Making Image Comparison Fast

Question

Say I have a set of 10,000 images that I'd like to classify based on similarity. A number of people have recommended that comparing histograms is a cheap way to measure similarity. This thread, for example, recommends using 6 histograms for each comparison.

If I compare each image's histogram with all other images in the set, that's O(n^2) = 60,000*59,999/2 comparisons in all, which is very slow. How can I speed this up?

Third answer from the top response on the thread you link purports to be much faster. — Brandon Frohbieter, Mar 05 '11 at 06:15

score 0 · Answer 1 · answered Mar 05 '11 at 06:14

0

Hash the histogram in some way,make a sorted list of the hashes, find adjacent values that are similar (within some limit) then compare those histograms

However making the histograms is likely to be the slow step

answered Mar 05 '11 at 06:14

Martin Beckett

94,801
28
188
263

Making Image Comparison Fast

1 Answers1