0

In my grabber script I need to check every new grabbed picture for duplicates. I use php on linux server. Does anybody know fast way to make that check. I have about 200k-300k pictures and dupes check takes a lot of time. Easiest MD5 check is already implemented. But I need to find dupes when pictures size not match. Thanks.

  • "When picture's size do not match" - how would you classify two images to be duplicates then? – Dogbert Apr 24 '13 at 12:13
  • Its not a dupe if the images are different in some way. – Lawrence Cherone Apr 24 '13 at 12:17
  • 2
    Try [Libpuzzle](http://www.pureftpd.org/project/libpuzzle/php) ... except you want a pure GD solution – Baba Apr 24 '13 at 12:20
  • http://www.imgseek.net/isk-daemon has an php api that can find a similarity percentage. I think that can help you. Or look at this topic: http://stackoverflow.com/questions/25977/how-can-i-measure-the-similarity-between-two-images its provide you with the right information to check it yourself – S.Visser Apr 24 '13 at 12:27

0 Answers0