Discard images from a group of similar images

Question

I am generating images (thumbnails) from a video every 3 seconds. Now I need to discard/remove all the similar images. Is there a way I could this?

I generate thumbnails using FFMPEG. I read about various image-diff solutions like given in this SO post, but I do not want to do this manually. How and what parameters should be considered that could tell if a particular image is similar to other images present.

Try the [mpdecimate](https://ffmpeg.org/ffmpeg-filters.html#mpdecimate) filter. — Gyan, May 30 '18 at 19:16

score 1 · Accepted Answer · answered May 30 '18 at 19:11

You can calculate the Structural Similarity Index between images and based on the score keep or discard an image. There are other measures you can use, but basically a method that returns a score. Try PIL or OpenCV https://pillow.readthedocs.io/en/3.1.x/reference/ImageChops.html?highlight=difference https://www.pyimagesearch.com/2017/06/19/image-difference-with-opencv-and-python/

Yiannis Ath · Answer 2 · 2018-05-31T02:03:07.940

I dont have enough reputation to comment my idea on your problem, so i will just go ahead and post it as an answer in hope of helping you.

I am quite confused about the term "similar" but since you are reffering on video frames i am going to assume that you want to avoid having "similar" frames that have been captured because of poor camera movement. If that's the case you might want to consider using salient point descriptors.

To be more specific you can detect salient points (using for instance Harris) and then use a point descriptor algorithm (such as SURF) and discard the frames that have been found to have "too many" similar points with a pre-selected frame.

Keep in mind that in order for the above process to be successful, the frames must be as sharp as possible, i guess you don't want to extract as a thubnail a blurred frame anyway. So applying a blurred images detection might be useful in your case.

Discard images from a group of similar images

2 Answers2