The main idea is this, I have a large collection of IGCSE past papers, I need to find which paper a particular question was from, and all I have is the screenshot of one question. I want to make a program that can input an image of a question, then scan a set of pdfs to find the said question, then output the pdf containing the said question. I have experience in programming but I'm a bit stuck into how to approach the problem at hand.
Solutions I have tried:
- I tried combining pdfs into one mega pdf so I could just search the mega pdf, can't do that as the file is too large.
Solutions I think might work but not sure:
- Making a program to read through every single pdf to find the keywords in the image.