I am trying to make a program that could automatically scan the images or texts on a user's desktop and then convert it to a .txt
file for text analysis.
So far I have found source codes to convert PDF
and HTML
into .txt
. However I would like to make my program automatically scan the desktop screen at certain time intervals rather than manually inputting the source such as:
$pdf2txt.py samples/simple1.pdf
I don't know where to start so any suggestion will be appreciated.