The tesseract OCR on screenshots gives rather erratic results. Only some of the text seems to be recognized correctly even though the image is completely black with white text over it. Even after I resize the image to 300dpi
the accuracy remains low and most of the text is gibberish.
I read the similar question on StackOverflow at: Best way to recognize characters in screenshot?
As mentioned, the writer of the question was able to get nearly 100% accuracy by training the tesseract engine with his font.
The font in my image is Arial. How can I still improve the accuracy???
Here is a sample of the kind of Images I have: Image Sample