So my project involves transcribing texts in jpg files into text files, and we are currently using tesseract. However, at this current level, tesseract is not doing so well at transcribing the texts in jpg files. So I decided to use some image preprocessing program to make the image files more optimal to be fed into tesseract.
The example of image file is like this: http://i46.tinypic.com/opramo.jpg
basically old news in image forms.
Any suggestion on which image processing engine to use??? Thank you !