I have been developing an application for android which uses tesseract OCR (optical character recognition) and was wondering if there is a method for improving the results for small text.
I have tried re-compiling the standard dictionary with my own frequent and normal word lists (using wordlist2dawg) and have seen no improvement (I can't even tell if it is helping!). I have also heard it is possible to alter the threshold at which tesseract uses dictionary words but I have no idea how to do this.
If anybody has an idea of how I could improve the results tesseract gives me I would really appreciate it!