I got this image of a simple math expression Tesseract fails to read:
I've tested a screenshot of the same expression written on an Android phone and it was read pretty well. So I thought it's a font problem. I considered:
- Preprocess the image by inverting or removing the red areas
- Training Tesseract with images (StackOverflow question with no answers)
- Using WhatFontIs.com to find similar font then training Tesseract with the font file with TrainYourTesseract