I have an image of text, where the words are outlined rather than filled in. Tesseract is struggling to get any of the words correct - does anyone have a solution to these types of problems?
I have tried simple operations like inversion, but to no affect. I'm guessing tesseract already handles this.
Img example:
Typical output for Next: New
Typical output for Previous: Pflevuows
(my very simple) Code, takes the image as an argument:
import pytesseract
import sys
from PIL import Image
print(pytesseract.image_to_string(Image.open(sys.argv[1])))
print(sys.argv[1])
EDIT: Applying a threshold binary can get me next, but does not seem to get previous still.