1

I've been trying to get tesseract to recognize the numbers on this image: Filtered Img

but when running the script the output is empty meaning it can't

Any idea how to make it work? it doesn't seem like it should have a bad time converting the image into text and the same happens 7 segment digital digits and when trying to run tesseract on a noisier colored version of this image this does actually seem to work well in this example:

Noisy img

Any hints on how to get it to work?

Thanks for helping

Dmitrii Z.
  • 2,287
  • 3
  • 19
  • 29
MikeLemo
  • 521
  • 1
  • 5
  • 11

1 Answers1

1

Tesseract is very sensitive to the image borders, so adding about 20 pix border around your filtered image will solve the issue.

Regarding 7-segment font as stated in tesseract issue - LSTM (tess 4.*) might not be the best approach and there are different libraries which can handle this better. You can try this tessdata for tesseract 3.4;

Dmitrii Z.
  • 2,287
  • 3
  • 19
  • 29