3

I need to do OCR on a group of images. I have been using Tessnet and it works pretty well. The problem is that it seems to have problems with some images, so I thought that it might work better if I modify the images' brightness, contrast, etc. Also, the images are in .jpg format, but I read that .tiff is optimal.

What can I do? Should I just convert the JPEGs to TIFFs?

jscs
  • 63,694
  • 13
  • 151
  • 195
FrioneL
  • 943
  • 2
  • 13
  • 19

1 Answers1

0

There's no point in converting the jpeg images to a lossless format like tiff, you will convert the artifacts as well. You could try and apply a sharpness kernel on the image before you try to do ocr on it.

Look at this page for more information.

onemasse
  • 6,514
  • 8
  • 32
  • 37