Optimal configuration for Tessnet -- is image format conversion good enough?

Question

I need to do OCR on a group of images. I have been using Tessnet and it works pretty well. The problem is that it seems to have problems with some images, so I thought that it might work better if I modify the images' brightness, contrast, etc. Also, the images are in .jpg format, but I read that .tiff is optimal.

What can I do? Should I just convert the JPEGs to TIFFs?

onemasse · Answer 1 · 2011-08-04T11:03:05.477

0

There's no point in converting the jpeg images to a lossless format like tiff, you will convert the artifacts as well. You could try and apply a sharpness kernel on the image before you try to do ocr on it.

Look at this page for more information.

edited Aug 04 '11 at 11:03

answered Aug 03 '11 at 17:12

onemasse

6,514
8
32
37

1

sharpeness kernel? What it is about? – FrioneL Aug 04 '11 at 08:03

Optimal configuration for Tessnet -- is image format conversion good enough?

1 Answers1