EasyOCR - Table extraction

Question

I use easyocr to extract table from a photo or scanned PDF, but I have a problem in fine tuning the data as a table. I try to make a searchable pdf according to extracted coordinates but when I convert it to csv, the lines are not tune. I would appreciate if someone guide me about this.

Please provide enough code so others can better understand or reproduce the problem. — Community, Jul 01 '22 at 16:59

zhoujun · Answer 1 · 2022-08-17T03:41:19.873

2

As far as I know, easyocr currently does not support table recognition. The best table recognition should be PaddleOCR's PP-Structure model. This is what I use now, and the effect is very good. You can try it.

link: https://github.com/PaddlePaddle/PaddleOCR/blob/dygraph/ppstructure/README.md

edited Aug 17 '22 at 03:41

answered Aug 17 '22 at 03:40

zhoujun

69
4

Can you share more info? which language you are recognizing table, how many images did you used for fine-tuning(if you had fine tuning). – masoud parpanchi Jan 09 '23 at 17:11

EasyOCR - Table extraction

1 Answers1