3

I use easyocr to extract table from a photo or scanned PDF, but I have a problem in fine tuning the data as a table. I try to make a searchable pdf according to extracted coordinates but when I convert it to csv, the lines are not tune. I would appreciate if someone guide me about this.

mahya
  • 31
  • 1
  • 2

1 Answers1

2

As far as I know, easyocr currently does not support table recognition. The best table recognition should be PaddleOCR's PP-Structure model. This is what I use now, and the effect is very good. You can try it.

link: https://github.com/PaddlePaddle/PaddleOCR/blob/dygraph/ppstructure/README.md

zhoujun
  • 69
  • 4
  • Can you share more info? which language you are recognizing table, how many images did you used for fine-tuning(if you had fine tuning). – masoud parpanchi Jan 09 '23 at 17:11