scrapy for table content in pdf file

Question

I am working on web scraping for tables in pdf file using python

Can some one suggest me a good module which fetch's only required table I have tried pypdf,pdf2html,ocr,slate but nothing works

Thanks

Can you please explain what it is you are trying to do? – lindelof Jun 07 '12 at 06:15 — lindelof, Jun 07 '12 at 06:15

score 3 · Answer 1 · edited May 23 '17 at 10:27

3

And then, using an HTML parsing library, parse the HTML generated from the PDF. See BeautifulSoup HTML table parsing

edited May 23 '17 at 10:27

Community

answered Jun 07 '12 at 06:41

Priyank Patel

1 Answers1