Convert specific PDF file to HTML in PHP

Question

is there any way how to covert PDF to HTML? I need a text from the file and when I tried PDFtoText library, I got the text, but unsorted and without any rules for parsing. I noticed, that some PDFtoHTML online services works great with the file. So, any tips please? Here is the PDF file and I need only one specific row in the right column.

http://stackoverflow.com/questions/956508/convert-pdf-to-html — Mohit Bumb, Dec 07 '11 at 12:55
You should try this answer: http://stackoverflow.com/a/2249962/765854 and only take the portion that you care about. — Rakesh Sankar, Dec 07 '11 at 12:54

score 0 · Accepted Answer · answered Dec 07 '11 at 12:51

0

Try integrating the PDFtoHTML from the poppler project; that should support table recognition.

answered Dec 07 '11 at 12:51

A T

13,008
21
97
158

score 0 · Answer 2 · answered Dec 07 '11 at 12:55

0

pdftohtml works fine : fast, stable but the html result is ugly at best. I have used it for quite some time for a web site that has many job resumes.

It is a good solution for extracting textual content however.

I would give the scribd API a try

http://www.scribd.com/developers/api

or the google apps document API. GOogle does a great job a displaying and converting pdf files

answered Dec 07 '11 at 12:55

Mohit Bumb

2,466
5
33
52

Source : stackoverflow.com/questions/956508/convert-pdf-to-html – Mohit Bumb Dec 07 '11 at 12:56

Convert specific PDF file to HTML in PHP

2 Answers2