Is there a proper library which I can use to convert PDF to HTML or some other format that can be converted to HTML easily?
I searched similar questions, but to no luck.
I want to be able to extract text from PDF's, possibly images. I'm not looking to embed the PDF inside the HTML.