I have a PDF that contains many underlines and strikethroughs in the text. I would like to be able to convert this PDF to HTML. I have tried many different tools, and all of them will sometimes catch the underlines and strikethroughs as text formatting, and at other times will convert the underlines and strikethroughs to graphics, which is (as far as I can tell) useless to me.
I would really like to know how these programs differentiate between underlines that format text and underlines that are converted to graphics, and how I might be able to access the document and capture everything as text formatting.
I may be taking the wrong approach with this, and am open to any possible solutions, I think I just need to be pointed in the right direction.
Thank you in advance for any assistance.