Is there a library to parse HTML into a String object, either in Java, C# or any other programming language out there.
This is my situation:
I have some documentation which came from an wysiwyg editor and contain some basic html tags such as <p><br>
and others like this:
<p>This </p><font>etc</font><br>
<span> and this, etc.
When exported to some other tool it gets converted to plain text making it unreadable. What I'm doing right now is to: "Select all" > "Save as x.html" > "Open in browser" > "Select all" > "Paste"
Which I know could be automatized with a program.
Is there an library to do this? That is, to "render" simple HTML? Preferably to a string to I can put it into my clipboard. Removing the html tags is not enough for I would get a very long line without carriage returns.
etc. I updated the question to clarify that. – OscarRyz Oct 23 '12 at 15:35