I need to decode HTML into plain text. I know that there are a lot of questions like this but I noticed one problem with those solutions and don't know how to solve it.
For example we have this piece of HTML:
<h1><strong>Some text</strong></h1><p><br></p><p>Some more text</p>
Tried regex solutions, HttpUtility.HtmlDecode method. And all of them give this output: Some textSome more text
. Words get connected where they should be separate. Is there a way to decode string without merging words?
` with spaces before extracting the plain text content. – Andy G Feb 08 '19 at 13:09