Is there any way to transform HTML into Markdown text?
- HTML is pretty simple - it's just a div element with several
<p>
elements inside. - There may be images - I want to preserve them as well.
- Also, there may be several "inline" images like this one:
<img src='images/symbol_minus.gif' width='9' height='3' alt='−' border='0' style='vertical-align:middle;' />
which is just a minus sign - I want to replace them with text.
Nokogiri::XML::Element#text
is almost something that I want, but it strips off images.