So I have this html piece:
<p class="tbtx">
MWF
<br></br>
TH
</p>
which is completely mangled it seems. I need to extract the data i.e. ['MWF', 'TH'].
The only solution I could think of is to replace all newlines and spaces in the html, then split it at
and rebuild html structure and then extract .text but it's a bit ridiculous.
Any proper solutions for this?