I have a HTML file containing words like <i>rūpa</i>
.
How to convert it into rūpa
(rūpa)?
Is there any way to convert it?
Also i get to know that these are the html representation of extended binary code,(correct me if i am wrong).
python is preferred, but solution in any language is appreciated.