I have many string with html markup like that:
<font face='verdana'>A great project with many companies
A <b>plumber</b> company as external.
An <i>electricity</i> company.
And a security guard <u><i><b>during 2 weeks </b></i></u>.
</font>
Me, for each markup I want a new string with contents and markup, in my exemple it will be:
string1 = "A great project with many companies
A"
string2 = "<b>plumber</b>"
string3 = " company as external.
An"
string4 = "<i>electricity</i>"
string5 = " company.
And a
string6 = "<font color='FF6600'>security </font>"
string7 = "guard"
string8 = "<u><i><b>during 2 weeks </b></i></u>"
string9 = "."
string10 = "</font>"
The first <font face='verdana'>
and the last </font>
can be deleted.
I've tried with the matcher method but the result wasn't exactly what I want, or maybe my regex wasn't good.
I've tried also to search first beginning tag and the first closing tag which corresponds but it's didn't work because all font tag (<font color='FF6600'>
and ` have the same closing tag);.