Data:
<tr>
<td>
<a href="somelink">
some. .data...
</a>
</td>
<td>Black</td>
<td>57234</td>
<td>5431.60</td>
<td><font class="down"> -125.02</font></td>
</tr><tr>
<td>
<a href="somelink">
some. .data...
</a>
</td>
<td>Blue</td>
<td>57234</td>
<td>5431.60</td>
<td><font class="up"> -125.02</font></td>
</tr><tr>
<td>
<a href="somelink">
some. .data...
</a>
</td>
<td>Brown</td>
<td>57234</td>
<td>5431.60</td>
<td><font class="down"> -125.02</font></td>
</tr>
...more data...
I want to extract 'some. .data...'; 'Black'; '57234'; '5431.60'; at one time. [fifth td
data is not required.]
Initially,
<tr><td><a.*>([a-zA-Z0-9 -]+)</a></td><td>(\w+)</td><td>([\d]+\.\d+)</td><td>(\d+\.\d+)</td>
was working. (via hit and miss approach)
But, now it's broke.
Now, when I use <td>(.*)</td>
or <\w+>(.*)</\w+>
: it shows data from last four td
s in every tr. But then, Why won't it show <a href...>...</a>
and how can I get data I want?