Trying to scrape data from elements in a plain table, not all cells are required. The information is contained into the cells like the sample provided below:
<TD class=padded vAlign=top width="10%">
<SPAN class=bold>Record No:</SPAN>
<BR>40597
</TD>
In this example I am trying to extract the value for the field, which is 40597.
I have been able to use jQuery so far to find each td element like so:
function getHtmlDoc(data){
var el = document.createElement('html');
el.innerHTML = data;
$.each($('.padded',el),function(index,item){
if($(this).text().indexOf("Record No:")>=0){
console.log(index + " " + $(this).text());
}
});
}
This returns
Record No:
40597
I just want the last part.
I could add steps to remove the text Record No: and than trim the whitespace to obtain the value.
Is there a better solution? I have to do this method a few items and there are numerous entries on each page using a similar displayed above.