Very new to regex and haven't found a descriptive explaination to narrow down my understanding of regex to get me to a solution.
I use a script that scrapes html script from Yahoo finance to get financial options table data. Yahoo recently changed their HTML code and the old algorithm no longer works. The old expression was the following:
Main_Pattern = '.*?</table><table[^>]*>(.*?)</table';
Tables = regexp(urlText, Main_Pattern, 'tokens');
Where Tables
used to return data, it no longer does. An HTML inspection of the HTML suggests to me that the data is no longer in <table>
, but rather in <tbody>
...
My question is "what does the Main_Pattern
regex mean in layman's terms?" I'm trying to figure how to modify that expression such that is is applicable to the current HTML.