I am trying to scrape a page that has Hebrew text on it. It contains the following piece of HTML:
<div id="AgeRating">דירוג גיל: 12+</div>
I just want the 12+ part here (in fact: I only want the '12' part). I am currently doing to with this piece of regex for other languages:
new Regex(@"<div id=""AgeRating"">.*(\d{1,2})\+</div>", RegexOptions.Compiled);
But I just can't get this to match. I tried all the regex options like RightToLeft, CultureInvariant, SingleLine, MultiLine, etc. but nothing works. It does work fine with plenty other languages though.
Note: I'm aware of HtmlAgilityPack for proper parsing of HTML. This is question about why seemingly correct RegEx fails to match particular string (as this a sample I have currently).