0

I would like to scrape innertext from html block using regex.

Here is the block along with my best parsing effort which is far from perfect: http://www.regexr.com/3andc

What I would like to extract is the string "10 Extremely Popular Youtubers Who Are Now Millionaires"

Thanks.

Andy Lester
  • 91,102
  • 13
  • 100
  • 152
hvs
  • 518
  • 1
  • 5
  • 21

1 Answers1

2

Please do not use regex to scrape HTML... RegEx match open tags except XHTML self-contained tags

maybe use HTMLAgilityPack? https://htmlagilitypack.codeplex.com/

Edit :

Then try this : http://regexr.com/3ane7

Community
  • 1
  • 1
Bruno
  • 4,685
  • 7
  • 54
  • 105
  • Not an option in this case. I am using a proprietary piece of software that will only accept regex input – hvs Mar 30 '15 at 18:18