0

Currently I am only getting the links in the header, i cannot receive the links that are apart of the updated body.

private static void HTMLtoLinkList(string HTMLTEXT)
{
    var LParser = new Regex(@"\b(?:https?://|www\.)\S+\b", RegexOptions.Compiled | RegexOptions.IgnoreCase);
    foreach (Match m in LParser.Matches(HTMLTEXT))
    {
        Console.WriteLine(m);
        Console.ReadLine();            
    }
}
CC Inc
  • 5,842
  • 3
  • 33
  • 64
JinX
  • 1
  • 1
  • 3
    Can you include samples of text it will match and samples you want it to match? – CC Inc Dec 09 '16 at 00:14
  • 1
    Try parsing the html instead with https://htmlagilitypack.codeplex.com/ . Also relevant: http://stackoverflow.com/a/1732454/4665 and http://stackoverflow.com/a/590789/4665 – Jon P Dec 09 '16 at 00:47

0 Answers0