Lets say we want to extract the link in a tag like this:
input:
<p><a href="http://www.google.com/home/etc"><b>some text</b></a></p>
desired output:
http://www.google.com/home/etc
the first solution is to find the match with reference using this href=[\'"]?([^\'" >]+)
regex
but what I want to achieve is to match the link followed by href. so trying this (?=href\")...
(lookahead assertion: matches without consuming) is still matching the href
itself.
It is a regex only question.