I am trying something that I found on another answer but I am having some problems:
I know that there are better regex for URLs but consider this for example:
@links=($content =~ m/(https?)?.*[.]com/g);
*$content has text or html
The part (https?)?
is for links like www.google.com
, but having the parenthesis it returns "http"
to $1
which is put into @links
! That is a problem, since I want the whole link.
What would globally extract simple links (or whatever regex is specified) from text and put them into a list?
By simple, I mean:
http://www.google.com
www.google.com
google.com
https://www.google.com