So I am trying to build a web crawler. I have started by passing the request and getting all the HTML of the page in response.
Next I thought of using regular expressions for extracting links from the HTML page. However the more I try to learn them the more tricky them seem.
Are there any alternatives to regular expressions (it may seem a discussion question but it is not I have searched the internet and haven't found a satisfactory answer).