I am trying to parse text from webpages, starting at this page. This page has links to the final page (this can be moved in to text file manually also; in order to avoid extra effort on coding). On the final page on the left hand side there is a page index. And each page also has a page index. The page index is in the top section of each page. From this item list I just need to extract a line starting with 'Configuring' , 'Configuration Examples' or 'Example'.
This task appears to be so simple when doing manually but it's daunting and hard to track. If this information can be extracted from any tool that crawls and logs the items in hierarchical order as it finds. Probably in some simple format where it also includes hyperlink or at least just as normal tab-separated text file.
The information on the webpage is public and is downloadable. If it is hard to extract via web may be I can also try to get download those and try as offline.
I tried to do research on this requirement and look LinksGrabber, WebParser, BeautifulSoup or parsing text with regex could do tweaks. But I am still a few lightsyears away from the implementation of this idea.
Is this what I am trying is achievable with Python or what would be the realist way to approach on this.
PS: I understand this is web scraping, but I am just doing this for personal education purposes and it's not holding commercial value or any association.