I am trying to work on a pet project that needs me to crawl through a list of Wikipedia: Picture of the day pages by month. As an example: https://en.wikipedia.org/wiki/Wikipedia:Picture_of_the_day/May_2004 has a list of images followed by a brief caption for each image. I want to do the following 2 things here:
- Scrape all the images from the page and the respective caption. (Preferably a dictionary to store an Image: Caption pair)
- Crawl through other months and repeat 1.
Any help on how to accomplish this would be highly appreciated.
Thank you very much.