I'm using Scrapy to scrape a website. The item page that I want to scrape looks like: http://www.somepage.com/itempage/&page=x. Where x
is any number from 1
to 100
. Thus, I have an SgmlLinkExractor
Rule with a callback function specified for any page resembling this.
The website does not have a listpage with all the items, so I want to somehow well scrapy to scrape those urls (from 1
to 100
). This guy here seemed to have the same issue, but couldn't figure it out.
Does anyone have a solution?