According to this question How Scrapy filters the crawled urls?, there is a file called requests.seen
in the directory defined by the JOBDIR variable
Please where can I find the JOBDIR variable ?
According to this question How Scrapy filters the crawled urls?, there is a file called requests.seen
in the directory defined by the JOBDIR variable
Please where can I find the JOBDIR variable ?
According to official tutorial(Jobs: pausing and resuming crawls) JOBDIR can be set from command line:
scrapy crawl somespider -s JOBDIR=crawls/somespider-1