I get this warning when running scrapy :
[scrapy] WARNING: Disabled HttpCacheMiddleware: Unable to find scrapy.cfg file to infer project data dir
I get this warning when running scrapy :
[scrapy] WARNING: Disabled HttpCacheMiddleware: Unable to find scrapy.cfg file to infer project data dir
Where is your scrapy.cfg
file? Are you running from the command line? If so, you should be running from the same directory as scrapy.cfg
.
What does your project directory look like. It shoud look something like:
.
├── dirbot
│ ├── __init__.py
│ ├── items.py
│ ├── pipelines.py
│ ├── settings.py
│ └── spiders
│ ├── dmoz.py
│ └── __init__.py
├── README.rst
└── scrapy.cfg
UPDATE
If you are using cron
, try not to (meaning don't) use the root
user. You can add the PATH info right into the crontab: How to get CRON to call in the correct PATHs
PATH=/usr/local/sbin:...:/absolute/path/to/my/scrapy/project