Scrapy Crawlera was working just well in my Windows machine, but it gets error 111 when I run it in my linux server. Why is that?
When I use curl, I got this error:
curl: (7) Failed connect to proxy.crawlera.com:8010; Connection refused
Scrapy Crawlera was working just well in my Windows machine, but it gets error 111 when I run it in my linux server. Why is that?
When I use curl, I got this error:
curl: (7) Failed connect to proxy.crawlera.com:8010; Connection refused
It turned out when dealing with ports, CPanel (or maybe Linux?) blocks ports by default if it is not whitelisted in the firewall. I opened it via WHM since I use CPanel, and everything works fine now.
It has most likely nothing to do with Windows or Linux, but with the user-agent of Scrapy. Try doing sth. like this in settings.py
:
USER_AGENT = 'Mozilla/5.0 (X11; Linux x86_64; rv:7.0.1) Gecko/20100101 Firefox/7.7