I have an issue with Scrapy
, Crawlera
and Splash
when trying the fetch responses from this site.
I tried the following without luck:
- pure
Scrapy
shell - times out Scrapy
+Crawlera
- times out- Scrapinghub
Splash
instance (small) - times out
However I can scrape the site with the Firefox webdriver of Selenium
. But I want to move away from that and use Splash
instead.
Is there a workaround to avoid these timeouts?
NOTE:
If I use local Splash
instances set up by aquarium the site loads, though it still takes 20+ seconds compared to the Firefox webdriver's 10 seconds.