I need to scrape this page (the ads): https://www.sahibinden.com/en/cars/used?date=1day&a5_min=2005&a5_max=2020
When I open it too many times I get blocked, changing the IP doesn't help either. The problem is that when I open this page from browser from my PC it works just fine. But it seems to get blocked from webkit.
await page.route("**/*", (route) => {
if (!firstReq) route.continue();
else {
firstReq = false;
route.continue({
method: method,
postData: data,
headers: headers,
});
}
});
let pageRes = await page.goto(url);
await page.waitForNavigation()
await page.unroute("**/*");
return pageRes;
I realize that it is site trying to block bots but what are the practices to avoid that. I have tried waits, ip rotation as well as useragent rotation - nothing seems to be working