6

How to disable images/CSS in Puppeteer? I've seen this tutorial https://www.scrapehero.com/how-to-increase-web-scraping-speed-using-puppeteer/ but I don't know how to translate it to Python

Leo
  • 370
  • 7
  • 12

2 Answers2

13

Based on example from https://github.com/miyakogi/pyppeteer/blob/dev/pyppeteer/page.py#L312:

await page.setRequestInterception(True)
async def intercept(request):
    if any(request.resourceType == _ for _ in ('stylesheet', 'image', 'font')):
        await request.abort()
    else:
        await request.continue_()
page.on('request', lambda req: asyncio.ensure_future(intercept(req)))
mbit
  • 2,763
  • 1
  • 10
  • 16
3

This below code will disable resource by type: fetch, image, media, and font.

    page.setRequestInterception(true)

    page.on ( 'request', async request => {
        if ( request.resourceType () === 'fetch' || request.resourceType () === 'image' || request.resourceType () === 'media' || request.resourceType () === 'font' ) {
            request.abort ()
        } else {
            request.continue ()
        }
    })
Edi Imanto
  • 2,119
  • 1
  • 11
  • 17