Can't find span by class while scraping with requests and beautiful soup

Question

I try to scrape all bonus items of a supermarket. After inspecting the HTML code I found the name of each bonus in a span with class named "line-clamp_root__3yA0X line-clamp_active__2502b"

However, when I try to find this spand by class name I can't find it. Here is my code:

import requests
from bs4 import BeautifulSoup
    
url='https://www.ah.nl/bonus'
    
response = requests.get(url)
soup = BeautifulSoup(response.content, 'html.parser')
    
soup.find_all('span', {'class': 'line-clamp_root__3yA0X line-clamp_active__2502b'})

Output is [ ]

Does anyone have an idea what I am doing wrong?

Many thanks in advance!

Ps. My final goal is to scrape all bonus item names :)

score 0 · Answer 1 · answered Nov 05 '21 at 12:44

0

That class attribute has two classes in it. To select elements using two classes, you'll either need to match the exact value of the full attribute using _=:

soup.find_all('span', class_='line-clamp_root__3yA0X line-clamp_active__2502b')

Or you'll need to use a CSS selector:

soup.find_all('span.line-clamp_root__3yA0X.line-clamp_active__2502b')

answered Nov 05 '21 at 12:44

Sean

6,873
4
21
46

Thanks for your quick response. I just tried both options, but neither seems to work in this case. Unfortunately both options outputs the same [ ] – Barry Nov 05 '21 at 12:55
Those elements are loaded by javascript after the initial page load. You may need to use a different scraping library: https://stackoverflow.com/questions/2148493/scrape-html-generated-by-javascript-with-python – Sean Nov 05 '21 at 12:59

Can't find span by class while scraping with requests and beautiful soup

1 Answers1