Authentication while scraping with BeautifulSoup 4

Asked Jan 23 '17 at 17:17

Active Jan 23 '17 at 17:30

Viewed 751 times

I am having trouble understanding the processes in finding out how to scrape a web sight that I have access to. I would like to scrape my College email but it seems to be blocking the program form running properly.

from bs4 import BeautifulSoup
import requests

url = ""

response = requests.get(url)
soup = BeautifulSoup(response.text, "html.parser")

links = soup.find_all('hpc-numeric hpc-positive')

for link in links:
    print(link.text)
    print(link['hpc-table-row hpc-even'])

I know I will need to add some thing to let the server know I am who I say I am, I just don't know how.

edited Jan 23 '17 at 17:30

coder

12,832
5
39
53

asked Jan 23 '17 at 17:17

Nick Read

If it is your email, then you have to login first. Take a look at this answer for more info: https://stackoverflow.com/questions/11892729/how-to-log-in-to-a-website-using-pythons-requests-module – coder Jan 23 '17 at 17:32
As @coder points out to you in the similar question, you need your scraper to first authenticate itself to the website to be able to scrape your emails. – Fruitspunchsamurai Jan 23 '17 at 17:37

Authentication while scraping with BeautifulSoup 4

0 Answers0