403 Forbidden error scraping using BeautifulSoup with Mozilla Headers

Question

I am trying to scrape news headlines and links using BeautifulSoup from https://www.blade.com/news but keep running into HTTPError: HTTP Error 403: Forbidden.

I am using headers and requests, but still get the error, here is my code:

from urllib.request import urlopen, Request
from bs4 import BeautifulSoup 
import datetime

blade = "https://www.blade.com/news"
req = Request(blade, headers={'User-Agent':'Mozilla/5.0'})
bladehtml = urlopen(req)
soup3 = BeautifulSoup(bladehtml, 'lxml')
type(soup3)

I am not sure why I keep getting the error with the browser header in place. Any help would be appreciated!

teja chintham · Answer 1 · 2021-08-27T21:27:55.297

1

The Website you are trying to scrape is cloudfire protected,Use selenium if in a hurry this might help Python - Request being blocked by Cloudflare

edited Aug 27 '21 at 21:27

answered Aug 27 '21 at 21:22

teja chintham

66
4

403 Forbidden error scraping using BeautifulSoup with Mozilla Headers

1 Answers1