0

Im working on a project for which I need the content of an HTML page given ts url.

Im doing something like this

con=urllib.request.urlopen(url)
a=con.read()
con.close()

soup = BeautifulSoup(a)


print(soup.get_text())

But the problem is Im getting all the java script , and other things as well. I just need the displayed content of a webpage. Any pointers on how to go about it?

ashish g
  • 429
  • 1
  • 7
  • 16

0 Answers0