AttributeError: 'module' object has no attribute 'urlopen'

Question

I'm trying to use Python to download the HTML source code of a website but I'm receiving this error.

Traceback (most recent call last):  
    File "C:\Users\Sergio.Tapia\Documents\NetBeansProjects\DICParser\src\WebDownload.py", line 3, in <module>
     file = urllib.urlopen("http://www.python.org")
AttributeError: 'module' object has no attribute 'urlopen'

I'm following the guide here: http://www.boddie.org.uk/python/HTML.html

import urllib

file = urllib.urlopen("http://www.python.org")
s = file.read()
f.close()

#I'm guessing this would output the html source code?
print(s)

I'm using Python 3.

score 338 · Accepted Answer · edited Apr 27 '19 at 10:13

338

This works in Python 2.x.

For Python 3 look in the docs:

import urllib.request

with urllib.request.urlopen("http://www.python.org") as url:
    s = url.read()
    # I'm guessing this would output the html source code ?
    print(s)

edited Apr 27 '19 at 10:13

vvvvv

25,404
19
49
81

answered Oct 19 '10 at 15:07

eumiro

207,213
34
299
261

4

Hi Eumiro, using the 'with' statement in Python I'm guessing it closes the connection automatically once it's done using it? Similar to a use statement in C#? – Oct 19 '10 at 15:13
@Sergio: exactly! And through the indentation you see where your file is still opened. – eumiro Oct 19 '10 at 15:16
Hello @eumiro, I have an error of "IndentationError: expected an indented block" when I type `s = url.read()`, may I ask how can I solve it please? x – Karen Chan Mar 25 '16 at 04:38
@KarenChan you're missing an indent before `s=url.read()`; have you got 4 spaces before it? – numbermaniac Jun 12 '17 at 05:46
I get `HTTPError: HTTP Error 503: Service Unavailable` when I use your method – Mona Jalal Apr 05 '21 at 23:05

Martin Thoma · Answer 2 · 2016-05-20T10:15:08.717

27

A Python 2+3 compatible solution is:

import sys

if sys.version_info[0] == 3:
    from urllib.request import urlopen
else:
    # Not Python 3 - today, it is most likely to be Python 2
    # But note that this might need an update when Python 4
    # might be around one day
    from urllib import urlopen


# Your code where you can use urlopen
with urlopen("http://www.python.org") as url:
    s = url.read()

print(s)

edited May 20 '16 at 10:15

answered Dec 07 '15 at 07:11

Martin Thoma

124,992
159
614
958

1

`with urlopen("http://www.python.org") as url:` does not work in python2 with `AttributeError: addinfourl instance has no attribute '__exit__'`. Need to write `url = urlopen("http://www.python.org") ` – orshachar Apr 22 '18 at 12:16

score 21 · Answer 3 · answered Apr 13 '15 at 17:15

21

import urllib.request as ur
s = ur.urlopen("http://www.google.com")
sl = s.read()
print(sl)

In Python v3 the "urllib.request" is a module by itself, therefore "urllib" cannot be used here.

answered Apr 13 '15 at 17:15

Manu Mariaraj

211
2
3

Steven B. Peutz · Answer 4 · 2018-09-13T14:46:33.583

10

To get 'dataX = urllib.urlopen(url).read()' working in python3 (this would have been correct for python2) you must just change 2 little things.

1: The urllib statement itself (add the .request in the middle):

dataX = urllib.request.urlopen(url).read()

2: The import statement preceding it (change from 'import urlib' to:

import urllib.request

And it should work in python3 :)

edited Sep 13 '18 at 14:46

answered Jun 14 '17 at 14:33

Steven B. Peutz

141
1
8

score 9 · Answer 5 · answered Feb 22 '20 at 23:56

Change TWO lines:

import urllib.request #line1

#Replace
urllib.urlopen("http://www.python.org")
#To
urllib.request.urlopen("http://www.python.org") #line2

If You got ERROR 403: Forbidden Error exception try this:

siteurl = "http://www.python.org"

req = urllib.request.Request(siteurl, headers={'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/80.0.3987.100 Safari/537.36'})
pageHTML = urllib.request.urlopen(req).read()

I hope your problem resolved.

score 3 · Answer 6 · answered Apr 27 '15 at 04:43

3

import urllib.request as ur

filehandler = ur.urlopen ('http://www.google.com')
for line in filehandler:
    print(line.strip())

answered Apr 27 '15 at 04:43

Kamran

31
3

score 1 · Answer 7 · answered Jul 13 '18 at 07:38

For python 3, try something like this:

import urllib.request
urllib.request.urlretrieve('http://crcv.ucf.edu/THUMOS14/UCF101/UCF101/v_YoYo_g19_c02.avi', "video_name.avi")

It will download the video to the current working directory

I got help from HERE

score 1 · Answer 8 · answered Jul 17 '18 at 07:46

1

Solution for python3:

from urllib.request import urlopen

url = 'http://www.python.org'
file = urlopen(url)
html = file.read()
print(html)

answered Jul 17 '18 at 07:46

Banjali

181
1
5

Simple and easy to understand for beginners. Thanks – SHR Nov 05 '18 at 22:25

score 0 · Answer 9 · edited Sep 23 '21 at 21:04

0

If your code uses Python version 2.x, you can do the following:

from urllib.request import urlopen
urlopen(url)

By the way, I suggest another module called requests, which is more friendly to use. You can use pip install it, and use it like this:

import requests
requests.get(url)
requests.post(url)

edited Sep 23 '21 at 21:04

Xiddoc

3,369
3
11
37

answered Aug 20 '18 at 10:22

jason.lu

1

score 0 · Answer 10 · edited Jun 19 '19 at 15:57

0

import urllib
import urllib.request
from bs4 import BeautifulSoup


with urllib.request.urlopen("http://www.newegg.com/") as url:
    s = url.read()
    print(s)
soup = BeautifulSoup(s, "html.parser")
all_tag_a = soup.find_all("a", limit=10)

for links in all_tag_a:
    #print(links.get('href'))
    print(links)

edited Jun 19 '19 at 15:57

MrPromethee

721
9
18

answered Jun 19 '19 at 15:12

user11649630

23
5

score 0 · Answer 11 · answered Aug 28 '19 at 16:46

0

One of the possible way to do it:

import urllib
...

try:
    # Python 2
    from urllib2 import urlopen
except ImportError:
    # Python 3
    from urllib.request import urlopen

answered Aug 28 '19 at 16:46

Vasyl Lyashkevych

1,920
2
23
38

score 0 · Answer 12 · edited Sep 24 '21 at 09:32

0

Use the third-party six module to make your code compatible between Python2 and Python3.

from six.moves import urllib
urllib.request.urlopen("<your-url>")

edited Sep 24 '21 at 09:32

Alex Waygood

6,304
3
24
46

answered May 19 '20 at 16:24

Rajat Shukla

17
2
6

1

You can import six module in this way from six.moves import urllib – Rajat Shukla Jun 03 '20 at 20:04

score -3 · Answer 13 · edited May 07 '21 at 01:56

-3

imgResp = urllib3.request.RequestMethods.urlopen(url)

Add this RequestMethods before using urlopen

edited May 07 '21 at 01:56

Buddy Bob

5,829
1
13
44

answered May 06 '21 at 12:48

khadersha shaik

1

AttributeError: 'module' object has no attribute 'urlopen'

13 Answers13

Linked

Related