How do I read image data from a URL in Python?

Question

What I'm trying to do is fairly simple when we're dealing with a local file, but the problem comes when I try to do this with a remote URL.

Basically, I'm trying to create a PIL image object from a file pulled from a URL. Sure, I could always just fetch the URL and store it in a temp file, then open it into an image object, but that feels very inefficient.

Here's what I have:

Image.open(urlopen(url))

It flakes out complaining that seek() isn't available, so then I tried this:

Image.open(urlopen(url).read())

But that didn't work either. Is there a Better Way to do this, or is writing to a temporary file the accepted way of doing this sort of thing?

See also: [How to save an image locally using Python whose URL address I already know?](http://stackoverflow.com/q/8286352/562769) — Martin Thoma, Mar 14 '16 at 11:25
There must be an issue where the requests is not able to fetch the image from the url. Try the same ( just for testing purpose) from another url. — Aashish Chaubey, Nov 17 '21 at 11:46

Andres Kull · Answer 1 · 2014-05-06T08:30:57.507

437

In Python3 the StringIO and cStringIO modules are gone.

In Python3 you should use:

from PIL import Image
import requests
from io import BytesIO

response = requests.get(url)
img = Image.open(BytesIO(response.content))

edited May 06 '14 at 08:30

answered May 06 '14 at 08:21

Andres Kull

4,756
2
15
13

1

How to get back the image from response.content ? – Amresh Giri Feb 07 '19 at 11:33
`requests` package throws 503 status code while fetching an image from a URL. Instead, I had to resort to `http.client` to get the image. – MSS Aug 08 '19 at 06:46
When I try this I get: AttributeError: module 'requests' has no attribute 'get'. – apiljic Aug 26 '19 at 23:21
37

Manually wrapping in BytesIO is no longer needed since PIL >= 2.8.0. Just use ```Image.open(response.raw)```. PIL automatically checks for that now and does the BytesIO wrapping under the hood. From: https://pillow.readthedocs.io/en/3.0.x/releasenotes/2.8.0.html – Vinícius M Feb 06 '20 at 15:21
@ViníciusM your answer should be at the top! thank you – alanho Jul 26 '20 at 10:53
2

Kinda annoying that 3 libraries are necessary... Pillow should just add this functionality! – Nic Scozzaro Apr 29 '21 at 22:15

score 180 · Accepted Answer · edited Apr 29 '21 at 22:20

180

The following works for Python 3:

from PIL import Image
import requests

im = Image.open(requests.get(url, stream=True).raw)

References:

edited Apr 29 '21 at 22:20

Nic Scozzaro

6,651
3
42
46

answered Dec 03 '16 at 04:14

Giovanni Cappellotto

4,597
1
30
33

5

urllib2 was for Python2 I think, which is outdated. For Python 3 it's urllib.requests: `urllib.request.urlopen(url).read()` – wordsforthewise Jan 24 '21 at 01:03
2

As mentioned by @wordsforthewise urllib is outdated. I used the second option as I was using 'requests' anyway in my code and it worked, so upvoting. Should the urllib part of the solution be removed so that readers don't spend time on trying the first approach just to realize that it doesn't work and then move to the next one? – Mugdha Mar 04 '21 at 13:40
1

Hi this worked great for my project! Just wondering, does this build up any buffer or cache? Do I need to close these images / clear anything? – Pissed Off Banker Dec 14 '22 at 17:21

score 173 · Answer 3 · edited Mar 24 '21 at 16:16

173

Using a StringIO

import urllib, cStringIO

file = cStringIO.StringIO(urllib.urlopen(URL).read())
img = Image.open(file)

edited Mar 24 '21 at 16:16

iacob

20,084
6
92
119

answered Sep 12 '11 at 18:07

Fábio Diniz

10,077
3
38
45

Thanks, would just like to add that the same exact code will work with urllib2 (with Python2) – sofly Dec 03 '14 at 18:59
22

in python 3 it would be from urllib.request import urlopen and io.io.BytesIO instead of StringIO – matyas Dec 13 '16 at 16:38
2

HELP, IOError: cannot identify image file <_io.BytesIO object at 0x7fb91b6a29b0> my url is: ...model=product.template&id=16&field=image_medium – С. Дэлгэрцэцэг Sep 03 '18 at 12:43

score 61 · Answer 4 · edited Mar 24 '21 at 16:16

61

Using requests:

from PIL import Image
import requests
from StringIO import StringIO

response = requests.get(url)
img = Image.open(StringIO(response.content))

edited Mar 24 '21 at 16:16

iacob

20,084
6
92
119

answered Oct 23 '12 at 06:19

Saurav

3,096
3
19
12

3

For some reason urllib didn't work for some URLs, but requests worked where that failed – mirri66 Mar 04 '15 at 08:17
I couldn't find the PIL package, but it looks like pillow have taken over the PIL effort and you can install for python3 with `pip3.4 install pillow`. – disruptive Nov 23 '15 at 09:02
3

Note that requests will load the entire response into memory, and then PIL will load the entire thing again as an image, so you have two full copies resident in memory. The previous answer using urllib method streams the data, so you only end up with one copy plus the streaming buffer size. You can stream the data with requests too, but because the response does not support read() semantics, you would have to build an adapter. – sirdodger Feb 02 '16 at 22:14
@sirdodger Are you referring to urllib2 or urllib? – CMCDragonkai May 28 '18 at 02:59
@CMCDragonkai I was referring to the accepted urllib answer. If memory overhead is a concern, it is better than using this requests answer. (However, like I mentioned, a different solution using requests could achieve the same effect.) – sirdodger May 29 '18 at 08:01
@sirdodger PIL Apparently now supports the streaming too. =) https://pillow.readthedocs.io/en/3.0.x/releasenotes/2.8.0.html – Vinícius M Feb 06 '20 at 15:24

Miladiouss · Answer 5 · 2019-01-30T04:28:51.537

50

Python 3

from urllib.request import urlopen
from PIL import Image

img = Image.open(urlopen(url))
img

Jupyter Notebook and IPython

import IPython
url = 'https://newevolutiondesigns.com/images/freebies/colorful-background-14.jpg'
IPython.display.Image(url, width = 250)

Unlike other methods, this method also works in a for loop!

edited Jan 30 '19 at 04:28

answered Sep 05 '18 at 05:18

Miladiouss

4,270
1
27
34

Dan D. · Answer 6 · 2023-03-28T03:18:02.647

30

This answer was written for Python 2.7.

For Python 3, urlopen was moved from urllib to urllib.requests. And StringIO.StringIO was replaced by io.BytesIO.

Use StringIO to turn the read string into a file-like object:

from StringIO import StringIO
from PIL import Image
import urllib

Image.open(StringIO(urllib.urlopen(url).read()))

edited Mar 28 '23 at 03:18

answered Sep 12 '11 at 18:06

Dan D.

73,243
15
104
123

1

This response is clean and helpful, but the import statement should read ```from io import StringIO``` – Vid Stropnik Mar 23 '23 at 09:07

john-hen · Answer 7 · 2019-06-19T01:09:26.127

26

The arguably recommended way to do image input/output these days is to use the dedicated package ImageIO. Image data can be read directly from a URL with one simple line of code:

from imageio import imread
image = imread('https://cdn.sstatic.net/Sites/stackoverflow/img/logo.png')

Many answers on this page predate the release of that package and therefore do not mention it. ImageIO started out as component of the Scikit-Image toolkit. It supports a number of scientific formats on top of the ones provided by the popular image-processing library PILlow. It wraps it all in a clean API solely focused on image input/output. In fact, SciPy removed its own image reader/writer in favor of ImageIO.

edited Jun 19 '19 at 01:09

answered Jun 18 '19 at 13:21

john-hen

4,410
2
23
40

Very slow. skimage methods would be better option if you want to do in one line – Anupam Tripathi Sep 03 '21 at 00:06
2

This *is* the skimage (Scikit-Image) method, as the answer explains. And it's as slow as your internet connection. – john-hen Sep 20 '21 at 09:44

score 24 · Answer 8 · answered Oct 17 '15 at 15:59

For those doing some sklearn/numpy post processing (i.e. Deep learning) you can wrap the PIL object with np.array(). This might save you from having to Google it like I did:

from PIL import Image
import requests
import numpy as np
from StringIO import StringIO

response = requests.get(url)
img = np.array(Image.open(StringIO(response.content)))

Shivid · Answer 9 · 2018-05-30T20:53:32.897

select the image in chrome, right click on it, click on Copy image address, paste it into a str variable (my_url) to read the image:

import shutil
import requests

my_url = 'https://www.washingtonian.com/wp-content/uploads/2017/06/6-30-17-goat-yoga-congressional-cemetery-1-994x559.jpg'
response = requests.get(my_url, stream=True)
with open('my_image.png', 'wb') as file:
    shutil.copyfileobj(response.raw, file)
del response

open it;

from PIL import Image

img = Image.open('my_image.png')
img.show()

score 4 · Answer 10 · answered Aug 24 '20 at 18:39

Manually wrapping in BytesIO is no longer needed since PIL >= 2.8.0. Just use Image.open(response.raw)

Adding on top of Vinícius's comment:

You should pass stream=True as noted https://requests.readthedocs.io/en/master/user/quickstart/#raw-response-content

So

img = Image.open(requests.get(url, stream=True).raw)

score 2 · Answer 11 · edited Jul 07 '23 at 12:28

USE urllib.request.urlretrieve() AND PIL.Image.open() TO DOWNLOAD AND READ IMAGE DATA :

import requests
import urllib.request
import PIL

urllib.request.urlretrieve("https://i.imgur.com/ExdKOOz.png", "sample.png")
img = PIL.Image.open("sample.png")
img.show()

or Call requests.get(url) with url as the address of the object file to download via a GET request. Call io.BytesIO(obj) with obj as the content of the response to load the raw data as a bytes object. To load the image data, call PIL.Image.open(bytes_obj) with bytes_obj as the bytes object:

import io

response = requests.get("https://i.imgur.com/ExdKOOz.png")
image_bytes = io.BytesIO(response.content)
img = PIL.Image.open(image_bytes)
img.show()

score 2 · Answer 12 · answered Nov 16 '21 at 13:09

from PIL import Image
import cv2
import numpy as np
import requests
image=Image.open(requests.get("https://previews.123rf.com/images/darrenwhi/darrenwhi1310/darrenwhi131000024/24022179-photo-of-many-cars-with-one-a-different-color.jpg", stream=True).raw)
#image =resize((420,250))

image_array=np.array(image)
image

score 1 · Answer 13 · answered Oct 24 '20 at 03:10

1

To directly get image as numpy array without using PIL

import requests, io
import matplotlib.pyplot as plt 

response = requests.get(url).content
img = plt.imread(io.BytesIO(response), format='JPG')
plt.imshow(img)

answered Oct 24 '20 at 03:10

AdithyaM

11
2

score 0 · Answer 14 · answered Mar 03 '23 at 05:10

The solutions mentioned above might work, but it misses one point that I would like to highlight i.e. when we fetch or retrieve the image url to read, we might not always get the actual image content if we don't pass the headers while making the get request.

for example:

request without Headers

import requests
url = "https://www.roaringcreationsfilms.com/rcsfilms-media/chankya-quotes-in-hindi-32.jpg"
data = requests.get(url).content

if we check the data:

print(data)
b'<head><title>Not Acceptable!</title></head><body><h1>Not Acceptable!</h1><p>An 
appropriate representation of the requested resource could not be found on this server.
This error was generated by Mod_Security.</p></body></html>'

you see, we don't actually get the content of the image.

request with Headers

import requests
url = "https://www.roaringcreationsfilms.com/rcsfilms-media/chankya-quotes-in-hindi-32.jpg"
headers = {"User-Agent": "PostmanRuntime/7.31.1"}
data = requests.get(url, headers=headers).content

and, if we now check the data:

print(data)
b'\xff\xd8\xff\xe0\x00\x10JFIF\x00\x01\x01\x00\x00\x01\x00\x01\x00\x00\xff\xdb\x00C\x00\t\x06\x06\........\xfb\x04El\xb3\xa8L\xbc\xa12\xc6<\xc4\x891\xf2L|\xf7\x9eV\x18\xc5\xd8\x8f\x02\xca\xdc\xb1c+-\x96\'\x86\xcb,l\xb12;\x16\xd4j\xfd/\xde\xbf\xff\xd9'

Now, we get the actual content of the image.

Things to note are that different urls might require different combinations of the headers (such as "User-Agent", "Accept", "Accept-Encoding", etc.) to successfully get the data and some even might not require any headers. But it's always a good practice to pass "User-Agent" as a minimum required header while making the request.

score 0 · Answer 15 · answered Apr 27 '23 at 09:53

For Python 3 using OpenCV:

import cv2
from urllib.request import urlopen

image_url = "IMAGE-URL-GOES-HERE"
resp = urlopen(image_url)
image = np.asarray(bytearray(resp.read()), dtype="uint8")
image = cv2.imdecode(image, cv2.IMREAD_COLOR) # The image object

# Optional: For testing & viewing the image
cv2.imshow('image',image)

For Python 3 using OpenCV and Google Colab/Jupyter Notebook:

import cv2
from google.colab.patches import cv2_imshow
from urllib.request import urlopen

image_url = "IMAGE-URL-GOES-HERE"
resp = urlopen(image_url)
image = np.asarray(bytearray(resp.read()), dtype="uint8")
image = cv2.imdecode(image, cv2.IMREAD_COLOR) # The image object

# Optional: For testing & viewing the image
cv2_imshow(image)

How do I read image data from a URL in Python?

15 Answers15

Python 3

Jupyter Notebook and IPython

Linked

Related