17

I am looking to create base64 inline encoded data of images for display in a table using canvases. Python generates and creates the web page dynamically. As it stands python uses the Image module to create thumbnails. After all of the thumbnails are created Python then generates base64 data of each thumbnail and puts the b64 data into hidden spans on the user's webpage. A user then clicks check marks by each thumbnail relative to their interest. They then create a pdf file containing their selected images by clicking a generate pdf button. The JavaScript using jsPDF generates the hidden span b64 data to create the image files in the pdf file and then ultimately the pdf file.

I am looking to hopefully shave down Python script execution time and minimize some disk I/O operations by generating the base64 thumbnail data in memory while the script executes.

Here is an example of what I would like to accomplish.

import os, sys
import Image
size = 128, 128
    im = Image.open("/original/image/1.jpeg")
    im.thumbnail(size)
    thumb = base64.b64encode(im)

This doesn't work sadly, get a TypeErorr -

TypeError: must be string or buffer, not instance

Any thoughts on how to accomplish this?

Martijn Pieters
  • 1,048,767
  • 296
  • 4,058
  • 3,343
0xhughes
  • 2,703
  • 4
  • 26
  • 38

4 Answers4

31

You first need to save the image again in JPEG format; using the im.tostring() method would otherwise return raw image data that no browser would recognize:

from io import BytesIO  
output = BytesIO()
im.save(output, format='JPEG')
im_data = output.getvalue()

This you can then encode to base64:

image_data = base64.b64encode(im_data)
if not isinstance(image_data, str):
    # Python 3, decode from bytes to string
    image_data = image_data.decode()
data_url = 'data:image/jpg;base64,' + image_data

Here is one I made with this method:



Unfortunately the Markdown parser doesn't let me use this as an actual image, but you can see it in action in a snippet instead:

<img src=""/>
Martijn Pieters
  • 1,048,767
  • 296
  • 4,058
  • 3,343
  • Both of the answers provided work for what I want to do, but this one was more in line and fluid with my program :) So I will go with you on this! I wish I could accept both as answers as I will be borrowing from both, but StringIO seems to work real well for me! Thanks everyone! – 0xhughes Apr 18 '13 at 15:36
3

In Python 3, you may need to use BytesIO:

from io import BytesIO

...

outputBuffer = BytesIO()
bg.save(outputBuffer, format='JPEG')
bgBase64Data = outputBuffer.getvalue()

# http://stackoverflow.com/q/16748083/2603230
return 'data:image/jpeg;base64,' + base64.b64encode(bgBase64Data).decode()
He Yifei 何一非
  • 2,592
  • 4
  • 38
  • 69
2
 thumb = base64.b64encode(im.tostring())

I think would work

Joran Beasley
  • 110,522
  • 12
  • 160
  • 179
  • Nope, that won't work, as `im.tostring()` returns the *raw* image matrix, not JPEG encoded data. – Martijn Pieters Apr 17 '13 at 17:48
  • but it should work for `Image.fromtext(b64decode(my_encoded_raw))`? or is that a lie(I didnt try it)? ... – Joran Beasley Apr 17 '13 at 17:53
  • Actually; my reading comprehension could do with polishing as well; looks like the OP indeed wants to roundtrip, not display these as data URLs. Oops. Still; I'd.think jsPDF or the canvas might want to use JPEGs. – Martijn Pieters Apr 17 '13 at 18:42
  • yeah tbh i didnt know you could just use a url like that (neat trick) – Joran Beasley Apr 17 '13 at 18:46
  • Thanks for the tips guys, I did try out the tostring() method in some testing and had some interesting results. in my various testings. I can't seem to get the b64 encoded data to work in jsPDF too well, I am going to tinker some more, i've noticed different libraries are picky with how b64 data is presented to it! – 0xhughes Apr 18 '13 at 15:34
2

I use PNG when I save to the buffer. With JPEG the numpy arrays are a bit different.

import base64
import io

import numpy as np
from PIL import Image

image_path = 'dog.jpg'

img2 = np.array(Image.open(image_path))

# Numpy -> b64
buffered = io.BytesIO()
Image.fromarray(img2).save(buffered, format="PNG")
b64image = base64.b64encode(buffered.getvalue())

# b64 -> Numpy
img = np.array(Image.open(io.BytesIO(base64.b64decode(b64image))))

print(img.shape)
np.testing.assert_almost_equal(img, img2)

Note that it will be slower.

Philippe Remy
  • 2,967
  • 4
  • 25
  • 39