I want the src url of an image when I process some html, but I am getting back an encoded image. What am I doing wrong if I want the url?
Given a url like: "http://www.amazon.com/Cheese-Plate-multi-purpose-mounting-plate/dp/B00CI06DWE/"
And a desktop user agent:
from lxml import etree
import requests
page = requests.get(url, headers=agent)
page_txt = page.text
html_parser = etree.HTMLParser()
tree = etree.parse(StringIO(page_txt), html_parser)
path = '//img[@id="landingImage"]'
img = tree.xpath(path)
img_src = img[0].get('src')
using that code, I'm getting back:
'\ndata:image/jpeg;base64,/9j/4AAQSkZJR'(truncated)
when I want: