0

I have the following function using selenium that works - but I am unable to pass it on. I get the error: InvalidSchema: No connection adapters were found for ' (full image link below - it works when I stick in a browser tab) when it gets to the response.get() portion of the code.

Is there a way to bypass opening and saving the image? I just want to be able to upload the full picture of Barry O to Twitter without saving to my local machine. Or do I have to download, save, and re-upload for my "push to Twitter" function? Am I missing a decoding here? Or mis-using the request.get()? If I don't have to use requests, that's fine by me!

import requests
import base64
from selenium import webdriver
from selenium.webdriver.common.keys import Keys

# test image for scrape
testing = "Barack Obama"

# Function to scrape picture ----
def pic_scrape(plain_name):

    # selenium setup
    driver = webdriver.Chrome(executable_path = r"/Users/username/Documents/chromedriver")
    driver.get('https://www.google.com/imghp?hl=en&authuser=0&ogbl')
    search_box = driver.find_element_by_xpath("//input[@name='q']")

    # search test term ('testing' variable above)
    search_box.send_keys(str(plain_name), Keys.ENTER)

    # finds elements - all works
    elements = driver.find_element_by_class_name('rg_i')
    elements.click()
    element = driver.find_element_by_class_name('v4dQwb')
    celeb_pic = element.find_element_by_class_name('n3VNCb')
    pic_url = celeb_pic.get_attribute("src")
    driver.close()

    # where the code fails
    response = requests.get(pic_url)
    print(response)

    return response

# Main function ----
if __name__ == '__main__':
    pic_scrape(testing)

Full error for requests.get() with image coding (image url works, if put in browser search):

InvalidSchema: No connection adapters were found for ''
papelr
  • 468
  • 1
  • 11
  • 42

1 Answers1

0

The response is not a URL its a base64 encode for MIME data:image/jpeg, so you cannot use it for request.get().

import base64
imgdata = base64.b64decode(src.split(",")[1])
filename = 'some_image.jpg'  # I assume you have a way of picking unique filenames
with open(filename, 'wb') as f:
    f.write(imgdata)

if you want to save the image you could use something like the above code

PDHide
  • 18,113
  • 2
  • 31
  • 46
  • Is there a way I can just pass it on, without saving? I just want to scrape and then upload directly to Twitter. Rather not save it – papelr Mar 31 '21 at 02:09
  • Img data will have the data you want , try returning imgdata – PDHide Mar 31 '21 at 02:16
  • Was able to return the `imgdata` into my push function - but question - would that return the actual picture, or will I have to convert it to an actual image for upload? – papelr Mar 31 '21 at 13:19
  • it depends on what is your endpoint expects – PDHide Mar 31 '21 at 13:24
  • Guessing, from this post: https://stackoverflow.com/questions/31748444/how-to-update-twitter-status-with-image-using-image-url-in-tweepy - that I *must* download the file and then remove it. Sad, but oh well – papelr Mar 31 '21 at 16:34