I want to scrape a Heritrix home page using pythons requests module. When I try to open this page on chrome, I get the error:
This server could not prove that it is 10.100.121.41; its security
certificate is not trusted by your computer's operating system. This
may be caused by a misconfiguration or an attacker intercepting your
connection.
But I can proceed to the page. When I tried to scrape the same page using requests, I got SSL error and after a bit of digging up, I used the following code from a SO question: r=requests.get(url,auth=(username, password),verify=False
. That is giving me the following warning /usr/lib/python2.6/site-packages/requests/packages/urllib3/connectionpool.py:734: InsecureRequestWarning: Unverified HTTPS request is being made. Adding certificate verification is strongly advised. See: https://urllib3.readthedocs.org/en/latest/security.html
, and the returning status code of 401. How to solve this problem?