Just a practical question. I do need to retrieve the HTTP status code of a site as well as the IP address.
Given the fact I normally need to parse between 10k and 150k domains, I was wondering which is the most efficient method.
I've seen that using the urllib2.urlopen(site) attempts to download the entire file stream connected to the file. At the same time the urllibs2 doesn't offer a method to convert an hostname into an IP.
Given I'm interested only in the HEAD bit to collect information like the HTTP status code and the IP address of that specific server, what is the best way to operate?
SHould I try to use only the socket? Thanks