I'm writing a webcrawler using Python and enjoying it a lot! But I've noticed some differences between the result produced by urlopen(url).read()
on Python and by curl
on terminal. I tried to install the pycurl module with no success. Is there a simple way to produce the CURL result on Python?
UPDATE
In this case I parsed this URL. I passed the same headers on both requests User-Agent: Mozilla/5.0
. Here are the outputs:
- cURL output: http://pastebin.com/PmmNhsbba
- urlopen output: http://pastebin.com/7Wrt8pQZ