This is my code to get a web page's image's URLs
for some webpage, it works very well, while it' dosen't work for some web page
this is my code: #!/usr/bin/python
import urllib2
import re
#bufOne = urllib2.urlopen(r"http://vgirl.weibo.com/5show/user.php?fid=17262", timeout=4).read()
bufTwo = urllib2.urlopen(r"http://541626.com/pages/38307", timeout=4).read()
jpgRule = re.findall(r'http://[\w/]*?jpg', bufOne, re.IGNORECASE)
jpgRule = re.findall(r'http://[\w/]*?jpg', bufTwo, re.IGNORECASE)
print jpgRule
bufOne
work well, but bufTwo
didn't work. so how to write a ruler for it to make bufTwo work well?