I've just started screen scraping using BeautifulSoup in Python 2.7.2, and I want to get data off of this website:
http://www.trainsimple.com/SelfPaced.aspx
However, using urllib2 to open this URL and using LXML to parse it yields a lot of garbage if I use the .prettify() function.
After viewing the page source, I see that the page is actually rendered using javascript and that the div's are loaded dynamically.
Does anyone have any idea of how to get the data from this website?
Thank you