While I know how to extract contents of a website by URLConnection
and BufferedReader
and get its source code, sometimes a website is itself getting data from elsewhere and showing onto the page.
e.g. I am now working on this page http://bet.hkjc.com/marksix/userinfo.aspx?file=lucky_ocbs.asp&lang=en
and the 10 branches name and other details in the table in the page is not in the source code of the page.
Question:
Instead of extracting data from source code, is there any way to extract wordings simply from the final text showing in a page? If yes, how could it be done?
Thanks a lot.