I'm working on a project of web data mining to extract information directly from HTML by crawling server pages. My effort is concentrated only in an specific website which has a java web server, with caucho resin installed.
Parameters are passed by value pairs in url, like www.xxxxxx.com/jm/search?act=see&id=909&...
I have decoded many parameters by try but of course, results are comming very slowly.
My question is... do you Java Gurus know how to get all valid parameters of this kind of server? it is possible?
I don't have access to server and I don't know nothing about caucho resin, I'm coding an utility in Java to do the job.