a. i am using a simple url crawler (How do I make a simple crawler in PHP?) on a xyz.com/items/advsearch. The page lists the results after someone does advanced search (which lists all results). I have to copy/scrap those results. Now, when i go to "Next>" page, its url is encoded and crawling to the url of "Next>" brings me back to the main page of Advanced Search which shows 0 results.
b. Another thing i noted is: On simple human url traversal, the url of 'Next>' does not have jessionid in it as paramter, while, on getting html of page using file_get_contents(), it has it. Why is this so??
I am finding it quite difficult to mess with encoded urls /sessions stuff that i cannot crawl! Urgent help needed.