0

I had built a web crawler in Perl.

I am using

HTML::ContentExtractor

LWP::UserAgent 

HTML::LinkExtor

to extract text form webpages.

Reference link for sample code web cralwer perl

Issue:

The issue is that it does not get text from web pages that have the extension as .aspx . It works perfectly for other webpages.I could not figure out the issue why this crawler fails for aspx pages.

Community
  • 1
  • 1
Nagaraju
  • 1,853
  • 2
  • 27
  • 46

1 Answers1

1

To be able to access a process a website with JavaScript, use WWW::Mechanize::Firefox

Miller
  • 34,962
  • 4
  • 39
  • 60