dutchman70 has asked for the wisdom of the Perl Monks concerning the following question:

Hi, I've been playing around with WWW::Spyder which seems to be a nice module and does most of what I need. Unfortunately I don't see a way to analyze the URL to be crawled before the spider crawls it which it hard to control the crawling process. Below is the while loop I'm using. Does anyone have a suggestion or is there a better module to use? while ( my $page = $spyder->crawl ) { #process page... }
  • Comment on WWW::Spyder check url before loading it?

Replies are listed 'Best First'.
Re: WWW::Spyder check url before loading it?
by Anonymous Monk on Feb 15, 2010 at 01:46 UTC
      Anonymous, thanks very much for the code - exactly what I was looking for...! Thanks!
      for example if the url is part of the docroot.