in reply to web spider for searching multiple sites

Use the obvious (LWP or Mechanize) modules to fetch the content... craft a plugin system (1 per module) to deal with the content of the fetched sites.

Make sure the plugin API deals efficiently with the terms you're most concerned about in the pages to be scraped.

-David

  • Comment on Re: web spider for searching multiple sites