in reply to Foreign directory search with LWP part 2?

they use a porgram called spider that looks up for a file called "robots.txt" in the root dir of the specified (registered) domain. So they try to get "http://www.perlmonks.org/robots.txt") Inside this file is specified what the spider may read and what directories he might search. But still not. Acces must be granted.
You can read about this very good on the special section of these search engines, where is described how such a file must and can look like and how it works.
CPAN has it

Have a nice day
All decision is left to your taste
  • Comment on Re: Foreign directory search with LWP part 2?