in reply to WWW::mechanize not obeying rules
well, the robots.txt forbids to access help/policies/, so a search-engine-bot will not add those pages in its index.
but your favorite web-browser is able to access that pages, right? that's why there is no reason that WWW::Mechanize (or similar stuff like LWP::Simple) can't poll those pages too.