in reply to Extract Web Page

Something else you're probably going to want to look at is HTML::TokeParser. You can try to parse a web page using regexes. I know because I've done it, and it's not easy or pretty. HTML::TokeParser makes it so much faster and easier to do.
Useless trivia: In the 2004 Las Vegas phone book there are approximately 28 pages of ads for massage, but almost 200 for lawyers.