in reply to Newbie regex question

extract text from a file (which holds the content of a web page)

You probably don't really want to parse HTML with regexes, at least as a matter of habit. Instead, use something suited to handling HTML, like HTML::TreeBuilder or HTML::TokeParser::Simple. Other good advice may be found by Super Searching for "don't parse HTML regex".

HTH,

planetscape