in reply to Help with regular expression - real file

This looks like a perfect opportunity to use XML::LibXML's HTML parsing mode to scrape the page. There's a very short snippet sample at Get O'Reilly's top 25 into Template Toolkit, and I have a column on that as well.

Parsing HTML with regular expressions can be messy, since proper regexs that don't match angle brackets in the wrong places are hard to write.

-- Randal L. Schwartz, Perl hacker
Be sure to read my standard disclaimer if this is a reply.

  • Comment on •Re: Help with regular expression - real file