in reply to Parsing HTML

When faced with this kind of task, a lot of Perl coders: At least, that's how me and my co-workers did it once :)

So as a result, I'd suggest looking at HTML::Parser or one of its relatives. I used HTML::TreeBuilder to parse some quite large and unreliable HTML files and found that it worked great. The tricky bit is learning how to code in the callback style required, but you can get lots of help on that here once you've started.