in reply to HTML::Parser??

You could also try HTML::TreeBuilder. This layers ontop of HTML::Parser and allows you to query your html doc as a tree.

Just my 2p :)