in reply to Should I use; Html Parser, table extract, Extractor

As GrandFather advised, I would also use HTML::TreeBuilder. Now for the   thing, I take a slightly different approach for removing non-breaking spaces, which I explained in Re: How do I create non-breaking space (  ) with HTML::Element? :
my $NBSP = HTML::Entities::decode_entities(' '); $foo =~ s/$NBSP//g ;
Hope this helps.

Gu