in reply to how to remove html tags

HTML::FormatText does a nice job if the HTML is not too complicated and you'd like some plain text formatting:

use strict; use HTML::FormatText; use HTML::TreeBuilder; my $tree = HTML::TreeBuilder->new(); $tree->parse("<H1>hello</H1>"); my $formatter = HTML::FormatText->new(); print $formatter->format($tree);