in reply to HTML <=> Text convertion
Nevertheless, HTML::Parser comes with an example script that extracts plain text from HTML: eg/htext.
Two: there's an easy HTML::Parser tutorial on gellyfish's own site. The very first example extracts only the pure text.
If you want to build your own, starting out from ovid's HTML::TokeParser::Simple, a simpler, more OO-style interface on top of HTML::TokeParser, looks easier to me, than from the raw HTML::Parser.
In Section Seekers of Perl Wisdom