http://qs1969.pair.com?node_id=313789


in reply to HTML <=> Text convertion

Nevertheless, HTML::Parser comes with an example script that extracts plain text from HTML: eg/htext.

Two: there's an easy HTML::Parser tutorial on gellyfish's own site. The very first example extracts only the pure text.

If you want to build your own, starting out from ovid's HTML::TokeParser::Simple, a simpler, more OO-style interface on top of HTML::TokeParser, looks easier to me, than from the raw HTML::Parser.