note
bart
Nevertheless, [cpan://HTML::Parser] comes with an example script that extracts plain text from HTML: [http://search.cpan.org/src/GAAS/HTML-Parser-3.34/eg/htext|eg/htext].
<P>Two: there's an easy [http://www.gellyfish.com/htexamples/|HTML::Parser tutorial] on [gellyfish]'s own site. The very first example extracts only the pure text.
<P>If you want to build your own, starting out from [ovid]'s [cpan://HTML::TokeParser::Simple], a simpler, more OO-style interface on top of [cpan://HTML::TokeParser], looks easier to me, than from the raw [cpan://HTML::Parser].
313685
313685