in reply to HTML::Parser question

If you just want to extract the text from a Website HTML-Strip could be interessting too.

AddOn:

#!/usr/bin/perl use strict; use warnings; use LWP::Simple; use HTML::Strip; my $hs = HTML::Strip->new(); print $hs->parse( get('http://www.perlmonks.org/') ); $hs->eof;

Replies are listed 'Best First'.
Re: HTML::Parser question
by mkurtis (Scribe) on Mar 07, 2004 at 22:48 UTC
    I tried your code but noticed that it doesnt get all of yahoo's content,just the main part with the directory and none of it is tabbed box text,(where the news is). I dont understand why it wouldnt extract all the text, however it is not combining words anymore, thanks for that, there is no more docs about it on cpan, ill try and find others, im using your exact code except i changed perlmonks to yahoo.

    Thanks