in reply to How would you extract *content* from websites?
HTML::Strip, for example?
use HTML::Strip; my $hs = HTML::Strip->new(); my $clean_text = $hs->parse( $raw_html ); $hs->eof;
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^2: How would you extract *content* from websites?
by Ovid (Cardinal) on Jun 17, 2005 at 18:31 UTC |