drake50 has asked for the wisdom of the Perl Monks concerning the following question:
my $stream = HTML::TokeParser->new( \$document ); and then pulling out type href... Or better to use regex on the page like while ($document =~ m/href\s*=\s*"*([^"\s]+)"*\s*>/gi) { or while( $document =~ m/<a href=\"(.*?)\"/ig ) { Or is there something totally different I should be looking at?
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re: extracting web links
by Corion (Patriarch) on Dec 27, 2003 at 19:03 UTC | |
by bart (Canon) on Dec 27, 2003 at 19:31 UTC | |
by drake50 (Pilgrim) on Dec 27, 2003 at 22:12 UTC | |
by Corion (Patriarch) on Dec 27, 2003 at 22:22 UTC | |
by PodMaster (Abbot) on Dec 28, 2003 at 09:53 UTC | |
by dominix (Deacon) on Dec 27, 2003 at 22:49 UTC | |
by drake50 (Pilgrim) on Dec 27, 2003 at 22:55 UTC | |
|
Re: extracting web links
by revdiablo (Prior) on Dec 27, 2003 at 23:10 UTC |