in reply to Problems with cruft-finding script

You need to read the docs a little closer, especially WHAT'S A LINK-type tag

As a general strategy, when in doubt, Dumper :)

use HTML::LinkExtractor; use Data::Dumper; my $input = q{ <blockquote cite="http://crazyinsomniac.perlmonk.org/index2.html"> Now that's some goood feedass (ass back it's all good) %') </blockquote> If <a href="http://perl.com/"> I am a LINK!!! </a> <IMG SRC="YODAYODAYODAYODAYODAYODAYODAYODAYOD.png" ALT="It's YODA +!"> }; my $LX = new HTML::LinkExtractor(); $LX->parse(\$input); print Dumper($LX->links); __END__ $VAR1 = [ { 'cite' => 'http://crazyinsomniac.perlmonk.org/index2.html' +, '_TEXT' => '<blockquote cite="http://crazyinsomniac.perlmo +nk.org/index2.html"> Now that\'s some goood feedass (ass back it\'s all good) %\') </blockquote>', 'tag' => 'blockquote' }, { '_TEXT' => '<a href="http://perl.com/"> I am a LINK!!! </a +>', 'href' => 'http://perl.com/', 'tag' => 'a' }, { 'alt' => 'It\'s YODA!', 'src' => 'YODAYODAYODAYODAYODAYODAYODAYODAYOD.png', 'tag' => 'img' } ];

MJD says "you can't just make shit up and expect the computer to know what you mean, retardo!"
I run a Win32 PPM repository for perl 5.6.x and 5.8.x -- I take requests (README).
** The third rule of perl club is a statement of fact: pod is sexy.

Replies are listed 'Best First'.
Re: Re: Problems with cruft-finding script
by aarestad (Sexton) on Dec 10, 2003 at 18:12 UTC
    Ah, from the author himself. :) Yes, you're right - I thought I read the docs carefully, but I missed that. :( So I suppose a good way to get the IMG SRCs is:

    foreach(@{$LX->links}) { if ($$_{tag} eq 'img') { push @allLinks, $URI->new_abs($$_{src}, $base); } # do other stuff }
    Being a bit of a Perl n00bie, I did not think to use Data::Dumper. Thanks for expanding my mind. :)