my $stream = HTML::TokeParser->new( \$document ); and then pulling out type href... Or better to use regex on the page like while ($document =~ m/href\s*=\s*"*([^"\s]+)"*\s*>/gi) { or while( $document =~ m/