search.cpan.org is your friend. Try HTML::LinkExtor and/or HTML::LinkExtractor modules. Both the modules allow you to specify a callback that can be used to filter the extracted links.
Additionally, this (or very similar) question has already been asked on perlmonks. Please use the search feature for some of the archived threads and responses.
Regards,
Mahesh
In reply to Re: Extracting full links from HTML
by smahesh
in thread Extracting full links from HTML
by wojtyk
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |