I recommend looking at HTML::LinkExtractor and using Super Search on that module to see if that does what you want it to. When you get the URL, you can use LWP::Simple to fetch it, then feed that into HTML::LinkExtractor and you should be able to get all the attributes and text that you need.
If HTML::LinkExtractor doesn't quite do what you need, I'd move on to HTML::Parser if the HTML is not well-formed XML, or XML::Twig if it is. These should have the flexibility to do what you want, at the expense of a bit more code to write.
In reply to Re: Links and data extractor script
by Tanktalus
in thread Links and data extractor script
by sandal
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |