http://qs1969.pair.com?node_id=74059

abultm74 has asked for the wisdom of the Perl Monks concerning the following question:

Help! I'm working on a class project. We are caching copies of HTML files. The problem: HTML 'href's and 'src's need to be changed. Relative links need to be changed to hard links, so that all our database has to hold is HTML text, not images, etc. Anyway, there is a myriad of ways of making HTML href and img tags: No quotes, quotes, relative, relative with '..'s, leading slashes, trailing slashes, ones with 'http://', with only 'www', etc. I need to find all HTML 'href' and 'src' links and make them hard links. Any ideas? Is there a module that does this, or do I have to do a million regexps? I need some help... 'Mad Props' to anyone who can shed some light... Adam