in reply to RE: Extract and modify IMG SRC tags in an HTML document. in thread Extract and modify IMG SRC tags in an HTML document.
This is a very old thread but its really important for me...
I need to modify the hrefs and the src tags for proxy.
but the problem is some hrefs are like
href="/page.html"
href="../page.html"
href="http://url"
href=url
...
src="/abc.jpg"....
What I am doing is downloading the page source using Lynx.
ThenI have to modify all the links like www.yahoo.com example
www.abc.com/cgi-bin/proxy.pl?http://www.yahoo.com
then clicking on any link on that page will download the other page source and proceeds the same way.
|