in reply to Extract and modify IMG SRC tags in an HTML document.
(I've seen few HTML docs that are over 100000 bytes in size)open FILE,"filename"; read FILE,$file,100000; close FILE;
@lines = split(/\<IMG/,$file);
$newfile = shift @lines;
5. Do whatever with the $newfile:foreach $line (@lines) { $pos = index($line,'>'); $tag = substr($line,0,$pos+1); $restofline = substr($line,$pos+1); $tag =~ s/SRC\=\"(.*?)\"/SRC\=\"$newurls{$1}\"/gi; $newfile .= $tag . $restofline . "\n"; }
Complete code: open FILE,"filename"; read FILE,$file,100000; close FILE; @lines = split(/\<IMG/,$file); $newfile = shift @lines; foreach $line (@lines) { $pos = index($line,'>'); $tag = substr($line,0,$pos+1); $restofline = substr($line,$pos+1); $tag =~ s/SRC\=\"(.*?)\"/SRC\=\"$newurls{$1}\"/gi; $newfile .= $tag . $restofline . "\n"; } %print $newfile;
|
|---|