in reply to substitution of illegal chars in filename
But trying to maintain the linkages among the href's inside each file is a bit more challenging; jeffa's reply has the basic approach: convert all the wget-assigned file names to sensible names first (making sure to avoid collisions), rename the files, and keep the old-new relations in a hash; then, for each file in the harvest, replace all occurrences of a wget-style (cgi-based) file name string with the corresponding sensible name. Tedious, but not so difficult.
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re: Re: substitution of illegal chars in filename
by lahf (Initiate) on Oct 10, 2003 at 14:06 UTC |