| [reply] |
Hi,
I'd like to be able to combine the distinct file types into single compressed tar files. There are approx 5 types, split roughly c. 1 million for one type and the other 4 c. 300k - 500k each.
Yes, the files need to be deleted afterwards to make disk space for incoming.
Colin
| [reply] |
If there is a process adding new files to the directory while your "tar up" script is running, then you need to face the twin issues of deleting files which haven't been put in the tar file, and putting empty or half-written files into the tarball. If possible, you need to be able to stop the process from adding any new files while the script is running; but if you can't, then the following should be safe.
Use the script I gave you above to, for example, move all files starting with 'b' into a b/ subdirectory. Then wait a few minutes, or however long it could reasonably take for the process to finish writing the current file, then from the command line, simply:
$ tar -cfz .../some-path/b.tar.gz b/
$ tar -tfz .../some-path/b.tar.gz > /tmp/foo
View /tmp/foo in a text editor to see if it looks reasonable, then
$ rm -rf b/
If the rm fails due to too many files, then write another perl script similar to the one above, but using 'unlink' to remove each file one by one.
Dave. | [reply] [d/l] |