Reading a zipped file (win32)

spikey_wan has asked for the wisdom of the Perl Monks concerning the following question:

Replies are listed 'Best First'.
Re: Reading a zipped file (win32) by pbeckingham (Parson) on Jun 29, 2004 at 12:57 UTC
Take a look at Compress::Zlib and Archive::ZIP. While they do not quite serve your purpose, you could wrap them with your own code to provide for your needs.	[reply]
Re: Reading a zipped file (win32) by gellyfish (Monsignor) on Jun 29, 2004 at 13:05 UTC
Well somewhere along the line something is going to have to unzip the file but if you want to have a transparent read of the zipped file without any apparent imtermediate uncompression then you might create a PerlIO 'layer' using Archive::Zip that would allow you to do something like: `open FOO , '<:zip','zipfile.zip' \|\| die "$!\n"; while(<FOO>) { # ... }` [download] The module to do this doesn't exist yet but if you read the Perl IO documentation and An example implementation then it should be quite clear what you need to do. /J\	[reply] [d/l]
Re: Reading a zipped file (win32) by spikey_wan (Scribe) on Jun 29, 2004 at 13:17 UTC
But if I could read the zip file directly, line by line, then the amount of memory or disk space required would be minimal, wouldn't it? Far less than uncompressing the file, surely? The log files are now so large that I had to give up slurping them and process them line by line, quite a while ago. Unfortunately, I am struggling with the answers I have so far, can anyone make it a bit clearer to my thick head?. Thanks, Spike.	[reply]
Re^2: Reading a zipped file (win32) by guha (Priest) on Jun 29, 2004 at 14:08 UTC
A quick glance at the docs for Archive::Zip reveals the existance of a low level routine named readChunks, which I expect will get you further in your quest. You will however have to manage chunk boundaries and line endings logic yourself. Update: The chunkSize parameter refers to the source data, ie compressed, so expecting compression ratios in the order of 95 % you will need to set this parameter sufficiently small as the inflated data can expand to ~ 20 times your requested chunksize. HTH	[reply]
Re^3: Reading a zipped file (win32) by iburrell (Chaplain) on Jun 29, 2004 at 16:34 UTC
Re^2: Reading a zipped file (win32) by gellyfish (Monsignor) on Jun 29, 2004 at 13:57 UTC
It is quite simple really, you cannot read a compressed file by the lines of its content without uncompressing it first - the compression will not preserve the lines of the original data. BY the way you describe your problem it looks like it would be best to scan the file line by line before compressing it - as has already been suggested. /J\	[reply]
Re: Reading a zipped file (win32) by spikey_wan (Scribe) on Jun 29, 2004 at 14:49 UTC
Re^4: Reading a zipped file (win32) by BrowserUk (Patriarch) on Jun 29, 2004 at 15:05 UTC
Re^2: Reading a zipped file (win32) by pboin (Deacon) on Jun 29, 2004 at 13:53 UTC
Compressed data isn't really usable -- that's the trade off you make for space. If you can't temporarily decompress data, process it and then put it back, you have problems bigger than perl technique. You need more storage or a less wasteful record in the first place. If you post more details or start a different thread, maybe we can get you fixed up in a whole other way. One that will get the answer you need at the end of the day w/o skinning this particular cat.	[reply]
Re: Reading a zipped file (win32) by borisz (Canon) on Jun 29, 2004 at 12:57 UTC
Compress:Zlib can do what I think you want to do. Boris	[reply]
Re: Reading a zipped file (win32) by gri6507 (Deacon) on Jun 29, 2004 at 12:55 UTC
What's the problem with unzipping it? You could always zip it up after you're done, leaving the environment clean.	[reply]
Re: Reading a zipped file (win32) by spikey_wan (Scribe) on Jun 29, 2004 at 13:06 UTC
If the log file is very large, and there's not much disk space left, I may run out of space when unzipping it.	[reply]