File > 2G under linux

rlb3 has asked for the wisdom of the Perl Monks concerning the following question:

Replies are listed 'Best First'.
Re: File > 2G under linux by Abigail-II (Bishop) on Sep 11, 2003 at 10:37 UTC
To be able to deal with files over 2G, your perl must be compiled with USE_LARGE_FILES. Under Linux, Configure enables this by default starting from version 5.6.0. See also the INSTALL file, section "Large file support". You may want to check the output of perl -V. Abigail	[reply]
Re: Re: File > 2G under linux by Anonymous Monk on Sep 11, 2003 at 12:43 UTC
I think that's the point, because ext3 really has support for files over 2GB, the other mentioned fs too!	[reply]
Re: File > 2G under linux by broquaint (Abbot) on Sep 11, 2003 at 10:57 UTC
If you've got access to `perl5.8.0`+ then you could take advantage of the funky new IO layering system and employ the use of `PerlIO::gzip`. So you could do something like this `use PerlIO::gzip; open my $in_fh => "<:gzip", 'input.gz' or die "ack: $!"; open my $out_fh => ">:gzip", 'output.gz' or die "ack: $!"; while(<$in_fh>) { do_stuff($_) if /matches some condition/; print {$out_fh} $_; }` [download] See. the `PerlIO` docs for more info on IO layers in `perl5.8.0`+ and `PerlIO::gzip` for info on the gzip layer used above. HTH `_________ broquaint`	[reply] [d/l]
Re: Re: File > 2G under linux by sweetblood (Prior) on Sep 11, 2003 at 13:36 UTC
Now that's cool ... makes me wish I had 5.8. ++	[reply]
Re: File > 2G under linux by edan (Curate) on Sep 11, 2003 at 10:23 UTC
not sure about the filesystem issues, but you could try just uncompressing the file to STDOUT, read that line-by-line and do your parsing, and write to a pipe that compresses. I have done this using `gzip` with success (not using large files, just in general). something like this (UNTESTED): `open(INPUT, "/usr/bin/gzip -d -c '$filename' \|") open(OUTPUT, "\| /usr/bin/gzip > '$filename'"); while(<INPUT>) { # munge print OUTPUT; } close INPUT; close OUTPUT;` [download] You could also look at Compress::ZLib, which might work for you... -- 3dan	[reply] [d/l]
Re: Re: File > 2G under linux by MidLifeXis (Monsignor) on Sep 11, 2003 at 17:12 UTC
It was written: `open(INPUT, "/usr/bin/gzip -d -c '$filename' \|") open(OUTPUT, "\| /usr/bin/gzip > '$filename'");` [download] Make sure that you use a different value for $filename on each of these calls, or you (may\|will) clobber the contents of the file you are trying to read, that is, unless you are using an OS with versioned files. --MidLifeXis	[reply] [d/l]
Re: Re: Re: File > 2G under linux by edan (Curate) on Sep 14, 2003 at 07:21 UTC
Quite right. Goog thing I included the 'UNTESTED' disclaimer! :-) I cut and paste some code from different places, one of which did the reading and the other writing - I was solving a different problem than the one posed here. Good eye! -- 3dan	[reply]
Re: File > 2G under linux by hardburn (Abbot) on Sep 11, 2003 at 13:56 UTC
On the IA-32 (read: Intel) architecture, all Linux filesystems starting in the 2.4 series support 64-bit file sizes. Note that in POSIX, `open()` only uses 32-bit file sizes, so C code has to use a different function (`open64()`, IIRC). My guess is that your decompression program is using the older version of `open()`, which you should be able to fix by getting a new version to compile (assuming its one of the common Free Software compressors, like gzip or bzip2). ---- I wanted to explore how Perl's closures can be manipulated, and ended up creating an object system by accident. -- Schemer Note: All code is untested, unless otherwise stated	[reply] [d/l] [select]