in reply to Reading gnu mailman archives

This works for me  (your sample file seems to be double-compressed, so it should more appropriately be named .txt.gz.gz):

use IO::Uncompress::Gunzip qw(gunzip $GunzipError) ; gunzip "2010-January.txt.gz" => "2010-January.tmp" or die "gunzip fail +ed: $GunzipError\n"; gunzip "2010-January.tmp" => "2010-January.txt" or die "gunzip fail +ed: $GunzipError\n";

P.S. it's not XML, but rather classic mbox format.

Replies are listed 'Best First'.
Re^2: Reading gnu mailman archives
by cormanaz (Deacon) on Jan 25, 2010 at 22:28 UTC
    How strange. When I run the same code and open the resulting file in a text editor I get:

    ‹†]Kÿ/var/lib/mailman/archives/private/mythtv-users/2010-January.txt

    (some of the chars are rendering as excaped html here; they're extended ascii on my screen) You're getting clear text?