in reply to Re: Merge and sort large data
in thread Merge and sort large data

Sorry I have been unclear, the foillowing code now works:
#!/usr/bin/perl -w use strict; use warnings; open F1, 'sort -k3 mergefile1|' or die "opening file 1"; open F2, 'sort -k2 mergefile2|' or die "opening file 2"; open OUF, '>', 'mergefile.out' or die "opening output file"; my @in1; my @in2; sub getrec1 { @in1 = (); if (!eof(F1)) { (@in1) = split /\t/, <F1>; chomp $in1[2]; } } sub getrec2 { @in2 = (); if (!eof(F2)) { (@in2) = split /\t/, <F2>; chomp $in2[1]; } } sub write1 { print OUF "write1 $in1[2]\t$in1[0]\t$in1[1]\tnull\tnull\n"; getrec1; } sub write2 { print OUF "write2 $in2[1]\tnull\tnull\t$in2[0]\t$in2[2]\n"; getrec2; } sub writeboth { print OUF "writeboth $in1[2]\t$in1[0]\t$in1[1]\t$in2[0]\t$in2[ +2]\n"; getrec1; getrec2; } # Prime the pump getrec1; getrec2; while (1) { last if $#in1<0 and $#in2<0; if ($#in1<0 or $#in2<0) { # Only one file is left... write2 if $#in1<0; write1 if $#in2<0; } elsif ($in1[2] eq $in2[1]) { # Matching records, merge & write 'em writeboth; } elsif ($in1[2] lt $in2[1]) { # unmatched item in file 1, write it & get next rec write1; } else { # unmatched item in file 2, write it & get next rec write2; } }
using the file mergefile1 below
15 20 foo 22 30 bar 30 33 baz 14 22 fubar
And the file mergefile2 below
alpha baz 17.30 gamma foobar 22.35 gamma bar 19.01 delta fromish 33.03 sigma bear 14.56
Can anyone tell me why this only works on Korn shell and not in command? Thank you

Replies are listed 'Best First'.
Re^3: Merge and sort large data
by Corion (Patriarch) on Jul 23, 2007 at 11:04 UTC

    I'm not sure why this "only works on Korn shell and not in command, but there are two possibilities:

    Traditionally, Unix shells perform argument wildcard expansion (globbing) on the command line, while command.com and cmd.exe, the shells on Windows do not. If you want to perform globbing yourself, use the glob version, best as follows:

    # Near the top of your program use File::Glob qw(bsd_glob); # sane whitespace handling my @files = glob('../INPUT/*');

    The other possibility is that you have two programs called sort.exe and $ENV{PATH} is set differently between your ksh and your cmd.exe. The sort.exe program that comes with Windows is incompatible with the sort.exe program that behaves like the Unixish programs do. The easiest way to make sure that the "right" sort.exe program is invoked is to give the full path, maybe C:\\programs\\cygwin\\usr\\bin\\sort.exe, instead of calling it implicitly.