Re: Joining two files on common field

Yet another way -- just a plain hash:

use strict;
my %data;

open(I1,"file1") or die "file1: $!"
$_ = <I1>;   # read column headings

while (<I1>) {  # get data
    chomp;
    my ($upc,$qty) = split /,/;
    $data{$upc} = $qty;
}

open(I2,"file2") or die "file2: $!"
$_ = <I2>;   # read column headings

while (<I2>) {  # get data
    chomp;
    my ($sku,$siz,$clr,$upc) = split /,/;
    $data{$upc} .= join ',', '', $sku, $siz, $clr;

# or: $data{$upc} = join ',', $data{$upc},$sku,$siz,$clr;
}

for (sort keys %data) {
    print "$_,$data{$_}\n";
}
[download]

That's pretty brittle in a number of ways: it won't handle properly quoted fields in a CSV file (e.g. '121,MD,"Green, Kelly",87654321'); it depends rigidly on a particular ordering and quantity of columns in each file; if file2 has a upc value not found in file1, it'll print ",$sku,$siz,$clr"; if file1 has a upc value not found in file2, it'll print "$upc,$qty".

But it's simple and should do the job as you described it.

Comment on Re: Joining two files on common field Download Code

Replies are listed 'Best First'.
Re^2: Joining two files on common field by Skeeve (Parson) on Sep 23, 2005 at 06:10 UTC
You said: That's pretty brittle in a number of ways: ... it depends rigidly on a particular ordering and quantity of columns in each file So let's fix that. `$_ = <I1>; # read column headings` becomes `$_ = <I1>; # read column headings my @columns= split /,/;` [download] And when reading the columns, instead of `my ($upc,$qty) = split /,/;` and `my ($sku,$siz,$clr,$upc) = split /,/;` we put `@line{@columns}= split /,/;` (provided you have a `my %line;` defined) So in full: Read more... (995 Bytes) `$\=~s;s.;q^\|D9JYJ^^qq^\//\\\///^;ex;print`	[reply] [d/l] [select]

Replies are listed 'Best First'.

Re^2: Joining two files on common field
by Skeeve (Parson) on Sep 23, 2005 at 06:10 UTC

That's pretty brittle in a number of ways: ... it depends rigidly on a particular ordering and quantity of columns in each file

$_ = <I1>; # read column headings

$_ = <I1>;   # read column headings
my @columns= split /,/;
[download]

my ($upc,$qty) = split /,/;

my ($sku,$siz,$clr,$upc) = split /,/;

@line{@columns}= split /,/;

my %line;