in reply to How to delete lines with similar data

A couple of (uncalled for ;--) comments on your script:

A couple of style points:

So your code modified:

#!/bin/perl -w use strict; open (REGFILE, "<$datadir/projects.dat") || die "Cannot open File1: $! +"; open (NEWFILE,">>$datadir/subscribe\.dat") || die "Cannot open File2: +$!"; my %seen_email; # hash email => 1 while( <REGFILE>) { my( $email, $name, $format); ($email, $name, $format, undef) = split /\|/; next if( $seen_email{$email}); print NEWFILE"$email|$name|$format\n"; $seen_email{$email}=1; # either just the email or the whole li +ne } close (NEWFILE); close REGFILE;

An other option, if you are on any kind of unix (or cygwin) is to preprocess the initial file using sort -u:

#!/bin/perl -w use strict; # the die is actually useless here open (REGFILE, "sort -u $datadir/projects.dat |") || die "Cannot open +File1: $!"; open (NEWFILE,">>$datadir/subscribe\.dat") || die "Cannot open File2: +$!"; while( <REGFILE>) { my( $email, $name, $format); ($email, $name, $format, undef) = split /\|/; print NEWFILE"$email|$name|$format\n"; } close (NEWFILE); close( REGFILE) || die "could not sort $datadir/projects.dat: $!"; # i +n this case the error happens when you close the file