in reply to Re: Very Large CSV Filter
in thread Very Large CSV Filter

If the files are in the proper order (sorted by email address, if I remember correctly), you can use a file merge type solution.

Conversion into perl is left as an exercise to the reader.

sort (OS level), sort (Perl level), open, close, eof, and perlop are all potentially helpful in this task.

Be aware that you are dealing with 1 billion records, so it is likely, depending on the complexity of the records and comparison, that the sort or filter step could take a while.

Benefits: only one record from each of the input and deletion files is in memory at a time.

--MidLifeXis