in reply to Re^4: 15 billion row text file and row deletes - Best Practice?
in thread 15 billion row text file and row deletes - Best Practice?

And if the list of items to delete is _really_ small, I just use egrep from the command line:

egrep -v '^(item1|item2|item3|item4|item5),' input_file > output_file

Replies are listed 'Best First'.
Re^6: 15 billion row text file and row deletes - Best Practice?
by djp (Hermit) on Dec 04, 2006 at 02:36 UTC
    Or use the -f option to GNU grep:
    grep -E -v -f deletes.txt infile > outfile