Thanks for the replies - I'll try to be more specific. The data in the files is comma separated db data. I'm trying to find the lines in file1 that aren't in file2 and vice versa. Merlyn posted an excellent approach for doing this with a hash table in a previous thread
. I'm not having any problems with the current script's logic, just with the amount of time it takes to run and the memory limitations.