in reply to Find duplicate lines from the file and write it into new file.
The traditional approach is to sort your file first, then all duplicates will appear one after another. To sort a huge amount of data, the best approach is to divide it up into small parts and sort these first and then do a Merge Sort from the sorted parts. During that merging you can also already output the duplicates easily.
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^2: Find duplicate lines from the file and write it into new file.
by tinita (Parson) on Jan 04, 2007 at 13:28 UTC | |
by monkey_boy (Priest) on Jan 04, 2007 at 13:34 UTC | |
by davidrw (Prior) on Jan 04, 2007 at 13:51 UTC |