in reply to How to match duplicate lines in a text file and extract only one of those lines to a new file

There is a way, yes:)
However the solution has dependencies: Are all beginning numbers duplicated? Are the following numbers always equal if yes? Etc. The solution can be very specific or very generic, depending on the answers.
If you actually try something, post the code and we can step further.

I'm too lazy to be proud of being impatient.
  • Comment on Re: How to match duplicate lines in a text file and extract only one of those lines to a new file
  • Download Code

Replies are listed 'Best First'.
Re^2: How to match duplicate lines in a text file and extract only one of those lines to a new file
by danica (Initiate) on Apr 04, 2012 at 13:04 UTC
    Yes the numbers of the first column are duplicates. The first column is a unique ID number given to an individual. As you probably guessed, the file contains DNA sequences and each individual is allocated 2 rows to represent their alternative alleles. However I need to transform the data so that each individual will only have 1 row instead of 2.