in reply to finding and deleting repeated lines in a file

How long are the sequences and how long is the file? Can we have one line, for those of use who don't know what a peptide sequence looks like.

It sounds like a quick hash check keyed on the sequence itself. If the data source is large, it'll need a tie and maybe some MD5 action.

--
Steve Marvell

  • Comment on Re: finding and deleting repeated lines in a file