To be more specific on the "very large file" containing the DNA sequences...there are 27000 sequences and it looks something like this:
>identifier 1
ATG...(50ish to 4000 base pairs long)
>identifier 2
ATG...etc.
>identifier 3
Etc....
Actually the input file is the larger of the two...it contains much smaller chunks of DNA in the same format. Of these files there are probably 200,000 sequences.
I have several of these files that I will search against the other "large file"
Sorry for the lack of detail. I never know how much to give without boring people with useless details.
Cheers,
Dr.J
In reply to Re: Re: Re: Quickest method for matching
by dr_jgbn
in thread Quickest method for matching
by dr_jgbn
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |