Hello,
I am trying to create a script that will allow me to grab a small sample (~100) of sequences in a fasta file that contains over 200,000 sequences and put it into an output file. I have tried things like split and sed but they are to inefficient for such a process and produce unnecessary output files. Any pointers will help. Thank you.