bingalee has asked for the wisdom of the Perl Monks concerning the following question:
I have a huge file with many sequences. I need to separate them into different files.How do I go about making a perl script for it? .Thanks in advcance
Here's what my file looks like,
XLOC_000039 >chr1:983051-985037 CATGACTTTGTCGGAATTATGTTACTGCTCATTATCAATTCCACCATTAGCTTCATAGAG GAAAACAATGCCGGGAATGCTGCTGCTGCGCTTATGGCCCGCCTCGCACCAAAATCCAAG GTAAGCCCCACACCCTACTTACCACTCCTTTTTCTTCTCAATACTGCTTTTCATCATGTT ACACTCATTTTCTAGGTTTTACGTGATGGAACCTGGAGTGAAATGGACGCATCTTTGTTG GTGCCCGGTGACATAATCAGCATTAAACTTGGAGACATCATTCCGGCAGATGCGCGTCTT CTCGAGGGAGATCCGCTGAAAATTGACCAGGTCTTTCTTGTGTCTCAATCATAGTGTTCT TGGTAGAGCGGAAAAAAAAATATTCTGATATGAAAATTACATGAGACACTAAAACACATA XLOC_000456 >chr1:12600284-12601781 CAACAATCTCTGATGATGCGGCAGGGCCTTGCTCGCGGGGCGTGGTGCTACCTCGAGGAT GAGTTCCTTGGCCAAAGGGAATCCCGGGCGCTTCTACTTGAGACAAAATTCCGCAACTTC CGCCAAGAGTCCTTGAGCATCACTGACTACTGCCGCCAGCTTGAGTCAATGGCGGCATCC CTTGCCGGTTTCGGCGATCCCATCGGCGATAGGCAGATGGTGCTCACGCTCCTTCGTGGC CTCGGCGGCAAGTTCCGTCACATGGTGTCCATCCTCAAGATGCACCAGCCGTTCCCCACG TTCGCAGAGGCTCGTGCGCACCTGCTGTTGGAGGAGCTGGAAATCGACGCACGACCTCCA TCACCGCCATCGGCACTTGTTGCTGCAGCGCCGCGGCATGCGACTCCGGGGGCCCCAGTA
So I thought I could split each of them into arrays but I kinda dont know what to do after that.
this is what I got
#usr/bin/perl -w open(IN,"/home/datasets/maize/extracted_sequences.gtf"); mkdir sequence1; while($seq=<IN>) { @file=split(/\r/,$seq); open(OUT, ">sequences1/$file[0].txt"); print OUT ">$file[0]"; } close(IN); close(OUT);
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re: put every sequence of a file in a different output file
by kennethk (Abbot) on Jun 13, 2013 at 15:09 UTC | |
by bingalee (Acolyte) on Jun 13, 2013 at 16:10 UTC | |
|
Re: put every sequence of a file in a different output file
by davido (Cardinal) on Jun 13, 2013 at 15:13 UTC | |
|
Re: put every sequence of a file in a different output file
by 2teez (Vicar) on Jun 13, 2013 at 16:54 UTC |