in reply to Splitting a multi-sequence fasta file into individual sequences in individual arrays
Why do you want an AOAs? Each sequence is a header + a single, wrapped sequence. Wouldn't a hash be more useful?
#! perl -slw use strict; use Data::Dump qw[ pp ]; my %seqs; { local $/ = ">"; my @seqs = <DATA>; chomp @seqs; s[\n][\t] for @seqs; tr[\n][]d for @seqs; shift @seqs; %seqs = map split( "\t" ), @seqs; } pp \%seqs; __DATA__ >sequence header 1. AAATATTATATATATTGCG ATTATTATATGCGCGGCGC >sequence header 2 AATTGGGCTCGCTGCTTTT AGGAGGAGGAGCCCTCTCC >sequence header 3 AATTGGCTGCTCGCTGCTC AATGTGTCGGCGCGCGTGC
Prints
[ 4:34:55.96] c:\test>junk40 { "sequence header 1." => "AAATATTATATATATTGCGATTATTATATGCGCGGCGC", "sequence header 2" => "AATTGGGCTCGCTGCTTTTAGGAGGAGGAGCCCTCTCC", "sequence header 3" => "AATTGGCTGCTCGCTGCTCAATGTGTCGGCGCGCGTGC", }
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^2: Splitting a multi-sequence fasta file into individual sequences in individual arrays
by krish28 (Acolyte) on Feb 09, 2011 at 15:15 UTC | |
by BrowserUk (Patriarch) on Feb 09, 2011 at 18:28 UTC |