in reply to printing fasta sequences from Hash

A more readable version of the question:

Hello Monks, I have a problem and I would greatly appreciate your wisdom to solve it. So, I am reading from a fasta file (containing protein seqeunces) while looping over and trying to print protein sequences.

My problem is: When trying to print each protein sequence that I have in my hash, the sequences are broken down as follows:

Protein sequence:STARTXXXXX Protein Sequence: XXXXXXXXX Protein sequence: XXXXXXEND

Question: How could I avoid this?

Here is my script:

use warnings; my %database; my $protein_id = ""; my $line = ""; my $datbase_filename = ""; my $start = ">DDB"; my $end = ""; #input1: open the file containg the protein sequences. print "Please enter the path to the fasta file of protein sequences\n" +; #put the file name into $filenam $database_filename = <STDIN>; chomp($database_filename); #remove newline #open the file containing protein sequences in fasta format open(MYFILE1, "<$database_filename") or die "Unable to open $database_filename: $!\n"; while (<MYFILE1>) { chomp; #remove newline $line = $_; print "Line i: $_\n"; #test if (($line =~ m/^>DDB/) and ($line =~ m/to\s\d+$/)) { $protein_id = $line; print "protein ID: $protein_id\n"; #test } else { # associating the protein sequence in my %proteins to its ID $proteins{$protein_id} = $line; print "sequence: $line\n"; #test } } exit;
Many thanks in advance.

Replies are listed 'Best First'.
Re^2: printing fasta sequences from Hash
by AnomalousMonk (Archbishop) on Feb 24, 2015 at 21:25 UTC

    kroach: I appreciate you're trying to be helpful, but I think it would have been better if Bijgom had re-formatted the OP him- or herself.


    Give a man a fish:  <%-(-(-(-<

Re^2: printing fasta sequences from Hash
by Bijgom (Initiate) on Feb 24, 2015 at 23:03 UTC

    Sorry guys for my mistake...And many thanks Kroach for editing my post...