in reply to Arrays and Files

What exactly counts as a sequence?? Is it the lines underneath

GTM1_MOUSE etc.. etc..

or is it the whole entry? I am going to assume it is the whole entry.

$file = "xyz"; if(!open(READ_FILE, "$file")){ die "Cannot open file [$file]\n"; } my $sequence = ""; while(my $line = <READ_FILE>){ if($line =~ m/^G.*\(EC/){ if($sequence){ #write sequence here. print "seq: $sequence\n"; $sequence = ""; } $sequence .= $line; }else{ $sequence .= $line; } } #write last sequence here print "last: $sequence\n";
That regex is somewhat hardcoded, if you are nervous about it, you can always change it to look for the empty line. I sort of assumed that every sequence would start with

G<something>

and would have

(EC <more-stuff>