comment on

hi i have been trying to extract a multiline sequence which consists of a string of letters spaning several lines followed by a blank line, then a header sequence ie

 
*header 
abcdefghik 
sdsaadsd 
addds 
 
*header 
a....
[download]

so far i have used the following, but it still only matches 1 line at a time! ie when i use the the array seqs later each $i element is only 1 line. i am out of books/ideas so all help would be appreciate.

 
 
while ($file=<INPUT1>){ 
    chomp $line; 
      if($file =~/^[a-z](.*?)(?=(\z))/msg){ 
            $seq=$1; 
            $seq=~ s/\n//; 
            push @seqs, $seq; 
            print OUTFILE2  @seqs; 
            exit; 
      } 
 
      elsif($file =~/*/){ 
            $header =$fastafile; 
             chomp $header; 
            $header=~s/>//; 
            push @headers, $header; 
            print OUTFILE "$header\n";  
  }
[download]

many thanks in advance, i know this is novice question :-) PC

In reply to multi-line match by pdotcdot

Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!

Titles consisting of a single word are discouraged, and in most cases are disallowed outright.

Read Where should I post X? if you're not absolutely sure you're posting in the right place.

Please read these before you post! —

Posts may use any of the Perl Monks Approved HTML tags:

a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr

You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)

	For:		Use:
	&		`&`
	<		`<`
	>		`>`
	[		`[`
	]		`]`

Link using PerlMonks shortcuts! What shortcuts can I use for linking?

See Writeup Formatting Tips and other pages linked from there for more info.