in reply to Regular Expressions Challenge

In the absence of being able to come up with a single line of code for regular expressions I've combined the split suggestions with a regular expression to achieve the desired result. Here is the working code:

my @section; my $i = -1; foreach (split /\n/, $str) { if (/^===Comments===$/) { ++$i; next; } next unless $i >= 0; $section[$i] .= "$_\n"; } my $comments1; my $comments2; my $comments3; if ($section[0] =~ /\n(.*?)=Microarray Data=/gs){ $comments1 = $1; } if ($section[1] =~ /\n(.*?)=Pathway Information=/gs){ $comments2 = $1; } if ($section[2] =~ /\n(.*?)=Aditional Info=/gs){ $comments3 = $1; } print $comments1, $comments2, $comments3;

Thanks for all pointers. If anyone can think of a way of making this neater and simpler that would be great. But otherwise it's working so I'm happy!

Thanks.

Replies are listed 'Best First'.
Re^2: Regular Expressions Challenge
by ig (Vicar) on May 18, 2010 at 10:00 UTC

    Not sure if this is neater or simpler, but certainly shorter - maybe you would like one of the following:

    my @sections = $str =~ m/^===Comments===\n(.*?)\n=.*?=$/gms; print Dumper(\@sections); print "\n\nOR\n\n"; my %sections = reverse( $str =~ m/^===Comments===\n(.*?)\n=(.*?)=$/gms + ); print Dumper(\%sections); print "\n\nOR\n\n"; while($str =~ m/^===Comments===\n(.*?)\n=(.*?)=$/gms) { print " $2 : $1\n"; }

    Which produces:

    $VAR1 = [ ' User comments are added here. A user may write whatever they may wish. ', ' Comments are related to the microarray data here. ', ' Comments related to the pathway information here. ' ]; OR $VAR1 = { 'Microarray Data' => ' User comments are added here. A user may write whatever they may wish. ', 'Pathway Information' => ' Comments are related to the microarray data here. ', 'Aditional Info' => ' Comments related to the pathway information here. ' }; OR Microarray Data : User comments are added here. A user may write whatever they may wish. Pathway Information : Comments are related to the microarray data here. Aditional Info : Comments related to the pathway information here.

    update: but I see the last two are wrong, now that I've posted it.

    my %x = map { /^=([^\n]*)=\n.*\n===Comments===\n(.*)$/s ? ( $1 => $2 ) : () } split(/(?=^=[^=])/m,$str); print Dumper(\%x);

    gives

    $VAR1 = { 'Microarray Data' => ' Comments are related to the microarray data here. ', 'Pathway Information' => ' Comments related to the pathway information here. ', 'Literature' => ' User comments are added here. A user may write whatever they may wish. ' };