Tidier and more efficient parsing code

JPaul has asked for the wisdom of the Perl Monks concerning the following question:

Replies are listed 'Best First'.

Re: Tidier and more efficient parsing code
by Fletch (Bishop) on Jan 24, 2002 at 08:58 UTC

One of MJD's red flags: whenever you've got variables named $fooone and $footwo you probably want an array. Likewise here, you've got multiple arrays named with indicies which means you probably want a hash.

my $cur_section = 0;
my %sections;
while( <TEXTFILE> ) {
  if( /^;section(\d+)/ ) {
    $cur_section = $1;
    next;
  }

  unless( $cur_section ) {
    warn "No section defined for `$_'\n";
    next
  }

  push @{ $sections{ $cur_section } }, $_
}
[download]

[reply]
[d/l]

Re: Re: Tidier and more efficient parsing code

by JPaul (Hermit) on Jan 24, 2002 at 21:49 UTC

JP,
-- Alexander Widdlemouse undid his bellybutton and his bum dropped off --

[reply]

(crazyinsomniac) Re: Tidier and more efficient parsing code
by crazyinsomniac (Prior) on Jan 24, 2002 at 09:05 UTC

my $section = 0;  my @sectone;  my @secttwo;
while(<TEXTFILE>) {
    if (/;section(1)/) {
        push(@sectionone, $_);
    } elsif (/;section(2)/) {
        push(@sectiontwo, $_);
    } else {
            print "No section defined for: $_\n";
        }
    }
[download]

my $section = 0;  my @sectone;  my @secttwo;
while(<TEXTFILE>) {
    if (/;section([12])/) {
        push( $1 == 1? @sectionone : @sectiontwo, $_);
    }
    else {
            print "No section defined for: $_\n";
        }
    }
[download]

my $section = 0;
my %sections = (1 =>[],2=>[]);
while(<TEXTFILE>) {
  if (/;section([12])/) {
    push @{$sections{$1}},$_;
  }
  else {
            print "No section defined for: $_\n";
        }
    }
[download]

update:

my $section = 0;
my %sections = (1 =>[],2=>[]);
while(<TEXTFILE>) {
    push @{$sections{$1}},$_
      and next
    if /;section([12])/;

    warn "No section defined for: $_\n";
# cause warn gives you $. in <TEXTFILE>
}
[download]

incantation huh?:

Incantation two:

my $section = 0;  my @sectone;  my @secttwo;
while(<TEXTFILE>) {
    if (/;section([12])/) {
        $section = ( $1 == 1 ? @sectionone : @sectiontwo);
    }
    else {
            push @{$section},$_ and next if $section;
            warn "No section defined for: $_\n";
        }
    }
[download]

my $section = 0;
my %sections = (1 =>[],2=>[]);
while(<TEXTFILE>) {
    if (/;section([12])/) {
       $section = $1 and next
    }
    push @{$sections{$section||
                     warn "No section defined for: $_\n"
                     and next()}}
    ,$_;
}
[download]

my $section = 0;
my %sections = (1 =>[],2=>[]);
while(<TEXTFILE>) {
$sections=$1 and next if /;section([12])/;
warn "No section defined for: $_\n" unless $sections;
push @{$sections{$sections}},$_
}
[download]

update:
I lobster, but I never flounder

my $section = 0;
my %sections = (1 =>[],2=>[]);
while(<TEXTFILE>) {

    push @{$sections{( /;section([12])/ 
                       and $section = $1
                       and next() )
                     or (
                     $section ||
                     warn "No section defined for: $_\n"
                     and next() )}}
    ,$_;
}
[download]

______crazyinsomniac_____________________________
Of all the things I've lost, I miss my mind the most.
perl -e "$q=$_;map({chr unpack qq;H*;,$_}split(q;;,q*H*));print;$q/$q;"

[reply]
[d/l]
[select]

Re: (crazyinsomniac) Re: Tidier and more efficient parsing code

by demerphq (Chancellor) on Jan 24, 2002 at 15:19 UTC

I just ran it and it seems not to... As far as I can tell the push will only happen when there is a ;section on the line.

Sorry.

Yves / DeMerphq
--
When to use Prototypes?

[reply]

Re: Tidier and more efficient parsing code
by demerphq (Chancellor) on Jan 24, 2002 at 15:06 UTC

crazyinsomniac

my $section = 0;
my @sections=(undef,[],[]);  
while(<TEXTFILE>) {
    $section = $1 and next if /;section([12])/;
    die "Bad section $section" unless $section;
    push @{$sections[$section]},$_;
}
[download]

Update:

Yves / DeMerphq
--
When to use Prototypes?

[reply]
[d/l]

Re: Tidier and more efficient parsing code
by flocto (Pilgrim) on Jan 25, 2002 at 18:41 UTC

my $data = [];
my $section = 0;
while (<TEXT>)
{
    if (/^section(\d)/)
    {
        $section = $1;
    }
    else
    {
        next unless $section;
        push (@{$data->[($section - 1)]}, $_);
    }
}

#my @section_one = @{$data->[0]}; # if absolutely
#my @section_two = @{$data->[1]}; # neccessary only
[download]

[reply]
[d/l]

Re: Tidier and more efficient parsing code
by lirm (Novice) on Jan 26, 2002 at 00:05 UTC

It's kinda verbose, but I couldn't figure out how to get rid
of the first element during split.

{
    undef $/;
    @temp = split /;/, <DATA>; # slurp file
    shift @temp; # Get rid of everything before first semicolon
    %sections = map { my($sect, @param) = split /[\n\r\f]+/;($sect, \@
+param) } @temp;
}

# Just to test it
foreach (keys %sections) {
    print "$_ => [@{$sections{$_}}]\n";
}

__DATA__
;section1
foo
bar
;section2
bar
baz
[download]

[reply]
[d/l]

Re: Re: Tidier and more efficient parsing code

by particle (Vicar) on Jan 26, 2002 at 00:10 UTC

{
    local $/;
    undef $/;
    (undef, @temp) = split /;/, <DATA>; # slurp file
    %sections = map { my($sect, @param) = split /[\n\r\f]+/;($sect, \@
+param) } @temp;
}
[download]

~Particle

[reply]
[d/l]