in reply to content triggered parsing in Spreadsheet-ParseExcel

Here is a version that is more dynamic but assumes data is columns A and B starting from row 1:

School 1 Dean John No.stu. 55 School 2 Dean Tony No. Students 60 School 3 Dean James No.stu. 56 No. Teacher 20

Most of the code is taken from amon's example on stackoverflow.com

use strict; use warnings; use Spreadsheet::ParseExcel; my ($infile) = @ARGV; my $parser = Spreadsheet::ParseExcel->new(); my $workbook = $parser->parse($infile); die $parser->error unless defined $workbook; my ($worksheet) = $workbook->worksheets(); my %data; # accumulate data here my $row = 0; my $school = 0; while(1){ my $cell = $worksheet->get_cell($row, 0); last unless defined($cell); my $key = $cell->value(); my $data = $worksheet->get_cell($row++, 1)->value(); if( $key eq "School" ) { $school = $data; } else { $data{$school}{$key} = $data; } } # see what we got foreach my $s (sort keys %data) { print "School $s:\n"; foreach my $fact (sort keys %{$data{$s}}) { print "\t$fact: $data{$s}{$fact}\n"; } }

which will print

School 1: Dean: John No.stu.: 55 School 2: Dean: Tony No. Students: 60 School 3: Dean: James No. Teacher: 20 No.stu.: 56

If you now add more facts underneath a school it will automatically add it to the hash.

Replies are listed 'Best First'.
Re^2: content triggered parsing in Spreadsheet-ParseExcel
by qingxia (Novice) on Mar 27, 2013 at 09:41 UTC
    Hi hdb,

    thanks very much for the code. i learned a lot from it but still have some questions. Most of them may seem straightforward to you but I hope get them clarified.

    1. last unless defined($cell); i think it is used to tell the loop when to stop, e.g. when the loop stops when it reaches the undefined cell.

    2. my $data = $worksheet->get_cell($row++, 1)->value(); I guess this line increments the row number by one AFTER it fetches the column value.

    3.

    if( $key eq "School" ) { $school = $data; } else { $data{$school}{$key} = $data; }

    this i am not sure, but it seems to me that the hash table %data has 2 layers, first is school, the second contains the rest facts. If it reaches the $school row, record it as first layer key. And record the rest as other keys? Please correct me.

    Thanks a lot in advance!

      No corrections required. You got it all correct.

        thanks for the quick respond!
        Hi again, I hope i am not asking too much. But then how could I out-write the hash data to a new xls file? Ideally, the new excel table should look like:
        col1 col2 col3 col4 row1 School Dean No.stu. No.teacher row2 1 John 55 row3 2 Tony 60 row4 3 James 56 20
        Best regards,
Re^2: content triggered parsing in Spreadsheet-ParseExcel
by qingxia (Novice) on Mar 25, 2013 at 22:08 UTC
    wow, it works really well. thanks a lot.