Can this script be Optimized?

Anonymous Monk has asked for the wisdom of the Perl Monks concerning the following question:

Replies are listed 'Best First'.
Re: Can this script be Optimized? by kcott (Archbishop) on May 01, 2014 at 07:01 UTC
There's a number of improvements you could make. Here's a (not necessarily complete) list: Deferencing variables has an overhead. Use a hash instead of a reference to a hash. You're getting a list of keys three times and sorting those lists twice. Just do all of this once. There's a builtin module List::Util which provides a `max()` function. There's no need roll your own code for this. [As far as I know, all of the functions provided by that module are implemented in C, so you also get a performance bonus.] (See also: List::MoreUtils) Transforming the hash into an AoA is unnecessary. All the required functionality can be achieved by just using the initial hash. You've used a total of four `foreach` loops; in the code I've shown below there's only one. [`for` and `foreach` are synonymous: save yourself some typing by just using `for`] You can use the `-l` (ell) command switch so you don't need `$/` (or `"\n"`) after each `print` (see perlrun: Command Switches). I took your original spec and produced the following script. While I kept to the spec, I made no any attempt to reproduce any parts of your script. `#!/usr/bin/env perl -l use strict; use warnings; use List::Util qw{max}; my %Hash = ( "A" => ["HYU"], "B" => ["TU6"], "C" => [ "11", "09", "88", "2" ], "D" => [ "01", "11" ] ); my @keys = sort keys %Hash; print join ",\t" => @keys; { no warnings 'uninitialized'; for my $i (0 .. max map { $#{$Hash{$_}} } @keys) { print join "\t" => map { $Hash{$_}[$i] } @keys; } }` [download] Output: `A, B, C, D HYU TU6 11 01 09 11 88 2` [download] Using tabs for the output worked in this instance; however, with other data, columns may be out of alignment. Also, tab widths can vary; so what looks fine here may be misaligned elsewhere. Consider using printf to format your output. Also have a read of "Perl Performance and Optimization Techniques" for more tips you can use in your other scripts. -- Ken	[reply] [d/l] [select]
Re^2: Can this script be Optimized? by tobyink (Canon) on May 01, 2014 at 08:00 UTC
"As far as I know, all of the functions provided by that module are implemented in C, so you also get a performance bonus." Indeed, they are. List::Util also happens to make very efficient use of Perl's API (e.g. using multicall), allowing its functions to perform at speeds approaching Perl's built-in list operators (`grep`, `map`, `sort`). "See also: List::MoreUtils" This is also written in C (and also makes use of multicall), but has a fallback Perl implementation allowing it to be installed on machines lacking a C compiler. If a function exists in both List::Util and List::MoreUtils (and because recent List::Util releases have been adding new functions, the overlap between them is growing), then prefer the List::Util one because it will be guaranteed to be implemented in C. `use Moops; class Cow :rw { has name => (default => 'Ermintrude') }; say Cow->new->name`	[reply] [d/l] [select]
Re^3: Can this script be Optimized? by kcott (Archbishop) on May 01, 2014 at 09:03 UTC
Thanks for the feedback. I had seen the C and pure-Perl implementation information in List::MoreUtils [from CPAN]; however, the List::Util [from perldoc.perl.org] documentation made no mention of this: hence the "As far as I know" qualifier. I was a little surprised when you mentioned there was an overlap between these two modules as I wasn't aware of this. I did a little investigation and found: http://perldoc.perl.org/List/Util.html Indicates Perl v5.18.2 and only shows these functions: `first max maxstr min minstr reduce shuffle sum`. It does show `any`, `all`, et al, which currently exist in List::MoreUtils, as suggested additions which haven't been included. (I couldn't find any mention of a module version number.) http://search.cpan.org/~pevans/Scalar-List-Utils-1.38/lib/List/Util.pm This does show `any`, `all`, et al as being included. `$ perldoc List::Util` This appears to be the same doco as the CPAN version. Checking my versions: `List::Util 1.38` and `Perl 5.18.1` I have, up until now, used the perldoc.perl.org documentation for all builtin modules. It's clear that this is out-of-date for `List::Util` (i.e. it's not the POD that ships with 5.18.2); unfortunately, this leave me wondering what other parts of its doco aren't up-to-date. -- Ken	[reply] [d/l] [select]
Re^4: Can this script be Optimized? by tobyink (Canon) on May 01, 2014 at 23:14 UTC
Re^5: Can this script be Optimized? by kcott (Archbishop) on May 01, 2014 at 23:44 UTC
Re^2: Can this script be Optimized? by RonW (Parson) on May 01, 2014 at 16:40 UTC
In the interest of education, `map` is a "feature enhanced" version of `for`. `map { expr; } @array;` is equivalent to: `for (@array) { push @newarray, expr; }` [download] Presumably, `expr` uses $_ either implicitly or explicitly. Similarly, `grep { expr; } @array;` is equivalent to: `for (@array) { push @newaray, $_ if expr; }` [download]	[reply] [d/l] [select]
Re^3: Can this script be Optimized? by kcott (Archbishop) on May 01, 2014 at 17:03 UTC
Actually, I'd say a closer equivalency would be `my @newarray = map { expr } @array;` [download] and `my @newarray; for (@array) { push @newarray, expr; }` [download] Or, conversely `map { expr } @array;` [download] and `for (@array) { expr; }` [download] or `expr for @array;` [download] Anyway, as that was a reply to my post, were you suggesting I replace a `map` with a `for`, or vice versa? Or, perhaps, something else? -- Ken	[reply] [d/l] [select]
Re^4: Can this script be Optimized? by RonW (Parson) on May 01, 2014 at 17:22 UTC
Re^5: Can this script be Optimized? by kcott (Archbishop) on May 01, 2014 at 18:51 UTC
Re: Can this script be Optimized? by NetWallah (Canon) on May 01, 2014 at 05:49 UTC
How about this: `print join( ",\t" => my @kk=sort keys %$Hash ), $/; my $row = 0; my $items_in_row; do { $items_in_row = 0; my $v; for my $k (@kk){ if (defined ($v = $Hash->{$k}[$row])) { $items_in_row ++; }else{ $v=""; } print "$v\t" } print "$/"; $row++; } until $items_in_row == 0;` [download] Produces the same output as your code. Update: Another version: `print join( ",\t" => my @kk=sort keys %$Hash ), $/; my $items_in_row; my $row=0; do { $items_in_row = 0; my $v; print +(map{ if ( defined ($v = $Hash->{$_}[$row]) ){ $items_in_row++ ; "$v\t" }else{ "\t" } } @kk ), "$/"; $row++; } until $items_in_row == 0;` [download] What is the sound of Perl? Is it not the sound of a wall that people have stopped banging their heads against? -Larry Wall, 1992	[reply] [d/l] [select]
Re^2: Can this script be Optimized? by Anonymous Monk on May 01, 2014 at 06:11 UTC
Thanks alot NetWallah, I knew there was no need for all the multiples for loops I was using. Timtoday takes the day.	[reply]
Re^3: Can this script be Optimized? by NetWallah (Canon) on May 01, 2014 at 06:32 UTC
OK - one final (short/cryptic) version: `print join( ",\t" => my @kk=sort keys %$Hash ), $/; for (my $i=0; scalar grep {+defined} (my @v=map {$Hash->{$_}[$i] } @kk + ) ; $i++){ print join ("\t", map {defined $_ ? $_:""}@v),"$/"; }` [download] What is the sound of Perl? Is it not the sound of a wall that people have stopped banging their heads against? -Larry Wall, 1992	[reply] [d/l]