RE: RE: Hash of hashes question

Replies are listed 'Best First'.

RE:(3) Hash of hashes question
by Russ (Deacon) on Jun 19, 2000 at 06:04 UTC

Being curious, I did a quick benchmark. However, its results defy logic (my (sort grep keys)[-1] is always faster than a O(1) "max()" algorithm!)

Since I am obviously missing something, would you be willing to post some benchmarking code showing the efficiency differences between "sort" and "max()" in this context?

Russ

P.S. Just for the sake of helping me find my silliness, here is what I was running. Changing the map to larger numbers (to make the source hash bigger) made no difference to the benchmark times.

#!/usr/bin/perl -w
use strict;
use Benchmark;
my $now = 8; # we'll pretend it's between 8 and 9 PM
my %url = (
  monday => {
    @{[map(($_,1), (1..1000))]}
  }
);
timethese(100000, {
  a => q{
    $now = (sort grep {$_ <= $now} keys %{$url{'monday'}})[-1];
  },
  b => q{
    $now = ($now < $_ && $_ < 8 ? $_ : $now) for keys %{$url{'monday'}
+};
  }
});
[download]

[reply]
[d/l]

RE: RE:(3) Hash of hashes question

by Aighearach (Initiate) on Jun 19, 2000 at 11:56 UTC

I think it is the because sort() is very fast, and you are comparing it with an if/then (?:) control structure, which is inherantly slow.

Well, another thing to do besides benchmarking is profiling:

#!/usr/bin/perl
use strict;
use Devel::OpProf qw(profile print_stats zero_stats );
profile(1);    # turn on profiling
my $now = 8; # we'll pretend it's between 8 and 9 PM
my %url = (
       monday => {
           @{[map(($_,1), (1..1000))]}
       }
       );
#start profiling
zero_stats;
$now = (sort grep {$_ <= $now} keys %{$url{'monday'}})[-1];
print_stats;

zero_stats;
$now = ($now < $_ && $_ < 8 ? $_ : $now) for keys %{$url{'monday'}};
print_stats;
[download]

Well, this tells us a lot!

First algorithm

null operation 1006

block 1001

private variable 1001

grep iterator 1000

numeric le (<=) 1000

scalar variable 1000

pushmark 7

next statement 5

glob value 3

constant item 2

subroutine entry 2

private hash 1

private array 1

sort 1

keys 1

conditional expression 1

list slice 1

hash dereference 1

hash element 1

array dereference 1

scalar assignment 1

block entry 1

grep 1

Second algorithm

private variable 3000

null operation 2006

logical and (&&) 2001

glob value 1996

numeric lt (<) 1992

scalar dereference 1992

next statement 1006

conditional expression 1001

foreach loop iterator 1001

scalar assignment 1000

iteration finalizer 1000

constant item 993

pushmark 4

subroutine entry 2

block 1

keys 1

foreach loop entry 1

array dereference 1

block entry 1

private array 1

private hash 1

hash dereference 1

hash element 1

loop exit 1

So we see that just the control loop by iself takes almost as much "action" as the whole first algorithm

Paris Sinclair    |    4a75737420416e6f74686572
pariss@efn.org    |    205065726c204861636b6572
I wear my Geek Code on my finger.

[reply]
[d/l]

RE:(5) Hash of hashes question

by Russ (Deacon) on Jun 19, 2000 at 23:10 UTC

use Devel::OpProf qw(profile print_stats zero_stats ); profile(1); # turn on profiling

Wow! Devel::OpProf. My toolbox just got bigger...

So, the follow-up question: is there a better way to code max() than the control structure I used? The theory is that sort() should be less efficient than max(). There must be a more efficient way to code max(), then. max() should be in O(1), where Perl's sort is (usually) in O(N log N). How can we get the best performance for the "greatest value less than another value" algorithm?

Russ

[reply]

RE: RE:(5) Hash of hashes question

by Aighearach (Initiate) on Jun 20, 2000 at 04:00 UTC

RE:(7) Hash of hashes question

by Russ (Deacon) on Jun 20, 2000 at 05:28 UTC

Some notes below your chosen depth have not been shown here