Re^3: Random hash element (low memory edition).

The advantage? I'm not sure there is one. It is a different approach than the one proposed by you in Re: Random hash element (low memory edition)., and it is not worse. That is interesting by itself.

It could even be beneficial in case of a tied hash, as a call to keys makes it loop through all the keys of the tied hash, just to get the count. So, in that case, your code will loop through them at least once, to get the keys, and a fraction of a cycle (up to a complete cycle) more than that. I assume that my approach could be faster, in that case.

But note that the Big O is still the same for both approaches: O(n), where n is the number of hash keys.

It might become more interesting if one wants a weighted probability, then in my code, you "just" have to replace the test condition by a different function of rand() and a weight that depends on the current key. But the structure of the code can remain the same. That's a huge advantage.

I have derived a usable function in a journal entry on use.perl.org.

With weighted probabilities (the weight is a positive number that is proportional to the desired odds that this item is picked; if it's zero, the item is skipped), the code can be:

my($threshold, $value);
while(my($k, $v) = each(%hash)) {
    if(my $weight = weight($k)) {  # function call
        my $rand = -log(1 - rand)/$weight;
        if(not defined $threshold or $rand < $threshold) {
            $threshold = $rand;
            $value = $v;
        }
    }
}
[download]

}

Comment on Re^3: Random hash element (low memory edition). Download Code

Replies are listed 'Best First'.
Re^4: Random hash element (low memory edition). by BrowserUk (Patriarch) on Jan 28, 2008 at 13:12 UTC
It is a different approach than the one proposed by you in Re: Random hash element (low memory edition)., But, the same as Roy Johnstone's 664125? Examine what is said, not who speaks -- Silence betokens consent -- Love the truth but pardon error. "Science is about questioning the status quo. Questioning authority". In the absence of evidence, opinion is indistinguishable from prejudice. "Too many [] have been sedated by an oppressive environment of political correctness and risk aversion."	[reply]
Re^5: Random hash element (low memory edition). by bart (Canon) on Jan 28, 2008 at 13:40 UTC
It is. We even point to the same FAQ entry. The difference is that I adapted the code while he stuck to just handwaving.	[reply]
Re^6: Random hash element (low memory edition). by amarquis (Curate) on Jan 28, 2008 at 18:51 UTC
I appreciate both. The handwaving made me think and adapt the code, and your implementation let me check what I'd done.	[reply]