comment on

Update: This post was made to showcase another way to benchmark when involving parallel workers. Upon further review, the OP's post wants to increment $bc_pair_num which isn't done here. I will come back later and update the code. Also, sorting is not required when Perl has two fast ordered-hash implementations. I will try Hash::Ordered and MCE::Shared::Ordhash (constructed as non-shared). These modules are fast.

Update: The OP does not mention sorting. Therefore please disregard the mentioning of the two ordered-hash implementations. Like haukex said, sorting is helpful during debugging sessions. This is true.

Update: Posted a non-benchmark version, which increments the $bc_pair_num field.

Greetings,

Sometimes workers may not stop immediately like you want them to with various benchmark modules. The following is another way, based on karlgoethebier's example. For an array (mce_loop), the manager process chunks and sends the next chunk via IPC. For sequence (mce_loop_s) and with the bounds_only option, workers compute the next offset begin and end boundaries. Thus, runs with lesser overhead.

If you must set the chunk_size option, do not go over 8000 when processing an array. Perl performance degrades if you go higher. Better yet, simply comment out the chunk_size option or not set it. There's no easy formula for setting chunk_size. However, the default chunk_size => 'auto' for MCE Models do a good job for most cases.

For arrays, check to see if Perl has Sereal::Decoder and Sereal::Encoder 3.015 or later installed. MCE will use Sereal 3.015+ for serialization if available. Otherwise, it defaults to Storable. The results are mind-boggling. The reason is that for arrays, MCE involves the manager process which chunks the array elements and sends via IPC. Even with that overhead, MCE runs faster. That requires the our vs my keyword on %barcode_hash and $barcode_pair_35. Thank you, karlgoethebier for this enlightenment.

Results: run 50 times for 100000 keys: $max set to 1e5.

$ perl demo.pl haukex 50
duration (haukex):  1.236 seconds
found match: yes

$ perl demo.pl karlary 50
duration (karlary): 0.825 seconds
found match: yes

$ perl demo.pl karlseq 50
duration (karlseq): 0.313 seconds
found match: yes
[download]

Results: run 50 times for 1 million keys: $max set to 1e6.

$ perl demo.pl haukex 50
duration (haukex): 17.388 seconds
found match: yes

$ perl demo.pl karlary 50
duration (karlary): 7.633 seconds
found match: yes

$ perl demo.pl karlseq 50
duration (karlseq): 2.858 seconds
found match: yes
[download]

Demo script.

#!/usr/bin/env perl

use strict;
use warnings;
use feature qw( say );

use MCE::Loop;
use Time::HiRes qw( time );

sub usage {
    warn "usage: $0 ( haukex | karlary | karlseq ) [ count ]\n\n";
    exit 1;
}

my $func  = shift || usage();
my $count = shift || 50;

usage() unless main->can($func);

my $cpus = MCE::Util->get_ncpu() || 4;
my $max  = 100000;

MCE::Loop::init {
    max_workers => $cpus,
    chunk_size  => 8000,  # <-- do not go over 8000
    bounds_only => 1      # <-- applies to sequence
};

my $data =
  [ 'AGCTCGTTGTTCGATCCA', 'GAGAGATAGATGATAGTG', 'TTTT_CCCC', 0 ];

our %barcode_hash = map { $_ => $data } 1 .. $max - 2;

$barcode_hash{ ($max - 1) } =
  [ 'AGCTCGTTGTTCGATCCA', 'GAGAGATAGATGATAGTG', 'TTTT_AAAA', 0 ];

$barcode_hash{ ($max) } =
  [ 'AGCTCGTTGTTCGATCCA', 'GAGAGATAGATGATAGTG', 'TTTT_AAAA', 0 ];

our $barcode_pair_35 = 'TTTT_AAAA';

{
    no strict 'refs';
    my $start = time; my $ret;

    $ret = $func->() for 1 .. $count;

    printf "duration ($func): %0.03f seconds\n", time - $start;
    printf "found match: %s\n", $ret ? 'yes' : 'no';
}

exit 0;

sub haukex {
    # serial code
    my $ret = 0;
    for ( 1 .. $max ) {
        $ret = 1, last if $barcode_hash{$_}[2] eq $barcode_pair_35;
    }
    return $ret;
}

sub karlary {
    # workers receive next array chunk
    my @ret = mce_loop {
        my ( $mce, $chunk_ref, $chunk_id ) = @_;
        for ( @$chunk_ref ) {
            MCE->gather(1), MCE->abort(), last if (
               $barcode_hash{$_}[2] eq $barcode_pair_35
            );
        }
    }
    1 .. $max;  # <-- for array 1 .. $max
    return @ret ? 1 : 0;
}

sub karlseq {
    # workers receive next sequence 'begin' and 'end' boundaries
    my @ret = mce_loop_s {
        my ( $mce, $chunk_ref, $chunk_id ) = @_;
        for ( $chunk_ref->[0] .. $chunk_ref->[1] ) {
            MCE->gather(1), MCE->abort(), last if (
               $barcode_hash{$_}[2] eq $barcode_pair_35
            );
        }
    }
    1, $max;    # <-- for sequence 1, $max
    return @ret ? 1 : 0;
}
[download]

For the MCE bits, I used MCE->gather and MCE->abort. The abort method is helpful which stops all workers from processing more chunks. Thus, ending the job early.

Update: Results from a Windows 7 VM configured with 4 cores and Strawberry Perl. I think Perl makes extra copies. Thus, involves extra time during spawning.

Results: run 50 times for 100000 keys: $max set to 1e5.

$ perl demo.pl haukex 50
duration (haukex):  1.232 seconds
found match: yes

$ perl demo.pl karlary 50
duration (karlary): 1.482 seconds
found match: yes

$ perl demo.pl karlseq 50
duration (karlseq): 0.858 seconds
found match: yes
[download]

Results: run 50 times for 1 million keys: $max set to 1e6.

$ perl demo.pl haukex 50
duration (haukex):  20.108 seconds
found match: yes

$ perl demo.pl karlary 50
duration (karlary): 16.770 seconds
found match: yes

$ perl demo.pl karlseq 50
duration (karlseq): 11.419 seconds
found match: yes
[download]

Update: Also tested Perl from the Cygwin environment. Here, it seems workers are spawned instantly after the initial creation. This is see via the task manager.

Results: run 50 times for 100000 keys: $max set to 1e5.

$ perl demo.pl haukex 50
duration (haukex):  1.607 seconds
found match: yes

$ perl demo.pl karlary 50
duration (karlary): 1.529 seconds
found match: yes

$ perl demo.pl karlseq 50
duration (karlseq): 0.749 seconds
found match: yes
[download]

Results: run 50 times for 1 million keys: $max set to 1e6.

$ perl demo.pl haukex 50
duration (haukex):  25.194 seconds
found match: yes

$ perl demo.pl karlary 50
duration (karlary): 14.446 seconds
found match: yes

$ perl demo.pl karlseq 50
duration (karlseq):  7.051 seconds
found match: yes
[download]

Regards, Mario.

In reply to Re^2: search for particular elements of hash with multiple values by marioroy
in thread search for particular elements of hash with multiple values by pmpmmpmp

Are you posting in the right place? Check out Where do I post X? to know for sure.
Posts may use any of the Perl Monks Approved HTML tags. Currently these include the following:
<code> <a> <b> <big> <blockquote> <br /> <dd> <dl> <dt> <em> <font> <h1> <h2> <h3> <h4> <h5> <h6> <hr /> <i> <li> <nbsp> <ol> <p> <small> <strike> <strong> <sub> <sup> <table> <td> <th> <tr> <tt> <u> <ul>
Snippets of code should be wrapped in <code> tags not <pre> tags. In fact, <pre> tags should generally be avoided. If they must be used, extreme care should be taken to ensure that their contents do not have long lines (<70 chars), in order to prevent horizontal scrolling (and possible janitor intervention).
Want more info? How to link or How to display code and escape characters are good places to start.


Your skill will accomplish what the force of many cannot
	PerlMonks