(tye)Re: japhy regex analysis: case study (RE question...)

Well, since you resorted to benchmarks (updated)...

            Rate    bt_c    mult      bt     opt     mod    chop
bt_c    180870/s      --     -1%     -5%    -28%    -62%    -74%
mult    181987/s      1%      --     -4%    -27%    -62%    -74%
bt      189426/s      5%      4%      --    -24%    -60%    -73%
opt     249612/s     38%     37%     32%      --    -48%    -64%
mod     476214/s    163%    162%    151%     91%      --    -31%
chop    692944/s    283%    281%    266%    178%     46%      --
[download]

So chop is over 45% faster than mod even though it had to make an extra copy of the string!

use Benchmark 'cmpthese';

$x = int (1_000_000 * rand 1_000_000);

cmpthese( -3, {
  mult => sub { $x =~ /(\d)*(\d)/  },
  bt_c => sub { $x =~ /(\d*)(\d)/  },
  bt   => sub { $x =~ /\d*(\d)/    },
  opt  => sub { $x =~ /(\d)$/      },
  mod  => sub { $x % 10            },
  chop => sub { my $x= $x; chop $x },
});
[download]

Following are the original bogus results. Thanks to dkubb for mentioning my over local. I realized I'd made a mistake and came back but not quick enough. So it looks like local is quite a bit slower than my (which makes sense), so I'd be interested in how japhy's machine compares the new code.

        Rate    mult   bt_c     bt    opt    mod     chop
mult 180759/s     --    -1%    -7%   -31%   -62%     -71%
bt_c 182581/s     1%     --    -6%   -31%   -62%     -70%
bt   193680/s     7%     6%     --   -26%   -60%     -69%
opt  263234/s    46%    44%    36%     --   -45%     -57%
mod  481067/s   166%   163%   148%    83%     --     -22%
chop 618559/s   242%   239%   219%   135%    29%       --
[download]

So chop is almost 30% faster than mod even though it had to make an extra copy of the string!

use Benchmark 'cmpthese';

$x = int (1_000_000 * rand 1_000_000);

cmpthese( -3, {
  mult => sub { $x =~ /(\d)*(\d)/ },
  bt_c => sub { $x =~ /(\d*)(\d)/ },
  bt   => sub { $x =~ /\d*(\d)/   },
  opt  => sub { $x =~ /(\d)$/     },
  mod  => sub { $x % 10           },
  chop => sub { local $x; chop $x },
});
[download]

- tye (but my friends call me "Tye")

Comment on (tye)Re: japhy regex analysis: case study (RE question...) Select or Download Code

Replies are listed 'Best First'.
Re: (tye)Re: japhy regex analysis: case study (RE question...) by japhy (Canon) on May 28, 2001 at 05:22 UTC
Even on a different machine, I get these results (the machine has the 5.005 version of `Benchmark.pm`): `timethese( -3, { mod => sub { $x % 10 }, chop => sub { chop(my $x = $x) }, substr => sub { substr($x, -1) }, }); __END__ (they ran for at least 3 seconds) chop: 8707.14/s (n= 29256) substr: 43620.45/s (n=136532) mod: 48906.87/s (n=163838)` [download] So on my machine, mod is much faster than `chop`; the unexplored `substr` approach is nearly as fast. `japhy` -- Perl and Regex Hacker	[reply] [d/l]
(tye)Re2: japhy regex analysis: case study (RE question...) by tye (Sage) on May 28, 2001 at 07:53 UTC
The main difference probably isn't your computer vs. my computer. You left out the regex solutions which (the first time that the first one is called) "modify" the global $x by providing it with a string value. Since the chop solution makes a copy in order to avoid changing the value of the global $x, it also doesn't give the global $x a string value. So if the chop solution is run first, it has to stringify $x for every single call (as in your two runs but not in mine). Your substr solution also gives the global $x a string value, but it is getting run after the chop solution so it doesn't help (in my run, all of the regex solutions were run before the chop solution). So for another way to compare the regular expression versions to the mod version, you could force a stringification per call. I didn't come up with an eligant way to do this (and I came up with some pretty interesting but non-intuitive and mutually contradictory benchmark numbers so I'll just leave this to someone else). - tye (but my friends call me "Tye")	[reply]
Re: Re: (tye)Re: japhy regex analysis: case study (RE question...) by snafu (Chaplain) on May 30, 2001 at 01:56 UTC
Geez! I could kick myself for not thinking of substr! >:\ ---------- - Jim	[reply]
Re: (tye)Re: japhy regex analysis: case study (RE question...) by japhy (Canon) on May 28, 2001 at 03:39 UTC
Except that you never actually stored anything in `local $x`. `use Benchmark 'cmpthese'; $x = int (1_000_000 * rand 1_000_000); cmpthese( -3, { mod => sub { $x % 10 }, chop => sub { local $x; chop $x }, chop2 => sub { chop(local $x = $x) }, }); __END__ Rate chop2 chop mod chop2 25430/s -- -88% -93% chop 204396/s 704% -- -41% mod 348051/s 1269% 70% --` [download] On my machine, `chop()` was slower. But the real `chop()` approach was slower still. `japhy` -- Perl and Regex Hacker	[reply] [d/l]