in reply to Re: speeding up a regex
in thread speeding up a regex

I like the resuts from cmpthese much better than those from timethese. Here is the same benchmark using cmpthese

use warnings; use strict; use Benchmark qw(cmpthese); our @list = ('create the world','blah blah', 'drop it already','foo schnerp', 'need to delete','flip schnitzel', 'send me that update!','a flibbertygibitz', 'mailing insert collection','grand central station'); our @wordlist = qw(create drop delete update insert); our @relist = map { qr/\b$_\b/ } @wordlist; our $bigre_t = '\b(?:'.join('|',@wordlist).')\b'; our $bigre = qr/$bigre_t/; print "bigre: $bigre\n"; sub several_re { my $match = 0; foreach my $s (@list) { foreach my $re (@relist) { if ($s =~ /$re/) { $match++; last; } } } $match; } sub one_re { my $match = 0; foreach my $s (@list) { if ($s =~ /$bigre/) { $match++; } } $match; } sub use_index { my $match = 0; foreach my $s (@list) { foreach my $word (@wordlist) { if (index($s,$word) >= 0) { $match++; last; } } } $match; } print "several_re: ", several_re(),"\n"; print "one_re: ", one_re(),"\n"; print "use_index: ", use_index(),"\n"; cmpthese(-1, { 'Several Regexp' => \&several_re, 'One Big Regexp' => \&one_re, 'With index()' => \&use_index, });

Prints:

bigre: (?-xism:\b(?:create|drop|delete|update|insert)\b) several_re: 5 one_re: 5 use_index: 5 Rate Several Regexp With index() One Big Regexp Several Regexp 24634/s -- -24% -50% With index() 32367/s 31% -- -34% One Big Regexp 49358/s 100% 52% --

DWIM is Perl's answer to Gödel