in reply to Pluggable regex engine in Perl

I'm the author of re::engine::RE2.

As for motivation it was mostly to learn a bit about this area of Perl, however I do see uses for RE2 due to its matching being much faster than Perl's matching.

For example combined with an mmaped scalar I can match a regexp on 1 GiB of text in about 10 seconds (on a core 2 duo), Perl's RE doesn't even come close to that. You can see how Google's Code search can be so fast.

There's some issues with Perl's UTF-8 handling (frankly it's insane), but once I've worked around that re::engine::RE2 should be nearly a drop in replacement for Perl's RE, but faster.

Replies are listed 'Best First'.
Re^2: Pluggable regex engine in Perl
by BrowserUk (Patriarch) on Dec 27, 2010 at 12:31 UTC
    There's some issues with Perl's UTF-8 handling (frankly it's insane),

    It could be informative to read some expansion on that position.


    Examine what is said, not who speaks -- Silence betokens consent -- Love the truth but pardon error.
    "Science is about questioning the status quo. Questioning authority".
    In the absence of evidence, opinion is indistinguishable from prejudice.