in reply to Re^2: RegExp breaks in Perl 5.10
in thread RegExp breaks in Perl 5.10
I followed this thread advice (as I said above) and uploaded the module converted into UTF-8. The good news is that so far, all reports in both 5.10.0 and 5.8.8 have passed.
Bad news is that if you repeat your test but backwards, $latin =~ /utf8/, it also fails:
for my $chr (160 .. 255) { my $chr_byte = chr($chr); my $chr_utf8 = chr($chr); utf8::upgrade($chr_utf8); my $rx = qr{uci$chr_utf8|uci_}; my $s = "uci$chr_byte"; print $chr . " " . ($s =~ $rx ? "ok" : "not ok") . "\n"; }
Now that the module is UTF-8, I copied the test suite list of words into latin 1. As suggested by the test above, the new test suite fails. It fails for the same 15 words.
So, is ${^RE_TRIE_MAXBUF} = -1; the most general work-around? What implications does it have? What other options do I have?
Thank you for your kind help.
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^4: RegExp breaks in Perl 5.10
by almut (Canon) on Mar 07, 2008 at 18:21 UTC | |
by jfraire (Beadle) on Mar 07, 2008 at 20:02 UTC |