in reply to Re: Re: s/.// increases length - bug or badly documented feature
in thread s/.// increases length - bug or badly documented feature
I'm reading from 'perlunicode' here, in my Perl V5.6.0 documentation:
Important Caveat
WARNING: The implementation of Unicode support in Perl is incomplete.
The following areas need further work.
Input and Output Disciplines
There is currently no easy way to mark data read from a file or other external source as being utf8. This will be one of the major areas of focus in the near future.
Regular Expressions
The existing regular expression compiler does not produce polymorphic opcodes. This means that the determination on whether to match Unicode characters is made when the pattern is compiled, based on whether the pattern contains Unicode characters, and not when the matching happens at run time. This needs to be changed to adaptively match Unicode if the string to be matched is Unicode.
use utf8 still needed to enable a few features
The utf8 pragma implements the tables used for Unicode support. These tables are automatically loaded on demand, so the utf8 pragma need not normally be used.
However, as a compatibility measure, this pragma must be explicitly used to enable recognition of UTF-8 encoded literals and identifiers in the source text.
|
---|
Replies are listed 'Best First'. | |
---|---|
Re: Re: Re: Re: s/.// increases length - bug or badly documented feature
by Juerd (Abbot) on Mar 01, 2002 at 19:32 UTC |