in reply to utf8::upgrade weirdness
\x{c3a9} is not a valid unicode codepoint; I think you meant \xc3\xa9. But even that won't match, because perl still treats the string as a sequence of characters, the third of which is the unicode code point 00E9. If you want to create string where each character is a byte of a utf8-encoded string, you want to be using Encode, not the utf8 functions:
This should do exactly the same thing whether you've done utf8::upgrade($string) or not.$string = encode("utf8", $string);
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^2: utf8::upgrade weirdness
by graff (Chancellor) on Aug 09, 2006 at 03:24 UTC | |
by ysth (Canon) on Aug 09, 2006 at 17:32 UTC |