more useful options | |
PerlMonks |
Re^22: Seeking Perl docs about how UTF8 flag propagates (Terminology)by ikegami (Patriarch) |
on May 23, 2023 at 14:52 UTC ( [id://11152392]=note: print w/replies, xml ) | Need Help?? |
Oh, you have a problem with the fact that you can store a byte in a character. A character can be:
In Perl, it has the second definition. There are no other words for this. You apparently associate character with one of the last three. I don't know which. For example, take a look at Å [U+212B], Å [U+C5] and Å [U+41,U+30A].
So when you say character, do you think that all three of those things are the same? Only two? None of them? I have no idea. Unicode suggests most people would consider that list to have two characters: The Armstrong symbol, and Latin Capital Letter A with Ring Above. But most people isn't everyone. And that's why you should use the more precise term than character if you mean grapheme, glyph or code point. Standards exist for a reason.
In Section
Seekers of Perl Wisdom
|
|