in reply to Re^14: Seeking Perl docs about how UTF8 flag propagates (Terminology)
in thread Seeking Perl docs about how UTF8 flag propagates
> I'm not sure what you mean?
A byte is commonly defined as 8 bits.
Encode says
A character in the range 0..255
While most other sources - like WP - say "UTF-8 is encoding characters using one to four one-byte units".
This will make reader stumble into paradoxical mental loops. ( -> a character is encoded by 1-4 characters ... WTF? )
As you pointed out are most (not all) string operators in Perl "character" based. (Those should maybe be better called "text operators")
What comprises a "character" in a Perl-string depends on the "UTF8 flag"
If a terminology invites for misunderstandings one should chose a new word.
Cheers Rolf
(addicted to the 𐍀𐌴𐍂𐌻 Programming Language :)
Wikisyntax for the Monastery
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^16: Seeking Perl docs about how UTF8 flag propagates (Terminology)
by ikegami (Patriarch) on May 21, 2023 at 15:06 UTC | |
by LanX (Saint) on May 22, 2023 at 11:29 UTC | |
by ikegami (Patriarch) on May 22, 2023 at 16:39 UTC | |
by LanX (Saint) on May 22, 2023 at 20:48 UTC | |
by ikegami (Patriarch) on May 23, 2023 at 01:34 UTC | |
|