in reply to Re^12: Seeking Perl docs about how UTF8 flag propagates (Terminology)
in thread Seeking Perl docs about how UTF8 flag propagates
Well your stars are more golden than mine.
Seriously, it says 32bits or more° and I don't care that much as long as Perl and Unicode aren't expanded to cover all scripts of the galaxy.
And I cringe about calling a byte a character. A string - as a sequence of bytes˛ - can hold any kind of packed data which fits into memory. Like eg JPG. Perl has also plenty of string operators which don't assume text.
BUT ... "from all docs I skimmed thru yet" ... it's the best in having an axiomatic build up with clarifying the terminology first.
And as I said "taking the style as starting point."
Cheers Rolf
(addicted to the 𐍀𐌴𐍂𐌻 Programming Language :)
Wikisyntax for the Monastery
°) "...range 0 .. 2**32-1 (or more)"
˛) the idea seems to be to define a "logical" character as the sub-units of strings as returned by split // or 'length'. That's unfortunate wording IMHO.
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^14: Seeking Perl docs about how UTF8 flag propagates (Terminology)
by ikegami (Patriarch) on May 19, 2023 at 15:20 UTC | |
by LanX (Saint) on May 20, 2023 at 11:05 UTC | |
by ikegami (Patriarch) on May 21, 2023 at 15:06 UTC | |
by LanX (Saint) on May 22, 2023 at 11:29 UTC | |
by ikegami (Patriarch) on May 22, 2023 at 16:39 UTC | |
|