wyt248er has asked for the wisdom of the Perl Monks concerning the following question:
I would like to convert between Unicode code points and UTF-8 character codes.
For example, the Unicode code point for the GREEK SMALL LETTER PI is U+03C0, and its UTF-8 character code is 0xCF80. So, if the string "U+03C0" (or "0x03C0") is entered, then I want the string "0xCF80" to be printed (without quotes). If the string "0xCF80" is entered, then I want the string "U+03C0" (or "0x03C0") to be printed (without quotes). Note that the desired output is NOT a character itself but a string showing the character code.
By the way, if your terminal is configured to display Unicode wide characters, then the following commands will show you the GREEK SMALL LETTER PI.
perl -l12e 'print(chr(0x03c0))' -C perl -l12e 'print(pack("U0W*", 0xCF, 0x80))' -C
Thank you in advance.
|
---|
Replies are listed 'Best First'. | |
---|---|
Re: How to convert between Unicode codepoint and UTF8 character code on Perl?
by hippo (Bishop) on Oct 24, 2021 at 11:13 UTC | |
by wyt248er (Initiate) on Oct 25, 2021 at 04:34 UTC | |
by hippo (Bishop) on Oct 25, 2021 at 10:19 UTC | |
by Anonymous Monk on Oct 25, 2021 at 07:38 UTC | |
Re: How to convert between Unicode codepoint and UTF8 character code on Perl?
by ikegami (Patriarch) on Oct 25, 2021 at 15:40 UTC | |
Re: How to convert between Unicode codepoint and UTF8 character code on Perl?
by Anonymous Monk on Oct 24, 2021 at 06:16 UTC |