in reply to Re: Trying to determine the output length of a Unicode string
in thread Trying to determine the output length of a Unicode string

When I try this script I get

m
3
3
1
1

How can I get this little special m̧̈?

  • Comment on Re^2: Trying to determine the output length of a Unicode string

Replies are listed 'Best First'.
Re^3: Trying to determine the output length of a Unicode string
by Anonymous Monk on Sep 26, 2011 at 07:35 UTC
    You need a shell capable of displaying unicode (utf8)

      I suppose my shell is capable of displaying unicode (utf8).

      When I run this

      perl -CO -E 'say "\x{263a}"'

      I get this output

        It seems unlikely to me you're using a shell terminal that has the capability to perform the sophisticated typography needed to render a lowercase m with a diaeresis above it and a cedilla below it. It's a matter of computer typography, not of Unicode code point coverage.

Re^3: Trying to determine the output length of a Unicode string
by ikegami (Patriarch) on Sep 26, 2011 at 20:05 UTC
    We need a terminal capable of handling combining marks.
    $ perl -CS -MUnicode::Normalize -E'say NFC("\xE9")' é $ perl -CS -MUnicode::Normalize -E'say NFD("\xE9")' e