My answer was based upon ANSI C:

In this Perl discussion, that's as relevant as Java using 32-bit wide chars. I can understanding the mistake of bringing it up initially, but why bring it up again.

And it's wrong. ANSI C says nothing about wchar_t being 16-bit. sizeof(wchar_t) can be as small as 1, and it's commonly 4. In fact, your own program betrays you. Also from gcc on an Intel:

$ gcc -o a a.c $ a hello world size of a wide char is 4 bytes

4294967295 is just a 32 bit unsigned hex number.

And how did I get that number? By getting the character number of "\x{FFFFFFFF}". Therefore, I had a 32-bit character.

But basically 16 bits does it.

Your own reference contradicts you. 17 planes of 16 bits = way more than 16 bits. (21, to be precise.)

For example, these Chinese chars require more than 16 bits.

I don't know how many bits Hindi requires.

It varies by encoding, and it can even vary withing an encoding. But it's completely irrelevant. Perl supports all Unicode characters, including the Hindi ones.


In reply to Re^4: Perl Modules for handling Non English text by ikegami
in thread Perl Modules for handling Non English text by paragkalra

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.