in reply to Re^9: Perl Modules for handling Non English text
in thread Perl Modules for handling Non English text
Your hex math looks fine. I am just saying that Hindi, like most languages can be described with <= 16 bits per character.
And I'm "just saying" no, not in a way Perl understands them. The question was about what Perl understands.
Mind you, decode and other means can be used to decode text in any number of encodings into something Perl understands. But only once its decoded does Perl understands the text to be Hindi character. And once they're decoded, the Hindi characters happen to take at least three bytes.
There's nothing special about 16 bits, so I don't know why you keep bringing it up. You hold high the ability of the characters to be represented by UCS-2le (passing it off as the only encoding), but that has nothing to do with the OP's question or Perl's abilities.
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^11: Perl Modules for handling Non English text
by Marshall (Canon) on Mar 31, 2009 at 07:14 UTC | |
by ikegami (Patriarch) on Mar 31, 2009 at 07:17 UTC | |
by Marshall (Canon) on Apr 02, 2009 at 07:52 UTC |