How to to get chinese charactor from unicode?

snaillu has asked for the wisdom of the Perl Monks concerning the following question:

I coded a short program, named unicode.pl. Below is the content.

use Encode;

$vall = '\u805a\u5408\u6216\u8be6\u7ec6'; 
$vall =~ s/u(\w\w\w\w)/x{$1}/g;
print $vall,"\n";

$str=Encode::encode("utf8", $vall);
print $str;
[download]

The result is:

\x{805a}\x{5408}\x{6216}\x{8be6}\x{7ec6}
\x{805a}\x{5408}\x{6216}\x{8be6}\x{7ec6}
[download]

In fact, I want to get the result like below:

\x{805a}\x{5408}\x{6216}\x{8be6}\x{7ec6}
聚合或详细
[download]

To the sum, I want to get chinese charactor from unicode.

Edit: g0n - code tags

Comment on How to to get chinese charactor from unicode? Select or Download Code

Replies are listed 'Best First'.
Re: How to to get chinese charactor from unicode? by ikegami (Patriarch) on Mar 02, 2007 at 06:16 UTC
You're trying to convert a Perl string literal to a string. To do that, use `perl`: `$s = '\u805a\u5408\u6216\u8be6\u7ec6'; $s =~ s/u(\w\w\w\w)/x{$1}/g; $s = eval qq{"$s"}; print($s);` [download] Alternatively, you could interpret the original string yourself `$s = '\u805a\u5408\u6216\u8be6\u7ec6'; $s =~ s/\\u([0-9A-Fa-f]{4})/chr(hex($1))/ge; print($s);` [download]	[reply] [d/l] [select]
Re^2: How to to get chinese charactor from unicode? by ikegami (Patriarch) on Mar 02, 2007 at 06:57 UTC
I wasn't too happy with the second snippet. It had no way of escaping `\`. This is a fix: `my $out = ''; for ($in) { /\G ([^\\]+) /xgc && ( $out .= $1 ); /\G \\u([0-9A-Fa-f]{4})/xgc && do { $out .= chr(hex($1)); redo; }; /\G \\(.) /xgc && do { $out .= $1; redo; }; }` [download]	[reply] [d/l] [select]
Re: How to to get chinese charactor from unicode? by zentara (Cardinal) on Mar 02, 2007 at 13:43 UTC
This might interest you Tk-reading-chinese-out-of-data. It's Tk, but maybe your problem is your shell can't display chinese characters? My system is screwed up that way. I'm not really a human, but I play one on earth. Cogito ergo sum a bum	[reply]