in reply to Re^3: print UTF-8 problem
in thread print UTF-8 problem

At the Windows console, I type "chcp 65001" (and then it says: "Code page: 65001") and I also change the console font to "Lucida Console". But that doesn't help.

Replies are listed 'Best First'.
Re^5: print UTF-8 problem
by ikegami (Patriarch) on Feb 15, 2012 at 20:30 UTC
    cp65001 is UTF-8, so that's good. What do you expect instead of <הער׳11>? It would also be good to have the input and output in hex form.

      It seems that it's not a Perl problem. I have a wide-character text file that looks fine in Notepad ("UTF-8 encoding"), in Notepad++, and when I cut-and-paste into MS Word it looks fine too. But when I open a "DOS box" (Windows console) and go: "type file.txt", it prints gibberish.

      And yes, I did all the recommendations for Unicode on Windows console: I opened the console using "cmd /u", I changed the font to Lucida, and I've entered: "chcp 65001".

        If the file is UTF-8 and the chcp is 65001, it should work.

        What do you get from

        perl -CS -E"say map chr, 0x2660, 0x100;"

        You should get the following (like I do)

        ♠Ā

        Is cygwin involved at all? (e.g. Are you using the bash shell?)

        What do you expect to get instead of <הער׳11>?