Hi,

Here you go

perlunitut: Unicode in Perl#I/O flow (the actual 5 minute tutorial)

Now ask your program (or mine) , who is doing the byte encoding (what statement, what function/method )?

#!/usr/bin/perl -- use strict; use warnings; use Data::Dump qw/ dd /; use Path::Tiny qw/ path /; use JSON::XS(); use JSON::PP(); my @humps = "\x{FEFF}\x{1F42A} one hump two humps \x{1F42B}"; dd( JSON::XS->new->pretty(1)->encode( \@humps ) ); dd( JSON::PP->new->pretty(1)->encode( \@humps ) ); dd( JSON::XS->new->utf8(1)->pretty(1)->encode( \@humps ) ); dd( JSON::PP->new->utf8(1)->pretty(1)->encode( \@humps ) ); dd( JSON::XS->new->ascii(1)->pretty(1)->encode( \@humps ) ); dd( JSON::PP->new->ascii(1)->pretty(1)->encode( \@humps ) ); print "#" x 6, "\n"; path( 'deleteme.txt')->spew_raw( JSON::XS->new->pretty(1)->encode( \@h +umps ) ); dd( path( 'deleteme.txt')->slurp_raw ); path( 'deleteme.txt')->spew_raw( JSON::PP->new->pretty(1)->encode( \@h +umps ) ); dd( path( 'deleteme.txt')->slurp_raw ); print "#" x 6, "\n"; path( 'deleteme.txt')->spew_utf8( JSON::XS->new->pretty(1)->encode( \@ +humps ) ); dd( path( 'deleteme.txt')->slurp_raw ); dd( path( 'deleteme.txt')->slurp_utf8 ); path( 'deleteme.txt')->spew_utf8( JSON::PP->new->pretty(1)->encode( \@ +humps ) ); dd( path( 'deleteme.txt')->slurp_raw ); dd( path( 'deleteme.txt')->slurp_utf8 ); print "#" x 6, "\n"; path( 'deleteme.txt')->spew_utf8( JSON::XS->new->utf8(1)->pretty(1)->e +ncode( \@humps ) ); dd( path( 'deleteme.txt')->slurp_utf8 ); path( 'deleteme.txt')->spew_utf8( JSON::PP->new->utf8(1)->pretty(1)->e +ncode( \@humps ) ); dd( path( 'deleteme.txt')->slurp_utf8 ); print "#" x 6, "\n"; path( 'deleteme.txt')->spew_utf8( JSON::XS->new->ascii(1)->pretty(1)-> +encode( \@humps ) ); dd( path( 'deleteme.txt')->slurp_utf8 ); path( 'deleteme.txt')->spew_utf8( JSON::PP->new->ascii(1)->pretty(1)-> +encode( \@humps ) ); dd( path( 'deleteme.txt')->slurp_utf8 ); print "#" x 6, "\n"; path( 'deleteme.txt')->spew_raw( JSON::XS->new->utf8(1)->pretty(1)->en +code( \@humps ) ); dd( path( 'deleteme.txt')->slurp_utf8 ); path( 'deleteme.txt')->spew_raw( JSON::PP->new->utf8(1)->pretty(1)->en +code( \@humps ) ); dd( path( 'deleteme.txt')->slurp_utf8 ); # path( 'deleteme.txt')->remove; __END__ "[\n \"\x{FEFF}\x{1F42A} one hump two humps \x{1F42B}\"\n]\n" "[\n \"\x{FEFF}\x{1F42A} one hump two humps \x{1F42B}\"\n]\n" "[\n \"\xEF\xBB\xBF\xF0\x9F\x90\xAA one hump two humps \xF0\x9F\x90\ +xAB\"\n]\n" "[\n \"\xEF\xBB\xBF\xF0\x9F\x90\xAA one hump two humps \xF0\x9F\x90\ +xAB\"\n]\n" "[\n \"\\ufeff\\ud83d\\udc2a one hump two humps \\ud83d\\udc2b\"\n]\ +n" "[\n \"\\ufeff\\ud83d\\udc2a one hump two humps \\ud83d\\udc2b\"\n]\ +n" ###### Wide character in print at C:/perl/site/lib/Path/Tiny.pm line 1848. "[\n \"\xEF\xBB\xBF\xF0\x9F\x90\xAA one hump two humps \xF0\x9F\x90\ +xAB\"\n]\n" Wide character in print at C:/perl/site/lib/Path/Tiny.pm line 1848. "[\n \"\xEF\xBB\xBF\xF0\x9F\x90\xAA one hump two humps \xF0\x9F\x90\ +xAB\"\n]\n" ###### "[\n \"\xEF\xBB\xBF\xF0\x9F\x90\xAA one hump two humps \xF0\x9F\x90\ +xAB\"\n]\n" "[\n \"\x{FEFF}\x{1F42A} one hump two humps \x{1F42B}\"\n]\n" "[\n \"\xEF\xBB\xBF\xF0\x9F\x90\xAA one hump two humps \xF0\x9F\x90\ +xAB\"\n]\n" "[\n \"\x{FEFF}\x{1F42A} one hump two humps \x{1F42B}\"\n]\n" ###### "[\n \"\xEF\xBB\xBF\xF0\x9F\x90\xAA one hump two humps \xF0\x9F\x90\ +xAB\"\n]\n" "[\n \"\xEF\xBB\xBF\xF0\x9F\x90\xAA one hump two humps \xF0\x9F\x90\ +xAB\"\n]\n" ###### "[\n \"\\ufeff\\ud83d\\udc2a one hump two humps \\ud83d\\udc2b\"\n]\ +n" "[\n \"\\ufeff\\ud83d\\udc2a one hump two humps \\ud83d\\udc2b\"\n]\ +n" ###### "[\n \"\x{FEFF}\x{1F42A} one hump two humps \x{1F42B}\"\n]\n" "[\n \"\x{FEFF}\x{1F42A} one hump two humps \x{1F42B}\"\n]\n" "🐪 one hump two humps 🐫"
"🐪 one hump two humps 🐫"

In reply to Re^5: JSON::XS Cyrillic unicode not saving properly by Anonymous Monk
in thread JSON::XS Cyrillic unicode not saving properly by cormanaz

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.