(If you were expecting it to be some displayable character, then either you have the wrong code point in your data, or else you're saying/pretending it's unicode when in fact it is not. BTW, I notice that 0x97 is used in the MS "CP125*" code pages for "em dash", which is "officially" supposed to transliterate into U2014, which in turn should yield a 3-byte utf8 sequence: E2 80 94.)
I tried the test script that you posted in a reply above, and it seemed to put a U0097 character -- in utf8 encoding (i.e. as the two-byte sequence C2 97) -- for both "test1" and "test2" elements, in all of its outputs (the "print_out.xml" file, the "out.xml" file, and STDOUT; of course, I had to use a hex dump to actually "see" the character in all cases, since it is not displayable). Does that run contrary to your own findings?
(I'm running 5.8.1 on darwin. 5.8.5 shouldn't be any different...)
In reply to Re: problem with XML::Writer, unicode and Perl 5.6.0 upgrade
by graff
in thread problem with XML::Writer, unicode and Perl 5.6.0 upgrade
by santellij
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |