Ok I got this figued out.

The encoding files identify themselves in their header, and the name of the encoding in the header had better match the file name.

So, with a binary editor I edited the encoding name in the file from x-sjis-cp932 to shift_jis (you have to be careful not to change the file length, so you have to pad the encoding name with NUL characters, in vim that NUL can be entered with Ctrl-K N U.

That file is saved as shift_jis.enc, then my sample script runs fine. It wasn't anything to do with cygwin, hearty apologies for bad mouthing their fine dll.

A safer way to do this would be to remake the encoding files with the XML::Encoding package, but this works as a quick fix.

James


In reply to Re^2: XML::Parser Shift_JIS encoding by zeimusu
in thread XML::Parser Shift_JIS encoding by zeimusu

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.