if you write characters that violate the encoding

You can't write character that violate the encoding. It's simply not possible. But you could encode characters using the wrong encoding, and incorrectly write that to the XML document. This would produce a document that contains different characters than the intended characters. Detecting this would require heuristics, as per my original post.

And an XML file with encoding='ISO-8859-1" should not contains characters in another encoding.

No. An XML file with encoding='ISO-8859-1" can not contain characters in another encoding (unless encoded using entities such as ਩). It's simply not possible.


In reply to Re^3: Detect wrong characters by ikegami
in thread Detect wrong characters by LexPl

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.