more useful options | |
PerlMonks |
Re^4: XML Parser not well-formedby ktingle (Sexton) |
on Nov 02, 2004 at 20:01 UTC ( [id://404735]=note: print w/replies, xml ) | Need Help?? |
That character is 0x92, UTF-8 only maps up to 0x7F as a single byte. If the document is representing that character with just one byte then its not UTF-8 and a broken XML instance. That character is represented with 2 bytes in UTF-8. Whenever I get confused about UTF-8 I use this reference; http://www.cl.cam.ac.uk/~mgk25/unicode.html#utf-8
In Section
Seekers of Perl Wisdom
|
|