http://qs1969.pair.com?node_id=85379


in reply to Re: Re: What are you expecting XML to be in?
in thread Converting character encodings

7-bit ISO-8859-1 (also called "ASCII" {grin}) is a proper subset of UTF-8, but not 8-bit ISO-8859-1. So yes, you'd need to declare the file as ISO-8859-1 if you wanted to have any "second half" characters, but otherwise you can let it default to UTF-8.

-- Randal L. Schwartz, Perl hacker

  • Comment on Re: Re: Re: What are you expecting XML to be in?

Replies are listed 'Best First'.
Re: Re: Re: Re: What are you expecting XML to be in?
by John M. Dlugosz (Monsignor) on Jun 04, 2001 at 07:23 UTC
    Oh yea, that's right. 8859-1 is a subset of Unicode character encoding, but not UTF-8. I mixed them up.

    So one can find existing xml files with various encodings? Does that mean a parser needs a conformance statement as to what encodings it can understand?