in reply to Re: Re: Re: Re: XML Simple Charset Q?
in thread XML Simple Charset Q?

Right (I updated my post to clarify). His regex takes single-byte characters in the range 80-ff nd recodes them as HTML escape codes. Same number, just a different way of persisting it to the output stream.

Inspired by that, I showed that the same idea can convert from UTF8 by using the utf8 pragma and the extended \x escape codes in the regex, and meanwhile encode to Latin-1 by using pack.

  • Comment on Re: Re: Re: Re: Re: XML Simple Charset Q?