in reply to Generating UTF-8 from nasty high ASCII input

This subject is discussed in the Perl XML FAQ. Given that you don't know what encoding(s?) the original data used, you might find the 'sanitise' function in the FAQ useful.

I regularly hit this problem when people paste stuff from MSWord since the 'smart quote' characters are not in the ISO-8859-1 set.

  • Comment on Re: Generating UTF-8 from nasty high ASCII input

Replies are listed 'Best First'.
Re: Re: Generating UTF-8 from nasty high ASCII input
by samtregar (Abbot) on Jul 10, 2002 at 16:37 UTC
    Thanks, that looks like it may be the solution. I'll try it this afternoon.

    -sam