in reply to Re: Re: Guess between UTF8 and Latin1/ISO-8859-1
in thread Guess between UTF8 and Latin1/ISO-8859-1

The whole point of this is that I do not want to reject stuff I don't have to.
But you will have to reject something; it's not possible to just guess the encoding (and be correct all the time, that is)... I'd go for being pedantic, and just contact them, saying they don't follow the standard and try to shame them into fixing it.

Joost.

  • Comment on Re: Re: Re: Guess between UTF8 and Latin1/ISO-8859-1

Replies are listed 'Best First'.
Re: Re: Re: Re: Guess between UTF8 and Latin1/ISO-8859-1
by Jenda (Abbot) on Jan 21, 2004 at 21:59 UTC

    For some applications your position "I can't have any incorrect data in the system, if it aint 100% sure I'll reject it" is definitely the right one. All I'm trying to say is that for me in this particular case to be safe means to import everything I possibly can even if it means that there might be an incorrect character now and then.

    There is not one correct way to handle errors. The right way depends on the application.

    Jenda
    Always code as if the guy who ends up maintaining your code will be a violent psychopath who knows where you live.
       -- Rick Osborne

    Edit by castaway: Closed small tag in signature