in reply to Re: UTF-8 Malformed Char Error -- how to find and remove bad chars
in thread UTF-8 Malformed Char Error -- how to find and remove bad chars
UTF-8 is pretty easy to understand. It is a way to encode Unicode characters that can be processed by tools that handle normal C strings. All ASCII characters have the same encoding. Larger characters are encoded in two or more bytes. If you get a malformed char error, it could mean your string was corrupted. More likely is that it isn't a UTF-8 string, but some 8-bit encoding like Latin-1. The proper solution is to translate the other encoding into UTF-8 and let Perl handle it.
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^3: UTF-8 Malformed Char Error -- how to find and remove bad chars
by water (Deacon) on Jun 23, 2004 at 16:56 UTC | |
by iburrell (Chaplain) on Jun 23, 2004 at 18:18 UTC |