Would The Absolute Minimum Every Software Developer Absolutely, Positively Must Know About Unicode and Character Sets (No Excuses!) be ok?
Oh, and of course, in perl: perldoc perlunicode.
In reply to Re^3: Malformed UTF-8 character by mirod in thread Malformed UTF-8 character by jeanluca