I see. I'm new to these encode stuff, but now I understand... check, guess try to encode, if not discard.
At the moment I want just to discard, later when I have time will do more tests
But my next question was... if I check for valid utf8 string and discard. Will this discard the string if it is ascii ?
No. U+0000 to U+007F (the first 128 Unicode characters) are represented in UTF8 by one byte - the same byte that is used in ASCII. So ASCII (7 bit ASCII, not eg ISO-8859-* or WINDOWS-1252) is a subset of UTF8.