I am processing files from many sources and languages, including chinese.
Sometimes the files are not encoded properly (or part of them) and they result in gibberish characters.
I don't need even to detect the encoding. I just want to skip the problematic lines of the file which are gibberish.
How do i accomplish it ?
In reply to gibberish detection by david2008
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |