in reply to How to Identify a language
If you just want a program to classify text you might also be interested in: TextCat.
It's a Perl script that uses "N-Gram-Based Text Categorization" and has worked for me in the past. Though I did not need to classify Asian languages, it's supposed to support CJK.
A list of languages and an article discussing the approach can be found on the page as well.
|
|---|