cormanaz has asked for the wisdom of the Perl Monks concerning the following question:

Good day fellow Monks. Hope you are all surviving the Apocalypse OK.

Does anyone know where I can get training tables for Lingua::Ident? The docs say "these tables can be generated using the trainlid(1) utility program." After some hunting I have found the utility program but I don't have training data to run through it. Are these available pre-generated anywhere?

Before someone says it, I know there is also a mod called Lingua::Identify that does not require external training files, but I am working on Windows and it is not available through ActiveState PPM, which usually means it won't work (despite what the testers say). If you have it working on Windows please let me know and I will try to install via CPAN.

Replies are listed 'Best First'.
Re: Training data for Lingua::Ident
by davido (Cardinal) on May 01, 2020 at 21:49 UTC

    I booted to Windows and installed Lingua::Identify quickly and easily with Strawberry Perl. You're using ActiveState, but ought to have similar results assuming you're able to install any other Perl module using the traditional Perl toolchain. Give it a try and check back if there are errors that you're not able to resolve.


    Dave

      I did, and much to my surprise it worked. On first attempt it choked on Text::Ngram but I updated that and Lingua::Identify installed after that. Thanks for the encouragement!
Re: Training data for Lingua::Ident
by Anonymous Monk on May 01, 2020 at 23:29 UTC