http://qs1969.pair.com?node_id=517245


in reply to Creating Dictionaries

Depending on your needs, you might wish to take a look at the following writeups:


NLP - natural language regex-collections?
Perl and Linguistics
Perl NLP
What are the monks doing with Perl and Linguistics?

Take a tour of CPAN with its many gems such as Ted Pedersen's Ngram Statistics Package... Or Google it... This is a mature area of study and there are many existing quality tools available.

HTH,

planetscape