in reply to Creating Dictionaries
Depending on your needs, you might wish to take a look at the following writeups:
NLP - natural language regex-collections?
Perl and Linguistics
Perl NLP
What are the monks doing with Perl and Linguistics?
Take a tour of CPAN with its many gems such as Ted Pedersen's Ngram Statistics Package... Or Google it... This is a mature area of study and there are many existing quality tools available.
HTH,
planetscape
In Section
Seekers of Perl Wisdom