What your you attempting to do? The book, PRACTICAL TEXT MINING WITH PERL, is very much worth the money. As far as objects go, finding the set of "uniq" words is straightforward in Perl. What's more interesting and applicable to comparing tests require less trivial-to-implement preprocessors such as Lingua::EN::Ngram or Lingua::EN::Tagger.