in reply to Trying to set up a concordance using linguistic rules

Look at what Lingua::EN::Inflect does. Its results aren't what you're looking for, but its data structures might be. At the very least, a good start would be "Can I inflect this word or not?" Articles and pronouns can't be inflected in and of themselves. (Well, not completely true, but a good start.)

Another solution would be to get your basic "word|number|where" going, then blacklisting the various things you don't care about. You will eventually have to be build the blacklist into the parser because some pronouns ("Joe") are also nouns ("joe"), but only context can tell the difference (usually).


My criteria for good software:
  1. Does it work?
  2. Can someone else come in, make a change, and be reasonably certain no bugs were introduced?
  • Comment on Re: Trying to set up a concordance using linguistic rules

Replies are listed 'Best First'.
Re^2: Trying to set up a concordance using linguistic rules
by telemachus (Friar) on Feb 25, 2008 at 17:39 UTC
    In English, pronouns inflect __more__ than other words not less (he, his, him; who, whose, whom; etc.). Also, I don't quite see how "Joe" with a capital letter is a pronoun; names are proper nouns.