in reply to Wishlist: help for people that don't differentiate homonyms

Is this trivial, hard, impossible?

It is hard (probably impossible) problem but it is probably well studied , and many (universities published) open source solutions exist , which are state of the art and produce good-enough results

Off the top of my head, you might start with Lingua::Wordnet or starts with Lingua::TreeTagger, identify the parts, then look-up each noun in a dictionary of homophones ( say common english homophones for dutch speakers), maybe with Text::Soundex

You'll notice WordNet description doesn't exactly mention homonyms, but that is where a non-linguist like me would start.

See also Perl and Linguistics, Text Analysis Tools to compare Slinker and Stinker?, Natural Language Software Registry, Perl and Linguistics

  • Comment on Re: Wishlist: help for people that don't differentiate homonyms