in reply to separating text words in similarity classes using levenshtein
Taking a quick look at your data, I might suggest using regexes and Regexp::Assemble. Perl Hacks demonstrates how to use Regexp::Assemble to build a dispatch table that you may find helpful.
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^2: separating text words in similarity classes using levenshtein
by spx2 (Deacon) on Dec 10, 2007 at 20:18 UTC | |
|
Re^2: separating text words in similarity classes using levenshtein
by spx2 (Deacon) on Dec 10, 2007 at 20:27 UTC | |
by eric256 (Parson) on Dec 10, 2007 at 20:39 UTC | |
by spx2 (Deacon) on Dec 10, 2007 at 21:51 UTC | |
by eric256 (Parson) on Dec 10, 2007 at 21:57 UTC |