![]() |
|
Welcome to the Monastery | |
PerlMonks |
Re^2: Search for similar strings - to standardiseby educated_foo (Vicar) |
on Oct 31, 2009 at 03:33 UTC ( #804260=note: print w/replies, xml ) | Need Help?? |
Edit distance would be useful for comparison, but not so much for clustering strings with their common misspellings. For that, you might try n-grams (mentioned above) or locality-sensitive hashing (basically the same thing, but with gaps).
In Section
Seekers of Perl Wisdom
|
|