in reply to Re^2: Generate strings which sounds like source string
in thread Generate strings which sounds like source string
Soundex is so trivial of an algorithm it isn't too difficult to create a reverse lookup.
I know, I tried it, but the results are pretty useless. Most of these "matches" are nothing like the given words:
The problems with soundex include:
But for example, 'Cray' is a far better sound-alike for 'Gray' than most of those above.
And there are many words or phrases that might match 'Citrullus' that begin with 'S'. Say 'Sit with us'.
Hence matches like 'Gray; with 'giaour'.
Hence matches like 'Charleston' with 'carls' & 'creolizations'
The name Soundex is deceptive. It has little or nothing to do with the sound.
Metaphone is too specific. Many of the words in the OPs examples would never match anything if encoded at their full length, and if you reduce the encoding length across the board, you get far too many hits for other words. And to dynamically adjust the length of the encoding successfully, you need to encode your dictionary words at all lengths.
I am interesting in your idea (even without implementation).
The problem with developing my idea is that it would be a table-driven algorithm that would require considerable effort (programming & manual), in order to derive the tables. Not worth the effort unless the was at least an outside chance someone might make use of it.
Hence I'd like to know if the OP is serious. And, what he (or other people) might use it for. That might give me an idea as to whether it is worth the time and effort.
|
|---|