in reply to Comparing sets of phrases stored in a database?

Interesting question.

In PostgreSQL there are some text tools that may already be adequate for such a database:

The built-in full-text search (includes indexing, parsing, stemming, ranking). [1]

The extension pg_trgm (trigrams). Can be used to index, provides similarity functions. [2]

The extension fuzzystrmatch (with soundex, levenshtein etc.). [3]

[1] http://www.postgresql.org/docs/current/static/textsearch.html

[2] http://www.postgresql.org/docs/current/static/pgtrgm.html

[3] http://www.postgresql.org/docs/current/static/fuzzystrmatch.html