in reply to Re^2: What is the best way to store and look-up a "similarity vector"?
in thread What is the best way to store and look-up a "similarity vector" (correlating, similar, high-dimensional vectors)?
Located some dusty slides I remembered seeing (05-LSH). More keywords to research: Jaccard Similarity, MinHashing, Shingling, MinHash Signatures, etc.
Anyway, this is a spooky topic. These techniques are useful for de-anonymizing big data.
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^4: What is the best way to store and look-up a "similarity vector"?
by isync (Hermit) on Nov 15, 2013 at 12:47 UTC |