Anonymous Monk has asked for the wisdom of the Perl Monks concerning the following question:
Hello Monks,
Does anyone have a code to cluster files(using k means or any other algorithm) using vector notation? Each document is represented as matrix, where 1 implies the word is present and 0 implies the word is absent.
Eg: Let the complete word list of a set of documents is art, brick, ball, monk, pearl, road.So, d1's vector is [0 1 0 0 1 0].
Thank You
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re: Clustering documents using vectors.
by BrowserUk (Patriarch) on Dec 14, 2012 at 13:15 UTC | |
|
Re: Clustering documents using vectors.
by chromatic (Archbishop) on Dec 14, 2012 at 18:51 UTC |