perlquestion
karey3341
<br>If my data looks like this:<br/>
<br>word 1: 100 101 101 102 102 102 106 106<br/>
<br>word 2: 101 104 106 110 113 129 131 148<br/>
<br>word 3: 101 153 175 180 381 <br/>
<br>word 4: 106 110 113 122 131 137 142 148<br/>
<br>word 5: 120 165 169 <br/>
<br><br/>
Where word 1,2,3,4,5 represent different words, numbers represent a list of paper those words have been used as keywords.
<br><br/>
How can I calculate similarity between these words?