This depends on the algorithm. Some information retrieval algorithms just work with boolean values, others keep track of the frequency of a term in a document.
If you want to keep track of the frequencies, you can either store position/frequency pairs or use two lists, one for the position, the other for the frequencies. The former approach is cleaner, the latter should be faster.
Hope this helps, -gjb-
In reply to Re: Re: Re: Re: Re: Refining a 'vector space search'.
by gjb
in thread Refining a 'vector space search'.
by Seumas
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |