in reply to Re: Matching a question in text
in thread Matching a question in text

Noise words, important words, etc. tend to be domain-specific. What I do for my current project is for every search, I log: This is written to a log file and a cron job dumps results into mysql db for easy reporting.

So to finally answer your question, you determine noise words by looking at what your users do. HTH