I realize nothing will be perfect. I was trying to see if anyone had done this before, and what model is most efficient. How do you recommend I run a "threshold check"? By percentage of keywords matched?
percentage of keywords matched... maybe even weight the keywords and keyphrases and take anything that gets 5 points, or 10, or 2... or more, of course... kind of a reverse search