Measuring the frequency of and distance between keywords in particular contexts is widely used in detecting plagiarism, and that may be the way forward for you, coupled with some fuzzy word matching to pick out appropriation of certain keywords or stolen factual information.