Beefy Boxes and Bandwidth Generously Provided by pair Networks
XP is just a number

Re: Re: Re: Re: From a SpamAssassin developer

by Matts (Deacon)
on Aug 19, 2002 at 16:24 UTC ( [id://191230] : note . print w/replies, xml ) Need Help??

in reply to Re: Re: Re: From a SpamAssassin developer
in thread Bayesian Filtering for Spam

I'm not surprised. Not even slightly - see my original post.

The biggest thing about statistical analysis is you simply cannot test it on the training data set. I get 100% accuracy when I do that. And it's not surprising. I'm speculating that's what PG did. But I could be wrong. And also the fact that the training often overfits. None of this is news to anyone versed in machine learning (which I'm starting to be ;-)