http://qs1969.pair.com?node_id=191230


in reply to Re: Re: Re: From a SpamAssassin developer
in thread Bayesian Filtering for Spam

I'm not surprised. Not even slightly - see my original post.

The biggest thing about statistical analysis is you simply cannot test it on the training data set. I get 100% accuracy when I do that. And it's not surprising. I'm speculating that's what PG did. But I could be wrong. And also the fact that the training often overfits. None of this is news to anyone versed in machine learning (which I'm starting to be ;-)

Matt.

  • Comment on Re: Re: Re: Re: From a SpamAssassin developer