This is a cool "want to use", as I haven't done it yet. However, I spent $199 on a new Linux server, so I'm comitted :)

Basically, I think that a lot of the features that spammers use to defeat simple keyword filters can themselves be recognised by something a little smarter, like a Perl script.

I kind of like SpamCop, but it has issues with my current mail provider, and doesn't quite do what I want in terms of filtering.

So, I want to write some of my own, to "detect" based on the incoming mail, and to handle the "challange" to see if a live person is on the other end. All in Perl, on a Linux box.

So, besides any pointers from those who have already done something like this, I have a semi-of-topic request: point me in the right direction for good info on this kind of Linux server setup? Specifically, how to properly run the mail server (web server, etc.) and the firewall on the same physical box. And, make this a killer Perl machine in general :)

So this isn't totally OT, let me list some of the filter ideas that relate to regular expressions. On newsgroups, I see words that separate each letter with a non-letter mark, like "g.e.t. r.i.c.h!". Kills keyword filters, but easy to spot in and of itself, with real matching logic (i.e. Perl). Likewise, eliminating duplicate subject strings by adding a serial number, they always use a bunch of spaces (to scroll it off small screens?), and that's trivial to spot. Anything with more !'s than words can bite the dust, since I probably wouldn't want to read it even if it wasn't spam.


In reply to Email filtering by John M. Dlugosz

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.