Slashdot Mirror


Paul Graham on Fighting Spam

Ramakrishnan M writes "Paul Graham, the Lisp Guru is back with a great technique to fight spam. It is based on trust matric, and he claims, only 5 out of 1000 spams got leaked out of this system with 0 false positives. Worth looking at."

14 of 675 comments (clear)

  1. I heard about this! by WilliamsDA · · Score: 2, Funny

    I got an email last night about this! Also, it asked me to help out his Nigerian cousin...

  2. only 5 per 1000? by jeffy124 · · Score: 2, Funny

    that means CmdrTaco reduces his spam intake to around 500/day.

    --
    The One Rule Of Chess You'll Ever Need: Don't play someone who carries a kit in their bookbag.
  3. Major geek bias there... by Kaa · · Score: 5, Funny

    From the article:

    Based on my corpus, "sex" indicates a .97 probability of the containing email being a spam, whereas "sexy" indicates .99 probability. And Bayes' Rule, equally unambiguous, says that an email containing both words would, in the (unlikely) absence of any other evidence, have a 99.97% chance of being a spam.

    Hmm.... take an average adult geek and yes, an email mentioning sex or sexy can go to /dev/null immediately without as much as a second glance... :-)

    On the other hand if you run the statistics on email of an average horny teenager, the probabilities might get a bit different.

    --

    Kaa
    Kaa's Law: In any sufficiently large group of people most are idiots.
    1. Re:Major geek bias there... by LordNimon · · Score: 2, Funny

      But what about the (unlikely) situation of a geek getting a girlfriend? All of her steamy email will be flagged as spam, and then she'll get upset and dump him. Oh, the irony!

      --
      And the men who hold high places must be the ones who start
      To mold a new reality... closer to the heart
  4. (spam) by Anonymous Coward · · Score: 0, Funny

    (insert (lisp joke (here)))

  5. Re:Best anti-Spam method is TMDA by Anonymous Coward · · Score: 1, Funny

    "verified that they are a real person and not a spammer."

    Heh, spammers are people too, you know :-)

  6. False positives... by dillon_rinker · · Score: 5, Funny

    From the article:

    In the spam filtering business, false positives are your biggest worry...Based on my corpus, "sex" indicates a .97 probability of the containing email being a spam, whereas "sexy" indicates .99 probability...an email containing both words would have a 99.97% chance of being a spam.

    False positives could be a HUGE problem in this case...imagine the agony if you missed this email from your wife: "I'm feeling REALLY sexy today - meet me at the motel off 12th street at noon for some lunch-hour sex!"

  7. Shifman by T-Kir · · Score: 2, Funny

    I wonder what Bernard Shifman would make of this article?

    What is our 'CS Consultant' up to these days?

    --
    Are you local? There's nothing for you here!
  8. You're being shortsighted by David+Wong · · Score: 4, Funny

    It was with the help of spam that with just a simple herbal supplement I was able to add three inches to my penis (an increase of over 20%). I had assumed it was just a scam, and nobody was more suprised than me that it worked.

    Well, except my wife.

  9. Re:spamassasin by Unknown+Bovine+Group · · Score: 2, Funny

    Based on my corpus, "sex" indicates a .97 probability of the containing email being a spam, whereas "sexy" indicates .99 probability. And Bayes' Rule, equally unambiguous, says that an email containing both words would, in the (unlikely) absence of any other evidence, have a 99.97% chance of being a spam.


    Obviously, the author just isn't sexy.

    --
    m00.
  10. Great quote by fizban · · Score: 2, Funny

    This is the best paragraph of the whole article:

    So as spammers start using "c0ck" instead of "cock" to evade simple-minded spam filters based on individual words, Bayesian filters automatically notice. Indeed, "c0ck" is far more damning evidence than "cock", and Bayesian filters know precisely how much more.

    The Bayesian filter. You can run, but you can't hide!

    --

    +1 Insightful, -1 Troll. What can I say, I'm an Insightful Troll.

  11. Re:Easy way to beat spam 100% by alcmena · · Score: 4, Funny

    if you like, can put things like "don't use me" in the ALT attribute of the image to avoid curious people that browse in text/disable graphics mode.

    Better yet, use the alt text "CLICK HERE!" and everyone will assume it's some sort of ad and they will refuse to touch it with a ten foot pole. "CLICK HERE!" is like the web version of the radioactive symbol.

  12. Re:Nicely done by certsoft · · Score: 2, Funny
    Enlarge <!-- elephant --> penis [etc..]

    I think most elephants have a large enough penis already.

  13. Re:spamassasin by Anonymous Coward · · Score: 1, Funny

    Actually, I'd recommend a combination between a nasty spam filter that kills off close to anything that might conceivably be spam and white-lists of senders who are automatically cleared.

    Great idea! Where'd you get it? The article?