Two Spam Filters 10 Times As Accurate As Humans

← Back to Stories (view on slashdot.org)

Two Spam Filters 10 Times As Accurate As Humans

Posted by timothy on Monday February 23, 2004 @01:13PM from the dev/null-is-getting-fatter dept.

Nuclear Elephant writes "The authors of two spam filters, CRM114 and DSPAM, announced recently that their filters have achieved accuracy rates ten times better than a human is capable of. Based on a study by Bill Yerazunis of CRM114, the average human is only 99.84% accurate. Both filters are reporting to have reached accuracy levels between 99.983% and 99.984% (1 misclassification in 6250 messages) using completely different approaches (CRM114 touts Markovan, while DSPAM implements a Dolby-type noise reduction algorithm called Dobly). If you're looking for a way to rid spam from your inbox, roll on over to one of these authors' websites."

29 of 487 comments (clear)

Outclassed... by Klatoo55 · 2004-02-23 13:14 · Score: 5, Funny

I'm sorry, Dave... That Nigerian guy looks suspicious and I can't let you send him money.

--
------- "A true friend stabs you in the front." -Eliot
Comment removed by account_deleted · 2004-02-23 13:15 · Score: 5, Insightful

Comment removed based on user account deletion
IM Spam by jeffskyrunner · 2004-02-23 13:15 · Score: 5, Interesting

Once Email Spam is eliminated, then IM spam will begin...

--
Jeff
wait, WTF? by PedanticSpellingTrol · 2004-02-23 13:15 · Score: 5, Insightful

I presume they mean more accurate than a human that was only looking at the subject line? I fail to see how someone could misclassify an email after they'd already opened it unless it was some kind of marathon testing, which would be totally unrepresentative of any real life situation. Once you're getting 6,000 messages, it's time to reach for "Delete All" and change your address, methinks
Who is sending that one? by ObviousGuy · 2004-02-23 13:16 · Score: 5, Funny

If your email is indistuinguishable from spam by a human, perhaps the problem isn't the receiver. It's the sender.

Forgive me if I don't feel any pity that some moron's email gets filtered to the junk bin because I couldn't discern it from spam.

--
I have been pwned because my /. password was too easy to guess.
To get this new spam filter... by Anonymous Coward · 2004-02-23 13:16 · Score: 5, Funny

Just enter a valid email address, and hit submit!
Re:Huh? Aren't humans 100%? by MarkJensen · 2004-02-23 13:17 · Score: 5, Informative

I haven't been 100% accurate.

I received an email from my sister-in-law from her work, and the address looked suspicious (one of those weird-looking "letter and number" jumbles.

I deleted it. It happens.
Re:Huh? Aren't humans 100%? by gid13 · 2004-02-23 13:20 · Score: 5, Insightful

If you read the post, it quotes a study and says humans are only accurate 99.84% of the time.

Kinda makes you wonder how they can know the filters are right though. :)

(please don't reply telling me how)
Hmmmm by Anonymous Coward · 2004-02-23 13:20 · Score: 5, Funny

Probably used those same people who open viruses as test subjects.
Obligatory Q... When will mozilla/TB have them? by sisukapalli1 · 2004-02-23 13:21 · Score: 5, Interesting

I reached the conclusion of "two filters better than humans" by using two sequential filters:
server side spamassassin, and a couple of simple procmail recipes. They have kept almost all the SPAM away.

However, it is good to see such good techniques becoming available and we can hope to see them as straight forward usable tools.

So, when will mozilla/TB (or your favourite server side or client side filter) get them?

S
actually by Digitus1337 · 2004-02-23 13:22 · Score: 5, Funny

it's not that humans are not as accurate, it's that 1 in X times we really do want a mini camera or free porn. It is what seperates us from those cold, heartless machines.... mini cameras and porn....
Re:Huh? Aren't humans 100%? by mattkime · 2004-02-23 13:23 · Score: 5, Insightful

Obviously you've never seen someone new to the internet sit in front of their computer. Lots of people don't know what popups are. Lots of people read some spam not knowing what it is. To these people, a computer is merely an interesting string of sensations.

--
Know what I like about atheists? I've yet to meet one that believes God is on their side.
*slams head against wall* by Faust7 · 2004-02-23 13:24 · Score: 5, Funny

I received an email from my sister-in-law from her work

Yeah, so did I. The subject line was "I want you so bad."

I deleted it. Turned out the message was genuine. I'll never forgive myself...

--
The coolest voice ever.
1. Re:*slams head against wall* by maddskillz · 2004-02-23 15:09 · Score: 5, Funny
  
  If it was your sister-in-law sending you that subject line, you probably did the right thing and deleted it
I'm sure they're great, but... by LesPaul75 · 2004-02-23 13:24 · Score: 5, Insightful

I'm also sure that Yahoo's "SpamGuard" was great when they first introduced it. Now, It catches roughly half of all the spam I get. Why? Because people have figured out how it works and taken advantage of it. The same will happen with any content-recognition-based spam software. In the extreme case, even if a piece of software were 100% accurate at saying "This piece of e-mail looks like spam," then spammers would just make their e-mails look exactly like e-mail from one of your buddies. How could software ever tell the difference between:

Hey, dude, check out this website I found. There are some hot naked chicks and stuff. Sweet.
Signed,
Your Buddy

and

Hey, dude, check out this website I found. There are some hot naked chicks and stuff. Sweet.
Signed,
SpamKiddy

Even a human can't tell the difference. The only real difference is who they're from.
Adaptive adversaries by Pendersempai · 2004-02-23 13:28 · Score: 5, Insightful

It's really easy to design an effective solution when the problem is purely mechanical or natural. As long as you're working with spammers who don't adapt, you can slice through their shitstorms very effectively.

But when a single solution becomes mainstream, spammers will adapt to it. Bayesian filters tend to work very well, but now spammers are adding sprawls of randomly generated green-light text to offset the filter's score.

Google found an excellent way to rank websites, but then it became widespread enough that webmasters began to game the system it had created. It's been playing catch-up ever since.

Once the adversary begins to adapt, we lapse into the same cat-and-mouse game of technological barriers and counter-barriers that we've seen so many times before.
Re:Huh? Aren't humans 100%? by evilmrhenry · 2004-02-23 13:29 · Score: 5, Insightful

Quite simple:
With 10 messages (after automatic spam detection) humans are 100% accurate.

With 1,000 messages, (before automatic spam detection)
humans are less than 100% accurate.

The experiment was done on 5849 messages.

Remember; one thing computers are good at is doing boring things repeatedly.
Re:Huh? Aren't humans 100%? by Elwood+P+Dowd · 2004-02-23 13:31 · Score: 5, Interesting

No, humans are not 100%.

If you see a strange name in your inbox with an odd title, that might be a Nigerian businessman, or it might be your long lost Nigerian brother.

I recently tried to order a t-shirt from this guy for a band he used to be in. I found his band because we have the same (semi-uncommon) name. So, he got an email From: himself. I had to send him two emails because he deleted the first one assuming it was spam.

I ordered some RAM for my dad a while back. He gets 200 spam emails a day (email addy in resume & web page), and he deleted the confirmation email from the RAM vendor. The RAM never shipped, and it took us a week to figure out that there was a problem.

People make mistakes all the time. Why is this an unexpected result? People are jackasses. This should be obvious.

--

There are no trails. There are no trees out here.
Could somebody explain this to me... by heldlikesound · 2004-02-23 13:32 · Score: 5, Interesting

I order all kinds of stuff online, wouldn't the receipt emails look like spam? My current spam solution is very simple:

1. display my email online as little as possible

2. use a number of addresses that all filter into one account, then filter by the sent-to address... this has turned up some VERY interesting results, for instance. I used dellorders@mydomain.com for an order from Dell, and NEVER used it or even typed it anywhere again, and started get spam about 6 months later, and I mean the nasty stuff, no just innocent stuff from Dell resellers...

3. i built a rudementary filter that looks for viagra,free,debt,enlarge, etc... if the sender is not in my address book, and the email contains these words, it is sent to a "check these out" folder...

How might a spam filter help me out without zapping confirmation type emails?

--

Cloud City Digital: DVD Production at its cheapest/finest
The true test of a spam filter... by GrpA · 2004-02-23 13:39 · Score: 5, Insightful

Results of new spam filters cannot help but to be bogus... The true test of a filter is how well it works *after* all the spammers know how it works and try to circumvent it.

--
Enjoy science fiction? "Turing Evolved" - AI, Mecha, Androids and rail-gun battles. What more could you want?
Re:can it be used with SA? -yes by wideangle · 2004-02-23 13:41 · Score: 5, Informative

A CRM114 plugin for SA is available, thanks to Devin Nate:

http://bugzilla.spamassassin.org/show_bug.cgi?id =2 301
How not to evaluate filters by Daniel+Quinlan · 2004-02-23 13:55 · Score: 5, Insightful
The study referenced is:
- On the author's mail (where all he does is probably talk about CRM114 and probably does not subscribe to many newsletters or non-technical mailing lists).
- A pre-trained filter. It can't be compared apples-to-apples with any filter that doesn't require training.
- Using his own filter on his own mail! Of course it does well.
... to mention a few of the problems. The statistics and methodology behind these claims are really questionable. I think both Consumer Reports and PC Magazine have both done better evaluations of spam filters (read that however you want).
Also, I wonder how many people have actually looked at CRM114 and tried to use it.
The really interesting thing about CRM114 is the windowed polynomial hashing technique used although there's some evidence that it can work just as well (if not better) on a much smaller window of only two tokens. I'm hoping someone will do a full exploration of the idea for SpamAssassin's Bayes module.
Human accuracy doesn't scale linearly by Kaboom13 · 2004-02-23 14:14 · Score: 5, Insightful

I'm not surprised a filter beat the human, considering the study used a sample of 5849 messages. As the sample size increases, the filter's accuray will increase, and the human's will decrease. Furthermore the higher the spam/real ration, the better the filter will do in comparison to a human trying to sort at a reasonable speed. The reason being humans tend to skim, and rairly actually read entire subjects, much less messages. Give a human 5000 messages and an hour and he will probably make some mistakes. On the other hand, in 10 messages, the human will probably be 100% correct. Most email filters rely on this already, as they tend to err on the side of caution. With the bulk of the spam taken out, it is not a burden to have the human check the iffy bits. Furthermore the type of email can mislead humans. A business-type email sent to someone's personal email is much more likely to be mistaken as spam, and vice versa. The main disadvantage of automated filtering is people generally have an idea of when a really important e-mail is going to come (the type that false positives are completely unacceptable) and who it will be from.
Re:Huh? Aren't humans 100%? by Anonymous Coward · 2004-02-23 14:31 · Score: 5, Insightful

The post quotes "a study" which gives the 99.84% figure. In fact, the 99.84% figure is mentioned in the one paper as "the human author's measured accuracy as an antispam filter...on the first pass". This is what we who understand statistics call "nonsense". An individual human had an estimated accuracy of 99.84% when looking at one particular sample set of data, once. This is not a meaningful number, and it sure as heck ain't "a study".
Re:Huh? Aren't humans 100%? by ergean · 2004-02-23 15:11 · Score: 5, Funny

There goes my bussines idea. I wanted to start a bussines that puts humans in an eastern europe contry to sort corporate e-mail.

Now I have to think again about putting humans to decorticate sunflower seeds, it's cheper than all those machines.
Spot the reference... by Maj.+Kong · 2004-02-23 16:09 · Score: 5, Informative

CRM114 was a piece of encryption gear in Major Kong's...err, my B-52 in the movie Dr. Strangelove . It allowed only properly coded messages to be received by the crew. When the Soviet SAM detonated near the airframe, the CRM114 was damaged and the crew could not get the recall order.

Kong: (announcing through headset intercom )

This is your attack profile: to insure that the enemy cannot monitor voice transmission or plant false transmission, the CRM114 is to be switched into all the receiver circuits. Emergency phase code prefix is to be set on the dials of the CRM. This'll block any transmission other than those preceded by code prefix. Stand by to set code prefix.

ObKubrick: In 2001: A Space Odyssey, one of the pods was marked with the designation CRM-114. And in Clockwork Orange, Alex is injected with serum 114. I suppose CRM-114 is to Kubrick as THX1138 is to Lucas.

Dobly, on the other hand, is from This is Spinal Tap , a mispronounciation of "Dolby" by David St. Hubbins's girlfriend:

Jeanine Pettibone: You don't do heavy metal in Dobly, you know.

Not to mention that it probably avoids trademark infringement (though I wouldn't put it past Dolby Labs or Thomas Dolby to raise a stink).

Maj. Kong

--

Shoot, a fella' could have a pretty good weekend in Vegas with all that stuff.
Re:Huh? Aren't humans 100%? by Harinezumi · 2004-02-23 16:29 · Score: 5, Informative

Computers are neither lazy nor pressed for time, and therefore can afford to read and evaluate every single line of every single message. Humans generally can't be bothered to be so diligent, and while they have the ability to get a 100% rate, in most cases they devote so little attention to the task of filtering email that the success rate drops.
When these factors are considered, I think it's quite possible to write software that in the long run has a higher success rate than a human who has better things to do than filter his mail all day.
Not the best idea by Vainglorious+Coward · 2004-02-23 17:20 · Score: 5, Insightful

What you're planning has already been done, it's called TMDA, and it's not such a good idea. You're going to send out 800 "challenge" emails per day - have you given any thought to how many of those will be genuine addresses, but have nothing to do with the spam you receive because they just happen to be the joe-job victim? These kind of challenge/response systems may slighlty alleviate your own suffering through spam, but at a cost to all those unfortunate enough to have had their email addresses faked. And if the sheer impoliteness of such net behaviour doesn't put you off, note that you're using up more of your own bandwidth to send out such challenges

If any of the smtp exchange or address lookup fails, just forget it, they're probably not real anyway

It would make a lot more sense to make these kind of checks when you're receiving the email in the first place. Reject at the SMTP level - you never accept and process the spam in the first place

--
My next sig will be ready soon, but subscribers can beat the rush
Share the luxury by bigberk · 2004-02-23 18:40 · Score: 5, Interesting

Having such a powerful statistical spam filter is definitely a luxury. I have no difficulty believing the accuracy values presented here. I have had experience with spamprobe, CRM114, bogofilter, spambayes, and spamassassin and all of these do an amazing job to the point where spam no longer exists (for you).

Which leads to me plug a little project called WPBL that uses exactly these types of statistical spam filters to spot spam sources in a distributed fashion. Each project member uploads hourly the IPs they see relaying spam and non-spam, where the 'decision' is made by these extremely reliable filters. This effectively converts your regular mail account into an intelligent spam-trap that feeds a central blocklist.

The more members we get, the better we can identify active spam sources around the world. This information is then used by some sites for quite large-scale blocking. Since you're doing all this filtering processing anyway, why not also share "what you learn" (the IPs that are spamming you)?

If this grabs your interest, read up on the reporting scripts or alternatively, the open WPBL data upload protocol if you want to code your own report generator. Bandwidth usage is minimal.