Paul Graham on Fighting Spam
Ramakrishnan M writes "Paul Graham, the Lisp Guru is back with a great technique to fight spam. It is based on trust matric, and he claims, only 5 out of 1000 spams got leaked out of this system with 0 false positives. Worth looking at."
I got an email last night about this! Also, it asked me to help out his Nigerian cousin...
that means CmdrTaco reduces his spam intake to around 500/day.
The One Rule Of Chess You'll Ever Need: Don't play someone who carries a kit in their bookbag.
From the article:
.97 probability of the containing email being a spam, whereas "sexy" indicates .99 probability. And Bayes' Rule, equally unambiguous, says that an email containing both words would, in the (unlikely) absence of any other evidence, have a 99.97% chance of being a spam.
/dev/null immediately without as much as a second glance... :-)
Based on my corpus, "sex" indicates a
Hmm.... take an average adult geek and yes, an email mentioning sex or sexy can go to
On the other hand if you run the statistics on email of an average horny teenager, the probabilities might get a bit different.
Kaa
Kaa's Law: In any sufficiently large group of people most are idiots.
(insert (lisp joke (here)))
"verified that they are a real person and not a spammer."
:-)
Heh, spammers are people too, you know
From the article:
.97 probability of the containing email being a spam, whereas "sexy" indicates .99 probability...an email containing both words would have a 99.97% chance of being a spam.
In the spam filtering business, false positives are your biggest worry...Based on my corpus, "sex" indicates a
False positives could be a HUGE problem in this case...imagine the agony if you missed this email from your wife: "I'm feeling REALLY sexy today - meet me at the motel off 12th street at noon for some lunch-hour sex!"
I wonder what Bernard Shifman would make of this article?
What is our 'CS Consultant' up to these days?
Are you local? There's nothing for you here!
It was with the help of spam that with just a simple herbal supplement I was able to add three inches to my penis (an increase of over 20%). I had assumed it was just a scam, and nobody was more suprised than me that it worked.
Well, except my wife.
Phallic Symbols in LOTR
Based on my corpus, "sex" indicates a .97 probability of the containing email being a spam, whereas "sexy" indicates .99 probability. And Bayes' Rule, equally unambiguous, says that an email containing both words would, in the (unlikely) absence of any other evidence, have a 99.97% chance of being a spam.
Obviously, the author just isn't sexy.
m00.
This is the best paragraph of the whole article:
So as spammers start using "c0ck" instead of "cock" to evade simple-minded spam filters based on individual words, Bayesian filters automatically notice. Indeed, "c0ck" is far more damning evidence than "cock", and Bayesian filters know precisely how much more.
The Bayesian filter. You can run, but you can't hide!
+1 Insightful, -1 Troll. What can I say, I'm an Insightful Troll.
if you like, can put things like "don't use me" in the ALT attribute of the image to avoid curious people that browse in text/disable graphics mode.
Better yet, use the alt text "CLICK HERE!" and everyone will assume it's some sort of ad and they will refuse to touch it with a ten foot pole. "CLICK HERE!" is like the web version of the radioactive symbol.
I think most elephants have a large enough penis already.
Actually, I'd recommend a combination between a nasty spam filter that kills off close to anything that might conceivably be spam and white-lists of senders who are automatically cleared.
Great idea! Where'd you get it? The article?