Using Email Networks as P2P Spam Filters

Secure? by geomon · 2005-05-12 07:03 · Score: 4, Interesting

The authors propose that their system have access to inbound and outbound contacts. For trusted email accounts, that might work. But what about email accounts that people may want to creat to sheild their identity (political dissidents, whistleblowers). They would have to live outside of the spam protection network and would, I assume, be seed accounts for spammers.

Am I missing something in this analysis?

--
"Rocky Rococo, at your cervix!"

Re:Secure? by seoYak · 2005-05-12 07:08 · Score: 5, Interesting

I don't think that i'll trade my privacy for a reduction in spam.
Re:Secure? by Mad_Rain · 2005-05-12 07:16 · Score: 1

Wouldn't the whistleblowers and political dissedents just have to email and be read by a few trustable sources to become trusted themselves? I guess the hard part would be to get past the filters of those first few, but it would be possible after that.

--
"What do you think?" "I think 'What, do you think?!'"
Re:Secure? by Anonymous Coward · 2005-05-12 07:17 · Score: 4, Funny

"Those who would trade privacy for a reduction in spam deserve neither." Benjamin Franklin.
Re:Secure? by geomon · 2005-05-12 07:18 · Score: 1

I guess the hard part would be to get past the filters of those first few, but it would be possible after that.

True. I guess my concern would have been whether their proposed system could be mined for information regarding frequency of connection between two emailers.

--
"Rocky Rococo, at your cervix!"
Re:Secure? by dextroz · 2005-05-12 08:45 · Score: 0

I didn't know they had e-mail back then - I thought he just meant "spam" cans

--
Where's my free iPod!? Until then, I'll settle for a kiss...
Re:Secure? by rescendent · 2005-05-12 10:00 · Score: 2, Interesting

Also you would not be able to be emailed by people who you haven't already approved the email address; would they have to phone you first?

For example:
People who change email address (Gmail, dropping a spammed email)
People legitimately contacting you (Old friend, people wanting to know more about your website etc.)
etc.

It would be like setting your telephone to only accept certain phone numbers and scrapping the phonebook. Bad for people, terrible for business.

Though I suppose spam is worse because it requires less effort and cost to contact 1000 people than using the telephone would... I've only changed email address 8 times... LOL

Nice...but not necessary by PenguinBoyDave · 2005-05-12 07:03 · Score: 4, Insightful

Since switching to Thunderbird, I get nearly no spam...maybe one or two per day. I like fancy stuff, but when simple works, go with it!

--
I'm not a troll, but I play one on Slashdot.

Re:Nice...but not necessary by winkydink · 2005-05-12 07:07 · Score: 2, Interesting

One or two per day out of how many? 3? 5? 1000?

--
"I'd rather be a lightning rod than a seismometer." -Ken Kesey
Re:Nice...but not necessary by downsize · 2005-05-12 07:07 · Score: 1

and for webmail, shinyfeet has the best spam filtering I have experienced. fancy or not, it certainly works (and with their unlimited setup, you'd need something that works)

--
do you have shinyfeet?
Re:Nice...but not necessary by Dukael_Mikakis · 2005-05-12 07:15 · Score: 5, Insightful

I use gmail, which does an excellent job at filtering spam.

But I think this could even be a step back. Like the parent says, I think most informed people have solved the issue of filtering spam pretty effectively (Thunderbird, Yahoo, Gmail, Bayesian filters, etc.) and so we don't generally *see* much spam.

The *REAL* problem with spam is traffic and network pollution. Spam wastes a ridiculous amount of bandwidth and (through spyware) hijacks our systems' cycles to do something that is (with filters) ultimately to no end. This seemingly won't solve the bandwidth consumption issue and might worsen the problem by polling all your friends over the network and then using your personal cycles to scan said email against all the known spam on your friends' computers.

People forget that the true detriment of spam these days is the traffic it causes, not cluttering your inbox (if you're smart).
Re:Nice...but not necessary by Crabbyass · 2005-05-12 07:31 · Score: 1

How does Thunderbird provide any protection? When I was using it it didn't seem to have any spam filters.
Re:Nice...but not necessary by Anonymous Coward · 2005-05-12 07:38 · Score: 1, Insightful

I agree with the main point of your post, but wanted to address something you said.

I use gmail, which does an excellent job at filtering spam.

I see this stated here on /. a lot. While gmail seem to be improving in this regard, I haven't had the experience of excellence.

A couple months ago the majority of my spam was actually legitimate email from my mailing lists. As of this moment, I don't see any legitimate mail in my Spam folder.

However, about 20% of the actual spam I get ends up in my Inbox.

I agree this is better than getting 0% spam in my Inbox and a ton of false positives, but my idea of excellent is for me not to even really think about spam at all. In other words, if a spam shows up in my Inbox, I want it to be a rare occurance. I want to be genuinely surprised.
Re:Nice...but not necessary by anti-trojan · 2005-05-12 07:38 · Score: 1

It does have a built-in bayesian filter for a long time now. When were you using it?

--
Virus infects both Windows and Linux!
Re:Nice...but not necessary by PenguinBoyDave · 2005-05-12 07:41 · Score: 1

1.0.2 has a *junk* filtering system. When you turn it on it will go through your inbox and mark what it THINKS is junk. It them moves the *junk* to the junk folder where you can review it. If you find something that is not junk, you can click "not junk" and move it back to the inbox. Once I "trained" it for about a week, I have never found anything I wanted in the junk folder, and like I said, I only get about one or two spam messages in my inbox per day, compared to the 150+ that end up in the junk folder.

--
I'm not a troll, but I play one on Slashdot.
Re:Nice...but not necessary by McGiraf · 2005-05-12 07:42 · Score: 1

The Real problem of bandwith usage will go away by itself when enuf people filter properly. If sent spam is not seen by anybody it will cease to be effective and will become worthless to advertisers.
if nobody pays for spam, nobody sends spam.
Re:Nice...but not necessary by geoffspear · 2005-05-12 07:48 · Score: 2, Interesting

That's a nice theory, but it seems more likely that the more effective spam filtering gets, the more spam will be sent. If it takes 100x more messages to get the same results, the spammers will just send 100x more messages. And they'll need to turn even more machines into zombies to do it.

--
Don't blame me; I'm never given mod points.
Re:Nice...but not necessary by Anonymous Coward · 2005-05-12 07:50 · Score: 0

You obviously don't use secure IMAP. If you did, Thunderbird would be the last client you'd use... its buggy implementation of IMAP is so horrible as to be unusable. POP3 works, though.
Re:Nice...but not necessary by fireduck · 2005-05-12 07:53 · Score: 1

I've been using thunderbird since 0.7 or so, and I'm not entirely in love with it's spam filtering. I've had it turned on since the feature was introduced, so it's had a long time to learn what is spam (95% of my email) and what's not (the other 5%). My most recent download of my earthlink email (which i've just about given up on), had about 60 messages, only 1 of which wasn't spam. But of the 59 spam, 17 weren't marked as spam (despite me junking very similar emails a few days earlier). Perhaps the spam database it's been compiling has been corrupted and I need to start over. I don't know.
Re:Nice...but not necessary by conteXXt · 2005-05-12 08:01 · Score: 1, Interesting

glad you told me. I was happily using it.

gentoo implimentation of courier-imap in ssl mode.

I have had no problems with it.

NOTE: 3 users, all me. NOT in production environs

Any suggestions for a better replacement?

--
The truth about Led Zep should never be told on /. (Karma suicide ensues)
Re:Nice...but not necessary by Linux_ho · 2005-05-12 08:02 · Score: 1

I'm using Thunderbird with IMAP over SSL and it works great, I've had no problems at all. What server software are you using?

--
include $sig;
1;
Re:Nice...but not necessary by nokiator · 2005-05-12 08:12 · Score: 1

I am not keeping my hopes high on this. The response rate for the regular snail mail is about 1%, that is 99% of the junk mail people receive basically goes to trash. Still, corporate America is willing to spend more than $46 billion on direct mail marketing every year. Considering that postal junk mail is a lot more expensive to send than e-mail spam, a 0.1 or even 0.01% response rate for e-mail spam may be sufficient to sustain the current spam rate, if not continued growth.
Re:Nice...but not necessary by DeafByBeheading · 2005-05-12 08:23 · Score: 1

I had massive issues with its IMAP at 0.9, buy since 1.0, that seems to have been fixed (at least for me)...

--
Telltale Games: Bone, Sam and Max
Re:Nice...but not necessary by Dukael_Mikakis · 2005-05-12 08:45 · Score: 1

I would agree but the incremental cost of sending spam is basically free ... especially if people let spyware hijack their computers to do the heavy lifting.
Re:Nice...but not necessary by Anonymous Coward · 2005-05-12 09:11 · Score: 0

I'm using Thunderbird with IMAP (for a long time, since 0.7 at least), and it's been a total and complete non-issue. It just works.
Re:Nice...but not necessary by irq255 · 2005-05-12 09:19 · Score: 1

Does the junk filter in Thunderbird also work on imap accounts? Or is it for pop3 only?
Re:Nice...but not necessary by Anonymous Coward · 2005-05-12 10:04 · Score: 0

I get no mail either. Looks like spam was the only mail I got so far....
Re:Nice...but not necessary by Anonymous Coward · 2005-05-12 10:21 · Score: 0

... recent download of my earthlink email (which i've just about given up on), had about 60 messages, only 1 of which wasn't ...

are you a non-subscriber using their free email service or a subscriber who just has their spamblocker turned off?

if spamblocker is letting that many through, maybe you need to find a new email provider with better filters.

my office uses services from http://www.postini.com/ and pretty effective. between it and a few predefined rules in the mail client (this is work, so it's just oe since i have to support others using it), and in my "INBOX" folder i get maybe one spam a week out of about 2500 incoming mails.

my home accounts rarely see any spam at all because i am careful who the addresses are given to (and they have uncommon, most not in google, usernames).. the only home accounts that get any are the two i use for debian mailing lists and bug reports.
Re:Nice...but not necessary by psetzer · 2005-05-12 15:34 · Score: 1

Better filtering has to be on the network level, so that individuals don't have to opt in. The people who know how to set up effective filtering won't likely buy anything from spamvertisers in the first place, meaning that, while it makes those peoples' lives easier, it doesn't affect the spammers' bottom line much.

--
"Anyone who attempts to generate random numbers by deterministic means is living in a state of sin." -- John von Neumann
Re:Nice...but not necessary by Doc+Ruby · 2005-05-14 13:10 · Score: 1

If all they do is send more spam that the filters trap, then all that will happen is spam traps will be more full. Winning such an intractable arms race, where the defense scales so much better than the offense, requires the attacker to get much smarter. While evolutionary selection might suggest they will both get smarter, in some predator/prey equilibrium, the economics suggest the defense has the advantage. It's cheaper in the aggregate to defend from multiple spam attacks with cheaply distributed filters than it is to develop new attacks. And the value of such advanced intelligence in the software development market (including spam filtering), where legitimate development is usually very lucrative without the other risks, suggests a winning advantage for the filters over the spammers.

--
--
make install -not war

Great... by yotto · 2005-05-12 07:06 · Score: 4, Funny

...Now the RIAA's going to sue me for getting spam.

--
Pulp Audio Weekly - Geek News and Reviews

Re:Great... by tandr · 2005-05-12 08:19 · Score: 1

Well, actually for NOT getting it!

Potential for harm by davidwr · 2005-05-12 07:07 · Score: 4, Insightful

Imagine the potential for harm if I infiltrated a social network and then identified my enemies as spammers, either deliberately or because I or the software agent I use was somehow tricked into doing so.

Social network-based spam-detection is a part of, not a total, solution, and its limits need to be recognized.

--
Knowledge is how to play a game, intelligence is how to win, wisdom is knowing what game to play.

Re:Potential for harm by Dukael_Mikakis · 2005-05-12 07:31 · Score: 1

This isn't spam-detection on a user basis (e.g. a network with "these email addresses are trustworthy and these email addresses are spammers") it's spam detection based on a spam database.

The potentials for abuse that I see are if you don't keep a spam database at all, and so you will not flag any query as spam (even if it clearly is), or if you try to keep an "Anti-Spam" database (I dunno, a database of legitimate emails. Only problems with these abuses are if you have no database it shouldn't matter because hopefully several others that have been polled will recognize it as spam, and for the "Anti-Spam" database, where you try to flag real emails as spam, there's really no way you could reasonably predict the format of all the legitimate emails that would be received.
Re:Potential for harm by pocketfullofshells · 2005-05-12 07:32 · Score: 1

Yes this is a very dangerous tool in the hands of a spammer or just about anyone with a little brains. But just about any communication tool anti-spammers create to be used on a wide public basis, can and will be used by spammers against those who created it and use it.
Re:Potential for harm by Anonymous Coward · 2005-05-12 07:53 · Score: 0

QUIET YOU!

i have long awated the day where i can apply the blanket DENY *@aol.* rule to the whole world!

Wondering if this works for mailinglists by jurt1235 · 2005-05-12 07:10 · Score: 1

We do a mailing once to twice a week and there are hundreds of wrong/fake mailaddresses on it. We do not filter this, the mailserver dumps the returns. However now we are already marked sometimes as spam, while we craft the messages in such a way that at least spamassassin does not mark it as spam. It is a user subscription list, certainly not spam. This kind of solution will make this situation worse.

--

My wife's sketchblog Blob[p]: Gastrono-me

Re:Wondering if this works for mailinglists by geoffspear · 2005-05-12 07:52 · Score: 3, Informative

If you're sending messages to email addresses that didn't actually subscribe then yes, you're a spammer and you should be blocked.
A well-designed opt-in list won't have any fake addresses on it (although it may have messages to invalid addresses bounce is once-valid accounts stop working), because anyone with half a brain designing an opt-in list would require the addresses it's mailing to be validated by the recipients of the messages before sending them anything.

--
Don't blame me; I'm never given mod points.
Re:Wondering if this works for mailinglists by jms1 · 2005-05-12 10:08 · Score: 1
It sounds like you want people to think you are a "responsible" sender of email, but then you admit the following:
- Your mail server throws away bounce messages, rather than routing them to a human being who can remove the bad addresses from your list. Every bounce message you ignore is essentially stolen bandwidth and CPU on somebody else's server- if you were truly a responsible mailer, you would take steps to minimize your impact on others' systems.
- You have to "craft the messages" to get around spamassassin. This tells me that YOU KNOW whatever you're sending is likely to be considered spam, and yet you send it anyway- ESPECIALLY where you know that your list has "hundreds of wrong/fake" email addresses. How fscking rude.
- The fact that you even HAVE wrong or fake email addresses on your list is an indicator that your signup method is flawed. You should be using a double opt-in mechanism, if for no other reason than when somebody calls one of your messages spam, you can forward them back a copy of the message you received which agreed to receive it.
All of these are common traits of the spammer's trade. Do us all a favor- post the IP address of the machine which sends this list, so that we can blacklist you now.
Re:Wondering if this works for mailinglists by Anonymous Coward · 2005-05-12 11:27 · Score: 0

"It is a user subscription list, certainly not spam."

Which sounds to me like he means that people sign up and then use the spam filiters as an "unsubscribe"
Re:Wondering if this works for mailinglists by jurt1235 · 2005-05-12 19:35 · Score: 2, Informative

Taking texts out of context is your hobby I guess, anyway a reply:
If you get enough trash back because of users, the nicest way is to let the mailserver handle it. A CPU can do the dumping a lot faster than a person can lookup an account, and take the person of the mailinglist

The spamassassin side of the story: We do not like to send out a plain text message, but nice HTML formatted messages. We take care that this requested e-mail is not mistaken for spam by already routing it through a filter to prevent our users who request this mail do not accidentily put us in a spambox. Since we send it from the same address all the time, they can or go to our side and login with their own account and disable, or use a filterrule to dump it in the thrash anyway.

3th point: We send everybody a welcome message with a login. So they need to be active to get started. There is however an very high rate of AOL/Hotmail addresses which do not live very long, resulting in a lot of trouble.

And no, sending a normal mailinglist with limitted resources is not like being a spammer, it is more like being spammed because you have to get rid of all the trash expiring e-mail accounts cause.

--

My wife's sketchblog Blob[p]: Gastrono-me
Re:Wondering if this works for mailinglists by jms1 · 2005-05-13 06:24 · Score: 1

If you get enough trash back because of users, the nicest way is to let the mailserver handle it. A CPU can do the dumping a lot faster than a person can lookup an account, and take the person of the mailinglist
Again you state that your mail server is "dumping" the bounces as they come in. Does your definition of "dumping" mean it's removing the bad email addresses from your list?
If you are removing the bad addresses when bounces come in, whether manually or using an automatic process, then you need to explain what "dumping" means when you describe this process to other people. Most of us think of "dumping" as meaning "throwing away".

However, if your server is ignoring and/or throwing away these bounce messages, then in my book you are a spammer.

We send everybody a welcome message with a login. So they need to be active to get started.
How do they become "active" in the first place? Do they just visit your web site and put in an email address? What would prevent me from visiting your web site and putting in a hundred random email addresses, causing you to send your messages to those people? Is there a second verification step (known as "double verified opt-in") which happens afterwards, or do you just add them to the list and hope that the person will unsubscribe if the signup was bogus?
I'm not trying to sound like a jerk, but in ten years of running ISP's I have seen these same arguments before. Unless you are doing a double verified opt-in, you don't know for sure that the email address you have actually represents a person who wants to receive your messages. Sending a verification request to the email address, and receiving a positive response to that message, is the only way to be sure that the request is genuine.

Isn't this basically how Razor works? by forevermore · 2005-05-12 07:10 · Score: 4, Insightful

Granted, I just skimmed the article, but isn't this exactly how Razor works? (simplified) Communities of people flag messages, senders, etc. as spam, and the mail server (or in my case, spamassassin) compares the messages to the community spam archive for matches before delivery.

--
Do you really need reason for beer? Wingman Brewers

Re:Isn't this basically how Razor works? by HermanAB · 2005-05-12 09:06 · Score: 1

Razor, Razor2, Pyzor, DCC...

--
Oh well, what the hell...

Hate to burst your bubble.. by Anonymous Coward · 2005-05-12 07:11 · Score: 0

..but most of the large spam companies do exactly that, in fact their selling point to company A is that they have thousands of other customers, so if customer B gets a spam or virus, customer A is protected even though they never received it.

many many of the hosted and appliance based commercial spam filters do exactly that, they report up to the home base.

Re:Hate to burst your bubble.. by pocketfullofshells · 2005-05-12 07:47 · Score: 1

It never quite makes it to the top quick enough.
FTA: "our large-scale simulations show that the system achieves a spam detection rate close to 100%"

That must have been some easy spam to stop because in the real world, its more complex. Random generators changing subjects, spoofing random senders addresses, and changing content.

How can it stop the spam from getting to inboxes when it HAS TO GET TO AN INBOX TO EVEN BE MARKED AS SPAM!!! Lets say SPAM A hits server B at 11:00pm. Server then relay's it to block of address's, and spam reaches users by 11:02

User checks mail at 11:05 and is queried by social filter and indicates message is spam.

by this time, the spam is already in as many inboxes as the server was responsible for, and in about 5 minutes SPAM A will come back with a funny mustache and thick glasses disguised as SPAM B, and circumvent the whole process, unless the filters can ignore the slight changes and random content....

I'll beleive this thing works when they try it out on some REAL spam, not that store bought stuff.

Isn't this how Yahoo works by CrazyJim1 · 2005-05-12 07:11 · Score: 3, Interesting

You click a multi-user message as marked as spam, then it turns into spam for everyone else too.

--
God spoke to me.

Reduces to a standard spam filter by tdvaughan · 2005-05-12 07:13 · Score: 3, Insightful

According to the article the method works by asking its network of email users if they've seen the spam before:

Similar software on each computer that receives the query would then check the message against its own spam database, and so on, until a match is found, or the message is deemed original.

So it can't deal with spam that includes a unique random ID and would tag emails from a mailing list as spam. Once more: nice try, but it won't work in the real world.

Re:Reduces to a standard spam filter by Anonymous Coward · 2005-05-12 07:56 · Score: 0

I agree, this is a stupid idea. It's solving a problem that doesn't exist. The criticism of existing systems is:
Therefore, they cannot pick up new spam messages that are unlike any received before.
But in reality today's spam is like yesterday's spam, and that's why Bayesian filtering works so well.
Re:Reduces to a standard spam filter by p2sam · 2005-05-12 08:22 · Score: 1

see dcc, razor, and pyzor for details. but there is such a thing as fuzzy checksum that hashes two similar input to the same output.

Ob by lheal · 2005-05-12 07:13 · Score: 4, Interesting

In Korea, only old people get P2P spam.

Actually, I think we should find a way to attach the same stigma to spam customers that we do to the spammers. Why do spam customers not have to go to jail? They're as much the problem as the spammers.

I can see something like having all the spam customers' names published online, so you google for "spam" and "lheal" and up pops my list of purchases. The other spammers then get a very clean list of people to spam. Over time, the net would be segregated into those who like spam and those who don't.

Yeah, unworkable idea, but so are all the others.

--
Raise your children as if you were teaching them to raise your grandchildren, because you are.

Re:Ob by bill_kress · 2005-05-12 08:31 · Score: 1

I used to keep a list of spam customers and refuse to patronize them. I still avoid them when I can, but have totally lost the ability to keep the list in my head. In my old age, about all I remember for sure is I'll never buy an x-10 camera thingy.
Re:Ob by Anonymous Coward · 2005-05-12 10:34 · Score: 0

Most people buy stuff like viagra from spam to avoid being identified.

Hmmm... by __aaclcg7560 · 2005-05-12 07:15 · Score: 2, Funny

When you thought it was safe to use email again...

Re:Hmmm... by Anonymous Coward · 2005-05-12 08:30 · Score: 0

wow ur fat
Re:Hmmm... by __aaclcg7560 · 2005-05-12 09:12 · Score: 1

wow u noticed

Percolation search by digitaldc · 2005-05-12 07:15 · Score: 0

Man, and I thought a Percolation Search was what I did last weekend in backwoods North Dakota when I couldn't find a local Starbucks.

--
He who knows best knows how little he knows. - Thomas Jefferson

Re:Percolation search by Anonymous Coward · 2005-05-12 07:18 · Score: 1

holy crap that is the most unfunny thing i have ever heard. ever. you have reached the top of the mountain of not funny.
Re:Percolation search by Anonymous Coward · 2005-05-12 08:15 · Score: 0

backwoods North Dakota
Don't be silly, there aren't any trees in North Dakota.

YahooMail, GMail and Hotmail Do This Already by osewa77 · 2005-05-12 07:19 · Score: 3, Informative

What strikes me is that the idea of "pooling information" isn't really new. When one yahoo-mail/HotMail/Gmail user marks a particular mailing as spam, it affects the likelihood that the same email would be marked as spam for other yahoo users. So, the idea of "pooling information about spam" (from article) is already in use! However, it would be nice to create explicit protocols to allow such data (what mailings I have marked as spam) to be made public so that people using other email providers or their own mail servers can share in this pool of knowledge. Of course, the big three email providers (yahoo mail, hotmail, and gmail) will be foolish to make this information public: the spam filtering is one thing that makes a yahoo/gmail account more attractive to potential users! Good idea in theory, but bad business prospects. To add insult to injury, there is no way for the researchers to profit from the arrangement.

Re:YahooMail, GMail and Hotmail Do This Already by lheal · 2005-05-12 07:31 · Score: 1

... (yahoo mail, hotmail, and gmail) will be foolish to make this information public: the spam filtering is one thing that makes a yahoo/gmail account more attractive ...

Don't reject that idea so quickly, as I think you're on to something. The protocol would encapsulate the information that "userx@foo.com marked this message as spam". What the email provider does with that information is something else.

Not only that, but Google and Yahoo! could team up against Hotmail, or AOL, or whoever. Maybe they all could realize that it's in everyone's interest to stop spam and start cooperating.

Right.

--
Raise your children as if you were teaching them to raise your grandchildren, because you are.

NEWS BULLETIN by pocketfullofshells · 2005-05-12 07:20 · Score: 1

This just in, Spammers now use virtual social engineering to clog new P2P network spam filtering, Inboxes bulging with "extra inches"

would that really be good? by overbom · 2005-05-12 07:21 · Score: 2, Insightful

If I were a spammer:

I'd change an email client to respond with any message from certain folks I don't like to report all of their messages as spam to poison the social network. a couple of clients out there saying "yup, I've already got a message like that here, and my user marked it as spam".

think globally, act locally, right?

Re:would that really be good? by taustin · 2005-05-12 10:11 · Score: 1

I'd send myself a bunch of spam complaints, mark them as spam, and see if the abuse desk at my ISP was using this filter system.

Not a particularly new idea... by Otto · 2005-05-12 07:23 · Score: 3, Insightful

This isn't a new idea... except that they propose to integrate it into the mail client and have everybody you've ever sent mail to or received mail from be a potential contact, weighted by frequency that you email them. That's a bit new, but not as effective as it seems.

For one thing, it would block mailing list messages, which are messages that you probably do share with your contacts.

For another, it does not consider that most spam has random keywords seeding into every copy sent, so those would have to be ignored somehow, which introduces a fuzzy match algorithim, which means the possibility of false matches exists, and since you're asking others (probably all using the same algorithim against their databases) you have increased the chances of a false match being found.

In any case, collaborative networks already exist in a better form. Users mark messages as spam when they get them, a flag is created and sent to some central place that all users check against for matches. The algorithim for fuzzy matching resides in one place and is only used as an indicator in spam assassin in any case, not as the sole indicator..

Large scale systems like Google's GMail can use people flagging messages as spam to filter similar enough messages from other users, sort of thing. I'm pretty sure they do something like this, in fact, as my GMail account has *never* made a mistake in it's spam detection.

And so forth. There's better ways than relying on a random query of your contacts to see what they think.

--
- Give a man a fire and he's warm for a day, but set him on fire and he's warm for the rest of his life.

Mmmm, buzzwordie by cloudmaster · 2005-05-12 07:24 · Score: 2, Funny

Sorry, I can't read the article. There were too many buzzwords in the post.

Re:Mmmm, buzzwordie by Anonymous Coward · 2005-05-12 09:19 · Score: 1, Funny

people read the articles?

Darn... by John+Betonschaar · 2005-05-12 07:25 · Score: 1

I've already had this idea for years, seriously, I almost made it my graduation project.

Create a distributed spam filter that fingerprints incoming mail based on a number of criteria, have the user mark spam with a certain 'undesiredness factor', blacklist email fingerprinted as spam and propagate this information to other people using the same system... This way it should be possible to create some kind of 'network' that classifies email much more reliable than a simple content filter or address blacklist.

I even thought about how to integrate something like this in existing mail transport agents like exim and into email clients to have them forward/download the fingerprints and scores of incoming email before the email itself is sent on, and how you could create some kind of hierarchy of 'filter authorities' that collect all fingerprints and their scores from lower levels, possibly going up from the user to the ISP to the backbone provider... The user could have a special inbox that shows the fingerprints/subjects of messages on the mail server that are marked as 'undesired' by other users of the network, with the option to re-rate them or to download them anyway...

Anyways, the way it is put in this article it will not be of much help to the internet in general, because the spam messages are only scored by the 'social network', but they will still be delivered to their destination, and thus wasting bandwidth...

Re:Secure?partially... by spectrokid · 2005-05-12 07:26 · Score: 3, Interesting

You could collect email adresses in a hashed form, just like passwords are stored on a server. You would be able to check if the sender is in the list, but not be able to "un-hash" the list back into real adresses. The way to get around it would be for spammers to attach their sender adresses to these funny mails people do like to forward to their friends.

--

10 ?"Hello World" life was simple then

spam filters should reduce network load by sPaKr · 2005-05-12 07:26 · Score: 2, Insightful

Skipping past the security issues. One of the goals of spam filters should be reducing network load not increasing it. If we have to send our spam to several differnt peers to be scored this would compound the network load problems. Mostly this is a bad idea(tm) from the get go. I think the only thing that will really stop spam is to force something like pgp(gpg) signatures on all mail. Here's hoping the new national ID cards will have public certs encoded on them. It would be cool if someone would step in and get PKI working for the rest of us. Also we should drag the boddies of spammers through major cities behind a horse, while allowing victums to beat the spammer with large sticks like golf clubs.

Re:spam filters should reduce network load by AnotherBlackHat · 2005-05-12 07:41 · Score: 1

One of the goals of spam filters should be reducing network load not increasing it. If we have to send our spam to several differnt peers to be scored this would compound the network load problems.

All other things being equal, reducing network load is better than not reducing it.

But all other things are NOT equal.
If it's a tradeoff between reducing the load on the human (by reducing the amount of spam they must deal with) and reducing the load on the network, I'll pick "reducing the load on the human" everytime.

-- Should you believe authority without question?
Re:spam filters should reduce network load by sPaKr · 2005-05-12 07:58 · Score: 1

You would have a point if this was the only solution to the spam problem. Fortunately it isn't.

Solution!!!! by Anonymous Coward · 2005-05-12 07:31 · Score: 0

I have 2 email accounts;
bogus@cox.net and real@cox.net

bogus is NEVER used for real communication but freely accessible on different web-sites and sent to a couple of "relevant" news servers ;-)

bogus emails are always fetched first and kept 1 week.

real@cox.net has a filter comparing properties from incoming emails to emails in bogus - if there's a match:
emailmessage >> trash!!

That's iT!!!!

Probably easy to bypass by Anonymous+Writer · 2005-05-12 07:32 · Score: 1

Similar software on each computer that receives the query would then check the message against its own spam database, and so on, until a match is found, or the message is deemed original.

Spammers can ranodomly generate content for their spam to bypass this. Have the actual spam message text as a JPEG image followed by random, gibberish text in both the same background and foreground colour so it is invisible. If the system looks for messages that are identical by comparing the text, then spam messages would appear different from the computer's point of view.

This will just be an ongoing thing. Spammers will figure a way around this, then people will get on to what the spammers are doing and fix things so they block the spam again (like filter out text with the same foreground and background, compare JPEGs), but then the spammers come up with new methods... Besides, it seems like in order to compare emails, it will have to distribute more, so this method will cause Spam to take up even more network traffic.

I think a real simple solution would be to have an email client allow only emails from addresses in the address book or from websites that have been bookmarked. Other emails should be kept separate and dealt with as potential spam.

Re:Probably easy to bypass by hedora · 2005-05-12 09:35 · Score: 1

Generating random text, and attaching a jpeg doesn't work for two reasons.

First, the random text doesn't look like legitimate e-mail, so it will be completely ignored by most spam filters. That leaves a jpeg attachment and bogus headers, which out to look pretty 'spammy' to a mail filter. How often do total strangers send you legitimate email that contains nothing but a JPEG and text that doesn't have any 'important' words in common with your other mail?

Second, if you're like me, then your mail client is set not to render HTML, or display attachments by default, so the only think that is displayed is some random text. (This isn't very effective marketing...) I suspect most people eventually set their mail clients this way, at least if they get penis/porn spam. It usually only takes one JPEG/HTML file to remind me to turn off images in a new copy of thunderbird.

Of course, that's ignoring the fact opening a spam in many mail clients inadvertantly updates spammer databases by default, and disabling HTML fixes that. (The html in the messages load a remote image, and the remote image's filename contains your email address, or a hash of it.)

Sounds Like SpamNet by MBCook · 2005-05-12 07:36 · Score: 2, Informative

That sounds like Cloudmark SpamNet (I think that was what it was called). I used it a few years ago when it was in beta and it worked great. The idea is people marked mail they got as spam if it was. When they did that, a hash of the message (or title, or something like that) was sent to their server. When your mail came in, it was hashed and checked to see if it was spam. It was VERY accurate. It had only one problem:

Cloudmark.

I signed up for the free beta and was told that it would be free forever (they were going to charge businesses, IIRC). Then they chagned their mind but said that early adpoters/beta users would get it free for life. Then it left beta and they offered me a $5 discout (one time) for their subscription service (or some other pointless trinket offer like that). As far as I'm concerned they ripped me off.

That set me off trying other things, and I eventually found POPFile, which I use to this day (great software). I've posted this to Slashdot before (a long time ago). Some nice guy from a anti-spam company gave me a code for a free version of their product to be nice (I never used it, I had found something by then and didn't feel like switching again).

The point of all this is that it is a nice method that really works. If there was an open source project that did the same thing, I would use it. Untill then, I've got a solution that works fine.

But this isn't new (if I'm right about what it is, the article is down).

--
Comment forecast: Bits of genius surrounded by a sea of mediocrity.

Vipul's razor, anyone? by Anonymous Coward · 2005-05-12 07:50 · Score: 0

This sounds a lot like Vipul's Razor.

Yet Another Insane Proposal by abb3w · 2005-05-12 07:54 · Score: 1

I think we should find a way to attach the same stigma to spam customers that we do to the spammers.

Nah... Congress can't make stupidity illegal; they'd lose too many votes. The universe, not being elected, can... but tends to be in favor of capital punishment as a way of preventing repeated behavior.

An utterly illegal and unethical solution would be to start up a V1AGRA spam outfit, and taint the supply so that one pill in twenty was actually a disguised lethal dose of cyanide. This would cut into demand sharply, and possibly decrease average human gulibility. Of course, when you got caught, the electric chair might be the only thing that saved you from the lynch mobs.

A marginally less illegal and unethical approach would be starting an urban legend that in fact evil fanatic group was doing exactly that, and that some number of people are confirmed dead from it, with minor male Hollywood sleaze in the hospital having barely survived due to semi-plausible escape. I hypothesize there is a large overlap between those susceptible to spam purchasing and those who believe urban legends without checking to provide effective innoculation.

Lessee, how about "Al Qaeda, 6, Cliff Robertson, alcohol induced vomiting preventing the full dose from being absorbed" for the first round?

--
//Information does not want to be free; it wants to breed.

Re:Yet Another Insane Proposal by The_Wilschon · 2005-05-12 08:42 · Score: 1

Let's do it.

--
SIGSEGV caught, terminating

wait... not that kind of sig.
Re:Yet Another Insane Proposal by Anonymous Coward · 2005-05-12 10:30 · Score: 0

I heard SCO was strapped for cash and to pay their legal bills they were selling bad Cialis. 699 people are confirmed dead from it, with Rod Stewart in the hospital having barely survived due to having his stomach pumped to disgorge two quarts of undigested spunk.

Standard Form Letter by Golthur · 2005-05-12 07:56 · Score: 4, Funny

Your post advocates a (X) technical ( ) legislative ( ) market-based ( ) vigilante approach to fighting spam. Your idea will not work. Here is why it won't work. (One or more of the following may apply to your particular idea, and it may have other flaws which used to vary from state to state before a bad federal law was passed.) ( ) Spammers can easily use it to harvest email addresses (X) Mailing lists and other legitimate email uses would be affected ( ) No one will be able to find the guy or collect the money ( ) It is defenseless against brute force attacks (X) It will stop spam for two weeks and then we'll be stuck with it ( ) Users of email will not put up with it ( ) Microsoft will not put up with it ( ) The police will not put up with it ( ) Requires too much cooperation from spammers ( ) Requires immediate total cooperation from everybody at once ( ) Many email users cannot afford to lose business or alienate potential employers ( ) Spammers don't care about invalid addresses in their lists (X) Anyone could anonymously destroy anyone else's career or business Specifically, your plan fails to account for ( ) Laws expressly prohibiting it ( ) Lack of centrally controlling authority for email ( ) Open relays in foreign countries ( ) Ease of searching tiny alphanumeric address space of all email addresses ( ) Asshats ( ) Jurisdictional problems ( ) Unpopularity of weird new taxes ( ) Public reluctance to accept weird new forms of money ( ) Huge existing software investment in SMTP ( ) Susceptibility of protocols other than SMTP to attack ( ) Willingness of users to install OS patches received by email ( ) Armies of worm riddled broadband-connected Windows boxes (X) Eternal arms race involved in all filtering approaches ( ) Extreme profitability of spam (X) Joe jobs and/or identity theft ( ) Technically illiterate politicians ( ) Extreme stupidity on the part of people who do business with spammers ( ) Dishonesty on the part of spammers themselves (X) Bandwidth costs that are unaffected by client filtering ( ) Outlook and the following philosophical objections may also apply: ( ) Ideas similar to yours are easy to come up with, yet none have ever been shown practical ( ) Any scheme based on opt-out is unacceptable ( ) SMTP headers should not be the subject of legislation ( ) Blacklists suck ( ) Whitelists suck ( ) We should be able to talk about Viagra without being censored ( ) Countermeasures should not involve wire fraud or credit card fraud ( ) Countermeasures should not involve sabotage of public networks ( ) Countermeasures must work if phased in gradually ( ) Sending email should be free ( ) Why should we have to trust you and your servers? ( ) Incompatiblity with open source or open source licenses (X) Feel-good measures do nothing to solve the problem ( ) Temporary/one-time email addresses are cumbersome ( ) I don't want the government reading my email ( ) Killing them that way is not slow and painful enough Furthermore, this is what I think about you: (X) Sorry dude, but I don't think it would work. ( ) This is a stupid idea, and you're a stupid person for suggesting it. ( ) Nice try, assh0le! I'm going to find out where you live and burn your house down!

--
Hofstadter's Law: It always takes longer than you expect, even when you take into account Hofstadter's Law.

Re:Standard Form Letter by Linux_ho · 2005-05-12 08:27 · Score: 2, Insightful

I'll add my own here:

(X) Similar to DCC and Razor, but far less bandwidth efficient than either

You should also have checked:

(X) Users of email will not put up with it
(X) Requires immediate total cooperation from everybody at once

--
include $sig;
1;
Re:Standard Form Letter by Golthur · 2005-05-12 08:58 · Score: 1

Yeah, that's what you get when you try to fill out the form and have it posted before someone else does :)

I didn't know if users of email would put up with it or not, so I didn't check that one.

I definitely should have added my own option for consuming tons of extra bandwidth per spam, though - this thing would make the existing bandwidth use of spam look like a raindrop compared to the ocean...

--
Hofstadter's Law: It always takes longer than you expect, even when you take into account Hofstadter's Law.
Re:Standard Form Letter by Anonymous Coward · 2005-05-12 09:16 · Score: 0

I know this is slashdot, but can't you spell check your standard form letter? I mean if it's a copy paste kinda deal, shouldn't one proof read it before littering the web with the same spelling errors 20 million times?
Re:Standard Form Letter by Anonymous Coward · 2005-05-12 09:19 · Score: 0

You must be new here...

Bigger problem... by Not_Wiggins · 2005-05-12 08:05 · Score: 2, Insightful

What is one person's spam is another person's desired mail. I'm not talking about advertising, either. For example, I know for a fact that there are a lot of people out there that "knee-jerk" react to service messages from their bank, credit card, whatever... stuff they even signed up for that they mark as spam. Since I want to get my "your payment has posted" email, do I want to rely on the network of people around me that signed up for the same thing with the same company and report it as spam because they're too lazy to just unsubscribe?

--
Diplomacy is the art of saying, "Nice doggie!" until you can find a rock.

Re:Bigger problem... by Anonymous Coward · 2005-05-12 14:13 · Score: 0

Hi. I work for Cloudmark which makes the commercial product SpamNet and maintains the free Vipul's Razor, two (similar) services which do exactly what this article describes.

What you describes turns out not to be a real problem. The percent of email that the community can't decide whether it's spammy or not is very small indeed! Though even if that weren't the case it wouldn't invalidate the premise - it would just mean the system would have to be a little more sophisticated and the user would have to train it on which sketchy messages you want.
Re:Bigger problem... by Anonymous Coward · 2005-05-13 07:45 · Score: 0

I work for a creditcard company and the grandparent is spot-on. We often have a difficult time dealing with large ISPs because they're so spam-complaint sensitive; if we are trying to deliver service messages to 30000 users and 100 complain, then we have to deal with spam filters labeling all our messages as spam.

Perhaps your Cloudmark service is more sophisticated; but how many people need to mark something as spam within a community to get the rest of the community to label that same message as spam? What happens if you get agents for spammers inside the community who are trying to get "real spam" whitelisted by always accepting it or fighting the spam designation? And doesn't your scheme require that most of the community actively participate in marking messages as Spam/Ham?

Community-based might seem like a good idea, but it has holes. Just like individual spam heuristics (ala, Bayesian filters) can't take advantage of community knowledge is a weak point.

Ultimately, the answer will be to have something other than SMTP as a protocol where sender can always be traced/verified; then spamming will lose its advantage as an anonymous activity. People will be able to track them down and stop them.

There is a DCC filter that does this. by ChadL · 2005-05-12 08:28 · Score: 1

I have been using Spamihilator for a while now, with the DCC plug-in activated, that checks a fuzzy check-sum of the message with servers that hold a list of other users who use this filter. I have found that it does block a number of newsletters that large numbers of people receive, however a simple list of newsletter definitions do a good job of preventing this problem. I just put this filter with a DNSBL filter that checks Spamcop and other blacklists, and a learning filter, with no spam reaching my inbox, and all real messages getting in fine. More Information: http://www.rhyolite.com/anti-spam/dcc/ http://www.spamihilator.com/

I have no friends.. by Harald+Paulsen · 2005-05-12 08:44 · Score: 1

..you insensitive clod!

--
Harald

Now you won't be able to make new friends by WillAffleckUW · 2005-05-12 08:50 · Score: 0, Troll

Because they won't be part of your trusted network.

Especially hot babes or hunks you meet at parties.

Next!

--
-- Tigger warning: This post may contain tiggers! --

Betcha GMail is doing something like this already by wsanders · 2005-05-12 09:20 · Score: 1

In the last few months, as gmail's customer base has grown, their spam capturing capabilities have reached about 99.5% with a 0% false positive rate. And I get about 100 spams per day. It has been weeks since Gmail last falsely identified an incoming spam for me.

This type of searching (i.e efficiently searching through a long-tailed distribution) my contacts and archived mail is probbaly just one part of the equation - only about 25% of my email is from other gmail users. But nearly all of my legit email is from people I have emailed to or from before. I am a perfect candidate for this kind of protection.

--
Give a man a fish and you have fed him for today. Teach a man to fish, and he'll say "WHERE'S MY FISH, YOU IDIOT?"

Spam degrades the reliability of the medium by erice · 2005-05-12 11:10 · Score: 1

People forget that the true detriment of spam these days is the traffic it causes, not cluttering your inbox (if you're smart)

You've got to be kidding. Spam is text, or very nearly so (HTML). Unless you are using floppies and a 2400bps modem, the bandwidth/storage costs are irrelevent.

What is relevent is that it forces people to either use spam filters that randomly throw good messages away or they miss good messages becuase they can't be seen among all that spam. In either case, the loss rate goes from negligable to very noticeable and that makes email a whole lot less useful than it should be.

*Worms*, on the other hand are a storage problem, since they include largish binaries. But the methods for dealing with worms is rather different than for spam.

Re:Spam degrades the reliability of the medium by Anonymous Coward · 2005-05-12 11:58 · Score: 0

You're both right and you're both wrong, you're just identifying the main problem from your respective ends. For people that have to pay the costs associated with building out network capacity and maintaining it, spam's bulk is indeed the main problem.

An end user could care less about that though, they don't pay for those costs directly. What spam does cost them though is time and mental effort. Also having to deal with the effects of malicious content. Just because you can see your own problems clearly enough doesn't obligate you to willfully ignore the problems other people face in the same situation.

Exactly by Anonymous Coward · 2005-05-12 12:03 · Score: 0

Thunderbird spam filter chokes once the number of email goes over few thousands. It's not grinding to the halt, it just hits the wall per se - no spam is getting marked at all ...

2004 spam for one user costs $0.55 by erice · 2005-05-12 13:20 · Score: 1

For people that have to pay the costs associated with building out network capacity and maintaining it, spam's bulk is indeed the main problem.

No, actually it isn't. I run my own mail server. I keep all mail, including spam. I get something like 200 spams/day. All spam for 2004 amounts to a bit more than 100MB. At the somewhat inflated price of $0.50/GB, that is about 5 cents to store all the spam for 2004. You may quibble over the exact number but you would be hard pressed to come up with storage costs over $1.

My ISP charges $5/GB for excess bandwith when web hosting. That means a bandwidth charge for all of 2004 spam of about $0.50

That means the total is still under a buck.

Re:2004 spam for one user costs $0.55 by iamcf13 · 2005-05-12 15:18 · Score: 1

Multiplied by the approximately 934 million people online and your $0.55 figure becomes $513,700,000.00

over $500 million dollars!

Spam worldwide still wastes $$$....

There are alternatives... by greppy · 2005-05-12 13:30 · Score: 0

As you point out, the big three provide a massive surface area to do mass mailing tests on but have no commercial interest in sharing this information.

A company called Cloudmark has, for several years now, been running a similar system for cross-provider spam pooling. It's an outlook plugin that weights user opinions against email voted as desirable or otherwise; pretty much exactly as the article describes only on a more centralised (commercial) basis. A friend who ran it a few years back reliably informed me it was very effective.

http://www.cloudmark.com/products/safetybar/howitw orks/

Spam Archive by certel · 2005-05-12 14:19 · Score: 1

Here is something I thought you spamassassin users would find useful. Teaching the bayesian filter is difficult when just starting a new mail server, so here is a couple years of archived spam mail for you. Run these through 'sa-learn' and you'll have reduced your spam quite a bit: Spam Archive

--
[%] Cingular Ringtones

Re:Spam Archive by Anonymous Coward · 2005-05-12 14:41 · Score: 0

i have been using spamgourmet, http://spamgourmet.com/

so i can sign up for stuff and give my mail address out to computer forms etc.

you can't believe the spam i get through here from sites i thought were trust worthy

however you can set a limit to how many emails you want to receive from 1 to 20

does not help with the traffic but keeps my box clean.

missing the point by cahiha · 2005-05-12 16:00 · Score: 1

Social filtering of email is already widely implemented. It's not implemented at the level of the end user email client, but at the level of email servers, which compare messages to users and blackhole addresses.

A P2P approach and querying of other people's address books has huge privacy and compatibility problems without any obvious advantages.

Re:spam filters should reduce network ld-mine does by iamcf13 · 2005-05-12 16:11 · Score: 1

This is an on-topic, one line ad for a software program I wrote. If you hate ads then READ NO FURTHER!

My mailserver does everything it can to prevent spammers from using the SMTP DATA command to send their spam.

Since you use gmale by Anonymous Coward · 2005-05-12 16:44 · Score: 0

we must assume you value convenience above privacy.

What about an auto bounce back for unknowns? by Anonymous Coward · 2005-05-12 19:39 · Score: 0

I was thinking spam could be solved by having any new senders have to click a confirm link to get the mail in the inbox. once added to trusted lists or 2nd email onwards it could skip that step.

this would waste the spammers time and anyone with anything important to say WILL click the confirm link. or those letters you gotta type in for getting a free account on sites.

hope this idea is useful to one of you amazing /. people. if yer not smart coming in here you are by the time you read all these +5 comments.

bluetigerbc at gmail

and???? by Anonymous Coward · 2005-05-13 00:29 · Score: 0

I always assume this is what Yahoo!, Gmail, and Hotmail were doing anyways. Why wouldn't they? They have all your emails.....

NEW(?) PHISHING TRICK TO AVOID: PLEASE READ! by iamcf13 · 2005-05-13 19:10 · Score: 1

Yet another fake eBay site.... (>_<);

I entered a bogus but properly formatted CC# but it appeared to reject it. Oh well. Enjoy the relevant information and use it to avoid being duped....

The phish was sent from 80.247.227.76 in France through a redirect page at href=http://projekt.ig-immobilien.com/signin.html in Germany to the phish site itself at:

href=http://61.190.66.139/ws/index2.php?MfcISAPI Co mmand=SignInFPP

in China via the phish email link:

href=https://signin.ebay.com/ws/eBayISAPI.dll?Si gn In&UsingSSL=1&pUserId=&ru=http%3A%2F%2Fservlet%2Ee bay%2Ecom%2FForumController%3Fdest%3Dhttp%3A%2F%2F projekt%2Eig%2Dimmobilien%2Ecom%2Fsignin%2Ehtml

Amazing.... Credit card fraud 'courtesy' of miscreants using online resources in three countries using USA e-commerce giant eBay to do their dirty work. Truly an international effort if I ever saw one.... :p

P.S. It looks like eBay closed up this security hole. Good for them. :)

Slashdot Mirror

Using Email Networks as P2P Spam Filters

108 comments