Stopping Spam Before It Hits the Mail Server
Al writes "A team of researchers at the Georgia Institute for Technology say they have developed a way to catch spam before it even arrives on the mail server. Instead of bothering to analyze the contents of a spam message, their software, called SNARE (Spatio-temporal Network-level Automatic Reputation Engine), examines key aspects of individual packets of data to determine whether it might be spam. The team, led by assistant professor Nick Feamster, analyzed 2.5 million emails collected by McAfee in order to determine the key packet characteristics of spam. These include the geodesic proximity of end mail servers and the number of ports open on the sending machine. The approach catches spam 70 percent of the time, with a 0.3 false positive rate. Of course, revealing these characteristics could also allow spammers to fake their packets to avoid filtering."
I'll go first.
All spammers have to do is change the characteristics of the message. It's always going to be a cat and mouse game, just like antivirus and antispyware, so saying that they've found THE solution to blocking spam from hitting the server is slightly irresponsible.
Problem already solved back in 2003, I don't get any spam now.
Why do we need a crazily complex scheme like this when a simple entry in your router's 'Deny' list (for the source IP of the spam) has the same end effect?
Given the spew pouring out of the IP space of China, LACNIC, and Russia, blocking in such a manner appears to be near-lossless compression.
Bruce Lane, KC7GR,
Blue Feather Technologies
Just like other criminals, spammers must quickly respond to what actually works. In essence this is the flaw in any "security by obscurity" scheme, the bad guys simply respond to whatever works. If you get to try several billion times a day then you can try a whole lot of combinations.
That means that in my office of 50 people, with an average of 50 emails per day (a very very low estimate), we'd get 7-8 false positives daily. I'd hear bloody murder if that was the case.
We get a lot more mail than that per day, and our spamassassin without autolearning (simply flag anything higher than 5.0) does a hell of a lot better job than that... down in the range of 1-2 false positives a month. Assuming a low daily average of emails (like my example), that's .002% false positives.
I'm out of my mind right now, but feel free to leave a message.....
The original is "The end result was a system capable of detecting spam 70 percent of the time, with a 0.3 percent false positive rate."
Dave Barnes 9 breweries within walking distance of my house
0.3 would be terrible - three out of ten false positives. 0.3 percent - what the article actually says - is not too bad. But current techniques allow me to check the spam bin for such messages. This technique would pretty much preclude that capability, since the mail would never arrive at the server. I'm not sure that a rate of 0.003 would be acceptable under those circumstances.
Floating face-down in a river of regret...and thoughts of you...
IP addresses, he notes, are easy to fake.
Sure, you can fake your IP address so you get past this filtering, because it just looks at the first packet. It won't help you though, because you can't complete a TCP 3-way handshake from a fake address, and without doing that you can't actually send spam.
Finally! A year of moderation! Ready for 2019?
Isn't this just pushing the processing back a level, but still arriving at its destination? I guess you could implement bandwidth-provider-level (i.e. before the customer even gets their packets) spam filtering this way, but I'm sure most organizations would prefer to retain control by doing their own filtering.
So this software functions in both space AND time? Fascinating.
It's good that they specified that in the name, to avoid questions such as "Will this software work in the universe which we inhabit?"
a baseball glove.
But I'd first have to question why somebody is throwing spam at my mail server in the first place?
what happens when someone tries to contact me out of the blue before I have a chance to white list them?
Absolutely nothing happens ... at least from your perspective.
If libertarians are so opposed to effective government, why don't they all move to Somalia?
I've got a device in front the mail server, many people do. These and others work fine. Sorry for folks that don't have one. As long as it is free, it will be abused. Someone already said it was cat and mouse.
ceci n'est pas un sig
I hear this suggestion a lot. However, many of us work for global companies that deal with legitimate email from these countries. We can't just reject IP blocks for countries when we have dealings in them. China and Russia are huge for international companies.
It sounds like this approach would be fairly CPU intensive; analyzing the characteristics of packets, comparing them to other packets, looking for information on their originating systems, etc... It seems like they are throwing a non-trivial amount of computational time at the problem in order to spare the storage space that would be otherwise taken up by spam.
And of course as others have already pointed out, this just starts another round of whac-a-mole by pursuing this avenue.
Damn_registrars has no butt-hole. Damn_registrars has no use for a butt-hole.
Regardless of how complex you make it, someone will always eventually figure out a way around it.
The fundamental property of spam is that it involves many similar messages going to a large number of destinations. That's what to look for. Google can do that, because they manage a very large number of mailboxes with a single system. SpamCop used to do that, but they had to be in the mail-forwarding business to do it and that was too expensive.
Trying to detect spam by looking only at the mail for a single account is inherently a form of guessing. The existing technologies are reasonably good, but not good enough that the spammers give up.
Oh yeah. I was thinking a rate of 0.3 was huge. 0.3 percent is much better but still not acceptable.
Taiwan is not a third world country, and depending on who you ask not even a country. Anyway blocking out all of asia is probably a bad idea for many businesses.
Your post advocates a
(x) technical ( ) legislative ( ) market-based ( ) vigilante
approach to fighting spam. Your idea will not work. Here is why it won't work. (One or more of the following may apply to your particular idea, and it may have other flaws which used to vary from state to state before a bad federal law was passed.)
( ) Spammers can easily use it to harvest email addresses
( ) Mailing lists and other legitimate email uses would be affected
( ) No one will be able to find the guy or collect the money
( ) It is defenseless against brute force attacks
(x) It will stop spam for two weeks and then we'll be stuck with it
(x) Users of email will not put up with it
( ) Microsoft will not put up with it
( ) The police will not put up with it
( ) Requires too much cooperation from spammers
( ) Requires immediate total cooperation from everybody at once
(x) Many email users cannot afford to lose business or alienate potential employers
( ) Spammers don't care about invalid addresses in their lists
( ) Anyone could anonymously destroy anyone else's career or business
Specifically, your plan fails to account for
( ) Laws expressly prohibiting it
( ) Lack of centrally controlling authority for email
( ) Open relays in foreign countries
( ) Ease of searching tiny alphanumeric address space of all email addresses
( ) Asshats
( ) Jurisdictional problems
( ) Unpopularity of weird new taxes
( ) Public reluctance to accept weird new forms of money
( ) Huge existing software investment in SMTP
( ) Susceptibility of protocols other than SMTP to attack
( ) Willingness of users to install OS patches received by email
(x) Armies of worm riddled broadband-connected Windows boxes
(x) Eternal arms race involved in all filtering approaches
( ) Extreme profitability of spam
( ) Joe jobs and/or identity theft
( ) Technically illiterate politicians
( ) Extreme stupidity on the part of people who do business with spammers
( ) Dishonesty on the part of spammers themselves
( ) Bandwidth costs that are unaffected by client filtering
( ) Outlook
and the following philosophical objections may also apply:
( ) Ideas similar to yours are easy to come up with, yet none have ever
been shown practical
( ) Any scheme based on opt-out is unacceptable
( ) SMTP headers should not be the subject of legislation
( ) Blacklists suck
( ) Whitelists suck
(x) We should be able to talk about Viagra without being censored
( ) Countermeasures should not involve wire fraud or credit card fraud
( ) Countermeasures should not involve sabotage of public networks
( ) Countermeasures must work if phased in gradually
( ) Sending email should be free
(x) Why should we have to trust you and your servers?
( ) Incompatiblity with open source or open source licenses
( ) Feel-good measures do nothing to solve the problem
( ) Temporary/one-time email addresses are cumbersome
( ) I don't want the government reading my email
( ) Killing them that way is not slow and painful enough
Furthermore, this is what I think about you:
(x) Sorry dude, but I don't think it would work.
( ) This is a stupid idea, and you're a stupid person for suggesting it.
( ) Nice try, assh0le! I'm going to find out where you live and burn your
house down!
It should be illegal to say that freedom of speech should be limited.
Your post advocates a
( X ) technical ( ) legislative ( ) market-based ( ) vigilante
approach to fighting spam. Your idea will not work. Here is why it won't work. (One or more of the following may apply to your particular idea, and it may have other flaws which used to vary from state to state before a bad federal law was passed.)
( ) Spammers can easily use it to harvest email addresses
( ) Mailing lists and other legitimate email uses would be affected
( ) No one will be able to find the guy or collect the money
( ) It is defenseless against brute force attacks
( X ) It will stop spam for two weeks and then we'll be stuck with it
( ) Users of email will not put up with it
( ) Microsoft will not put up with it
( ) The police will not put up with it
( ) Requires too much cooperation from spammers
( ) Requires immediate total cooperation from everybody at once
( ) Many email users cannot afford to lose business or alienate potential employers
( ) Spammers don't care about invalid addresses in their lists
( ) Anyone could anonymously destroy anyone else's career or business
Specifically, your plan fails to account for
( ) Laws expressly prohibiting it
( ) Lack of centrally controlling authority for email
( ) Open relays in foreign countries
( ) Ease of searching tiny alphanumeric address space of all email addresses
( X ) Asshats
( ) Jurisdictional problems
( ) Unpopularity of weird new taxes
( ) Public reluctance to accept weird new forms of money
( ) Huge existing software investment in SMTP
( ) Susceptibility of protocols other than SMTP to attack
( ) Willingness of users to install OS patches received by email
( ) Armies of worm riddled broadband-connected Windows boxes
( X ) Eternal arms race involved in all filtering approaches
( ) Extreme profitability of spam
( ) Joe jobs and/or identity theft
( ) Technically illiterate politicians
( ) Extreme stupidity on the part of people who do business with spammers
( X ) Dishonesty on the part of spammers themselves
( ) Bandwidth costs that are unaffected by client filtering
( ) Outlook
and the following philosophical objections may also apply:
( X ) Ideas similar to yours are easy to come up with, yet none have ever
been shown practical
( ) Any scheme based on opt-out is unacceptable
( ) SMTP headers should not be the subject of legislation
( ) Blacklists suck
( ) Whitelists suck
( ) We should be able to talk about Viagra without being censored
( ) Countermeasures should not involve wire fraud or credit card fraud
( ) Countermeasures should not involve sabotage of public networks
( ) Countermeasures must work if phased in gradually
( ) Sending email should be free
( ) Why should we have to trust you and your servers?
( ) Incompatiblity with open source or open source licenses
( ) Feel-good measures do nothing to solve the problem
( ) Temporary/one-time email addresses are cumbersome
( ) I don't want the government reading my email ( X ) Killing them that way is not slow and painful enough Furthermore, this is what I think about you: ( X ) Sorry dude, but I don't think it would work. ( ) This is a stupid idea, and you're a stupid person for suggesting it. ( ) Nice try, assh0le! I'm going to find out where you live and burn your house down!
- My uid ends in 69...
First: I do not want others to decide what's spam for me.
Second: I got graylisting, amavisd with spamd & co, and more. Why exactly would I put such a system on every other node of the net too? To throw away resources?
Any sufficiently advanced intelligence is indistinguishable from stupidity.
Slightly off-topic, sorry, but I think it's abysmal enough to post and interest a few (or just make you thankful you're not here.)
"Absolutely nothing" is my company's solution to filtering out large or suspect attachments. If somebody sends me an attachment and my company's filters don't like it, the e-mail is dropped. I don't get a notice saying, "This e-mail contains suspicious attachments and has been removed." My customer doesn't get a reply saying, "This e-mail could not be delivered to the recipient because it contained suspicious attachments." Nothing - Zip, zero, nothing. My customer thinks it went through and it's invisible to me.
After numerous complaints to IT, the response was that I need to contact each of my customers and any of their contacts that may be sending e-mail that I may be copied on with an attachment and have them call me on the phone any time they send on so that I know to expect it.
Beautiful, huh?
He's getting rather old, but he's a good mouse.
What exactly does this mean? A rate is usually a comparison of two values. What two values were compared to get 0.3?
Big whoop. All it does is block email with IP addresses from France, Belgium, Russia, Italy, and Argentina.
I want to try to keep this as non-spam as possible, but Symantec acquired a company about 5 years ago called TurnTide that did almost *exactly* that. Take the reputation of the sending address, and shape the TCP/IP packets to slow down the rate of mail into the system. Symantec touts a 70% reduction in mail volume and an 80% reduction in the amount of spam that hits a mail server. I've had it in production in one environment where the customer went from approximately 5 million messages/day to 500,000 messages/day.
YOU REMEMBER WHEN SEX WAS THE LAST TIME? REFRESH THE MEMORY OF VIA GRA!
No more hair Rogaining medicine.
GIRLS DO ANYTHING FOR A BIG HOSE
It boosts your rod!
Make two days nailing marathon
for your delicate advantage
And all that is just from the most recent page in my spam folder.
http://twitter.com/OLDTELEGRAM
Except that the sender's source domain is almost always forged in spam. So, as an owner of several domains, I get hundreds of bounce messages per day in response to spams sent with my one of domains as the source domain. The "Undeliverable Mail" messages I receive have become, for me, almost worse than the spam.
-- "Most people prefer a popular myth to an unpopular truth"
The spam filter we use already does a geography-based check. Even allows you to choose how to classify it based on geography....as well as IP, grey listing, header, recipient, sender, subject, content, bayesian pattern, honeypot.... My complaint anymore is not about the amount of spam making it in (we have had 1 that slipped thru the filters in the last 6 months, but that is because it actually spoofed an email address on our whitelist (we don't whitelist domains at all). It's the amount of traffic created, and how huge our log files get, and even if we set the amount of logging to limit log file size, then we end up with HUNDREDS of smaller log files.
But do those 2500 messages include spam or are they just the mails that get through the existing spam filters?
Otherwise my understanding of the 0.3% false positive is where 100% = the total number of emails.
Which is rather unacceptable given the handling of false positives, and the total number of emails could be very high when you include spam.
Oh, and I don't use LinuxMagic LOL...
Technology which is already here for a long time has now a buzzword - SNARE. For example OpenBSD spamd doing the same based on blacklists, greylists and even on Operating System fingerprints. Wheel is reinvented again... ;-)
Since 1996...
A few years ago the company I worked for came under an email DOS attack that bogged down our Exchange server to the point that it took about 10 hours for a legitimate email to get through. The Windows admins tried all 10 spam settings with no affect. I put a Linux box running SpamAssassin in front of the Exchange server and within a couple of hours the delivery time dropped to about 10 seconds. Products like SpamAssassin are essentially dynamic filters that can and do get fresh filter information as often as you like. This case was a dictionary attack and we got rid of the vast majority of the spam by the simple expedient of deleting anything that wasn't addressed to a legitimate account. As another poster noted, most spam filtering methods are just educated guessing. Rely on one that is educable.
-- Consensus - 50% probability that the majority are wrong.
0.3% FP on the total mail input, but 90% is spam anyway.. so that means 3% of legit mail is dropped.
3$ is way too high.
Why does it seem everyone ignores the real source of the majority of spam: Microsoft windows computers infected by viruses running botnets that send spam. Yes, is generated by other systems, but not nearly the amount that is being generated by MS based botnets.
How about everyone just send their frigen spam bill to MS. How about a class action for everyone to collect for the damage that MS does to networks around the World. Better yet lets just forward all the spam we get to MS. Let them sort it out.
Living in Chile
So 70% of the time it works every time. Sold.
Wouldn't it be cheaper in the long run to simply design a new mail protocol from the ground up, with security and spam prevention as the main focuses? It seems to me that when you need to implement solutions which are as complex as this one to keep the system running as intended it is more or less a failure.
From now on, whenever you complain to IT, do it in writing, and send them a telegram first, telling them to expect a letter with your complaint. Hopefully they will soon see sense.
www.Buy-Proxy.com - A "buyer-driven" global marketplace.
And follow up with a phone call to make sure they got the letter.
Warning: this article may contain humor, sarcasm, parody, and perhaps even irony. Read at your own risk.
Not even remotely. At best this system could only be used as input to a secondary system that then uses this information along with other sources. See, e.g., SpamAssassin's scoring approach.
... like network-based virus blockers bring several good things:
* an entirely different set of algorithms can be used, leveraging data and traffic patterns not specific to the message contents
* a team of engineers not tied to a single enterprise
And, indeed, major network operators like to do stuff like this - takes traffic off the network, and relieves enterprises of evil traffic forms (including DDOS)
BUT then, net neutrality purists, like 4chan, despise this and fight back, as recently when AT&T worked to thwart a large-scale DDOS attack.
I still don't understand why they don't regulate SMTP servers on the net just like other business areas. These have a real financial impact on other's operating costs. If they required all SMTP servers on the net to be closed and regulated, I think it would be a good start.
I'm talking fines and the ability to cut off any rogue SMTP servers. They also need a better method to validate connecting servers and it needs to be an industry wide adopted standard, whether that is done via certificate authority or some other 'secure IP' method.
Actually, here's a good idea: put it in a greylist and let it hit once and see what the user chooses: whitelist or blacklist. And we're back at square one. GP should be modded Insightful because a lost email may be the perfect business opportunity, therefore not all spam must be blocked.
I have a 100% guaranteed way to stop spam from reaching the mail sever.
Unplug the dam thing!
An SQL query goes to a bar, walks up to a table and asks, "Mind if I join you?"
That's what happens when companies hire incompetent IT staff.
An SQL query goes to a bar, walks up to a table and asks, "Mind if I join you?"
It's probably due to how your company's mail filtering is setup.
In an ideal world, you do your filtering during the SMTP session and give either a 4xx or 5xx code to the originating server if you are blocking the message due to filters. This puts the burden of notifying the original sender on the originating server and does not put your SMTP server at the mercy of relying on a (usually) forged return sender address to notify the original sender.
However, a lot of shops do their filtering after accepting a message for local delivery. This causes problems because there's no way of reliably informing the originator of the message that their message was blocked or bounced. Sender addresses are almost always forged in spam runs, and relying on that to notify the sender causes "backscatter" spam. A lot of email anti-virus tools cause this backscatter because they make the naive assumption that the sender address is not forged.
Some systems do a quarantine style setup, where the message is placed in a queue, and the recipient is informed of the sender / subject and allowed to retrieve it.
Unfortunately, filtering during the SMTP transaction (a.k.a. "pre-queue filtering") has its downsides (performance, CPU usage, lack of integration with 3rd party tools). But it's still best to reject as much mail as possible during the SMTP transaction.
Wolde you bothe eate your cake, and have your cake?
Although it is not 100% effective, having a spam filter in front of the email server is the best solution IMHO. Solutions like this let traffic hit the mail server before stopping it as spam. Other than it being annoying to users, the big issue with spam is lots of small connections slowing down the system. Letting a EHLO for each of the spam hits despite filtering it away before completion is not helpful. But then, it might depend on if your an end user that hates getting spam or an admin that hates what spam actually does to your mail server.
Having to work for a living is the root of all evil.
Are they kidding? 70% and 0.3 % false positives? I employ a simple GreyList which catches 90% of spam and 0 false positives short of a misconfigured sending email server that does not adhere to RFC. Couple this with user-configured Spam Assassin, and my clients see maybe 1 (generally 0) spam email in their inbox a day, with around 10-20 ending up in the spam due to SA. This is down from hundreds in the spam folder and 20-25 in the Inbox before implementing this solution. At least if we're going to pretend something is newsworthy, make it better than what already exists.
Call me old school, but I think the best way to keep spam from getting to the servers would be for there to be a spirited geek vigilante initiative for a couple of years where guys with pocket protectors and baseball bats would show up on the doorsteps of spammers and break their kneecaps. I think there was a Russian spammer who got harsher treatment than this a year or two ago, but I think broken kneecaps would suffice. Just saying...