Slashdot Mirror


The Next Step in Fighting Spam: Greylisting

Evan Harris writes "I've just published a paper on a new and unique spam blocking method called "Greylisting". The best thing about it other than achieving better than 97% effectiveness in blocking spam, is that it practically eliminates the main problem of other solutions: the false-positive. There's even source code for an example implementation written as a perl filter for sendmail, along with instructions for installing, so you can get up and running quickly."

99 of 481 comments (clear)

  1. your first mistake by frieked · · Score: 4, Insightful

    I'm going to try to say this as nicely as possible and without trolling:
    You have just rendered Greylisting pretty useless by making it open source. Spammers are much smarter than you think and what you have basically done is shown them what they need to do in order to get around Greylisting. That's just my take on the issue, maybe I'm wrong but I doubt it.

    --

    I have often regretted my speech, never my silence.
    -Xenocrates
    1. Re:your first mistake by Soko · · Score: 4, Insightful

      I'm going to try to say this as nicely as possible and without trolling:

      Not trolling at all - you have a legitimate (though perhaps misguided) problem with this method.

      You have just rendered Greylisting pretty useless by making it open source. Spammers are much smarter than you think and what you have basically done is shown them what they need to do in order to get around Greylisting. That's just my take on the issue, maybe I'm wrong but I doubt it.

      So, the spammers themselves will be of significant help in debugging and helping to fix the code so they can't circumvent it, won't they? OSS means anyone who finds how the greylist script is beaten can figure out a fix and post it. Sounds like the best thing to do IMHO.

      Soko

      --
      "Depression is merely anger without enthusiasm." - Anonymous
    2. Re:your first mistake by Schnapple · · Score: 5, Funny
      You have just rendered Greylisting pretty useless by making it open source.
      You're assuming the spammers can read source code.
    3. Re:your first mistake by L.+VeGas · · Score: 4, Funny

      That's just my take on the issue, maybe I'm wrong but I doubt it.

      That's what I like to see. Someone with strong opinions. Or maybe not.

    4. Re:your first mistake by tomstdenis · · Score: 4, Informative

      You're missing a big part of it though. If you have to try say 3 times to send a message [over a 5 day period or so] you're ability to mass send 100million emails is really squashed.

      Legitimate people first time sending won't really mind the few day wait and most MTAs will try for upto a month.

      Tom

      --
      Someday, I'll have a real sig.
    5. Re:your first mistake by TheCarp · · Score: 5, Informative

      not at all

      Read the paper. Spammers would figure it out eventually. What it buys is what they have to do to get around it.

      It means they have to do retrys...that means spam runs take longer, especially since they have to run...then wait for a locally defined timeout, and run all those addresses again

      AND they have to do it from the same IP.

      This raises their bandwidth profile. It wastes their time... all in all... it raises their cost of doing buisness and cuts into their profit margins.

      It means they will have to upgrade their tools again. It means they get headaches. And of course, the next step is to impliment spam traps that watch activity and see that a spammer is spamming, and promotes them to a blacklist before they can even retry. (oh gee 1000 new greylist triplets from 1 IP in under 5 mins? Set the timeouts for that IP to 12 hours)

      -Steve

      --
      "I opened my eyes, and everything went dark again"
    6. Re:your first mistake by JJAnon · · Score: 2, Interesting

      I don't think the mistake has anything to do with it being open source. It could be closed source, and would still fail because the basic premise is so simple - it relies on spammers sending spam to your inbox and not bothering to resend it if an error code is returned. So all a spammer has to do is just resend the message a couple of times to get around the spam 'filter'.

    7. Re:your first mistake by Horny+Smurf · · Score: 2, Funny
      So, the spammers themselves will be of significant help in debugging and helping to fix the code so they can't circumvent it, won't they? OSS means anyone who finds how the greylist script is beaten can figure out a fix and post it. Sounds like the best thing to do IMHO. Soko

      So, once the spammers find how to get around the greylist, they'll submit patches to the spam blocking software?

    8. Re:your first mistake by Henry+Stern · · Score: 4, Interesting

      It means they have to do retrys...that means spam runs take longer, especially since they have to run...then wait for a locally defined timeout, and run all those addresses again

      AND they have to do it from the same IP.

      Not to mention that if this is used in conjunction with other collaborative tools (i.e. RBL, checksums), by the time that the spamming MTA can return its IP address will have been submitted to MAPS/etc. and the contents of the message will have been submitted to Razor/Pyzor/DCC.

      I think that this greylisting idea will be pretty hard to beat by Joe spammer. Since the game of spam detection is pretty much an arms race, slowing him down will probably be enough to turn the battle in your favour.

    9. Re:your first mistake by Ross+C.+Brackett · · Score: 4, Funny

      You're assuming the spammers can read.

    10. Re:your first mistake by autopr0n · · Score: 4, Funny

      You're assuming the spammers can read source code.

      Who do you think writes spamming software?

      --
      autopr0n is like, down and stuff.
    11. Re:your first mistake by Flwyd · · Score: 2, Insightful

      If you have to try say 3 times to send a message [over a 5 day period or so] you're ability to mass send 100million emails is really squashed.

      It's hardly squashed, it just takes a little longer.

      Legitimate people first time sending won't really mind the few day wait and most MTAs will try for upto a month.

      I don't think I've seen a bounce that tried for more than five days. And while instantaneous may be a poor expectation, I often legitimately expect within-an-hour times. The most obvious case of this is automatic responses. A less obvious case is when you know the recipient will be available.

      For instance, a web visitor submits a form. I happen to be at my computer, and ask for more info. The visitor is still available, so responds quickly. I now have to dink around online for an hour to get a piece of mail that shouldn't take more than a few minutes.

      Another case: I phone a friend and ask him to look at the source code I'm having trouble with. He has to wait an hour to receive it, then I have to wait an hour to see his response. Perhaps this could be better solved by instant messaging, but I don't have an IM client at work.

      Finally, the timing of carbon copied messages could get wonky. If several people are conversing via email, each sender/recipient pair creates a new timeout. If some of the participants have previously corresponded, their messages will arrive quickly, and they may respond immediately. Others wait for more than an hour for the original message, but may receive some replies before the original. This becomes vastly confusing to follow and significantly inconvenient.

      I like the Greylisting approach, and these issues could be circumvented by delivering all mail to a user when the MTA thinks she's reading her email (when she's POPed recently or is logged on to a system with low idle time, etc.).

      One other note -- 4 hours seems too small of an initial window, especially if there's some sort of attack.

      --
      Ceci n'est pas une signature.
  2. Questions by Traa · · Score: 2, Insightful
    Some questions about this method:
    • It delays all incoming emails for a certain amount of time. Unfortunate side effect of the algorithm. Can anyone tell me what the average extra time is?
    • I am not convinced that most of the spam comes from specialized email applications that can be fooled with a temporarily failure. Can anyone provide numbers on this?
    • How does the algorithm adapt when aforementioned email applications adapt to 'greylisting'?
    • I see a lot of spam that was probably produced by applications that use an automated signup to yahoo/hotmail/etc. to obtain a temporary email address and leave the actual emailing to those services which will circumvent 'greylisting'.
    • How much of the total internet traffic is made up of email? What happends of we all install 'greylisting' filters and each email has to be resent several times? Is doubling/tripling the amount of email traffic going to be noticable?


    I like the idea though. Since SMTP is broken anyway, why not use another of it's features in a new way to help filter unwanted email. Keep up the good work!
    1. Re:Questions by sulli · · Score: 3, Insightful
      1 hour is the time proposed. Completely unacceptable unless the whitelist works.

      Since most personal users are on dialup or dynamic IPs, unless the mail client can upload the whitelist in a trusted fashion (or the MTA remembers what users the client sent messages to!), this won't work.

      Do any mail clients include whitelist-collection? Mail.app for OS X does collect all addresses you've sent to, but I've never seen any tool to upload it somewhere.

      --

      sulli
      RTFJ.
    2. Re:Questions by sharlskdy · · Score: 2, Informative

      Retry is configurable, and it depends on the MTA. Qmail has a default retry of 400 seconds (6 minutes, 40 seconds).

      Much of my e-mail comes through within seconds - I'm not sure I want that delayed too much. Although, this delay is on the first matching triplet.

      Server disk space requirements for major providers would climb considerably, I would expect. Legitimate mass-mail programs, and mailing list services would have a problem, tho.

      The algorithm takes advantage of the lazyness of spammers, which is not a bad idea.

    3. Re:Questions by eh1001 · · Score: 2, Informative


      The initial time delay depends on the configuration. The default is one hour.

      For better numbers, try it yourself, and report back. That's the best way to validate it.

      As spammers, adapt, the system can be adapted as needed, but for right now, it makes spammers stay at an IP for some measurable amount of time. That time gives other methods of blacklisting and spam blocking time to work.

      Note that every email will NOT have to be sent multiple times. Only those emails that aren't part of an established "relationship" will.

    4. Re:Questions by JaredOfEuropa · · Score: 3, Informative

      It's not the sender's mail client that connects to the server that runs the greylist system, it's the sender's SMTP server as provided by their company or ISP. Its IP address will not change regardless of the sender's connection or dynamic IP.

      --
      If construction was anything like programming, an incorrectly fitted lock would bring down the entire building...
  3. can't believe their numbers by sqrt529 · · Score: 5, Informative

    most spam today is sent through open relays. Those relays will simply retry the delivery no matter which software the spammer uses, so the method won't work.

    1. Re:can't believe their numbers by McDutchie · · Score: 5, Informative

      Eh, open relays are soooo 20th century. :) Actually most open relays today are either blocked or closed, and newly installed MTAs are secure against third-party relaying by default, so this spam method is dying out. Most spam today is sent either directly to the receiving MTA, through open proxies, or through formmail.pl and similar exploits.

    2. Re:can't believe their numbers by grub · · Score: 3, Interesting
      Open proxies get most of my rejects, here's a paste from "spamstat" (a quick script I did that cron's me the output once a day). The logs rotated not quite 2 hours ago.
      Open Relay: 1
      Dialup Spam Source: 0
      Confirmed Spam Source: 2
      Smart Host: 0
      Spamware Developer or Spamvertized site: 0
      Unconfirmed Opt-In List Server: 0
      Insecure formmail.cgi: 0
      Open Proxy Server:8
      --
      Trolling is a art,
  4. In case of /.'ing by Anonymous Coward · · Score: 4, Informative

    The Next Step in the Spam Control War: Greylisting
    By Evan Harris
    Copyright 2003, all rights reserved.

    Introduction
    This paper proposes a new and currently very effective method of enhancing the abilities of mail systems to limit the amount of spam that they recieve and deliver to their users. For the purposes of this paper, we will call this new method "Greylisting". The reason for choosing this name should become obvious as we progress.

    Greylisting has been designed from the start to satisfy certain criteria:

    1. Have minimal impact on users
    2. Limit spammers ability to circumvent the blocking
    3. Require minimal maintenance at both the user and administrator level

    User-level spam blocking, while somewhat effective has a few key drawbacks that make its use in the continuing spam war undesirable. A few of these are:

    1. It provides no notice to the senders of legitimate email that is falsely identified as spam.
    2. It places most of the costs of processing the spam on the receivers side rather than the spammers side.
    3. It provides no real disincentive to spammers to stop wasting our time and resources.

    As a result, Greylisting is designed to be implemented at the MTA level, where we can cause the spammers the most amount of grief.

    For the purposes of evaluating and testing Greylisting, an example implementation has been written of a filter that runs at the MTA (Message Transfer Agent) level. The source for this example implementation is available as a link below, and as other implementations or additional utility code become available, they will also be linked.

    Greylisting has been tested on a few small scale mail hosts (less than 100 users, though with a fairly diverse set of senders from all over the world, and volumes over 10,000 email attempts a day), however it is designed to be scalable, as well as low impact to both administrators and users, and should be acceptable for use on a wide range of systems, including those of very large scale. Of course, performance issues are very dependent on implementation details.

    The Greylisting method proposed in this paper is a complimentary method to other existing and yet-to-be-designed spam control systems, and is not intended as a replacement for those other methods. In fact, it is expected that spammers will eventually try to minimise the effectiveness of this method of blocking, and Greylisting is designed to limit options available to the spammer when attempting to do so.

    The great thing about Greylisting is that the only methods of circumventing it will only make other spam control techniques just that much more effective (primarily DNS and other methods of blacklisting based on IP address) even after this adaptation by the spammers has occurred.

    The Greylisting Method
    High Level Overview
    Greylisting got it's name because it is kind of a cross between black- and white-listing, with mostly automatic maintenance. A key element of the Greylisting method is this automatic maintenance.

    The Greylisting method is very simple. It only looks at three pieces of information (which we will refer to as a "triplet" from now on) about any particular mail delivery attempt:

    1. The IP address of the host attempting the delivery
    2. The envelope sender address
    3. The envelope recipient address

    From this, we now have a unique triplet for identifying a mail "relationship". With this data, we simply follow a basic rule, which is:

    If we have never seen this triplet before, then refuse this delivery and any others that may come within a certain period of time with a temporary failure.

    Since SMTP is considered an unreliable transport, the possibility of temporary failures is built into the core spec (see RFC 821). As such, any well behaved message transfer agent (MTA) should attempt retries if given an appropriate temporary failure code for a delivery attempt (see below for discussion of issues concerning non-conforming MTA's)

  5. Tempfailing is not new and unique by HiKarma · · Score: 5, Informative
    This idea isn't so new or unique. It's been discussed a fair bit on the ASRG mailing list under the name "tempfailing".

    First I heard of it was from Landon Noll and Mel Pleasant. It is noted in brief as one of the techniques in this plan to end spam (though their plan, which did include the triplets, is not laid out in full there.)

    It is a worthwhile technique for a little while, and if spammers were rational, would be worthwhile for some time to come. But spammers are not rational, and already this technique is not as useful as would be hoped.

    Do a Google Search for Tempfailing especially in ASRG to see statistics etc.

  6. Easy way to stop spam... by Anonymous Coward · · Score: 3, Informative

    Just encode your e-mail address on web pages & don't sign up to any dubious mailing lists.

    I haven't received 1 single spam in recent months from doing this!

    1. Re:Easy way to stop spam... by seangw · · Score: 2, Informative

      It isn't always possible to never publish your email address.

      You can, however, establish classes of emails. Most people don't like this however, because you have to check multiple accounts, and it really doesn't stop the spam.

      In order to sign up to certain services / sites you need to provide a valid email address.

      While that email address can be a secondary email, if anything important is going to come in on the email (such as domain information via network solutions) you will still want to use your real email address.

      It's a very difficult issue.

  7. 1 false positive is not acceptable. by Pop+n'+Fresh · · Score: 3, Insightful

    This isn't very reassuring:

    "it practically eliminates the main problem of other solutions: the false-positive."

    What does 'practically eliminates' mean? If it gives false positives at all, it is just as useless as all those 'other solutions'.

    --
    *This page intentionally left pointless*
    1. Re:1 false positive is not acceptable. by pclminion · · Score: 5, Interesting
      Wrong. 1 false positive can be acceptable, and in fact is probably better than how things are now.

      At USENIX '03 there was a paper presented on artificial intelligence techniques for spam detection. I can't provide a link since only USENIX members can download the paper (at this point, at least). I was a coauthor of that paper.

      One of the things we've discovered in our research is that some classes of filters (most notably, the one I have been developing along with a few other individuals) are actually more effective at correctly classifying email than humans are. That is to say, you can train the learning algorithm on mostly-correctly-classified data, then re-run it over the training data, and almost miraculously, it discovers all kinds of email in the training set that was incorrectly classified.

      I.e., this filter has discovered mail that I myself incorrectly thought was spam. It's scary, because there's a lot of it.

      To assume that a human will always be 100% accurate at classifying their own email isn't just arrogant, it's plain wrong. Newer filters that will be introduced in the near future might possibly be more accurate than you, a frail human, could ever be.

    2. Re:1 false positive is not acceptable. by dasmegabyte · · Score: 3, Insightful

      Maybe we don't want them to be so accurate.

      I get these chain emails from my brother. They are always some funky scheme to get money that won't work. I'd love to just delete them...but if I do this, he tells my mom I don't answer his email.

      She then laces into me like you would not believe...blah blah blah he's your brother and you should love him. I don't need that grief...so instead I respond with a "not interested, no cash right now." Keeps the family happy.

      I could see it being more important than this, though. Your boss sends you direct mail HE received and appends a "Should we do this" to the bottom. Or, worse, your marketting team constructs a direct mailing that fails your spam filter (no comments from the peanut gallery...obviously this is a good thing to find out, but this is not the way to find it out). Missing that one email could make somebody VERY angry and put you in danger. I have had messages from my boss/CEO/etc go into my junk folder and found them when cleaning it out.

      It is correct for the spam engine to label these as spam email. It would be incorrect for it to delete them before they got to you. And so I subscribe to the school of thought that a single false positive makes any spam filter absolutely worthless. It is very easy to delete a message that gets through the filter. It is impossible to resurrect a mailing you never even knew you got.

      --
      Hey freaks: now you're ju
  8. Time critical by Synithium · · Score: 5, Insightful

    Time critical mailing will go out the window. I can see how this might make any corporate user irate. The same thing goes for challenge-response, the time delay in the business world is unacceptable.

    This would be great for personal mail, but that's about it. ISPs would have the same problems with it because their business-class users most likely use the same servers as their consumer-class users.

    1. Re:Time critical by eGabriel · · Score: 4, Informative

      This isn't true, actually. Once one mail gets through, the system lets in subsequent mails from that sender. So there is only the initial delay, after that CEO Joe can use his email as a fat instant messenger per usual.

    2. Re:Time critical by SuiteSisterMary · · Score: 2, Insightful

      Besides, if you're using SMTP for time-critical things, you have a problem, as SMTP is NOT a guarenteed delivery system.

      --
      Vintage computer games and RPG books available. Email me if you're interested.
    3. Re:Time critical by Sturm · · Score: 2, Insightful

      As an e-mail admin, I would definitely advise someone against using e-mail for any type of communication that involves either "time" or "critical". There are just too many things that can go wrong. Mail queues fill up because of DNS failures, domain names expire, disks fill up... These are just a few of the "normal" bad things that can happen with e-mail systems.
      Private instant messaging, IMHO, is much better for "time-critical" communication. Of course, it depends on what type of data you are sending and what the transmission medium is.
      I heard a rumor that people once used to use phones and faxes for communicating, But I haven't been able to confirm it :)

    4. Re:Time critical by SuiteSisterMary · · Score: 2, Insightful

      Oh, I've seen it, and I warn people against it all the time.

      But, hey, most companies are schizo when it comes to IT. You wouldn't let your accounts recieveable rely on random people who may or may not pass on your records unchanged and unread; so why do you trust your business communications to SMTP email?

      --
      Vintage computer games and RPG books available. Email me if you're interested.
    5. Re:Time critical by IncohereD · · Score: 4, Insightful

      How often do you get time critical e-mail from someone you've never recieved e-mail from before?

      some guy telling you to BUY THIS NOW != time critical.

      your wife telling you to BUY THIS NOW == time critical, and in theory, your wife == whitelisted (or blacklisted, depending on personal preference).

  9. security through obscurity, again? by dh003i · · Score: 4, Insightful

    If they can get around it by looking at the source, then something was wrong with it, waiting to be exploited. Might as well fix it.

    1. Re:security through obscurity, again? by blakestah · · Score: 4, Interesting

      The thing that is wrong is the SMTP protocol, and most people's conception of a spammer. Once you see a few "confessions of ex-spammers", everything changes.

      There are people out there who pay $10000 in startup costs, and then make $2000/week for spamming. The $10000 gets them software written by knowledgable internet security experts. This software finds any and every way to anonymify the email spam, and finds lists of people to spam.

      As long as knowledgable internet security experts are getting paid good cash to enable spammers, and SMTP doesn't change, spam will only continue to get worse. There needs to be a fundamental change in SMTP protocols. It oughta take the spammers about 2 days to fix their MTA bug to get around greylisting.

    2. Re:security through obscurity, again? by SuiteSisterMary · · Score: 3, Insightful

      The way to get around this, of course, being that you send each email twice. In other words, run through your database, then run through your database. Same IP addy, same sender, same recipient. As far as the MTA's concerned, it's retrying. Boom.

      --
      Vintage computer games and RPG books available. Email me if you're interested.
    3. Re:security through obscurity, again? by SillySlashdotName · · Score: 4, Insightful

      I see that, in fine /. tradition, you didn't RTFA.

      From the article: If we have never seen this triplet before, then refuse this delivery and any others that may come within a certain period of time with a temporary failure. (emphasis addded)

      Later in the article it goes into much more detail about the delay, how long to delay if the triplet has not been seen before, life time of the whitelist, etc.

      It also talks about configuring the times - they mention the default delay is 1 hour, but that their records suggest that 1 minute would have caught 99% of the same spam messages - "The data collected during testing showed that more than 99% of the mail that was blocked with the tested setting of 1 hour would still have been blocked with a delay setting of only 1 minute. At that point, having a larger initial delay will definitely help, as it gives time for other blocking methods to act. For this reason, it is suggested that at least a one hour delay value be kept as a default, since spammers will start adapting as soon as this method becomes known and starts being used. (again, emphasis added)

      RTFA!

      --
      Acts of massive stupidity are almost never covered by warranty. --me.
    4. Re:security through obscurity, again? by pjrc · · Score: 2, Insightful
      No, simply sending the message twice does not defeat it. Retries are rejected for 1 hour (default setting). The paper specifically talks about how 1 minute will block virtually all spam today, but such a short timeout will allow spammers to defeat greylisting exactly as you have described.

      Quoting from the paper:

      The initial delay of 1 hour was picked for several reasons:

      1. An hour is short enough that in most cases, users will not notice the delay.
      2. It is long enough to give time for administrators on a possibly compromised or abused mail server to discover the problem and hopefully correct it, before any of the offending email is able to be delivered.
      3. It is long enough to provide a good chance that if the sending host is in fact a spammer, they will be listed in other IP-based blacklists that may be used in conjunction with Greylisting, so that even if a spamming relay later attempts a redelivery that would no longer be delayed by Greylisting, it may still be blocked by other methods.
      4. It is also long enough that other types of traffic analysis could be designed and implemented such that spamming IP's could be easily identified and blocked by other methods, in such a way that even the first recipients (before a spamming pattern starts to emerge) would still not be bothered by the spam email.

      The data collected during testing showed that more than 99% of the mail that was blocked with the tested setting of 1 hour would still have been blocked with a delay setting of only 1 minute. However, it is expected that as spammers become aware of this blocking method, they will change their software to retry failed deliveries. At that point, having a larger initial delay will definitely help, as it gives time for other blocking methods to act. For this reason, it is suggested that at least a one hour delay value be kept as a default, since spammers will start adapting as soon as this method becomes known and starts being used.

      Personally, I disagree with item #1. A one hour delay in first-contact email is not acceptable... at least for me.

    5. Re:security through obscurity, again? by blakestah · · Score: 4, Insightful

      RTFA!

      There is no magical waiting period or re-try period that cannot be trivially coded around. And, with good money on the line, will be trivially coded around.

      You don't get it. Really smart people are getting paid a whole lot of money to make programs to exploit every possible crack in the way we send email. There is no general rule to spammers, except that it is a lot of money and they are very clever. Little bandaids are not going to stop this one - there needs to be a much more fundamental change. And I am not talking about laws against spam - I am talking about changes in the protocols we use to send email.

    6. Re:security through obscurity, again? by Torp · · Score: 2, Informative

      What he wants to do, instead of rejecting the mail immediately, is reject the mails on the greylist after holding the connection for, say, 10 minutes. That will help deter spamming software, since it will slow down the rate at which mail goes out.

      --
      I apologize for the lack of a signature.
    7. Re:security through obscurity, again? by dasuridai · · Score: 2

      It seems to me that we should start suing the people that pay the spammers. They should be relatively easy to find, since they are apparently making money off of the advertising that they purchased, after all. If you stop the flow of money to spammers and make the cost / risk of funding spam great, then you would inevitably reduce the spam that gets put out.

    8. Re:security through obscurity, again? by SacredNaCl · · Score: 3, Insightful

      As stated, the only reason the hour works right now is because the spammers don't see this in the wild. Re-running your database script an hour later isn't a big deal.

      I disagree. When you are sending 250,000,000 emails a day -- restarting that script IS a big deal. It would, in effect, make them have to do the entire thing twice. That's a pretty big hit on their resources.

      --
      Freedom is merely privilege extended unless enjoyed by one and all.
    9. Re:security through obscurity, again? by letxa2000 · · Score: 5, Interesting
      is reject the mails on the greylist after holding the connection for, say, 10 minutes. That will help deter spamming software,

      I doubt it. I would assume the spam software would have a timeout, and I doubt it's ten minutes. If they want to hit-and-run and aren't even willing to make a second delivery attempt when an error code is returned, I doubt they're going to wait 10 minutes. I'm sure that within 30 seconds or less they'll consider it a dead connection and hang up.

      Problem is, I used to have my sendmail HANG UP in real-time on an incoming connection as soon as it realized a message was spam. I.e., the incoming message was filtered in the DATA phase and if it was spam I hung up immediately. It worked great and it felt good, but there were many spam programs that took the disconnection as some kind of TCP/IP failure and immediatelty tried again. So I had one day where a single message was attempted to be delivered about 30,000 times as the spammer connected, I hung up, spammer software said "Oops, let me try again!" About one delivery attempt every second or so.

      I'd be willing to bet if you put a 10 minute timeout in sendmail you'll see lots of spammer software disconnecting sooner and just trying again. It takes more of their resources, but takes more of yours, too.

    10. Re:security through obscurity, again? by SillySlashdotName · · Score: 2, Informative

      I agree that one of us doesn't get it. :)

      I agree that there is no "magical waiting period or re-try time period". However, by forcing the spammer to re-run through their spam list, their life has been made a little harder, they have been forced to be a little more visible, we have pushed them to use more resources (hopefully hitting them in the wallet), and we have forced them to do something that, BY ITSELF, can be used as a spam indicator. As I mentioned in another post, I rarely get duplicate emails from people - so getting duplicates within 4 hours - as spammers try to get past the greylist - would be a (one) possible signature for spam.

      Spammers are generally (or so I understand) using a 'fire-and-forget' method of spam sending, which is why/how they can send millions of emails a day. Responding to the greylist method takes that away from them or forces them to double their resource usage, their bandwidth, their exposure on the Internet. Resources are not free, bandwidth is not free and most spammers are exposure adverse.

      Either they work a way around the problem - the only way I currently see is to behave more like a legitimate emailer which reduces the number of addresses they can reach in a time period and so, for the same response rate, reduces their income - or they don't bother and the greylist reduces network traffic by refusing the email BEFORE IT IS EVEN SENT.

      I agree the greylist is not a cure - but I never said it was, and it seems to me to be a win-win situation to use it.

      Until there is a fundamental change in the protocols I see this, if adopted widely enough, as a viable way to reduce bandwidth usage and spam. I don't see a change in the fundamental protocols happening quickly (if at all), I do see the greylist here today.

      --
      Acts of massive stupidity are almost never covered by warranty. --me.
    11. Re:security through obscurity, again? by SillySlashdotName · · Score: 2, Interesting

      Spammers rely on a tiny hit rate to a huge number of emails. If they sent more emails, they would make more money. I assumed (dangerous, I know) that they would therefore be sending the maximum number of emails they could based on their bandwidth limitations as well as their resourse limitations - or as many as they thought they could get away with without being noticed/blacklisted/shutdown. I.e., they are either running flat out, or trying to stay under somebodys' radar.

      If they are sending 250,000,000 emails a day, then that must be all they CAN (or think they can - amounts to the same thing) send or they would be sending more. If they have to send everything twice, then they have just dropped the number of emails to 125,000,000 - and cut their income from their activity in half.

      Another possibility is that they double the size of their email farm and the width of their pipeline - but that also takes $$$, time, and resourses - my computer requires electricity or it refuses to work.

      I am for anything that hits spammers in the pocketbook.

      --
      Acts of massive stupidity are almost never covered by warranty. --me.
    12. Re:security through obscurity, again? by volkerdi · · Score: 3, Interesting

      There is no magical waiting period or re-try period that cannot be trivially coded around. And, with good money on the line, will be trivially coded around. You don't get it. Really smart people are getting paid a whole lot of money to make programs to exploit every possible crack in the way we send email.

      Yeah, spammers are so clever. Well, the fact is if for every one of these "smart" (yeah, right) spammers who has the help of a network consultant that will work around greylisting there are 5 dumbasses who don't (and I think I'm being generous there), then if I greylist I'd think over 80% of my spam problem would be eliminated. What's wrong with that? What's to "get"? Looking through headers I see the same bulk mailers used over the years, probably passed around as warez in spammer circles.

    13. Re:security through obscurity, again? by Fulcrum+of+Evil · · Score: 2, Informative

      Well, the fact is if for every one of these "smart" (yeah, right) spammers who has the help of a network consultant that will work around greylisting there are 5 dumbasses who don't

      This does fuck all when your one spamking is responsible for 80% of the SPAM (by volume.

      --
      "We returned the General to El Salvador, or maybe Guatemala, it's difficult to tell from 10,000 feet"
  10. spam.....hrmmm by chef_raekwon · · Score: 5, Insightful

    with all of these solutions to spam..and all of the spam now flooding mail servers...

    isn't it time to change the specification (RFC) and possibly the manner in which our current system works? i haven't come up with anything yet, but surely there must be some sort of handshaking/secure type connection that could be used - - some sort of postage (free) that is encrypted into the mail, that states that it is genuine....kind of like the hologram on those windows cds...

    i dunno. file this story under redundant.

    --
    We're like rats, in some experiment! -- George Costanza
  11. I'm not sure about this... by BiteMeFanboy · · Score: 3, Insightful
    These applications appear to adopt the "fire-and-forget" methodology

    I thought it was generally understood that most spam was sent by abusing open relays, thus hiding it's origin. This could be wrong. However if it's not, those figures aren't appllicable. Nor is spam going to be diverted since an open relay is generally running a regular mta and will attempt a retry. For instance, if qmail were running on an open relay and was abused by a spammer it would try again and again with an increasing delay (calculated logarithmically if memory serves) between attempts. So the mail will still get through.

    When you further consider that if a spammer hits an open relay and hammers your mailserver from it and all of the "triplet's" are new, you're increasing your traffic, because all of that mail will be attempted again.

  12. Bayesian Filtering by Dr+Rick · · Score: 3, Interesting
    I'm finding that use of the Outclass interface to POPfile is surprisingly effective at dealing with my spam problem (and I get a lot of it) - since training POPfile I haven't had a single spam message get into my inbox no false positives. Of course I could just be very, very lucky and with this post the email gods will punish me...

    How does the effectiveness of Greylisting compare with what others are seeing with existing techniques (such as Bayesian filtering)? Is it a false positives problem, such as digests and opt-in mailing lists getting incorrectly tagged as spam?

    --

    Dr. Rick
    - "It's such a fine line between clever and stupid" (Nigel Tufnel)
    - Zort! (Pinky)
    1. Re:Bayesian Filtering by anti$pam · · Score: 4, Insightful

      The key is to make spammers not make money!

      If people start adopting anti-spam technologies we would reduce the return spammers get from sending spam. Reduce this enough and the spamming business will no longer be profitable.

      POPFile is great. I've also used SAProxy (http://saproxy.bloomba.com/) under windows and it works great too.

      Again, the idea is not to eliminate all spam, but to reduce the return rate, and therefore the money made by spammers.

    2. Re:Bayesian Filtering by Admiral+Burrito · · Score: 2

      Mod parent up!

      There is a "Let's replace SMTP to stop spammers!" meme floating around. I haven't seen a single example of a new SMTP protocol that will actually stop spamming - they only make it marginally harder. Replace SMTP and you've gone to a whole lot of effort, and the spammers will still find a way to spam.

      People shouldn't dismiss client-side filtering on the grounds that spammers are still wasting our resources. That's a temporary situation! Right now most people don't have good client-side filtering - most people are using Outlook(| Express) without any of the Bayesian tools. Once that changes spamming will be futile, and the spammers will go away and stop wasting our resources. Spammers are not script kiddies trying to DoS the system, they are sleazy business people trying to make a buck. Eliminate the profit from the sleazy business and the sleazy business people will go away.

      That said, SMTP will probably go the way of the dinosaur anyway, replaced by whatever instant-messaging standard we eventually end up with. Better support open standards unless you want a single company controlling your communications.

  13. I think not by Monoman · · Score: 5, Interesting

    Doels this mean all public crypto algorithims are useless?

    --
    Keep the Classic Slashdot.
  14. How about Habeas' haiku method? by siskbc · · Score: 3, Interesting

    The best idea I've seen in YEARS was to have people start using a specific, original poem as their signatures. Then, the author granted license to anyone who WASN'T sending spam. Therefore, they could sue any spammer for copyright infringement if they used it, and you could train your mail filter to look for the signature. Once spamassassin took it up, it pretty much snowballed. See story here

    --

    -Looking for a job as a materials chemist or multivariat

  15. Copy of spam logged? by spuke4000 · · Score: 2, Insightful

    Question about this system: if it sends a temporary unavailable message or whatever it does, does it log the original message? Where I'm going with this is what happens if a legitimate message is blocked but never resent? Most anti-spam software allows you to view the spam folder, or something equivalet, to check for false positives. How are false positives handled here?

    --
    This post cannot be rebroadcast without the express written constent of Major League Baseball.
  16. RFC 3514 by pizen · · Score: 4, Funny

    How about in the spirit of RFC 3514 (the evil bit) we create a spam header in email. Spam will set this header so we can easily filter it out.

  17. Published a paper? by Call+Me+Black+Cloud · · Score: 4, Informative

    Where? To me, publishing a paper means your writing appeared in some peer-reviewed journal (where the "peers" are acknowledged as domain experts). What you did was put up a web page. With a donation link at the bottom.

    For others looking for a solution, try POPFile. Open source, cross platform, gives me 96% accuracy.

    One more thing: "practically eliminates" is not the same as "eliminates".

    1. Re:Published a paper? by vidarh · · Score: 4, Insightful
      To me publishing a paper in a peer reviewed journal instead of on the web would mean that I'd expect audience to be reduced to a ridiculously small fraction of people that might be interested. If I wanted to publish something I'd do it on the web first, and if it stacks up people I respect would start talking about it and link to it.

      Yes, I realize that for "serious" science still expect things to be published in peer reviewed journals, but in most cases I can't help but think that getting the article out there would be more useful. Sure, peer review is important, and somewhere to look for some kind of verification of the value of a paper is useful. But I much prefer the Research Index way, where I can get a good indication of the value of a paper by looking at how many people have cited a paper and WHO have cited a paper.

      Anyway, pretending that putting up a document on a website is somehow less publishing a paper than having it printed in a journal, is just plain elitist. You should propably be a bit more critical to papers that are published that you don't know have been through a proper review, especially if you're not a domain expert yourself, but being aware of the source is something that you always need to be.

    2. Re:Published a paper? by FattMattP · · Score: 2, Insightful
      One more thing: "practically eliminates" is not the same as "eliminates".
      And "publishing a paper" isn't the same thing as "publishing a paper in some peer-reviewed journal."
      --
      Prevent email address forgery. Publish SPF records for y
  18. Poor use of statistics by GGardner · · Score: 4, Insightful
    The data in this article claims that 1% of all corporate mail servers in the UK allow open relaying, down from 91% in 1997. For all we know, the total number of corporate e-mail servers has grown by a factor of 100 (or more) in the last six year, meaning that perhaps there are more open relays now.

    The article also doesn't measure the amount of spam coming through those relays. Even if there are only 10 open relays in the UK at any one time, it still might be possible for all of the spam to be coming through them.

    Certainly, closing down open relays is a good thing, but lowering the percentage of open relays doesn't prove anything about the source of spam

    1. Re:Poor use of statistics by StringBlade · · Score: 2, Insightful
      Realize that the article doesn't claim that Greylisting alone will stop all spam, but Greylisting in conjunction with blacklisting and other anti-spam techniques can make open relays less of a problem.

      Let's just take the scenario where a major spammer has decided to route his spam through an open relay in the UK. The network admin in charge of email security at BigSoftware Corp. has implemented Greylisting in addition to all his anti-spam measures previously existing including blacklisting. According to the article it is possible to delay incoming mail from that relay long enough to set up a blacklist for that entire domain or perhaps a subnet of that domain depending on where the flood of mail is coming from. If the UK relay has a complaint about mail not making it to BigSoftare Corp., the admin can politely tell him he's got a spammer molesting his relay and will gladly remove his domain from the blacklist once the relay is closed.

      --
      ...and that's the way the cookie crumbles.
  19. Easy for end-users, sure. by Medievalist · · Score: 5, Insightful
    Just encode your e-mail address on web pages & don't sign up to any dubious mailing lists.
    Many of us must maintain contact addresses in the global whois database - so that people can contact us when something is broken.

    Look at it this way: you can stop crank calls by unlisting your phone numbers. But you can't unlist the hospital, the ambulance service, the fire department, etc.

    We're not all end-users. Some of us are the plumbers.
  20. Re:My spam filtering by McDutchie · · Score: 2, Funny
    If a message contains the word "offer" it's toast.
    Remind me never to offer you any friendly help with anything. :)
  21. Waiting for Article Title by notque · · Score: 4, Funny

    The Next Step in Fighting Spam: Death Penalty

    --
    http://use.perl.org
  22. Re:I have my own algorithm by dprovine · · Score: 2, Insightful

    But what happens when you try to have an email
    discussion about stopping spam, and someone in
    the discussion says "Well, I filter out any
    message with the words viagra or penis..."?

    Does that get flagged as spam and discarded too?

  23. Wouldn't it be nice if by abe_is_fun · · Score: 2, Informative

    Spammers are notoriously resiliant. Within a few days/weeks/nanoseconds the spammers would realize they need to retry after a delay, and they would stop with the fire-forget mentality.

    I wish your plan would work but I just don't think it will.

    Plus the spammers can get their viagra at wholesale cost!

    --
    I don't want to be here.
  24. Delaying email by one hour! by pjrc · · Score: 5, Insightful
    From the linked paper:

    An hour is short enough that in most cases, users will not notice the delay.

    I'm wondering how I'm going to explain that to a new customer over the phone who says "I'll just email that file right now so we can go over it together".

    1. Re:Delaying email by one hour! by vidarh · · Score: 4, Insightful
      Agreed. I've been involed in operating a larger (hundreds of thousands of active users) mail system a couple of years ago, and users would complain if their mail took more than seconds. We had to upgrade our system at one point because rapid growth had made mail delivery take a couple of minutes on average, and it caused bad publicity - a lot of users had a clear expectation that e-mail should be delivered in a few seconds and that if it didn't something was wrong.

      I think changing that perception of e-mail as near instant will be incredibly hard. And if you succeed it will just move even more traffic over to the IM networks and cause spamming of IM networks to escalate instead.

    2. Re:Delaying email by one hour! by Binestar · · Score: 2, Insightful

      This doesn't stop the attachments from going through. This only delays them. For those gotta be there now attachments you should be using something that is meant to be more reliable than SMTP anyways.

      Just because themajority of people does something incorrectly doesn't mean it's suddenly the correct way to do it.

      --
      Do you Gentoo!?
    3. Re:Delaying email by one hour! by pjrc · · Score: 3, Insightful
      Saddly, you have missed the central point about the necessity of timeliness of email delivery and instead focused on using FTP rather than attachments.

      Even if FTP were a solution, it does nothing to answer a new customer who says "I just heard about you and I'm excited about your products. Wanted to call and ask you some questions. I sent an email about 10 minutes ago with an outline of the project we're doing were you guys could really help out, have you had a chance to look it over yet".

      There's a limitless number of these important common customer relationship scenarios, where the expectation of all parties involved is that email is delivered in under 1 minute and typically 5-10 seconds. And there are an infinite number of scenarios other than sales and customer service/relations where people quite reasonably expect email to be delivered in seconds.

      Focusing on using FTP isn't just the wrong answer, it's not even an answer at all to the problem of email delivery taking an order of magnitude longer than users expect and depend upon.

      But as others have pointed out, most users don't have access to FTP servers to receive files. Most corporate firewalls would prohibit users from setting up a FTP server. I would guess that almost any employee behind a corporate firewall wanting to somehow receive a file from a new customer via FTP who attempted to ask a sysadmin would get the answer "just have them send it as an attachment". FTP is simply not a viable protocol for customers and salespeople (or most others) to use to pass files back and forth.

      Aside from not solving the unacceptable delay and the inappropriateness of using FTP, there is the problem of bad attitude. Specifically:

      Explain this to your user. You can just tell them that... [snip]

      Where did "new customer" turn into "user". The word "user" in this context is often spoken in the tone of an overworked, grumpy sysadmin who's personal view of his priorities are decoupled from the larger organization's mission (usually taking care of customers, selling products, operating efficiently, and so on).

      In this particular example, what is important is that the new customer whats to talk with someone about solving his problems. That someone is me, and I want to impress him, sell him something that will truely meet his needs, and hopefully turn him from "new customer" into "repeat customer" or even "loyal customer". THAT is what is important, and getting the customer's file quickly and easily with minimal hassle is merely a tool that enables the truely important work to happen.

      Not having the email for 1 hour means I'll either have to call him back in an hour, while he probably calls some competitors and shops around. Often times people will buy from the first friendly, knowledgable person who goes to some effort to help them.... searching until they find that person/company. Delaying response to a new customer by 1 hours would put me at a competitive disadvantage.

      Or we'll have to proceed without it (FTP is not an option), leading to frustration as he explains material that would have been much better delivered as a file. Maybe it would go ok, maybe not. But it's starting the whole process "on the wrong foot".

      Then again, if your business is being a grumpy sysadmin where you have (captive) "users" rather than "customers", maybe delaying new email conversations is a big advantage which is not offset by any impact in "responsiveness" because it's already intentionally low.

  25. Open Relays a smaller problem? Viruses instead? by garyebickford · · Score: 2, Informative

    According to this article (June 12), open relays at least in the corporate environment are becoming hard to find, requiring spammers to find new ways. In 1997, 91% of mail servers tested were open; as of a year ago only 1%. ISP and home machines apparently weren't tested.

    This doesn't really say what's actually being used by spammers, but it's a sign of improvement. At the least, it narrows the pool of available relays. Continuing progress will increase the spam pressure on those remaining, which will in turn make it more likely that they'll be fixed.

    The article also doesn't say what spammers might use as an alternative. From something else I read recently (don't recall where), mail viruses that take over users' machines are rapidly becoming the tool of choice. There are a lot more of them than mail servers, so it makes sense for the spammers. It does put them in a more dangerous position WRT the law. IMHO (IANAL), using a virus to exploit someone's machine for profit is almost certainly illegal under existing law.

    --
    It's easier to be a result of the past, but more fun to be a cause of the future! http://www.spacefinancegroup.com/
  26. One good point about this proposal by Anonymous Coward · · Score: 5, Insightful

    It deals with spam at the server level. All the wonderful user-level solutions don't do jack to stop spam from being sent. Look at the numbers the spammers show for return rate, and look at how fast spam programs can go, and you'll see that the only solutions that will work are those that make it expensive to send spam. Anything else will just make the spammers send more spam to try and get the hit rate they need.

  27. clever hack for WHOIS contact addresses by phr1 · · Score: 5, Interesting

    The registrar I use (jumpdomain.com) has a clever hack for despamming WHOIS contact email. Basically they change your published contact address once a week. The published address i automatically generated, looks like gibberish, and forwards to your real address. If someone wants to contact you by looking up your address by WHOIS and writing to you, it works fine. But if they add the address to a mailing list, it stops working in a week. That has eliminated almost all my WHOIS spam. Good scheme.

  28. The real solution by mrseigen · · Score: 4, Funny

    We should grab some of the guys who get 1000+ spams per day, point them to the physical location of the spammers, and then step back. I can guarantee you that vigilante justice is entirely appropriate here, considering we want the gov to step back from the 'net instead of entering new "secret arrests of spammers"(?) laws.

  29. Re:Sarxpam by selfabuse · · Score: 3, Funny

    find the people who sent it, and send them a message saying "I'm Ron Jeremy... I don't think you *want* me to have another 3 inches"

  30. Bogofilter does pretty well for a client filter by lxdbxr · · Score: 4, Interesting
    The summary does not seem completely accurate; since the greylisting MTA sends an SMTP temp failure there should never be any false positives as long as the sending MTA is vaguely RFC-compliant (sadly not true I suspect). Or at least that was my reading of the paper...

    I'm currently using Bogofilter (and looking into CRM114) and getting better than 99% accuracy (about 1 in 200 false negatives at the moment) and very very few false positives (maybe 2 in 5000 messages).

    Of course these are MUA level filters (and yes, I know, I've already "paid" with bandwidth to download the spam) - however since the proposed "greylister" would have to be installed as the MTA at major ISPs (as the authors note) I'm not convinced that is more likely to get widespread adoption than the various sorts of adaptive client-based filtering now available, particularly as it requires a database to back the method up.

    As far as I am concerned the major factor in a spam filter should be zero false positives - personally I don't mind reviewing one or two spams a week but I get really annoyed if I were to lose a real message (note the two false positives I have sent to date with bogofilter contained forwarded sales pitches along with a message).

    --
    -- Nothing unusual happened today
  31. co-evolution by 73939133 · · Score: 3, Insightful

    During the initial testing of Greylisting, it was observed that the vast majority of spam appears to be sent from applications designed specifically for spamming. These applications appear to adopt the "fire-and-forget" methodology.

    Spam guards and spam co-evolve. Since greylisting is easy to get around by spammers, if it becomes widespread, spammers will take measures to avoid it, and the net result will be a lot of extra traffic.

    In fact, the impact of this kind of system on mail could be pretty bad if widely adopted: large amounts of mail may end up being held up in delivering servers, and "informative" messages sent by helpful mail systems (about "temporary failures") may end up creating more junk mail than they avoid.

  32. 97%? not impressive. It's POPfile for me by YE · · Score: 3, Informative

    I get 98-98.5% accuracy with POPfile. I get about 200 mails a day, of which around 30% spam. I get about 1 false negative a day, and maybe 2 or 3 false positives a month. It's a personal solution and as such is much more attractive to me than something server-based which has to be installed by a [typically VERY uncooperative] BOFH.

    I use it experimentally for general mail classification (business/personal/a variety of mailing lists etc., all in all 7 buckets) on my home machine, and it works fine in these conditions too, although the accuracy is a bit lower (around 95%).

  33. anyone@domain by autopr0n · · Score: 2, Informative

    What I do is set my MTA (well, actualy someone I'm using someone else's mail server) to forward all the mail sent to my domain to my main account. That way, whenever I sign up for anything or give away any email address I use a unique address.

    Oddly enough, I've been totaly promiscuous with these throw-away email addresses, but I've only got one SPAM from a company I actualy bought stuff for. So far no one has sold any of the address, and all the websites I've posted too either arn't scanned or are protecting the address well.

    OTOH, my 'main' address are spammed constantly. rrr.

    --
    autopr0n is like, down and stuff.
  34. Blah! Just another fancy waste of time.... by steveit_is · · Score: 2, Informative

    You know... All of these fancy 'spam-fighting' methods are just a waste of time, when all you really need is a properly configured smtp server and some good free realtime blackhole lists. After making simple changes, I went from receiving 5-6 spams a day down to 1 in a year and a half. With no complaints of 'false-positives'. There have been instances where the senders mail server was misconfigured, but after contacting them and explaining the situation they were invariably helpful. All I did was make sure that only mail sent from fqdn to valid local accounts, and such were allowed there is an ok tutorial on basic psotfix configurattion herehere.... and a great one Here.

  35. missed the point by eLoco · · Score: 4, Informative

    I've seen some comments that say (paraphrasing) "For real SPAM filtering use <POPFile|Spamassassin|...>", but these missed the point (or perhaps didn't read the paper?). This method is a "first-pass" filter, getting rid of e-mails for which no redelivery attempt will be made. The second-pass filter should still be implemented for everything that gets through the first pass. From the paper:

    "The Greylisting method proposed in this paper is a complimentary method to other existing and yet-to-be-designed spam control systems, and is not intended as a replacement for those other methods. In fact, it is expected that spammers will eventually try to minimise the effectiveness of this method of blocking, and Greylisting is designed to limit options available to the spammer when attempting to do so."

    --
    sig != null
  36. The Ironic Spam Solution by robby_slaughter · · Score: 2, Interesting

    Personally, I only accept messages that are 10 MB in size or larger. If you want to email me, please be sure to include a huge block of random text at the end of your email or else I'll never see it.

    I don't get any spam using this approach, because the spammers don't send big messages. And if *everyone* ignored small messages, spammers would have to close up shop because they could not afford to send millions of big messages.

    (This is a joke. But you could do this at the SMTP level, by automatically replying to any sender who is not on your personal whitelist with the response: "Hey you, if you're real, send me back a HUGE reply!" And the SMTP server could cheerfully delete the last 99% of the first-time oversized email you get. I should write my own anti-spam paper and get mad Slashdot cred. Nah, I'm too lazy.)

  37. Re:here are the stats by tomhudson · · Score: 2, Interesting
    Open Relay Database Stats by Country

    You'll notice that the US is the #1 country Top 3 are:

    1. The United States, with over 80,000 open relays
    2. Korea and Japan pretty much tied at +15,000 each
    3. Japan, at just under 10,000
    That's more than everyone else combined!
  38. E-Mail Secrets by Nakoruru · · Score: 2, Interesting

    I have written an essay on ending spam. The idea is to associate a second piece of information that goes along with your e-mail address. This 'secret' can be used for anything you want, such as blocking anyone who does not get the secret right.

  39. EOL SMTP by satyap · · Score: 3, Funny

    Or we could all abandon SMTP and move to my jabber-based email "solution".

    What? Where is it? Oh, I'm still working on it. You can send and receive, but buddy lists are not implemented yet.

  40. How about DSPAM ? by MeerCat · · Score: 2, Interesting

    I quite like the idea of this greylisting, but it seems a lot of spam is nowadays being sent as DSPAM (cf DDOS) or Distributed Spam. A spammer infects a load of broadband machines with a simple trojan, and then calls upon a number of the trojans to send an email spam via that machines normal MTA (ie for most windows machines it uploads to the ISPs mail servers).

    I know this is happening as some complete bastard seems to be doing this using my domain as a "From:" address (well, [random-word]@schmerg.com), meaning that I'm been getting about 30 or 40 bounce messages a day for the last 2 or 3 months now. And although the odd sending IP is repeated, mostly they're all from different IP addresses. And of course I'm getting perfectly valid looking bounce messages from perfectly reasonable companies (and only a couple of abusive replies so far).

    Now the problem is that the email is being uploaded to thier (non-open-relay) ISP's mailserver that will retry properly, and anyone else looking at the IP address will see a perfectly reasonable IP (the spammer seems to gave infected a lot of AT+T customers, ComCast customers, etc.). So short of blocking spam on subject, this spam is harder to prevent in the first place.

    I've semi-automated a process to report the infected machines (that provoked a bounce message) to the appropriate ISP, and seem to havign some success in getting the ISPs to contact their customers, but I think this new form of spamming using a distributed attack will be particularly hard to block.

    Anyone with a great idea (or who knows more about this scheme, or the identity of the twat behind it) I'd love to hear from you...

    --
    T

    --
    I spent a lot of money on booze, birds and fast cars. The rest I just squandered. - George Best
  41. Spammers don't care about defeating the top 5%. by KFury · · Score: 2, Interesting

    Evidence has shown that email harvesters don't even *try* to parse even the simplest email obfuscation techniques (kevin at fury dot com, or changing the @ to an html entity).

    These are widespread practices and yet spammers don't care about spending the effort to reach that 5% who so adamantly don't want to be reached.

    Unless greymail is used by more than a quarter of *all* email readers, history has shown that the spammers just won't care.

  42. Carnivore? by dsilver · · Score: 2, Insightful

    It seems one side effect of this approach is that it records the sender and recipient of every email sent through a particular mail server.

    Sound familiar to anyone?

  43. That's a good point by SuperKendall · · Score: 3, Interesting

    Anything that makes spammers do extra work seems to make it more likley that they will have patterns that can be observed and then blocked before mail really goes anywhere...

    --
    "There is more worth loving than we have strength to love." - Brian Jay Stanley
  44. SpamAssassin by ajs · · Score: 3, Insightful

    The comments in this paper about other systems ignore one of the oldest and largest SPAM filters: SpamAssassin.

    SpamAssassin can also be used at the MTA-level, and while this tool might be an interesting test to integrate with SA, its claims that other systems cannot feed back to the sender that their mail has been blocked is flat-out wrong.

    Most people do not do this because you are almost certainly getting this mail through a relay, and that relay is going to get the SMTP temporary error and try to send a warning to the user who sent it. Spammers regularly slam my home mail server by using my address as the "From" in an entire batch of spam. It's pretty seriously annoying to get that deluge of junk, and it's not really necessary. If your spam system just identifies spam and lets the user (or sysadmin) decide how to deal with it based on how "spamish" it is, you get a much more reasonable behavior.

    I junk thousands of pieces of spam every week, and I *never* junk valid mail. Yes, I do have some spam in my inbox. Most of it is tagged as potential spam, and I delete that after cursory inspection of the from addresses. Some of it is missed, and the overhead that I suffer having to identify that myself is amazingly low compared to not being able to read my mail prior to SA.

    Check out SA. The latest version is pretty impressive, and if this "new" technique (I don't think the idea of tracking connection quality is very new, it's certainly done in SA to some extent) turns out to be useful... well SA works on much the same principal as Perl: There's More Than One Way To Do It. Bayes, Blacklists, Whitelists, Obfuscation detection, Checksum trackers, you name it, SA uses it. None of these techniques gets to say "this is spam", they all just get to poke a message in the direction of being spam or non-spam. This leads to something far more reliable than any one techniqe.

  45. Limit Criminal Penalties by Lafe · · Score: 2, Interesting

    I once heard a story once that is probably false (you never know), but contains an interesting idea on how to end spam.

    It says that Mississippi tried to outlaw "flag burning", and the law was struck down as unconstitutional. So the Mississippi legislature responded by limiting the maximum penalty for assaulting a person who is engaged in flag burning to a $25 fine.

    This sounds like a fine idea on how to handle the spam problem.

    Create a law that states that the maximum penalty for physically assaulting a spammer is a $100 fine. I know more than a few people who'd be willing to pony up and take a whack at them.

    Though this law would probably be far more satisfying than sensible.

  46. HOW SPAMMERS WILL DEFEAT THIS: by hoggoth · · Score: 3, Informative

    I really like the idea, and I think it will work wonders (if you are willing to accept your nearly-instant email now having approx. 1 hour delay).

    However, here is how the spammers will adapt:
    MailBlast 2.0 will send each mail TWICE. The first time to get the triplet on the greylist, the second time to actually send it. Or a little more sophisticated, it will run in one-hour blocks, and at the end of the hour, re-run the previous hours emails. No queue or other real-SMTP server functionality necessary.

    --
    - For the complete works of Shakespeare: cat /dev/random (may take some time)
  47. Re:Reference for that paper by pclminion · · Score: 2, Informative

    I found a copy of the final draft online: Learning Spam: Simple techniques for freely-available software. The paper covers several machine learning techniques. The particular one I'm talking about here is the information-theoretic clustering and neural network approach.

  48. Wild idea, maybe ISP' should do something at by dh003i · · Score: 2, Insightful

    the sender level.

    Like, say, putting a maximum limit on the number and size of e-mails that can be sent out a day.

    Gather studies on how many people send 1 to n e-mails a day, and how many people send out e-mails of 1 byte to n bytes in size.

    My guess is, it's a pretty distorted curve, with maybe a few thousand people -- of all those online -- sending out millions of e-mails a day. The maximum most "normal" people will send a day is probably 100 (and that's a large over-estimate).

  49. My Anti-Spam Idea by _iris · · Score: 2, Interesting

    Here are my latest thoughts on winning the spam war.

    I've submitted it to Slashdot. They rejected it. Tell me what you think. I'd like reactive approaches to get discussed a bit more. If you do too, submit this to Slashdot :]

  50. Even if something like this could work... by thgreatoz · · Score: 2, Interesting

    ...which it couldn't, IMHO, due to the ... tenacity...of the spam community, I still think people would take issues with the proposed "1 hour delay". There are plenty of times when a company will send you an e-mail while you're on the phone with them (for example, RMA requests for damaged equipment), or perhaps you've forgotten the password to a particular news site and need to have it sent to you. Having to wait an hour for something that used to be near instantaneous is a less than ideal solution. My vote's for an overhaul of STMP. We'll be better off in the long run.

    --
    When their numbers dwindled from 50 to 8, the dwarves began to suspect Hungry.
  51. Listservs now well served by greylisting by LandGator · · Score: 2, Interesting

    Pardon me for being clueless, but I don't see in this concept a description of what happens when a posting from a listserv or other e-mail list goes to a new subscriber. Mail bounces back to the listserv, right?

    Well, the first e-mail to a news subscriber is often the e-mail required to confirm subscription. No reply, and the subscriber is plonked.

    Sounds suboptimal to me.

    So, you whitelist the listserv machines... until one of them has to change IP addresses. Whoops! No umpteen bazillions of e-mail messages go no where.

    I'm sure that listserv admin would find the idea suboptimal about this time.

    Nice idea, but No Slack, as far as I can see.

    --
    There is nothing wrong with yr Internet. Do not attempt to adjust the picture. We are controlling the transmission - NSA
  52. Anti-Spam Techniques: Honeypot spam detection! by mabu · · Score: 4, Informative

    Aside from the obvious of getting the authorities to crack down on the existing illegal activities (relay hijacking, violation of TOS of ISPs, header forging, etc.) which is the only true solution, I think there are much better approaches than this "greylisting" method.

    The problem with the greylist method is it still slows down mail service, and potentially more than the relay blacklist features. The objective here is that end-user/networks should not be penalized in the fight against spam. We already waste too many resources, and according to my latest mail server stats, more than 65% of our inbound mail is UCE. I'm fed up with more than half my e-mail bandwidth being crap my users didn't request so more resource allocation on a local level in the fight against spam is counterproductive!

    Here's a very clever, much more practical method I cound recently.

    A company is Canada has set up what it calls SORBS: Spam and Open Relay Blocking System.

    What's different from their blacklist is that they maintain "honeypots" strategically located around the Internet. These are servers they specifically set up as inbound mail relays, but never for legitimate purposes. If the servers get [select] mail activity, it's assumed to not be legitimate and it flags the source as a potential spammer... it makes a lot of sense. You create a domain name, but don't promote it in any legitimate manner, and/or you seed spam lists with these e-mail addresses and then let the spammers send to your key systems around the internet and *bam*, they're identified in real time, and then added to a blacklist.

    I really like this idea. Like any other system, it has the potential for abuse but the beauty is the identity of the honeypot systems is kept secret, so it's very difficult for anyone other than spammers to exploit the network.

  53. Secret algorithms vs. secret keys by Max+Threshold · · Score: 2, Informative

    Open-source crypto works because the secret isn't the algorithm, it's the keys. In this case, the secret is the algorithm. The entire scheme can be circumvented by someone who knows how it works.