Slashdot Mirror


The Growing Field Guide To Spam Techniques

Aneusomy writes "From Activestate: 'Compiled by Dr. John Graham-Cumming, a leading anti-spam researcher and member of the ActiveState Anti-Spam Task Force, the ActiveState Field Guide to Spam is a selection of the tricks spammers use to hide their messages from filters, providing examples taken from real-world spam messages.' The hope is that Activestate and others can contribute to continually expand this guide, so that anti-spam filters improve."

40 of 321 comments (clear)

  1. "Tricks?" by agent+dero · · Score: 2, Interesting

    I also thought it was pretty easy to spot and eliminate SPAM offering my mom to "Add 3inches to your penis today_________________12312vxas"

    Or to eliminate javascript enabled e-mail.

    SPAM is not quite a science. It's skript kiddie stuff, meaning it's not too hard to do just some open relays, and mass e-mail lists you can buy from AOL.

    --
    Error 407 - No creative sig found
    1. Re:"Tricks?" by wiggys · · Score: 5, Interesting
      You miss the point comletely. Any reasonably normal intelligent human being can spot and delete spam - that's never been the issue. The point is that spam is annoying and can be very time consuming for a human to deal with, which is why computerised spam filters were created.

      The first generation of spam filters were crude and simplistic - they would delete an email based on the sender, or maybe one or two key words. This isn't effective because spammers rarely use their own email addresses in the "Reply to" field, and deleting all email which contains the words "marketing" or "investment opportunity" is likely to delete legitimate email. Besides, spammers can easily get around this by altering words in such a way as to delete filters (V*I*A*G*R*A is easily read by a human but a computer looking for "viagra" and "viagara" would not stop it)

      The best spam filters today use Bayesian filtering to eliminate spam: you train the filter by giving it a pile of email and telling it these are genuine, and another pile and saying these are spam. The filter then looks through the mail and gives certain words a weighting - if most spam contains big red letting with words like "investment", "click here to be removed" and "penis enlargement" then it would score highly and be given a higher probability of being marked spam. Email containing words with your name in it, or words relating to your life or work, would be given a higher probability of being called spam.

      And for crying out loud, "spam" is not an acronym so stop writing it in upper case!

      --

      Sorry, but my karma just ran over your dogma.

  2. ActiveSpam? Real world spam? by jkrise · · Score: 2, Interesting

    From the article:
    the ActiveState Field Guide to Spam is a selection of the tricks

    The words Active, Smart, Rich etc. are part of MSspeak - leave a bad taste..

    providing examples taken from real-world spam messages.

    Why not fictional world spam messages? You mean, all those enlargers I got over mail weren't real-world! Boo-hoo....

    -

    --
    If you keep throwing chairs, one day you'll break windows....
  3. Does not explain purpose of trick by PCP · · Score: 1, Interesting

    Many of these description shows how spammers try to hide text. Why would they do that? Isn't the whole point that we should read the spam?

    I assume spam-filters reads the whole e-mail anyway, so trying to hide text in a would not accomplish anything.

    Or are spammer just stupid?

    1. Re:Does not explain purpose of trick by BFKrew · · Score: 3, Interesting

      From what I gathered, it demonstrates two things:

      Firstly, the techniques spammers will use to display the text in the email so that the end user will be able to view the text in the email.

      Secondly, it demonstrates how using the above approach they are trying to trick spam stopping techniques from working. For example, instead of having a email titled "Free viagra" you could write it as "F*r*e*e V*i*a*g*a*r*a" in an attempt to stop a spam stopper from spotting Viagara as easily in the title. In the body of the email you could write the html in such a way that decifering any words is quite tricky, eg writing Viagara as (font size="2")V(font size="2")iaga(font size="2")ra(/font) etc. Certainly to say spotting all variants of 'hiding' such words is not as simple as you might first think.

      It certainly gave me an interesting insight into the problem that it is, and how the spammers are trying and continually evolving their techniques to ensure they can carry on.

  4. Getting worse by BenjyD · · Score: 5, Interesting

    I've definitely noticed that my spamassassin filters are getting less effective. Six months ago, it was rare to see a spam that didn't get caught. Now maybe 10-20% get through.

    As I use a sensible email client that doesn't render HTML by default, I can't even read the text of the spams anyway.

    1. Re:Getting worse by Ed+Avis · · Score: 4, Interesting

      Yes - it looks like the majority of the 'spammers' tricks' listed are silly HTML tricks. From the messages I receive, a good rule of thumb is that HTML format implies spamminess. It might be different if you regularly have to communicate with Outhouse users.

      HTML rendering was added to Pine only fairly recently. Given the quantity of HTML spam out there, it might have been a mistake.

      --
      -- Ed Avis ed@membled.com
  5. My approach by gowen · · Score: 5, Interesting

    Bayesian filters are all well and good, and are -- for now -- effective. But given these tricks, the only really reliable approach I've found is IP blacklists for repeat offenders. If your machine is used to spam me, and my complaint letter is not answered in a satisfactory way (i.e. an email saying "We are sorry. The spammer has been cut off") I don't accept mail from you any more.

    And if you're on ATTBI, or Comcast, or PBI.net, or BT Openworld, or Chello, or any number of large ISPs with too much tolerance for spammers, and you're not on my whitelist, I can't read your emails.

    And I don't care. Get a ISP who don't shelter spammers.

    --
    Athletic Scholarships to universities make as much sense as academic scholarships to sports teams.
  6. Render the HTML then use OCR by thelandp · · Score: 5, Interesting
    Here's a crazy idea... (but is it crazy enough?)

    All of these spamming techniques seem to involve visual tricks, because the rendered HTML is viewed in a very different way to a human than the plain text would be seen by the filter. Things like zero-height fonts, or white-on-white text, or just using one big image etc. etc.

    So how about this: I think every single one of these tricks would be defeated by using this process for filtering spam:

    1. Render the html to an image (not on the screen, just behind the scenes)
    2. Feed the image into OCR
    3. Then scan the OCR text for spam

    Sure OCR is not perfect, but since these techniques are imprecise already, maybe it would work well.

    Although I guess processing power is a limiting factor, but maybe someday this will be worth doing.

    --

    -- the only thing we have to fear is really scary things
    1. Re:Render the HTML then use OCR by hacker · · Score: 4, Interesting
      You could also just take the HTML, run it through a series of Perl modules (XML::LibXML, HTML::Lint, HTML::Clean, HTML::FormatText, etc.) and return just the textual representation of the content itself, and then scan/score that.

      Doing so would then compress whitespace, remove colors, and basically un-SPAM the SPAM. I do this for web content, which I need re-rendered as text-based articles before they are sent to the client. It's about 12 lines of Perl, and can be easily stuffed into a SpamAssassin milter. If you want some working code, feel free to contact me (I'm also for hire, so I can do this as c custom gig for you or your company).

      In fact, you could probably put a small function in your milter to just strip all HTML entirely, before the client ever sees it. There's no need to use OCR (and the overhead associated with it) to handle this, just turn the HTML back into text. It works with foreign, encoded, obfuscated entities, and should be no problem to correct before scoring.

    2. Re:Render the HTML then use OCR by Ben+Hutchings · · Score: 2, Interesting
      Doing so would then compress whitespace, remove colors, and basically un-SPAM the SPAM.

      That would defeat obfuscation of spam keywords. However, many of the tricks (such as using identical or similar colours for text and background) are ways to include un-spammy text that the filter will see but the human recipient won't. Converting to plain text leaves them in, but they should actually be ignored.

  7. insider help is the key. by professorhojo · · Score: 5, Interesting

    i had a friend who recently turned to the dark side and now boasts that his circle of friends include the biggest spammers in the world.

    and believe it or not, the biggest break these guys have had in the past year has been help from people on the "inside".

    to give you an example, an ex-AOL employer has written them a little proggy for these guys to send messages that makes the AOL mailservers think that the mail originated on the inside of the network (which means that none of it is spam checked or filtered.)

    their usual 10% deliverability to AOL.com suddenly went to 100%. make no mistake -- that was worth millions to 'em.

  8. Easy Solution by grennis · · Score: 3, Interesting
    If you try to keep up with HTML tag tricks, you will always be one step behind.

    Why not have your spam filter render the HTML in an offscreen buffer (using existing browser/plugin API's), than pull the straight text out of the rendered document and run the filter on that?

    1. Re:Easy Solution by iapetus · · Score: 3, Interesting

      Why not just ditch the whole sorry concept of HTML e-mails? Seems like a better solution to me. Can't quite do that yet, but as a bare minimum HTML image tags (and anything else that makes a request automatically to a remote server, thus confirming the validity of your e-mail address) should be ignored.

      --
      ++ Say to Elrond "Hello.".
      Elrond says "No.". Elrond gives you some lunch.
  9. What a waste of effort by Zog+The+Undeniable · · Score: 3, Interesting
    If spammers have to go to such great lengths - and some of this stuff is admittedly clever - to get spam through, has it not dawned on them that 99.9% of people don't want to receive it? Perhaps we should ignore the spammers and target the 0.1% of idiots who actually reply and end up buying "generic Viagra" and septic tank cleaner. It reminds me of that Simpsons Hallowe'en episode with the giant advertising figures destroying Springfield. If everyone ignores them, they will die.

    I still favour going after the people paying the spammers rather than the spammers themselves...unlike the big spam rings, they at least have to be locatable, otherwise they'd never be able to sell you stuff.

    --
    When I am king, you will be first against the wall.
  10. Spammers using the anti-spam tools by dimer0 · · Score: 4, Interesting

    I helped this lady out who had a 100% opt-in mailing list, but some people weren't getting their mailings... We came to find out the emails were being flagged as spam, so, I set up a dummy email account for her than took every inbound message, sent it through spamassassin (with verbose reports, etc) - and then sent the email back to her.

    Now she can see if there's a problem with the headers, the content of the email, etc - so she tunes the email to get the lowest spamassassin score. (You know, the last major version of spamassassin took off points if you put your email client header as being Mozilla! Hah.. That one is gone now)..

    This lady definitely isn't a spammer tho, just someone with a small mailing list of 100% opted-in people.

    I'm sure spammers do the same thing. I would.

  11. Re:HTML mail is evil by Quixote · · Score: 2, Interesting
    I don't remember ever receiving an e-mail that actually had any content requiring it to be HTML.

    Until recently, I thought so too, till I ordered a laptop from HP. Their ordering system sends all the notices (order being processed, shipped, etc. etc.) in only HTML.

    One would think that a company like HP with its resources would know better, but... <sigh>

  12. Re:Does making this public help spammers? by dillkvast · · Score: 2, Interesting

    Don't agree. This is sorta the same as the idea behind "full disclosure" of security issues. The underground know all the tricks, and thus it is better that the sysadmins out there also have some idea of whats going on. This keeps us (the filtermakers more exactly) one step closer. Alot of these filters are OSS anyway. So the spammers can design there spam to circumvent the filters. They can even buy properitary filters and just test against them when designing spam.

    --
    Scitne aliquis remedium potimum crapulae?
  13. Use NOT for a filter by TheVampire · · Score: 2, Interesting

    My filter works 100% of the time. If the mail does NOT include a certain series of letters and numbers, then the mail is deleted. The people that e-mail me know to include that in the mail, so their stuff gets through. Of course, if you want to subscribe to lists, then this sort of thing won't work.

    1. Re:Use NOT for a filter by Fuzzums · · Score: 2, Interesting

      also this will only work for private mail.
      i can imagine a (not-spamming) commercial website telling people to put "qwerty" in their e-mail. not.

      but the idea is whitelisting. only allow a selected group of people to send you mail.

      for a company i can imagine the use of a html-form to "send" mail. for spammers it would be too much trouble to find a lot of those forms and write scripts ao spam them.

      --
      Privacy is terrorism.
  14. I noticed a new one recently by AssFace · · Score: 5, Interesting

    It isn't that this new one that I saw was all that amazing an idea, I just hadn't seen it until recently. It is such an obvious idea that I don't know why I haven't seen it until more recently.

    They send the mail as you. Fake the headers and make it look like it is from you. To you. From you.

    I had our local setup here allowing in anything that was from our domain. Now I have to stop that.

    I suppose the spammers saw that people were allowing their own domains and set it up that way.

    On a side note and not all that related, I've noticed that I am getting (about once a week) an e-mail from a bank - citibank, or wells fargo, telling me that my loan application has not been approved, see details attached.
    Now, I haven't been applying for loans, and the file attached is a *.pif file... which are notorious for being viruses, and not a format that a bank will send you.
    Not to mention that looking at the headers, they usually come from attbi.com which is cable modems, and I have seen through Compuserve as well - which aren't exactly how banks usually do business.

    --

    There are some odd things afoot now, in the Villa Straylight.
    1. Re:I noticed a new one recently by realdpk · · Score: 2, Interesting

      What's most impressive about those .pif spams from "Wells Fargo" and "Citibank" is that the spammer uses good grammar and spelling. This is an incredible leap in spammer technique that I'm surprised has not received more attention.

  15. Follow the money by SirLanse · · Score: 3, Interesting

    Someone is paying the spammers to spam. They usually have a URL in the email. Set up a screen saver to DDOS the payer. FOLLOW THE MONEY, make it bad to buy spam.

  16. SPAM filtering by ajs318 · · Score: 2, Interesting
    By cunning use of procmail recipes and ten-minute perl hacks, we can implement a spam filter as follows.
    1. Check headers for signs of relay-misuse.
    2. Strip out anything between <mustang> signs; s/(\<.*\>)//g;
    3. Strip out all remaining punctuation.
    4. Use a tr/// to convert accented characters to unaccented.
    5. Recall that when used in a scalar context, s/// and tr/// return a count of successful changes made.
    6. Check for certain words in the munged text.
    We can assign messages a score based on how many "nasties" were removed as compared to how many would be in a legitimate e-mail. Then despatch to one of three mailboxes: one for stuff we are sure is legit, one for stuff we are sure is spam, and one for stuff where we aren't sure. If we wanted to be really paranoid, we would strip out image links and JavaScript from HTML e-mails. It's not inconceivable that an image link could actually be a link to a CGI script with a unique identifier embedded into it, for the purpose of alerting the spammer that copy # 31337 {faute de mieux} of the message went to a working e-mail address. {Possibility for mischief?}

    And if we were an ISP, doing this on a public server, we would allow our customers to send abuse notifications to the appropriate server owners {for all the good it's likely to do} with just a few clicks.
    --
    Je fume. Tu fumes. Nous fûmes!
  17. Why do they try to trick the filters? by fungai · · Score: 3, Interesting

    Someone please explain. People who have spam filters on don't want receive spam, and will most likely just ignore/delete any spam that does get through. Why do the spammers waste so much time trying to get past the filters? Is it to reach the unwashed masses behind ISP filters?

  18. Re:Block spam by halr9000 · · Score: 3, Interesting

    I would try harder on POPfile. No offense, but you probably did not train it very well. I'm up to greater than 97.7% correct filtering with POPfile.

    Besides, who wants to switch mailers to block spam? That's kinda drastic. You can use POPfile with any mailer. (Haven't tried TB, but I'm a big fan of FB.)

  19. But does it need to be perfect? by JanneM · · Score: 5, Interesting

    I have on occasion misclassified mail myself, both ways. A few spams (uncolicited bulk emails) have been full enough of content that I have found interesting that I only after reading it realized this was not from anybody I knew. Conversely, I have a couple of times received mail which was for me , and was genuine, but so poorly formatted (lots of obnoxious html, strange subject and so on) that I deleted it as spam and only later came to understand it was a serious message.

    The point is, not even I can do spam classification 100% correctly. It would be a tall order indeed to have an automated tool do it. But does this matter? There are two issues: discarded genuine mail, and non-caught spam.

    Discarded genuine mail is not really as big a problem as people make it out to be. Mail is inherently not guaranteed; messages do fall between the cracks now and again. Swallowed by a buggy server, lost in limbo as a network connection goes down, never having a chance due to a misspelt or obsolete address, sent on a wild goose chase due to a temporary DNS error. Mail do disappear. Everybody knows that - or should know. Mistaking a mail for spam is just another crack for it to fall into. As long as the rate is low there really is no problem. And those doing mail that can easily be mistaken for spam will wise up eventually, as they see a disproprtionate amount of their email get lost in the ether.

    Missing spam is no real problem either. The big issue is having fifty spam in your inbox every morning, with another fifty arriving during the day. Having one or two a day, on the other hand, is not that painful.

    The point is, it is not a binary system: A spam system that misses two spams a day is better than one that misses five, and vastly better than having no system at all. Similarily, one that classifies one genuine message out of a thousand as spam is no disaster. Not good, but not a reason to shut it all down either. If reliability is _that_ important, what are you doing using email in the first place?

    Filtering isn't perfect. It won't ever be perfect. That's quite alright. Saying a technique is worthless because it makes an occasional mistake is throwing out the baby with the bathwater.

    --
    Trust the Computer. The Computer is your friend.
  20. TMDA by TheSync · · Score: 4, Interesting

    After a while, SpamAssasin's false negatives and positives drove me to the Tagged Message Delivery Agent (TMDA).

    TMDA has flexible whitelist and blacklist capabilities. But the big win is that it can be set to autoreply to anyone not on the whitelist, and require them to reply back before allowing the email to get through. Of course, very few spammers have valid return email addresses...

    This may seem drastic, but in fact it has made life soooo much easier. It also helps you to "automagically" get off those email lists you signed up for a long time ago, don't really care about, and are too lazy (or lost the info) to sign yourself off ;)

    The only sad thing is that no longer do Russian women want to extend my length or give me free money or viagra, and I am no longer in contact with Ms. Sesse Seiko from Uganda...

  21. The key difference. by alistair · · Score: 4, Interesting

    The key difference is that KMail does this on a per message basis, whereas in Mozilla this is set once in Preferences and I suspect the same is true in Evolution. Thus looking at a HTML message I just received I get the following in a box at the top of the message;

    "Note: This is an HTML message. For security reasons, only the raw HTML code is shown. If you trust the sender of this message then you can activate formatted HTML display for this message by clicking here."

    The HTML code follows and a single click turns it into a fully rendered message, or an alternate click consignes it to the trash can.

    It may be possible to add this as a mozilla mail / thunderbird toolbar, and as Thunderbird takes off I hope we will see this type of quick prefs bar develop to the same extent they have been developed for the mozilla browser component.

  22. Re:Intresting article by DukeyToo · · Score: 2, Interesting

    Actually, your last statement (or is it a tagline?) has been shown to be incorrect! Bayesian filters can actually be better at sorting mail than a live person. Probably because they do not use a fixed set of rules.

    A while ago when I was researching mail classification techniques, I saw a study that compared the accuracy of some classification techniques. The study took mail that had been manually classified, and compared that to how a several trained filters classified the mail.

    They found, as a side-note, that the filter actually did a better job than the people they got to manually sort the mail!

    I'm not much for the details, so no URL for the study, sorry :(

    In any case, reading emails manually defeats the point, especially for my poor mom who is horrified by some of the messages she recieves.

    --
    Most writers regard truth as their most valuable possession, and therefore are most economical in its use - Mark Twain
  23. New tech by JMP3 · · Score: 3, Interesting

    Some time ago a new way for filtering spam has been discovered. Solution is simple, yet brilliant - we already have those "To confirm you're not a script, please type the text shown in this image" at various websites to guard against form-submitting bots. Apply this to email (bounce back all emails with image attached) and all the spam is gone! Not that it is a perfect solution (I wish there was...) as I see 2 minor flaws in this system :
    1. It introduces a delay in communication - confirmation letter has to be sent and reply received.
    2. Not all recepients at the other end are *that smart* to understand "what the hell this image means and what am I supposed to do with it?"
    From the other side it can serve as lameness filter ;)

    But still a promising technology. I've searched the web and came with both subscription services Mailblocks and client-side apps Icemile. The last one is free and I think I'll stick with it.

  24. PopFile by MrEnigma · · Score: 3, Interesting

    What's awesome about the author (Dr. John Graham-Cumming) is that he not only knows his stuff, but he puts it out in his open source software called PopFile written in Python.

    PopFile can be located at http://popfile.sourceforge.net.

    I am currently using PopFile, with an accuracy of 98.26% from nearly 8,000 messages. It's the best I've ever used, and it's free!

    --
    GeekWares - Buy and Download Today!
  25. That would require we be able to find them by Moryath · · Score: 2, Interesting

    The trick is: the Spammer, him/her/itself (well he/she WILL be an "it" if I ever find them), wants to be completely transparent.

    They send mail. You see mail. In their depraved mind, you then deal with company that commissioned mail.

    First of all, I want to strangle the people who commissioned said mail, especially mr. "Free golf wedge, best in world" and the fuck from K-Mart marketing who bought a cd full of email addresses and added them to K-Mart's bluelight email list.

    However, that's not the point.

    Think about how we filter. In order to have a realistic opt-out sequence, we have to be able to reach the spammer back. Either by email, or clicking a link, or something of that sort.

    The MOMENT something that static is in the email, however, ISP filters will catch it and promptly ban any email that they send with that indicator tag in it.

    See the trick? It's all based on evading filters. You can't legitimately provide an opt-out solution, because then that becomes an identifying tag for people to filter you away.

    And the last thing spammers want to see is people actually opt out anyways, because if they WERE honoring it, they couldn't claim to be mailing to 50 million people. They make their cash partially on the claim that they reach a huge number of people in order to get responses from a smaller number, just as TV shows do with ratings and ads.

  26. Where's the profit in hiding? by netringer · · Score: 2, Interesting
    One thing I gotta know: If the spammer knows I have no interest in the say, "Herbal Viagra" prodct he's pitching, why does he think that if he says he's selling "V A 1 G R A" it'll be different? Am I supposed to go for that message and BUY THE PRODUCT now?

    I'll answer my own question a bit: After seeing one of these scumbags on TV it's obvious they get off just watching the counter increment saying that he just sent 4,123,456.890 more messages while he watched. They don't really want you buy or do anything. They just want to send the garbage.

    --
    Ever dream you could fly? Get up from the Flight Sim. I Fly
  27. Re:IP Blacklists are the way to go... by acceleriter · · Score: 2, Interesting

    spamcop.net was pretty cool, before Julian got in bed with Cyveillance. Now I wouldn't touch them with a ten foot pole.

    --

    CEE5210S The signal SIGHUP was received.

  28. metaphone mapping text by joeldg · · Score: 5, Interesting

    You can use the metaphone algorithm (I use PHP so, http://us3.php.net/manual/en/function.metaphone.ph p) which has come in handy.. Just strip all HTML and de-urlencode then run this on the msg, it totally ignores numbers and punctuation and any letters that are not in (a-z A-Z). You will need to have a database pre-made full of metaphone values from a dictionary then start a comparision and you can get a general feel for the msg.

    I took all the words used in a product called spamassassin and used that to do a comparison.. Coupled with bayes filtering I imagine this would be pretty much the best way to filter mail.

    It is kind of an interesting approach based on what mail "sounds" like vs what it actually contains.. If you filter on the straight contents these guys will just keep coming up with different ways of encoding and generally being twitchy.

    However, their mail will *always* have that "buy this!" kind of sound.

    I built a system a while back that was processing all double bounces from three servers and handled around 50k/day spams and came up with some interesting results.

    If anyone is interested I'll dig up the code and place it on my site with the rest of the stuff there.

  29. Re:No, no, no... look at this another way by Urchlay · · Score: 4, Interesting

    > One final piece to the solution is to get ISPs to act responsibly, and block egress traffic on port 25 for dynamic IP addresses

    Some ISPs do this already.

    <rant topicality="50%">
    That'd be fine, if said ISPs would allow their users to relay mail from addresses other than $user@isp.com... but for various reasons (commercial? political?), they don't.

    In other words, I can't send mail via my $50/mo. cable modem at all, unless I want to use the account assigned to me by my ISP (and sold to spammers, no doubt). I prefer to use an address at a domain I personally have registered and for which I personally control the SMTP server. For one thing, my ISP may change: I may decide to get DSL instead of cable, or I may move to an area served by a different cable ISP, or (this has happened to me recently) my cable provider may get bought out by another company, and change the domain name... or any number of other things... but my domain and my SMTP server won't change, so nobody even has to care what ISP I use, and I don't lose legitimate mail due to the address changing.

    Unfortunately, my ISP, in its attempt to stop me from sending spam, has restricted me to using only their SMTP server (blocked egress on TCP port 25, as suggested by the parent), but will not allow me to send mail via their own SMTP server using my own (valid) email address (which I do not wish to use for reasons already explained)...

    The only solutions here are some sort of VPN to the network where my SMTP server lives (at work), or else ssh to the SMTP server (which is what I actually do, but it's inconvenient).

    I've offered to pay my ISP for `business class' cable service, but they *don't offer it*. I've attempted to get DSL, but am too far away from the CO. I'd love to have a choice of ISPs in my area, but cable companies are local monopolies in the country where I live... and thanks to the shakedown in the market, they're getting to be multi-state monopolies. I'd have to move *many* miles before I could get cable internet service from a different provider.

    I'm not claiming anyone's deliberately conspiring to limit my (or anyone else's) freedoms. I guess what this boils down to is that so many people have pissed in the pool that we've now got on-duty cops as lifeguards... sorry, that's a rotten analogy, best I can do at the moment.
    </rant>

    OK, I feel better now, sorry about that.

  30. Re:HTML mail is evil by hacker · · Score: 3, Interesting
    Funny. A couple posts up in this very thread you posted a couple of lines of sendmail config to do exactly this, bounce HTML mail. So which is it?

    As you know, blocking mail at the MTA is not a bounce. "A couple of posts up", I posted a bit of a sendmail hook that blocks (i.e. rejects before receipt) mail with the Content-Type of text/html. That is not a bounce. I am not regenerating an additional email, which would be sent to an incorrect (in most cases, innocent) recipient.

    Starting yesterday, my mail server has been thwarting an attack from 2,734 separate external machines, all trying to send a message to 3 non-existant users on 1 domain that I host which has 0 mail accounts, no website, and no users behind it. It's a registered domain pointed to my IP address, nothing more.

    So far today, we've received 15,833 separate attempts to send mail from these 2,734 hosts that my server has blocked (with a quick virtusertable hook to send them 'nouser'). The number of unique external hosts has been slowly increasing. It was 1,633 at the end of yesterday, and has now grown to 75% more than that number, up to 2,734 as I type this.

    THESE are bounces. Clearly someone has sparked off a trojan somewhere that was lurking inside a LOT of companies in a lot of machines (some of the domains are worldbank, dell.com, aol.com, etc., CLEARLY not spammers inside these companies, not THIS many of them) who are now trying to send this one message to these same 3 non-existant users at this 1 domain.

    I just checked again, from the time I started typing this reply, and we're up to 2,746 hosts trying to send this 1 spam message to these 3 non-existant users.

    So trust me, I'm well aware of the difference between blocking a message and bouncing a message.

    Are you?

  31. Re:HTML mail is evil by dwsauder · · Score: 2, Interesting
    Oh, I have to agree with you.

    Now, here's a funny story. I was at the FTC Spam Forum a while back. There were some of the more responsible email marketers there -- you know, the ones that send out regular newsletters for opt-in subscribers -- and they were whining and complaining because spammers have spoiled "rich" email for them. Just a few years back they had visions of eventually being able to send email with flash, animated graphics, fancy styles, and so forth. And now they realize that people don't want to receive those kinds of emails because of spam (and to some extent viruses). So they whined about it. I guess for them email is "push" marketing, while for the rest of us, email is a way to communicate with co-workers and friends. Who needs HTML to say "wanna go get some lunch?"

  32. False Positives by Arpie · · Score: 2, Interesting

    Yeah, I hate spam as much as the next geek. However most people don't stop to think about the black side of spam filters: false positives.

    I use spamassassin and Mozilla's bayesian filters, they do get rid of a lot of spam, but they also do get some false positives. This means I have to check my spam folder every so often, which kind of defeats the purpose, doesn't it?

    Moreover, email is not only a personal communication tool anymore. Do you buy on-line? Do you expect an order confirmation, or a shipping confirmation? Well, it's quite likely that those could be flagged as spam by spam filters. It just happened to me yesterday on an ebay winning bid notice, because the subject had an exclamation mark. Businesses -- you know, the kind of organization that usually pays the sallaries of us working geeks, or the sallaries of the parents of student geeks -- need to get through to comunicate with their customers. Spam and spam filters are both getting in the way.

    How bad is that? IMHO pretty bad. Spam is killing half of the advantages of using email. Filters, with the pretty much unavoidable false positives in this cat and mouse game are killing another quarter, at least. I don't know what will happen, but it's a pretty sad situation.

    --
    /* TAANSTAFL */