Slashdot Mirror


How Good is Gmail's Spam Filter?

juglugs asks: "I've been using Gmail since the first round of invites on Blogger. Tonight I received my very first spam email. It was one of the ones offering me some product (I didn't read it too much) that would increase my manhood. It didn't trouble me too much as I just had to hit the 'Report Spam' button and off it went. But how good is their spam filter? Does anyone else get much spam? Why didn't this get recognized as spam - it had all the usual 'keywords' that you'd normally associate with it."

73 comments

  1. Ask Google by Anonymous Coward · · Score: 5, Funny

    Not Ask Slashdot.

    1. Re:Ask Google by Anonymous Coward · · Score: 5, Funny

      our spam filter is second to none.

      trust us.

      -google

    2. Re:Ask Google by Anonymous Coward · · Score: 0

      with the amount of time spent on trying to fix a broken system, why not just make a new one?

      email sucks, it's easy to forge headers and all that technical crap.

      why not make another protocol other than email (maybe email2 or whatnot) and have authorization as part of it's system

      mail sends to me, i auto CAN'T accept and it bounces back. it's the first mail from that person. the person then clicks a link to get it to me.

      have it so no message CAN go through w/o being authorized to do so. ya it's a bit more time taken, but then there is no spam.

      im prob wrong b/c im not a super smart /. person. someone please show how it wouldn't work. i always thought auth would fix it. (like icq not being able to send to people not on my list kinda thing).

  2. You'll never really know by orthogonal · · Score: 4, Insightful

    How Good is Gmail's Spam Filter?

    Who knows?

    Will it get better? Will it get worse?

    Who knows?

    Part of the convenience of using GMail -- or any other email service -- is that that service filters spam for you.

    But that means you don't get to do your own filtering.

    Which is why I don't rely on my ISP to filer, and indeed, asked them explicitly to not filter my mail when they began to do so.

    I'm too paranoid about false positives causing me to miss an important email (eventually all those girls who dumped me will wise up and beg my forgiveness, right? Right?), and I figure I can do a better filtering job on the client side. And indeed, I can even use a chain of multiple filters, or roll my own filter.

    Currently I'm using SpamBayes, and it works well enough. Could it be better? Sure, it does miss several spams a day. But, it also filters many more than it misses, I'm not worried about false positives, and I can always hack the source if I need to (already did so to work around some MS Outlook stupidity, in fact).

    1. Re:You'll never really know by 00420 · · Score: 2, Informative

      GMail allows you to implement your own filters, and if you're really worried about false positives you can always browse your junk folder.

      Although, since it sounds like you're all situated anyways, switching to gmail probably wouldn't be a good idea in your situation.

    2. Re:You'll never really know by Romeozulu · · Score: 2, Informative

      I have had a GMail account for a couple of weeks, and the Spam filter is not up to the level of SpamAssassin running on my server.

      I get around 2000 Spams a day (due to many of my email addresses being on websites), and using SpamAssassin, I get 1 or 2 Spams a week in my INBOX and an equal number of False-Positives.

      I forwarded all my email to my GMail account to see how it would do. I can't really tell if it's getting any better from the first day. I get 10 or 20 Spams a day that are not caught by GMail. This number does not seem to be going down.

      When I first installed and trained SpamAssassin, it only took a few days before it was catching just about everything, and a few weeks of training got it to the current level.

    3. Re:You'll never really know by Anonymous Coward · · Score: 0

      I've been using something called the Active Spam Killer (ASK) and it's been working quite well for me, actually. The premise behind it is that it sits on the mailserver machine (meaning that you need at least shell access to put it on, and I think it uses Python), and effectively blocks any e-mail that it doesn't know should go to you, sending out a "Confirmation message". If nobody replies to the confirm message, then it eventually gets chucked. The idea behind it is that if you are spam, you're using a spoofed e-mail address and thus you will not be able to reply.

      I have actually recieved (in my inbox) only *ONE* spam message, from a person who was stupid enough to use their own e-mail address to send it through and replied to it. It was one of those stupid little "rich man seeks investment" type things... *shrugs*

      Anyway, that's all.

    4. Re:You'll never really know by Romeozulu · · Score: 1

      I forward all my regular email to GMail, to test it out under heavy load.

  3. buddy Report by Deliveranc3 · · Score: 2, Interesting

    Friend who sent me an invite reported a 100% drop in spam from 50 a day to 0.
    I have a hotmail account for spam and it goes through just fine thank you.

    1. Re:buddy Report by MindStalker · · Score: 2, Insightful

      Problem is, most people experience a significant drop in spam when they switch ISPs, atleast when that ISP isn't selling your account.

    2. Re:buddy Report by JamesDotCom · · Score: 1

      Could that be because he got a new email address? *;D~

    3. Re:buddy Report by Deliveranc3 · · Score: 1

      All mail is forwarded. in this case.

    4. Re:buddy Report by Deliveranc3 · · Score: 1

      Um he switched web based e-mail which um has nothing to do with ISPs.

      Who told you about slasdot?

  4. You are not using a finished service by GrandCow · · Score: 4, Insightful

    Gmail is still in it's beta form. The company is still working out all the kinks in it. That's why you have to go through the process of getting invited before you can set up an account. Spam is going to fall through the cracks for a while until they finish fine-tuning the filter.

    As long as it still says beta at the top of the screen when you log in, you should expect things like that. Click report spam and do your part to help the filter get finished.

    --
    "Well kids, you tried your best, and you failed. The lesson is, never try." -Homer Simpson
    1. Re:You are not using a finished service by mmonkey · · Score: 1

      Also, I guess it's possible that every so often during the beta Google will drop the threshold on their spam filter temporarily to get people to manually train their filters. There's no substitute for human-based sorting.

  5. How much personal info on signup? by Anonymous Coward · · Score: 1, Funny

    Perhaps when you signed up you gave something away and the Gmail filter is that smart that it knew to consider penis enlargement emails as legitimate?

  6. It's terrible by Richard5mith · · Score: 2, Interesting

    In a word, it's terrible.

    I forward all my email from my existing account (which I've been using for 5 years and gets a ton of spam) to my gmail account and spam always slips through. I've been using gmail since shortly after it was announced and I have seen it improve, but I'm still getting the same format spam slip through every day.

    I pick up the same mail in Mac OS X Mail, and the combination of POPfile and the Mail spam filter gets it all.

    1. Re:It's terrible by kinema · · Score: 3, Interesting
      I forward all my email from my existing account (which I've been using for 5 years and gets a ton of spam) to my gmail account and spam always slips through.
      Maybe Gmail's filters are factoring in the fact that the mail is being forwarded from an address that it knows you recieve lots of valid email from. I'm sure that the filters use the origin as part of the decision on wether or not a particular message is spam or not.
    2. Re:It's terrible by burns210 · · Score: 1

      maybe some of it slips through because of the sender's email address... if you send 100 emails from one of your old address to gmail, then accept 20 of them as good emails, and the other 80 are spam, google might be more likely to accept some iffy-spam as legit, since it comes from the same email that a lot of your legit email comes from....

      It makes sense that email it is not 100% sure is spam, may possibly slip through due to that.

    3. Re:It's terrible by irokitt · · Score: 1

      Are you *really* sure? Because not every program thinks like a human. Chances are the filter doesn't take origin into account, and that the problems occur because features like that haven't been coded into GMail (which is still beta, so it isn't complete and it's a little early to start questioning the performance of its spam filter).

      --
      If my answers frighten you, stop asking scary questions.
  7. I've gotten no spam on that account by Anonymous Coward · · Score: 0

    Of course, the only email I've gotten is the welcoming message. I suppose I should sign up for a few mailing lists. Easier than getting actual friends.

  8. Group heuristics vs. Individual heuristics by foniksonik · · Score: 3, Insightful

    Would you rather have an ISP do group heuristics, potentially marking email you want as spam, or individual heuristics, forcing you to identify email spam yourself?

    It's a trade-off... on the one hand you get much quicker and more compehensive spam detection by using a group level rule but then you have to check your spam folder to see if it incorrectly marked good email as spam and on the other hand you have individual rules which must be generated for each account based on individual opinions.

    Neither is perfect.

    Bottom line is that you're using a free service, if you don't like it you can move on w/o expense incurred.

    --
    A fool throws a stone into a well and a thousand sages can not remove it.
  9. Perhaps the problem is... by Fished · · Score: 2, Funny
    Perhaps the problem is what Gmail has seen of your tastes.

    That's sick dude.

    --
    "He who would learn astronomy, and other recondite arts, let him go elsewhere. " -- John Calvin, commenting on Genesis 1
  10. Re:I don't know. by Quill_28 · · Score: 1

    Well give me your e-mail address and i'll send you an invite.

  11. I know, and it's HORRIBLE by Anonymous Coward · · Score: 5, Interesting

    Well, actually - it's HORRIBLE.

    So far, no spam whatsoever has found its way into my inbox. However, the amount of false positives filtered into the spam folder is overwhelming.

    For a while I wondered why I only got reports by email about 30-40% of my finished online auctions (link omitted, no free advertising here). Last week I accidentaly clicked on the spam folder, and there it was, dozens of FALSE POSITIVES.

    And yeah, there is NO INDICATION AT ALL of mail in the spam folder, one have to explicitly look in it to see if there are any e-mails there...

    Sure, mod me troll if you like. I've been using gmail since the first blogger.com-invitations, and am very happy with it (and have more invites to give out than people to give them to. I tried gmailswap for a while but soon got bored).

    Still, far to many false positives. I have no idea why some auction-results were treated as spam, and others not. They're almost identical. Or perhaps it was exactly that which caused the problem, several near identical mails in a short period of time...?

    1. Re:I know, and it's HORRIBLE by Gangis · · Score: 2, Informative

      Holy crap! You're right!

      After reading your post, I decided to look into the Spam folder of my gmail account, and there they were: the 2 emails I've been waiting for for a few days. Now I can finally set up my IPv6 tunnel...

      --
      "Black holes are where God divided by zero." - Steve Wright
    2. Re:I know, and it's HORRIBLE by Anonymous Coward · · Score: 0

      Quite frankly, I don't understand what Google have been smoking. If a new mail has arrived, the inbox is relabeled as "Inbox (1)" or something similar.

      One certainly would be justified to expect "Spam (1)" to indicate that unread messages had found their way into the spam folder.

      Well, happy tunneling. :-)

    3. Re:I know, and it's HORRIBLE by driptray · · Score: 4, Informative

      And yeah, there is NO INDICATION AT ALL of mail in the spam folder

      Create a label called "spam". Anything that gmail filters into the Spam folder will also automatically appear under the "spam" label, which also shows the number of unread messages.

      I don't know why this works, but it does.

    4. Re:I know, and it's HORRIBLE by jamessan · · Score: 1

      How did you go about creating this filter? There is no way to setup a filter based on which folder gmail puts it into, yet you have to tell it to match something.

    5. Re:I know, and it's HORRIBLE by spectral · · Score: 1

      I don't think it matters. Match on from: president@whitehouse.gov. Just so long as a label exists with that name, it'll show up in your 'Labels' category. The 'Spam' folder on the left is just a special label (just like Inbox, Starred, etc.). So, if it finds something as spam, it puts it under the 'Spam' label for you, automatically, even though you don't realize you HAVE a spam label. So make one, and bam, it works.

      I never thought of doing this, but I DID notice the Spam label on all its false positives. grr.

    6. Re:I know, and it's HORRIBLE by spectral · · Score: 1

      I thought I sent this, but apparently never clicked submit. ignore my other comment, just make a label (see upper right of the inbox window), and call it spam. Doesn't need to be a filter, just to have something to associate spam with. GMail alaready assigns everything the spam label, you just don't have a way to view it yourself. I wonder if it works for the other ones? (Inbox, Starred are both just labels to GMail, I think). I'm going to go play :)

    7. Re:I know, and it's HORRIBLE by driptray · · Score: 1

      I didn't create a filter, I created a label (called spam). Gmail automagically applied this label to the mail it put in the spam folder. No filters are required.

      There's no reason why this should happen, but it does.

    8. Re:I know, and it's HORRIBLE by MooseGuy529 · · Score: 1

      It happens because "Spam", along with "Inbox" and "Starred", are real labels that Gmail uses internally to organize those folders. It's nice to see that they use a standard way of storing that kind of information.

      Basically all incoming mail gets either the "Inbox" or the "Spam" label, and starred messages have a "Starred" label. Adding them by hand is useful to see the number of messages, and you can also add manual spam filters.

      --

      Tired of free iPod sigs? Subscribe to my blacklist

    9. Re:I know, and it's HORRIBLE by driptray · · Score: 1

      Ah, thanks for this info.

      But I have noticed that if I set up a filter to apply the "spam" label to a message, that message still appears in the Inbox and does not appear in the Spam folder, although it does of course appear under the Spam label.

    10. Re:I know, and it's HORRIBLE by chaoaretasty · · Score: 1

      Gmail is still in beta, you have to expect some sort bugs for now.

    11. Re:I know, and it's HORRIBLE by irokitt · · Score: 1

      Questions are, will this kluge work once GMail goes beyond beta, or will the underlying problem (i.e. too many false negatives) be fixed?

      --
      If my answers frighten you, stop asking scary questions.
    12. Re:I know, and it's HORRIBLE by Jucius+Maximus · · Score: 1
      "Create a label called "spam". Anything that gmail filters into the Spam folder will also automatically appear under the "spam" label, which also shows the number of unread messages."

      "How did you go about creating this filter? There is no way to setup a filter based on which folder gmail puts it into, yet you have to tell it to match something."

      Labels and filters are different. He said to create a label. You don't need to set up any matching rules for labels.

  12. About 80% by driptray · · Score: 4, Informative

    I get over 800 spam a day to my domain, which I now have forwarded to my gmail account.

    The gmail spam filter is knocking out about 80% of it. I haven't bothered to check for false positives as it's enough of a hassle getting rid of the approximately 160 spams that get through to my inbox.

    1. Re:About 80% by burns210 · · Score: 1

      why don't you setup some spamassassin setup for your domain, to knock out another huge chunk of emails, as well?

  13. Not Good by willll · · Score: 1

    Gmail has only caught about 5 of the 30 spams I've gotten so far. Although I haven't seen any false positives yet.

  14. Re:How much personal info on search? by ftvcs · · Score: 1

    It was a pub based on a previous search query.

  15. Really really bad by Anonymous Coward · · Score: 1, Informative

    After reading this thread I look at my spam folder in my gmail account. Argh !! What a surprise 6 conversations from different developer mailing list where marked as SPAM. In this particular case their filter did a really poor job : 100% of the mail was valid and should not have been considered as spam.
    What a disapointment !!!!!

  16. Spam filter got all my mails! by Sediyama · · Score: 1, Informative

    I received about 100 e-mails in my gmail account, and almost all mails were marked as spam. And I didn`t received a single spam in this account yet.

  17. Wow, Such A Range by CMiYC · · Score: 1

    I'm surprised at the range of people saying its good to the people saying its terrible. I was going to say its pretty darn excellent. I use a catch-all address for my domain which I forward to wherever. Filtering on X-Forward I use to be able to catch most of the spam. However, some were so badly formatted that the headers weren't processed properly.

    When I switched to using GMail I didn't realize it at first. I was still getting 2-3 spams a day (which is was what I use to get.) Then I realized I never set up any filters or anything! On the 2nd day using it I opened the Spam "folder" and found over 80.

    I've checked the folder every so often and haven't found any false positives.

    Personally, I'm impressed with GMail's spam filter capability.

  18. why not by Anonymous Coward · · Score: 0
  19. gmail's spam filtering by Anonymous Coward · · Score: 0

    It is horrible. I've had my email account since 1995 and get a lot of spam. Gmail catches the minority of it.

    Another problem is that gmail rejects incoming email virii when receiving over smtp. Normally great (who wants virus mail) but when you are forwarding to gmail from another account, it causes a bounce back to your forwarding account, then your forwarding account tries to forward again, and.... you get an email loop between smtp servers. I had one piece of email with megabytes of received headers before I realized what was going on. Now I fetch locally, filter using clamav, then forward to gmail. I'm considering also filtering for spam to get a result that is closer to spam free.

  20. 66% falso positives by molo · · Score: 1

    I signed my gmail account up for some mailing lists. The other day, I had 12 messages marked as spam. 8 of them were legitimate messages. That gives a 66% false positive rate. There have also been a bunch of false negatives (but I don't have numbers for that). So, gmail's spam filters need a lot of work.

    A bunch of stuff is also filtered at the SMTP level. Anything with an executable attachment is dropped before you ever see it.. this is not so good for some of the security mailing lists like full-disclosure and others discussing trojans and worms.

    -molo

    --
    Using your sig line to advertise for friends is lame.
  21. their filtering sucks by Anonymous Coward · · Score: 0

    i forwarded gmail -every- email that i received for several weeks. they caught ~50% of the spam. spamassassin catches 90-95%. how they can be so bad when they have 10,000 machines on which they could run spamassassin on all incoming mail is sad.

    1. Re:their filtering sucks by burns210 · · Score: 1

      they would rather fall short on the side of too much spam getting through, then too much legit email getting marked as spam....

      The problem is, the group heuristics they use allow for a lot of people to build the filter on what is/isn't spam, which is great, until you start getting jerks who mark tons of legit email as spam, and you start to break the filters usefullness.

      Google needs, all email services need, a 2 or 3 pronged aproach... For the length of the beta, use the group heursitics filter... When it goes gold, make each member have an idividual filter, that starts preconfigured to the beta's heursitics settings.

      It also needs just a plain keyword/dumb spam search, befor it hits any bayesian mail filter, to weed out the OBVIOUS spam, and lighten the load on the more complex bayesian-style filtering system.

  22. Re:I don't know. by rk87 · · Score: 1

    in my header :) (don't forget to unspam)

    --
    I'M NOT ANGRY!
  23. My experience by fulldecent · · Score: 1
    4 Comcast accounts that I've never told anyone about are forwarded to gmail:
    • ~400 spam detected per day
    • ~3 spam undetected per day
    • 0 false-positives

    Fuck you Comcast stop selling email addresses

    --

    -- I was raised on the command line, bitch

    1. Re:My experience by Anonymous Coward · · Score: 0
      4 Comcast accounts that I've never told anyone about are forwarded to gmail:

      If you've never told anyone about them, why not forward them to abuse@comcast.net? :-)

    2. Re:My experience by sasami · · Score: 1

      I had the same experience and reaction when I saw my unused Comcast accounts get spam.

      Then I realized that all my accounts use common words as usernames.

      These days, it's probably safe to assume that any-dictionary-word@any-large-ISP is going to receive spam...

      ...where "any dictionary word" does not mean only the English dictionary, but a lexicon containing the dictionaries of several languages, plus names, proper nouns, slang, and common misspellings.

      ---
      Dum de dum.

      --
      Freedom is not the license to do what we like, it is the power to do what we ought.
    3. Re:My experience by Anonymous Coward · · Score: 0

      i have one like 34q8fhzz498@comcast.net

  24. I don't know... by Lazyhound · · Score: 1
    ...but I'll be sure to look into it for you.

    Now, about that invite...

  25. It's google by Ruis · · Score: 1

    If there's anyone that I think can create some software to intelligently determine if an email is spam or not, my first guess would be google. They seem to have a good track record of being able to sort the mess we call the Internet. Just give them some time. It's still beta after all.

  26. Gmail's spam filter depends on the user input by Jugalator · · Score: 1

    Therefore, the amount of active users should affect the precision of the filtering, since it's the users who report the spam. Since Gmail has relatively few users right now, the filtering might be relatively poor in comparison to what it will become. At least that's what my sense of logic tells me now.

    For the record... Currently, I have 0 ham reported as spam, 7 spam reported as spam, and 1 spam reported as ham. Not many know of my Gmail address at all yet.

    --
    Beware: In C++, your friends can see your privates!
    1. Re:Gmail's spam filter depends on the user input by benna · · Score: 1

      Also the fact that there are so few users probobly means spammers aren't really targetting gmail as of yet.

      --
      "It is not how things are in the world that is mystical, but that it exists." -Ludwig Wittgenstein
  27. I hope it's good... by eeg3 · · Score: 3, Informative

    Google's spam filtering better be good. Do a search on google with site:google.com inurl:gmail +"a-". Guess what you get? That's right, a huge list of gmail invites for easy spam harvesting. Way to go google.

    1. Re:I hope it's good... by Anonymous Coward · · Score: 0

      I hope that someone at Google will read the parent post because this is indeed very stupid. I hope they didn't hire too many people without a clue in their quest to beat Microsoft and Yahoo.

    2. Re:I hope it's good... by CvD · · Score: 1

      That is pretty scary. Hope not too many people read your post... did you write Google about it?

      Cheers

    3. Re:I hope it's good... by eeg3 · · Score: 1

      Indeed I did. No response yet, after almost a week. Unfortunately.

    4. Re:I hope it's good... by John+Allsup · · Score: 1

      site:google.com inurl:gmail +"@gmail.com"
      does a slightly better job

      --
      John_Chalisque
  28. Spam Filter by Anonymous Coward · · Score: 0

    It seems good. One question that's been on my mind since I started using Gmail is: "Where are the relevant text ads I've been hearing about?"

    Since I got Gmail I've been emailing my programmer friends about projects like Quake 2, Blender 3D, GTK Radiant and the like... one email we were even joking about Viagra and Penis Enlargement pills setting off spam alerts. Those keywords, I would think, would have sparked a shitload of text ads. None, though. I have yet to see a text based ad in my Gmail portal.

    Not even in my signature is there a "This Email Brought To You By The Friendly People At Google."

  29. I'd test it, if I had an invite =) by Cave+Dweller · · Score: 1

    I have several e-mail accounts that receive roughly 2,500 messages a day, each. I could forward them to GMail and tell you -- if you'd invite me =)

    mark (at) signal42 (dot) com

    TIA.

  30. Comment removed by account_deleted · · Score: 1

    Comment removed based on user account deletion

  31. Not Too Good by _Neurotic · · Score: 1

    I will second what Richard5mith said. I also forward a copy of my mail to GMail to test the spam filtering and labeling features.

    What I have found is that the spam filtering is less effective than Outlook Spam Filter, often letting more 30-50% of spam through to my inbox.

  32. Re:I don't know. by stryc9 · · Score: 1

    I must be the only geek without a gmail account. Hook me up? stryc9_AT_shaw.ca

    --
    www.madeofwinandawesome.com
  33. My personal experience by Wise+Dragon · · Score: 1

    I forwarded my email frrom my usual forwarder to gmail. I have gotten two false positives from commercial email that I subscribe to, and several spams have slipped into my inbox. All the missed spams are very similar, having a random dollar amount in the subject line, offering to refinance my house, then wrapping up with a few lines of madlib-like text.

    I have to say, some of these madlib-like phrases are quite amusing. "Japanese requester non-proprietary doesn't" anyone?

    I also got a casino offer signed:

    Best regards,
    Jamie Zawinsky

    Oh the irony.

  34. Hmmm... by zonker · · Score: 0

    Well, I have over 16,000 (yes, you read that right) spam messages in my spam folder. Why so many? Because I can't 'empty spam folder'. I've used about 70M of space on spam alone. The filter is *okay* compared to how much spam is being received each day but there are about 25-70 spams that get through on average and I receive about 500+ spams a day. It isn't great but hopefully it will get better.

    BTW, I redirect all unrouted mail from my domain to my gmail account, which is where I get all the spam from...