Slashdot Mirror


Gmail Spam Filter Changes Bite Linus Torvalds

An anonymous reader points out The Register's story that recent changes to the spam filters that Google uses to pare down junk in gmail evidently are a bit overzealous. Linus Torvalds, who famously likes to manage by email, and whose email flow includes a lot of mailing lists, isn't happy with it. Ironically perhaps, it was only last week that the Gmail team blogged that its spam filter's rate of false positives is down to less than 0.05 per cent. In his post, Torvalds said his own experience belies that claim, and that around 30 per cent of the mail in his spam box turned out not to be spam. "It's actually at the point where I'm noticing missing messages in the email conversations I see, because Gmail has been marking emails in the middle of the conversation as spam. Things that people replied to and that contained patches and problem descriptions," Torvalds wrote.

18 of 136 comments (clear)

  1. Not sure if this is first comment by Mouldy · · Score: 4, Funny

    Or if the other comment's got hit by spam filters

    1. Re:Not sure if this is first comment by Anonymous Coward · · Score: 4, Funny

      No, you weren't the first as I posted a reply to this thread
      two days ago and it wound up in Linus' inbox as a kernel bug-fix.

  2. This Just In by Anonymous Coward · · Score: 5, Insightful

    Individual that differs more than 6-sigma from the population's mean has trouble with automated tools designed for the average person.

    Gmail's spam filter is why email is still useful.

    1. Re:This Just In by Anonymous+Brave+Guy · · Score: 4, Informative

      Gmail's spam filter is why email is still useful.

      I might not be six sigmas from the population mean, but the aggressive filtering of Google's mail service is annoying me more and more. I don't use it myself, but quite a few of my recurring professional contacts do, often behind their own domains so there's no way to know until it breaks. Aside from the privacy implications of that, I'm getting awfully bored of finishing a day's work, e-mailing the results to wherever they need to go, and getting in the next morning to find a nasty note from Google that was sent back after I'd left saying my mail had been blocked because they considered something in the attached file a security risk. This is particularly infuriating if I'm working in the UK and sending the results to a contact on US time, because it costs between half a day and an entire day to catch up.

      --
      If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.
    2. Re:This Just In by Carewolf · · Score: 4, Interesting

      Individual that differs more than 6-sigma from the population's mean has trouble with automated tools designed for the average person.

      Gmail's spam filter is why email is still useful.

      In my experience it is crap. Not as bad as Linus experience, but it stil mistook on 1 in 200 emails just like google says and that is COMPLETELY UNACCEPTABLE. Having to find important emails in the thousands of spam emails is a problem, and haven't seen any other spam filter with that many false positives.

    3. Re:This Just In by Solandri · · Score: 5, Informative

      Not sure why Linus and you are complaining. Gmail already has a tool for eliminating false positives. You set up a filter to automatically give any email from a particular mailing list a label for that list. It's actually a great tool for auto-organizing your email if you subscribe to multiple lists like he does.

      When setting up the filter, you make sure to check the "Never send it to spam" option.

  3. State the Obvious by freeze128 · · Score: 5, Funny

    Well, Linus doesn't *HAVE* to use gmail. There are other email providers.

  4. Re:Filters will do this by dpidcoe · · Score: 3, Interesting

    One of the ways of combating it economically is to make it require more effort to successfully deliver spam to the target recipients. i.e. using a filter.

  5. Works for me - whatever that is worth by sjbe · · Score: 5, Insightful

    Individual that differs more than 6-sigma from the population's mean has trouble with automated tools designed for the average person.

    Exactly. I use Gmail and I honestly haven't had a false positive (flagged as spam when it isn't) in over two years. I still get the occasional false negative (spam that isn't flagged) at a frequency of a few per week. It's good enough that I don't even bother to routinely check my spam filter. It also is pretty good on the training - once you've spent a little time telling it what is spam and what isn't for you in my experience it is pretty good after that. Frankly if you have to check your spam filter often it isn't a very good spam filter.

    I suspect Linus has rather unusual email requirements. Perhaps Gmail isn't the ideal solution for him. Very few tools are perfect for everyone. I'm a little surprised he's having that much trouble but stranger things have happened.

    1. Re:Works for me - whatever that is worth by Tx · · Score: 5, Interesting

      I read about this a few days ago on The Register, according to one user there, this particular issue is to do with DKIM and mailing lists (the stuff Linus had issues with was all Linux kernel mailing list messages);

      bhtooefr "Basically, Google's enforcing DKIM from certain domains, and if a message is "from" someone whose e-mail host provides proper DKIM, but it's missing it, Google (and Yahoo) servers reject it. Mailing lists aren't usually set up to properly handle DKIM (being, effectively, a relay), and therefore get rejected.

      The workaround that I saw one mailing list use was to resend the e-mail from the mailing list's address, append "via (mailing list name)" to the name on the from field, and just have both the mailing list and the original author in reply-to."

      Seems like people running mailing lists need to take a look at how spam filters work, rather than mail providers changing anything. If I understand correctly, the policy is sensible and blocks a likely spam vector, and legit mailing lists could easily be set up to not fail that particular check.

      For regular mail, I'm like you guys, Google's spam filtering does a fantastic job. I never check my spam folder any more, unless I'm expecting an email and it doesn't arrive, but it's been ages since I had a false positive.

      --
      Oh no... it's the future.
    2. Re:Works for me - whatever that is worth by war4peace · · Score: 4, Informative

      GMail started flagging Youtube newsletters as SPAM but gets confused by another filter I added manually to all e-mails from Youtube. I had created a filter which adds a label to that type of e-mail, and now GMail says "This message was not sent to Spam because of a filter you created." every time I am getting an e-mail from Youtube.

      Funny, 'cause Youtube is owned by Google.

      --
      ...gis sdrawkcab (usually not responding to ACs; don't bother posting as AC)
    3. Re: Works for me - whatever that is worth by AvitarX · · Score: 3, Interesting

      I've had this problem with small websites I run. A lot of contact forms default to using the submitter as from still, I have to edit the code that sends the mail in the module to be from the site's domain and use the reply-to.

      I started having to do this year's ago, yet very few modules let you take advantage of reply-to still. Very annoying.

      --
      Wow, sent an e-mail as suggested when clicking on "use classic" banner, and got a fast response that addressed my msg
  6. Poor Linus needs a gofundme by RonVNX · · Score: 4, Funny

    Apparently he can't afford to purchase decent email service. Maybe someday he'll create something important and then he can get off the crap freemail.

  7. Re:Boolean filters are wrong by pem · · Score: 3, Insightful

    at some time it is going to be illegal [to throw away spam]

    WTF are you smoking, and can I haz some?

    No amendment, not even the first, makes it illegal for me to throw away shit that people decide to send to me.

    Spam is a valuable resource.

    Pigshit is a valuable resource. Spam is spam. The fact that you can look for similarities in it in order to trash more of it doesn't make it a valuable resource.

  8. The bitching is slower than the fixing by Anonymous Coward · · Score: 4, Informative

    He posted about it in G+, a googler noticed and offered to look into it. One day later The Register is feeding off the echoes and the story is slashdotted.

    "Much better now.

    Of the 100+ messages caught as spam over-night, only two were false positives (and I reported them). My email is getting back to normal."

    https://plus.google.com/+LinusTorvalds/posts/dJdkRxUCRmK

  9. Already Solved by Anonymous Coward · · Score: 4, Informative

    On the next day, Linus wrote "My email is getting back to normal."
    https://plus.google.com/+LinusTorvalds/posts/dJdkRxUCRmK

  10. he's using gmail? by hymie! · · Score: 3, Funny

    Somebody should tell Linus about this great new operating system I run at home. I have sendmail running on my machine, and it lets me control my spam filters and everything.

    It's called "Linux". I highly recommend it.

  11. domain issues by v(*_*)vvvv · · Score: 4, Interesting

    From his original post, there is a clear date he claims the FP rate to have gone up... so this isn't a blanket Gmail FP rate issue, but rather a Gmail or spam blacklist incident, which is quite different from what the summary would suggest. As of right now:
    http://mxtoolbox.com/SuperTool.aspx?action=blacklist%3aLKML.ORG&run=toolpage

    lkml.org Added to UCEPROTECTL2

    Uceprotectl2 Automatically Delists Entries

    This blacklist does not offer any form of manual request to delist. Your IP Address will either automatically expire from listing after a given timeframe, or after time expires from the last receipt of spam into their spamtraps from your IP Address.

    Uceprotectl2 Accepts Payments Or Donations

    This blacklist does support a manual request to remove, delist, or expedite your IP Address from their database upon Payment or Donation of fees to their organization. Please note the following; 1) MxToolBox does not in any way advocate the paying of removal from any blacklists. 2) Removal requests that are submitted without addressing the core problem will likely result in your IP Address being relisted in the database which can cause subsequent problems and extended listing periods without release.

    More information about UCEPROTECTL2 can be found at their website: http://www.uceprotect.net/

    Reason for listing - Net 146.185.176.0/21 is UCEPROTECT-Level2 listed because 36 abusers are hosted by RCN-ASN - Reality Check Network Corp./AS46652 there. See: http://www.uceprotect.net/rblc...

    UCEPROTECTL2 seems a bit shady, but I am not blacklist expert.

    Also as a side note, any spam filter that attempts context evaluation has a tendency to mark emails with code or special character formatting as spam. Even emails with links. So for someone like Linus to have a higher blanket spam FP rate is also not surprising.

    The best gmail feature is the "never treat as spam" filter.