Gmail Spam Filter Changes Bite Linus Torvalds
An anonymous reader points out The Register's story that recent changes to the spam filters that Google uses to pare down junk in gmail evidently are a bit overzealous. Linus Torvalds, who famously likes to manage by email, and whose email flow includes a lot of mailing lists, isn't happy with it.
Ironically perhaps, it was only last week that the Gmail team blogged that its spam filter's rate of false positives is down to less than 0.05 per cent.
In his post, Torvalds said his own experience belies that claim, and that around 30 per cent of the mail in his spam box turned out not to be spam.
"It's actually at the point where I'm noticing missing messages in the email conversations I see, because Gmail has been marking emails in the middle of the conversation as spam. Things that people replied to and that contained patches and problem descriptions," Torvalds wrote.
Or if the other comment's got hit by spam filters
Individual that differs more than 6-sigma from the population's mean has trouble with automated tools designed for the average person.
Gmail's spam filter is why email is still useful.
Well, Linus doesn't *HAVE* to use gmail. There are other email providers.
It is pretty much a given that the FP rate would go up as filters become more ubiquitous. This is how the spammers are winning the spam battles when people place too much faith in filters.
As I have said before, spam is an economic problem. We won't solve the problem with filters, or with any kind of punishment (legal or otherwise); we need to look at this rationally as an economic problem.
Damn_registrars has no butt-hole. Damn_registrars has no use for a butt-hole.
I've noted an increment of spam on my Gmail account.
From one email every few week to a couple per day.
Individual that differs more than 6-sigma from the population's mean has trouble with automated tools designed for the average person.
Exactly. I use Gmail and I honestly haven't had a false positive (flagged as spam when it isn't) in over two years. I still get the occasional false negative (spam that isn't flagged) at a frequency of a few per week. It's good enough that I don't even bother to routinely check my spam filter. It also is pretty good on the training - once you've spent a little time telling it what is spam and what isn't for you in my experience it is pretty good after that. Frankly if you have to check your spam filter often it isn't a very good spam filter.
I suspect Linus has rather unusual email requirements. Perhaps Gmail isn't the ideal solution for him. Very few tools are perfect for everyone. I'm a little surprised he's having that much trouble but stranger things have happened.
more at 10
Apparently he can't afford to purchase decent email service. Maybe someday he'll create something important and then he can get off the crap freemail.
Every sane spam filter returns three possible results:
When you have three categories it will reduce the FP rate very hugely. And the most important fact is that a spam filter should never throw away spam. It can be illegal to do so, or at some time it is going to be illegal. You should keep all your spam for documentation, for this reason. Also, you can initialize (learn) your filters very quickly when migrating the system. Spam is a valuable resource.
You can tell this is old news given that it was this past weekend the Linus posted an update on Google Plus stating that false positive rates were back down to normal for him.
https://plus.google.com/+LinusTorvalds/posts/dJdkRxUCRmK
you really understood the problem..... just as well you posted as an AC
"The hands that help are better far than lips that pray." - Robert Ingersoll (1833-1899)
you must be the only one, i've not seen anyone else complain about git in that fashion - did you email your bug report to Linus (or did it get put in the spam folder by google)?
"The hands that help are better far than lips that pray." - Robert Ingersoll (1833-1899)
He posted about it in G+, a googler noticed and offered to look into it. One day later The Register is feeding off the echoes and the story is slashdotted.
"Much better now.
Of the 100+ messages caught as spam over-night, only two were false positives (and I reported them). My email is getting back to normal."
https://plus.google.com/+LinusTorvalds/posts/dJdkRxUCRmK
On the next day, Linus wrote "My email is getting back to normal."
https://plus.google.com/+LinusTorvalds/posts/dJdkRxUCRmK
And what if I want to bring up two separate branches side-by-side to do some copying? Can't fucking do it in Git.
For side by side comparisions you can always just do a lightweight clone which pretty much should happen automatically if you clone to another directory within the same filesystem, i.e.
git clone -b branch orig_repo branch_repo
Git as source control = shit
Git as a tool for Linus to merge kernel patch sets = amazing.
Git is fantastic at what it was designed for.
Maybe someday he'll create something important and then he can get off the crap freemail.
Yup. Given his past success with both Linux kernel and with GIT distributed source management, I too think that out of anyone Linus Torvalds might be the only guy able to effectively solve the SPAM problem.
"Sufficiently advanced satire is indistinguishable from reality." - [Tips: 1DrYakQDKCQ6y52z6QbnkxHXAocMZJE61o ]
I'm sure NSA has a copy. All you need is to fill out a FIOA request and interrogate Michael Hayden until he admits it
>80 column hard wrapped e-mail is not a sign of intelligent
>life
Somebody should tell Linus about this great new operating system I run at home. I have sendmail running on my machine, and it lets me control my spam filters and everything.
It's called "Linux". I highly recommend it.
I for one am extremely shocked that the above post ('use other providers') be flagged as funny.
Torvalds is the last person I'd imagine registering an email address @ Google.
Wise as he may have been, sorry, but to me he's a moron now just because of this.
I just hope I won't evolve his way when getting older.
Herve S.
From his original post, there is a clear date he claims the FP rate to have gone up... so this isn't a blanket Gmail FP rate issue, but rather a Gmail or spam blacklist incident, which is quite different from what the summary would suggest. As of right now:
http://mxtoolbox.com/SuperTool.aspx?action=blacklist%3aLKML.ORG&run=toolpage
lkml.org Added to UCEPROTECTL2
Uceprotectl2 Automatically Delists Entries
This blacklist does not offer any form of manual request to delist. Your IP Address will either automatically expire from listing after a given timeframe, or after time expires from the last receipt of spam into their spamtraps from your IP Address.
Uceprotectl2 Accepts Payments Or Donations
This blacklist does support a manual request to remove, delist, or expedite your IP Address from their database upon Payment or Donation of fees to their organization. Please note the following; 1) MxToolBox does not in any way advocate the paying of removal from any blacklists. 2) Removal requests that are submitted without addressing the core problem will likely result in your IP Address being relisted in the database which can cause subsequent problems and extended listing periods without release.
More information about UCEPROTECTL2 can be found at their website: http://www.uceprotect.net/
Reason for listing - Net 146.185.176.0/21 is UCEPROTECT-Level2 listed because 36 abusers are hosted by RCN-ASN - Reality Check Network Corp./AS46652 there. See: http://www.uceprotect.net/rblc...
UCEPROTECTL2 seems a bit shady, but I am not blacklist expert.
Also as a side note, any spam filter that attempts context evaluation has a tendency to mark emails with code or special character formatting as spam. Even emails with links. So for someone like Linus to have a higher blanket spam FP rate is also not surprising.
The best gmail feature is the "never treat as spam" filter.
I just checked my spam folder again. I reserved a U-Haul, and the email confirming that reservation went to spam. One of the few false positives I get, but there are others.
What's interesting is all the political fund-raising emails. Only the conservative ones end up in Spam (Campaign for Liberty, Conservative Senate committee, Scott Walker, Rand Paul, Jeb Bush, Mark Mix, etc.). That's fine - I don't really want to see those emails anyway. Yet, I get lots of the same emails from Hillary Clinton, Obama, Harry Reid, common good VA, Virginia Dems, Webb 2016, etc, and NONE of those left-leaning groups or politicians get sent to Spam. 0. That seems a little odd - I didn't do any filter training based on that. So does Google's spam filter have a political leaning?
"Somebody has to do something. It's just incredibly pathetic it has to be us."
--- Jerry Garcia
Moreover, any time I access gmail via ipv6 (I have dual-stack) my messages are marked as spam and I receive terrible warnings about someone else trying to break into my e-mail, and that despite using the same system and browser and the IPv6 whois records pointing to the very same ISP and town as the IPv4 ones. (The funny thing is that they don't tell me that, but that someone is trying to log into my email using "Linux" (as if that wasn't the case everytime).
It seems that despite their pretensions of achieving consciousness and singularity with their AI and stuff, their day-to-day operations are based on 'trusty' cargo-cult more typical of some neighbourhood network run by highschool dropouts.
I have the same experience. For many months now, Gmail has been overzealous in marking stuff as spam. Stuff like daily emails from servers I manage with log digests. Emails about pending security package upgrades. Even when I specifically say that a certain subject string (e.g. "logwatch") is to be excluded, Gmail ignores that rule. It has been very frustrating trying to exclude stuff via filters in Gmail.
2bits.com, Inc: Drupal, WordPress, and LAMP performance tuning.
Oh come now. I quite like git (it's my go-to choice these days for everything---Darcs, I hardly knew ye) and I have occasionally had to reset--hard after fucking something up. I've found fuckups a little easier than I'd prefer.
Besides generalised strostrup is right: there are two kinds of tools, those people complain about and those people use. I would therefore expect many complaints about git.
(and many are not unjustified either)
SJW n. One who posts facts.
automatically reject email failing its SPF or DKIM checks. If it's forged, by definition it's spam.
What is probably causing Linus's problem is the open subscription nature of the LKML. I'm guessing that a lot of people are subscribing, then flagging messages as spam rather than deleting them. Once a certain threshold is hit, that and similar messages are then flagged as spam.
Was helping a guy having trouble posting to our LUG list the other day.
He had DSN's being delivered to his GMail spam folder. I thought, "golly, how does Gmail figure those could be spam?" Nobody is going to sneak a Viagra ad through underneath a 550 report.
Of course his other problem was he was using Comcast as an outbound relay. Their new relay retries a message once every second five times and then gives up forever. Totally breaks greylisting, or even temporary outages. Didn't even try my backup MX.
But I'm running RFC-complaint secure servers, so those two big bozos can go pound sand. Eventually they'll fix their game. If a LUG member, of all people, can't be bothered to spend $2 a month for quality email hosting (assuming they just don't want to do their own), I'm not going to cry about it.
Free services are only guaranteed to be worth just as much as you pay for them. This falls squarely into the "not my circus, not my monkeys" category.
My God, it's Full of Source!
OUTSIDE_IP=$(dig +short my.ip @outsideip.net)
First up lkml.org is a third party site that hosts Linux kernel mailing list archives on a website. Regular Linux kernel mail isn't actually sent from it (I believe that's done by vger) so we're looking up the email reputation for the wrong IP...
Secondly UCEPROTECT is a very aggressive blacklist which states upfront they will block people who they believe are in the vicinity of people who the judge to be sending them spam. It's not the be and end all though and on one server I looked some time ago it's effectiveness was surpassed by other blacklists (here's someone else's old DNS blacklist comparison for 2014). In general I prefer more conservative tools like senderbase when trying to work out an IPs mail reputation.
For what it's worth I've also seen GMail incorrectly mark mails sent to the fio mailing list (which is also managed by vger) as spam and in that case it was purely down to mail being proxied through the list which was a place that didn't match the sender's DMARC records. Most of the time GMail was getting the marking of spam right though (even for mailing list mails)...
But nobody could every remember the name of the command.
And the Gnome guys promised to make a userfriendly GUI to help against that.
But they are still arguing about how to make it follow HIG, and how to adapt to the upcoming GTK4.
"Sufficiently advanced satire is indistinguishable from reality." - [Tips: 1DrYakQDKCQ6y52z6QbnkxHXAocMZJE61o ]
I totally agree. Git is way too focused on the repository maintainer than the day to day developer that just wants to check in code and not deal with the esoterica of the source control system. Even with Sourcetree it's just weird.
Interesting, thanks!