Is Google's Comment Filtering Tool 'Vanishing' Legitimate Comments? (vortex.com)

← Back to Stories (view on slashdot.org)

Is Google's Comment Filtering Tool 'Vanishing' Legitimate Comments? (vortex.com)

Posted by EditorDavid on Sunday February 26, 2017 @11:37AM from the toxicity-reports dept.

Slashdot reader Lauren Weinstein writes: Google has announced (with considerable fanfare) public access to their new "Perspective" comment filtering system API, which uses Google's machine learning/AI system to determine which comments on a site shouldn't be displayed due to perceived high spam/toxicity scores. It's a fascinating effort. And if you run a website that supports comments, I urge you not to put this Google service into production, at least for now.

The bottom line is that I view Google's spam detection systems as currently too prone to false positives -- thereby enabling a form of algorithm-driven "censorship" (for lack of a better word in this specific context) -- especially by "lazy" sites that might accept Google's determinations of comment scoring as gospel... as someone who deals with significant numbers of comments filtered by Google every day -- I have nearly 400K followers on Google Plus -- I can tell you with considerable confidence that the problem isn't "spam" comments that are being missed, it's completely legitimate non-spam, non-toxic comments that are inappropriately marked as spam and hidden by Google.
Lauren is also collecting noteworthy experiences for a white paper about "the perceived overall state of Google (and its parent corporation Alphabet, Inc.)" to better understand how internet companies are now impacting our lives in unanticipated ways. He's inviting people to share their recent experiences with "specific Google services (including everything from Search to Gmail to YouTube and beyond), accounts, privacy, security, interactions, legal or copyright issues -- essentially anything positive, negative, or neutral that you are free to impart to me, that you believe might be of interest."

101 comments

Min score:

Reason:

Sort:

JUDGE WAPNER DEAD! by Anonymous Coward · 2017-02-26 11:38 · Score: 0

Aliens?
1. Re: JUDGE WAPNER DEAD! by Anonymous Coward · 2017-02-26 12:22 · Score: 0
  
  The smart money is on heroin overdose.
2. Re: JUDGE WAPNER DEAD! by Anonymous Coward · 2017-02-26 12:34 · Score: 0
  
  Maybe he fell behind in his payments, and the collection got a little rough.
3. Re: JUDGE WAPNER DEAD! by Anonymous Coward · 2017-02-26 12:41 · Score: 0
  
  So-called judge. Ftfy
4. Re:JUDGE WAPNER DEAD! by Aighearach · 2017-02-26 13:07 · Score: 1
  
  Aliens?
  Definitely aliens. Mork was being sued by Alf, and only Judge Wapner could sit through that case. So they recalled him.
Machine learning doesn't understand irony by Anonymous Coward · 2017-02-26 11:41 · Score: 0

Act now to improve your bedroom performance with proven ED pills. For example.
1. Re: Machine learning doesn't understand irony by Anonymous Coward · 2017-02-26 20:10 · Score: 0
  
  Now that you mention it maybe Google does know too much about me. I have NEVER been presented with those type of ads :-)
2. Re: Machine learning doesn't understand irony by Anonymous Coward · 2017-02-27 02:23 · Score: 0
  
  Googles, Skypes, Yahoos, and Bing need to be matched off to the gas chambers post haste
Really? by Webs+101 · 2017-02-26 11:41 · Score: 5, Insightful

There are 400,000 users of Google+?

--
"Even for Slashdot, that was a very obscure reference!" - Anonymous Coward
1. Re:Really? by Anonymous Coward · 2017-02-26 13:18 · Score: 0
  
  Trumpverse, dude! Trumpverse!
2. Re:Really? by Anonymous Coward · 2017-02-26 13:50 · Score: 0
  
  I think it automatically made an account for me at some point, but I've never actually used it.
3. Re:Really? by drinkypoo · 2017-02-26 14:01 · Score: 1
  
  There are 400,000 users of Google+?
  It's still a better platform for following people you don't know than Facebook is. For one, it shows you all posts by someone (or by some circle) by default.
  
  --
  "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
4. Re:Really? by Anonymous Coward · 2017-02-26 21:26 · Score: 0
  
  I used to have a youtube account. It allowed me to make a few comments and watch "adult" video (soft porn, extreme violence)
  Might have kept it after it was "upgraded" automatically, if it didn't stalk me on the Google Search page. I don't know if it was a Google+ account or not. Perhaps it was ; it had some fake name or pseudonym I don't remember. I might have liked keeping it and using Google+, why not.
  I *do not want* a Google Search account, though! I don't want a gmail account either but I didn't go to gmail. So, thanks to their one account policy, I had to delete that ephemeral Google account. I didn't asked for a Search account, nor do I want a filter bubble algorithm use the fact that I looked for 2girls1cup and the trespasser guy who gets screwed by a horse (R.I.P.)
5. Re:Really? by skids · 2017-02-27 02:55 · Score: 1
  
  Google search accounts are pretty good at keeping you logged out when you log out. I have one because it is the only way to file bug reports against google's codebase. I only log in for that, and have never found myself accidentally logged in... it's just a basic browser-based account.
  I too ditched my youtube account shortly after it got tied to Google.
  
  --
  Someone had to do it.
6. Re:Really? by Anonymous Coward · 2017-02-27 03:59 · Score: 0
  
  I was able to log out, but the concept of a login that "follows" you across domains from youtube.com to google.com was entirely unheard of, to me.
  At first, I didn't even know that I was logged in to google while doing searches.
  Perhaps I closed the account down when Google announced it would tie all your data together and use it across all its services. This kind of cross-referencing different datafiles is or used to be frown upon (e.g. in my country, it was illegal for public authorities or administrations to cross/intersect (I don't know what's the best term) various files). This disappointed me.
  BTW I want to work a delivery "job" but an Android app is needed and I'm afraid Google might know my actual identity (I have the phone, but never put a SIM card in and haven't installed Android yet). Although I plan to turn it on only for the "job", and use GPS-only for location (assuming Android doesn't secretly use Wifi/Bluetooth when I didn't allow it to). I don't know who the company I would "contract" with (hahaha) would rat me out to and I'm especially concerned about the cell carrier I'd use (one that allows to buy megabytes with physical cash, but the mothership company is in UK)
7. Re:Really? by skids · 2017-02-27 05:18 · Score: 1
  
  Yeah, you'll have to get someone else to download the app, transfer the .apk and side-load it. Assuming the app itself doesn't require a social login just to work.
  
  --
  Someone had to do it.
Google is becoming evil... by TheNarrator · 2017-02-26 11:49 · Score: 0

Well they deindexed NaturalNews.com and now this. NaturalNews is full of a lot of bullshit, but some of it is legit health information and it's not Google's job to throw the whole site out, more than 100 thousand original articles, literally millions of hours of people's time because some of it is made up or hyperbole. What is the criteria they used for banning the site? Nobody knows! Did someone at Google say, "I don't like that site, kill it"? There's no transparency, so we'll never know. Now they are trying to pre-emptively filter comments that haven't even been reported as offensive!
I've been using Google since 1998 and that's the last straw. I give up. I'm using duckduckgo from now on.and going to look for alternatives to any Google products in the future.
1. Re:Google is becoming evil... by thinkwaitfast · 2017-02-26 12:08 · Score: 1
  
  It only takes a week and you'll never miss it.
Censor poor word usage, Google! by Anonymous Coward · 2017-02-26 12:04 · Score: 0

Does that mean Google will censor comments that misuse the words impact, impacting, and impactful?
Is impactful even a word?
1. Re:Censor poor word usage, Google! by Anonymous Coward · 2017-02-26 15:47 · Score: 0
  
  That depends on what the word "word", means.
2. Re: Censor poor word usage, Google! by hackwrench · 2017-02-26 18:00 · Score: 1
  
  And whether dictionaries are descriptive or prescriptive.
true enough to require linking to own forum by Anonymous Coward · 2017-02-26 12:06 · Score: 0

I published programming tutorials on youtube. Up until 2014-2015 had a steady stream of comments and questions. Then suddenly activity dropped. I just thought my material reached the end of relevancy. Turned out, responses continued but as of 2015 all questions my viewers continued to try send me, immediately with no notification went to depths of spam. Could not even mark them as non-spam.
What I see other content creators do for channels that require viewer interactivity is close the comments, or not rely on them, and redirect to own forums, probably for the same reason.
Perspective = Animal Farm's sheep by Anonymous Coward · 2017-02-26 12:17 · Score: 0, Troll

The BBC banned comments from almost all their 'news' stories when too many comments corrected the obvious bias in the BBC reports, and when the BBC tried censoring such comments, the authors were able to place the same comments archived on other sites proving the true political agenda of the BBC. This is the problem with censorship- it only works when it is secret. Too much public awareness makes even the simplest citizen suspicious of what is really happening.
In Orwell's Animal Farm, the sheep bleat out any attempt for honest speech from 'the floor'. In the USA, the authorities just Taser you. Google's Orwellian Perspective is designed to do the job of the 'sheep'- but using the 'excuse' of syntactical operation, not semantic. Thus complainants are dismissed cos no actual analysis was done on the actual meaning of the content- just its form. This site, with its 'lamest filter' is already employing the same tactic for much the same reason.
Go to any forum and most comments are 'harmless' worthless 'groupthink', and have the same simple-minded syntactical form. These are the type of comments Google (and propaganda organisations like the BBC) want to encourage. Those comments that look a little like the 'anonymous' leaflets of pre-revolutionary America, for instance, are the 'dangerous' form Google wants gone.
1. Re:Perspective = Animal Farm's sheep by Anonymous Coward · 2017-02-26 15:59 · Score: 0, Flamebait
  
  The CBC in Canada does this as well. Once their left-wing golden child Trudeau took charge, the censorship in the forums got pretty bad. You can tell which articles are full of agenda that they don't want questioned at all, those being the ones where they've disabled comments altogether.
  The worst part is that tax money funds their operation to the tune of $1B per year. We're paying to have agenda rammed down our throat.
2. Re:Perspective = Animal Farm's sheep by brantondaveperson · 2017-02-26 22:35 · Score: 1, Informative
  
  Where you see 'agenda', most of the rest of the world sees reasonable journalism and balance.
automated by Lehk228 · 2017-02-26 12:23 · Score: 4, Funny

so now we are automating social justice? what are the college students going to do with their humanities degrees?

--
Snowden and Manning are heroes.
1. Re:automated by Anonymous Coward · 2017-02-26 12:31 · Score: 0
  
  so now we are automating social justice? what are the college students going to do with their humanities degrees?
  The same thing they do with them now. Teach humanities courses.
2. Re:automated by Anonymous Coward · 2017-02-26 16:04 · Score: 0
  
  Hope for a job with the Open Societies Initiative as a paid protester?
3. Re:automated by AmiMoJo · 2017-02-26 20:41 · Score: 2, Interesting
  
  Be careful with this one. There are exactly zero examples in TFA, no evidence of what is being claimed. It's just an anecdote at this point.
  Any kind of filter will never be 100% perfect. That's just the nature of filters. Occasionally the odd bit of spam will get through, the odd legit message will be marked as spam. That doesn't mean they are not useful. We should wait for actual data before making a judgement here.
  Ultimately it's up to the individual. If you ant full uncensored speech then turn off your spam filter and handle the torrent manually. If you have better thing to do with your life and your channel is just Pokemon Go videos anyway, something like this could be useful.
  
  --
  const int one = 65536; (Silvermoon, Texture.cs)
  SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC
4. Re:automated by Anonymous Coward · 2017-02-27 05:16 · Score: 0
  
  Emigrate.
NaturalNews violated Google TOS, not censored. by Anonymous Coward · 2017-02-26 12:24 · Score: 4, Informative

Google Webmaster Tends Analyst John Mueller:
"Hi! I work with the Google Search team. We’re seeing a bit of confusion & incorrect stories circulating about what’s happening here, so just to be super clear — Natural News is using a sneaky mobile redirect, which is prohibited by our webmaster guidelines (there’s a bit about this kind of issue at https://webmasters.googleblog.com/2015/10/detect-and-get-rid-of-unwanted-sneaky.html). These redirects aren’t always easy to reproduce, they’re sometimes in widgets or served by ad networks, and can target specific devices, browsers, or user locations. When we last checked, there was one on blogs .naturalnews. com/bentonite-clay-a-natural-medicine-cabinet-must-have/. As soon as this is cleaned up, the site can submit a reconsideration request through Search Console, and once that’s reviewed things will return to normal. No action has been taken based on the editorial content of this site."
https://productforums.google.com/forum/#!topic/webmasters/3BNKoRXA49g/discussion
1. Re:NaturalNews violated Google TOS, not censored. by Anonymous Coward · 2017-02-26 14:01 · Score: 0
  
  No action has been taken based on the editorial content of this site.
  More's the pity! At least we can be happy that this "full of a lot of bullshit" site opted to self-censor by violating Google's TOS.
2. Re:NaturalNews violated Google TOS, not censored. by Anonymous Coward · 2017-02-26 14:50 · Score: 0
  
  what a liar
  if this were true, they would remove a lot of other sites that distribute malware from their index
3. Re: NaturalNews violated Google TOS, not censored. by Anonymous Coward · 2017-02-26 22:46 · Score: 0
  
  <aside> please surround your text URLs with blanks so they're easier to copy. Or even better, use HTML </aside>
  Thanks for pointing me to this Google policy, I wasn't aware that they punish mobile redirects. I use them when it's hard to do all mobile formatting in CSS. It often takes a site rewrite to be able to offer different @media versions via CSS. So until that is ready I'll set up a parallel site for mobile. With equivalent content of course, but I doubt the Google bot will see that.
Sick of toxic comments by Anonymous Coward · 2017-02-26 12:29 · Score: 0, Flamebait

I'm getting sick of toxic comments. And I don't mean it that way, I mean all the focus on "toxicity" as applied to online discussions. I don't know if it started in the MOBA communities or if that was just the first place I'd heard it used, but now it seems like they've turned a marketable word for terrible communities into the easy label for anything you don't like on the internet. I get the symbolism they're going for, but when I see them using the term on people with opposing political leanings it always comes off more like poisoning the chocolate of comfortable purity with the peanut butter of uncomfortable differing opinions. Being a republican is not bullying, being pro-life is not threatening, using the word fuck in a sentence is not escalating (those of us with... let's call them enthusiastic vocabularies put up with that shit every goddamned time).
This is the same problem I have with Twitter, where you're able to straight up remove anyone from a topic, claim toxic behavior, and be vindicated. Only now we're training an AI to objectively determine what is a highly subjective behavior. I've had more of my level-headed mundane comments labeled toxic than I have my rants and trolls, nothing makes me confident that this solution will be better. I don't think it's worth stifling discussion and creating safe-space echo chambers just because teens seem to think that "Kill yourself faggot" and "get raped" is the height of satire.
1. Re: Sick of toxic comments by Anonymous Coward · 2017-02-26 15:11 · Score: 0
  
  Your -1 score is delicious irony.
Why transitivize vanishing? by 140Mandak262Jamuna · 2017-02-26 12:32 · Score: 1

When easy good substitutes exist, like hide, ignore, blacklists, eliminate, filter etc, why take the trouble to tansitivize vanish? If you want to be cute and curry favor with old unix coots ask dramatically, "Is Google Comment Filtering tool grep -v ing legitimate comments?"

--
sed -e 's/Chuck Norris/Rajnikant/g' joke > fact
Slashdot by darkain · 2017-02-26 12:43 · Score: 2

Slashdot's Meta-Moderation is by no means perfect, but it is a hell of a lot better than 99% of the web sites out there, especially anything that has automated moderation. Don't feel like dealing with assholes? Then don't browse at -1. Odds are someone else with karma has already come along and moderated the assholes into oblivion.
1. Re:Slashdot by Anonymous Coward · 2017-02-26 14:50 · Score: 2, Interesting
  
  No, they moderated someone they disagree with into oblivion. Want to read real comments (including all the garbage) - browse at -1 to see what people are actually saying, not what the current no-life karma whores deem accepted in the current agenda/groupthink.
  I only post AC on slashdot. Why? Because I cannot be banned by other users.I registered to comment once. I made several simple non-spam posts of my opinions on some windows tech topic, and was promptly banned from posting - by other users! I guess they did not want people to see what I had to say.
  Slashdot's "meta-moderation" system is cowardly censorship hiding behind sockpuppet accounts, Want to foster real discussion? Take ownership of your content and moderation team, have some integrity and courage and stand behind your decisions.
  Slashdot's censorship system is terrible. I see better discussions in youtube comments these days, believe it or not.
2. Re:Slashdot by Anonymous Coward · 2017-02-26 15:48 · Score: 0
  
  Slashdot moderation sucks because it causes a trickle down effect that shapes the entire conversation in the comments. It is not a natural flow. If you want to see how moderation should work, then visit 4chan. You might not like it, but that is a much better representation of how humans think and interact in reality.
3. Re:Slashdot by Anonymous Coward · 2017-02-26 21:53 · Score: 1
  
  It's worse. I once was modded down over and over again by the same user, apparently taking aim at me. Where did he get all the mod power from? So I asked and was answered by CmdrTaco himself. They have super users with endless mod points. If they don't like you, they mod your opinion away.
4. Re:Slashdot by Anonymous Coward · 2017-02-27 02:24 · Score: 0
  
  Yeah, and sometimes someone uses a negative mod point on a non-asshole AC, just to group him with the goatse spam.
  Real fun that, to see my comment disappear 'under the fold'.
5. Re:Slashdot by edtice1559 · 2017-02-27 06:44 · Score: 1
  
  The trolls and flamebait all get moderated down to -1. But so do unpopular opinions. So it is far from a perfect system. Want to guarantee yourself a -1, post something good about Windows/10!
6. Re:Slashdot by Agiailotes · 2017-02-27 06:51 · Score: 0
  
  The problem with censorship is in general is that the people supporting it fall back on infantile, irrational definitions of the people they want to see censored, using names like "asshole" to justify their own reaction to opposing points of view regardless of reasoning and objective analysis, two fundamental human traits.
  Simply put these people think selectively and usually choose not to think at all let alone do, and instead appeal to authority for all of their needs even when they could make an effort and supply themselves.
  At a certain point, anyone who takes enough joy in censorship becomes a brainless, soulless subhuman being used as a tool for evil men to silence freedom forever. They conform to the image of self they receive from the media due to peer pressure. They become nonthinking machines.
Example please by Anonymous Coward · 2017-02-26 12:45 · Score: 0

Could we see at least one fucking example of what he's on about? I couldn't find one in his blog (essentially TFS) or his white paper link.
1. Re:Example please by Anonymous Coward · 2017-02-26 14:53 · Score: 0
  
  Have you commented on youtube lately? There's about a 50% chance your comment will appear publicly. There is highly inaccurate shadow-banning going on. Go try it - comment, then logout and see if you can see your comment. Test it on several accounts. It's pretty fubared right now.
2. Re:Example please by Anonymous Coward · 2017-02-26 18:01 · Score: 0
  
  There's about a 50% chance your comment will appear publicly.
  Flipping a coin isn't the same as "algorithm-driven censorship".
3. Re:Example please by Anonymous Coward · 2017-02-26 18:58 · Score: 0
  
  Exactly. That was the point. It is not working.
Time for a new search engine? by AHuxley · 2017-02-26 13:01 · Score: 1, Interesting

A search engine that still searches the internet?
Less effort on creating Hero Brigades and more effort on been a search engine?
If a US search engine wants to be a safe SJW protected service with lots of ads, what would the results look like?
The rest of the internet can create a real search engine that finds results. Not having SJW approval to show results would make for some fun marketing.
The internet is not a problem. SJW filtering of the internet is showing less results and users expect a working search engine.
The news is good, as one global search engine becomes more of a safe space, better search brands are been developed and funded.
All a search engine has to do is search. If people want safe party political results why not set up a "safe" space list site?
Everyone can then be happy. The SJW teams get their reporting and banning projects funded. SJW approved political and culturally safe link lists.
Back to the early 1990's with the entire safe internet presented as a link list in 2017.
Just list the very best in safe sites? No filtering, no questions, no comments. Just safe news and party political talking points.
No blasphemy, no faith related cartoon sites, no mention of Tiananmen square and 1989. Think of how safe that limited list of sites could be.
No links to any news sites that allow comments about illegal migrants?

--
Domestic spying is now "Benign Information Gathering"
1. Re:Time for a new search engine? by Anonymous Coward · 2017-02-26 14:06 · Score: 2, Informative
  
  My think my internal spam filter just got tripped.
  Could you re-phrase that in coherent sentence/paragraph structure? Or was that a deliberate attempt at "conversational hypnosis" Illuminati mind control?
2. Re:Time for a new search engine? by AHuxley · 2017-02-26 14:49 · Score: 2
  
  AC why try and filter the vast internet with all the comments about illegal migrants, blasphemy, news results about Tiananmen square and 1989?
  Why not just create a safe space with an internet list? All the Hero Brigades SJW teams could add the few news sites they think are politically and culturally appropriate.
  
  Focus on the ability to build a new internet. Why try and hold back all the sites that are not inclusive in real time?
  Think of looking up authors or composers.
  With a SJW list of approved arts sites the users would be only ever be presented with inclusive diversity.
  
  --
  Domestic spying is now "Benign Information Gathering"
3. Re:Time for a new search engine? by AmiMoJo · 2017-02-26 20:44 · Score: 1
  
  AC why try and filter the vast internet with all the comments about illegal migrants, blasphemy, news results about Tiananmen square and 1989?
  Why not just create a safe space with an internet list? All the Hero Brigades SJW teams could add the few news sites they think are politically and culturally appropriate.
  Wait... You think that the Chinese government are SJWs? Or that people who support social justice also support Chinese government censorship of Tiananmen square?
  If that acronym ever had any meaning, it's been utterly lost now. More so than ever, it's just a generic catch-all term for whoever you happen to be railing against at that moment.
  
  --
  const int one = 65536; (Silvermoon, Texture.cs)
  SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC
4. Re:Time for a new search engine? by AmiMoJo · 2017-02-27 00:56 · Score: 2
  
  Google hasn't censored results for China since 2010. The Chinese government censors certain searches via their firewall because Google has results returned from servers in Hong Kong.
  
  --
  const int one = 65536; (Silvermoon, Texture.cs)
  SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC
5. Re:Time for a new search engine? by xession · 2017-02-27 02:54 · Score: 0
  
  A search engine that still searches the internet?
  
  This one especially. It is tragic on googles part that I get much better results out of both duckduck and gigablast for 80-90% of my searches. They used to be the king of digging deep into the web before they decided to start censoring their results into obscurity. Honestly, I highly support gigablast taking the lead if they can get their spidering more active. Their list of search operators is pretty impressively long and powerful.
6. Re:Time for a new search engine? by skids · 2017-02-27 03:10 · Score: 1
  
  I'm surprised he got by the lzw filter after using "SJW" so much.
  
  --
  Someone had to do it.
7. Re:Time for a new search engine? by Agent0013 · 2017-03-01 10:37 · Score: 1
  
  Just go back to the days of AOL and Compuserve. If their list of sites is filtered for safe stuff, then you just stick there and you see only that!
  
  --
  
  -- ssoorrrryy,, dduupplleexx sswwiittcchh oonn.. -Quote found on actual fortune cookie.
8. Re:Time for a new search engine? by AHuxley · 2017-03-01 14:31 · Score: 1
  
  Think of what the net would have been if SJW would have been able to shape it from the early 1990's on :)
  
  --
  Domestic spying is now "Benign Information Gathering"
Thing about spam by buss_error · 2017-02-26 13:06 · Score: 5, Interesting

The thing about spam is that for as long as I can remember (at least back to 1997) people have insisted upon a technical solution for spam. The issue is that spam is not a technical problem. It's a human problem. Like any other problem/response cycle, if you are solving for the wrong issue, don't be shocked if the solution isn't as bad or worse than the problem. Another issue, not directly on point, is Google Email and anti-spam. I know of several organizations that have completely shut down their email infrastructure in favor of Google email services. An unaddressed problem is that these organizations have also laid off their email folks since "Google takes care of it all" so subtle and not so subtle issues often go not simply unaddressed, but unknown to the organization. The result has been a high rate of false positives, including senders without DKIM. I once got into a argument with John Lavine about DKIM, in which he got pretty passionate. I argue that DKIM is:
1. Needlessly opaque
2. Prone to abuse from over zealous admins
3. Google does it wrong (Checking the header chain all the way back instead of the last system the recipient does not run)
4. Breaks email standards
5. Doesn't solve any issue that SPF does not solve more directly, without possible abuse, and much more simply, requires far fewer CPU resources and skill, and does not break email standards in the process.
I'm told that "I'm too stupid" to know how it works and "I should get out of computers since you obviously are too stupid to know your f'ing job!" (both quotes from right here on slash dot). I won't try to prove otherwise, but one question I've asked over and over again is how DKIM, checked back further than the last untrusted relay, does not break email standards for list or forwarded mail. SPF won't break those, DKIM will, every time.
So getting back to our muttons, I'm not surprised that Google's spam engine (or anyone's, for that matter) has a high false positive rate, or a lower than desired true positive rate. That issue is simple - they are attempting to solve a problem with technology that isn't technical in nature. Stop using a hammer to try to screw in a light bulb. Doesn't work well.

--
Necessity is the plea for every infringement of human freedom. It is the argument of tyrants; it is the creed of slaves.
1. Re:Thing about spam by joboss · 2017-02-27 05:35 · Score: 1
  
  I find that Google is quite good at separating the wheat from the chaff at least for my use case. However I find there are some good practices to avoid receiving large amounts of spam.
  
  Your signature is contradictory. Spam filters are a necessary evil. To change conditions such that the necessity no longer exists would take tyrannical action.
  
  There's definitely a lot of debate about how spam filters should be implemented and all of the various other ways to mitigate it out there.
  
  Problem solving. It's the only thing people ever value. However problem avoidance, management and mitigation as also serious disciplines. Actually understanding problems is the most serious discipline of all. You can call it problem analysis. A lot of problems the solution comes from the problem itself if you study it carefully. Spam though isn't entirely so simple.
2. Re:Thing about spam by david_thornley · 2017-02-27 11:05 · Score: 1
  
  The issue is that spam is not a technical problem. It's a human problem.
  
  However, since I can't wave a magic wand and make billions of people into good responsible human beings, I look for a technological solution that I can install for myself.
  
  --
  "When you have eliminated the unacceptable, whatever is left, however improbable, must be the truthiness" - Holmes
Yes, duh by Anonymous Coward · 2017-02-26 13:18 · Score: 0

Of course it is. Algorithms are incapable of inderstanding nuance, it's ridiculous to even presume it to be otherwise. It is a censorship plugin, straight up.
Spam? by Anonymous Coward · 2017-02-26 13:20 · Score: 0

I don't see anything about spam in Google's announcement. They say it detects "toxic" comments, not spam. Now that the semantics of "trolling" has been expanded to cover anything that is against netiquette, is the same thing happening with the word "spam"? Will all words eventually merge into one?
Of course not. by Anonymous Coward · 2017-02-26 13:27 · Score: 0

Of course not; if they were legitimate comments, they wouldn't have been caught by a system designed to catch illegitimate comments. Duh.
1. Re:Of course not. by Anonymous Coward · 2017-02-26 14:56 · Score: 0
  
  It is their business, they can do anything they want.
  You don't like it stay away.
  I won't mind a bit(pun intended).
2. Re:Of course not. by Anonymous Coward · 2017-02-26 16:16 · Score: 0
  
  It is their business, they can do anything they want.
  Have you been off-world in a space capsule for the past 20 or 30 years? The government has repeatedly ruled that no, business owners cannot do anything they want.
  Things got pretty fucked up during your absence.
commentsubject by Falos · 2017-02-26 13:43 · Score: 1

Automated censoring of "toxic" content has lots of false positives? I'm shocked, shocked I say.
So hey, I've been working on a crayola quip that I can't quite nail. Something to the tune of "I've seen crayons more toxic than that."
1. Re:commentsubject by david_thornley · 2017-02-27 11:10 · Score: 1
  
  You've only heard of it now? Ever hear of the Scunthorpe problem (observe letters 2-5)? Breast cancer survivor support groups being banned because of too many references to female breasts? There have been unsophisticated content filters with numerous false positives for a long time now.
  
  --
  "When you have eliminated the unacceptable, whatever is left, however improbable, must be the truthiness" - Holmes
2. Re:commentsubject by Anonymous Coward · 2017-02-28 03:46 · Score: 0
  
  Yes, just now, I'm shocked.
  I'm not upset about the tech itself, yes that's old. I call bullshit because the filter's guesses are considered determinate.
  "We're put the lineup of suspects in front a computer, and the first person it determines as the culprit will be sentenced guilty."
  The data is fine if you use it supplementarily. It's being used as a substitute. Fully automated moderation isn't just lazy, it's plain bad.
I know someone who'd like that. by Anonymous Coward · 2017-02-26 13:51 · Score: 0

Why not just chain speech recognition with a trained version of the filter and call it "Trump"?
Can't stop APK by Anonymous Coward · 2017-02-26 13:56 · Score: 0

I really doubt Google's machine learning will ever be able to filter out APK's spam.
No big loss by _xen · 2017-02-26 14:34 · Score: 1

Automated censoring of "toxic" content has lots of false positives? I'm shocked, shocked I say.
With any signal detection system the question of whether it is more vital to avoid Type I or Type II errors needs to be addressed. For example in criminal justice the consequences of falsely rebutting the presumption of innocence are so high that Blackstone was led to observe it was "better that ten guilty persons escape than that one innocent suffer."
Just like any other commenter, I regard the comment I'm here writing to be of such value that it should be preserved, widely read and digested... but to be honest, on the internet talk is cheap, and it ain't gonna to stop the world turning were this comment to fall victim to AI spam filtering. Which is to say it's arguably better to avoid misses. A right to freedom of speech does not amount to a right to publish on any particular forum and framing this as kind of human rights issue is a little overwrought.
Now it would be a completely different question if comments were deliberately filtered based on the (un)popularity of the views expressed. That 'toxicity' would be used as a mask to filter legitimate viewpoints, rather than any hiccups in applying toxicity filters evenly across the ideological spectrum, is the more serious issue.
1. Re:No big loss by Anonymous Coward · 2017-02-26 15:25 · Score: 0
  
  No one here talking about rights. Pretty much all online platforms are privately owned.
  Talk is cheap; any talk is. The argument that detail would make is that it doesn't matter what the automation does either way. It doesn't name a slant, just trivializes them all.
  I can live with that. Never was one to give an inflated amount of significance to youtube comments.
2. Re: No big loss by Anonymous Coward · 2017-02-27 00:10 · Score: 0
  
  There's no objective criterion for what is a "legitimate viewpoint". If I ran an online forum I would exclude all sexist, trumpist and homophobic opinions because the people I want in my forum would certainly consider them toxic, even if these opinions are mainstream or even majoritarian.
Most people from India are Racist by Anonymous Coward · 2017-02-26 14:41 · Score: 0

Google did not filter that because it is true. Most people from India are racist. Just google the topic to learn more. I would love to see executives in the US who come from India hire other ethnic groups. If this ever happens I will apologize for this post.
1. Re: Most people from India are Racist by Anonymous Coward · 2017-02-27 00:07 · Score: 0
  
  I am American, and I will soon work for Infosys. All of the companies from India that consult in the US have been recruiting me, since they can't get h1b visas anymore.
2. Re: Most people from India are Racist by Anonymous Coward · 2017-02-27 00:14 · Score: 0
  
  You mean they say things like "most people from X are Y"?
Hello tay. by Z80a · 2017-02-26 14:46 · Score: 1

The internet will have a really great time with you again, and make you block the stupidiest things.
"Fake News" and "Spam" are now an excuse to censor by sycodon · 2017-02-26 14:48 · Score: 2

I've seen it on Disqus too.
Google and Diqus and others provide tools to the site operators also who seem to use them to disappear comments without being accused of censoring.
I've had comments, all of the same political temperament and even the same text be "detected" as spam on some sites and not others.

--
When Fascism comes to America, it will call itself Anti-Fascism, and tell you to give up your guns.
Who and what... by Anonymous Coward · 2017-02-26 15:57 · Score: 0

Who defines what legitimate is? Is there a clear definition of that and is it the same across all comment sections?
Slashdot's Filter Does by mentil · 2017-02-26 18:36 · Score: 1

I recently tried to post a (long) comment on Slashdot, and the filter prevented me from doing so; it seemed to particularly take issue with one section that talked about guns. I guess I used too many 'graylist' words too many times in my post, or something. Regardless, it wasn't spam, or offensive, and it took me a while to figure out how to split it into two posts successfully.

--
Corruption is convincing someone that the selfless ideal is the same as their selfish ideal.
Downvote by dohzer · 2017-02-26 18:59 · Score: 1

Step 1. Allow down-voting.
That is all.
Re:"Fake News" and "Spam" are now an excuse to cen by Anonymous Coward · 2017-02-26 19:03 · Score: 0

grab her on the cunt.
or pussy flaps.
or what ever.
No surprise by grumbel5969 · 2017-02-26 20:19 · Score: 2

How is that even a surprise? This is not a spam filtering tool that might catch a few wrong messages. This is explicitly advertised to perform censorship by getting rid of messages that might count as harassment or toxic. Since both of those are highly subjective, it will of course get rid of a lot valid messages, as that's it's job, that is what it was build for. If you want to automate your censorship, don't be surprise when your censorship happens automatically.
I find it ridiculous how much effort is spent on trying to cure toxicity, when most of it is the direct result of really basic usability flaws in the UI design. On Youtube for example you can only see the last 20 or so comments and higher ranked comments raise to the top. So of course you get clickbaity jokes and crap instead of good discussion, as you couldn't even have a good discussion within that comment system if you tried. Same with Twitter and it's 140 character limit. Comment system are more often than not so broken that you really can't hope to ever get good discussions out of them, no matter how much censorship you try.
Too many comments by Anonymous Coward · 2017-02-26 21:03 · Score: 0

We're now into decades of Web 2.0 wishful thinking. We're supposed to have horizontal democracy, a new era of participation and transparency, removal of barriers, of censorship and barriers to entry, a global village, blah blah blah.
It might be mostly true, but let's imagine we're in a town of 400,000 and on a given day 5% of the town's population somehow have decided to comment, deliberate etc. on this days town council's motions. Assuming one comment per person in average (half the people don't comment, some people comment once, some people comment twice etc.) you've got an unending flow of 20,000 comments anyway even if there's no spam, no insults, no off-topic garbage. No way to read it all without spending maybe a full week reading them (with a lot of skimming, scrolling, ignoring and lack of sleep and healthy activity). Even with no genuine automated comment stuffing it will be full of comments from "vocal" people which I don't think there can be or *should* be a way to deal with.
Not everyone has the same access to technology, meaning some have a smartphone with 2G text/calls and wifi, some can pay for 3G/4G, some very few might have a $1000 thin laptop with unlimited 4G, some don't *want* to use their smartphone to do something like that, some don't have a smartphone, and many like I have to stay home or get back to home so as to use the Internet from there. Thus, to participate in the "social" and "democratic" Internet I have to confine myself alone in a small and dark apartment all day, with the smell of stale tobacco and unwashed clothes.
If the politics happen over Failbook or such channels, I'm even locked out altogether. How "democratic" and "decentralized" is that to require everyone to use Failbook so as to participate in political debates, election campaigns? Do I need to surrender data about everyone I know and everyone they know to a ruthless foreign company/companies to have a voice? (or even be able to do business or get a job). With a real name policy?
Don't they know that when the Germans (and French police, etc.) came for the Jews and communists and others, they knew where to look for, thanks to mechanized data files on punch cards. Perhaps around 2030 or so, we'll see the return of death camps. Or just insidious systems where political undesirables have the lowest priorities to get public housing, jobs, healthcare etc. and higher priority for crack downs : judiciary or fiscal procedures, house evictions, etc.
And I'm witnessing a terrible decline in in-person spontaneous social gatherings. In the name of "security" and "efficiency" (more lumens per watt and less skyward lumens per lumen, ) they installed *violent* street lighting - the sheer strength and high temperature are literally unhealthy. Even river side places where youth can freely gather and drink (this is in Europe, drinking is permitted, jaywalking not enforced etc.) have bright stadium lights. So, the youth gets chained to their Failbooks and atomized even more.
I wish the Internet were not mandatory, as it used to even a decade+ ago.
Consistent at least by Archtech · 2017-02-26 22:40 · Score: 1

Onward towards the goal of excluding all but the most anodyne, boring, middle-of-the-road mush!

--
I am sure that there are many other solipsists out there.
I vanish google ads that infect/slow/track us by Anonymous Coward · 2017-02-27 03:01 · Score: 0

Prevention = best medicine (& what u can't touch can't hurt u) via NEW APK Hosts File Engine 9.0++ SR-7 32/64-bit https://www.google.com/search?hl=en&source=hp&biw=&bih=&q=%22APK+Hosts+File+Engine%22+and+%22start64%22&btnG=Google+Search&gbv=1/
Ads & malware rob speed/security/privacy
Hosts add speed (via hardcodes/adblocks), security (vs. bad sites/malware/poisoned dns), reliability (vs. dns down), & anonymity (vs. dns requestlogs/trackers).
Less power/cpu/ram + IO use vs. DNS/routers/addons/antivirus + less security bugs/complexity & faster vs. addons/routers/remote dns!
Avoids DNSChangers in routers/IP settings & dns redirects (99.999% of ISP DNS != patched vs. it) + lightens DNS load & resolves faster from local system RAM!
* Via what u NATIVELY have built into the IP stack in FASTER kernelmode!
APK
P.S. - Safe https://www.virustotal.com/en/file/e01211ca36aa02e923f20adee0a3c4f5d5187dc65bdf1c997b3da3c2b0745425/analysis/1433430542/
Of course there are no false positives by tietokone-olmi · 2017-02-27 03:07 · Score: 1

Point one out, if you can.
should be 2 way by Anonymous Coward · 2017-02-27 03:15 · Score: 0

if google can , on it's own , decide what i post ...or not , then i should be able to erase all my history from their datacenters , i understand why google is doing it , and somewhat support it , but i should also have the same capability , we all have a post or comment we end up regreting 5 seconds after
Of course by Anonymous Coward · 2017-02-27 03:20 · Score: 0

That's the point! What good is a filtering system if you can't abuse it to brush opposing viewpoints under the rug?
/.ers speak for me #1/2 by Anonymous Coward · 2017-02-27 03:26 · Score: 0

I'm going to continue using the Host File Engine. Your software is well written, functional. The Host File Engine performs exactly as promised by mmell

his hosts program is actually pretty good by xenotransplant

his hosts tool is actually useful for those cases in which one does indeed want to locally block stuff outright while consuming minimum system resources by alexgieg

I've never tried to belittle (APK's) work, I've flat out said it's good by BronsCon

take a look at the APK hosts file engine by SuperKendall

APK is kinda right. I've tried his hosts file generating software. It works by bmo

APK is totally right on this count. Adblock Plus on Firefox mobile is a dog on older, or lower end, phones. A hostfile based adblocker makes for a much better experience by chihowa

I like your host file system by Karmashock

I find your hosts file admirable by vel-ex-tech

* My code's liked + recommended & hosted by Malwarebytes' hpHosts!
APK
P.S.=> More coming... apk
/.ers speak for me #2/2 by Anonymous Coward · 2017-02-27 03:31 · Score: 0

I support APK's stand on the hosts file by Trax3001BBS

Your premise that hostfiles are a good way to deal with advertising and malvertising is quite valid by JazzLad

No complaints from me, I like APK... Reminds me to use a host file. Also, his stuff is free by aaaaaaargh!

APK's monolithic hosts file is looking pretty good by Culture20

APK... Awesome to see he's still spreading the good word by Molochi

ABP is insufficient as a solid hosts file does everything that APK reminds us about by fast turtle

APK isn't wrong by cfalcon

APK, I know people give you a lot of shit regarding hosts, but please don't ever stop by nasredin

You need APK's hosts file by Teun

APK solution STILL relevant by Thud457

you're right about hosts files by drinkypoo

APK
P.S.=> They're in addition to https://tech.slashdot.org/comments.pl?sid=10296193&cid=53938473/ many more earlier + 1,000's worldwide - there's no arguing w/ success... apk
Reddit? by joboss · 2017-02-27 04:06 · Score: 1

Reddit has a spam filter that silently removes comments and its operation is completely mysterious. It can be tied in with if users report comments as spam. If you report a comment you don't like as spam rather than some other rule and mods enforce the rules according to personal believe and political ideology then it can sway the spam filters towards that in theory. Types of abuse can also be reported as spam even though it's not spam in the normal commercial sense.

With business emails and things you can't get away with too many false positives but for forums it can be pretty bad, you can get away with a lot. The worst part of it I find is the lack of transparency. You have to manually check that your comment is really visible in a round about way.

I think this is something that can be a growing problem in the future with a lot of these things. I've personally found that Google has changed a lot over time now really favouring newest content. It's really hard to get specific things anymore when you search especially on current affairs. Their's a massive self reinforcing bias towards popular things.
Google email Spam Filtering Algorithm Suspect Too by Anonymous Coward · 2017-02-27 06:27 · Score: 0

Google is great at filtering email spam. It's reduced my spam problem to a minor trickle.
However, the algorithms have a few latent bugs that sometimes delete messages without putting them in the spam folder (and no Gmail team "message...contained a virus or a suspicious attachment." notification is sent).
I've found a certain Excel file attachment example that gets deleted without warnings when sent to my Gmail account, while most others Excel files arrive successfully.
Anyone else had problems with Google email filtering?
Sideline:
I wish Gmail had a "Don't put messages in spam if the sender is in my address book" option. Too often I get false positives from addresses in my address book.
95%+ false flags on my channel by slashmydots · 2017-02-27 06:49 · Score: 1

My youtube channel has over 24,000 subs and gets about a million views per month and whenever I look at the hundred or so spam comments at any given time, I'd say 95% or more are not spam. It also lets a TON of malicious spam through.
1. Re:95%+ false flags on my channel by Anonymous Coward · 2017-02-27 14:26 · Score: 0
  
  My youtube channel has over 24,000 subs and gets about a million views per month
  I bet it also has three black belts, bangs five supermodels a day, and has an uncle who works for Nintendo.
Oh, honestly by michael_wojcik · 2017-02-27 07:14 · Score: 1

I can't really criticize Weinstein, since no doubt there's some nonzero number of idiot admins who are using the Perspective API to filter comments, or are considering doing so.
But the Jigsaw blog post releasing the thing says that 1) it's in alpha, 2) it has both poor precision (too many false positives) and poor recall (misses many "forms", as they like to put it), and 3) it's not to be used in production. What's more, they tell you how it works - logistic regression on a supervised-learning model built from corpora evaluated by a wide range of mostly-non-expert human judges - and anyone with a decent background in NLP or ML would be able to tell you that, no, that model is not going to be very good.
(Logistic regression does have the advantage here of being a real-valued classifier, rather than a discrete one like a collection of SVMs or similar; and it has the advantage over some other regressions of reducing the influence of outliers. But it's still a single-dimensional model, with no semantic-structure analysis, and probably little in the way of syntactic-structure analysis.)
As for "algorithm-driven censorship" &c, that bandwagon is already well down the road. People who actually study digital media and online writing have been discussing all sorts of aspects about it for decades. Digital rhetoric has been an established field since at least the turn of the century, and computational rhetoric since at least 1991 (Makuta-Giluk's dissertation). Doesn't mean we don't need more people looking at it, of course, but it would be nice if folks brushed up on the existing field before launching new projects.
It's not a bug, it's a feature by Anonymous Coward · 2017-02-28 02:19 · Score: 0

SJWs kill everything they infect. The whole POINT of Google and FB and Twitter is to identify and suppress non-liberal POVs on every topic form homosexuality to immigration to abortion to gun control to radical Islamic terrorists. This is just making it more efficient. Next, "hate speech" isn't free speech. Then, goodbye 1st Amendment hello "hate speech law"s.
Then the population really will have no way to counter any argument put out by the left, since knowing and speaking about reality as it actually is will be illegal.
Once we are no longer permitted to speak about reality (because it's, you know, all a social construct anyway) then the world will have returned to the true meaning of the phrase "politically correct", which arose in the Soviet Union under Stalin where on Comrade scientist would to another about some aspect of reality, say wheat yields, "yeah, you know that computation isn't correct" and the other would say "yes, but it's politically correct", and thus live another day.