Slashdot Mirror


Google's Sentiment Analyzer Thinks Being Gay Is Bad (vice.com)

gooddogsgotoheaven shares a report from Motherboard: In July 2016, Google announced the public beta launch of a new machine learning application program interface (API), called the Cloud Natural Language API. It allows developers to incorporate Google's deep learning models into their own applications. As the company said in its announcement of the API, it lets you "easily reveal the structure and meaning of your text in a variety of languages." In addition to entity recognition (deciphering what's being talked about in a text) and syntax analysis (parsing the structure of that text), the API included a sentiment analyzer to allow programs to determine the degree to which sentences expressed a negative or positive sentiment, on a scale of -1 to 1. The problem is the API labels sentences about religious and ethnic minorities as negative -- indicating it's inherently biased. For example, it labels both being a Jew and being a homosexual as negative. A Google spokesperson issued the following statement in response to Motherboard's request for comment: "We dedicate a lot of efforts to making sure the NLP API avoids bias, but we don't always get it right. This is an example of one of those times, and we are sorry. We take this seriously and are working on improving our models. We will correct this specific case, and, more broadly, building more inclusive algorithms is crucial to bringing the benefits of machine learning to everyone."

42 of 453 comments (clear)

  1. See below by temcat · · Score: 4, Interesting

    If you've already decided that being homosexual (or a Jew, or a redhead, or a lefty, you name it) must not be deemed negative, why do you need analysis at all?
    Anyway, here is a constructive suggestion: define more precisely what "negative" is. (Things like this are sorely needed IRL too, where people e.g. routinely use statistical norm as an argument in ethical discussions.)

    1. Re:See below by TWX · · Score: 5, Insightful

      If the algorithms that have come to these conclusions are based on analyzing public data from the Internet, then if an AI decides that any particular characteristic is negative, it's because it reflects the sentiments of those who bother to post opinions.

      Most people that do not themselves exhibit the trait that's being argued-against by the noisy minority don't usually express opinions on it, so they're a hole in the data that needs to be accounted for. Unfortunately it's a lot easier to interpret based on what has been said than what has not been said.

      --
      Do not look into laser with remaining eye.
    2. Re:See below by temcat · · Score: 2, Insightful

      I just don't understand why Google should apologize for a fact (namely, for the sentiments of some strangers). Okay, so it turns out there are sufficiently many people who think that homosexuality is bad—but why then pretend they don't? It's not as if it's some kind of an objective measure or an ethical judgment of divine origin. They got the answer to the exact question they asked. If they wanted a different reply, maybe they should've asked a different question.

    3. Re:See below by doctorvo · · Score: 3, Insightful

      On the one hand we have tons of actual data from probably hundreds of millions of people. Sure, it's a biased sample, but it's still a very large sample.

      On the other hand, we have your ideologically motivated handwaving and reinterpretation.

      Which of those do you think is more reliable?

    4. Re:See below by pots · · Score: 2

      It seems likely that if some people think that a trait is bad, and other people think that a trait is neutral, then on balance a simple average would be somewhat negative.

      A more interesting analysis might look at whether expressing an opinion that being gay/jewish is bad, is itself bad (i.e.: are racists bad?). In that case you certainly have some people who think that this is bad, but the question is: do racists think that being racist is good? Or neutral? Even if they think it's good they probably aren't going to balance out the anti-racists, but is being anti-gay better or worse than being gay, on average?

    5. Re:See below by Rockoon · · Score: 2

      Contrary to what the media will tell you, racists are proud of being racist.

      The media will tell you that a guy who is repeating that he isnt racist, is racist, because of this one single thing that this evil racist did (that went "viral.")

      Its a social justice warrior world, and warriors need enemies.

      --
      "His name was James Damore."
  2. Funny how this always happens. by Anonymous Coward · · Score: 4, Funny

    It's almost like objective, quantifiable reality and feel-good political correctness are fundamentally at odds with each other.

  3. It only reflects common usage by DavenH · · Score: 2

    Nobody at Google goes through a dictionary choosing the sentiment of words; it's the context of word usage out in the world that trains these models. So it's not Google's fault, it's our fault, if blame is to be laid.

  4. Re:Comments by stephanruby · · Score: 5, Insightful

    Their api is trying to be too universal and monolithic.

    Gay is neither good or bad. Gay is just a label. Google should be able to recognize that 'gay' means one thing when a bigot uses it, but it should also recognize that 'gay' means something else when others use it.

    In other words, the meaning of words should be heavily weighed on the training data of the individual listening/reading and on the training data of the individual speaking/writing.

  5. Look in the mirror by JOstrow · · Score: 5, Informative

    So many of us already use "gay" and "jew" as derogatory terms. Is it any wonder that Google's NLP picked up on that? What source do you think it learned from?

  6. Re:That's because... by Jeremi · · Score: 5, Insightful

    Biologically speaking homosexuality is bad for the replication of any species in it's most basic form

    Well, no.

    Being homosexual might be bad for the replication of the individual (since it reduces the individual's chances of having offspring), but it doesn't necessarily follow that it's therefore also bad for the species.

    Consider that childless homosexuals, not being burdened by the task of having to care for children of their own, will likely therefore have extra time and resources available to help protect and care for the children of child-bearing couples in their family or community, thus improving the children's (and therefore the species') chances of surviving. Given the huge time investment that human children require, increasing a child's odds of survival to adulthood is a huge win.

    And if you think that sort of dynamic is uncommon in nature, consider that fact that 99% of bees are sterile and play no role in the reproduction of their hive, except to gather food and protect the queen and the drones who do handle the reproduction.

    Specialization and division of labor is one of the things human civilization is based on; there's no reason it can't be applied to reproduction as well.

    --


    I don't care if it's 90,000 hectares. That lake was not my doing.
  7. AI is not politically correct by Anonymous Coward · · Score: 2, Insightful

    Reading the ridiculous comment from the google spokesperson falling all over themselves to apologize and prattle on with all the talking points of every fake corporate "diversity" statement ever made is pretty hilarious. And pathetic.

    The system came up with its conclusion on its own. It wasn't the desired conclusion by some people's standards. It's a machine that isn't real. Who cares what it "thinks"? Why apologize? Once they program enough biases of all the things it is NOT allowed to consider "bad" the whole point of this application is gone.

    I bet it if it decided that white straight christian/atheist/agnostic males were the scourge of the earth then everybody would celebrate.

  8. Re:That's because... by dfghjk · · Score: 2

    "Biologically speaking homosexuality is bad for the replication of any species in it's most basic form..."

    There is utterly no evidence to support this assertion, and there is evidence that homosexuality is, in fact, a side effect of an adaptation that improves reproduction in a species.

    "This is a scientific certitude that only delusion will argue with."

    And this says a lot about your intellectual capacity, or lack thereof, not that your bogus claim matters in any way. It is what it is.

  9. Re:Comments by Anonymous Coward · · Score: 3, Insightful

    Actually this pokes a bigger hole in their algorithm. EVERYTHING has context. To gun nuts, democrats are going to seem bad. To democrats, democrats are going to seem good. EVERYTHING HAS BIASES. Kind of a huge oversight with Google, you need the context of the reader's biases to even begin to predict positive or negative rating of any text.

  10. Well, duh! by GerryGilmore · · Score: 4, Insightful

    Even though we like to think that everyone is enlightened, etc as us, in very broad swaths of American society being gay or jewish (or Muslim or...) is very much perceived as a negative out of the starting gate. File under: Sad-But-true.

  11. Probably just from what it was fed by burtosis · · Score: 2

    These require material to train them and the responses tend to reflect the participants nuanced behavior. I mean, what do people think is going to happen when you force feed it, eyes taped open, to 47 million social media feeds? Seems to be some kind of fine line between an algorithm and portal to hell. Well, at least they did better than Microsoft

  12. Re:Comments by rtb61 · · Score: 2, Funny

    Well, to be fair, if you are a women, looking to get married and pregnant, homosexual men are evil. They are inherently prejudiced against women, misogynists of the worst order, only willing to share their bodies, cooking abilities, cleaning skills and their wallets with other men, the horror for fat lazy ugly heterosexual women (even though I do recognise that it has been proven, that perception of appearance is not driven by exceptional appearance but by super average appearance, you are not exceptionally attractive, just exceptionally average, you face lacking all character), especially when they are feminists wanting to dominate effeminate men, the hen pecked (it would be funnier if it weren't true, believe me, well if you are hen pecked not so funny but for us slackers super funny) ;D.

    --
    Chaos - everything, everywhere, everywhen
  13. Re:Comments by Anonymous Coward · · Score: 3, Interesting

    I'm gay and I think being gay is "bad". I wouldn't wish it on anyone growing up. It makes life more difficult. Dating pool is much smaller. There's still homophobia everywhere. If I had a kid I wouldn't want them to be gay, I'd want them to have a chance at a "normal" life. I'd be absolutely loving and accepting if they were gay but I don't want them to have to go through any of the shit I did.

    I can see why gay would end up with a -.

  14. Re: Comments by Anonymous Coward · · Score: 2, Interesting

    Also, some people hate being exposed to facts that interfere with their beliefs, such as black people commit 217 times more violent crimes, gay men are 12 times more likely to be pedophiles, etc.

    You are guilty of the same behavior you're pretending to be smugly self-righteous about.

  15. Probably just the zeitgeist by Okian+Warrior · · Score: 4, Insightful

    Gay is neither good or bad. Gay is just a label.

    While true, one can easily see the reason for the problem.

    Google API is trained to look for correlations in written text. If it sees a lot of negative text about something, then that's what it will believe.

    I note that there's a lot of text that condemn jews for one reason or another, but there's not a lot that *praises* jews. We hear all the time about Christian charities, for example, but not a lot for the jewish ones(*).

    There's also a lot of negative statements about gays, and although there's *some* text praising gayness, it's mostly either personal ("good on you for having the courage to believe in yourself") or neutral ("it's OK to be gay, it's normal"). I've never seen writing that *praises* gayness as a concept.

    Compare with Christianity, where there is endless adulation of the Christian way of life. Democracy is probably the same way.

    I'd be interested to see what the API thinks about Islam, or Trump, or Clinton, or a host of other controversial political subjects.

    Google API is probably just giving us a reflection of the zeitgeist.

    (*) Don't read anything into this, I'm only saying that Christians get better press.

  16. Re:Comments by ShanghaiBill · · Score: 5, Interesting

    I'm gay and I think being gay is "bad".

    I am straight and I think gays are "good". Gay tend to be DINKs (double income no kids), so they have high household incomes and can afford nice houses. They pay property taxes to support the schools, but don't have kids, leading to higher per student educational spending. They contribute to a thriving urban culture and nightlife.

    I live in the SF Bay Area, and gays are a big part of the reason this is a nice place to live, with great schools, creative jobs, safe neighborhoods, and interesting culture. Thank you for being gay.

  17. It's just doing its job by barbariccow · · Score: 2, Insightful

    It's just doing its job. Obviously on the data it was trained, "gay" and "jew" were used as derogatory terms. And they're apologizing for that instead of explaining it? Wouldn't you rather have a system that didn't have injected bias, like injecting an override such that "gay" and "jew" receive a sentiment boost despite that being contrary to its training? Total crap. Someone probably ran through every SJW term and happened to find two that didn't have the results they wished it had on usage amongst speech.

  18. Re:Comments by Dutch+Gun · · Score: 4, Informative

    In fairness, Google is marketing this API for a somewhat narrow purpose: Determining whether a customer left a *positive* or *negative* reaction to a customer's comment in your company support forums, for example, or attempting to determine customer reaction from support interactions.

    This little fact is somewhat obfuscated in the summary, in which it seems to be billing it as a more general-purpose system that's making sweeping value judgements about society. Within this actual context, let's be honest, if you see those terms in your company's customer support forums, what do you think of the likelihood is of them being part of positive or negative comments? Yeah, exactly.

    The big mistake that Google made is not putting a politically correct filter on their API to make sure controversial words had a neutral value, even if that wasn't really the case. Otherwise, you generate flamebait headlines like we see here, wherein highly limited "AI" algorithms simply regurgitates the training material it was fed without any deep or sophisticated understanding.

    --
    Irony: Agile development has too much intertia to be abandoned now.
  19. Re:That's because... by clonehappy · · Score: 2

    Yes, I think being celibate is bad for replication. By definition of replication and by definition of celibate. In fact, any activity which does NOT result in replication will not positively affect replication.

  20. Re:Comments by barbariccow · · Score: 3, Insightful

    That brings up a point I was going to raise: it's odd that this algorithm would be summative of values for individual words rather than trying to derive a context. Given that would be a really stupid design, my guess (if you think I read TFA you best check the url you're currently on) is that they tested this by taking a sentence like "You're so happy", got a score, replaced it with "you're so gay!" or "you're such a jew!" and got a negative sentiment whereas things like "you're so funny" generally had a positive context. I.e. biased testing.

  21. What the app thinks of Engineering Mechanics by Latent+Heat · · Score: 3, Interesting

    I ran a Google Scholar search that turned up a paper "Jerk Influence Coefficients, Through Screw Theory, of Closed Chains." The title is not a vulgar joke as each of the phrases is a term-of-art on the topic of rigid-body (yes, another domain-specific term) mechanics.

    Let's just say Google served me a large number of adds on the assumption that I belong to what is only a narrow subculture of a broader and more diverse community.

  22. Logically speaking... by oic0 · · Score: 4, Insightful

    It's not a judgement by the AI against the people, it's a judgement about how others react and behave towards those them. Stop apologizing for your AI being able to perceive human behavior. "I'm so sorry my AI figured out you have a flat tire".

  23. Re:Comments by h33t+l4x0r · · Score: 4, Insightful

    How often are parents disappointed that their kid is straight? Being gay is subjectively more often bad than good and everyone knows it.

  24. Re:That's because... by kamapuaa · · Score: 4, Insightful

    But from a biological standpoint, no one can say that it can scientifically be positively correlated to the replication of the species

    You repeatedly speak in absolutes ("noone can say" - "biologically homosexuality is bad") when there isn't cause for it and there certainly isn't a scientific consensus. You imagine a PC agenda propagating homosexuality as good for the species, when that really makes no sense.

    Homosexuality is common throughout nature. While biological dead ends do make it through the process of evolution for a long time, homosexuality is so endemic that it seems that it must have some positive impact on the species, or it would have evolved out. The theory isn't there, but empirically nature has shown it to be true.

    --
    Slashdot: providing anti-social weirdos a soapbox, since 1997.
  25. why is API bad? by superwiz · · Score: 2

    If it just allows for text analysis and the text, as a total body of statements, uses "Jew" or "gay" as insults, wouldn't high fidelity API reflect that? It seems more like a statement about the text being analyzed rather than about the processing. If the analysis didn't have high fidelity to the text, wouldn't it then be biased? Imagine the analysis which corrects for biases against historically-oppressed minorities. Now imagine this black box is fed Mein Kampf and other Nazi works. Should it correct for the biases? That would mean not detecting antisemitic biases in the Nazi propaganda. Wouldn't that make it a bad analysis?

    --
    Any guest worker system is indistinguishable from indentured servitude.
  26. Re:Comments by war4peace · · Score: 5, Insightful

    Being bisexual myself, obviously I have nothing against fellow gay people. Quite the contrary. However, I think Google's API is spot-on. Even in TFS they say "The problem is the API labels sentences about religious and ethnic minorities as negative" - well thanks, Captain Obvious, the majority will always see the minority in negative ways - because they're different.

    What we need to focus on is what the cause for negative perception is and fix things there, not replacing a correct algorithm with a lying one.

    --
    ...gis sdrawkcab (usually not responding to ACs; don't bother posting as AC)
  27. Re:Comments by serviscope_minor · · Score: 4, Funny

    Well, to be fair, if you are a women, looking to get married and pregnant, homosexual men are evil.

    I would say this is about the stupidest thing I've heard today.

    --
    SJW n. One who posts facts.
  28. Junk in Junk out by tomxor · · Score: 2

    It is not an "artificial intelligence", it is a tool built to analyse text, trained on a mass of text written by biased humans. Now if it had something resembling a conscience or critical thinking it might have a chance at identifying and balancing prejudice to defend it's "sentiment" weighting, but right now it's a "dumb" (in the AI sense) tool... So if it's biased against "Jews" and "gays", all it tells us is on average the humans who wrote the training data are biased against "gays" and "Jews".

  29. Re: Comments by king+neckbeard · · Score: 2

    Why would everyone need to be gay for being gay to be okay? Sexuality appears to have a bimodal distribution on the Kinsey scale. There may be sociological factors involved in those numbers as well, but it's pretty well tuned for a relatively stable population.

    Also, you can't apply categorical imperative that broadly. Humanity would die out faster if everyone belonged to any one particular occupation, because our current carrying levels depend upon extensive separation of labor. However, working any particular job, such as doctor, farmer, or scientist, is obviously not immoral or impractical in and of itself.

    --
    This is my signature. There are many like it, but this one is mine.
  30. Re: What about Gay AND Jewish? by lucasnate1 · · Score: 3, Insightful

    I am Jewish and living in Israel and what you are saying is a lie that some of our religious people spread in order to pretend to integrate with western culture.

    "If a man lies with a man as one lies with a woman, both of them have done what is detestable. They must be put to death; their blood will be on their own heads." - This is Leviticus 20:13, part of the Jewish bible.

    Furthermore, you should ask yourself, if Jewism has nothing against homosexuality, how come in Israel gay marrige are prohibited while they are legal in countries that have a high percentage of Christians?

    The conservative and reform Jewism are not homophobic, but the orthodox Jews, those who represented jewism for the last few hundred years, those who are the majority here in Israel, they are homophobic just like Christian and Muslim friends.

  31. Re:That's because... by TheRaven64 · · Score: 2

    Examples: Ancient Greece / Macedonia, Roman Empire, Vikings. Lots of examples of highly successful civilisations that had high instances of homosexuality (Alexander the Great's wife complained that he'd rather sleep with his generals than with her) and seem to have done very well out of it.

    --
    I am TheRaven on Soylent News
  32. Re: Comments by Bing+Tsher+E · · Score: 2, Interesting

    Quibbling bullshit like you just engaged in is why regular people settle for a guy like Trump who calls you on your bullshit language soup. (Trump isn't much better, if at all, but people want a way to respond to the psuedo-intellectual drivel you promulgate.)

    Simply put: quit being such a fucking prat. I know it's difficult, but 'clever' will be the death of you.

  33. Re: Comments by Timothy2.0 · · Score: 4, Interesting

    I would suggest a non-trivial subset of the training dataset encountered words like "homo" and "gay" in negative, colloquial contexts and likely couldn't discern the idiomatic usage. For example, if gaming chat windows provided any input for the training set, it wouldn't be unreasonable to see something like "stop spawn-camping you fucking fag" and the like wend its way into the dataset.

    The sentiment analyzer reflects how average people use language, so the perception of the sentiment analyzer will be through that average user's use of language. If the average person is a bigoted idiot, the sentiment analyzer's interpretations will be those of a bigoted idiot.

  34. Re:Comments by Austerity+Empowers · · Score: 2

    Gay is neither good or bad. Gay is just a label. Google should be able to recognize that 'gay' means one thing when a bigot uses it, but it should also recognize that 'gay' means something else when others use it.

    Homosexuality in the majority of the world is anywhere from merely tolerated to a capital offense. It's not hard to see why their AI gets it wrong. It's only in the west where we've taken measures to try to change it, and even then it's something we want to say we support, but not something we want to happen to us.

    I suspect even in the western world, people are disappointed when their kids come out as gay. They may try to accept and move past that disappointment, but it's there. So I'm not sure how they're going to program this AI without making it insane.

  35. Re: Comments by JackieBrown · · Score: 3, Insightful

    Good and evil are binary categories for the uneducated--it's really all relative to an individual perspective and it's never binary

    It's this kind of thinking that has allowed a generation or two of self important assholes who think that whatever they do is right as long as they can justify their actions to themselves. (Casting couch and degrading woman is ok because Wiesntein is making movies that uplift woman - so for 1 person hurt, millions helped.) This idea is destroying our society's honor and soul.

    Once you find good and evil always relative, anything is acceptable.

  36. Re: Comments by sound+vision · · Score: 3, Insightful

    It's a bit of a problem with the entire machine learning methodology. Presumably, these kind of bug-features could be worked out, in the manner you describe, given enough time and resources... The problem is that when the banks, police, HR departments etc. start making extensive use of these things. Are they going to be looking for an algorithm that's fair, or an algorithm that's profitable? To be sure, there is probably an algorithm that's both. Human civilization could reap a wicked ROI on this stuff, but in today's dark world, that is seen as infringing on someone's right to make a profit. My guess is that AI starts getting widespread deployment and the problems manifest quickly... But the problems look so similar to the ones we already have (the AI, after all, is trained for that) that there is no public outcry. It might even serve to reinforce those preconceptions. "The algorithm never tells me to hire a gay, they must be really bad." The idea that "Because things are bad now, there is no other way they could be" has a lot of traction these days. It's caused a lot of good people to maintain horrible status quos.

  37. Re: Comments by werepants · · Score: 2

    Once you find good and evil always relative, anything is acceptable.

    And people who are full of absolutist certitude can often justify anything in the pursuit of their goal. Intellectual humility fosters doubt, though, and doubt causes scrutiny, and scrutiny means that you're thinking about your actions.

    People like Weinstein are good at rationalizing - and you can rationalize from a perspective of moral absolutism (it's ok to murder him because he's a heathen) or from a perspective of relativism (if it feels good, do it). And the cold, hard truth is, people often do the wrong thing. Including movie execs, preachers, and every other profession under the sun.