Slashdot Mirror


Google's Sentiment Analyzer Thinks Being Gay Is Bad (vice.com)

gooddogsgotoheaven shares a report from Motherboard: In July 2016, Google announced the public beta launch of a new machine learning application program interface (API), called the Cloud Natural Language API. It allows developers to incorporate Google's deep learning models into their own applications. As the company said in its announcement of the API, it lets you "easily reveal the structure and meaning of your text in a variety of languages." In addition to entity recognition (deciphering what's being talked about in a text) and syntax analysis (parsing the structure of that text), the API included a sentiment analyzer to allow programs to determine the degree to which sentences expressed a negative or positive sentiment, on a scale of -1 to 1. The problem is the API labels sentences about religious and ethnic minorities as negative -- indicating it's inherently biased. For example, it labels both being a Jew and being a homosexual as negative. A Google spokesperson issued the following statement in response to Motherboard's request for comment: "We dedicate a lot of efforts to making sure the NLP API avoids bias, but we don't always get it right. This is an example of one of those times, and we are sorry. We take this seriously and are working on improving our models. We will correct this specific case, and, more broadly, building more inclusive algorithms is crucial to bringing the benefits of machine learning to everyone."

16 of 453 comments (clear)

  1. See below by temcat · · Score: 4, Interesting

    If you've already decided that being homosexual (or a Jew, or a redhead, or a lefty, you name it) must not be deemed negative, why do you need analysis at all?
    Anyway, here is a constructive suggestion: define more precisely what "negative" is. (Things like this are sorely needed IRL too, where people e.g. routinely use statistical norm as an argument in ethical discussions.)

    1. Re:See below by TWX · · Score: 5, Insightful

      If the algorithms that have come to these conclusions are based on analyzing public data from the Internet, then if an AI decides that any particular characteristic is negative, it's because it reflects the sentiments of those who bother to post opinions.

      Most people that do not themselves exhibit the trait that's being argued-against by the noisy minority don't usually express opinions on it, so they're a hole in the data that needs to be accounted for. Unfortunately it's a lot easier to interpret based on what has been said than what has not been said.

      --
      Do not look into laser with remaining eye.
  2. Funny how this always happens. by Anonymous Coward · · Score: 4, Funny

    It's almost like objective, quantifiable reality and feel-good political correctness are fundamentally at odds with each other.

  3. Re:Comments by stephanruby · · Score: 5, Insightful

    Their api is trying to be too universal and monolithic.

    Gay is neither good or bad. Gay is just a label. Google should be able to recognize that 'gay' means one thing when a bigot uses it, but it should also recognize that 'gay' means something else when others use it.

    In other words, the meaning of words should be heavily weighed on the training data of the individual listening/reading and on the training data of the individual speaking/writing.

  4. Look in the mirror by JOstrow · · Score: 5, Informative

    So many of us already use "gay" and "jew" as derogatory terms. Is it any wonder that Google's NLP picked up on that? What source do you think it learned from?

  5. Re:That's because... by Jeremi · · Score: 5, Insightful

    Biologically speaking homosexuality is bad for the replication of any species in it's most basic form

    Well, no.

    Being homosexual might be bad for the replication of the individual (since it reduces the individual's chances of having offspring), but it doesn't necessarily follow that it's therefore also bad for the species.

    Consider that childless homosexuals, not being burdened by the task of having to care for children of their own, will likely therefore have extra time and resources available to help protect and care for the children of child-bearing couples in their family or community, thus improving the children's (and therefore the species') chances of surviving. Given the huge time investment that human children require, increasing a child's odds of survival to adulthood is a huge win.

    And if you think that sort of dynamic is uncommon in nature, consider that fact that 99% of bees are sterile and play no role in the reproduction of their hive, except to gather food and protect the queen and the drones who do handle the reproduction.

    Specialization and division of labor is one of the things human civilization is based on; there's no reason it can't be applied to reproduction as well.

    --


    I don't care if it's 90,000 hectares. That lake was not my doing.
  6. Well, duh! by GerryGilmore · · Score: 4, Insightful

    Even though we like to think that everyone is enlightened, etc as us, in very broad swaths of American society being gay or jewish (or Muslim or...) is very much perceived as a negative out of the starting gate. File under: Sad-But-true.

  7. Probably just the zeitgeist by Okian+Warrior · · Score: 4, Insightful

    Gay is neither good or bad. Gay is just a label.

    While true, one can easily see the reason for the problem.

    Google API is trained to look for correlations in written text. If it sees a lot of negative text about something, then that's what it will believe.

    I note that there's a lot of text that condemn jews for one reason or another, but there's not a lot that *praises* jews. We hear all the time about Christian charities, for example, but not a lot for the jewish ones(*).

    There's also a lot of negative statements about gays, and although there's *some* text praising gayness, it's mostly either personal ("good on you for having the courage to believe in yourself") or neutral ("it's OK to be gay, it's normal"). I've never seen writing that *praises* gayness as a concept.

    Compare with Christianity, where there is endless adulation of the Christian way of life. Democracy is probably the same way.

    I'd be interested to see what the API thinks about Islam, or Trump, or Clinton, or a host of other controversial political subjects.

    Google API is probably just giving us a reflection of the zeitgeist.

    (*) Don't read anything into this, I'm only saying that Christians get better press.

  8. Re:Comments by ShanghaiBill · · Score: 5, Interesting

    I'm gay and I think being gay is "bad".

    I am straight and I think gays are "good". Gay tend to be DINKs (double income no kids), so they have high household incomes and can afford nice houses. They pay property taxes to support the schools, but don't have kids, leading to higher per student educational spending. They contribute to a thriving urban culture and nightlife.

    I live in the SF Bay Area, and gays are a big part of the reason this is a nice place to live, with great schools, creative jobs, safe neighborhoods, and interesting culture. Thank you for being gay.

  9. Re:Comments by Dutch+Gun · · Score: 4, Informative

    In fairness, Google is marketing this API for a somewhat narrow purpose: Determining whether a customer left a *positive* or *negative* reaction to a customer's comment in your company support forums, for example, or attempting to determine customer reaction from support interactions.

    This little fact is somewhat obfuscated in the summary, in which it seems to be billing it as a more general-purpose system that's making sweeping value judgements about society. Within this actual context, let's be honest, if you see those terms in your company's customer support forums, what do you think of the likelihood is of them being part of positive or negative comments? Yeah, exactly.

    The big mistake that Google made is not putting a politically correct filter on their API to make sure controversial words had a neutral value, even if that wasn't really the case. Otherwise, you generate flamebait headlines like we see here, wherein highly limited "AI" algorithms simply regurgitates the training material it was fed without any deep or sophisticated understanding.

    --
    Irony: Agile development has too much intertia to be abandoned now.
  10. Logically speaking... by oic0 · · Score: 4, Insightful

    It's not a judgement by the AI against the people, it's a judgement about how others react and behave towards those them. Stop apologizing for your AI being able to perceive human behavior. "I'm so sorry my AI figured out you have a flat tire".

  11. Re:Comments by h33t+l4x0r · · Score: 4, Insightful

    How often are parents disappointed that their kid is straight? Being gay is subjectively more often bad than good and everyone knows it.

  12. Re:That's because... by kamapuaa · · Score: 4, Insightful

    But from a biological standpoint, no one can say that it can scientifically be positively correlated to the replication of the species

    You repeatedly speak in absolutes ("noone can say" - "biologically homosexuality is bad") when there isn't cause for it and there certainly isn't a scientific consensus. You imagine a PC agenda propagating homosexuality as good for the species, when that really makes no sense.

    Homosexuality is common throughout nature. While biological dead ends do make it through the process of evolution for a long time, homosexuality is so endemic that it seems that it must have some positive impact on the species, or it would have evolved out. The theory isn't there, but empirically nature has shown it to be true.

    --
    Slashdot: providing anti-social weirdos a soapbox, since 1997.
  13. Re:Comments by war4peace · · Score: 5, Insightful

    Being bisexual myself, obviously I have nothing against fellow gay people. Quite the contrary. However, I think Google's API is spot-on. Even in TFS they say "The problem is the API labels sentences about religious and ethnic minorities as negative" - well thanks, Captain Obvious, the majority will always see the minority in negative ways - because they're different.

    What we need to focus on is what the cause for negative perception is and fix things there, not replacing a correct algorithm with a lying one.

    --
    ...gis sdrawkcab (usually not responding to ACs; don't bother posting as AC)
  14. Re:Comments by serviscope_minor · · Score: 4, Funny

    Well, to be fair, if you are a women, looking to get married and pregnant, homosexual men are evil.

    I would say this is about the stupidest thing I've heard today.

    --
    SJW n. One who posts facts.
  15. Re: Comments by Timothy2.0 · · Score: 4, Interesting

    I would suggest a non-trivial subset of the training dataset encountered words like "homo" and "gay" in negative, colloquial contexts and likely couldn't discern the idiomatic usage. For example, if gaming chat windows provided any input for the training set, it wouldn't be unreasonable to see something like "stop spawn-camping you fucking fag" and the like wend its way into the dataset.

    The sentiment analyzer reflects how average people use language, so the perception of the sentiment analyzer will be through that average user's use of language. If the average person is a bigoted idiot, the sentiment analyzer's interpretations will be those of a bigoted idiot.