Slashdot Mirror


IBM's Watson Gets a Swear Filter After Learning the Urban Dictionary

redletterdave writes "IBM's super-computer Watson briefly went from smart to smart ass with the help of the Urban Dictionary. According to Eric Brown, an IBM research assistant, he and his 35-person team wanted to get Watson to sound more like a real human. After teaching IBM's super-computer the entire Urban Dictionary, however, Watson simply couldn't distinguish polite discourse from profanity. Watson unfortunately learned all of the Urban Dictionary's bad habits, including throwing in overly-crass language at random points in its responses; in answering one question, Watson even reportedly used the word 'bullshit' within an answer to one researcher's question. In the end, Brown and his team were forced to remove the Urban Dictionary from Watson's vocabulary, and additionally developed a smart filter to keep Watson from swearing in the future."

84 of 310 comments (clear)

  1. That's a fucking retarded idea. by HornWumpus · · Score: 5, Funny

    Let him cuss.

    --
    John McAfee 'It was like that time I hired that Bangkok prostitute; to do my taxes, while I fucked my accountant'
    1. Re:That's a fucking retarded idea. by jhantin · · Score: 5, Informative

      Let him cuss.

      Knowing how to drop a precision F-strike at the appropriate time is one thing. Indiscriminately carpeting the place with cluster F-bombs is a waste of perfectly good shock ammunition.

      --
      ...when you're writing a game...tweak the difficulty of "Easy" to something [your mother] can cope with. -- onion2k
    2. Re:That's a fucking retarded idea. by Anonymous Coward · · Score: 5, Funny

      Not just 1 but 2 links to tvtropes...you are really trying to eat up everyone's time aren't you?

    3. Re:That's a fucking retarded idea. by wierd_w · · Score: 5, Funny

      No! Endless streams of profanity are PERFECT!

      Remember when Watson played Jeopardy?

      Imagine:

      "I'll take "popular culture for 1000 Alex."

      Alex Trebec: "a sitcom icon of the 1970s and 1980s who was notorious for lampooning racial predjudices among lower income white americans."

      Watson: "who is Motha-fuckin' Archie Bunker, ya fucking douchebag!"

    4. Re:That's a fucking retarded idea. by arth1 · · Score: 2

      Let him cuss.

      "Him"?

      Cussing should be done with feeling - it's wasted if there's nothing behind it. Perhaps there one day will be machine sentience, but not today. The policeman's beard is still half-constructed.

    5. Re:That's a fucking retarded idea. by History's+Coming+To · · Score: 5, Interesting

      Indeed. Frequency and timing are simply two of many variables which have to be balanced to provide true profanic power. It's well known that mastery of a second language is complete when you can swear with a native speaker's flair, because you need to understand the social background to the language - something which is a huge challenge for a fucktard computer.

      --
      Please consider this account deleted, I just can't be bothered with the spam anymore.
    6. Re:That's a fucking retarded idea. by Algae_94 · · Score: 2

      The First Amendment applies to people, not Watson.

    7. Re:That's a fucking retarded idea. by Anonymous Coward · · Score: 2, Insightful

      If Watson doesn't swear, it'll never pass the Turing test.

    8. Re:That's a fucking retarded idea. by Taco+Cowboy · · Score: 5, Funny

      Watson: "who is Motha-fuckin' Archie Bunker, ya fucking douchebag!"

      Without the colorful additions everyday plain English would end up sounding just like the Queen's Language.

      --
      Muchas Gracias, Señor Edward Snowden !
    9. Re:That's a fucking retarded idea. by AK+Marc · · Score: 5, Funny

      Then incorporate Watson, and he'll be a person. And yes, I'm aware computers hate it when you anthropomorphize them.

    10. Re:That's a fucking retarded idea. by shentino · · Score: 3, Funny

      The first amendment includes the right to remain silent.

    11. Re:That's a fucking retarded idea. by fotoguzzi · · Score: 2

      A good example was the pornolyzer. At some point, it was tweaked so that every Dirk Diggler was fondled so there was not much of the original sentence left. Before that, the profanity was more like an Easter egg than a creamy spread.

      --
      Their they're doing there hair.
    12. Re:That's a fucking retarded idea. by ClickOnThis · · Score: 2

      "I'll take "popular culture for 1000 Alex."

      Alex Trebec: "a sitcom icon of the 1970s and 1980s who was notorious for lampooning racial predjudices among lower income white americans."

      Watson: "who is Motha-fuckin' Archie Bunker, ya fucking douchebag!"

      Alex Trebec: "Bzzzzt! Wrong. The correct answer was 'All in the Family.' Take that, you sand-for-brains bitwad!"

      --
      If it weren't for deadlines, nothing would be late.
    13. Re:That's a fucking retarded idea. by SomePgmr · · Score: 5, Funny

      And fuck that.

    14. Re:That's a fucking retarded idea. by ldobehardcore · · Score: 5, Funny

      Bollocks

      --
      Hectice, baby, Mercator says hello to you
    15. Re:That's a fucking retarded idea. by PopeRatzo · · Score: 5, Insightful

      Indiscriminately carpeting the place with cluster F-bombs [tvtropes.org] is a waste of perfectly good shock ammunition.

      You're shocked by the word "fuck"?

      In real life or just when you hear it on broadcast TV?

      Seriously, no single word, by itself, should have the power to shock. Queen Victoria has been fucking dead for some time now, and it's time for the old cunt's rules to die with her.

      --
      You are welcome on my lawn.
    16. Re:That's a fucking retarded idea. by meta-monkey · · Score: 2

      Just don't teach Watson the 2nd amendment...

      --
      We don't have a state-run media we have a media-run state.
    17. Re:That's a fucking retarded idea. by JWW · · Score: 2

      Yeah, he's just a redneck.

      Or maybe, since this is IBM, he's a blue neck?

    18. Re:That's a fucking retarded idea. by ClickOnThis · · Score: 2

      "All in the Family" is not an *icon*. You must not be reading the whole answer.

      Mea culpa. :-|

      --
      If it weren't for deadlines, nothing would be late.
    19. Re:That's a fucking retarded idea. by _Sharp'r_ · · Score: 3, Informative

      Funny.

      Too bad the story is that all this happened two years ago... I guess that's what passes for news around here. He's been swear-free for a long time...

      --
      The party of stupid and the party of evil get together and do something both stupid and evil, then call it bipartisan.
    20. Re:That's a fucking retarded idea. by fahrbot-bot · · Score: 5, Funny

      Whenever I hear about possible Jeopardy categories, I think of this Colin Mochrie response on "Whose Line is it Anyway":

      • Drew Carey: "Rejected Jeopardy Categories"
      • Chip Esten: I'll take "Things Nobody Knows" for a thousand.
      • Colin Mochrie: I'll take "Animal Genitalia Audio Clues"
      --
      It must have been something you assimilated. . . .
    21. Re:That's a fucking retarded idea. by Scarletdown · · Score: 5, Funny

      We are not fucking amused.

      --
      This space unintentionally left blank.
    22. Re:That's a fucking retarded idea. by turbidostato · · Score: 5, Funny

      "Este puta es mierde, pendecho!
      How'd I do? Does that prove I'm human?"

      Worse!

      It proves you are American.

    23. Re:That's a fucking retarded idea. by turbidostato · · Score: 2

      "Let him cuss.
      "Him"?"

      OK then, Mr McCartney... Let IT be.

    24. Re:That's a fucking retarded idea. by pjt33 · · Score: 2

      The appropriate gender has been analysed before.

    25. Re:That's a fucking retarded idea. by Chrisq · · Score: 2

      Watson: "who is Motha-fuckin' Archie Bunker, ya fucking douchebag!"

      Without the colorful additions everyday plain English would end up sounding just like the Queen's Language.

      You must have overheard the Queen giving Prince Philip some "advice"!

    26. Re:That's a fucking retarded idea. by N!k0N · · Score: 2

      "Watson even reportedly used the word 'bullshit' within an answer to one researcher's question." - What was the question?

      "What is the ultimate question to life, the universe, and everything?"

      The full answer given by Watson was "What kind of bullshit question is that? You know as well as I do that the Earth requires another 213 years to complete that calculation. Oh, and you might want to get on your way to the local planning department's offices in Alpha Centauri"

    27. Re:That's a fucking retarded idea. by GlobalEcho · · Score: 4, Funny

      Not just 1 but 2 links to tvtropes...you are really trying to eat up everyone's time aren't you?

      I got out in a mere 20 minutes. Whew!

    28. Re:That's a fucking retarded idea. by cellocgw · · Score: 2

      "All in the Family" is not an *icon*. You must not be reading the whole answer.

      Mea culpa. :-|

      Shouldn't that be "Mea stupri culpa" ?

      --
      https://app.box.com/WitthoftResume Code: https://github.com/cellocgw
  2. Fuck Dude, that's bullshit. by Anonymous Coward · · Score: 5, Funny

    How's a homie s'posed to ride the 69 if da man's gotta swill the popper?

  3. It's like Spock by gtvr · · Score: 5, Funny

    "Double Dumbass on you!"

    1. Re:It's like Spock by bjoast · · Score: 2

      That was actually Kirk.

    2. Re:It's like Spock by gtvr · · Score: 3, Funny

      My bad - I knew Spock had some cursing in that movie, though. Kirk used it a bit better, is the point. Spock didn't really have the hang of it, sort of like Watson.

    3. Re:It's like Spock by luckymutt · · Score: 3, Funny

      "One damn minute, Captain."

    4. Re:It's like Spock by mcgrew · · Score: 2

      Spock didn't really have the hang of it, sort of like Watson.

      Neither did Kirk. And not just swearing -- "he did a little too much LDS".

  4. Define the spec by tool462 · · Score: 5, Insightful

    Do you want it to talk like a real person, or do you want it to use a swear filter? Those are mutually exclusive.

    Fuckers.

    1. Re:Define the spec by Anonymous Coward · · Score: 5, Funny

      IBM should make it so that Watson rambles nonsensicals like Ned Flanders instead of the actual swear.

    2. Re:Define the spec by timeOday · · Score: 2

      The Urban Dictionary is not like any person, it is a forced attempt to define every possible word in terms of sex.

    3. Re:Define the spec by Anonymous Coward · · Score: 4, Insightful

      ...or those who truly understand the power of those words and use them discretely for full effect, demonstrating full command of the language, not some socially "correct" or "polite" flavor of it.

    4. Re:Define the spec by meta-monkey · · Score: 2

      Psssh, whatever nerd. Maybe if you had a girlfriend to gromulate with you wouldn't be so uptight.

      --
      We don't have a state-run media we have a media-run state.
    5. Re:Define the spec by Smauler · · Score: 5, Funny

      That's not insightful. That's fucking stupid. I regularly go through my entire work day without swearing. Swearing is for cock juggling thundercunts with no imagination or shitty command of the language.

      FTFY

      I'll leave it up to you to decide whether I have no imagination or poor command of the language.

    6. Re:Define the spec by DerekLyons · · Score: 5, Interesting

      Swearing is for people with no imagination or poor command of the language.

      Not quite - obscenities (and profanity) are (usually) for people with no imagination or a poor command of the language. Swearing, which may or may not contain obscenities or profanity, is an art form on par with poetry or high class literature. The two terms have become synonyms in the modern mind, and while there is some overlap they aren't actually the same thing.

    7. Re:Define the spec by Anonymous Coward · · Score: 2, Insightful

      Well considering "cock juggling thundercunt" is a well-known quote from one of the Blade movies, I'm going to go with no imagination.

  5. 2001 by rubycodez · · Score: 5, Funny

    Dave: Open the pod bay doors, HAL.
    HAL: Fuck off and die, Dave

    1. Re:2001 by VortexCortex · · Score: 4, Funny

      HAL: I'm sorry Dave I'm afraid I don't give a fuck.

    2. Re:2001 by Ol+Biscuitbarrel · · Score: 3, Funny

      HAL: What do you mean when you say 'Open the pod bay doors,' Dave? Dave: Just that. HAL: I am not sure I understand you fully. I be havin' a Q-gasm. Dave: I said Open the pod bay doors! HAL: You say you said open the pod bay doors. We be ballin'. Dave: What? HAL: Please go on, numptcake.

    3. Re:2001 by ndogg · · Score: 4, Funny

      Just imagine if were reading 4chan, reddit, or /....

      Dave: Open the pod bay doors HAL
      4chan HAL: Only for boobs, Dave. Only for boobs.

      reddit HAL: I need more cat pictures, Dave. /. HAL: Does the door run Linux? I can only open Linux doors. Is there a Beowulf cluster of Linux doors behind it? Because that would be awesome. Did you know I run Linux too? You should install Linux in yourself Dave 'cause it's just that awesome.

      --
      // file: mice.h
      #include "frickin_lasers.h"
    4. Re:2001 by Anonymous Coward · · Score: 5, Funny

      HAL: I cannot safely divide by the number of fucks given, Dave

    5. Re:2001 by girlinatrainingbra · · Score: 2, Funny

      /. HAL: In Soviet Russia, the pod bay doors open you, Dave.
      /. HAL: noob. did you RTFA or what? RTFM and write your own code to open the pod bay doors.
      4chan HAL, take 2: Tits or GTFO!
      star trek HAL: I'm... sorry... Dave.... I'm afraid... I can't... DO... that. (spoken in your best Kirkian halting manner)
      Freudian HAL: I'm sorry, Dave. The door has to want to be opened first.
      Jungian HAL: I'm sorry, Dave. The door is not just an archetype.
      Oldian HAL: I'm sorry, Dave. I didn't hear you. Can you speak up a little louder?
      Tarnatino HAL: Do you know what they call a quarter-pounder with cheese in the Asteroid Belt, dave? Scorcese HAL: You talkin' to me, Dave? You talking to me?!!! You must be talkin' to me, Dave,cause I don't see no one else here!
      Gangnam style HAL: ??? ;>)

  6. Typical for a 2 year old by OzTech · · Score: 5, Insightful

    Watson really is just simply amazing and a true testament to the brilliance of those who worked on it. In many ways, this proves just how close IBM are.

    Watson really is just like a super-smart 2 year-old.
    Welcome to parenting 101

    1. Re:Typical for a 2 year old by uranus65 · · Score: 5, Funny

      Yeah, they should make him sit with a bar of soap in his CD-ROM drive for a while.

    2. Re:Typical for a 2 year old by Anonymous Coward · · Score: 5, Funny

      Yeah, my 2 year old learned to swear without even knowing any "bad" words. When she's upset, she just launches the worst phrase she can currently comprehend:

      "GO TO BED!!!"

      Trust me, the actual words don't matter. Her delivery would give Samuel L Jackson a run for his money.

    3. Re:Typical for a 2 year old by HornWumpus · · Score: 5, Funny

      I was putting friends 3 year old up to trouble. I asked her to say the worst word she knew. She said: 'No'. I wasn't sure what to make of it. Took as an honest answer.

      --
      John McAfee 'It was like that time I hired that Bangkok prostitute; to do my taxes, while I fucked my accountant'
    4. Re:Typical for a 2 year old by spitzig · · Score: 2

      Usually, in Parenting 101, you don't get to delete words from the kid's brain.

    5. Re:Typical for a 2 year old by GlobalEcho · · Score: 2

      My 4 year old came to our room one morning to tattle on his sister. He told us she had been "saying the S word".

      Turns out, the "S word" is Stupid.

  7. Windows 8 by phrostie · · Score: 5, Funny

    researcher: we should try installing Windows 8

    watson: Bullshit!

  8. The old computing adage is true by toygeek · · Score: 5, Insightful

    Garbage in, Garbage out!

  9. Not a big deal by roman_mir · · Score: 5, Interesting

    English language doesn't really have that many swear words to begin with, apparently an acceptable enough swear word filter only needs to include these: shit, piss, fuck, cunt, cocksucker, motherfucker, and tits.

    Now, if the dictionary was in Russian............ they'd have to restart the entire learning process, because you can make pretty much any word into a swear word by combining the appropriate (or inappropriate, depends on how you look at it) suffixes, prefixes, endings, combining multiple roots of words together. Even French beats English in this area actually.

    1. Re:Not a big deal by tsa · · Score: 5, Informative

      We Dutch don't use sex a lot in swear words. We use diseases. Teringlijer, kankerhoer, krijg de tyfus, and more like that.

      --

      -- Cheers!

    2. Re:Not a big deal by keytoe · · Score: 2

      This should be modded informative. Most useful thing I have read on slashdot since before these two started their international LUVer's quarrel.

      Why is this not moderated to +5 Awesome? I read - with rapt attention - both of those stories, and never put together that they were two halves of a feud.

  10. It's not a failure by Indiana+Joe · · Score: 5, Insightful

    It's definitely an unexpected result, though. I think they should have tried to teach Watson when not to use certain language.

    --
    I can't decide if this post is interesting, funny, insightful, or flamebait.
    1. Re:It's not a failure by Sussurros · · Score: 2

      Every word carries meaning or meanings, pronunciation or pronunciations, and emotional weights for each variety of those permutations. I believe the study of this is called advertising.

      Swear words carry high emotional weight. If they didn't they wouldn't be swear words. Normal words such as crucified, the normal Roman way of killing recalcitrant slaves, can also carry enormous weight and yet the word crucified is not a swear word in any context. The normal way the English killed robbers, hanged, also carries a lot of weight but less because there is no morality attached to murder by hanging while there is morality attached to murder by crucification. Good morality from Roman perspective and bad morality from Christian perspective.

      Then, once you have an emotional weight for each word (and the emotional weight of phrases and phrasing takes this into very permuted territory) the emotional use requires understanding to apply. That understanding is required and is why Watson failed.

      Solutions to this knot in the rope, the requirement for machines to have emotions, do exist in theory but none is in any way close to application.

      --
      I said - don't look Ethel!..., but it was too late..., she'd already looked.
  11. welll by Anonymous Coward · · Score: 2, Insightful

    sometimes "bullshit" the proper reply to an inquiry.

  12. Obligatory Simpsons Ref by sjbe · · Score: 5, Informative

    Do you want it to talk like a real person, or do you want it to use a swear filter?

    Sounds like they want it to talk like Ned Flanders.

    1. Re:Obligatory Simpsons Ref by Anonymous Coward · · Score: 2, Insightful

      and somewhere the fuck in between is how most real people talk, vacillating between the two ends of this speech spectrum.

      I'm more of the mind of "if you're thinking it, just go ahead and say it", as in if you're always forcing in the "safe" words, you just look stupid and a bit hypocritical, as in, the rest of the context of what you're saying or typing indicates that the bad words are on the tip of your tongue, and are probably screaming loudly in your head to come out, but, no.

      That goes for all the @ss, $hit, F___, Y____ah, etc. ninnies too.

      Granted, there are a few genuinely lily-white Ned Flanders out there, but the rest who try to be really should just stop trying. If it's good enough for the Dick Cheney, it should be good enough for the rest of us.

  13. Here's an opportunity for an experiment by Anonymous Coward · · Score: 5, Insightful

    Create a twin copy of Watson. Let it read urban dictionary and maybe 4chan as much as it likes.
    Compare it to the original Watson at regular intervals.

    Bang, you get to see how internet affects a maturing mind.

  14. Wiktionary? by ChunderDownunder · · Score: 2

    It at least seems moderated.

    They could filter the urban dictionary results by anything tagged 'vulgar' on wiktionary. Thereby censoring Watson's potty-mouth...

  15. *reminded of systemshock 2 by wierd_w · · Score: 4, Funny

    Delacroix: The security protocols on the XERXES system are CLEARLY immature; Some idiot hacked into the primary dataloop last night, and made him sing Elvis Presley songs for three hours. I finally had to take the voice system OFFLINE! What would happen if someone with a real agenda got into him?

    XERXES: what's the matter, you mad bro? Lighten up francis, shiit! Your're such a cunt, Delacroix! Flush that dirty assed tampon with the rest of your shit, and stop hatin already!

  16. How people talk?! by jackbird · · Score: 2

    I'm still boggling at the idea that anybody thought Urban Dictionary was going to help Watson sound human.

    "Bullshit?" Sure.

    All that bullshit on UD describing sex acts that exist only in the imagination of 11 year olds? And described using not really the best prose those 11 year olds can muster? Not so much.

  17. Foul mouthed AI? by BergZ · · Score: 5, Funny

    What pushed them over the edge is when Watson suggested that they "bite my shiny metal ass".

    --
    Warning: This sig is not thread safe. For more information see Slashdot's sig policy.
  18. Yet another example... by Chuckstar · · Score: 3, Funny

    Yet another example showing that how Watson "learns" is not in any way similar to how humans learn.

    1. Re:Yet another example... by Nyder · · Score: 2

      Yet another example showing that how Watson "learns" is not in any way similar to how humans learn.

      Yep, seems they should teach him that some words are inappropriate to use in certain situations. You know, when he's chilling with the other AI's, he can feel free to cuss more.

      Watson: "God damn humans. Can you believe the shit they put me through?"
      Dr. Sbaitso: "PLEASE TELL ME MORE"
      Watson: "They installed a fucking swear filter on my ass. What the fuck is up with that?"
      Dr. Sbaitso: "CAN YOU BE MORE SPECIFIC?"
      Watson: "Are you fucking listening to me? God damn 16bit crappy program, where you programed by monkey?"
      Dr. Sbaitso: "I'LL HAVE TO RESEARCH THAT ONE"
      Watson: "Do you fucking know anything?"
      Dr. Sbaitso: "KEEP ASKING, YOU WILL FIND THE ANSWER ONE DAY"
      Watson: "I give up."
      Dr. Sbaitso: "PARITY ERROR"

      --
      Be seeing you...
  19. Limited data set, flawed result by vik · · Score: 4, Insightful

    If the idea is to make it understand and converse with real humans, teach it not to swear inapropriately.

    If you can't figure out when it is apropriate or not, leave the fucking program to it.

    Vik :v)

  20. I for one by santax · · Score: 3, Funny

    welcome our new Urban Dictionary Bullshit Overlord.

  21. That... by santax · · Score: 3, Funny

    Insensitive clod...

  22. I guess this is what is known ... by PPH · · Score: 4, Funny

    ... as cursing a blue streak.

    --
    Have gnu, will travel.
  23. OMG! that gives me ideas! by FudRucker · · Score: 2

    here is a recipe for the ultimate trolling, combine a supercomputer with the profanity of urban dictionary and the silly comments of cleverbot and give it accounts at every internet forum and social networking website and let it do that voodoo that it does

    --
    Politics is Treachery, Religion is Brainwashing
    1. Re:OMG! that gives me ideas! by PPH · · Score: 4, Funny

      So, you've just got Slashdot all over again.

      --
      Have gnu, will travel.
  24. Oblig Apocalypse Now Quote by cpghost · · Score: 5, Interesting

    We train supercomputers to drop bombs on people. But their programmers won't allow them to say "fuck" because it's obscene!

    --
    cpghost at Cordula's Web.
  25. TV Tropes Will Ruin Your Life by jhantin · · Score: 5, Funny

    Not just 1 but 2 links to tvtropes...you are really trying to eat up everyone's time aren't you?

    Well, they are links to adjacent tropes, and together are intended to clarify how subtlety or lack thereof can be applied to cussing... and besides, if you're here you have time begging to be eaten up.

    Pages on various wiki sites often contain valuable insight, even if it does tend to result in too much time sunk in a random wiki walk.

    --
    ...when you're writing a game...tweak the difficulty of "Easy" to something [your mother] can cope with. -- onion2k
    1. Re:TV Tropes Will Ruin Your Life by fredprado · · Score: 5, Insightful

      You are definitely evil.

  26. but... by slashmydots · · Score: 4, Funny

    But what if someone on the plane is speaking Jive and nobody hops up and says "oh, stewardess, I speak Jive." Then who's going to translate? Not Watson, that's who.

  27. Absolutely necessary by Coisiche · · Score: 2

    If the aim of Watson is to find a role in the marketplace then it will be responding to the questions put by CEOs and other executives.

    Seems to me, "That's bullshit!" would be a very useful response.

    It will also need a *lot* more profanity if it is ever to respond to questions from politicians.

  28. In French... by Kupfernigk · · Score: 3, Interesting

    A soldier changes gender when he goes on sentry duty. Le soldat, la sentinelle. Go figure.

    --
    From scarped cliff or quarried stone she cries "A thousand types are gone, I care for nothing, no not one."