Jeopardy-Playing Supercomputer Beats Humans
An anonymous reader writes "Ok, this was just a practice round. But in a short demonstration today IBM's Jeopardy-playing supercomputer, a whiz by the name of Watson, thoroughly bested two talented human contestants. IBM has been working on this artificial intelligence project for years to prove that a computer can be programmed to understand conversational speech and wordplay. In today's demo, Watson seems to have proved the point: it started out on a roll in the category 'Chicks Dig Me,' about women and archaeology. The real man versus machine face-off (in which the same contestants compete for a $1 million prize) will be taped tomorrow, and aired in February."
"What is the mission is too important for me to allow you to jeopardize it?"
There's no -1 for "I don't get it."
They will give the AI a heavy Indian accent, because it's what callers expect.
Call center employees aren't allowed to be smart. They have scripts that they must follow. They're reduced to a very simple algorithm, executed by human beings only because there are still people who prefer talking to a other people over interacting with a machine, and because speech recognition software is still not ready to deal with what some people call speech.
Best quote from the article:
Truly. Although it sounds threatening to some, the practical applications of the natural language parsing technology will ultimately benefit everyone.
Until, that is, you dial your bank's customer service number from a noisy restaurant, and try to talk to Watson to ask him why your Visa was denied.
(Rutter's quote was a nifty Skynet allusion, but its syntax was mangled by the reporter/editor, so it comes in second best.)
I can see the fnords!
So the AI triumphed, and they're calling it a huge success? I wonder how the programmers feel about this. Pretty satisfied, I'd imagine.
I worked Comcast call center for a while in '09. No scripts at all, when I started work and asked for something to follow, to figure out what I was expected to say, I was told with a smile "we don't do any scripts here, good luck!".
The only guideline I had was "Get their name and phone number, don't trust the system to give you an accurate phone number."
So my neighbor works at the IBM facility where this is taking place, but in a completely unrelated function(it's a huge complex with a lot of people). He said that everyone is taking a forced day off on Friday when they will be taping the actual show. There's only going to be a small amount of the very top IBM brass there (supposedly even the head of this facility won't be allowed in). And that this is a HUGE secrecy issue (I'm guessing so that the results aren't leaked before the broadcast date).
My neighbor works with semiconductors and so works with a lot of dangerous chemicals and stuff. According to him, they've all been told to make sure that all their hazardous materials have been safely stored, and that (I have trouble believing this) even the IBM emergency response/hazmat teams have been told that they aren't allowed onsite and not to respond to any alarms that may be issued. That's a fairly dangerous decision if true, I'm doubtful but my neighbor stands by his statement.
Anyhoo, this is a pretty big deal apparently. More so from the Jeapordy people's end I'd guess since I don't think IBM has anything related to this project that they'd be that paranoid about keeping secret.
-"Those who fought today will die tommorow."-
In the article, they mention that the computer gets the question as text. Does anyone know exactly when the computer receives the question? Does it receive when the human host starts talking or when the human host completes the question? If it is when the host starts speaking, the computer is getting at least several second head-start on the humans.
maybe you should RTFA.
the unique challenges it poses to its contestants: the breadth of topics; the puns, metaphors, and slang in the questions; the speed it takes to buzz and answer.
Speech processing that can deal with the context heavy language of Jeopardy is a pretty big test and I think means we're just a little bit closer to a general purpose natural language speech recognition system.
“Common sense is not so common.” — Voltaire
The ability to handle Jeopardy's style of word play is very impressive. I have to wonder if Watson can handle it in all the varieties that is is used on the show and whether the categories are cherry picked to match its abilities. Ideally the writers won't know that their answers are going to be used for the big game and the categories will be picked at random from a pool (minus audio and video clues).
I am becoming gerund, destroyer of verbs.
Just buy some of those "how to win the lottery" books, and program in those rules. You should be able to win every single one. Once you determine the winnig number, make sure you buy hundreds of winning tickets! Mortgage your house so you can buy as many as possible!
Who would win this election: Andrew Weiner vs Andrew Weiner's weiner.
Really?
Quiz shows are designed with a targeted IQ in mind.
The people who write and select the questions know the percentages of how many people in their player population will know the answer.
Winning the game is dependent on the random distribution of the selected set of questions falling within the percentage of things you know.
Which means that it's not just as skill or talent game. You also have to be asked the right questions, and since you don't control that, it's luck.
(Shitty) video at http://www.zdnet.com/blog/btl/ibms-watson-wins-jeopardy-practice-round-can-humans-hang/43601?tag=content;selector-blogs
09 F9 11 02 9D 74 E3 5B D8 41 56 C5 63 56 88 C0
Actually, it doesn't do any speech processing - it does text processing - a HUGE difference. This is a shortcoming in my opinion. It should have to process Alex's voice, not clear text, though I guess it could do OCR on the screen instead. I bet they didn't have any Audio or Video Daily Doubles. In the show, audio and video clues are not uncommon.
Watson only buzzes in when he is confident that he knows the answer, which is apparently about 50% of the time. Of the 50% that he does buzz in, he answers correctly 80-90% of the time. If how the measure the confidence is accurate (and doesn't produce a lot of false negatives) it's likely that Watson would end the game in the red if he just buzzed in every time before he was sure of the answer. And according to the article, Watson has to physically push the button on the buzzer to buzz in. That probably doesn't delay him by much, but you're already talking about man vs machine when it comes to answering the questions, you might as well say that the humans have an unfair advantage because they speak English fluently.
Except that 1) Before And After is also a category on Jeopardy and 2) it was a category in run-throughs for Watson:
In a test tourney, Watson hit the bullseye on a question about clothing a young girl might wear on an operatic ship. The answer, pinafore, is also found in the title of the Gilbert & Sullivan opera H.M.S. Pinafore. And the computer was also successful with a before-and-after Jeopardy question about a candy bar and a Supreme Court justice, Baby Ruth Bader-Ginsberg. But earlier in its career, when asked, "What does a grasshopper eat?", it responded, "Kosher."
Source: http://www.philly.com/philly/blogs/storm/Jeopardy_A_Computer_Takes_on_Ken_Jennings_and_Brad_Rutter_.html
"You're never ready, just less unprepared."
This is very exciting news and for now, forget replacement of call center peoples, this machine is composed of 2800 Power7 cores, which renders it very expensive compare to the typical call center person. But this accomplishement, is a major step in the AI field and open it to many many exciting applications in the future which is not now too far.
Achille Talon
Hop!
Engadget has a video of the match that includes the visualization of the response... In several cases it did not have an answer by the time the other contestant buzzed in and in some cases it was close to a tie.
In an interview it was stated it takes about 3 seconds on average to answer a question, which is actually kinda long when some of the humans will be able to predict the answer as soon as they have read or heard only part of the clue.
EA David Gardner -"... but the consumers have proven that actually what they want is fun."
It sounds like soon we will be able to put another "a computer will never be able to..." on the scrapheap.
"A computer will never be able to beat a human at Jeopardy!"
But tell me it's not entertaining to listen to someone else dealing with an interactive voice menu. It's a great one-sided conversation:
"Yes"
"Yes"
"Billing"
"I said BILLING!"
"Question about charge."
"Question about charge"
"Problem with charge"
"Jesus Christ. YOU CHARGED ME FOR SOMETHING I DIDN'T BUY!"
"No."
"No.
"NO NO NO NO! YOU FUCKING MACHINE! GIVE ME A HUMAN BEING!" (Followed by insane button mashing)
It's like an old Bob Newhart routine, if Bob Newhart had Tourette's Syndrome and anger management issues.
You are welcome on my lawn.
Think for a moment about all the stuff you just hand-waved away, and you will begin to appreciate the problem. Let's take your "chicks dig me" example. You correctly identified that "chick" could mean baby chickens or females (it can also mean any young bird or a small child). "Dig" has a bunch of meanings, both as a noun and a verb. "Me" you just brushed off as "no relevance", but there are two problems with that: first, how is a computer supposed to know the word has no relevance, and secondly, it is VERY relevant to the category. Because of the wordplay, "me" is not referring to a person, it is referring to a LOCATION that was EXCAVATED by a FEMALE, so your answer had better either be an archeological site or a female archeologist. And it just keeps getting harder from there.
"TELL US HOW YOU CHEATED." "Okay fine, there's a chess-playing Turk concealed inside the cabinet."
DRM: Terminator crops for your mind!
You misunderstand in two ways. First, I wasn't hypothesizing. I was actually going from an earlier article about how the machine works. Second, this is a real computer, not a magical computer. It is doing something very hard and it takes many seconds to process a question and come up with an answer. When I say it is possible for a human to flat-out beat the computer, I mean that Alex can read the question, the lights can flash to let a human buzz in, and a human can have the correct answer ready while the computer is still churning away and doesn't even have a hypothesis ready. In fact the human is faster than even that implies because typically a Jeopardy contestant, especially at the championship skill level, already has the answer long before Alex has stopped speaking and now the competition is about who can buzz in correctly; Alex is reading for the benefit of the folks at home.
This computer is state of the art, and consequently and quite expectedly, it is barely capable of playing Jeopardy, basically. Next year, of course, for the same money you could build a machine 50% better for the same price, and next year, 50% better again for the same price, and so on... but this year it's just barely able to compete.
To be fair, I get the same with people, the machines aren't really any dumber.
...
Hold music fades...
Me: OK, I have a problem with my application.
Them: I can see your application.
M: Can you see how my name is misspelled.
T: Yes.
M: Can you fix it, it's just two transposed letters
T: I'll need you to send in your original birth certificate with a copy of your credit card, with passport, drivers license or other photo ID in addition to a urine sample, proof of noble linage, papal bull, letter from your doctor and bill less then three months old with your current address on it.
M: OK, can you cancel my application.
T: I'll need you to send in your original birth certificate
M: No I want to...
T: with a copy of your credit card, with passport, drivers license
M: No.
T: or other photo
M: Stop.
T: in addition to a urine sample, proof of noble linage, papal
M: SILENCE
T:
M: I would like to cancel my application.
T: I'll need to transfer you to another department.
Cue the hold music.
Calling someone a "hater" only means you can not rationally rebut their argument.