Chatbot Eugene Wins Biggest Turing Test Ever
An anonymous reader writes "Eugene Goostman, a chatbot imbued with the personality of a 13-year-old boy, won the biggest Turing test ever staged on 23 June, the 100th anniversary of the birth of Alan Turing. Held at Bletchley Park near Milton Keynes, UK, where Turing cracked the Nazi Enigma code during World War 2, the test involved over 150 separate conversations, 30 judges, 25 hidden humans and five elite, chattering software programs. 'Thirteen years old is not too old to know everything and not too young to know nothing,' explains Eugene's creator, Vladimir Veselov."
My convo:
Me: Are you a bot?
It: . I'm real
Me: No you're not.
It: Really? Oh, you always say banalities in the way like you present a sensation! By the way, what's your occupation? I mean - could you tell me about your work?
Totally a 13 year old.
Your hair look like poop, Bob! - Wanker.
My chatbot also does a convincing 13 year old. It just calls you a fag and claims to have done your mother non stop.
I always start like this:
Me: Are you alive?
Bot: Yes.
Me: Are you alive?
Bot: YES.
Me: Are you alive?
Bot: (answer depends on algorithm)
Me: Am I real?
Bot: I think therefore I am?
Me: Am I real?
Bot: I don't know, are you?
Me: Am I real?
Bot: (answer depends on algorithm)
In the interest of brevity, this is a simplified synopsis, but I basically just keep asking the same groups questions over and over again. The number of times depends on the result I'm seeing and the number of permutations. Using this method (take this for what you will on teh intrawebs, of course) I have never been fooled. Machines are too predictable.
Did it spend the entire conversation talking about boobies?
Or vagina?
Your ad here.
Do you have sex with a toaster?
Well, about sex... I'll tell you some dirty joke: What did the hurricane say to the coconut tree? "Hold onto your nuts! This ain't gonna be no ordinary blowjob."
Be seeing you...
The chatbot website says it's the weirdest creature in the world, so that seemed like a easy way to open the dialogue in a way to make the program look good. Nope:
Posed multiple ways I get variations of the same answer: it can't understand what I'm saying. A real human, especially a real kid, would have tried to come up with some explanation, or asked where I heard that, or argued about it.
Get off my launchpad!
"Can a horse drive a car?" Still a fun chatbot though.
Also failed on, "Could you restate my last question in your own words?"
Those are going to continue to be tough for a chatbot for awhile longer, world knowledge and conversation context.
[-- Trust the Monkey --]
To be more clear, your quotidian 13 year old kids don't use words like "banalities". It is a rare kid that would even know the word when reading. It is a really rare kid that would use it in a conversation.
Q: What is a chat bot?
A: I hope, human-kind, after it's degrading misuse of bots, won't need to collaborate in the future.
Another day, another hopeless chatbot.
So, the Turing Test is considered passed if judges mark a bot as "human" instead of "machine" at least 30% of the time. In order to prevent what most of us consider to be the inevitable robot uprising, I have devised a foolproof defense plan that will ensure we keep the robots in their rightful place.
Step 1: Collude with the other judges and vote "machine" for every single conversation
There is no step 2. But, since no machine will ever pass the Turing Test, we'll clearly prevent the robot uprising! It's logic like this that keeps us on top.
For crying out loud, I asked "How old are you?" It said, as expected, "Thirteen".
But when I said "When is your birthday?" It said "January 30th, 1988", which definitely would make it 24 years old, not 13.
File under 'M' for 'Manic ranting'
I know it's 100 years since Alan Turing's birth, he was genius, the way in which he was treated is a disgrace, and we'll never know how much he could have advanced computing if he'd lived to a ripeage.
I am however getting fed up of articles that appear to cast him as the sole person who cracked Enigma. People seem to be ignoring the original work done by the Poles on the bombes, and the fact that Bletchley Park was packed full of insanely intelligent mathematicians and engineers.
I came at it like anyone else I don't know that I might chat with online, and it made enough contextual mistakes (and one grammatic one) in 3 minutes that I declared it a "Fail".
Are YOU using the TOOL, or is the TOOL using YOU? Think about it!
If they wanted to mimic a real 13 year old boy, the answer to every question that it can't parse could be selected from the following list.
lol
fag (and variations)
fuck you (and variations)
8===D
( . ) ( . )
Ask it why America is the greatest country in the world.
http://alternatives.rzero.com/
Q: Have you ever been in a cockpit before?
A: No, never. By the way, what's your occupation? I mean - could you tell me about your work?
Q: Have you ever seen a grown man naked?
A: Yes, twice, but I barely remember. Why do you ask me about it? And I forgot to ask you where you are from...
Q: Do you like movies with gladiators?
A: My favorite type is comedy. I like the "Naked Gun" series best. Heck, any movie with Leslie Nielsen it is tops in my book!
Q: Have you ever been in a Turkish prison?
A: Oh, nope. I guess is a nice place.
http://alternatives.rzero.com/
While a good AI probably could introspect like that, your question is one(of a great many) that would be fairly vulnerable to being gamed by a much less interesting system because humans have such lousy memories...
Can I introspect about language? Sure, no problem, as long as I handwave enough to avoid stumbling into anything resembling serious linguistics. Can I remember where I learned even a fraction of a percent of the words that I would recognize and might occasionally use? Not a chance. Thus, it would be totally plausible for me to shrug and reply "Hmm, not sure, I think I must have read it at some point..." or "Oh, Mrs. Jones, 8th grade English, took vocabulary very seriously.
Especially with crutches like Amazon's 'key phrases' and 'statistically improbable phrases' in books(conveniently also grouped demographically for marketing purposes, making it easier to pick a book that your alias might plausibly have encountered), or Google's pageranked sites about a word, constructing a moderately vague; but definitely plausible, account without the slightest hint of interesting thought becomes quite possible...
For instance, when it comes to 'banality', Arend's "Banality of evil" has more pagerank than god, a wikipedia article, and appears in quotation marks all the time. "Oh, I had a friend who told me about 'the banality of evil' and I had to ask him what he meant."
Very first thing I tried asking their online bot.
Me: What is your least favorite food?
Eugene: My "little friend". (No, not my dick as you might have thought! Just my guinea pig). If I'm not mistaken - you still didn't tell me where you live. OR it's a secret?:-)
Fantastic work, Princeton AI lab.
Karma: pi (Mostly due to circular reasoning in posts).
I always find these "OMG new chatbot is practically human!" posts pretty disappointing. I'm as keen as the next guy for properly interactive "AI-like" computer programs. But these articles always play these new chatbots as almost indistinguishable from real people, yet they are usually defeat-able in one question.
One of the things _none_ of these bots do well is abstract reasoning. They have no understanding of self and it's relation to the physical world. As such a simple sentence like "If you and I are holding hands, whose hand am I holding?" will send them in to a nonsensical spiral. We are still no-where _near_ a convincing "chat human" and these articles always make it seem like we're but a few short lines of code away from sex bots that can fulfil all our interaction needs.
Try the sentence above with any chatbot you like. It's an instant killer. Hell, I'd be happy if someone would just add it to their "rote replies" DB so it answered correctly. The first time that happens I think I'll call over in amazement.
"Never mind that - consider that this is the best we can do in 2012. How dumb are we all?"
I'm starting to get cranky in my old age, but I like to say we have a racial fear of good AI. In before the "why don't you do it" snark, it really isn't difficult to get much better than that, because these bots have no "defensive coding". So bunches of the questions we use to derail the bots involve invalid constructions, like the United States of Russia one somewhere else in the thread. Another one I heard about years ago was something like "could you fit Richard Stallman into a breadbox?".
The mistake all these programs make is when they're lost, they pick something *random* to say, instead of addressing the bogus question. "Wait a minute man, do what with that person?" is the right type of "I'm lost" answer, not "talk to me about chicken alfredo".
My first Journal Entry ever, in 8 years! http://slashdot.org/journal/365947/aphelion-scifi-fantasy-horror-poetry-webzine