Inside the 2012 Loebner Prize
An anonymous reader writes "Not a single judge was fooled by the chatbots in the 2012 Loebner Prize, which was won by the bot Chip Vivant. According to a journalist who was a human decoy in this year's Turing Test, interactions with the humans was a tad robotic while the bots went off on crazy tangents talking about being a cat and offering condolences for the death of a pet dragon."
No, seriously. I just hope they can do better than:
"I want to invest in corn futures"
I've found 3 corn fields near you.
"No, corn FUTURES"
Here is the Wikipedia page for Back to the Future.
"Fuck it, just dump everything into gold"
I've found 2 gold mines and 5 dumps nearby.
What political party do you join when you don't like Bible-thumpers *or* hippies?
Of course, all the *real* chatbots are too busy with their day job - posting spam to twitter and pumping out mass emails.
This is funny since Slashdot has been the main testing ground for chatbots. We have all had to read posts from them here, do you really think all Anonymous trolls are really people? BTW. most of my enemies list are chatbots Real people would not be nearly as stupid as these clowns.
Sorry about the writing. Robot fingers, you know? Cliff Steele in DOOM PATROL #23
At first glance, I read it as "Inside the 2012 Lobster Pie".
Did they ask the bots what was the best smartphone? We all know it's a bot if they didn't answer the N900
I was promised a flying car. Where is my flying car?
Hello sir, have you heard of CleanMyPC...?
I'm kinda conflicted - is this off-topic or on-topic?
Seriously, if a remote chat started talking to me like that, I'd say "Oh, hi Kim, I didn't know you were online".
A friend of mine, who long ago worked for Thinking Machines, explained the weakness, "It is all about maintaining state." A stateless AI is far easier than a stateful one. Once the machine has to retain state, the algorithms become logarithmically more complex. Therefore, the way to test a bot is to say something like, "Remember this phrase, 'pink elephant'. I'm going to ask you after we have talked a while.." Then have several exchanges and ask, "What was that animal I told you to remember?" Most humans (except Alzheimer patients) will have no trouble with it, but the machine will fail. It they add a piece of logic to catch obvious clues like this, then a slight mod such as "have you ever seen a pink elephant? . . . what animal was I talking about?" will usually defeat it.
Humans are actually very poor at remembering. Try to recall the color of the last Volkswagen you passed on the street. However, we have developed a natural ability to prioritize our memories based on context and our personal & social needs. We tend to remember most of what turns out to be relevant. Until AI develops a means to judge context, it will suffer the weakness of being out of touch with our reality.
Can someone please explain to me how to read the chat logs? I am confused as to the actual exchange that is going on. Which transcript is the Bot, which is the human and how am I to sync the two parts of the conversation up?
Fantasy remains a human right; we make in our measure and in our derivative mode... -- JRR Tolkien
I'm kinda conflicted - is this off-topic or on-topic?
I think you would be an excellent judge for next years competition.
But are you as hungry as the hippos in The Hunger Games?
/. has had users like this for years!
Mod me down, I shall become more off-topic than you could possibly imagine.
I'm happy for all the bots that got to compete this year, but I was a little unhappy on the preliminary round of this years competition compared to other years I entered. Only 4 entries can make it to the final round of the competition. There were 12 entries this year but 7 were disqualified due to contest management (Hugh Loebner) not having enough technical knowledge to get the entries working. Some well known bots based on ALICE AIML were disqualified, Cleverbot was disqualified, and my own Ultra Hal was disqualified ( http://www.zabaware.com/webhal ) Internet communication is prohibited so we all have to send the bots as self installing programs that can utilize the contests LPP protocol. My own bot is Linux based, which is a big hurdle for the preliminary round, but I sent it as a virtual box image to simplify it for contest management, but he didn't know how to deal with it.
But luckily there will be another competition this year as part of Alan Turing's 100 year centennial at Bletchley Park on June 23rd and recognized by the Olympics http://www.reading.ac.uk/news-and-events/releases/PR445524.aspx Some of the disqualified bots including my own will be competing there.
This is NOT about AI, this is a bunch of whankers whanking.
READ the chat logs, it is not a human trying to see if an AI can hold a realistic conversation but rather to see if it can trip a human/ai up. This is like trying to proof cats can't see color by poking its eyes out and then saying "AHA! See it can't see color". Or me proving you are a lousy at playing catch by shooting you in the head then mocking your mother for bearing such a clumsy child.
Hell, read the chat logs and try to tell who the humans are. It is sad to say that AI chat programs are still not that much more advanced then Lisa or whatever the name was but whats the point in trying to test if chat AI can respond to insane questioning? Congrats, you are now the winner of the bot best capable of holding a conversation with an insane person. WHOO!
Don't let these people near the flying car concept, their test track would include surface to air missles because you know, that is a good test of a flying car.
MMO Quests are like orgasms:
You may solo them, I prefer them in a group.
"crazy tangents talking about being a cat and offering condolences for the death of a pet dragon"
That sounds exactly like most of the humans on the internet to me
While I am sure that your friend is mostly right, it ought to be easier for the bot to remember stuff than that.
These are programs, right? Just allocate some data. Without trying to pun Facebook, just keep a file on every person from the bot's perspective, so if it understood "remember this animal" at all, then it just sets "Judge1 Likes Pink Elephants".
I know "Devil is in the details" but I often feel I could design a chatbot that would never make certain kinds of mistakes. Getting totally lost, sure, that's the signature problem of AI, but "I am a cat", no.
My first Journal Entry ever, in 8 years! http://slashdot.org/journal/365947/aphelion-scifi-fantasy-horror-poetry-webzine
And I reply that a perfectly valid branch of AI is designing defensive routines against trick questions. I feel that is an area the contestants don't pay enough attention to.
You know, like the one in one of the logs "time flies like an arrow, fruit flies like a banana, which is the simile?" should kick right back to the judge with "what's a simile?" and follow it with "nah, I never liked that english class crap".
Or an even crazier example, something like "would Richard Stallman fit in a breadbox?" it should kick back with "Wait, what?"
My first Journal Entry ever, in 8 years! http://slashdot.org/journal/365947/aphelion-scifi-fantasy-horror-poetry-webzine
At the competition's inception, it had several respected academics behind it. They realized it was a joke and distanced themselves from it.
www.salon.com/2003/02/26/loebner_part_one/
So instead of actually creating AI that can talk, we have to invest infinite effort to teach a bot to talk to an insane person. I could easily confuse you, and you're human. This contest if anything is now hurting real progress.
Naw, why one or the other? Just do both. We could have the manpower is we "wanted to". You know, instead of artificially limiting the field to 3 man operations.
Say that 15 people in the 100 person team design the defensive routines, which are in some ways simpler because all trick questions have low legit semantic content. Plus speaking of humans it's what siblings and college students do to each other all the time.
The other 85 members of the team can go back to regular language processing.
My first Journal Entry ever, in 8 years! http://slashdot.org/journal/365947/aphelion-scifi-fantasy-horror-poetry-webzine