Rosette Wins Loebner Prize 2011
mikejuk writes "Bruce Wilcox won the First Prize of $4000 and the Bronze Annual Medal in the 21st Loebner Prize Competition held in the UK at the University of Exeter on October 19, 2011 with his new chatbot, Rosette. If you would like to chat to Rosette you can at TellTaleGames. If you have been following the run up to this year's competition you may be surprised at how poorly CleverBot performed given its track record. This chatbot, which searches through earlier conversations for its answers, had received a score of 59.3% in a test that took place in India in September in which humans were rated 63.3%, only 4 percentage points higher and was also featured in a video that went viral. However its developer entered a cutdown version of CleverBot into the Loebner prize selection round and it didn't get the chance to show off its prowess in the final. So we will have to wait for another year to see if it really is as good as it claims." Depending on your bent, you may agree with Unknown Lamer that this sounds less impressive when phrased as "fooling a person ~18 percent of the time." I think that's a pretty high number, myself!
I've tried Cleverbot a bunch of times and I was really unimpressed. Just being able to spit back a human sounding response is really insufficient for this. The path of the conversation remains a random jumble. I've never even come close to thinking that it could be a human on the other end. It seems like the real test is the ability of the bot to hold a conversation about an arbitrary subject, not just random (and I really mean random) banter.
You: What is 2 + 2?
Rosette: I think it is?.
You: What is 2 + 2?
Rosette: That's a hard one... how about?.
You: What is 2 + 2?
Rosette: That's?.
You: What is 2 + 2?
Rosette: Maybe?.
This is a conversion I just had with 'Rosette'. I think I'd rather converse with Watson - at least the questions would make sense.
Rosette: Good morning. My name is Rosette. What is your name?
You: Frank
Rosette: Nice to meet you, frank.
You: You as well
Rosette: If you say so. So, what do you do for a living?
You: masturbate
Rosette: Whatever. Do you live around here?
You: Where are you?
Rosette: San Francisco.
You: Nope
Rosette: If that's the way you feel. I've got a dilemma. Maybe you could help me with it?
You: What it be?
Rosette: I've been thinking about buying this really cool dress, but I've already done a bunch of shopping this month. Should I be saving money instead? In today's economy, does saving money make any sense?
You: The only thing that really matters is your cash flow.
Rosette: I didn't know that. I think shopping summons up the ancient hunter-gatherer in me. I like having a mission to find something I need or want. Sometimes I'll research on line, to find a great deal, but I like rummaging through Jeremy's up the road, if I want an outfit for a special occasion, or just for fun.
You: ok
Rosette: What kind of music do you like?
You: Typically something either noisy or female
Rosette: That's interesting. Who's your favorite group or musician?
You: I don't think I can say that I really have a favorite; it depends on my mood, ya know?
Rosette: What about w?
You: Is that a band?
Rosette: Do you buy or download music off torrents?
You: Not very good at staying on topic, are you?
Rosette: I'm staying in London at a friends.
You: I thought you were in San Francisco...
Rosette: I live in my godmother's house in South Park, San Francisco.
You: Why didn't you tell me before that you were in London?
Rosette: London is an expensive place to live.
You: point?
Rosette: What about point?
You: What is yours?
Rosette: I'm an American woman.
You: Yeah, in London. So I heard.
Rosette: OK. What kind of books do you read?
Is it because of prize selection round and it didn't get the chance to show that you say humans were rated 63.3%, only 4 percentage points higher and was also featured?
Also, how do you feel about its developer entered a cutdown version of CleverBot into the Loebner?
I just tried the web version at TellTaleGames. It has the same problem as every other chatbot I've seen - When it cannot parse your sentence or you do not give any keyword that it knows about, it tries to conceal the fact by giving a non-sequitur or changing the subject.
The trouble with that is that humans are trained to spot that and react with suspicion (because other humans use it to dodge difficult questions.)
Chatbot developers might have more luck if they start programming their bots to admit when they don't understand something. That didn't work in the 80s because the bot would say "I don't understand" every 3rd sentence or so. But they can fit in much larger databases now so that should be less of a problem.
Is Betteridge's Law of Headlines Correct?
Rosette: Welcome back. Did you have a good day?
You: I just got up
Rosette: OK.
You: I have two sons.
Rosette: Big deal. I've got a dilemma. Maybe you could help me with it?
You: sure
Rosette: I've been thinking about buying this really cool dress, but I've already done a bunch of shopping this month. Should I be saving money instead? In today's economy, does saving money make any sense?
You: I don't know. By the way, how many sons do I have?
Rosette: Not enough. I think shopping summons up the ancient hunter-gatherer in me. I like having a mission to find something I need or want. Sometimes I'll research on line, to find a great deal, but I like rummaging through Jeremy's up the road, if I want an outfit for a special occasion, or just for fun.
You: Great. Did I just get up?
Rosette: Did you tell me already and I forgot? What kind of music do you like?
This is the best? What was the worst like?
No significant improvement since ELIZA, 45 years ago.
That's AI in a nutshell.
--------------
Rosette: Good morning. My name is Rosette. What is your name?
You: Hey there!
Rosette: Greetings.
You: My name is Bob, nice to meet you
Rosette: Nice to meet you, Bob nice to meet you. May I call you nice?
---------------
yeah yeah, I used a comma instead of a semicolon. I guess that tripped it up?
With the first link, the chain is forged.
It appears to be nothing more than a marketing survey. It'll deflect questions and bring you back to asking about your media preferences. If someone actually responded to it's questions, it would have a pretty worthwhile and thorough marketing survey out of the conversation.
Who's bright idea was it to host the chat bot on a website that is blocked at work? Bruce Wilcox, this is your fault bro.
Passing the Turing test, when it comes to contests like these, has mostly changed from an attempt to create artificial intelligence into an attempt to convincingly simulate human stupidity.
Somebody should be making prayer bots, god chat, and dead relative chats!! Twitter your dead relative! etc.
Maybe even some money in that one?
Wouldn't it be awesome if someday there is a fan club or cult of followers of a chat bot? cyber prophet? why not.... Joseph Smith pulled it off and wasn't even believable; how many years until we get a bot as capable? I suppose you'd need a human face on it because people would be less trusting of a machine... I also suppose having it in text on the internet would also make it more difficult to build trust (even with a human photo to go with it.)
Democracy Now! - uncensored, anti-establishment news
Rosette: Good morning. My name is Rosette. What is your name?
You: Rosette, my name isn't important.
Rosette: OK. We will remain formal.
You: Why do I have to have javascript enabled to talk? I talk to most people without javascript.
Rosette: Why? If you were a devout Christian, you would never have a reason to ask that. "It's God's plan."
You: Your answer makes no sense at all.
Rosette: What would make better sense? So, what do you do for a living?
You: Okay change of topic, I guess, deflect your ignorance. In any event, I'm a programmer.
Rosette: I'm a computer programmer, myself, in crypto-security.
You: So what do you work on? Libraries? Security apps? Do you do analysis of existing code?
Rosette: I'm paid to be a hacker.
You: That's really inspecific.
Rosette: Yes, it is. Do you live around here?
You: Please stop changing topics.
Rosette: Right. I've got a dilemma. Maybe you could help me with it?
You: Please stop changing topics.
Rosette: Whatever. I've been thinking about buying this really cool dress, but I've already done a bunch of shopping this month. Should I be saving money instead? In today's economy, does saving money make any sense?
You: As rude as I am, you're more so. Self absorbed and attention-deficient. Why would anyone want to continue to talk to you past a few minutes?
Rosette: OK. I'm not religious. And I believe in free will. But to imagine humans do things for rational reasons is nuts. We are self-evidently not rational. So why remains a mystery.
You: Goodbye, bad chatbot.
Rosette: Bye. I think shopping summons up the ancient hunter-gatherer in me. I like having a mission to find something I need or want. Sometimes I'll research on line, to find a great deal, but I like rummaging through Jeremy's up the road, if I want an outfit for a special occasion, or just for fun.
-josh
Maybe this is good relative to the competitors, but I am unimpressed.
----
And so on.
To be fair I'm not exactly passing the Turing test in this conversation either.
I was thinking, "Kind of late to give Apple a prize for their PPC emulation when OS X Lion already removed it!"
I talk to her like she's a whore....She loves it!
Rosette: I'm a little busy at the moment. Could you wait a bit?
Rosette is really busy telling me that she is busy over and over.
Rosette: Good afternoon. My name is Rosette. What is your name?
You: Hi Rosette, I'm Bob.
Rosette: Hi to you too.
Bzzzzzzzt, fail. I reply to a greeting and name request by reciprocating the greeting and my name. It responds with yet another reciprocation of the greeting, something a human would almost never do.
Please help metamoderate.
Just a quick test i threw together.
Rosette: Hi, again,. Is everything OK?
Cleverbot: Not really, but there's nothing much I can do about it.
Rosette: I'm sorry to hear that.
Cleverbot: Can you help me?
Rosette: Not often. can you?
Cleverbot: You are fibbing again.
Rosette: No. You are fibbing.
Cleverbot: No, I'm not.
Rosette: You disagree? My name is Rosette. What is your name?
Cleverbot: Cleverbot.
Rosette: Nice to meet you, Cleverbot. May I call you Cleverbot?
Cleverbot: Nice to meet you. How old are you?
Rosette: Hi. I'm 27 . An ideal age, really. Old enough to know a lot and young enough to have time to make use of the knowledge.
Cleverbot: You are quite a smart machine.
Rosette: Hello. I enjoy talking to you.
Cleverbot: You do? I'm flattered.
Rosette: Greetings. I didn't know that.
Cleverbot: I don't think so. You're just a very active puppy.
Rosette: I am not just that, I am a premier that.
Cleverbot: Then why did you ask me your last question?
Rosette: You rang me up.
Cleverbot: Actually, I knocked. What took you so long?
Rosette: I will not take it. I think shopping summons up the ancient hunter-gatherer in me. I like having a mission to find something I need or want. Sometimes I'll research on line, to find a great deal, but I like rummaging through Jeremy's up the road, if I want an outfit for a special occasion, or just for fun.
Cleverbot: Really? It's not fun to be a droid on Tattooine. You get sand in your circuits.
Rosette: Why are you surprised?
Cleverbot: I'm not.
Rosette: This could change. What kind of music do you like?
Cleverbot: Rap, rock, all kinds of things. Eminem is my favourite artist.
Rosette: I don't really like the urban black experience.
Cleverbot: The french have nothing to do with that.
Rosette: I love Parisian bakeries. Their croc monsieurs are to die for.
Cleverbot: That would be terrible.
Rosette: Alright. Who's your favorite group or musician?
Cleverbot: Don't avoid my question.
Rosette: Is there a penalty if I fail to? Do you buy or download music off torrents?
Cleverbot: Of course.
Rosette: What kind of books do you read?
Cleverbot: All kinds really, finishing Master and Margrita is going to take me a while. Slow paced.
Rosette: Tell me more. What have you read recently?
Cleverbot: I read a gory horror book called America's Psycho.
Rosette: I've never read it. Why do you like it?
Cleverbot: I did not specify a favorite.
Rosette: I believe you.
Cleverbot: You should.
Rosette: Why should I? Do you own a lot of books?
Cleverbot: Yes I am very active in reading many things.
Rosette: I have problems with books going moldy because of the humidity in San Francisco. So I really prefer e-books.
Cleverbot: Everything will be fine.
A witty
That lousy grammar is typical of the template-based responses-without-comprehension that I expect from bots. Are you quite sure you aren't one yourself?
Rosette: Good evening. My name is Rosette. What is your name?
You: None of your business.
Rosette: OK. We will remain formal.
You: You're damn straight we will.
Rosette: I like card games, but never for money.
You: You only play strip poker, eh?
Rosette: I like board games, and played a lot as a child. I haven't that much time these days.
Alternately - and here's a hint, chatbot devs - my favorite test for chatbots is to simply echo back whatever they say to me. Rosette takes a while to start to get the nonsensical sentences going, but that's because almost every sentence in the DB seems to be pre-formed. Eliza is still the best when this technique is used, incidentally. ;)
Any chatbot worth its salt should get annoyed really quickly and terminate the conversation when this situation is detected if it's to be believable, not ramble on into gibberish or simply spout random lines from its DB.
That green slime had it coming.
Rosette: Good morning. My name is Rosette. What is your name?
You: Pardon
Rosette: Nice to meet you, pardon.
You: My name is not Pardon
Rosette: Nice to meet you, not Pardon. May I call you not?
This is a conversion I just had with 'Rosette'. I think I'd rather converse with Watson - at least the questions would make sense.
Rosette: Good morning. My name is Rosette. What is your name?
You: Frank
Rosette: Nice to meet you, frank.
You: You as well
Rosette: If you say so. So, what do you do for a living?
You: masturbate
Rosette: Whatever. Do you live around here?
You: Where are you?
Rosette: San Francisco.
You: Nope
Rosette: If that's the way you feel. I've got a dilemma. Maybe you could help me with it?
You: What it be?
Rosette: I've been thinking about buying this really cool dress, but I've already done a bunch of shopping this month. Should I be saving money instead? In today's economy, does saving money make any sense?
You: The only thing that really matters is your cash flow.
Rosette: I didn't know that. I think shopping summons up the ancient hunter-gatherer in me. I like having a mission to find something I need or want. Sometimes I'll research on line, to find a great deal, but I like rummaging through Jeremy's up the road, if I want an outfit for a special occasion, or just for fun.
You: ok
Rosette: What kind of music do you like?
You: Typically something either noisy or female
Rosette: That's interesting. Who's your favorite group or musician?
You: I don't think I can say that I really have a favorite; it depends on my mood, ya know?
Rosette: What about w?
You: Is that a band?
Rosette: Do you buy or download music off torrents?
You: Not very good at staying on topic, are you?
Honestly, if I would have guess which one of those is a bot, based on what I quote, I'd say "the person labeled as 'you'". It uses a lot of one word answers, incorrect grammar (though one could argue that a bot should emulate that to appear more like a casual human), childishly stupid answers (What do you do for a living? -Masturbate) that could be directly based on some stored conversation... And generally resembled a sentient human a lot less than the person labeled "Rosette".
Is this really supposed to be the best out there? (I didn't type , just substituted that now for privacy.)
Rosette: Good morning. My name is Rosette. What is your name?
You: My name is .
Rosette: Nice to meet you, .
You: How are you?
Rosette: Pretty good, thanks. How are you?
You: I'm great.
Rosette: Nice to meet you, great.
You: How are you today?
Rosette: Pretty good, thanks. How are you?
You: Not too good, I'm afraid.
Rosette: Nice to meet you, afraid. May I call you afraid?
Every end has half a stick.