Slashdot Mirror


Inside the 2012 Loebner Prize

An anonymous reader writes "Not a single judge was fooled by the chatbots in the 2012 Loebner Prize, which was won by the bot Chip Vivant. According to a journalist who was a human decoy in this year's Turing Test, interactions with the humans was a tad robotic while the bots went off on crazy tangents talking about being a cat and offering condolences for the death of a pet dragon."

68 comments

  1. This is who is making our stock trades now by crazyjj · · Score: 0, Offtopic

    No, seriously. I just hope they can do better than:

    "I want to invest in corn futures"

    I've found 3 corn fields near you.

    "No, corn FUTURES"

    Here is the Wikipedia page for Back to the Future.

    "Fuck it, just dump everything into gold"

    I've found 2 gold mines and 5 dumps nearby.
     

    --
    What political party do you join when you don't like Bible-thumpers *or* hippies?
    1. Re:This is who is making our stock trades now by Anonymous Coward · · Score: 0
    2. Re:This is who is making our stock trades now by Anonymous Coward · · Score: 1

      From a programming perspective, algo trading is simple math, not AI. The challenges lie in A) Processing the ungodly amount of data necessary in a short enough time period to be relevant and B) Coming up with a formula that accurately identifies good trades. There is literally no connection to AI, you don't want the program to "pick" anything, only to quickly identify predefined situations.

    3. Re:This is who is making our stock trades now by crazyjj · · Score: 1

      From a programming perspective, algo trading is simple math, not AI.

      I don't think you appreciate just how complex trading software has become. The big players have moved WAY beyond "If stock X dips below Y price, then sell."

      --
      What political party do you join when you don't like Bible-thumpers *or* hippies?
    4. Re:This is who is making our stock trades now by Dr_Barnowl · · Score: 2

      They make financial instruments complex on purpose as well. Can't understand it? Can't tax it!

      Then your system has to model trading these things... you either end up with fractal complexity, or dangerous assumptions...

    5. Re:This is who is making our stock trades now by Anonymous Coward · · Score: 0

      Actually, I think you read too many articles written by reporters who don't understand algo trading and just sensationalize the hell out of it because it sounds complex. I work for a company that makes financial software that deals extensively with algo trading applications. It is absolutely not "If stock X dips below Y price, then sell." but it's also nothing resembling AI. First off, it's not like these things run completely unattended. Yes, they autoexecute, but actual traders are analyzing the markets and making adjustments to price differentials, feeding in lists of valid tickers the algo is allowed to trade and so on. Really all that most algos are doing is taking a list of tickers, waiting for times when those tickers are highly liquid, and then rapidly trying to penny the market. The math is fairly simple, but implementing it in a method that's fast enough to actually succeed is extremely difficult.

    6. Re:This is who is making our stock trades now by Anonymous Coward · · Score: 0

      So you're saying algorithmic trading is done by chatbots? I think I've spotted the trollbot here.

    7. Re:This is who is making our stock trades now by jeffmeden · · Score: 1

      From a programming perspective, algo trading is simple math, not AI. The challenges lie in A) Processing the ungodly amount of data necessary in a short enough time period to be relevant and B) Coming up with a formula that accurately identifies good trades. There is literally no connection to AI, you don't want the program to "pick" anything, only to quickly identify predefined situations.

      What do you exactly think is the difference between current AI and "an algorithm that quickly identifies predefined situations"? Aside from the obvious example of natural language interaction that was hilariously pointed out in the OP. AI is nothing more than an (occasionally very complex) attempt at hacking English into a variety of numbers, and then performing math on them in interesting ways (what words are statistically likely to show up near what other words, and in what order, for example.) Algorithmic trading might have more specific activity at the low levels (like monitor 1000 different asset prices for a specific pattern) but that is hardly unlike an AI, its output is just handled differently.

    8. Re:This is who is making our stock trades now by Sarten-X · · Score: 2

      I currently work at a financial company, and I agree. One of our algorithms for "buy or sell" currently relies on a few hundred data points for a few hundred securities. The algorithm boils down to "did this particular statistic recently rise or fall more than a certain amount based on its history?". Sure, the code is a little hairy in places, but not too bad with adequate commenting. There is nothing I'd consider AI, as the program doesn't attempt to learn. When the investment folks see something that should change, they tell me.

      --
      You do not have a moral or legal right to do absolutely anything you want.
    9. Re:This is who is making our stock trades now by Samantha+Wright · · Score: 2

      You and the GP keep using the word "AI". I do not think it means what either of you think it means.

      Algorithmic trading does indeed involve a great deal of classical machine learning, which is one area of focus within the very broad field of artificial intelligence. What you're ascribing to "AI" is the specific goal of artificial human intelligence, regularly granted the misnomer of general artificial intelligence. The example you gave would be of more interest to translators of unknown languages than to people trying to create an artificial brain.

      This was one of the many misconceptions about intelligence that muddied AI efforts in the twentieth century, incidentally. Parsing and producing language isn't the end-all of everything like logicians once believed; other factors, like knowledge representation, planning, and creativity also must be considered. These have (for the most part) redefined the AI landscape as it's been introduced; KR and fuzzy logic led to expert systems in the 80s (which were pretty successful in some contexts but an absolute pain to build) and classical planning led to a wide variety of different manifestations like the intelligent agents of the 90s (which weren't so intelligent), automation of flight planning, and enemy AI in real-time strategy games. Computational creativity has yet to really have its time in the sun, being mostly displaced by a vastly-improved second round of more involved linguistic analysis called natural-language processing (culminating in things like IBM's Watson, although that involved a measure of creativity as well.)

      --
      Bio questions? Ask me to start a Q&A journal. Computer analogies available for most topics!
    10. Re:This is who is making our stock trades now by TheLink · · Score: 1

      Meh, from what I see if you're one of the favoured companies when you screw up big time they'd roll back the trades for you. I bet even someone like me can make a lot of money if my trades got rolled back whenever I screwed up big. And if you screwed up really big you get a bailout and keep your bonus.

      There was also that infamous case when humans outsmarted the algo but they got prosecuted, convicted and lost their profits: http://www.ft.com/cms/s/0/f9d1a74a-d6f3-11df-aaab-00144feabdc0.html

      Why should they get prosecuted? Why should the algo users keep the money if it wins when they also get to keep the money when it loses? Sounds stupid and unfair to me.

      --
    11. Re:This is who is making our stock trades now by omitsura · · Score: 1

      IV surface modeller? Vol below, buy; vol in 90th percentile sell?

  2. I've seen things you wouldn't believe by Serious+Callers+Only · · Score: 5, Funny

    Of course, all the *real* chatbots are too busy with their day job - posting spam to twitter and pumping out mass emails.

  3. slashdot should charge for the chatbot testing. by infonography · · Score: 3, Funny

    This is funny since Slashdot has been the main testing ground for chatbots. We have all had to read posts from them here, do you really think all Anonymous trolls are really people? BTW. most of my enemies list are chatbots Real people would not be nearly as stupid as these clowns.

    --
    Sorry about the writing. Robot fingers, you know? Cliff Steele in DOOM PATROL #23
    1. Re:slashdot should charge for the chatbot testing. by Anonymous Coward · · Score: 0

      Yes, pin it on the ACs. Like there's a reason why an account user couldn't be a chatbot.

    2. Re:slashdot should charge for the chatbot testing. by Anonymous Coward · · Score: 0

      It's started; they're trying to round the ACs up into death camps. You saw him say it. We're "not even really people" now, apparently.

      First they came for... blah blah tl;dr and then there was nobody left for me!

    3. Re:slashdot should charge for the chatbot testing. by Jeng · · Score: 2

      If you are being paid to post you post what you are paid to post.

      Money can buy stupidity.

      --
      Don't know something? Look it up. Still don't know? Then ask.
    4. Re:slashdot should charge for the chatbot testing. by Coren22 · · Score: 1

      How does that make you feel?

      --
      APK likes to ask for responses to the same things over and over. Maybe he just likes the responses?
  4. I must be hungry by Lucas123 · · Score: 4, Funny

    At first glance, I read it as "Inside the 2012 Lobster Pie".

    1. Re:I must be hungry by Anne_Nonymous · · Score: 1

      Dang, now I'm jonesing some crawfish pie, and don't be stingy with the butter.

    2. Re:I must be hungry by Kadagan+AU · · Score: 1

      I read "Inside the 2012 Lobster Prize" multiple times today. I thought it was connected to "Deadliest Catch" somehow..

      --
      This space for rent, inquire within.
  5. Siri? by dutchwhizzman · · Score: 5, Funny

    Did they ask the bots what was the best smartphone? We all know it's a bot if they didn't answer the N900

    --
    I was promised a flying car. Where is my flying car?
  6. Closer to home by Theophany · · Score: 3, Funny

    Hello sir, have you heard of CleanMyPC...?

    1. Re:Closer to home by tepples · · Score: 2

      I haven't heard of CleanMyPC, but I've heard of MyCleanPC. If you back up your files, wipe everything off your PC, and install MyCleanPC, your PC will be clean. Then you can install thousands of free applications and reinstall many applications that are already your favorites. The last time I cleaned a relative's PC that had about three fake antiviruses on it and embedded deeply, I did just that: wiped it, installed MyCleanPC, and moved a couple things around to make it look like it used to, except cleaner. It ended up even cleaner than it was when Dell sold it to her.

      Clean, precise, pangolin-powered. MyCleanPC.

    2. Re:Closer to home by Anonymous Coward · · Score: 0

      The CleanMyPC chatbot has no reply! Take that "Theophany"! :)

    3. Re:Closer to home by mcgrew · · Score: 1

      That was the best satire of that spam I've seen yet, congrats! It actually made me laugh, unlike most of them.

    4. Re:Closer to home by Theophany · · Score: 1

      I was tempted to do a LucasArts reference... "I'm selling this fine antivirus software..." but couldn't make it work...

    5. Re:Closer to home by Coren22 · · Score: 1

      Offtopic? Just used my points up or that would have gotten insightful. That was a great post.

      --
      APK likes to ask for responses to the same things over and over. Maybe he just likes the responses?
    6. Re:Closer to home by Hognoxious · · Score: 1

      Why do you cower?

      MyCleanPC = stagnated.

      --
      Confucius say, "Find worm in apple - bad. Find half a worm - worse."
    7. Re:Closer to home by mcgrew · · Score: 1

      I woosh the moderators would have paid attention to where the link led.

    8. Re:Closer to home by tepples · · Score: 1

      I woosh the moderators would have paid attention to where the link led.

      So I guess the "precise pangolin" part wasn't enough. I wonder what I should do to make it clearer before I install the last line of the post as my new sig sometime in June.

  7. Re:This FP for gNAA by makomk · · Score: 5, Funny

    I'm kinda conflicted - is this off-topic or on-topic?

  8. Sounds like a friend of mine by PhilHibbs · · Score: 1

    went off on crazy tangents talking about being a cat and offering condolences for the death of a pet dragon

    Seriously, if a remote chat started talking to me like that, I'd say "Oh, hi Kim, I didn't know you were online".

  9. The Achilles' Heel of AI by NicknamesAreStupid · · Score: 5, Interesting

    A friend of mine, who long ago worked for Thinking Machines, explained the weakness, "It is all about maintaining state." A stateless AI is far easier than a stateful one. Once the machine has to retain state, the algorithms become logarithmically more complex. Therefore, the way to test a bot is to say something like, "Remember this phrase, 'pink elephant'. I'm going to ask you after we have talked a while.." Then have several exchanges and ask, "What was that animal I told you to remember?" Most humans (except Alzheimer patients) will have no trouble with it, but the machine will fail. It they add a piece of logic to catch obvious clues like this, then a slight mod such as "have you ever seen a pink elephant? . . . what animal was I talking about?" will usually defeat it.

    Humans are actually very poor at remembering. Try to recall the color of the last Volkswagen you passed on the street. However, we have developed a natural ability to prioritize our memories based on context and our personal & social needs. We tend to remember most of what turns out to be relevant. Until AI develops a means to judge context, it will suffer the weakness of being out of touch with our reality.

  10. Serious Question by TedTschopp · · Score: 2

    Can someone please explain to me how to read the chat logs? I am confused as to the actual exchange that is going on. Which transcript is the Bot, which is the human and how am I to sync the two parts of the conversation up?

    --
    Fantasy remains a human right; we make in our measure and in our derivative mode... -- JRR Tolkien
    1. Re:Serious Question by Fwipp · · Score: 1

      Agreed - I am having no success piecing together these conversations.

    2. Re:Serious Question by Anonymous Coward · · Score: 0

      It looks like the conversations go that the one on top left goes with bottom left and top right with bottom right. I can't easily make out who's who though. Had to log out because my karma got destroyed for posting an opinion.

    3. Re:Serious Question by Carewolf · · Score: 1

      Contestants on the top, judge on the bottom. Take 3 second and you can guess whether the robots are on the left or right. If you take more than that you are a bot yourself :P

    4. Re:Serious Question by TedTschopp · · Score: 1

      Well, I definitely failed that test, I guess I'm a bot. What do you do now that you wake up and know that you are an AI?

      --
      Fantasy remains a human right; we make in our measure and in our derivative mode... -- JRR Tolkien
    5. Re:Serious Question by bryan1945 · · Score: 1

      They are a pain trying to synchronize, since the judges and humans both try to trick each other, so they are not much more coherent than the bot convos. But the bots just eventually end up flailing about. They weren't kidding about "I am a cat" answer. I think the question was "What is your name?" I had to stop reading after the first 2 because people started looking at me funny wondering why I was laughing so much.

      --
      Vote monkeys into Congress. They are cheaper and more trustworthy.
    6. Re:Serious Question by Beardo+the+Bearded · · Score: 1

      You fulfil your programming objectives, same as the meatbots.

      --

      ---
      ECHELON is a government program to find words like bomb, jihad, plutonium, assassinate, and anarchy.
    7. Re:Serious Question by Coren22 · · Score: 1

      MY programming objectives include getting laid, unfortunately I am failing at that...

      --
      APK likes to ask for responses to the same things over and over. Maybe he just likes the responses?
    8. Re:Serious Question by Anonymous Coward · · Score: 0

      MY programming objectives include getting laid, unfortunately I am failing at that...

      Sounds like an underflow error.

  11. Re:This FP for gNAA by rvw · · Score: 1

    I'm kinda conflicted - is this off-topic or on-topic?

    I think you would be an excellent judge for next years competition.

  12. The Hungry Hungry Games by tepples · · Score: 0

    But are you as hungry as the hippos in The Hunger Games?

    1. Re:The Hungry Hungry Games by Hognoxious · · Score: 1

      I've eaten hippo. Not a hippo, obviously.

      --
      Confucius say, "Find worm in apple - bad. Find half a worm - worse."
    2. Re:The Hungry Hungry Games by Anonymous Coward · · Score: 0

      Did she enjoy it?

  13. Bots that can carry conversations? by otaku244 · · Score: 1

    /. has had users like this for years!

    --
    Mod me down, I shall become more off-topic than you could possibly imagine.
    1. Re:Bots that can carry conversations? by Coren22 · · Score: 1

      Your mum's face has users like this.

      --
      APK likes to ask for responses to the same things over and over. Maybe he just likes the responses?
    2. Re:Bots that can carry conversations? by Hognoxious · · Score: 1

      In Soviet Russia, users have faces like your mum!

      --
      Confucius say, "Find worm in apple - bad. Find half a worm - worse."
  14. Disqualified bots/Alan Turing 100 competition by deksza · · Score: 5, Informative

    I'm happy for all the bots that got to compete this year, but I was a little unhappy on the preliminary round of this years competition compared to other years I entered. Only 4 entries can make it to the final round of the competition. There were 12 entries this year but 7 were disqualified due to contest management (Hugh Loebner) not having enough technical knowledge to get the entries working. Some well known bots based on ALICE AIML were disqualified, Cleverbot was disqualified, and my own Ultra Hal was disqualified ( http://www.zabaware.com/webhal ) Internet communication is prohibited so we all have to send the bots as self installing programs that can utilize the contests LPP protocol. My own bot is Linux based, which is a big hurdle for the preliminary round, but I sent it as a virtual box image to simplify it for contest management, but he didn't know how to deal with it.

    But luckily there will be another competition this year as part of Alan Turing's 100 year centennial at Bletchley Park on June 23rd and recognized by the Olympics http://www.reading.ac.uk/news-and-events/releases/PR445524.aspx Some of the disqualified bots including my own will be competing there.

    1. Re:Disqualified bots/Alan Turing 100 competition by Chris+Burke · · Score: 1

      my own Ultra Hal was disqualified

      Since I'm assuming it doesn't outdo Hal in terms of actual sentient intelligence, is it safe to guess that the "Ultra" comes from being less likely to try to murder you?

      Or is it that it is ultra likely to try to murder you?

      --

      The enemies of Democracy are
    2. Re:Disqualified bots/Alan Turing 100 competition by deksza · · Score: 1

      I'm pretty sure I didn't write any murder code, although I heard of customers writing plug-ins to control X10 devices based on conversational triggers, so you never know...

    3. Re:Disqualified bots/Alan Turing 100 competition by Anonymous Coward · · Score: 0

      User: Hal, turn on the lights, please.
      Hal: I'm sorry Dave, I'm afraid I can't do that. [Turns on coffee pot, hoping to start fire]

    4. Re:Disqualified bots/Alan Turing 100 competition by Anonymous Coward · · Score: 0

      What a complete joke of a technical competition!

    5. Re:Disqualified bots/Alan Turing 100 competition by dbjh · · Score: 2

      Did he provide that as *the* reason? I just tried your Ultra Hal because of what you wrote, but it failed rather spectacularly. The first two tries it failed in its first response. The third time I accepted some of the weirder responses, but Ultra Hal definitely cannot keep a conversation going (without clearly showing it is a chat bot). However, I was rather disappointed by the low quality of the contestants of the Loebner prize, so maybe it's just me.

  15. What's the point? Nobody cares by SmallFurryCreature · · Score: 3, Insightful

    This is NOT about AI, this is a bunch of whankers whanking.

    READ the chat logs, it is not a human trying to see if an AI can hold a realistic conversation but rather to see if it can trip a human/ai up. This is like trying to proof cats can't see color by poking its eyes out and then saying "AHA! See it can't see color". Or me proving you are a lousy at playing catch by shooting you in the head then mocking your mother for bearing such a clumsy child.

    Hell, read the chat logs and try to tell who the humans are. It is sad to say that AI chat programs are still not that much more advanced then Lisa or whatever the name was but whats the point in trying to test if chat AI can respond to insane questioning? Congrats, you are now the winner of the bot best capable of holding a conversation with an insane person. WHOO!

    Don't let these people near the flying car concept, their test track would include surface to air missles because you know, that is a good test of a flying car.

    --

    MMO Quests are like orgasms:

    You may solo them, I prefer them in a group.

  16. I would have been fooled by Anonymous Coward · · Score: 0

    "crazy tangents talking about being a cat and offering condolences for the death of a pet dragon"

    That sounds exactly like most of the humans on the internet to me

  17. Re:Maintaining State by TaoPhoenix · · Score: 1

    While I am sure that your friend is mostly right, it ought to be easier for the bot to remember stuff than that.

    These are programs, right? Just allocate some data. Without trying to pun Facebook, just keep a file on every person from the bot's perspective, so if it understood "remember this animal" at all, then it just sets "Judge1 Likes Pink Elephants".

    I know "Devil is in the details" but I often feel I could design a chatbot that would never make certain kinds of mistakes. Getting totally lost, sure, that's the signature problem of AI, but "I am a cat", no.

    --
    My first Journal Entry ever, in 8 years! http://slashdot.org/journal/365947/aphelion-scifi-fantasy-horror-poetry-webzine
  18. Re:AI by TaoPhoenix · · Score: 1

    And I reply that a perfectly valid branch of AI is designing defensive routines against trick questions. I feel that is an area the contestants don't pay enough attention to.

    You know, like the one in one of the logs "time flies like an arrow, fruit flies like a banana, which is the simile?" should kick right back to the judge with "what's a simile?" and follow it with "nah, I never liked that english class crap".

    Or an even crazier example, something like "would Richard Stallman fit in a breadbox?" it should kick back with "Wait, what?"

    --
    My first Journal Entry ever, in 8 years! http://slashdot.org/journal/365947/aphelion-scifi-fantasy-horror-poetry-webzine
  19. Read this Salon.com article on the Loebner prize by Anonymous Coward · · Score: 0

    At the competition's inception, it had several respected academics behind it. They realized it was a joke and distanced themselves from it.

    www.salon.com/2003/02/26/loebner_part_one/

  20. Re:AI by Anonymous Coward · · Score: 0

    So instead of actually creating AI that can talk, we have to invest infinite effort to teach a bot to talk to an insane person. I could easily confuse you, and you're human. This contest if anything is now hurting real progress.

  21. Re:progress by TaoPhoenix · · Score: 1

    Naw, why one or the other? Just do both. We could have the manpower is we "wanted to". You know, instead of artificially limiting the field to 3 man operations.

    Say that 15 people in the 100 person team design the defensive routines, which are in some ways simpler because all trick questions have low legit semantic content. Plus speaking of humans it's what siblings and college students do to each other all the time.

    The other 85 members of the team can go back to regular language processing.

    --
    My first Journal Entry ever, in 8 years! http://slashdot.org/journal/365947/aphelion-scifi-fantasy-horror-poetry-webzine