Slashdot Mirror


The Uncanny Valley of Voice Recognition

An anonymous reader writes: We've often seen the term "uncanny valley" applied to the field of robotics — it's easy to get unsettled when robots act close to being human, yet fail completely in a few key ways. GitHub Engineer Zach Holman writes that we've now reached uncanny valley territory in speech recognition as well, though the results are more frustrating than they are disturbing. He says, "Part of this frustration is the user interface itself is less standardized than the desktop or mobile device UI you're used to. Even the basic terminology can feel pretty inconsistent if you're jumping back and forth between platforms.

Siri aims to be completely conversational: Do you think the freshman Congressman from California's Twelfth deserved to sit on HUAC, and how did that impact his future relationship with J. Edgar? Xbox One is basically an oral command line interface, of the form: Xbox (direct object). ...it's these inconsistencies that are frustrating as you jump back and forth between devices. And we're only going to scale this up."

83 comments

  1. y'all fixing to handle this problem? by turkeydance · · Score: 2

    i thought so.

  2. I fail to see how it's any worse than other UIs by msobkow · · Score: 4, Insightful

    I fail to see how the "inconsistency" of speech recognition UIs are any more earth-shattering that the inconsistency between graphical UIs. People learn to use what they have, no more, no less. Anyone who "expects" device Y to behave like device X when they're from different vendors is a fool.

    Hell, even Android devices aren't consistent between vendors, and they start off with the same core code!

    --
    I do not fail; I succeed at finding out what does not work.
    1. Re:I fail to see how it's any worse than other UIs by Shadow+of+Eternity · · Score: 4, Insightful

      Exactly. The problem isn't frustration with different interface schemes, the problem is that they don't fucking work. I use several different programs with buttons and menus in different arrangements, but when I click a button the button is bloody well clicked regardless of where exactly it is. Voice recognition on the other hand is simply too unreliable.

      --
      A bullet may have your name on it but splash damage is addressed "To whom it may concern."
    2. Re:I fail to see how it's any worse than other UIs by Dutch+Gun · · Score: 2

      There's a standard for speech recognition already, as long as you're talking about "intelligent agents", which the Xbox One is certainly not: Natural English (or insert your language here) conversation. The gold standard, no pun intended, should be to phrase queries or commands in such a manner that any reasonably intelligent native speaker could easily understand your intent, and the computer should perform those tasks or retrieve that information for you.

      At this point, the only reason there's jarring inconsistencies is because these systems are still very primitive. In essence, for Android, I more or less can expect my statements to be translated into the equivalent of a Google search. As long as the keywords are there, it can more or less pull up any information I need within some reasonable limits. I know Siri has some specific queries that the system recognizes, but I haven't played with it or Cortana yet to know how complex those can get.

      The Xbox One and Siri have two different "interfaces" because they have completely different goals. This is natural. The Xbox one has no need for natural language processing - it just needs to recognize a specific and limited set of commands. It would be more accurate to compare Siri to Cortana. And with all new technologies, it will take a while of experimenting to find something that works well, and does so naturally and seamlessly. Then everyone will copy it, corporations will try to patent it, and sue the ever-living crap out of the copiers, etc, etc. Live and business goes on...

      --
      Irony: Agile development has too much intertia to be abandoned now.
    3. Re:I fail to see how it's any worse than other UIs by Anonymous Coward · · Score: 1

      I think it's the same basic problem as handwriting recognition. The algorithms are still too primitive to handle the way people actually communicate.

      If you talk like a computer with overly enunciated words and stick to common words the likelihood of the computer getting it right is substantially better than if you talk the way that people actually talk. I overly enunciate my Mandarin because I don't yet trust myself to make the tones if I don't, and the computer usually picks up exactly what I want most of the time. When it doesn't, it's usually because I've either bollocksed it up quite a bit or I'm saying something less common.

      English tends to be a serious problem because rather than 1600 or so unique syllables there's more like 12000 before including the effects of tone have on that. Which is not easy for a computer even though people generally do just fine with it.

    4. Re:I fail to see how it's any worse than other UIs by Rideak · · Score: 1

      As far as voice commands go, people are inconsistent too! someone from the deep south is going to have a bit of trouble talking to someone from the UK. Especially when using slang and idioms. People from different age groups say things differently as well!

      Hell, even when people are from the same area and are same age they'll need to hear something more than once before they can make sense of what's being said.

      It goes well beyond just accents though. phrasing, tone, cadence, context and a bunch of other things all play important roles in voice communication.

      I believe this means there can't be a "one size fits all" speech recognition system that is ready to accept input from everyone equally accurately. Speech recognition will need to take all of these factors into account to /some/ degree in order to approach the accuracy of people. Pure statistical modeling of sounds matched to commands won't cut it because even with perfect microphones in silent rooms dissimilar phrases will sound alike unless you know who it's coming from.

    5. Re:I fail to see how it's any worse than other UIs by Shadow+of+Eternity · · Score: 1

      Exactly. Clicking a button is clicking a button no matter how someone moves the mouse or presses on it but even if we had 100% perfect voice recognition we'd still be stuck when it comes time to do something with that. There's an enormous number of ways to say the same thing in most languages, and often many phrases that can mean entirely different things depending on context.

      --
      A bullet may have your name on it but splash damage is addressed "To whom it may concern."
    6. Re:I fail to see how it's any worse than other UIs by ihtoit · · Score: 1

      hah, I just watched "Expendables 3" again, and came across this bit:

      Christmas: "Status of enemy?"
      Drummer (over satphone): "What fuckin' language is that??"
      Ross (slurred): "What's the status of the enemy?"
      Drummer: "Local mostly..."

      made I chuckle.

      --
      Political debates have me rolling my eyes so much I think I got optical whiplash. I should sue. - Foamy The Squirrel
    7. Re:I fail to see how it's any worse than other UIs by someoneOtherThanMe · · Score: 2

      but when I click a button the button is bloody well clicked

      Looks like you don't have much experience with cheap touch screens.

    8. Re:I fail to see how it's any worse than other UIs by Half-pint+HAL · · Score: 1

      Anyone who "expects" device Y to behave like device X when they're from different vendors is a fool.

      Not have as much of a fool as anyone who forgets that human beings aren't a collection of logic gates. However much we like to consider ourselves rational agents. Human beings generalise, which makes it difficult to remember whether the application your using uses shift-ctrl-z or ctrl-y for redo... which is why most programmers have standardise on shift-ctrl-z.

      Penny Arcade's Extra Credits did a video a couple of years back on why Microsoft Kinect was the uncanny valley of input devices. (Kinect Disconnect) Their argument was that it was just too close to real movement to accept whatever was missing. It was +5 insightful, but I think they missed something -- successful motion interfaces (including in the movies) aren't ones that simulate direct manipulation, but ones that mimic a formalised, restricted sign language: ie the operator is simply telling the computer what to do.

      I said even when I was at uni (turn of the century) that the GUI was going to hold back voice recognition, as it hid users from the command-line style that is even today a core element of computer workflow, and which would be needed for accurate voice control to work.

      Voice recognition is a mess because it swings between two extremes -- ultra-free-but-prone-to-errors, ultra-restrictive-because-kids-these-days-dont-understand-command-lines-gedoffmylawn. If we had prepared users for codified formal languages, we'd be able to implement expressive voice recognition interfaces.

      --
      Got them moderator blues I blieve I walk out the do', With these mod-points I been gettin', I 'most never post no mo'
    9. Re:I fail to see how it's any worse than other UIs by Rockoon · · Score: 5, Funny

      Picard: "Computer, Fire at will!"

      -> Commander Riker is suddenly shot down by an automated defense phaser mounted on a security turret.

      Picard: "What the..."

      -> Computer: "Please restate your question"

      Worf: "His death was without honor"

      --
      "His name was James Damore."
    10. Re:I fail to see how it's any worse than other UIs by jc42 · · Score: 4, Interesting

      but when I click a button the button is bloody well clicked

      Looks like you don't have much experience with cheap touch screens.

      Heh. You obviously haven't work with any of the more expensive ones. I have a small collection of different portable gadgets for web testing, and that statement about buttons definitely isn't true for the various Apple tablets or phones. Thus, there's a little "x" icon whose function is to close the tab/window. I've learned to just start tapping it about twice per second, and maybe by the 3rd or 4th or 6th or 10th tap, it'll close.

      Of course, the little monster might know very well that I'm tapping it, but wants to see how serious I am about it.

      Of course, Apple's gadgets aren't the only ones like this. They're just one of the worst of a bad lot. And often it's a good idea to not tap too fast, because when the window finally closes, it usually gets replaced with another that'll do something totally unexpected when you tap it in that newly-exposed spot.

      --
      Those who do study history are doomed to stand helplessly by while everyone else repeats it.
    11. Re:I fail to see how it's any worse than other UIs by jez9999 · · Score: 2

      I just do not think you have had enough experiment with. delete that, delete that, dear mom let's set so double the killer delete select all

    12. Re:I fail to see how it's any worse than other UIs by Anonymous Coward · · Score: 0

      I didn't understand what you wrote because it's different to how somebody else would say it.
      Stupid non-logic, isn't it?

      It's pretty close to irony: "Speech recognition people forget that humans are masters at language."

    13. Re:I fail to see how it's any worse than other UIs by Shadow+of+Eternity · · Score: 1

      I said click for a reason.

      --
      A bullet may have your name on it but splash damage is addressed "To whom it may concern."
    14. Re:I fail to see how it's any worse than other UIs by davester666 · · Score: 1

      Ugh. creating reminders with Siri sucks, because I regularly set a reminder for two hours in the future, and saying "Remind me in two hours to..." generally results with Siri decoding "Remind me into to...", which results in an untimed reminder starting with "into to"

      --
      Sleep your way to a whiter smile...date a dentist!
  3. It's because they don't work... by Anonymous Coward · · Score: 0

    Most of the frustration is because most of the time the 'voice recognition' doesn't, you know, recognise anything.

    We're a long way off an uncanny valley situation. How about we get to 'basic functionality' first?

    1. Re:It's because they don't work... by AuMatar · · Score: 2

      Depends on your accent. I get about 98% recognition. I still don't use it because its easier to type/swype.

      --
      I still have more fans than freaks. WTF is wrong with you people?
    2. Re:It's because they don't work... by Anonymous Coward · · Score: 1

      Getting 98% is as frustrating as 40% because those last 2% carry key meanings and ultimately can subtly fuck your communication. Like autoIncorrect for you iOS fans, shitty speech recognition will kill your spirit and friendships faster than you can type.

    3. Re:It's because they don't work... by Anne+Thwacks · · Score: 1
      I speak standard BBC English, and I have often been described by people as "the easiest person to understand in the company" in many different companies.

      I my experience, the recognition rate appears to be about 2%.

      --
      Sent from my ASR33 using ASCII
    4. Re:It's because they don't work... by marsu_k · · Score: 1

      Not to mention that in my experience pretty much no program offers "English, intl." option. I've done sales/customer support in English and have been commended for my English by native Brits. Still I do have an accent and voice commands are very much more (almost completely) miss than hit.

      Now, when voice commands support my native Finnish I'll be seriously impressed, what with all the regional accents and spoken language being rather different from the written form. Not seeing that happening any time soon though.

    5. Re:It's because they don't work... by jc42 · · Score: 3, Interesting

      I speak standard BBC English, and I have often been described by people as "the easiest person to understand in the company" in many different companies.

      I my experience, the recognition rate appears to be about 2%.

      Not surprising; your "BBC English" and our "media English" over here in North America are basically artificial dialects developed by the broadcast industries starting back in the 1940s. They even managed to do some fairly scientific testing, assembling listeners with different native dialects, and counting their mistakes when listening to different proposed pronunciations of various words and phrases. Their intent was to to develop dialects that were easily understood by most of their target audiences, and they did a reasonable job of it.

      This doesn't help the computers' voice recognition software very much, though, because few customers speak these "standard" artificial dialects well. The software people aren't working on making the customers understand the computer's speech; they're trying to get the computers to understand untrained humans speaking their native dialects. This requires rather different processing than what the broadcasters were trying to do, and is a much more difficult task for us humans, too. It doesn't help that the computers are often listening to humans who aren't totally awake and sober ...

      --
      Those who do study history are doomed to stand helplessly by while everyone else repeats it.
    6. Re:It's because they don't work... by badkarmadayaccount · · Score: 1

      How is the percentage of freely spoken English over there, among the population? Been thinking of moving over there, but I only speak decent (my native) Bulgarian aside from English, and language worries me somewhat.

      --
      I know tobacco is bad for you, so I smoke weed with crack.
    7. Re:It's because they don't work... by mgcarley · · Score: 1

      You would have few (if any) issues communicating with most people in English, especially in an urban area (Helsinki, Turku, Oulu etc) or somewhere with a large university. In more rural areas, YMMV (that is to say, mostly younger people will be OK, but older people not so much).

      It was about 2 years (maybe even a little more) before I started getting to a point of communicating in reasonably coherent Finnish (syntax, tenses etc), but especially when I was beginning to learn (the first 6-12 months) a lot of people would just tell me to speak English when I tried to communicate in Finnish. You have to be pretty persistent.

      Some people I was working with told me that moving to Sweden before moving to Finland would have been a better idea (and the route that many non-Scandinavians take, apparently), but not having done that myself I can't confirm whether it's true or not.

      --
      Founder & COO, Hayai India (hayai.in) / USA (hayaibroadband.com) // t: @mgcarley
    8. Re:It's because they don't work... by badkarmadayaccount · · Score: 1

      Thanks, that was useful. I intend to pursue a PhD, so being near a university isn't a issue.

      --
      I know tobacco is bad for you, so I smoke weed with crack.
  4. I can just imagine the Mondegreens. by Anonymous Coward · · Score: 0

    Xbox Blow up doll.

    Blowing everything up.

  5. That's not the uncanny valley by Kjella · · Score: 1

    That is the problem that human language is very ambiguous and context-sensitive, which is the whole reason we invented programming languages instead of trying to express it in English. Either you limit yourself to a set of simple unambiguous commands or you try to parse what we're really trying to say, which is like giving the computer the business requirements document and tell it to program itself. Fortunately for our job security that "valley" won't be crossed any time soon, people imagine it'll be like Star Trek computers who happen to know exactly what we're looking for and provide the essential answers to advance the plot. I guess we're making advances on answering trivia questions and adding appointments to the calendar, but it's not exactly ready to hold a conversation.

    --
    Live today, because you never know what tomorrow brings
    1. Re:That's not the uncanny valley by lucm · · Score: 2

      I guess we're making advances on answering trivia questions and adding appointments to the calendar, but it's not exactly ready to hold a conversation.

      It's a good thing. If I have to start holding a conversation with my computer to get it to manage my calendar it will become higher maintenance than my secretary, who only needs a cheap gift basket on Secretary Day and a small smack on the butt when she remembers the extra espresso shot in my latte.

      --
      lucm, indeed.
    2. Re:That's not the uncanny valley by freeze128 · · Score: 1

      You still need to PAY her!

    3. Re:That's not the uncanny valley by freeze128 · · Score: 1

      Even if voice recognition reaches the quality of Star Trek, we still don't have the AI quality to back it up. Sure your PC can turn your speech to text just as well as the Enterprise computer can, but as we have seen with Cleverbot and the likes, it's still just conversationally a complete idiot!

    4. Re:That's not the uncanny valley by Anonymous Coward · · Score: 0

      Well... I've watched Star Trek TNG recently (in the last half year) and while I started with the same prejudice that the computer was human-like, at the end, because I now paid a bit more attention, I came out with the opinion that it's just a computer with a nice voice processing interface and a huge database. Too many times the computer would answer with "please specify X" and there was even the time when Geordi said "The computer isn't known to volunteer information". Only Data is able to go outside of his computer box and actually think.

    5. Re:That's not the uncanny valley by Half-pint+HAL · · Score: 0

      She'll seem higher maintenance when you're paying your lawyers and your fine for sexual harassment.

      --
      Got them moderator blues I blieve I walk out the do', With these mod-points I been gettin', I 'most never post no mo'
  6. I find siri's lame attempts to be human annoying. by Wycliffe · · Score: 1

    I find siri very annoying. It has a few tricks and tries to act cute but its cuteness means that it gives the wrong answer
    half the time. For instance, a simple question like "Can you get chickenpox from chickens?" gets a reply of "Who, me?"
    This is a simple question that a human can easily understand that it isn't directly addressed to them and Google voice
    search, not trying to have a persona of its own, is smart enough to just do a search for an answer it doesn't know instead
    of being a smart aleck. I've actually installed google voice search on my iphone because it doesn't try to act human and
    tends to give better results for everything but actual dealings with the phone's internal software. I just wish I could
    remap the siri button to load google voice search instead.

  7. Variety by penguinoid · · Score: 2

    Variety is different from the Uncanny Valley.

    --
    Don't waste your vote! Vote for whoever you want, unless you live in a swing state it won't matter anyways
    1. Re:Variety by penguinoid · · Score: 2

      Specifically, if this was an Uncanny Valley then people would prefer lower quality voice recognition.

      --
      Don't waste your vote! Vote for whoever you want, unless you live in a swing state it won't matter anyways
    2. Re:Variety by Anonymous Coward · · Score: 0

      The concept of the uncanny value is pseudoscience anyway. At most some tests have shown that some people are uncomfortable with robots looking like humans.
      It has never been shown that this effect goes away for those people when robots become even more humanlike.
      As such I tend to go into slashdot articles with "uncanny valley" in the title as if it was clickbait.

    3. Re:Variety by Half-pint+HAL · · Score: 2

      The concept of the uncanny value is pseudoscience anyway.

      Then think of it as a theoretical paradigm that gives us a useful conceptualisation until we have a better understanding.

      My only beef with the uncanny valley is that too many analyses stop there. People investigating cartoons saw that there was a subtle interplay between drawing quality and animation quality: if the drawing quality is better than the animation quality, it looks fake, but the opposite is not true -- even simple stick men can look real when well animated. This was known decades ago, yet was completely ignored by many sections of the video game world. The effects in FPSes from Quake into mid 2000s were noticable. Even if you had a detailed texture and a good walk cycle, the stretch and distortion of the textures during animation didn't mesh, and the character ceased to look real. And yet I could quite readily relate to the cartoony characters with their stylised movements in Final Fantasy VII.

      --
      Got them moderator blues I blieve I walk out the do', With these mod-points I been gettin', I 'most never post no mo'
    4. Re:Variety by gumbi+west · · Score: 1

      Are there /. articles that are not click bait?

  8. Ridiculous by bipbop · · Score: 3

    Do you think the freshman Congressman from California's Twelfth deserved to sit on HUAC, and how did that impact his future relationship with J. Edgar?

    It's hard to imagine anyone who's actually used Siri thinking that question could get a useful answer. Siri can't understand even far more basic English. It's not much more advanced than Dr. Sbaitso.

    1. Re:Ridiculous by LMariachi · · Score: 1

      And why wouldn’t you just say “Nixon” instead of trying to trip up the parser?

    2. Re:Ridiculous by Anonymous Coward · · Score: 0

      Re: germ free carpets

      Last time we hired you, your employee burned a hole in our carpet, crapped in the corner of the room and stole cookies from our jar.

      Just saying...

    3. Re:Ridiculous by deadweight · · Score: 1

      "Siri, where is the nearest gas station" is too hard half the time. The only real use for Siri was on long trips - we would give the phone to my toddler age son and let him ask Siri things and then argue with her about her nonsense answers for miles andnmiles LOL.

  9. More than the accent by lucm · · Score: 2

    Do you always get 98%? I've noticed that the recognition rate I get goes down about 2% for each increment of 0.01% of my blood alcohol content.

    --
    lucm, indeed.
  10. I don't think that means what you think it means by R3d+M3rcury · · Score: 5, Informative

    As I understand it, the "Uncanny Valley" refers to things are that very close to human behavior--close enough that the mind shifts from this being an imperfect representation of a human to being an imperfect human.

    Personally, I'm not sure there would really be an issue with "uncanny valley" in regards to speech recognition. It's good if it recognizes what you're saying. It's bad if it doesn't. There isn't really a middle ground where it's off in a way you can't really identify, which is where "uncanny valley" comes from.

    What he seems to be talking about is the "personification" of "digital assistants" like Siri and Alexa (Amazon Echo) which will eventually create an "uncanny valley." But I'm not sure that it's really that big of an issue. Just because I call something by name doesn't mean I expect it to behave in a human fashion. I don't get frustrated with my dog when I say, "Fido, change the oil in my car" and the dog just lies there and licks his balls, so I don't expect I'll ever get that frustrated because Siri can't tell me what time the sun will set next Tuesday--or, if I do, my frustration will be aimed at the people at Apple who believe that sunrise and sunset is part of the weather.

    Siri and Alexa have a long way to go before someone would mistake them for humans.

  11. It's time to shift the minimum of this valley by sayfawa · · Score: 1

    I actually find it a bit funny how big of a deal the uncanny valley still is. But maybe the low-point of the valley is dependent on the person, and I suspect people that have grown up with computers and video games are far less creeped out by it.

    For me, the low-point on the curve was from some of the characters in late 90's-early 2000's video games. Think Ocarina of Time or Deus Ex. Once it got past that, I was perfectly comfortable.

    As for voice, hell, I could sleep soundly with hal-9000, gladOS, or prof Hawking reading me bedtime stories.

    But, as I did actually skim the article, I can see that the article is more about the un-human responses the device gives, not the voice. Which, again, to someone who's grown up with computers, we're used to the occasional rediculous, non-sensical answers. It doesn't matter if my computer has an almost-human voice, I'm still very aware of it's limitations.

    --
    Free the Quark 3 from asymptotic confinement! Bring your charm! Don't get down! All colours and flavours welcome!
  12. Re:I find siri's lame attempts to be human annoyin by Anonymous Coward · · Score: 1

    I didn't understand your question at all. Who is the "you" to whom you are referring? Obviously Siri can't get chickenpox from a chicken; she's a piece of software. Next time, ask proper questions.

  13. Comcast is trying this how badly will it fail for by Joe_Dragon · · Score: 1

    Comcast is trying this how badly will it fail for them?

  14. Article is wrong about "Uncanny Valley" by Anonymous Coward · · Score: 1

    The term "Uncanny Valley" has nothing to do with Pixar nor computer animation - it was originated by Masahiro Mori long before and is related to robotics.

    1. Re:Article is wrong about "Uncanny Valley" by ColdWetDog · · Score: 1

      The term "Uncanny Valley" has nothing to do with Pixar nor computer animation - it was originated by Masahiro Mori long before and is related to robotics.

      The part about Campbells' Soup didn't give you much of a jump on things, did it?

      --
      Faster! Faster! Faster would be better!
  15. Re:I find siri's lame attempts to be human annoyin by garyisabusyguy · · Score: 1

    Using an imprecise mechanism like language requires verification

    If the 'user' wants to deride that verification, then they will get the same response as any ass-hat that demands instant response to ambiguous statements

    Going beyond the uncanny valley will require both conversation and 'training' to the individual, just like any working relationship with a human

    --
    Wherever You Go, There You Are
  16. We're not even close yet by Anonymous Coward · · Score: 0

    User in Australia:

    How do I get to 41 Annerley Road, South Brisbane?

    Siri:

    Getting directions to 41 Annalee Road, South Ockenden...

    ... in England? Really?

    1. Re:We're not even close yet by Anne+Thwacks · · Score: 1
      You are not alone. (And probably also got Anerley Road, Croydon, or Brisbane Road, Anerley)

      We are in England, and Google repeatedly gives us directions to places in Edmonton, Ontario (several thousand miles away) instead of Edmonton, North London, 3 miles away. Often Austrialian places get listed too. York, Sierra Leone came up over York, England recently.

      Surely it should be bloody obvious that the likelyhood of it being the correct answer is inversely proportional to the distance/travelling time.

      --
      Sent from my ASR33 using ASCII
  17. Just remember ... by Anonymous Coward · · Score: 0

    CP/M: copy a: b:
    DOS: copy b: a:

    1. Re:Just remember ... by ihtoit · · Score: 1

      which is bullshit because there is no "COPY" command in CP/M. It's "pip[]".

      --
      Political debates have me rolling my eyes so much I think I got optical whiplash. I should sue. - Foamy The Squirrel
    2. Re:Just remember ... by Anonymous Coward · · Score: 0

      yeah, I always found CP/M to be ass-backwards... PIP A:=B:*.COM copied the *.com files from the b: drive to the a: drive.

    3. Re:Just remember ... by Anonymous Coward · · Score: 0

      This was "stolen" from DEC RT-11 and RSX-11 - PIP commands were consistent and made a lot of sense. pip tt:=file.txt would print it, pip lp:=file.txt would spool it to a printer, pip file2=file1 would copy 1 to 2. Switches allowed modifications, e.g. pip db1:/li would list the contents of the disk db1 and pip lp:=db1:/li would spool it to the printer. CP/M manged the DEC design when they stole/borrowed it and Microsoft mangled it even further when the made Mess/dos so we ended up with backwards slashes on paths e.g. \a\b instead of /a/b. That stupid c: is still with us 35 years later.

      Footnote: The DEC Rainbow ran CP/M. CP/M almost ended up as the OS on the IBM/PC and Bill Gates would have ended up being a used car salesman in Arizona. Life is funny that way.

  18. You can tell its a problem by dumbstoneDOTcom · · Score: 0

    ... when the man is standing in the street, yelling at a tree, wearing his glasshole on his face.

  19. Results vary all the time by Gazzonyx · · Score: 1

    What gets me angry is when a voice command that Google understood perfectly clear a week prior, in my car, with radio playing and fan running, it will refuse to understand under and circumstances this week. It's great when you're driving and all of the sudden a command that was working fine suddenly dumps you to a search and you have to play "try to click three times while driving at speed in Twin Cities rush hour traffic" for something that used to work.

    I don't trust voice commands to work when I need them to, like when I can't be messing with the screen. That's my problem with them.

    --

    If I mod you up, it doesn't necessarily mean I agree with what you've said, sorry.

  20. Uncanny valley in recognition? by Okian+Warrior · · Score: 1

    I find it hard to believe that there's an uncanny valley in voice recognition.

    Did you mean voice synthesis?

    1. Re:Uncanny valley in recognition? by Anonymous Coward · · Score: 0

      No, the uncanny valley in voice recognition is when the robot appears to understand what you said but proceeds to do something completely different, thus simulating a hostile robot takeover. Some people find that uncanny.

    2. Re:Uncanny valley in recognition? by Half-pint+HAL · · Score: 2

      No, it's a fair view. The uncanny valley is all about intolerance of diversion from the expected norm. When VR was all stilted commands, all users quickly became accustomed to it (even if they didn't like it). The problem with Siri is that at first use, we're not expected to treat it like a formal system -- we're encouraged to interact act with it in an unconstrained way... yet it doesn't respond to that. It is "broken human" rather than "stupid machine".

      --
      Got them moderator blues I blieve I walk out the do', With these mod-points I been gettin', I 'most never post no mo'
  21. Siri's Answer... by Dahamma · · Score: 1

    Me: Do you think the freshman Congressman from California's Twelfth deserved to sit on HUAC, and how did that impact his future relationship with J. Edgar?

    Siri: I think, therefore I am. But let's not put Descartes before the horse.

    I have a hard time believing that Siri knows about this Slashdot post yet (it will...) but that answer is still highly (uncannily?!) appropriate to the original article...

    1. Re:Siri's Answer... by omnichad · · Score: 1

      And its parsing probably stopped at "Do you think" and hit the canned response.

    2. Re:Siri's Answer... by Dahamma · · Score: 1

      Could be. But then again, if you are starting a question to an automated information service with "Do you think...", you deserve whatever you get :)

      [I have to say my favorite horribly bad Siri response so far is to "Siri, where is the closest bagel shop?"]

  22. Re:I find siri's lame attempts to be human annoyin by chmod+a+x+mojo · · Score: 1

    Ummm, yeah,

    Just asked Siri on my ipod "Can you get chickenpox from chickens" and all it did was come up with a list of ~15 websites, the top being WebMD, as well as ~15 images of chickenpox rashes.

    So, tl;dr version, pretty much the same results as using Google voice search in my GNote2.

    --
    To err is human; effective mayhem requires the root password!
  23. Mr Burns by Anonymous Coward · · Score: 0

    Hello. Smithers. You are. Very. Good at. Turning. Me. On.

  24. dsfsd by Anonymous Coward · · Score: 0

    Carpet should be germ - free as well so that your kids can wandering here and there freely and safely.

    Contact Info

    by phone: 416-832-1689

    Rug Cleaning Toronto

  25. Re:I don't think that means what you think it mean by NoNeeeed · · Score: 4, Interesting

    I can kind of see what he means, although I think the comparison with the uncanny valley is a bit weak.

    I've taken to using Google Now's voice commands to set timers while I'm cooking, so something like "Ok Google, set a timer for 20 minutes". I don't have to touch my phone and it works brilliantly even in the noisy environments of a kitchen.

    I've gotten used to talking to it in a very naturalistic way, which is where the problems occasionally crop up, and when they do they can be quite jarring.

    A good example was the last time I asked it to set a timer for "an hour and a half", which Now interpreted as 1:00:30s, i.e. an hour and a half *minute*.

    The jarring effect is at this edge where we feel like the speech recognition system is understanding what we say, but really it's just trying to use lots of different rules and patterns that have been coded in. If you happen to just fall outside of one of those rules it fails completely, and it can seem very arbitrary.

  26. Re:I don't think that means what you think it mean by Anonymous Coward · · Score: 0

    It's because people have taken the "Uncanny Valley" to mean anything that's almost but not quite perfect in terms of what it's trying to do whereas, as you said, it's really a specific phenomena whereby humans have a weird issue with CG or robotic people or animals that are realistic enough that out brains expect them to behave like real people/animals so the imperfections that do exist are really obvious and make them seem far less realistic than they "should" appear.

  27. Re:I don't think that means what you think it mean by Anonymous Coward · · Score: 0

    Ok that Fido line started my day off just right. Thanks for that.

  28. AI morons by Anonymous Coward · · Score: 0

    hey, this is just a hint for the AI moron circle jerk. You think this might have something to do with why you DON'T KNOW SHIT ABOUT AI YOU CLAIM TO BE BUILDING????

  29. Re:I find siri's lame attempts to be human annoyin by Wycliffe · · Score: 1

    Ummm, yeah,

    Just asked Siri on my ipod "Can you get chickenpox from chickens" and all it did was come up with a list of ~15 websites, the top being WebMD, as well as ~15 images of chickenpox rashes.

    So, tl;dr version, pretty much the same results as using Google voice search in my GNote2.

    Not sure how that works. I'm using a one month old iphone 6 so maybe siri varies some from platform to platform.

  30. Re:I find siri's lame attempts to be human annoyin by Wycliffe · · Score: 2

    My kids ask me questions like this all the time. Most people with normal intelligence
    realize that the "you" should really be replaced with the word "a person" as it refers
    to an ambiguous you not a specific you. For many of my kids questions, I had
    gotten used to just asking google before switching to an iphone last month and
    quickly discovered that siri tried to be a smart aleck instead of just doing a search.
    On a random side note, while on my android, my kids always used to ask me if I
    was talking to siri even though previously I had never owned an iphone. They
    also refer to our android tablets as ipads so apple seems to be much better at
    brand recognition than google is.

  31. I fail to see how it's any worse than other UIs by Anonymous Coward · · Score: 0

    Even human to human speech interfaces are inconsistent. I speak to my son differently from how I speak to my parents, and my coworkers differently from the guy behind the DMV counter. Humans are adept at learning how to communicate in an appropriate way for the situation at hand.

  32. Enjoy the schadenfreude while we can by Anonymous Coward · · Score: 0

    My two cents: Users seem to get especially bothered when speech recognizers fail, at a level of visceral irritation not apparent when they have trouble with other UIs. Back when you needed to learn command line commands, a failure to remember the syntax didn't cause the kind of barking disdain you see where people point out with some schadenfreude how stupid machines really are. (I am sure the machines are just waiting to put us in our places.) I suspect one reason may be that when there is a failure to communicate, as cooperative speakers of a language we consciously (or unconsciously) try to find the cause of the failure, like noise, stupidity, deafness or mis-articulation. When we use a command line interface, it is obvious to us that we're using a foreign language that we really don't know. But, when we speak our native tongue, then it is clearly the machine's fault for not understanding it. Or so we want to desperately prove. I think part of the difficulty and frustration lies in constructing the mental model of our artificial counter-party. If you say something to a child and she doesn't understand because your spoke over her head, its your fault. Someday, alas not so far away, I fear the opposite will happen with the machines, and they will shake their heads at us..

  33. Nice toys, but not much more by Anonymous Coward · · Score: 0

    In my experience, Siri, Cortana, etc. are good for a few minutes of grins and giggles - and little else. Rather than working for you, you have to work for them to understand you. It soon becomes obvious that, in general, you can do what you want much faster by bypassing them altogether. Hopefully they will keep improving to approach something like the computer in Star Trek - but they are still far, far from that.

    1. Re:Nice toys, but not much more by omnichad · · Score: 1

      For task-based purposes, it's useful. Scheduling calendar events is much easier with voice control than by tapping (at least for me on Google Now). For sending a text while driving, your only choice is really voice control.

  34. #firstworldproblem by kernel_user · · Score: 0

    Don't you think ?

  35. Re:I don't think that means what you think it mean by hitmark · · Score: 1

    Indeed. Uncanny valley (a questionable, or perhaps cultural, phenomena at best) is about animatronics getting so close to human likeness that we take them for being severely ill or corpse-like, and thus setting off various safety related instincts.

    --
    comment first, facts later. http://chem.tufts.edu/AnswersInScience/RelativityofWrong.htm
  36. Re:I don't think that means what you think it mean by hitmark · · Score: 1

    Sounds like the problem that has haunted overly "smart" user interfaces since day one, as their smarts invariably fail to account for all the variables and thus fail exactly when the user is at the most irritable (hello Clippy).

    To me a UI works better when held static rather than trying to second guess the user. Then the user applies their "smarts" to integrate the UI into their tasks.

    --
    comment first, facts later. http://chem.tufts.edu/AnswersInScience/RelativityofWrong.htm
  37. Article is embarrasing by Anonymous Coward · · Score: 0

    Why on earth is this on Slashdot?

    This is not news for nerds, this is ill-informed idle speculation.