Slashdot Mirror


Vista Speech Recognition Goes Awry

An anonymous reader writes "It seems even MSNBC is willing to take a jab on those rare occasions when Microsoft products don't work. During a demo of Vista's speech recognition technology, Vista couldn't differentiate between mom and aunt, and all attempts to rectify the problem just made it worse. Wait until you see what it spat out, I think we have a new 'All your base.' Don't you just love Microsoft's live demonstrations?"

418 comments

  1. Roald Dahl by Ithika · · Score: 4, Funny

    Reminds me of the Roald Dahl short story about the ant-eater who ate someone's aunt because their accent rendered the two words the same.

    I can't remember what the story was called.

    1. Re:Roald Dahl by Kuxman · · Score: 1

      The tollbooth?

      --
      http://www.asti-usa.com
    2. Re:Roald Dahl by Anonymous Coward · · Score: 0

      Reminds me of another (off topic) joke:
      What do you call a fish without an eye?
      .
      .
      .
      .
      .
      .
      .
      .
      .
      Fsh

    3. Re:Roald Dahl by EddWo · · Score: 4, Informative

      From the book Dirty Beasts

      --
      "Taligent is still pure vapor. Maybe they'll be the last who jumps up on Openstep... "
    4. Re:Roald Dahl by dtobias · · Score: 0

      Are you perhaps thinking of The Phantom Tollbooth? That's by Norton Juster, not Roald Dahl, and (as far as I recall) does not feature any aunts being eaten, though various puns are used throughout the book.

      --
      --Dan
      Web Tips
    5. Re:Roald Dahl by madprogrammer · · Score: 1

      Maybe you should take that as a sign that you have got the wrong one.

      Roald Dahl definitely wrote a poem about an anteater that would rather eat people than ants, but I don't remember more than that.

    6. Re:Roald Dahl by palad1 · · Score: 2, Funny
    7. Re:Roald Dahl by Anonymous Coward · · Score: 0

      Awl ewer bays, bitches

    8. Re:Roald Dahl by Reverend528 · · Score: 2, Funny

      Eat up Martha?

    9. Re:Roald Dahl by arashi+no+garou · · Score: 1

      You know, I was an avid Dahl fan as a child and I also read "The Phantom Tollbooth" more than once. To this day I *still* catch myself associating Dahl with that book, when it was actually Norton Juster who penned it.

    10. Re:Roald Dahl by ronaldb64 · · Score: 1

      Reminds me more of Buttle vs. Tuttle.... with disastrous consequences...:)

      --
      There's no place like 127.0.0.1
  2. Sigh by Anonymous Coward · · Score: 0

    Yeah, well its MS. You get that.

  3. Awww...c'mon guys.... by JoeLinux · · Score: 5, Funny

    It's just a one-time thing.

    I mean, it's not like they have a reputation for releasing half-assed code that's been hyped up through marketing to the point that it will never perform as advertised.

    And it's not like this is a company that is having image problems due to its monopolistic nature.

    Or headed by an infamous ragaholic with a history of intolerance towards free standards.

    Nope, I'm sure that this is just an accident by a company that spends its off hours petting little baby chickens and bunnies.

    1. Re:Awww...c'mon guys.... by kripkenstein · · Score: 4, Insightful

      Nothing to worry about, I'm sure they'll get all the kinks out by the time Vista is released - sometime in 2008 or so, it seems, based on this video.

      This was really a dreadful presentation. There was no ambient noise (as the commentators say later, and despite what Microsoft says), and there was no echo as the demonstrator claims during the actual test. It seems to have been done under really good test conditions, but still it failed miserably.

    2. Re:Awww...c'mon guys.... by tomstdenis · · Score: 5, Insightful

      Most likely the system was trained by an engineer and handed off to the ass in marketting. He was probably supposed to train it to his voice too but decided to hit the bar instead.

      Voice recognition requires some training regardless of who provides it. We're not Star Trek here....Prep work and rehearsal people. If mr. sales guy had tried the demo before the presentation he would have noticed it wasn't working and avoided the embarassment.

      This is why sales people are asshats. They're unprofessional non-technical people who sap back the high life while the rest of us have to put up with the mess they create through their daily barrage of verbal diarhea.

      Tom

      --
      Someday, I'll have a real sig.
    3. Re:Awww...c'mon guys.... by Anonymous Coward · · Score: 0

      Older generation systems do.

      Dragon System newer system does not. So, could it be that MS is behind the times with its newest unreleased OS and Office Suite??

    4. Re:Awww...c'mon guys.... by tomstdenis · · Score: 5, Insightful

      Generally, from what I've seen you need to train it a bit on the way you speak. There are thousands of distinct English accents and pronounciation variations.

      For instance, the word "patent" is pronounced differently in the UK from North America. In the UK it is "pay-tent" and over here it's "pah-tent". That's just one example.

      Point is [to paraphrase ballmer]:

      Preperation (clap), preperation (clap), preperation (clap), preperation (clap), preperation (clap), [pitch of voice higher], preperation (clap), preperation (clap), [wheeze out of breath, pitch even higher], preperation (clap), preperation (clap), yeah!!!

      Something tells me this sales guy will get neither punished nor lose their x-mas bonus. Some poor schmuck in engineering will take the fall for not making the demo "people ready".

      Tom

      --
      Someday, I'll have a real sig.
    5. Re:Awww...c'mon guys.... by jrmcferren · · Score: 0, Redundant

      Maybe the voice recognition software was not properly trained. I had shit like that when I played with voice recogniton software before.

      --
      sudo mod me up
    6. Re:Awww...c'mon guys.... by calculadoru · · Score: 5, Funny

      Preperation (clap), preperation (clap), preperation (clap), preperation (clap), preperation (clap), [pitch of voice higher], preperation (clap), preperation (clap), [wheeze out of breath, pitch even higher], preperation (clap), preperation (clap), yeah!!!

      One would be inclined to think that since you went and typed that word nine times, you would have managed to spell preparation correctly at least once...

      --
      The power of accurate observation is commonly called cynicism by those who have not got it. -- G.B. Shaw
    7. Re:Awww...c'mon guys.... by jc42 · · Score: 4, Insightful

      There are thousands of distinct English accents and pronounciation variations.

      Aw, c'mon; how many English dialects pronounce "mom" and "aunt" similarly?

      Even to someone who's worked with voice recognition, that mistake simply isn't credible. If the software were anywhere near usable, it wouldn't confuse those words from anyone, especially not in a low-noise, no-echo demo.

      This is a "No excuses" situation. That demo was simply a dismal failure due to some major bug(s).

      Of course, the speech recognition field has a long history of staying in such a state forever. It's hard to find a product that, even with extensive training, doesn't produce howlers like this.

      I did like the "killer" part ...

      --
      Those who do study history are doomed to stand helplessly by while everyone else repeats it.
    8. Re:Awww...c'mon guys.... by teridon · · Score: 2, Interesting

      FWIW, Nuance claims that their latest version of Dragon Naturally Speaking (v9) doesn't require training before use. But of course this is different software. But consider this -- aren't "Mom" and "Aunt" phonetically dissimilar enough that you should NOT need to train it?

      I'm not one to defend MS, but I speculate that the volume on his microphone was set too high, causing distortion and clipping. Look at the volume meter when he talks -- it goes all the way to the top.

      --
      I hold it, that a little rebellion, now and then, is a good thing. -- Thomas Jefferson
    9. Re:Awww...c'mon guys.... by Illserve · · Score: 3, Interesting

      Let's give this guy some credit. He clearly has some degree if competence if he's selected to showboat the app at a major presentation, at least enough to know that you need to train, or at least test, a voice recognition demo.

      A far more likely scenario, in my mind, is that he trained and tested it 100 times and got it working nearly flawlessly, but in a different room and with a different setup. In fact he may have overtrained it. Programs like this can behave very badly when they end up overfitting the data.

      On the day in question he may have had a different mic and the acoustics were certainly different and the program went whacko.

    10. Re:Awww...c'mon guys.... by tomstdenis · · Score: 4, Funny

      I used the Vista Beta Speech Recognition?

      [and I copy/pasted it. Yeah I know, I'm hardly literate. What you wanna fight about it?]

      Tom

      --
      Someday, I'll have a real sig.
    11. Re:Awww...c'mon guys.... by tomstdenis · · Score: 4, Insightful

      I never said training was the only cause of the failure. I said it's likely that he didn't train it. Because most high powered sales people are just cocaine snorting asshats that make peoples lives miserable.

      Chances are he never even did a walk through of the presentation before the press was there.

      Tom

      --
      Someday, I'll have a real sig.
    12. Re:Awww...c'mon guys.... by calculadoru · · Score: 1

      I used the Vista Beta Speech Recognition

      that figures then :)

      --
      The power of accurate observation is commonly called cynicism by those who have not got it. -- G.B. Shaw
    13. Re:Awww...c'mon guys.... by tomstdenis · · Score: 3, Insightful

      You clearly don't work for a large corporation. Sales people (who are not all bad) are typically the sort that don't really understand technology and are all juiced up to make sales. Around technology they often leap before actually finding out the facts which nets them in a world of trouble.

      I seriously doubt this presentation was rehearsed. At the very least, they should have tested it in that room with that mic, etc. But in all honesty, this is going to be used by millions of people in all sorts of rooms with all sorts of mics. That shouldn't matter anyways.

      Anyways, I doubt he prepared at all, that is, other than snorting cocaine off a mirror in the back room before the show.

      Tom

      --
      Someday, I'll have a real sig.
    14. Re:Awww...c'mon guys.... by midkay · · Score: 1

      Why would they show it if they didn't think it would work? Yes, it sucked, but it must have gone through *a lot* of testing before it got greenlit for a live demonstration. "Hey, Bill, the voice recognition app is ready for us to demonstrate." "Has it been tested in a wide variety of circumstances?" "Yes, but it failed miserably. Let's demo it anyways." "Sounds good!"

    15. Re:Awww...c'mon guys.... by BasilBrush · · Score: 5, Funny

      I have been using the Microsoft Vista speech recognition feature for a while now and I can assure you all that it worms delete no delete bastard undo select back 8 you fucking aunt no select all enter

    16. Re:Awww...c'mon guys.... by bozojoe · · Score: 1

      touche'

      --
      lick the cancle button (at least thats what our Chinese QA says)
    17. Re:Awww...c'mon guys.... by westlake · · Score: 1

      Now that you have that rant out of your system, let us know how your own pet project fares in its next live-on-stage public demo.

    18. Re:Awww...c'mon guys.... by Anonymous Coward · · Score: 0

      God - as long as you vomit several paragraphs, you get positive moderation in Slashdot.

      You can't spell, phonetically or otherwise.

    19. Re:Awww...c'mon guys.... by Peyna · · Score: 1

      For instance, the word "patent" is pronounced differently in the UK from North America. In the UK it is "pay-tent" and over here it's "pah-tent".

      Actually, in the U.S., the word "patent" pronounced "pa-tent" and the word "patent" pronounced "pay-tent" have two very different meanings.

      "Pa-tent" can be a type of deed, a letters patent, etc. "Pay-tent" is the opposite of latent.

      --
      What?
    20. Re:Awww...c'mon guys.... by Anonymous Coward · · Score: 0

      I can understand why such a world view may be comforting. I can also understand that junk food and "social networking" isn't nearly as sexy as coke but programmers can suck too.

    21. Re:Awww...c'mon guys.... by Anonymous Coward · · Score: 0
      Most likely the system was trained by an engineer


      To me it seems that the system was trained by Bill Gates himself.
    22. Re:Awww...c'mon guys.... by KlaymenDK · · Score: 3, Interesting
      Aw, c'mon; how many English dialects pronounce "mom" and "aunt" similarly?


      Try this:
      Said: "How to recognize speech"
      Understood: "How to wreck a nice beach"

      No, it's not always easy to tell the difference...
    23. Re:Awww...c'mon guys.... by Anonymous Coward · · Score: 0

      Sacrificing a marketing guy to save Microsoft?

      *Head explodes*

    24. Re:Awww...c'mon guys.... by samkass · · Score: 1

      Voice recognition requires some training regardless of who provides it. We're not Star Trek here...

      Not necessarily. Even in the early 90's Carnegie Mellon's sphinx2 project did a fairly reasonable job on a completely random sample of people, untrained. With modern semantic parsers and such, it should be even better. This isn't an insurmountable problem, it's just a matter of how much RAM and CPU you want to throw at the problem. In that respect, Dragon probably has an advantage, as a built-in OS feature probably can't suck up 100+MB off the top in an always-resident manner. (Incidentally, no computer or human will ever be 100% accurate at speech recognition, but the "edit" commands should probably be a LOT higher on the priority list for recognition than they appear to be in this demo.)

      Incidentally, the sphinx2 project tended to use early Alphas (with the beta of OSF/1) as servers, and NeXT workstations as clients. I think a lot of those folks went to either Apple or IBM, then later Dragon.

      --
      E pluribus unum
    25. Re:Awww...c'mon guys.... by tomstdenis · · Score: 1

      Um, no. The claim of an invention is pronounced "pay-tent" in the UK. I routinely talk with people from that part of the world.

      Also the phonetic pronounciation of the word is actually pay-tent. Us north american folks are just crazy.

      Tom

      --
      Someday, I'll have a real sig.
    26. Re:Awww...c'mon guys.... by cruff · · Score: 1

      Perhaps he really meant perpetration?

    27. Re:Awww...c'mon guys.... by Mignon · · Score: 1
      Remember how Palm dealt with the handwriting recognition problem on their low-powered devices? They invented a new way of writing and required the user learn to use it (with an on-screen keyboard as a backup.) It's close enough to "natural" writing that it's easy to learn. I think it was a good balance of giving good results and easy to learn.

      I wonder if this approach would work with computer speech recognition - invent a new pronunciation technique that is easy enough to learn but provides enough clues to the software that it can figure out the rest.

      Provided you don't sound like too much of a dork talking to your computer in this way, it might actually be a reasonable approach.

    28. Re:Awww...c'mon guys.... by WilliamSChips · · Score: 3, Funny
      Because most high powered sales people are just cocaine snorting asshats that make peoples lives miserable.
      And I think calling them that would be an insult to the cocaine-snorting asshat that makes people's lives miserable community.
      --
      Please, for the good of Humanity, vote Obama.
    29. Re:Awww...c'mon guys.... by Anonymous Coward · · Score: 0

      I'm not surprised. Do you remember in 98 how MS had a "word of the day" feature? One day my word of the day was "fcuk" ... spelled correctly - god's honest truth.

    30. Re:Awww...c'mon guys.... by LnxAddct · · Score: 2, Interesting

      I guess you've never used the voice recognition in OS X. Out of the box, worked perfectly for me. I can speak very normally, sometimes even faster than I usually speak to people, and it works fine. I've never trained it (I don't even think you can). Microsoft is simply half-assing it again.
      Regards,
      Steve

    31. Re:Awww...c'mon guys.... by cyber-vandal · · Score: 1

      a built-in OS feature probably can't suck up 100+MB off the top in an always-resident manner.

      Can't it? I would have thought part of the OS would be able to do a great deal more than an app that can't access all areas.

    32. Re:Awww...c'mon guys.... by Anonymous Coward · · Score: 0

      Or maybe he's enough of a twit that somebody decided to sabotage the presentation by mistraining the application.

    33. Re:Awww...c'mon guys.... by DavidD_CA · · Score: 1
      This is why sales people are asshats. They're unprofessional non-technical people who sap back the high life while the rest of us have to put up with the mess they create through their daily barrage of verbal diarhea.


      Wow. Tell us how you really feel.


      You know, a lot of us asshat sales people think that you technical people are a bunch of basement dwelling, Star Wars quoting, self-rightous, introverted, bad-heigene virgins who scoff at any question and make no attempt to educate your users because in your mind there is no possible way they could ever understand the complicated intricities of a computer.

      --
      -David
    34. Re:Awww...c'mon guys.... by Anonymous Coward · · Score: 0

      Ya' know what? I want and re-read the guys posts, and there wasn't one thing which would indicate that programmers can't suck, nor was there anything to lead one to believe programmers can't suck.

      Sales people (sic), on the other hand.....

    35. Re:Awww...c'mon guys.... by jc42 · · Score: 1

      Said: "How to recognize speech"
      Understood: "How to wreck a nice beach"


      That's one of my favorites. It's right up there with the popular pair:

      Time flies like an arrow.
      Fruit flies like a banana.


      The first pair is a problem in phoneme recognition, of course, while the second is a problem in parsing. And there's also the observation that "Time flies like an arrow" has a valid second parsing, with "Time" the verb and the topic one of how to determine the velocity of travelling objects. So even if you've got all the phonemes right, you can still face some serious problem extracting the meaning. And we haven't even considered homonyms yet ...

      --
      Those who do study history are doomed to stand helplessly by while everyone else repeats it.
    36. Re:Awww...c'mon guys.... by Anonymous Coward · · Score: 0

      When I read "preperation (clap)", I assumed he was talking about the famous ointment. I've never used the stuff, so it seemed plausible to me that it's spelled with an e instead of an a.

      I also just assumed that he was coughing and wheezing because his butt hurts. Preperation^H^H^H^H^H^H^H^Haration H.

    37. Re:Awww...c'mon guys.... by Anonymous Coward · · Score: 0

      But we usually close our italics tag...

    38. Re:Awww...c'mon guys.... by Anonymous Coward · · Score: 0

      Hi Tom,

      Hope you read this since I'm posting as AC and all, but the web site in your profile, imaluser.com, is a site selling off-brand sex drugs. Is this intentional, or are you linking to a domain name you forgot to renew?

      Thanks
      R.X. d C

    39. Re:Awww...c'mon guys.... by Hikaru79 · · Score: 2, Funny

      the speech recognition field has a long history of staying in such a state forever.

      Wow. I think my head exploded just trying to think about that.

    40. Re:Awww...c'mon guys.... by tengwar · · Score: 1

      Yes, but OS X is just recognising canned phrases - much easier (and far less useful) than recognising words in a stream of natural-language dictation. I like MacOS, but this is something I really miss.

    41. Re:Awww...c'mon guys.... by Weedlekin · · Score: 1

      Enunciating clearly with short pauses between each word generally helps quite a lot, just as it does when talking to foreigners with limited fluency in a language.

      --
      I'm not going to change your sheets again, Mr. Hastings.
    42. Re:Awww...c'mon guys.... by Anonymous Coward · · Score: 0

      Typical country weather... Not Tickle your %&$* with a feather.

    43. Re:Awww...c'mon guys.... by B3ryllium · · Score: 1

      Yay Firefly! :) Shindig was a fun episode ... but I think Ariel had more speech recognition. It certainly didn't have more asshats, though. :)

    44. Re:Awww...c'mon guys.... by miskatonic+alumnus · · Score: 2, Funny

      Or the classic "Scuse me while I kiss this guy".

    45. Re:Awww...c'mon guys.... by EvilSporkMan · · Score: 1

      That's "can't" in the sense of "isn't allowed to, for pragmatic reasons". For example: "Your virus scanner can't run at real-time priority; the end user might have other things running!"

      --
      -insert a witty something-
    46. Re:Awww...c'mon guys.... by rfunches · · Score: 1

      According to this, not only do you not need to train Vista's speech recognition engine (if you can call it that after this fiasco), but it is also very "easy" to navigate the OS even if you don't know commands like "show numbers" for selecting graphics on a web page in IE (because that's a "duh" command) and "close that."

    47. Re:Awww...c'mon guys.... by dafing · · Score: 1

      Only one says "mom", the rest of us say "mum". I was about to comment on the "pay-tent"/"pah-tent" comment, but this one i just couldnt resist.

      Have a look at

      http://en.wikipedia.org/wiki/American_and_British_ English_differences

      --
      --- ...or a new slashdot signature. Dear aunt, let's set so double the killer delete select all
    48. Re:Awww...c'mon guys.... by Peyna · · Score: 1

      Um, no. The claim of an invention is pronounced "pay-tent" in the UK. I routinely talk with people from that part of the world.

      I said the US, not the UK. In the US "pay-tent" is the opposite of latent, and "pat-ent" is the claim of an invention.

      --
      What?
    49. Re:Awww...c'mon guys.... by jez9999 · · Score: 1

      Actually, the 'edit' commands did look like they were recognised fine. Word just wrote them into the document instead of obeying them. Any why wouldn't it? Would you want your document buggered up because you happened to want to say 'select all' or 'back' or 'delete' or 'fix'???

      The only sensible solution i can think of is to click a button or press a key before giving an edit command.

    50. Re:Awww...c'mon guys.... by FuturePastNow · · Score: 1

      That reminds me of the time I brought home a Tablet PC from work- my roomates and I "trained" the writing and speech recognition (which I never used, the computer was just a big wi-fi remote desktop display) to be highly inaccurate... profanely inaccurate. Good times- I wonder who is using that computer now?

      --
      Give a man fire, and you warm him for the night. Set a man on fire, and you warm him for the rest of his life.
    51. Re:Awww...c'mon guys.... by killjoe · · Score: 1

      So you are saying that the MS sales reps are just as incompetent as MS management and MS programmers.

      --
      evil is as evil does
    52. Re:Awww...c'mon guys.... by sim82 · · Score: 1

      That makes it even worse. Software like that has been around for years. It should be able to detect when it is overfitting during training. At least I would expect this from a new and especially user friendly program. Most of its users won't even know about special machine learning problems like overfitting.

    53. Re:Awww...c'mon guys.... by woobieman29 · · Score: 1
      Your views of salespeople are a bit narrow, I'm afraid.

      Yes, there are a number of asshats out there selling technical solutions, just about as many as there are slapping out crappy code and hardware in technical positions. I have had the displeasure of working with both sets of idiots, as I started my career in engineering and moved into a technical sales-support position (Sales Engineer) later.

      Your view is by no means unique amongst engineers, but it is still off-target. I have had the pleasure of working with incredibly talented sales and marketing people that perform an invaluable function, and make the rest of the company work that much better. It may be hard to see the difficulty of what a true sales professional adds to a company when you are focused on technical matters, but it may be an eye opening experience for you to tag along with a sales rep or sales engineer for a while sometime to see some of the shit they have to wade through in order to make a deal happen.

      In short, yes - there are some idiots in sales that have been able to make a living through lying, cheating etc and those people make it difficult to be successful in a company. The idea that all sales professionals are like that however is incorrect.

      Honestly, if the sales reps in your workplace are so bad, perhaps you should seek out a company that values hiring true professionals? Or even better - step up and give it a try yourself and show us all how it's done. Maybe you wouldn't want to give up your solid, predictable salary for the "high life" of the sales rep that involves performance-based commissions on top of a tiny base salary?

      BTW, I realize that this post has a high probability of being modded down into oblivion by the engineer-heavy slashdot crowd, but I'll take my chances. I just get a bit weary of seeing my entire profession constantly painted with this same inappropriate brush.

      --
      \/\/oobie
    54. Re:Awww...c'mon guys.... by Shanep · · Score: 1

      It's just a one-time thing.

      In the mid 90's I was at an Apple Newton product demo in Sydney, Australia.

      The Apple guy wrote something to the effect of: "Hello, my name is George."
      and the Newton converted that to the text: "Hello, any name is failure."

      My stomach muscles were killing me.

      The Newton came good though and I loved my MP 120.

      Can MS fix something like this or will they need to buy out the Dragon Dictate people? Naturally, Microsoft will hardwire this sofware to some very dangerous tools, because they are innovative and strive to provide their customers with options. Saying in notepad "The foremen see", will of course wipe your whole harddrive and "kitty lawn fun" will get you raped in prison.

      --
      War crimes, torture, lies, illegal spying... Would someone give Bush a blowjob, already, so he can be impeached?
    55. Re:Awww...c'mon guys.... by Homer's+Donuts · · Score: 1
      Enunciating clearly with short pauses between each word generally helps quite a lot, just as it does when talking to foreigners with limited fluency in a language.
      Yeah, that and just TALKING LOUDER!!!
    56. Re:Awww...c'mon guys.... by reanjr · · Score: 1

      Actually, some accents in America pronounce the two words fairly similarly. Aunt can be pronounces like the word "on" with a "t" at the end. The "t" could easily be lost in an echoing room, leaving it to differentiate between deermom and deeron. In addition, many people (especially among those who pronounce aunt as "ont") pronounce mom without the ending "m". So now, we're trying to get deermom to be interpreted as deermah, when an alternative is deeron. It's actually very likely that this was a voice training issue.

    57. Re:Awww...c'mon guys.... by reanjr · · Score: 1

      I am not sure about Dragon (last time I used it, I think it was worse than this demo), but I believe MS's works out of the box but still trains to your voice over time. This false sense of security in not requiring training, and then the software being trained by someone else inadvertently, could have something to do with it as well. I really find it hard to believ that the software legitimately would misconstrue aunt for mom under actual real-world conditions.

    58. Re:Awww...c'mon guys.... by WilliamSChips · · Score: 1

      KAYLEE: I'm not saying the Windows XP's hard to repair, it just ain't worth it.
      OLDER FARMER:It's a fine OS, keep it tuned--
      KAYLEE: ( No way ). [Tsai boo shr] Programs need Administrative access to run.
      MURPHY (re: older farmer): We've been tellin' him buy a Windows Vista for years!
      KAYLEE (overlapping Murphy): Spyware gets admin access, it don't matter how good the programs're runnin'.
      KAYLEE (to Murphy): By the way, the Windows Vista's the same OS. They changed the GUI, hoped no one'd notice!

      --
      Please, for the good of Humanity, vote Obama.
    59. Re:Awww...c'mon guys.... by ericbg05 · · Score: 1
      Aw, c'mon; how many English dialects pronounce "mom" and "aunt" similarly?

      Don't count dialects; count individuals. Determining what a "dialect" even is will get you into theoretical and field-method trouble relatively quickly.

      To answer your question: lots of them, at least in America. Many (if not most) Americans pronounce mom as [ma:m] and aunt as [a:nt'?]. The latter is pronounced in isolation with a word-final unreleased voiceless alveolar stop and a simultaneous glottal stop, a cluster which is hardly audible.

      And [m] and [n], both voiced front nasals, are quite similar as well -- even very decent DSP living behind a very decent microphone can have trouble distinguishing between the waveforms produced thereby. Have you ever tried to spell something with an "m" or "n" to someone over the phone? Even a human living behind a crappy microphone has trouble distinguishing [m] and [n].

      Certainly the midwestern, Kansas-type dialects have [a:nt'?]; and I would imagine, since those dialects form the basis of the American prestige dialect, that this is the prototypical pronunciation of the word for the purposes of the software.

      So I'd say in the universe of English lexemes, these two are pretty damn similar. Of course, you didn't define "similarly", so I guess there's not a whole lot to measure or argue about here.

    60. Re:Awww...c'mon guys.... by Weedlekin · · Score: 1

      Or, as often happens to disabled people, talk to whoever happens to be standing near the computer instead.

      --
      I'm not going to change your sheets again, Mr. Hastings.
    61. Re:Awww...c'mon guys.... by Atario · · Score: 2, Informative

      There were other problems that simply can't be put down to actual recognition problems; it clearly understood perfectly the pronunciation of "delete select all", yet didn't act on any of that as a command.

      --
      "A great democracy must be progressive or it will soon cease to be a great democracy." --Theodore Roosevelt
    62. Re:Awww...c'mon guys.... by belarm314 · · Score: 1

      Well, here goes some karma...

      I agree with you that many sales people are actually decent individuals who either know the products they sell or are willing to admit that they don't (the latter earns more points with me). One who stands out in my memory was the sales rep for HP who served the [Big box office supply store] I used to work at. The guy really did know his stuff, and would freely admit the limitations of the products he was bringing in, as he didn't want to see us sell "grandma" photo printers to amateur photographers, or anything of that sort.

      The problem being, of course, that this individual stands out. This should be commonplace, just as programmers who do bounds-checking should be. Reality differs from theory, and it's frustrating, but not surprising.

      Since humans have a tendancy to over-emphasize the bad, and tech guys do end up having to deal with clean-up from these guys, sales reps, marketters, and managers get used as punching bags in tech-dominated areas. It's gotten to the point (the comments on this story illustrate this vividly) where picking on these groups has entered the arena of dick and fart jokes in more general circles. You should either chuckle or groan at them, and then move on. If it gets under your skin that badly (which is understandable), slashdot simply may not be the place to get news...dem's da breaks, as they say.

      --
      When moderating, assume I have not yet had my coffee.
    63. Re:Awww...c'mon guys.... by CoolVibe · · Score: 1
      ... For example: "Your virus scanner can't run at real-time priority; the end user might have other things running!"
      You are being sarcastic, aren't you? ;)
    64. Re:Awww...c'mon guys.... by im_thatoneguy · · Score: 1

      Holy shit Tom, did some sales guy steal your wife? You must have set the longest running sales bashing marathon I've ever seen online.

    65. Re:Awww...c'mon guys.... by Schraegstrichpunkt · · Score: 1
      Aw, c'mon; how many English dialects pronounce "mom" and "aunt" similarly?

      At least one. A friend of mine from Nova Scotia pronounces "mom" and "aunt" with the same vowel sound. He pronounces "aunt" like "on" with a "t" at the end.

    66. Re:Awww...c'mon guys.... by Schraegstrichpunkt · · Score: 1
      You know, a lot of us asshat sales people think that you technical people are a bunch of basement dwelling, Star Wars quoting, self-rightous, introverted, bad-heigene virgins who scoff at any question and make no attempt to educate your users because in your mind there is no possible way they could ever understand the complicated intricities of a computer.

      We know. That's what makes you an asshat salesman.

    67. Re:Awww...c'mon guys.... by adavidw · · Score: 1

      What's a ragaholic? Someone who just can't stay away from ragahol?

      No, seriously, what is it?

    68. Re:Awww...c'mon guys.... by Tim+Browse · · Score: 1

      The latter is pronounced in isolation with a word-final unreleased voiceless alveolar stop and a simultaneous glottal stop

      Goddamn preverts.

    69. Re:Awww...c'mon guys.... by Anonymous Coward · · Score: 0

      Yeah so that doesn't really speak for the program if it's so obviously wrong in slighlty different conditions... Anyway why forget that they probably used a mic that doesn't pick up ambient noises like they usually do at conferences? Who cares anyway?

    70. Re:Awww...c'mon guys.... by Cro+Magnon · · Score: 1

      I once read about someone who was dictating with speech software, and two of his co-workers came in. He said "Hi, Nick and Ben", and the software wrote "Hi, naked men".

      --
      Slow down, cowboy! It has been 4 hours since you last posted. You must wait another few hours.
    71. Re:Awww...c'mon guys.... by Anonymous Coward · · Score: 0

      Don't know which corporations you have worked for, but that's not always the case. I work for a large software company and know that at least in our country (we have offices worldwide) the technical presentations / demonstrations are always given by pre-sales engineers or technical consultants who actually *do* understand the technical side of things. Sales people may hype things up with their Powerpoint presentations as much as they like, but the tech demos are given by tech pros.

    72. Re:Awww...c'mon guys.... by woobieman29 · · Score: 1
      Thanks for the post - your points are very accurate, and articulate.

      It does bother me once in a while when an outbreak of "slam the sales guy" pops up here, but it isn't anywhere near the level of making me want to move away from Slashdot. Reading back through my original post I guess I did come across a bit snippy, but my intention was really more to point out that true technical sales professionals exist, and add huge value to their organizations.

      On your very valid point about technical folks having to clean-up after bad sales activity; Yes, this is common among poor sales people. Sales Engineers like myself know this better than anyone, as we are the first line of defense against a salesperson's inaccurate technical expecations and promises to customers. When (not if) you get into a situation like this, there are two ways to handle it:

      1) Explain the issue to the sales rep, help them to find the best way to rectify this with the customer, and make sure that if the sales rep repeats the same incorrect action again that proper action is taken with management. or,

      2) Piss and moan with your coworkers about what idiots sales reps are.

      One of these works, the other doesn't.

      --
      \/\/oobie
    73. Re:Awww...c'mon guys.... by Anonymous Coward · · Score: 0

      "...how many English dialects pronounce "mom" and "aunt" similarly?"

      Ever been to Arkansas? :)

    74. Re:Awww...c'mon guys.... by crazed+gremlin · · Score: 1
  4. Hee hee by kefoo · · Score: 4, Funny

    Reminds me of the time when I worked at a computer store and we played with the voice recognition card in a PowerMac floor model. Somebody programmed it so that if someone said "Computer, bite me", it would respond with "Can't bite what's not there". Over time the accuracy of the recognition fell. One day as a salesman was talking to a customer about the computer it misinterpreted something he said and said "Can't bite what's not there". Needless to say that system was wiped and we weren't allowed to play with it anymore.

    1. Re:Hee hee by Elektroschock · · Score: 2, Interesting

      In fact voice recognition would be a great playground for non-profit open source software projects.

      Voice recognition means permanent beta. Voice recognition only slightly improved during the last ten years. One reason is that the VR market it a trivial patent minefield. The rest is just performance.

      Sure, we will get proper voice recognition some day. I would source it out to open source and integrate it back into my products once it will be ready.

    2. Re:Hee hee by marcello_dl · · Score: 2, Insightful

      Yes, we'll get good voice recognition one day. It'll be right after 99% of the world population have mastered mouse and keyboard interfaces.

      --
      ---- MISSING MISCELLANEOUS DATA SEGMENT --- [sigdash] trolololol
    3. Re:Hee hee by Shanep · · Score: 1

      Over time the accuracy of the recognition fell.

      I tried to tell a friend of mine years ago that the problem with a lot of voice recognition software (which I had found), was the more you teach it, the worse it gets, because it has a larger database to get confused with which just keep growing and thus keeps getting worse. He didn't beleive it. Strangely, the evidence which also showed this did not seem to convince him either.

      C?
      See?
      Sea?
      See men?
      Seamen?
      Semen?

      ; )

      --
      War crimes, torture, lies, illegal spying... Would someone give Bush a blowjob, already, so he can be impeached?
  5. No no no no no. by Anonymous Coward · · Score: 0

    It's not a bug. It's a feature. It reads your mind and finds what you are TRYING to say. In this case, he wanted to write a letter to his aunt about setting bombs so as to double the amount of serial killers at large (currently two) and then remove the Select All command from all programs ever made, and so it outputted "Dear Aunt, let's set so double the killer delete select all".

    1. Re:No no no no no. by Anonymous Coward · · Score: 0

      Dear slashdot, let's so establish the -1 "so unfunny it hurts" mod.

  6. Well by antifoidulus · · Score: 3, Funny

    it could lead to surprising porn....

    1. Re:Well by acariquara · · Score: 5, Funny

      ...or a new slashdot signature.

      --
      Dear aunt, let's set so double the killer delete select all
  7. I agree by Catastrophator · · Score: 1

    Let's set so double the killer! I guess all that left to do is point and go "Haa Haa!"

    1. Re:I agree by Brad+Mace · · Score: 1

      Let's set so double the killer! I guess all that left to do is point and go "Haa Haa!"

      I see you're also using the trial version

    2. Re:I agree by Tsiangkun · · Score: 1

      and develop algorithms to determine if an email is spam,
      or just written by someone using microsofts latest speach recognition technology.

  8. The Voice of Experience by dacap · · Score: 5, Insightful

    Yes, once again Microsoft S/W Engineers learn that the more public the demo or the more important the audience, the more likely some will go wrong. It's one of Murphy's laws. Been there. Did that. Barely survived.

    Experience is the human quality that enables you to recognize a mistake immediately when you make it again.

    Dacap

    --
    English -- gotta love it! / The engineers refuse to refuse the rocket until the refuse is removed from the launch pad.
    1. Re:The Voice of Experience by son+of+remo+williams · · Score: 2, Interesting

      My gf's brother works for the MS subsidiary that does network set up and tech support for MS trade shows. He's personally wired Gates and Ballmer and has admitted that many "live" demo's the head honchos have presented were actually canned like Ashlee Simpson lip synching on SNL. They don't trust their own products enough to put their execs in the same embarrassing position that this presenter got himself into.

    2. Re:The Voice of Experience by Anonymous Coward · · Score: 0

      Yes, once again Microsoft S/W Engineers learn that the more public the demo or the more important the audience, the more likely some will go wrong. It's one of Murphy's laws.

      I read your comment aloud into Vista's Speech Reckognition, it spat out:

      "Murphy, you are a elf. You are uncontrollably I think-- me yo. Me he yo ho mureheyo."

  9. Is SR ever going to be good enough? by TommyBear · · Score: 1

    There is Dragon Naturally speaking 9, which apparently is pretty good, but will SR ever really be the Star Trek kind? You can't really expect to use SR in a noisey environment. Has anyone here ever worked on a SR application?

    1. Re:Is SR ever going to be good enough? by joe+155 · · Score: 4, Informative

      I have used Naturally Speaking, it can take a bit of time to train it, but if only you use it then you can eventually get it to the point where you can talk at a normal speed (although it has to be clear) and it will get to approaching 90% accuracy, sometimes I had it higher. The point was that it couldn't be used as an alternative to typing for extended periods though because you had to check everything it wrote.

      One thing it did do which was good though is tried to understand sections of speech, rather than just each word, which did improve accuracy. Words often follow patters and there are few words that make sense after a word, so it was often right with "over there".

      SR tech will eventually be as good as on star trek as long as people work on it. I would give it 20 years if it is seen as something which could make a lot of money, 40 if you have to wait for interested people to do it for free on their own time

      --
      *''I can't believe it's not a hyperlink.''
    2. Re:Is SR ever going to be good enough? by Ougarou · · Score: 2, Interesting

      Well, I've taken a look at that (a while back): Dragon seems to be the leader, they get (with a month of traning) te best accurracy.
      However, sound recognition engineers are slowly realizing that the problem of recognising words is not just the algorithm's fault. Even people arn't able to understand all words from a taped conversation in a cafeteria.

      Dragon is currently the best, getting further will probably require more input, like a webcam to read your lips. This is just another Microsoft product where they read the wikipedia page on it, produced a flashy interface and packaged it with their OS. If you want sound recognition, don't go with Microsoft, they don't have the expertise.

      PS: Don't forget, that getting a good or even special microfoon can make all the difference.

    3. Re:Is SR ever going to be good enough? by CastrTroy · · Score: 3, Insightful

      Who cares if they ever get up to star trek level. The technology still sucks. It's much quicker and less annoying to the people around you to just type on your keyboard. Sure it has some uses such as those who don't have full use of their hands. We shouldn't abandon all research on the subject, becuase it does have it's uses, but I don't think it's something worth pushing on the general population, especially before the technology is actually ready. People already don't like their computers, pushing buggy technology like this out will just increase the problem.

      --

      Anthropic principle: We see the universe the way it is because if it were different we would not be here to see it.
    4. Re:Is SR ever going to be good enough? by Skater · · Score: 4, Insightful

      The computer in Star Trek (at least in the Next Generation) was WAY too smart. For it to do what it supposedly did in the show, it would have to be sitting there, monitoring the conversation all the time, and be totally able to understand the context of what was being said to know what to do. Not only when people directly asked the computer a question, but also when people wanted to converse with someone.

      For example, how does the computer know that Picard wants to call Riker and isn't just talking about him? Oh and keep in mind the computer never misinterpreted something. In other examples, people would carry on intelligent conversations with the computer - all those holodeck scenes, Troi ordering chocolate, etc.

      Star Trek-style of SR I think would be the holy grail and is probably always going to be out of reach. Barring some amazing breakthrough in AI algorithms, the computer power required just for the situations above would be incredible - and that's computer time that probably could be put to better use elsewhere, even if it was found to be possible.

      I think the computer in the original Star Trek was more realistic - but even there the voice-recognition was far beyond what we're capable of today, as Microsoft has demonstrated so well. Plus all the blinkenlights that seemed to have no useful purpose were cool. ;)

    5. Re:Is SR ever going to be good enough? by tomstdenis · · Score: 1

      People who can't type [e.g. immobile, or totally lacking hands] may think otherwise.

      The problem though, this is YET ANOTHER THING that should not be bundled with Windows. It should be sold as a third-party add-on. Even if MSFT produces it, it shouldn't be part of Windows unless a customer CHOOSES to buy and use it.

      MSFT obviously has learned shit all from their anti-trust LOSS, and to them they can do whatever the hell they want.

      Frankly, I look forward to the day where MSFT is just a footnote in a history text. It may not happen within the next decade but I see enough chips falling off the bastard.

      Tom

      Footnote: Microsoft was a monopolistic, backwards company that started the PC revolution. As a result of their shady business practices and poor quality control, Microsoft was solely responsible for the mass adoption of open source software. Ironically, the company that took choice away from the people, ensured that the people would still have choice. Today you can see the remains of Microsoft on display at the Smithsonian.

      --
      Someday, I'll have a real sig.
    6. Re:Is SR ever going to be good enough? by Anonymous Coward · · Score: 0

      What the hell is a microfoon? Is is possibly a fork-spoon that is incredibly small? Perhaps you make calls with a foon. Perhaps you're using Vista's voice recognition technology to formulate your post. Perhaps you're mouthing your words as you read this.

    7. Re:Is SR ever going to be good enough? by neoform · · Score: 1

      You don't think that we'll have more advanced technologies in.... 300 years?

      --
      MABASPLOOM!
    8. Re:Is SR ever going to be good enough? by wkitchen · · Score: 2, Insightful
      There is Dragon Naturally speaking 9, which apparently is pretty good, but will SR ever really be the Star Trek kind?
      Probably. But it will have to get much better at using context. They're already using grammar as a cue, but it's going to take much more than that. Humans draw on memories of previous conversations, knowlege about the interests and mannerisms of the person speaking, and knowlege of the situation at hand. Even just knowing what's big in the news can help.

      As for ambient noise, there's often useful contextual information there too. Ambient noise can give information about where the speech is occurring and about what is happening at that location. In some rare cases the ambient noise might even be responsive to the speech itself. The audience laughing in the example was a clue that a) an audience is watching, and b) the system made a mistake. A human would have recognized that and used it to advantage. For a speech recognition system to work as well as a human, it will not only have to get better at separating speech from ambient noise, but it will need to be able to recognize the ambient noise.
    9. Re:Is SR ever going to be good enough? by TheRaven64 · · Score: 1
      it will get to approaching 90% accuracy

      90% accuracy is nowhere near enough for voice recognition in a dictation context. 90% accuracy means one word in every ten will be wrong. In this post, so far, that works out to more than one erroneous word per sentence. Factoring in the time taken to correct all of these, it is much slower than typing; especially if you need to interrupt flow to do the corrections.

      --
      I am TheRaven on Soylent News
    10. Re:Is SR ever going to be good enough? by Anonymous Coward · · Score: 0

      You know, I'm into computers in a big way and I read slashdot, so sometimes I worry that I'm too much of a nerd. But then dudes like you post stuff like that, and I realise I'm actually pretty well balanced. Cheers!

    11. Re:Is SR ever going to be good enough? by wonkobeeblebrox · · Score: 1

      I'm faily good with computer programming....
      BS from Carnegie-Mellon in Physics and CS and a Master's degree on top of that.

      Give me until the year 2400 and I bet I'd get that "Star Trek NG" computer for ya.

      Of course, someone needs to develop an "eternal-life" pill first...

    12. Re:Is SR ever going to be good enough? by dosius · · Score: 1

      And yet hasn't OS/2 come with this functionality for years? Once again M$ is just playing catchup to IBM.

      -uso.

      --
      What you hear in the ear, preach from the rooftop Matthew 10.27b
    13. Re:Is SR ever going to be good enough? by AndroidCat · · Score: 1

      Once, for laughs, I tried feeding a CBC Radio newscast into MS SAPI 5.1's dictation and it didn't do too badly. (For odd values of too.) On the one hand, the radio was picking up computer RFI, on the other, announcers (on CBC radio) are hired for their speaking voices and enunciation. The gripping hand is that MS hasn't changed their voice technology since 1998 and I hope the Vista stuff is an improvement.

      --
      One line blog. I hear that they're called Twitters now.
    14. Re:Is SR ever going to be good enough? by dingen · · Score: 1

      Bah, Star Trek, who needs that passive bloated stuff where you have to say "Computer," before the computer actually responds to your request to have some coffee replicated, or press a button on your chest to get someone on the phone.

      What we should really be after, is the kind of SR in the Knight's Industries TwoThousand. Yes, K.I.T.T., now there's a computer you can have a nice conversation with. And can pick out some nice '80ies tunes for you as well, while you're at it.

      --
      Pretty good is actually pretty bad.
    15. Re:Is SR ever going to be good enough? by Shihar · · Score: 1

      I think you over estimate the challenge of the problem. Good SR is indeed hard, I will give you that. That said, I expect in 300+ years when Star Trek is set, our AI will beat the piss out of Star Trek AI. Hell, the computer has been around for only a little over 50 years. A little over 100 years ago we had just first discovered electricity and flight.

      If anything, Star Trek is a massive under estimate as to our technological prowess in 300 year. In 300 years I do not expect humans to be running around in massive ships that take hundreds of people to crew because computers are incapable of doing it. If anything will be floating around in space, it will be ships run purely by strong AI doing whatever it is strong AI would want to do in space.

      As far as us feeble humans, I imagine that in 300 years we are going to try our hand at living like near gods being served by strong AI, will have merged with strong AI and become something entirely different then what we are today, or will have been stomped out by strong AI.

    16. Re:Is SR ever going to be good enough? by erotic+piebald · · Score: 1
      It's much quicker and less annoying to the people around you to just type on your keyboard.
      Hmmm, I have to disagree with the first part: "much quicker" and totally agree with the second part.

      In many office situations that I've observed it's already annoying and distracting with folks YELLING into their cell phones. Having them all talking to their computers will be insane (why do people do that with a cell phone but (usually) not with a land line? I'm guessing sound quality.)

      At any rate, most people speak at over 100 words a minute (my teenage daughters: 200 wpm), even a really good typist does, what? 40, 50 wpm?
    17. Re:Is SR ever going to be good enough? by NormalVisual · · Score: 4, Funny

      all those holodeck scenes, Troi ordering chocolate, etc.

      Hmmm - holodeck, Troi, chocolate.....the combination of those three items is something that gives one pause to ponder.

      Um, I'll be back in a little bit.

      --
      Please stand clear of the doors, por favor mantenganse alejado de las puertas
    18. Re:Is SR ever going to be good enough? by Wdomburg · · Score: 1

      So Apple should be allowed to bundle voice recognition (not to mention handwriting recognition, calendering, mail, chat, photo management, video authoring, web development, and audio editing) but Microsoft shouldn't?

      Linux distributions are even more aggressive in the application bundling (though there's a conspicuous lack of decent voice recognition).

    19. Re:Is SR ever going to be good enough? by jc42 · · Score: 3, Interesting

      Footnote: Microsoft was a monopolistic, backwards company that started the PC revolution.

      They don't deserve credit for starting the "PC revolution". The credit properly belongs to the hundreds of little startups and hobbyists, the whole CP/M crowd and others like Amiga. Microsoft was a subcontractor to a giant monopoly (IBM) that stepped in after the little guys demoed there was a market, and took over that market. They succeeded mostly because of a marketing budget greater than the budgets of all the little companies combined.

      And there's a good argument that, by marketing PC/DOS rather than CP/M, they set back the PC revolution by 5 to 10 years, the time it took for PC/DOS to match the capabilities of CP/M when IBM started their PC marketing campaign.

      Sorry; that's the way "the Market" works in the computer field. Small, independent developers make something new and start selling it; the big companies then step in and take over the market through traditional monopoly strategies.

      It's likely that we're now going to hear people crediting Microsoft for starting the "voice recognition" revolution by inventing the new idea that computers can understand speech. Marketing can redefine history like that.

      (Whereas we computer geeks know that Al Gore invented speech recognition. ;-)

      --
      Those who do study history are doomed to stand helplessly by while everyone else repeats it.
    20. Re:Is SR ever going to be good enough? by Anonymous Coward · · Score: 0

      Nice. "gripping hand" usage really needs to work its way into everyday speech.

    21. Re:Is SR ever going to be good enough? by tomstdenis · · Score: 1

      I don't get your reply. I never said MacOS was in the clear. I don't harp on MacOS because I'm not personally forced to use it. I'm stuck using Windows for work [*] and it drives me stir crazy.

      Linux distributions are free to install and the software contained in them is portable. I can use Gnome in the BSD or Linux distributions with free will. I'm not stuck to using OpenOffice [for instance] ONLY in FreeBSD. Furthermore, with the good distributions I can remove and change competing programs. Don't like Gnome? Put KDE on, or IceWM, or WindowMaker or ..., Don't like bash? use tcsh or ksh or ..., don't like mplayer, use totem or xine or ..., etc.

      Try telling Windows that you dislike IE and want all traces of it removed from the box. Try buying a Windows install that didn't bundle IE or WMP. Try asking for a discount because you don't plan on using them.

      I'm sorry, but your examples are just weak. You are far more capable of making decisions in the OSS world than in Windows.

      Tom

      [*] To those that reply "oh you don't have to work there", well just send me rent and food cheques and I'll agree with you.

      --
      Someday, I'll have a real sig.
    22. Re:Is SR ever going to be good enough? by jc42 · · Score: 1

      If history is any guide, in 300 years, we'll still be reading predictions that accurate speech recognition is 10-20 years away.

      --
      Those who do study history are doomed to stand helplessly by while everyone else repeats it.
    23. Re:Is SR ever going to be good enough? by mobby_6kl · · Score: 1

      Oh come on. Apple bundles all their shit, including speech recognition, with MacOS. And don't try this bullshit about how they're not a monopoly: they are, on apple hardware. Even the commercial Linux distributions often come with several DVDs full of useless shit (no SR, but that's because the OS movement can't produce a solution).

    24. Re:Is SR ever going to be good enough? by Anonymous Coward · · Score: 1

      Or perhaps you are a chauvinistic idiot who can't deduct from the ".nl" country code of his web site that he is not a native English speaker?

    25. Re:Is SR ever going to be good enough? by tomstdenis · · Score: 1

      I already replied to this earlier in the thread [you posted essentially what someone else did...]

      Anyways, I never said MacOS wasn't. Stop putting words into my posts. I said Microsoft was.

      As for the OS world, they're not a monopoly because you can still choose between them. Sure distros like those from Redhat and Suse can try some lock-in tricks (Redhat is most famous for renaming symbols in the C and C++ libs) but the tools themselves are almost always platform agnostic.

      Don't like Redhat? Try Gentoo. It gives you complete control over what applications go on our box. None of this "Redhat is a Gnome shop" bullshit.

      Tom

      --
      Someday, I'll have a real sig.
    26. Re:Is SR ever going to be good enough? by asuffield · · Score: 1
      So Apple should be allowed to bundle voice recognition (not to mention handwriting recognition, calendering, mail, chat, photo management, video authoring, web development, and audio editing) but Microsoft shouldn't?


      Correct, that is what antitrust law is all about. If you don't like it, feel free to campaign for antitrust law to be revoked. I'm sure a lot of megacorps would like that.
    27. Re:Is SR ever going to be good enough? by jc42 · · Score: 3, Interesting

      I expect in 300+ years when Star Trek is set, our AI will beat the piss out of Star Trek AI. Hell, the computer has been around for only a little over 50 years. A little over 100 years ago we had just first discovered electricity and flight.

      Well, maybe. But we invented microscopes around 300 years ago, and discovered microorganisms immediately thereafter. The understanding that some bacteria were involved in diseases followed quickly. But it was nearly 300 years before we successfully eradicated a disease (smallpox). Today, we're still battling new diseases, and we don't have anything like a general solution to all diseases. We have a few antibiotics that effect more than one disease, but we haven't made much progress in solving the problem of the development of resistance to our antibiotics. Hell, we can't even convince the general public that it's the evolutionary process at work here, and we've understood that for around 150 years.

      I wouldn't predict any general solution to a complex problem like voice recognition in a mere 300 years. Maybe we will. But our history of general solutions to other complex biological problems is not encouraging. Neither is the history of our first 50 years of AI, despite the constant hype and Hollywood movies claiming that AI is just around the corner.

      --
      Those who do study history are doomed to stand helplessly by while everyone else repeats it.
    28. Re:Is SR ever going to be good enough? by giorgosts · · Score: 1

      The problem with these technologies can lie on the human side also. Can you speak error-free, like on the 9 o'clock news? How can a machine hear one thing and reason that it must be another?

    29. Re:Is SR ever going to be good enough? by asuffield · · Score: 1

      Voice recognition is not a viable dictation method.

      Yes, I really mean that. Typing is faster than talking, for anybody who's any good at it. And not by a little - by a lot. If you want to get a document into a computer quickly, use a keyboard.

      There are two practical applications for voice recognition that currently exist in the real world. The first is scenarios where it's not practical to reach a keyboard. Home automation, portable devices, stuff like that. Accuracy is important here, but it's also easy - there will be a small set of allowed phrases. That's much easier than full-English input. The second is probably the most important one - input for people who cannot use a keyboard, due to some form of disability. Accuracy here is considered a luxury. Nice to have but not really necessary.

      So, it's not really a big deal. But you'll note that nothing here suggests that non-disabled people will have any use for voice input to Windows.

      Realistically, this is the new minesweeper for Vista. It's a toy. Each new version of Windows has a few of these, because these toys are what sell non-OEM copies of the thing. People who go out and buy the new upgrade in a store, for however many thousands of dollars, are mostly doing it to get the new toys. Microsoft is aware of this and puts a lot of effort into making each successive version have new shiny toys; the rest is largely unimportant.

    30. Re:Is SR ever going to be good enough? by asuffield · · Score: 1
      At any rate, most people speak at over 100 words a minute (my teenage daughters: 200 wpm), even a really good typist does, what? 40, 50 wpm?


      Only in the conversational mode, with its sloppy, incomplete grammar and total absence of punctuation. Spoken English is not like written English. Speaking the written form is very slow. You either need a secretary to interpret, or you have to think ahead and plan your sentences. Typing's faster, largely because it offloads work from the part of your brain that handles speech to the part that handles motor control.
    31. Re:Is SR ever going to be good enough? by westlake · · Score: 2, Insightful
      For example, how does the computer know that Picard wants to call Riker and isn't just talking about him? Oh and keep in mind the computer never misinterpreted something. In other examples, people would carry on intelligent conversations with the computer - all those holodeck scenes, Troi ordering chocolate, etc.

      The fleet's computers have "known" Picard since he entered the service. They should be pretty well trained .

      The communicator badges in TNG could be transmitting supplementary biometric data and non-verbal commands, which by now have become almost automatic: "Watson. I need you!"

    32. Re:Is SR ever going to be good enough? by Yvan256 · · Score: 2, Insightful
      The communicator badges in TNG could be transmitting supplementary biometric data and non-verbal commands, which by now have become almost automatic: "Watson. I need you!"


      The badge also indicates the location of the person. So if Picard says "Will" (or "number one", which is simply an alias that Picard made for "Riker, William T.") and the computer sees that Will isn't in the same room as Picard (or isn't within normal hearing distance), it simply connects the two via a communication channel.
    33. Re:Is SR ever going to be good enough? by DarkOx · · Score: 2, Interesting

      No 90% is no place near good enough for dictation but it sure might be good enough for some applications.

      Think "computer lights", if it gets it wrong you just try again. All those media PC would be good candidates as well. If I say "change to channel six" and thing swiches to sixty 1/10th of the time well I could repeat myself that often in that application anyway; and still be pretty satisfied.

      --
      Repeal the 17th Amendment TODAY! Also Please Read http://www.gnu.org/philosophy/right-to-read.html
    34. Re:Is SR ever going to be good enough? by SleepyHappyDoc · · Score: 2, Insightful

      90% accuracy is nowhere near enough for voice recognition in a dictation context

      Depends on your own context...I deployed (admittedly, an older version) of Dragon NaturallySpeaking in an office full of mobility-impaired employees. They found it much easier to spend 10% of their writing time fixing errors than 100% of it trying to, for example, type with the onscreen keyboard. If you can't use a keyboard, even crappy voice recognition is a godsend.

      --
      Stasis is death. Embrace change.
    35. Re:Is SR ever going to be good enough? by colmore · · Score: 1

      What century is Star Trek set in? The 25th? (Deep Space 9 was the only series I liked at all, but I've been told I haven't seen the "right" episodes of TNG)

      Given what computers were like just 50 years ago, I don't think conversational fluency in spoken language (and notice that nobody in Star Trek ever mumbles) is too far fetched to imagine in several hundred years. It's not like they've solved Go or anything else actually impossible.

      --
      In Capitalist America, bank robs you!
    36. Re:Is SR ever going to be good enough? by tomhudson · · Score: 1

      Fair enough, but it wasn't Microsoft who started the PC revolution. There were a plethora of different boxes well before the IBM PC. They were all Personal Computers in the real sense ... since there weren't too many programs out there, they'd only run what you wrote :-)

      Those were fun days. Today's boxes are a million times more capable, but not as "interesting".

    37. Re:Is SR ever going to be good enough? by Skater · · Score: 1

      I think we'll have more advanced technologies, sure. I just think this one technology won't come to pass. Human speech is far too varied, with far too many idiomatic expressions (well, in English at least), that a computer will never be able to figure it out.

      For example, for a computer to grasp sarcasm, it'll have to understand emotions and rolled eyes and other visual cues. And even the bleeding-edge Star Trek android didn't grasp emotions (well, the one that did was insane). Yet there are plenty of people out there that don't get sarcasm now - and they have emotions.

      How many times do even people misinterpret what is said? It happens all the time. Yet a computer is somehow going to do better than that? Also, people on Star Trek always talked perfectly (as another poster pointed out) - but in real life people stutter, mispronounce words, use incorrect words, get interrupted, toss in some "ums" and "ahs", etc.

      I really think that people are underestimating the magnitude of the problem. It's not just speech recognition - it's emotion recognition and situational awareness as well. Sure computers will get more powerful, but to pull of Star Trek-style speech recognition, we'll need some damn impressive algorithms for them, and they'll probably require top of the line computers even then.

    38. Re:Is SR ever going to be good enough? by Comatosis · · Score: 1

      Patters of little feet! You need a voice tune up :).

      --
      When expecting to find intelligence in a person, do not look at their age but instead look at their IQ and maturity firs
    39. Re:Is SR ever going to be good enough? by Anonymous Coward · · Score: 0

      "Pretty well balanced" includes sniping at people that put thought into what they say while having nothing useful to contribute yourself?

    40. Re:Is SR ever going to be good enough? by Anonymous Coward · · Score: 0

      Everyone does this from time to time, especially on slashdot. So yep, still pretty well balanced.

    41. Re:Is SR ever going to be good enough? by Wdomburg · · Score: 1

      You can install all the competitive software you want on Windows or MacOS or Linux rather than use what's bundled. And bundling has the same stifling effect on competetion in all three cases - third part vendors of CD burning software, for example, are going to have an uphill battle on all three platforms. Likewise with mail clients, video players, et cetera, et cetera.

      And yes, on Linux you can replace whatever the default GUI for your distribution is. You know what? On Windows you can install Litestep or LDE or Talisman or any of a half dozen others to replace Explorer. There have even been proof of concept ports of KDE and Gnome to Windows.

      Of course, they may not get a lot of traction since most people who don't want to run Windows... don't run Windows. Funny that.

      And while you can technically remove every trace of some components from a Linux distribution, try removing the Gecko engine from Gnome or KHTML from KDE or WebCore from MacOS/X. Like it or not, HTML rendering is a core feature of a desktop environment. I suppose we could always go back to the good old days where everyone reimplemented basic functionality themselves or people dealt with running applications linked against a variety of slightly incompatible third-party libraries which may or may not get any active maintainance.

      (For the record, I don't particularly like Windows myself. The only reason I have a Windows partition available at all is for access to specific applications I need professionally.)

    42. Re:Is SR ever going to be good enough? by tomstdenis · · Score: 1

      You're confusing choosing a bundle over being forced to use a bundle.

      For instance, suppose you're an author, and your publisher demands you have to use Word. (hmm sounds familiar...). Now suppose you agree and go to use Word. Oh wait, Word only works in Windows. Not because technically it will only work there but because MSFT stiffles it to only work there (hint: if OpenOffice can work anywhere why can't Office?). Ok fine, so you buy a copy of windows for hundreds of dollars. Oh but wait, it comes with IE and WMP and a dozen other dumb applications you don't want or need. So now you're paying MSFT to support the IE, WMP, etc teams for things you don't actually want or need. In fact you didn't even want Windows in the first place.

      The problem here is MSFT is using its otherwise independent shops to prop up the sales of Windows to lock people into a given platform. For "MS Office" to be truly competitive and offer the customers the best choices it should really be ported to Linux and MacOS [timely]. That gives us, the customers, the ability to choose tools in a free market.

      Then you say, oh but you don't need to use Windows. Just use OSS. Except that MSFT locks in the file formats and APIs of its tools. They don't document the file formats used by programs YOU BOUGHT. So now if you want to compete with company A you have to use what they use, in this case Word or Excel or PP. Company A wants to work with Company B, who uses Office. so A moves to Office too. B wants to work with C, so B moved to Office forcing A to move to office forcing you to move to office, etc... It's totally viral and it's totally intentional.

      MSFT is not popular because it's technically superior or just the better choice. It's popular because they have done everything in their power to lock you into their platform. secret APIs and file formats are just two of the tricks they use. They also make illegal deals with companies to bundle Windows with their hardware whether you like it or not. While in recent years it has been possible to get naked systems it's not usually straightfoward and customers are encouraged to buy the bundle whether it's in their best interests or not.

      OSS is not like this. My OpenOffice documents made in Redhat are readable in Fedora, FreeBSD, WindowsXP, Gentoo Linux, etc. The C, Motif, GTK, etc APIs are all [to varying degrees] documented and consistent. My GTK+ application for FreeBSD will very likely build and run out of the box on a Gentoo box, etc, etc.

      OSS gives you the power to choose what YOU want to solve your problems. MSFT strives to simplify that for you. Use Windows, use Office, use VisualStudio. What else is there?

      Tom

      --
      Someday, I'll have a real sig.
    43. Re:Is SR ever going to be good enough? by teslatug · · Score: 1

      Not only was the computer too smart, it was smarter than any single human or betazoid could be. Imagine instead of a computer you had an assistant. No one would believe that that assistant could pick up every single time whether he was being addressed or they were talking to each other. Once in a while he'd have to say, "were you talking to me?" but the computer somehow manages every time. It's not possible through AI, you need direct access to the thought patterns of the humans. That's the only way it could be that good, and if they'd just said that I'd be Ok with it because I don't see outside the realm of possibility where you could detect some brainwave signature that matches commanding the computer.

    44. Re:Is SR ever going to be good enough? by Kjella · · Score: 1

      I wouldn't predict any general solution to a complex problem like voice recognition in a mere 300 years. Maybe we will. But our history of general solutions to other complex biological problems is not encouraging. Neither is the history of our first 50 years of AI, despite the constant hype and Hollywood movies claiming that AI is just around the corner.

      The real issue with speech recognition is presumably just as much AI as speech recognition itself. Without realizing it, we're using an awful lot of context and logic to deduct what it is a person really said. For example, if I'm stopped on the street and asked for directions, implictly I narrow the search down to nearby streets and landsmarks which it would be natural that a random passer-by would know the directions to.

      If I and a friend have a conversation on politics, "Bush always arrives too late" has a different meaning than if I'm talking about getting to town and say "bus always arrives too late". Half of what's being heard at any loud party is guesswork, and still for the most part it works out because people have the same understanding of the options, usually involving "dance", "drink", "toilet" and "step out", even though a computer would make no sense of it.

      I'm sure you've noticed it yourself, whenever someone says something surprising but valid sentence you have a tendency to go "What?" despite actually being quite clear. You're simply convinced that your brain has misfired and identified the words as something else. The computer knows which words are common and which go togeter, but it has no such sense of context or of reference to past events, or understand the objects. "My vacation is in pain" makes no sense, but "My vacation is in Spain" does.

      That is why you can create a voice recoginition that can extract all the possible information out of the sound itself, but still wouldn't hold a candle to human voice recognition. For that you require real AI.

      --
      Live today, because you never know what tomorrow brings
    45. Re:Is SR ever going to be good enough? by Anonymous Coward · · Score: 0

      If we can get a Universal Translator working like what ST had, all this other stuff is peanuts.

    46. Re:Is SR ever going to be good enough? by Anonymous Coward · · Score: 0

      > 90% accuracy is nowhere near enough for voice recognition in a dictation context. 90% accuracy means one word in every ten will be wrong. In this post, so far, that works out to more than one erroneous word per sentence.

      So you're saying it's good enough for a slashdot editor?

    47. Re:Is SR ever going to be good enough? by Anonymous Coward · · Score: 0

      It's been over an hour and you'r still not back. Aren't you done yet?:-)

    48. Re:Is SR ever going to be good enough? by Eideewt · · Score: 1

      "Yes, I really mean that. Typing is faster than talking, for anybody who's any good at it. And not by a little - by a lot. If you want to get a document into a computer quickly, use a keyboard."

      I don't think that's correct. According to Wikipedia, Barbara Blackburn, the world's fastest typist, has hit a peak speed of 212 wpm, with a sustained speed of 170 wpm. Conversations happen at about 200 wpm.

    49. Re:Is SR ever going to be good enough? by Just+Some+Guy · · Score: 1
      It's much quicker and less annoying to the people around you to just type on your keyboard.

      My wife, a doctor, pays $0.11 per line for someone to transcribe her patient notes from a tape recorder. She's from New York and speaks quickly normally, but sounds like the fast talker from the old Fed Ex commercials when she's dictating. We'd cheerfully pay $1,000 for software that could accurately recognize her speech, and gladly double that if it could automatically apply minimal formatting to the results.

      --
      Dewey, what part of this looks like authorities should be involved?
    50. Re:Is SR ever going to be good enough? by Eideewt · · Score: 1

      Even if you figure a person can only speak proper English at half the speed of a regular conversation (200 wpm, acc. to Wikipedia), that's still 100 wpm, which is way faster than most of us type.

    51. Re:Is SR ever going to be good enough? by Bing+Tsher+E · · Score: 1

      I'm stuck using Windows for work [*] and it drives me stir crazy.


      I have a stapler on my desk at work that occasionally jams.

      You don't find me ranting and badmouthing Swingline and claiming my rants are justified.

    52. Re:Is SR ever going to be good enough? by Bing+Tsher+E · · Score: 1

      You're implying that Apple should not be allowed to bundle iTunes with their Macintosh computers.

      Sounds good to me.

    53. Re:Is SR ever going to be good enough? by Bing+Tsher+E · · Score: 1

      Buck Rodgers was set in the 25th Century.

      And it's all been downhill since then.

    54. Re:Is SR ever going to be good enough? by Phisbut · · Score: 1
      I don't think that's correct. According to Wikipedia, Barbara Blackburn, the world's fastest typist, has hit a peak speed of 212 wpm, with a sustained speed of 170 wpm. Conversations happen at about 200 wpm.

      I really doubt any speech-recognition application will ever be able to deal with 200 wpm. With all these softwares, even the best ones, you have to articulate every word and speak much slower than a regular conversation. Try having a conversation at the pace to which you talk to a computer, you'll look like a retard.

      --
      After 3 days without programming, life becomes meaningless
      - The Tao of Programming
    55. Re:Is SR ever going to be good enough? by mrcaseyj · · Score: 1
      >[Speech Recognition] tech will eventually be as good as on star trek as long as people work on it.

      I think there are many areas of Artificial Intelligence, such as speech recognition, that will require full human level intelligence and knowledge in order to perform at an acceptable level. If you think about it, even humans don't recognize speech all that well. How often do people have to ask others to repeat themselves? Imagine how poorly a human would take dictation, if the human had as much as only one tenth of normal brain power. Furthermore many tasks require a general knowledge of the world to resolve ambiguities. The same applies to other areas. A robot that drives as poorly as the typical human driver, will never be allowed to control a car in public.

      However I expect human level computer intelligence in only 20 or 30 years. But then the explosion of intelligence will render voice recognition irrelevant.

    56. Re:Is SR ever going to be good enough? by Eideewt · · Score: 1

      I wouldn't say never, although I was talking more about the viability of speech as an input method from a hypothetical standpoint rather than a practical one. From a practical standpoint, it wouldn't even have to be that fast to be competetive. An advanced typist can do around 60 wpm, and the average typist can only do half that. Speech recognition doesn't have to beat the holder of the world record, just the rest of us. If it could do 50 wpm, the speed gains would be well worth it. 100, and all but the best typists could be faster. Of course, it would still be noisy and wear out your voice, but I think it could be fast enough for a lot of uses.

    57. Re:Is SR ever going to be good enough? by asuffield · · Score: 1

      Apple don't have a monopoly on desktop operating systems (as defined in antitrust law), so they can bundle whatever they like with theirs. Microsoft do have a monopoly so they are restricted. That's what antitrust law is all about.

      Apple might qualify as a monopoly for the iPod, which would mean they are restricted in what they can bundle with iPods. But they can do what they like with OSX.

  10. Funniest shit ever by tomstdenis · · Score: 0, Troll

    And totally the quality I'd come to expect from Microsoft.

    you say:

    Dear Mom,
    Vista is gonna suck. Enjoy this powerbook instead.
    Sincerely,
    Little Girl

    It hears:

    Dear Aunt,
    The explosives are deleted in the moon star night.
    Buy more Vista.
    Bite me,
    Bitten World

    Tom

    --
    Someday, I'll have a real sig.
    1. Re:Funniest shit ever by Anonymous Coward · · Score: 0

      you post too much.

  11. Re:next features to be cut from vista... by Anonymous Coward · · Score: 0

    Well selling 'i's worked for Apple. (iPod, iSight, iMac, etc.)

  12. I use this software right now by Anonymous Coward · · Score: 0

    It is not as bad as you dog to the car in double tuesdays golf.

  13. So? by Klaidas · · Score: 5, Informative

    This isn't the first presentation went wrong, isn't it?
    Win98 gone wild: http://www.youtube.com/watch?v=Hrbx9_AY720
    Media Center Edition gone wild http://www.youtube.com/watch?v=j7EEbokKLHI
    We can add this one to the list too ;)

    1. Re:So? by TheRaven64 · · Score: 1

      The XBox also crashed during its first public demo. Rumour has it that the developers were told that they had to make sure it never blue-screened in public again - and that's why when the XBox crashes you get a green screen of death.

      --
      I am TheRaven on Soylent News
    2. Re:So? by tomstdenis · · Score: 1

      Those are painful to watch. As much as I hate MSFT I still feel some empathy for them... Damn, poor gates playing with the remote...

      Tom

      --
      Someday, I'll have a real sig.
    3. Re:So? by anothy · · Score: 1

      that's the point. this isn't really about the technology in these demos. as someone who's worked building speech systems, i'll tell you that type of demo was a big risk. sure, ambient noise seemed low to us, but who knows what it was like on the stage? did he have a monitor speaker up there? and echo - at least enough to throw off the mic used for recognition - seems entirely possible. had they run the demo in exactly that sort of environment? with a simulated audience? had the presenter trained the system himself? speech recognition's hard, especially dictation-type systems.
      still, i have to agree with the anchor at the end: "live television's rough; welcome to our world." if you aren't sure you can pull it off, it's generally foolish to take that kind of risk. we in the tech world generally agree that MS hasn't gotten where they are because of superior technology; i've often heard them referred to (and referred to them myself) as a marketing company rather than a technology company. so why do they get this stuff wrong so often? where are the high-profile botches from, say, Apple?

      --

      i speak for myself and those who like what i say.
    4. Re:So? by mgabrys_sf · · Score: 1

      OH! There's plenty of those too - here's a highlight reel all in one file!

      http://youtube.com/watch?v=jAsxzwHaGjk&search=appl e%20bloopers

      (note: there's also a ton of general human-performance blunders as well - I particularly like the moments
      when tech buzzword piles defy explanation - even by Jobs)

    5. Re:So? by shadowbearer · · Score: 1


        I wonder if someone forgot to put fresh batteries in the remote? *g*

      SB

      --
      It's old. The more humans I meet, the more I like my cats. At least they are honest.
    6. Re:So? by anothy · · Score: 1

      heh. that was great. the "massive branch prediction logic" bit was my favorite.

      --

      i speak for myself and those who like what i say.
    7. Re:So? by l33t+gambler · · Score: 0

      Theres nothing "poor" about Mr. Gates or Microsoft. They are rich and they forced upon us Windows 9x with its BSODs and endless chkdsks. Did you know most people don't trust their computer to run more then 3 programs at once?

      then again Bill Gates and his wife do give lots of money to charity.

      --
      Teasing the nobles, and rightfully so!
  14. Dear aunt by linvir · · Score: 4, Informative
    For the flashless. Here's the format:
    Microsoftie says this
    Speech recogniser hears this


    Dear mom
    comma
    Dear aunt,
    [laughter]
    Fix aunt
    Let's set
    Delete that
    Delete that
    Delete that
    so
    I think it's picking up a little bit of echo here
    Delete... select all
    double the killer delete select all
    [laughter]


    Final text:
    Dear aunt, let's set so double the killer delete select all
    1. Re:Dear aunt by James+Manning · · Score: 5, Informative

      For the curious, it was an audio gain issue. Details on Rob Chambers' blog:

      http://blogs.msdn.com/robch/archive/2006/07/29/682 479.aspx

    2. Re:Dear aunt by Anonymous Coward · · Score: 0

      Unfortunately, the nature of this specific audio sub-system bug is that it's intermittent. It worked great every single time. Right up until that one live demonstration -- the one that counted. ;-)

      Symptomatic of badly designed software. The people who write it often incorrectly refer to this as "demo effect", usually unaware that it occurs at all regular user's desks and makes them angry at the product.

      Compare the Windows 98 demo. Also worked great every time except when you actually used it.

    3. Re:Dear aunt by Roblimo · · Score: 1

      Advice for Microsoft in future speech recognition demos: Use a high-quality USB headset with a noise-canceling mic.

      In fact, this is good advice for *anyone* messing with speech recognition software...

      - Robin

    4. Re:Dear aunt by KlaymenDK · · Score: 2, Funny

      Wait a sec ..... I think [MS] marketing folks can't use noise-cancelling mics. Nothing would be recorded!

      Badabing! Thanks folks, I'll be here all week.

    5. Re:Dear aunt by tpv · · Score: 1
      As Rob says
      when Vista ships later this year, this audio gain issue will be a thing of the past.

      I'm sure he's right - they'll have replaced with with worse bugs by then.

      --
      Read more of this story at Slashdot.Read more of this story at Slashdot.Read more of this story at Slashdot.
  15. Remember the Win98 BSOD? by Enderandrew · · Score: 4, Funny

    Not quite as embarrassing as the Windows 98 BSOD, but more entertaining than the Ballmer developer's video.

    http://www.ntk.net/media/developers.mpg

    --
    http://blindscribblings.com - Tasty pop-culture in conceptual fashion.
    1. Re:Remember the Win98 BSOD? by csplinter · · Score: 1

      More entertaining than both is the Developers Music Video

  16. What is there to differentiate if mom=aunt? by Browzer · · Score: 0, Offtopic

    ...

  17. are u serious? by CDPatten · · Score: 1, Insightful

    "Don't you just love Microsoft's live demonstrations?"

    As if MS is the only one who has problems with demonstrations. This is the problem with anti-ms guys... its not that MS is perfect, its just that the zelots are blinded by hate.

    1. Re:are u serious? by tomstdenis · · Score: 4, Insightful

      Microsoft routinely puts out their excellence over everyone else including OSS. Hear them talk about Office w.r.t. OpenOffice. They talk down about it, mock it, dismiss it, etc...

      It's called modesty. If MSFT had any [and some humility] they wouldn't get laughed at so hard for this. I mean look at Linux. Find a bug in the Kernel, fix it, post notices that its. You don't see anyone saying "Oh hahaha, Linus is at it again!" That's because you also don't see Linus on CNN mocking the rest of the world.

      Microsoft deserves all the negative press and humilitation they get because they are shameless, deceitful, greedy monopolistic bastards.

      Tom

      --
      Someday, I'll have a real sig.
    2. Re:are u serious? by NixLuver · · Score: 2, Insightful

      Nah, that's not it. I don't hate "Microsoft"; that's just a name on a door somewhere. I don't hate 'the corporation'; corporations are not individuals, no matter what the law would have us believe.

      The reason I find this eminently amusing is that Microsoft is a company built on marketing. At no particular point has Microsoft had "The Superior Technical Solution"; they have always had luck and better marketing. Since DOS 3.3 there have frequently been products that were more stable, faster, easier to use - you name it. And Microsoft's captains have beaten them in two ways: Marketing and Money.

      So of course when a company who has built their foundation on marketing flubs it, it's more amusing than when a company who has built their foundation on performance of one kind or another flubs it. It's inescapable that the Bg Dog gets more scrutiny than the Contender, anyway. And Microsoft apologists should understand that.

    3. Re:are u serious? by Mathiasdm · · Score: 1

      *points*
      Look, ma, no humour!

      Speech recognition version:
      Cook your aunt in liquor!

      --
      Join the anonymous, help develop the network: http://www.i2p2.de
    4. Re:are u serious? by pallmall1 · · Score: 1

      This is the problem with anti-ms guys... its not that MS is perfect, its just that the zelots are blinded by hate.

      Dear aunt, let's set so double the killer delete select all

      --
      3 things about computers: they're alive, they're self-aware, and they hate your guts.
    5. Re:are u serious? by Udo+Schmitz · · Score: 4, Insightful
      "As if MS is the only one who has problems with demonstrations."

      Hmmm, no. Maybe it's the way they deal with failures. Remember Bill gates trying hard to demonstrate the Media Center? Some time after that Steve Jobs gave his regular Macworld keynote when his Mac didn't respond anymore. He moved a monitor switch to continue the presentation on another Mac and said: "Well, that's why we have backup systems here."

    6. Re:are u serious? by Anonymous Coward · · Score: 0

      The reason I find this eminently amusing is that Microsoft is a company built on marketing. At no particular point has Microsoft had "The Superior Technical Solution"; they have always had luck and better marketing.

      I agree that Microsoft has never or at most rarely had a superior technical solution to anything, but I'm puzzled by the number of people who parrot this notion that they have had either luck or marketing.

      They defeated IBM in negotiating terms twice, giving them a monopoly. Period. They have been smart enough to maintain this monopoly. Period. They are unusually aggressive businessmen who had enough technical savvy to know that something new would matter in their contract with IBM. That could be luck or foresight. But there is no marketing in there anywhere. They established a monopoly and they work VERY hard to maintain it.

      Why is it so hard for geeks to understand that this is the sole reason that anybody even knows the name of Microsoft?
    7. Re:are u serious? by ThinkFr33ly · · Score: 1

      Or maybe people don't bash Steve Jobs and Apple after failures like they do Bill Gates and Microsoft.

    8. Re:are u serious? by Yvan256 · · Score: 1
      Some time after that Steve Jobs gave his regular Macworld keynote when his Mac didn't respond anymore. He moved a monitor switch to continue the presentation on another Mac and said: "Well, that's why we have backup systems here."

      You know, I've always thought it was all staged, just to make fun of Bill's mistake earlier that week.
    9. Re:are u serious? by westlake · · Score: 1
      It's called modesty. If MSFT had any [and some humility] they wouldn't get laughed at so hard for this.

      In Seattle they can hear you laughing all the way to the bank.

    10. Re:are u serious? by TomHandy · · Score: 1

      People don't bash Steve Jobs and Apple after failures? Are you joking? There are tons of Apple-haters out there (just like there are tons of MS haters) and they jump on every little chance they get. As the original poster said though, Apple has genuinely had less of these colossal failures, and when they do, they often are either minor (i.e. some glitch with a particular app or something) or they have backups ready to go so there isn't that terrible awkward time period where nothing is working and the person presenting doesn't have any plan for what to do.

    11. Re:are u serious? by Anonymous Coward · · Score: 0

      Oh yeah. It was totally staged. Just like all these were.

    12. Re:are u serious? by Zann · · Score: 1

      Apple presentation bloopers.

      Not only MS makes mistakes, of course.

      --
      Feeling a bit scared? Afraid? That's just death lurking around.
    13. Re:are u serious? by Anonymous Coward · · Score: 0

      You poor demented Linus lovechild. Are you going to actually keep a straight face and claim that Linus isn't an arrogant elitist prick? Example of a Linus quote:

      "I claim that Mach people (and apparently FreeBSD) are incompetent idiots"

      MS has never, ever insulted the intellegence of their competition. They just break out a list of bullet points and explain why (in their opinion) MS software is better than the competition. How are they wrong in doing that?

    14. Re:are u serious? by Anonymous Coward · · Score: 0

      MS has never, ever insulted the intellegence of their competition. They just break out a list of bullet points and explain why (in their opinion) MS software is better than the competition. How are they wrong in doing that?

      WTF, are you joking? Have you ever heard Steve Ballmer? He calls his competition "communists" and worse, and throws chairs around like a latter-day Randall Flagg.

    15. Re:are u serious? by Anonymous Coward · · Score: 0

      To be fair, Office does indeed beat the hell out of OpenOffice. I should not have to deal with lag in a goddamn word processor. Yes, Microsoft is setting itself up for very public failure with every gloat. Yes, they deserve the ridicule they get when they BSOD during a presentation, or type things only remotely connected to the incoming speech. But the real problem is just in their arrogance, not in their technology. The products are fine, they just can't keep up with marketing.

    16. Re:are u serious? by Mattintosh · · Score: 1

      Perhaps you missed the time when Steve Jobs threw a camera at a stage flunkie when things refused to work. Or when Quake 3 locked up during a demo on the keynote.

      There are several more in this video.

  18. Just from MicroSoft Insider by Seiruu · · Score: 4, Funny

    Steve Ballmers accidently send an e-mail while diligently testing the software. The e-mail says:

    "Sir put down the chair, then we'll talk"
    "No Steve wait up, don't do that"
    "BOOM CRASH BOOM CRASH BOOM CRAASH WAAAH NOOO STOOOOP"
    "DUDE, THE COMP HAS A BSOD! WAAH!"

  19. Show me yours. by Anonymous Coward · · Score: 0

    Ok, all you anonymous cowards. Show me your speech recognition engine. And show me your demo.

    1. Re:Show me yours. by Anonymous Coward · · Score: 0
    2. Re:Show me yours. by ZXSpectrum42 · · Score: 1

      3 Years ago with a friend we investigated the potential of writing our own voice recognition engine. Till then we were working on signal analysis on data aquired from the Ulysses spacecraft, so we had a good experience on signals analysis techniques. For 2 years this friend worked and finally created a voice recognition engine with a vocabulary of about 100 words. The engine had to be trained because it needed to be speaker independant , and of course it could recognise one word at a time (The engine had a succes of 95-98% ).Natural speach was out of our league and i believe that it is out of anyone's reach (if we are talking about a software that will not fail in any given presentation :-) see MS). After all, even a human needs a lot of training (years actually starting from being a baby where only listens untill the moment that says 'mom' :-)) and even then, the phrase "can you repeat that ?" comes up all the time. From what i have seen in the voice recognition discipline the moto is : "we are ten years from natural speech recognition" and they said that in 85 , 95, and 2005. And thats because unless we find something radically different we will always be 10 years from that goal. So for the time being, the only usefull applications for voice recognition (meaning software that actually works), given the technology status, is for menu driven applications for eg handicaped people (applications in elevators eg "Please select a floor").

      PS1 My english are bad
      PS2 We need to be more realistic about Voice Recognition as for AI in general

      --
      2+2 = 5 (for very large values of 2)
    3. Re:Show me yours. by Anonymous Coward · · Score: 0

      I worked on a speech recognition system for a tradeshow demo nearly 40 years ago (the 1969 FCJJ). We only wanted it to be able to recognize a couple of dozen words. We worked with a well-known university speech recognition PhD for clues on how to do it, used a rack of (then) state-of-the-art dedicated signal processing hardware and minicomputer, and trained the system using a couple of dozen of co-workers. It seemed to work almost decently at least some of the time so we shipped it to the show. What we found on the show floor was that, probably because of regional accents, that it didn't work nearly as well because there were larger variations between individual speakers than between the words they spoke. We thought the problem would probably be licked in another 10 years.

    4. Re:Show me yours. by Anonymous Coward · · Score: 0

      OOPS! That should be "FJCC", not "FCJJ". Should have previewed before submitting.

    5. Re:Show me yours. by Anonymous Coward · · Score: 0

      Why should I, it's not me who decided to make a public demo.

  20. Follow Up Another Intercepted Mail! by Seiruu · · Score: 1

    "No seriously, stop throwing that chair!"
    "What the hell is wrong with your comp Steve!?"
    "Is that even a BSOD? It's just skewed now!"
    "How the fuck do you turn this thing off!"
    "What does this button do?"
    "It's doing crazy things man, maybe we should just pull the plug!"
    "No Steve, throwing a chair at it doesn't work!"
    "CRASH BOOM CRASH"

  21. Re:I'd love to see what it 'spat out'... by Anonymous Coward · · Score: 1, Funny

    that comment is as sad as the video.

  22. Re:Did anybody try... by pallmall1 · · Score: 1

    I'll bet someone is Redmond is asking the demo organizer:

    "Can you say -- canned like a tuna?"

    --
    3 things about computers: they're alive, they're self-aware, and they hate your guts.
  23. Windows Redneck by Anonymous Coward · · Score: 0

    Does this mean there's a lot of inbreeding going in Microsoft?

  24. A little early for voice recognition by Xiph · · Score: 1

    Having seen some of the research that goes on, voice recognition is still far from good.
    Yes it works in some contexts, especially if it's been trained with the person speaking, and the language is limited, such as in a professional environment.

    but for home computers, it's not only overkill it's also inadequate and non-functional.
    I say COOL feature, but hopeless waste of time and money, which in the end will be paid by you-know-who (not ms)

    on another topic can someone please ask ms to stop the increasing and excessive stream of bundling?

    --
    Blah blah sig blah blah blah irony blah blah
    1. Re:A little early for voice recognition by techman2 · · Score: 1

      I agree, after spending a fair amount of time with the Tablet PC we have at work I am not very impressed with the voice recognition. It takes many hours of training to even get it to be anything near useable. It's certainly not ready for prime time in my opinion.

  25. Re:A modest voice. by Anonymous Coward · · Score: 0

    Open Source Speech Recognition?

    I believe one such system is called Sphinx.

  26. It's hard by wootest · · Score: 3, Funny

    It's hard to wreck a nice beach. :)

    1. Re:It's hard by Teresita · · Score: 0

      wootest wrote:
      It's hard to wreck a nice beach. :)

      Teddy Kennedy with his shirt off would certainly do it.

  27. Why it sucks .... by Spiked_Three · · Score: 0, Flamebait

    Here is a different perspective on why speech recognition STILL sucks;

    Natural language interaction is one of BillG's hot buttons. Back in 95 he used to love demonstrating with poly the parrot. Polly the parrot could recognize speech and react to it - like "play miles davis". He demo'd it many times, and yes, it occasionally glitched but the potential was pretty cool. When he built his house, he put speech recognition technology all through it, thinking that it would be perfected very soon.

    WTF happened? Well along came this distraction called 'the internet' and 'netscape'. And then another distraction called 'open source' and 'linux'. As a result of those distractions it set natural language recognition back 10 years. Yep, this is case where competition has stifled a particular innovation. I'm not saying that it's a bad thing, maybe competition encouraged 5 other innovations, but I am positive it stifled this one in particular.

    Just a different thing to think about...

    --
    slashdot troll = you make a compelling argument I do not like the implications of.
    1. Re:Why it sucks .... by Kent+Recal · · Score: 1
      Well along came this distraction called 'the internet' and 'netscape'. And then another distraction called 'open source' and 'linux'. As a result of those distractions it set natural language recognition back 10 years. Yep, this is case where competition has stifled a particular innovation.

      Nonsense.

      Just in case you have been living under a rock for the last 15 years I have some news for you:
      That kind of innovation doesn't come out of the microsoft campus. It comes out of universities.
      And those couldn't care less about the NASDAX buzzword of the week...
    2. Re:Why it sucks .... by Spiked_Three · · Score: 1

      apparently the rock i'm under has more light than yours. Have you never heard of Micosoft Research? http://research.microsoft.com/ There is a lot of technology that comes from there, asshole.

      --
      slashdot troll = you make a compelling argument I do not like the implications of.
    3. Re:Why it sucks .... by Kent+Recal · · Score: 1
      There is a lot of technology that comes from there, asshole.

      Name one, beauty.
    4. Re:Why it sucks .... by thunderpaws · · Score: 1
      Natural language interaction is one of BillG's hot buttons. Back in 95 he used to love demonstrating with poly the parrot. Polly the parrot could recognize speech and react to it - like "play miles davis". He demo'd it many times, and yes, it occasionally glitched but the potential was pretty cool. When he built his house, he put speech recognition technology all through it, thinking that it would be perfected very soon.


      BonziBuddy!

      Thanks again Bill.
    5. Re:Why it sucks .... by ClassMyAss · · Score: 1

      What we have is not an issue of competition crushing innovation; rather, it is the lack of competition in the OS field that has left Microsoft with the energy to expand into new sectors and use Windows to help it crush competitors elsewhere. If there had been a stronger operating system to challenge Windows all these years, MS would have had to substantially improve their product with each iteration rather than merely fix the heinous bugs left in the prior installment. Instead they have done nothing but tread water for the past 10 years, complacent in their domination of the OS market.

      And don't anybody give me this crap about Linux being a strong competitor to Windows! Don't get me wrong - I love my Ubuntu, but the day that Linux becomes a competitor to Windows will be the day that Linux actually refers to a single operating system rather than a mess of crappy forks. Take a lesson from Firefox: open source projects thrive when there is a clear winner. A monopoly may be a bad thing in commercial development, but in open source it is actually somewhat desirable, as it means less duplicated effort. Nobody is going to develop non open-source software for an operating system that is merely one distro among a thousand. And until commercial applications are widely available for an OS, it can never be a competitor to Windows. Choice isn't bad, but unfortunately having too much of it limits adoption and interoperability. [/rant]

      Back on topic: regarding speech recognition, I don't think it has much of a place in an operating system, anyways, at least today. It's a cute trick, but really, until some form of semi-strong AI is achieved, speech recognition is going to be a tough nut to crack. Humans use contextual clues to decipher what we hear, and they are very important - a computer will only be able to perform robust speech recognition when it can tell whether its attempted translation actually means anything or not, and this is a long way off. Here's hoping I see the day...

    6. Re:Why it sucks .... by Spiked_Three · · Score: 1

      Why, you cant read a web page on your own? Try that link, there is a menu item on the left (ask your mommy if you dont know left from right) for press releases. There are quite a few mentioned there. Oh look, some of the research is done with universities! Isn't that exciting?

      --
      slashdot troll = you make a compelling argument I do not like the implications of.
    7. Re:Why it sucks .... by anothy · · Score: 1

      it's unfair that you've been mod'd flamebait. i even have mod point, but i commented elsewhere on this story before seeing your comment. sorry about that.
      no, you're not flamebait; it would even be insightful - if it were true. microsoft, at any point, did very little real research on speech recognition systems. the innovation in that field has always come from other places. by your reasoning, Microsoft should've been the front-runner on speech systems until, say, 1996. they weren't. until around 1996 or 1997, the stuff coming out of Bell Labs probably was. around 1997, they stagnated and IBM made some very interesting advances. in terms of actual products, the two of them were the leaders until around 2001, when i lost track of such things. various universities also have produced very promising research and prototypes. but MS has never been out front on that.

      --

      i speak for myself and those who like what i say.
    8. Re:Why it sucks .... by Spiked_Three · · Score: 1

      I wouldnt argue with any of that. I don't think Microsoft was out front but they wanted to be and had to re-arrange the priorities because of the things i mentioned.

      I have tried the various things from IBM and they suck as bad as anything else i've seen. Until some company with billions of R&D funds can prioritize this, we're stuck with double deletes. I guess we can only hope it is one of Ozzy's passions as well.

      --
      slashdot troll = you make a compelling argument I do not like the implications of.
    9. Re:Why it sucks .... by QRDeNameland · · Score: 1

      Until some company with billions of R&D funds can prioritize this, we're stuck with double deletes. I guess we can only hope it is one of Ozzy's passions as well.

      While I'm fairly certain you were referring to Ray Ozzie, I think I've stumbled across the test that will be to voice recognition what the Turing test is to AI.

      The Ozzy Test. When a voice recognition program can decipher the casual speech of Ozzy Osbourne, then we can call it true voice recognition. I guess the bigger question is...how would we know when the software gets it right?

      --
      Momentarily, the need for the construction of new light will no longer exist.
    10. Re:Why it sucks .... by kalidasa · · Score: 1

      Yeah, because the Lernout & Hauspie mess had nothing to do with setting back speech recognition.

    11. Re:Why it sucks .... by Anonymous Coward · · Score: 0

      Wake up from your dream. Here are some speech recognition papers from MS.

      Typically has more papers in SIGGRAPH each year than any university.

      MS Research is one of the only institutions investing serious effort into developing an operating system in a safe language. MS Research is also a very big player in the field of programming language research, and the maintainers of the GHC Haskell compiler. I'm a particular fan of their work on Software Transactional Memory.

      I could go on, but any short list I could make of cool stuff MS Research has done wouldn't do justice to the amount of great stuff they've done. They are as involved in computer science research as any of the major CS-strong universities.

    12. Re:Why it sucks .... by lullabud · · Score: 1

      Competition? I thot u wer gonna say dat da net iz y txt rec duzznt werk cuz sum peeps cant evn speek rite n e more LOLOL

  28. Come on Microsoft... by Danga · · Score: 1, Redundant

    My guess is that the marketer "showing off" the voice recognition didn't properly train the software before the demonstration. If he did do that then he obviously did not pick and test something that was at least known to work which is not a bad idea when you are doing product demos. The software obviously has much work left since it interpreted the two syllable sentence "select all." as 13 syllables "so double the killer delete select all" (while it did finally get "select all" where the hell did the rest of that come from?). I am suprised that Microsoft had so much confidence they would go on live TV with it.

    This is one reason that I believe until software is done being fully tested and is fully released that someone technical and who knows the software inside and out (one of the developers) should be demonstrating the products. Leave the non technical marketers to demonstrate products that they can't mess up, kind of like the classes they had to take in college to get a marketing degree.

    --
    Hey, there is only one Return and it's not of the King, it's of the Jedi.
  29. New e-mail intercepted! by Seiruu · · Score: 1

    "Okay, Steve, I think it's back to normal again."
    "What? Taking money out of the Xmas gift to pay for your chair and office damage? Well, wouldn't be the first time."
    "But seriously man, just between you and me, but what's with all this MILF porn man?"
    "Bill told you not to keep this shit on your protected network drive."
    "Aw come on! Everybody knows your password is 'MacroBig'!"
    "Imagine if the company digged deep enough?"
    "Just in case this doesn't work out, we'll fire a bunch and write it off as "underperformed".
    "Hey, what's that weird thing on your screen? OMG IT'S SENDING AN EMAIL, QUICK CUT IT O.F....."

    Eek, I'm getting carried away here.

  30. Vista couldn't differentiate between mom and aunt by jcraveiro · · Score: 3, Funny

    Maybe they're twin sisters... ;)

  31. Sequence of events by daveschroeder · · Score: 2, Funny

    "Dear mom comma"

    Dear aunt,

    "Fix aunt"

    Dear aunt, let's set

    "Delete that"

    Dear aunt, let's set

    "Delete that"

    Dear aunt, let's set

    "Delete that"

    Dear aunt, let's set so

    "I think it's picking up a little bit of echo here...delete - select all"

    Dear aunt, let's set so double the killer delete select all

    *Manually selects all and deletes*

    "Okay, I'm glad you're enjoying this"

    *Laughter*

    1. Re:Sequence of events by lurker412 · · Score: 4, Funny

      Got ham pizza ship

  32. Man, that brings back memories!!! by smchris · · Score: 1, Informative

    Does Microsoft have to copy EVERYTHING??? I used OS/2 Warp for the second half of the 90s but my experience with _its_ built-in speech recognition was pretty much identical to that demo.

    1. Re:Man, that brings back memories!!! by TheRaven64 · · Score: 3, Funny

      I recall a friend of mine demoing the MacOS speech recognition engine a decade or so ago. The dialogue went something like this: User: Open SimpleText. Computer: Are you sure you wish to shut down. User: No. Computer: Shutting down... As you can imagine, he didn't leave it switched on all that often...

      --
      I am TheRaven on Soylent News
    2. Re:Man, that brings back memories!!! by CaptnMArk · · Score: 1

      I thought of that too.

      I took me a lot of time to train it to somewhat understand me (I'm not a native english speaker, and my PC was not exactly fast)

      So, almost a decade later, MS stuff is no better.

    3. Re:Man, that brings back memories!!! by smchris · · Score: 1

      Thank you. I think it was ridiculous for me to be rated 100% troll.

      1) Roughly before OS/2 Warp, Microsoft had an important role in OS/2.
      2) _Now_, a decade later Microsoft wants to imitate OS/2 Warp with an included speech recognition.
      3) It apparently works identically badly a decade later.
      4) Considering the number of things that have appeared elsewhere before appearing in Microsoft products I find that an historically amusing irony and pretty germane to Micrsoft standard operating procedure. And rather _telling_ that in 10 years, the Microsoft product apparently is as bad as the OS/2 Warp version of 1995.

  33. OS/2 Still Kicking Microsoft's Ass by Greyfox · · Score: 4, Funny
    C'mon! IBM put on a great speech reco demo at the '95 Atlanta COMDEX. Their product worked flawlessly! Well... Except the guy fired it up and started talking and the little text editor was picking up the words when someone in the back of the audience yells "FORMAT C!" The crowd went wild and the guy doing the demo cracked up too, which caused the speech engine to freak out a bit. He had to delete a bunch of junk out of his text editor once things settled down.

    Speech recognition is still just a gimmick anyway. We still have a LONG way to go before it gets to the point that Joe Average User imagines it should be. Joe average user wants his computer to respond like the one in Star Trek. I still want to set up my Asterisk server with speech recognition, though, so that people can either dial or say the extension they want. It'd also be neat to pick up the phone, say "Call Mom" to the dial tone and have it call my aunt for me.

    --

    I'm trying to teach myself to set people on fire with my mind... Is it hot in here?

    1. Re:OS/2 Still Kicking Microsoft's Ass by maxwell+demon · · Score: 3, Funny
      It'd also be neat to pick up the phone, say "Call Mom" to the dial tone and have it call my aunt for me.

      Seems Microsoft's speech recognicion is just right for you :-)
      --
      The Tao of math: The numbers you can count are not the real numbers.
    2. Re:OS/2 Still Kicking Microsoft's Ass by linj · · Score: 1

      I still want to set up my Asterisk server with speech recognition, though, so that people can either dial or say the extension they want. It'd also be neat to pick up the phone, say "Call Mom" to the dial tone and have it call my aunt for me.


      You should try a Microsoft Pocket PC Phone with Voice Command, then. It seems that voice-to-speech synthesis is much harder than if you have a set of expected voice commands to work with already. In a good environment, I can almost always get the right result with "call foo", "open foo-application", or "play foo-song". When problems crop up, it's generally because living in Asia causes one to have to work with some pretty funky names.

    3. Re:OS/2 Still Kicking Microsoft's Ass by Anonymous Coward · · Score: 0

      I still want to set up my Asterisk server with speech recognition, though, so that people can either dial or say the extension they want.

      You can give this a test drive by calling T Mobile support with their virtual assistant. I guaren-damn-tee you that after the 3rd time of repeating a simple command you'll think twice about implementing this.

    4. Re:OS/2 Still Kicking Microsoft's Ass by foniksonik · · Score: 1

      Hey you insensitive clod.... maybe his Mom IS his Aunt... I mean he could be from Alabama or something...

      --
      A fool throws a stone into a well and a thousand sages can not remove it.
    5. Re:OS/2 Still Kicking Microsoft's Ass by rritterson · · Score: 1

      WHOOOOOOOOOOOOOOSH

      (I think the rest of us know what that sound was)

      --
      -Ryan
      AUWYHSTOT (Acronyms are Useless When You Have to Spell Them Out Too)
  34. Mature Porn Spam. by Anonymous Coward · · Score: 0

    "Vista couldn't differentiate between mom and aunt"
    Seems that the problem originated from spammers messing up the keyword-based research features. After cleaning up the Porn db Vista correctly differentiated all the different kinds of mature porn pictures.

  35. Re: Speech Recognition vs. Fiction? by TaoPhoenix · · Score: 1

    I'll second the previous comment. Star Trek grade SR is just another fiction device, present for the very reasons we read fiction: we DON'T want realism in our fiction! Reality is far too full of lost days of work because you discovered the approach you took crashed hard into some flaw.

    I am an enthusiastic fan of Speech Production, because computers understand source text just fine. Hoping for miracles in Speech Recognition is at best 50 years away, and simply "wishing" at best. (If you insist loud enough, can you retroactively re-create the past?)

    Even with better accuracy in their limited environments, I still find voice prompts irritating, because people around you have to put up with "... yes ... representative ... hardware ... other issue ... connect me." Then the computer misses one of the words, and you get "Sorry, I didn't get that."

    --
    My first Journal Entry ever, in 8 years! http://slashdot.org/journal/365947/aphelion-scifi-fantasy-horror-poetry-webzine
  36. removing ambient noise by sh0rtie · · Score: 4, Insightful



    why not just use two mics, one to record the ambient noise (positioned away from the voice mic) the other to record the voice (headset) then as you have two signals just subtract the ambient noise signal from the heaset signal , voila clean headset mic audio

    works for music too, you could control your music player by voice even when its playing loud (at a party) by removing the music signal from the mic signal

    -AJS

    1. Re:removing ambient noise by clem.dickey · · Score: 1

      > just subtract the ambient noise signal from the heaset signal

      Noise canceling microphones do try to do that. I think it's just a mechanical cancelation accomplished by leaving the back of the microphone open; no fancy electrical processing or dual microphones. But it is common for voice recognition mics.

    2. Re:removing ambient noise by anothy · · Score: 1

      yes, this works, at least partly. speech recognition is very tricky stuff, and i doubt even this would solve all the problems, but it would help. but there's a more fundamental problem: in theory, he's demo'ing the real product. most people's computers don't have any provision for plugging in two mics to do that sort of noise cancelation; that sort of processing is non-trivial - i wouldn't want it chewing up CPU cycles generally, and most computers won't have special hardware for it; and most people don't have two mics. to overcome these problems, MS would have to make the demo differ significantly from the actual product.

      --

      i speak for myself and those who like what i say.
    3. Re:removing ambient noise by Anonymous Coward · · Score: 5, Informative

      For those interested, merely subtracting the two signals doesn't work. The signal at the microphone is not just the music signal (called far-end signal) plus the mic signal (near-end signal). The music signal has travelled across the room before it reaches the microphone, giving it some reverberations (echo). If you simply subtract the two signals, you will still hear the music signal quite loudly.

      What is done in practice and works extremely good, is modelling that "echo" as a filter (a FIR transversal filter, which is simply a delay line). You estimate the coefficients of the filter and use the music signal after the "room filter" has been applied to substract from the microphone signal. You then have the voice-only signal left.

      This is setup is called AEC or Acoustic Noise Cancellation. It is used in every telephone and mobile phone there is and is crucial to ADSL. If an ADSL modem would not cancel out its own sent signal at its receiver, the attainable speed would be several times less. AEC is also the reason why talking immediately when you pick up a mobile phone leaves an audible echo of your own voice: estimating the coefficients of the filter is still taking place at that point.

      See http://www.dspalgorithms.com/products/echo.html for a diagram of the AEC or read Haykin's Adaptive Filter Theory if you're looking for a decent book on the subject.

    4. Re:removing ambient noise by hankwang · · Score: 1
      why not just use two mics, one to record the ambient noise (positioned away from the voice mic) the other to record the voice (headset) then as you have two signals just subtract the ambient noise signal from the heaset signal
      That won't work. For recognizable speech, you need frequencies up to 4 kHz, that is wavelengths down to 8.5 cm (3.3 inch for the metrically challenged). If the mics are just half that distance apart, then the phase difference will turn your substraction into an addition for that particular frequency.

      Of course, you could use a direction-sensitive microphone, which basically contains a noise cancellation mechanism built into the cartridge which filters out sounds that come from behind and amplifies frontal sounds. But most simple headset microphones are not directional, because they have other disadvantages: they are slightly more expensive, usually bulkier, highly sensitive to contact vibrations from cable handling, they boost bass frequencies when placed close to the sound source, you need to align them correctly, and so on.

    5. Re:removing ambient noise by Anonymous Coward · · Score: 0

      Where do you asshats come from? At least research something before spouting your big mouth off, hm?

  37. A new "all your base"? by Anonymous Coward · · Score: 0

    Yeah, this was just as funny. Right... "Dear aunt, let's set so double the killer delete select all" HAHAH LOLOL!!1!1! I'm peeing my pants it's so funny.

  38. Re: Speech Recognition or Thought Prediction? by TaoPhoenix · · Score: 1

    Related in there is the urge to have a computer "perfectly understand" everything, so they can indulge in unclear thinking. Not a day goes by without my supervisor saying, "So, how is that progressing?", referring to something about seven topics back. When I haven't a clue, I continually reply: "I don't understand the question."

    --
    My first Journal Entry ever, in 8 years! http://slashdot.org/journal/365947/aphelion-scifi-fantasy-horror-poetry-webzine
  39. Re:Demo Rules by arthurpaliden · · Score: 1

    Demo Rules: 1) You never demo with out a script. 2) You never demo on any system that you did not use to create or at least test the script. 3) You never deviate from the script. 4) If someone wants to see something not in the script, show it to them later, in private.

  40. 20 years times the number of steps by tepples · · Score: 1
    I would give it 20 years if it is seen as something which could make a lot of money

    I would give it 20 years times the number of steps, with waiting at each step for the patents to run out, if Elektroschock is correct in his comment that it is a patent minefield.

  41. Mr. Pogue begs to differ: by Udo+Schmitz · · Score: 2, Interesting
    He writes: "The software I'm using is Dragon NaturallySpeaking 9.0, the latest version of the best-selling speech-recognition software for Windows. This software, which made its debut Tuesday, is remarkable for two reasons.

    Reason 1: You don't have to train this software. That's when you have to read aloud a canned piece of prose that it displays on the screen -- a standard ritual that has begun the speech-recognition adventure for thousands of people.

    I can remember, in the early days, having to read 45 minutes' worth of these scripts for the software's benefit. [...] NatSpeak 9 requires no training at all."

    1. Re:Mr. Pogue begs to differ: by MindStalker · · Score: 1, Flamebait

      Umm are you just selling Dragon now? I'm quite sure the demonstration video wasn't using Dragon. So thanks for the info but he Vista obviously still needs training.

    2. Re:Mr. Pogue begs to differ: by Bob+Wehadababyitsabo · · Score: 1

      Um, Udo was showing that modern speech recognition software (i.e. Dragon) do not require training. This either disproves the GP's argument that Vista's engine requires training or shows how half-baked Vista's much promoted speech engine really is.

      --
      fsck -u
    3. Re:Mr. Pogue begs to differ: by Udo+Schmitz · · Score: 1

      I haven't tried speech rec. software for years, although I found IBM ViaVoice on the Mac was working quite good after some training. But tomstdenis wrote: "Voice recognition requires some training regardless of who provides it. We're not Star Trek here". So I have to assume he's talking about speech recognition in general. I know that David Pogue used that kind of software years ago on windows, so I trust his article somewhat and have to assume that Microsofts solution isn't as good as waht's on the market right now. How shocking.

  42. Aunts? by Anonymous Coward · · Score: 0

    All your aunt are belong to Microsoft...

  43. Accessibility? by tepples · · Score: 1
    The problem though, this is YET ANOTHER THING that should not be bundled with Windows.

    Should a driver for the keyboard be bundled? To somebody who does not have use of hands, speech recognition is as indispensable as a keyboard driver. This is important when trying to get your product certified as disability-safe for use by agencies of governments.

    1. Re:Accessibility? by tomstdenis · · Score: 1

      Keyboard drivers are decidedly different from speech recognition software. For one, the competition on keyboard drivers is non-existent. Most keyboards are PNP compatible.

      However, there IS competition in the voice recognition market. Microsoft bundling their stuff in Vista is an anti-trust violation just like bundling IE with Win98. Your government example is also moot. The IT people could install A DIFFERENT CERTIFIED PROGRAM on their Vista images. I seriously doubt the IT people will be the ones needing speech recognition software themselves.

      In short, there is very little good motivation for limiting competition in this venue. There are many financial incentives in doing so, but fortunately they are not legal.

      Tom

      --
      Someday, I'll have a real sig.
  44. dreadful by Marin3 · · Score: 1

    omg...at least i had a good laugh xD

  45. Re:Vista couldn't differentiate between mom and au by Anonymous Coward · · Score: 0

    Maybe his mom is his aunt.

  46. Dragon Naturally Speaking by Anonymous Coward · · Score: 0
  47. china, google video not available by Anonymous Coward · · Score: 0

    argh, I hate living in china, ...google video is not available. Other things which are banned here are: bbc.com and most other western news sources (cnn.com is not banned strangely enough) google.com and google.cn chache wikipedia.com I have to resort to using an anonymiser for most of my surfing (www.anonymouse.com) I wonder how long untill they ban the whole internet, and replace it with a huge jpeg of Mao. (ps I'm a westerner, most chinese don't notice)

  48. Wait . . did they say by Slithe · · Score: 1

    rare occasions?

    --
    ---- "XML is like violence. If it doesn't fix the problem, you aren't using enough."
  49. Microsoft Innovation by SCHecklerX · · Score: 2, Insightful

    OS/2 Warp had speech recognition in 1994 with OS/2 Warp. Better yet, the OS/2 version of netscape at the time was speech enabled (browse simply by speaking the link). Even cooler was that the netscape developers actually listened to the OS/2 community with that version (I remember them implementing something that I had asked for...very cool). Keep in mind that the average system of that time was a pentium 133 with 100MB of ram. And here we are at 2006, With GHz processors and GBytes of RAM dirt cheap, and M$ is just now starting to experiment with this? By now this technology should be damned near perfectly integrated across the board! Thanks for abusing your monopoly power to destroy all of the competition and REAL innovation, Microsoft!

    1. Re:Microsoft Innovation by realmolo · · Score: 0, Flamebait

      How on Earth is it MS's fault that speech recognition isn't better? They don't even compete in that market.

      You are an idiot.

    2. Re:Microsoft Innovation by linguae · · Score: 1
      Keep in mind that the average system of that time was a pentium 133 with 100MB of ram

      IIRC, the average amount of RAM at the time (circa 1995-96) was more like 4MB or 8MB per PC, maybe 16MB or 32MB on high end systems. Users didn't start getting about 100MB RAM until about 1999 or 2000, maybe later when XP was released.

    3. Re:Microsoft Innovation by Anonymous Coward · · Score: 0

      In 1994 Warp didn't have software only speech recognition (it was added years later with version 4), you had to buy a separate software-hardware product to enable it. I attended one demo of that product and it worked flawlessly, but keep in mind that it required a dsp board. the average PC at that time (P75 with 8 MB RAM, where did you get the 100 MB figure?) wasnt able to do all the calculations in software.

    4. Re:Microsoft Innovation by adpowers · · Score: 1

      My family bought a computer in late 1997 which had 128 megs of RAM. This wasn't standard for many years later, but it wasn't a super high end machine, and it was by no means unheard of.

    5. Re:Microsoft Innovation by generic-man · · Score: 1

      Begin dictation.

      Yes. (pause) I. (pause) Remember. (pause) Using. (pause) That. (pause) I. (pause) Even. (pause) Bought. (pause) Begin number. One-six. End number. Begin spelling. M. B. End spelling. Of. (pause) Memory. (pause) To. (pause) Use. (pause) It. (pause) Period. (pause) It. (pause) Was. (pause) Useful. (pause) (right-click, correct mis-recognized "useful") When. (pause) I. (pause) Hurt. (pause) My. (pause) Wrist. (pause) And. (pause) Couldn't. (pause) Type.

      End dictation.

      With. Robust. Performance. Like. That. It's. A. Wonder. That. People. Besides. William. Begin spelling. S-H-A-T-N-E-R. End Spelling. Could. Use. It.

      --
      For more information, click here.
    6. Re:Microsoft Innovation by phasm42 · · Score: 1

      Destroyed the competition? You do realize that there are other speech recognition programs out there, like Dragon Naturally Speaking that work quite well? I'm sure they're glad that MS hasn't paid attention to this until now.

      --
      "No one likes working in a hamster wheel, and your shop smells of cedar shavings from here." - TaleSpinner
    7. Re:Microsoft Innovation by WuphonsReach · · Score: 1

      p133s were more likely to ship with 16-32MB of RAM. Rarely did you see one with 64MB or even 128MB. The 64MB machines were common in the late 90s and the 128MB machines were common just before Win2000 shipped (so 1998-1999). IIRC, I had a p90 with 16MB, but I might've upgraded it to 32MB or 64MB using EDO memory. My developer machine in 1999 came with 64MB stock and I convinced the IT group to up it to 192MB.

      (Currently retiring a bunch of late 90s Dells that have survived all previous attempts. Most came with 32MB or 64MB and were 200-450MHz machines.)

      A lot of the Dells made in the mid-late 90s (that I'm dealing with) could only support 32MB modules and only had 2 slots. A few rarer ones had 3 slots and could accept 64MB or 128MB chips. But those are 1999 models (give or take a year).

      --
      Wolde you bothe eate your cake, and have your cake?
  50. Which Vista promptly translates as: by ChePibe · · Score: 1

    double the niece's killers select all delete DIE!

  51. MS must have outsourced SR dept to India by Anonymous Coward · · Score: 0

    They hired the outsourcing tech support firm that just splitted with Dell. Vista probably understands Dell tech support lackeys just fine.

  52. Voice recognition requires some training regardles by glrotate · · Score: 2, Interesting

    That's so last century. NPR did a bit on the new Dragon Dictate 9. The NPR reporter got 100% accuracy out of the box, no training.

    Dictation Software Improves Usability, Accuracy

  53. It's CNBC... not MSNBC by Anonymous Coward · · Score: 0

    On the Money = CNBC

  54. This sounds so much like Microsoft by SmallFurryCreature · · Score: 3, Insightful
    How the fuck did this bug go unfixed for so long? It to me sounds to much like the old MS sale strategy of saying "Just wait please, do not buy X now, our product will be much better in the future."

    Yes bugs happen, yes vista is still in beta but rather then just admit "vista is still a buggy piece of crap software that can't even be used properly by its own engineers" they tell us to sit and wait because we can trust them to fix it.

    To MS credit, it is a strategy that works.

    --

    MMO Quests are like orgasms:

    You may solo them, I prefer them in a group.

    1. Re:This sounds so much like Microsoft by iceanfire · · Score: 1

      "This is a known bug in current builds, and has already been fixed by the audio team in their private builds in preparation for RTM."

    2. Re:This sounds so much like Microsoft by RzUpAnmsCwrds · · Score: 1

      How the fuck did this bug go unfixed for so long?

      Well, it depends on the bug. According to the article, the bug was intermittent, they knew that it existed, but they hadn't tracked it down and squashed it. Perhaps the bug was fixed in a newer release, but they were running an older release for the demo (would you do a demo with a largely untested nightly build?).

      Bugs are not always immediately apparent and are rarely as easy to fix as we like to assume.

  55. For those of us without sound... by wjcofkc · · Score: 1

    I have no speakers at the moment; what did the machine end up spitting out that was so funny?

    --
    Brought to you by Carl's Junior.
  56. I don't think so. by astrosmash · · Score: 1

    Yes, sales and marketing guys are asshats, but they're also professionals who usually spend a lot of time with the software rehearsing and choreographing their demos, because they're the ones who have to deal with the immediate fallout, after all.

    This demo didn't just drop a couple of words, or misinterpret an ambiguous sounding phrase, this was a complete melt down. A more plausible explanation is that the guy's voice was also amplified through the PA system in the room, and the computer's microphone was therefore picking up both the original voice and the delayed, amplified voice coming out of the PA.

    --
    ENDUT! HOCH HECH!
  57. You know... by JJJJust · · Score: 1

    He did say delete select all... *shrugs* As for that killer stuff... Vista knows when it's dead.

    1. Re:You know... by Emetophobe · · Score: 1
      He did say delete select all...
      That is a good point, so how does the software recognize the difference between a command like "delete select all" and someone wanting to write "delete select all" in the editor? I don't know how MS handles this, but if I were to program it, I would have a user-defined key that you have to press down whenever you want to issue a command, that way the program could recognize the difference.
  58. Oh my God! It's iListen! by Anonymous Coward · · Score: 0

    They must have licensed the technology from the same people iListen did. It was acting *exactly* the same as my experience with that useless piece of shit Mac software, including having a terribly hard time recognizing special commands it should be super good at, like "delete that."

  59. Loved the End of the Video by segedunum · · Score: 1

    It was funny, but the end of the video was funnier. Apparently Microsoft sent the heavies round and made it clear they weren't happy about the video being shown, and that the problem was down to background noise. However, CNN obviously found it funny, and the newsreader pointed out that it was a very quiet room until it started going wrong and people started laughing.

    "Live television is rough. Welcome to our world." she said. Ooooooh. Nice kick below the belt. Sounds like they're not keen on Microsoft at CNN.

    1. Re:Loved the End of the Video by tomstdenis · · Score: 1

      They have had the [very rare] crash at CNN before. Back around 2000 or so I remember seeing a few weathermaps totally lock up. And iirc I have seen a BSOD on a monitor in the background of a CNN broadcast.

      That said, they are very much a MSFT-shop.

      Tom

      --
      Someday, I'll have a real sig.
  60. On MSNBC's front page - for about 30 minutes.... by wowbagger · · Score: 5, Informative

    A friend of mine called me at work (since he knew that to access MSNBC's videos requires Internet Explorer, Windows Media 9 or better, and Flash, and I have neither IE nor WMP at home) and told me about this.

    I went to msnbc.com - and there it was, third on the list of videos on the main page.

    I called this to the attention of two of my coworkers, and we viewed the video - total elapsed time, maybe twenty minutes.

    Then I went to call it to the attention of a third coworker - and the video was no longer on the front page of MSNBC. OK, so maybe they've moved it off the front page, but it should still be on the Technology subsection, right?

    Wrong.

    Nor was it under Videos, nor anywhere else I could find it easily.

    Perhaps this was just a normal rotation of a video. Perhaps not. But no matter what the real cause, there is the appearance that it was removed from the page because it was too embarrassing. Not good for Microsoft.

    However, I will give MSNBC this - they didn't give Microsoft a free ride on this, they ribbed them pretty hard.

    However, I knew that this would be appearing on other sources as a video that could be viewed outside of Windows. Actually, I am rather surprised that it took this long.

    Now, as to the demonstration itself - it looks to me (a person who does signal processing and analysis for a living) like the presenter had the mike gain too high - every time he spoke he maxed out the bar graph on the display. *IF* he had the gain too high, and the audio was clipping significantly, that could make "mom" have enough of a pop to maybe sound like AUNT - especially if the software is using context to try to reduce the search-space for the words. Of course, that's why I would have a monitoring routine in the system, and if any of the samples are at 100% full scale, or if many of the samples are over 90% full scale, or the signal power is too high, I'd have my software adjust the mike gain down *and* flag an alert to the user. I'd also try to look for the mike element itself being overloaded.

  61. Oh Please by segedunum · · Score: 1

    Since when did Mum sound anything like Aunt?

    1. Re:Oh Please by tomstdenis · · Score: 4, Insightful

      Who knows how the algorithm they implemented works. Chances are the computer scientists behind it are not total asshats and they assumed the sales guy would follow the same procedure they did [e.g. to train it].

      Point is, if the sales guy had tried the system out beforehand he would have noticed it not working.

      That is, suppose the code is total shit [I know, big stretch for MSFT]. Then isn't it likely it would have failed during the preparation stage? If you are saying "mom" and it always comes back "aunt" you may want to cancel the presentation.

      That's why I think he didn't do any prep work for the presentation.

      Tom

      --
      Someday, I'll have a real sig.
    2. Re:Oh Please by cduffy · · Score: 1

      My understanding is that they use a neural network to determine what the most likely interpretation is. So, if its training data included "dear aunt" but nothing starting out "dear mom"...

    3. Re:Oh Please by NaugaHunter · · Score: 2, Insightful

      If you are saying "mom" and it always comes back "aunt" you may want to cancel the presentation.

      Or, in the very least, don't say 'mom'! I've had plenty of times where a salesperson tests something the day before a demo (usually after a week of knowing he had a demo, but that's a different rant) and finding something. Our usual response with that short of notice is 'well, don't show that' since we didn't have enough time to fix it. At best, we could send a version that had the error suppressed but not truly fixed.

      --
      R: That voice. Where have I heard that voice before? B: In about 365 other episodes. But I don't know who it is either.
    4. Re:Oh Please by Bing+Tsher+E · · Score: 3, Funny


      Who knows how the algorithm they implemented works.


      Probably nobody at Microsoft. . .

    5. Re:Oh Please by Lars+T. · · Score: 1

      Do you mean you have to train it all you dears so you can write a letter?

      --

      Lars T.

      To the guy who modded me down from perfect to terrible Karma - Apple haters still suck

    6. Re:Oh Please by m874t232 · · Score: 1

      Who knows how the algorithm they implemented works.

      Pretty much everybody knows how the algorithm works because pretty much everybody is using the same speech recognition algorithms. There are even some open source implementations that you can get that use roughly the same algorithms.

    7. Re:Oh Please by cduffy · · Score: 1
      Do you mean you have to train it all you dears so you can write a letter?
      No. All I mean is that it's likely to be weighted aganst things it hasn't seen in its training corpus (most of which would have happened before shipment, not after -- indeed, not all neural networks adjust their training during operation). Being weighted against something doesn't mean it's not possible -- just that it'll lend more weight to things that are in the corpus.
    8. Re:Oh Please by Peaker · · Score: 1

      That is, suppose the code is total shit [I know, big stretch for MSFT].

      Its not that big a stretch.
      Sure, they usually make utterly and horribly unusable piles of crap that they call code, but sometimes they manage to get to the level of "total shit" too. Hell, sometimes their code just "sucks", which is a relatively high level of quality in the software industry.

    9. Re:Oh Please by Anonymous Coward · · Score: 0
      Who knows how the algorithm they implemented works. Chances are the computer scientists behind it are not total asshats and they assumed the sales guy would follow the same procedure they did [e.g. to train it].

      Hahahahahahaha. Ahh-hahahahahah. Haha. Ahhhhh-hahahahaha. Yee, shit -- that's funny. Hahahahahahaha.

    10. Re:Oh Please by object88 · · Score: 1

      He didn't say "Mum", he said "Mom". In some localities, the "o" in mom is pronounced the same as the "au" in aunt... sort of an "aw" sound. And in each, the "aw" is the loudest part of the word, burying the "m"s in mom and the "nt" in aunt.

    11. Re:Oh Please by Anonymous Coward · · Score: 0

      Simmer down bitch - they sound exactly alike for some of us!

  62. Try calling the USPS by Anonymous Coward · · Score: 0

    Try calling the post office. Their entire menu is voice activated. "If you want to track a package say Track Package". Worked fine for me the other day. Guess they're not using Windows huh?

  63. microphones by Anonymous Coward · · Score: 0

    If the computer was listening to the same microphone that the sales guy was using to address the crowd, it seems like ambient noise and echo was low. If it was using another microphone, there's no telling what it heard.

    The infamous Howard Dean scream was not loud compared to the room of screaming people and he didn't look like a lunatic until later when people heard him recorded through the mic he was holding, without the crowd noise. A person (or computer) in the room would have heard something completely different.

    What I heard on the video is only evidence of that particular microphone, not of what the computer was hearing. Broadcast media is almost as arrogant as Microsoft.

  64. I think it was command mode by Anonymous Coward · · Score: 0

    It was an audio gain issue

    I don't know. I have a laptop with the tablet PC edition of windows. It has voice recognition. When you mess up and want to delete something, you have to say "command mode" first. If you just say, "delete that" on my laptop, then it will type out the words "delete that" But if you say, "command mode" it will beep. then you can say delete, select all, whatever.

    That's what I was thinking when I saw the video.

  65. James Joyce by skinfaxi · · Score: 1

    I tested a tablet a couple of years ago that had MS handwriting and voice recognition built in. You have to "train" the voice recognition by reading pre-made texts out loud (strangely, they were mostly about how great Microsoft is). Even after the training, the results were like the computer was channeling James Joyce. It especially had trouble with short words like "our." If the presenter had said, "Dear Mother, please rendezvous with me in Luxembourg" it would have been just fine.

  66. That's REALLY sad... by Khyber · · Score: 1

    My Cell Phone's voice recognition hasn't given me a problem, EVER. Perhaps Microsoft could learn a thing or two from the mobile industry?

    --
    Still waiting on Serviscope_minor to wake up to fucking reality and realize that Jessica Price isn't going to fuck him.
    1. Re:That's REALLY sad... by Ant+P. · · Score: 3, Interesting

      Is this "voice recognition" as in "understand English vocabulary and grammar rules and differentiate between speech and commands" or as in "match this sound up with one of 10 prerecorded ones to autodial a number"?

    2. Re:That's REALLY sad... by Repugnant_Shit · · Score: 1

      My Moto 710 figured out what my contacts' names sounded like without me having to train it like on an LG phone.

    3. Re:That's REALLY sad... by Khyber · · Score: 1

      More like "Try to match over three-hundred different names, some of them sounding the same, and asking you to say it in specific tones so it can understand which one you meant," kinda like how voice recognition is SUPPOSED to work, you know, by using inflections, variations in tone or enunciation? Ain't no ten prerecorded sounds in this day and age, and hasn't been since about 5-6 years ago when Sprint started advertising it as a feature of their new phones.

      --
      Still waiting on Serviscope_minor to wake up to fucking reality and realize that Jessica Price isn't going to fuck him.
  67. well gee I didn't expect this by Gno · · Score: 0

    Im sorry but it seems like this has happend before. A half-assed operating system that doesn't do everything they planned. Lets jsut watch all the pieces fall off vista until it becomes a switch similar from 98 to ME. Hell, im still waiting for 95 to do what they said it would.

    --
    It's not -1 Flamebait! It's +5 Funny. You just didn't get the joke...
  68. Audio Gain Settings Caused the Problem by ThinkFr33ly · · Score: 2, Informative
    As much as many of you would like to believe that the reason this demo failed was because Microsoft code is horribly designed and implemented, and that they are completely incompetent, there just might be a slightly more realistic explanation for the demo's abject failure.

    According to Rob Chambers, a developer on the Vista speech recognition team, the failures during the demo were caused by audio gain issues.

    From his blog:

    If you watch the video clip on MSN Video you can see in the speech user interface that the microphone "volume" is very high. It pushes up into the red frequently while Shanen is speaking to the computer. That's caused by the fact that the audio sub-system wasn't respecting the audio gain settings we've asked it to use.

    This is a known bug in current builds, and has already been fixed by the audio team in their private builds in preparation for RTM.


    Read the entire blog post for a more complete explanation of what happened... one that's just slightly more plausible than most of the explanations proffer by your fellow Slashdotters.
    1. Re:Audio Gain Settings Caused the Problem by Anonymous Coward · · Score: 0

      That's caused by the fact that the audio sub-system wasn't respecting the audio gain settings we've asked it to use.

      To rephrase that: "it wasn't a bug in OUR code. It was some other Microsoft code that we had no control over." As the CNN reporter said, "Welcome to our world."

    2. Re:Audio Gain Settings Caused the Problem by NtroP · · Score: 1

      He also said it worked flawlessly in all the dry-runs. I've been through situations like this before and I've seen other professionals in binds like this before. Here's what you do: You hit the little switch that activates the backup system, grab the other headset, make a joke like "now that you've seen what our top competitor can do... let me show you how our product works!", and pray to God the backup system comes through.

      Stuff like this happens all the time. I've even seen it happen to His Steveness(tm). The difference is, Steve ALWAYS has a hot-standby ready to go. I think he even has someone following along on it in the background so that it's sitting at a point in the demo where he needs to take off from (although I can't be sure, it happens so rarely). The point isn't necessarily which software is better. The point is that Steve is the consummate professional when he's doing important presentations and leaves nothing to chance.

      After seeing the amount of damage a single screw-up like this can do to a company's reputation, it shouldn't take too long before they realize how important going the extra mile and having redundancy built into the demo is.

      --
      "terrorism" and "pedophilia" are the root passwords to the Constitution
    3. Re:Audio Gain Settings Caused the Problem by Minwee · · Score: 1

      So it's not that there were unexpected hardware problems or any lack or preparation, it's just that they really were demonstrating a bug-ridden piece of crap that should never have been let out of its cage.

      I suppose that some times the simplest explanation really is the right one.

    4. Re:Audio Gain Settings Caused the Problem by __aaluww2118 · · Score: 1

      But wait, even if the whole "the gain was too high so it was distorting" is true it makes the whole fiasco even worse. You mean speech recognition DOES NOT test to see whether the audio is horribly distorted before it feeds it into the recognition engine? I mean come on, just a simple test for clip and too low a level would be pretty basic.

    5. Re:Audio Gain Settings Caused the Problem by IdntUnknwn · · Score: 1

      I'd like for you to point out *any* speech recognition systems that perform such a test. Are you saying that they're all stupid?

  69. MOD PARENT UP! by JoeLinux · · Score: 1

    I'm the original poster...This guy put coffee all over my LCD. (bastard)

  70. Re:Is SR ever going to be good enough? -- Yes! by oblique303 · · Score: 5, Informative

    I use Dragon NaturallySpeaking every day (carpal tunnel syndrome), and version 9 has around 99% accuracy, with around 98% out-of-the-box with no training. This means 10 or so errors out of a 1000 word dictation.

    I didn't believe it either, until I actually tried it. Dragon is the first worthwhile speech recognition solution I've seen that's practical for general use (Though I'd love if they'd release a "programmers" version to compliment the Medial/Legal versions). I get about 99% accuracy (a decent microphone is *very* important!)

    Dragon 9 also doesn't "technically" need training, but accuracy further improves if you do bother to train it a bit. The NYT reviewer was able to get 99.6% accuracy after a short training session.

    Here's a few reviews of version 9:

    http://www.nytimes.com/2006/07/20/technology/20pog ue.html?ex=1154318400&en=6fd795114b3f72ea&ei=5070

    http://www.npr.org/templates/story/story.php?story Id=5577523

  71. Sphinx by muyuubyou · · Score: 1

    http://cmusphinx.sourceforge.net/html/cmusphinx.ph p
    http://sourceforge.net/projects/cmusphinx/

    It's been around for a while. I think it's pretty good, though quite resource-demanding. The peeps at the tech-report usually benchmark it.

    Here for instance:

    http://techreport.com/reviews/2006q3/core2/index.x ?pg=12 -core 2 duo

    http://techreport.com/reviews/2002q1/athlonxp-2100 /index.x?pg=6 -early Athlon XP vs P4
        (middle of the page)

    As you can see, we can do real-time sphinx now.

    1. Re:Sphinx by Millenniumman · · Score: 1

      Is there an OS X frontend?

      --
      Stupidity is like nuclear power, it can be used for good or evil. And you don't want to get any on you.
    2. Re:Sphinx by muyuubyou · · Score: 1

      Last time I checked, I think there were no binaries. Just compile it, I believe macOS X is supported.

  72. Actually, they are very similar by Anonymous Coward · · Score: 0, Interesting

    One pronunciation of 'aunt' (the less common in US) uses the exact same vowel sound, and 'm' and 'n' are very similar. If the salesgenius had trained it to his (probably the more common) pronunciation of 'aunt', he wouldn't have had the problem. I suspect MS's program figures that in the salutation, 'aunt' is one of the more common words. That's why it got 'dear' correct, else it might've said something like 'tear'.

  73. Developers now test.. by novus+ordo · · Score: 1

    ..chair recognition

    --
    "You're everywhere. You're omnivorous."
  74. All Your Base? by BumpyCarrot · · Score: 0, Offtopic

    C'mon, peeps, the year is 2006. AYBABTU is dead and buried.

    --
    Do you see what I did there?
  75. Live demos are good things. by dpbsmith · · Score: 1

    If a product is ready for the world, it absolutely should be ready for a live demo. It shouldn't be just barely ready to work, under perfect conditions, minutes after careful testing by technicians.

    In fact, they should have called up a volunteer from the audience... preferably a member of the press so you'd know it wasn't a confederate... to do the talking.

    We realize things can go wrong under the best of circumstances. But it is still a completely valid test of the company's confidence in their product. In the real world, conditions will be far worse, there will be no technicians around, the software will be running in an environment that's had three security patches and two other major products installed on top of it, and so forth.

    In the days of live TV there was a program, it might have been Ed Sullivan, in which they regularly ran commercials for Timex watches, which, they said, "takes a licking and keeps on ticking," in which they put a Timex through various torture tests. It didn't always survive: the one I remember was the one where they buckled the watch onto the blade of an outboard motor's propellor and ran the motor in a tank of water, onstage, live. When they finished, at first they couldn't find the watch: the strap had broken. The propellor had flung the watch into a corner of the tank. And it wasn't ticking. But mostly, the Timex demos worked.

    On the Steve Allen show, they would regularly demonstrate Polaroid cameras. This was in the days before the cameras were motorized and the processing operation was tricky: you had to pull a long strip of paper-film sandwich firmly against fairly stiff resistance, wait ninety seconds, open the camera back, get a fingernail into a slit, and peel the perforated picture base away from the backing. Occasionally they had problems. As with the Timex demos, you got a completely convincing picture of the product's reliability and usability, and the company's confidence in their products.

    In Jack London's "The Sea-Wolf," a character asks "Do you know Dr. Jordan's final test of truth?" and answers: "Can we make it work? Can we trust our lives to it? is the test."

    I don't say Microsoft should wait to ship until they are ready to trust their lives to Vista voice recognition, but they should darn well be prepared to demonstrate it, live, in public.

    1. Re:Live demos are good things. by LarryWake · · Score: 1
      In fact, they should have called up a volunteer from the audience... preferably a member of the press so you'd know it wasn't a confederate
      Because that's how we can absolutely guarantee that there's no hanky panky involved.
    2. Re:Live demos are good things. by dcam · · Score: 1

      Live demos are fantastic.

      I work on a web application and occasically I train people in using it or demo it to people as part of a sales pitch. Without fail I find bugs, visual inconsistancies or I think of ways that I can make significant improvements. It is something about the change in environment that makes you look at things through new eyes.

      I used to think it sucked. Now I am wondering whether we should make it part of the testing cycle :).

      --
      meh
  76. WWDC by Coppit · · Score: 1

    What would be awesome is if Steve Jobs used the Mac's voice recognition to have it type out "Dear Aunt double the killer delete select all"

  77. Where was this done? by houghi · · Score: 1

    In some states your mom IS your aunt.

    --
    Don't fight for your country, if your country does not fight for you.
  78. This quote sums up the problem: by dudeman2 · · Score: 1

    "It's easy to wreck a nice beach"

  79. Download?! by Yvan256 · · Score: 1

    Google Video used to have a "save file" feature (even had PSP/iPod/etc. choice). Where is it?

    1. Re:Download?! by tlacuache · · Score: 1

      I was looking for this feature as well. After a while, a little googling revealed that there are some great Greasemonkey scripts that will allow you to do this from Google, YouTube, iFilm, and others. I don't have the link, but you should be able to find it on userscripts.org.

  80. Since when did Mum sound anything like Aunt? by tomhudson · · Score: 5, Funny

    Since when did Mum sound anything like Aunt?

    ... probably in the same jurisdictions where "wife" is spelled "first cousin" or "sister".

    1. Re:Since when did Mum sound anything like Aunt? by gstovall · · Score: 1

      Mum? We're not in the UK or Australia...try "Mom"...

      While there are some jurisdictions that allow first cousins, I'm not aware of any that allow marriage to sisters.

      Most ofolks I know pronounce "aunt" as "ant"
      Some of my easter ozark relatives pronounce "aunt" as "ain't"
      Some of my northern relatives pronounce "aunt" as "awnt"

      So, with my northern relatives, "mom" and "aunt" have the same vowel sound.

      And of course, "soft drink" is variously pronounced "pop", "soda", "sodie", and "coke" (in Texas, every soft drink is pronounced "coke")

    2. Re:Since when did Mum sound anything like Aunt? by tomhudson · · Score: 1

      Okay, a little bit of word play here ...

      Bet you I can prove to you that most jurisdictions allow marriage to sisters.

      See if you can figure it out before I post the answer.

    3. Re:Since when did Mum sound anything like Aunt? by Anonymous Coward · · Score: 0

      Mum? We're not in the UK or Australia...

      Speak for yourself. Or if you don't have anything constructive to say, stay mum.

    4. Re:Since when did Mum sound anything like Aunt? by acariquara · · Score: 1

      Simple, you can always marry someone's sister, just not yours.

      --
      Dear aunt, let's set so double the killer delete select all
    5. Re:Since when did Mum sound anything like Aunt? by penix1 · · Score: 1

      "Bet you I can prove to you that most jurisdictions allow marriage to sisters."

      Sure they do....As long as it is someone else's sister. OTOH, I think the Catholic faith may have an issue if you try to marry one of their sisters....

      B.

      --
      This is a sig. This is only a sig. Had this been an actual sig you would have been informed where to tune for more sigs.
    6. Re:Since when did Mum sound anything like Aunt? by tomhudson · · Score: 1

      "Simple, you can always marry someone's sister, just not yours."

      Bingo! Now for the bonus round ... is it legal for a widow's husband to marry his former wife's sister?

    7. Re:Since when did Mum sound anything like Aunt? by WilliamSChips · · Score: 1

      Only if he has a sarcophagus.

      --
      Please, for the good of Humanity, vote Obama.
    8. Re:Since when did Mum sound anything like Aunt? by Planesdragon · · Score: 1

      Bet you I can prove to you that most jurisdictions allow marriage to sisters.

      You mean step-sisters? Yeah, they do. I think you can even marry your adopted sibling, if you really push at it.

      (Sheesh. What morons we have on /.)

    9. Re:Since when did Mum sound anything like Aunt? by cyber-vandal · · Score: 1

      A widow's husband is dead, can I have my free iPod now please

    10. Re:Since when did Mum sound anything like Aunt? by Nossie · · Score: 1

      and if we are in the uk here ? just because its an american site doesnt mean everyone has to speak american :P

      repeat after me "mum" ... try again "mum" ... once more "mummy"

      got it ?

      well done!

    11. Re:Since when did Mum sound anything like Aunt? by Firehed · · Score: 1

      We're far enough OT, but might as well say that the answer's no because dead people aren't allowed to wed.

      --
      How are sites slashdotted when nobody reads TFAs?
    12. Re:Since when did Mum sound anything like Aunt? by tomhudson · · Score: 1

      I think you missed the joke ... anyone can marry a sister anywhere - just not their own sister. Read the rest of the comments, and note the reply to the question of whether its legal for a man to marry his widow's sister - his former sister-in-law :-)

    13. Re:Since when did Mum sound anything like Aunt? by tomhudson · · Score: 1

      This is true, but you'd be surprised at how, if you get a group of people together, they'll invariably miss that point.

      And when you try to explain it to them, there are those who "get it", and those who go "that doesn't make sense! why can't he still marry his former sister-in-law".

      People in the latter group also don't grok indirection, pointers to pointers, array math, or recursion, at least not very well.

    14. Re:Since when did Mum sound anything like Aunt? by gstovall · · Score: 1

      I was speaking for myself, and for the interests of clarification in the comparison...I was trying for a humorous way to bring us back to the point that the comparison was between "Mom" and "Aunt", not "Mum" and "Aunt".

      Mum is a (chiefly British) abbreviation of "Mummy", like someone else said. The primary definition of Mummy relates to dead Egyptians, while there are a whole host of definitions for "mum" that are more used than "abbreviation of Mummy (Mommy).

      In response to another poster, I did not presume we all were in the USA. However, the whole point of the article is that the software confused "Mom" with "Aunt". Nowhere does "Mum" come into this demonstration.

      Got it? :)

    15. Re:Since when did Mum sound anything like Aunt? by Anonymous Coward · · Score: 0
      ... probably in the same jurisdictions where "wife" is spelled "first cousin" or "sister".

      Only in Arkansas.
    16. Re:Since when did Mum sound anything like Aunt? by ozbird · · Score: 1

      ... probably in the same jurisdictions where "wife" is spelled "first cousin" or "sister".

      The banjos interfere with the speech recognition - where else do you think they got the Crazy Frog lyrics from? ("Di-di-ding ding ding ding ding ding ding.")

    17. Re:Since when did Mum sound anything like Aunt? by tomhudson · · Score: 1

      For those who don't get the references

      Awsome quotes ...

    18. Re:Since when did Mum sound anything like Aunt? by CrankyOldBastard · · Score: 1

      I at least am in Australia, so keep your "we're not in the UK or Australia...try "Mom"..." comment back in your 3rd world ghetto where it belongs.

      Seriously, you have to start to realize that people outside the USA do own and use computers, and that we can (and do) read and post to /.

    19. Re:Since when did Mum sound anything like Aunt? by Anonymous Coward · · Score: 0
      Bingo! Now for the bonus round ... is it legal for a widow's husband to marry his former wife's sister?

      Yes. In all states which approve of necrophilia.

    20. Re:Since when did Mum sound anything like Aunt? by tomhudson · · Score: 2, Funny

      I think the government has a monopoly on f*cking the dead.

    21. Re:Since when did Mum sound anything like Aunt? by cyber-vandal · · Score: 1

      I'm in both groups then since I don't understand about half the last sentence :P

    22. Re:Since when did Mum sound anything like Aunt? by gstovall · · Score: 1

      Where does all this "US citizens are idiots" sentiment come from?

      Hmmm...I work for a large multinational corporation, and I daily work with people in Ireland, Canada, Turkey, India, China, and Australia. I think I realize people in other countries do own and use computers...

      Let's bring it back to the issue at hand instead of stretching for reasons to do more USA bashing.

      The word was "Mom", and it was mis-interpreted as "Aunt". Someone went off talking about "Mum", and I was trying to use mild humor to bring it back to "Mom" from "Mum", and I start getting all this anti-USA garbage.

      Did I say ANYTHING ANYWHERE about other countries being bad or non-existent? I don't think so.

      With all the USA bashing I read in here daily, I wonder how it would come across if we really did start saying bad things about other countries, rather than it all being in fevered imaginations...

    23. Re:Since when did Mum sound anything like Aunt? by greginnj · · Score: 1
      People in the latter group also don't grok indirection, pointers to pointers, array math, or recursion, at least not very well.
      God, you must be really fun at parties...
      --
      Read the best of all of Slash: seenonslash.com
    24. Re:Since when did Mum sound anything like Aunt? by tftp · · Score: 1
      The question is unclear, that's why you are getting garbage out.

      The question does not specify which of widow's husbands it is talking about. The husband #1 is dead, so he can't marry anyone. The husband #2 - if the widow remarried - is free to divorce her and marry anyone else in general; relation to his former wife is irrelevant.

      This is a tricky question in part because a widow can be called a widow even if she marries someone else.

    25. Re:Since when did Mum sound anything like Aunt? by tomhudson · · Score: 1

      If she remarried, she is no longer a widow - she's a married woman (thugh she will still be husband #1's widow).

      Same as if you divorce someone, then remarry. You're not divorced, you're married (though you are still your previous spouse's ex).

    26. Re:Since when did Mum sound anything like Aunt? by tomhudson · · Score: 1

      People in the latter group also don't grok indirection, pointers to pointers, array math, or recursion, at least not very well.
      God, you must be really fun at parties...

      Well, I do have a large collection of jokes, puns, etc., and if you wait long enough, or drink enough, one of them is bound to be funny.

  81. Hax0ring by peterfa · · Score: 1

    With this new technology, hacking Vista is going to be pathetically easy.

  82. MOD UP... by Anonymous Coward · · Score: 0

    Wish I had mod points...

  83. Speech Recognition... by EddyPearson · · Score: 1

    This isn't a new technology (as Microsoft would like you to believe), speech recognition has been around for well over 5 years (innovation???)

    I had a copy of DragonSpeak about 3 years ago, it never made such simple mistakes (actually, i would recommend it!) and the computing world has come on leaps and since then.

    To me, at the moment, Microsoft's (specifically Vista) latest dev model seems to resemble a badly organised, feature saturated, open sourced effort. Think of all those bloated applications, with thousands of features, none of which work properly, and the aim of the project (those KEY features) have been so buried and sidelined, that they are buggy as well.

    Can we not persuade them to use the best of of OS and (dare i say it) the best bits of closed source to actaully come up with a product that people will be happy to upgrade to, as opposed to a usergroup who don't like the product but are forced to upgrade anyway.

    --
    You feel sleepy. Close your eyes. The opinions stated above are yours. You cannot imagine why you ever felt otherwise.
  84. "A keyboard? How quaint." by scottv67 · · Score: 1

    Has Microsoft taken into account how this small change will affect the transparent aluminum industry?

  85. That's no fun... by jrothwell97 · · Score: 1

    ...there is some REALLY funny stuff that comes out ot speech-to-text. For example:

    On a mobile phone that was being demonstrated in Italy just after the Pope had been elected, an Italian spoke into it with a strong accent that the machine wasn't accustommed to. The speaker said "Congratulations to our new Pope."
    The phone repeated: "Congratulation to our new poke."

    Who says computers don't have a sense of humour?

    --
    Those using pirated Tinysoft signatures(TM) are a real threat to society and should all be thrown in jail.
  86. Probably a bad Mic. by jcr · · Score: 2, Informative

    When I was last involved in adding speech control to an app, I attended a developer workshop at Apple, and found out much to my surprise that my mic wasn't any good. It sounded fine when I used it for voice recording, but for recognition the gain curve was all wrong. When I tried one of the mics that the speech team from Apple provided, the hit rate went from under 20% to well over 90%.

    When Kim Silverman demos Apple's speech recognition, he uses a high-quality noise cancelling mic. It makes all the difference.

    -jcr

    --
    The only title of honor that a tyrant can grant is "Enemy of the State."
  87. Re: Speech Recognition vs. Fiction? by llthomps · · Score: 1

    Each command was prefaced with "Computer", and seemed to have a common vocabulary to it.

    They never said "Computer - what's up with that?" It was always "Computer - Find all data pertaining to [a specific event]". It seem what they had in fact mastered in the future was not only voice recognition, but also a common language programming and database system.

  88. Bad News by PixelScuba · · Score: 2, Funny

    My mom is in a coma.

    My aunt is in a,

  89. C'mon guys, get going by Mateo_LeFou · · Score: 1

    Google search for "Let's set so double the killer" I want to see 95 youtube remixes by monday.

    --
    My turnips listen for the soft cry of your love
  90. Computers and Language by CarbonRing · · Score: 1

    In the 90's, a bit after Apple released the Newton, I saw a demo by the handwriting recognition team at Microsoft. They were very proud of their work, which they promised would be much better than Apple's. As part of the demo, the presenter wrote "Windows Rules" on the tablet, which the software recognized as "Windows Pales". That got a pretty embarrassing round of laughter considering it was an internal demo.

    To be fair, I'll throw in a more recent Apple story. I ordered a Mac Book Pro the day they were announced, and when it arrived it had the now famous squeal. I called Apple's 800 number and endured the automated voice menu:

    Apple Robot: "Say the name of the product you are calling about."
    Me: "Mac Book Pro"
    Apple Robot: "Did you say Apple Cinema Display?"
    Me: "No"
    Apple Robot: "I'm sorry. Please say the name of the product you are calling about"
    Me: "Mac Book Pro"
    Apple Robot: "Did you say Unix?"

    I got one of the very first shipping units, so they obviously hadn't added "Mac Book Pro" to the known list of products yet, but it was hysterical how far away the nearest matches were.

  91. Re:So? These two worked by danigiri · · Score: 1

    These two demos worked perfectly:

    - http://video.google.com/videoplay?docid=-413444611 2378047444&q=Motorrider&pl=true

    - http://maclive.net/sid/135

    Wonder what internal build of Vista they used...

  92. I've also noticed this as a beta tester by Lonath · · Score: 2, Funny

    Every time I say the word "Linux" it gets typed out as "Windows". Go figure.

  93. Actually, not really by jpardey · · Score: 2, Funny

    The PR guy was demoted. Microsoft put him in a box with a keyboard and called him "Vista Speech Recognition." See, such technology is in reach!

    --
    I have freaks! I did something right...
  94. Vista Speech by JerryLs · · Score: 1

    The video is funny, but I've tried the Vista public beta, and I loved the speech recognition. Outside of minor beta bugs, it worked flawlessly. I only wish it were available now.

    --
    Ad Astra Per Asper
  95. For the biologist... by Anonymous Coward · · Score: 0

    I can hardly wait for the days when "Deoxyribonucleic acid" is synonymous with "dead ox E ride O nuclear lick ass lid aunt."

  96. Tom by DRetta · · Score: 1

    http://slashdot.org/~tomstdenis http://slashdot.org/~tomstdenis Last 16 comments. You must really hate MS. I think you're winning.

    --
    I hear there's rumors on the Internets that we're going to have a draft.
  97. Re:Is SR ever going to be good enough? -- Yes! by sunwukong · · Score: 1

    Ok -- might as well finish the ad. What do you recommend for microphones?

  98. Already exists. by PxM · · Score: 1

    However, it's done passively (e.g. without electronics) in system using a single mic with two openings (pretty flash demo of the tech at: http://www.theboom.com/technology.html ). It was actually designed for voice recognition on the Wall Street trading floor but is now used in Black Hawk helicopters. List the to demo sound files: http://www.theboom.com/theboomO.html

  99. HURRRR they should have open sourced it by Anonymous Coward · · Score: 0

    then The Free Open Source Community could have improved it and made it better!!!!

    Linux!

  100. Voice recognition is overhyped as a concept by zielaj · · Score: 1

    To start with, I've never used any voice recognition software. Nevertheless, I think voice recognition is way overhyped as a concept for word processing, even assuming it works flawlessly. Here is why.

    First of all, spoken English is very different from written English. Try to record a typical conversation and then write it down exactly as you hear it. You'll notice almost no structure, very short phrases, no full stops, no commas, etc. You can't send such a "document" to anybody.

    So, if want your text to have any quality, you'll spend much more time editing it than actually writing new words. Now try to edit your document efficiently using voice recognition software. Marketing voice-recognition as a keyboard replacement for producing documents ("it's so fast and easy, you don't even have to learn how to type") is just bullshit.

    Sometimes, you don't need quality, for example when writing a short note to a friend. But isn't it then easier just to record yourself and let the receipient listen to your message directly (like voicemail)? What's the point of having a computer do voice recognition if humans can do it much better?

    Finally, voice recognition defies privacy. Do you seriously want to dictate an intimate email/letter to your girlfriend? Why do you think text messaging got so popular?

    I see voice recognition useful in two cases: (1) if the final receipient is a computer system, for example, star-trekish "computer, dim the light in this room", (2) the author is disabled so that he/she can't use the standard keyboard.

    1. Re:Voice recognition is overhyped as a concept by Bohemoth2 · · Score: 1
  101. Not the first MS demo embarrassment. by MsGeek · · Score: 3, Informative

    Actually the last MS demo flame out of this magnitude came when the beta of Windows 98 was being demoed. They wanted to show a scanner "just working" with Windows 98 and USB. W98 hit the blue screen of death when the USB scanner was plugged in.

    There, I found it. The file is an old QuickTime movie. I'm going to put this up on YouTube. There, that's done. Have at it.

    --
    Knowledge is power. Knowledge shared is power multiplied.
    1. Re:Not the first MS demo embarrassment. by Schraegstrichpunkt · · Score: 1

      To be fair to MS, voice recognition is hard to get right, and even theoretically is basically impossible to get 100% right. Writing a working scanner driver is vastly easier, and much more embarassing when screwed up, IMHO.

    2. Re:Not the first MS demo embarrassment. by l33t+gambler · · Score: 0
      To be fair to MS, voice recognition is hard to get right, and even theoretically is basically impossible to get 100% right. Writing a working scanner driver is vastly easier, and much more embarassing when screwed up, IMHO.

      Not when you write for Windows 98 and its VXD driver subsystem and try to make it "plug and play." Windows 2000 brought us WDM and Windows Vista (NT 6.0?) we have LDDM. When will they do it right? When they let each driver have its on folder with its files and settings and executable (i.e control panel). Just delete the folder and your PC is once again clean. Now we have "driver cleaner" programs and advices to completely reinstall Windows if you have uncommon problems after a driver upgrade.
      --
      Teasing the nobles, and rightfully so!
  102. WTF??? by soft_guy · · Score: 1

    Dragon Systems had better speech recognition than that years ago. Plus, they declared bancruptcy several times? or something since the late 90s. Why the hell wouldn't Microsoft just buy them instead of trying to roll their own (and doing their usual suckage job of it.)

    --
    Avoid Missing Ball for High Score
    1. Re:WTF??? by Anonymous Coward · · Score: 0


      Well, Dragon Systems was purchased by L&H, which was then purchased by ScanSoft, what has recently become Nuance. The current value of Nuance is $1.87B.

      What's the usual size (or even what has been the max size) of companies that MS has purchased in the past?

      Todd

  103. Background noise? by Cr0t · · Score: 0

    Background noise? I think I can hear chairs flying!

  104. Here's mine... by lullabud · · Score: 1

    Blonde Speech Recognition Engine

    Now where's your demo?

  105. and in other news... Larry Flint Joins Microsoft by theshowmecanuck · · Score: 1
    Nope, I'm sure that this is just an accident by a company that spends its off hours petting little baby chickens and bunnies.

    Just announced, Microsoft, a known petter of baby chickens and bunnies has just announced the signing if Larry Flint. Said a Microsoft spokesperson, as a lover of chickens, we thought Larry might be a good, errr, fit.

    --
    -- I ignore anonymous replies to my comments and postings.
  106. C'mon guys... grow up will ya? by Anonymous Coward · · Score: 0

    It seems that all of you are just a bunch of geeks trying to kill Money$oft, you take every mistake of microsoft as a capital sin... but you forget one thing, our beloved Open Source is way toooo buggy, from the freezing of a day to day application as gaim to the problems of Flex. So, if it's not about antitrust, can you please Microsoft alone and grow up at least a little?

  107. Re:Voice recognition requires some training regard by Reaperducer · · Score: 1

    Regular people who speak at work like NPR reporters and anchors usually end up getting sent off for drug tests.

    --
    -- I'm old enough to have lived through six different meanings of the word "hacker."
  108. One word sums up the future of Vista by Mr.+Cancelled · · Score: 1

    OSX

    Flame away gentlemen...

  109. Someone forgot to proofread before submitting by Baloo+Ursidae · · Score: 1
    It seems even MSNBC is willing to take a jab on those rare occasions when Microsoft products don't work.

    s/rare/frequent

    --
    Help us build a better map!
  110. Re:On MSNBC's front page - for about 30 minutes... by Anonymous Coward · · Score: 0

    Yes, because when you saw it first was obviously when it was put up. You are the worst troll on Slashdot, but keep trying!

  111. Speech recognition may not be the problem by Anonymous Coward · · Score: 0

    I think the main problem may not be the program, it's probably the developer who doesn't realize that mom and aunt should be different people, which makes me worry about the implications of microsoft trusting people that inbred

  112. MSNBC = .82NBC + .18MSN by BMIComp · · Score: 1

    I think the concept behind MSNBC was originally to be a sort of tech TV channel, as it emerged right around when the internet was coming into the mainsteam But that whole concept flopped.. and eventually they just became a 24 hours news network.

    Originally, the ownership of MSNBC was 50% microsoft and 50% NBC, but back in late 2005 NBC bought 32% of microsoft's share in the company. So, Microsoft really doesn't have a controlling stake in the company.

    Although... NBC has always said that Microsoft doesn't have editorial control.

  113. Training? Latest Dragon s/w: maybe not! by ansak · · Score: 1
    I was going to include a link but now I can't find it...

    But I know I heard an interview on radio regarding Dragon Naturally Speaking version 9 which didn't require any training at all. The only way to lose it was to have an interview and expect it to render both people's speech in real time. That was a mistake.

    cheers...ank

    --
    Still hoping for Gentle Treatment...
  114. Re:Is SR ever going to be good enough? -- Yes! by Phisbut · · Score: 1

    I have rarely used a speech-recognition software before, but all the experience I've had thus far is that one needs to speak in a very artificial way in order to use those. I've heard about this Dragon thing being very good now, but does it allow you to speak naturally (as you would to your buddies), or do you still need to adapt your speech to the machine and sound like a retard when you dictate?

    --
    After 3 days without programming, life becomes meaningless
    - The Tao of Programming
  115. Hear, hear! by Anonymous Coward · · Score: 0
    This is why sales people are asshats. They're unprofessional non-technical people who sap back the high life while the rest of us have to put up with the mess they create through their daily barrage of verbal diarhea.
    To say nothing of self-appointed gearheads who take breaks from putting up with aforementioned mess to pout on the Internet using banal tropes they learned from Dilbert. select all delete that Ah yes, quite.
  116. Good thing NASA never used this by IlliniECE · · Score: 0

    Just imagine.. "Houston, we are go for launch" --> "Houston, we are gone for lunch"

  117. O S / 2 did it right over 10 years ago!!! by cyberspittle · · Score: 0

    Reminds me of a tv show called the "Computer Chronicles" with Stewart Cheifet. Don't take my word for it, see for yourself at the Internet Archive:

    http://www.archive.org/details/OS2Warp

    The voice dictation is at the end. Of course, this was an add in product. Later in 1996, with OS/2 Warp version 4, it was already included. Too bad IBM doesn't sell OS/2, you'll have to buy eComstation from Serenity Systems instead.

  118. You say pay-tent I say pa-tent by pbhj · · Score: 1

    >>> For instance, the word "patent" is pronounced differently in the UK from North America. In the UK it is "pay-tent" and over here it's "pah-tent". That's just one example.

    I'm in the UK, from North Nottinghamshire (for my first 18 years). I prononce the word "patent" as pa-tent (with a short a as in "apple" [a-pul]).

    Perhaps I'm wrong. I was a Patent Examiner for several years though - people did correct my pronounciation when I told them where I worked (!) ... "oh you mean the pay-tent office" ... from what I recall most Examiners used pa-tent.

  119. I have a better solution by syukton · · Score: 1

    I have a better solution, and I'm surprised that it's taken so long for nobody else to come up with it. If I weren't so poor and unemployed right now, I'd patent this idea because it's a real winner. Are you ready? Wireless in-ear cranial vibration sensing microphones. The only ambient noise that will be picked up is the sound of you chewing some gum, so don't chew when you're talking to HAL.

    The coolest thing about this idea (I think) is that each mic could be coded to a specific user, and then fifty people could all be talking to the same supercomputer at the same time, and the computer could distinguish based on their user ID and not have to distinguish voices at the audio level at all. I've talked about this idea elsewhere in the past, and I'm still pretty surprised that nobody is doing it.

    Speech recognition (speech-to-text) is pretty trivial without training as the newest release of Dragon Naturally Speaking, version 9, demonstrates. With training, it can become nearly flawless. Speech-to-text is another area where AT&T's research lab has made some pretty good progress. I'm looking forward to the future of voice-interaction with my computer with a certain optimism, especially if somebody will just (please) do the cranial-mic thing. (Contact me for licensing! My idea! heh.)

    --
    Reinvent the wheel only at either a lower cost, greater effectiveness, or your own personal enrichment and satisfaction.
  120. lettuce by mennucc1 · · Score: 1

    truth is, it is almost impossible for a computer program to understand English just using phonemes: some phrases are indeed pronounced in the same exact way, the most famous example being "lettuce spray" and "let us pray". Some state of the art research is indeed trying to peruse accents and intonation, to distinguish such phrases.

  121. Not possible for computers to recognize speech by master_p · · Score: 1

    The brain works differently than the computer (no news, but it needs to be said once in a while).

    First of all, speech is context-dependent. The computer, in order to reliable understand speech, it has to use a context-sensitive grammar, which is a major nightmare for computer linguistics. And since certain words sound the same, the only possible solution for reliable speech recognition is context-sensitive parsing.

    Secondly, recognizing different accents requires experience, something that algorithms don't have. Speech recognition will not be useful unless Chinese, Hindu, Arab, Mediterranean, Russian accents (when speaking English) can be successfully parsed.

    My intuition says that in order to have reliable speech recognition, the computer shall work like the brain, i.e. like a neural network. Of course computers are not that powerful yet, so I think we should not blast Microsoft yet.

  122. dictation programs are region biased by fusion9290991 · · Score: 1

    I had a similar problem with Dragon Dictate a few years ago. It refused to accept anything I said, even in the training phase, unless I spoke with an American accent (and I can only do a lousy American accent).

    --
    remember to loot and pillage before you burn!
  123. Microsoft Demo Bugs Actually Work For Them by bandmassa · · Score: 1

    When a microsoft "new" technology fails amusingly or embarrassingly at a demo, they benefit from a) all the free publicity of people like us laughing, and b) the failures create a mindset in non-technical user markets that "gee, it must be knew stuff, even the demo went buggy." Apple had working, reliable speech recognition built-in to the OS way back in the early 90s with System 7.5. It didn't do dictation out of the box, but it provided alternative access methods to the OS, and could easily distinguish between words as different as "aunt" and "mother." People will believe Microsoft are pushing the boundaries, when in fact they're just bloating Doze even more by shoe-horning computer lab curios into an already overloaded OS. Then again, by even commenting, I guess I'm adding to the buzz (I roll my eyes as much at myself.)

    --
    "I hope you like Guinness, Sir. I find it a refreshing substitute for, er... food." Col. Jack O'Neil, SG-1
  124. Very funny and all by Bertie · · Score: 1

    But unfortunately I work in the speech industry, and it's hard enough selling stuff at the best of times, without these idiots pitching up and convincing a watching world that this is the state of the art...

  125. Re:On MSNBC's front page - for about 30 minutes... by Anonymous Coward · · Score: 0

    There's an explanation of this bug and yes, it's gain levels related.

  126. Glitch wasn't that bad in context. by DougieVan · · Score: 1

    I was at this meeting and wrote up an account of what happened that counters the newscast here.