Slashdot Mirror


Remote Exploit of Vista Speech Control

An anonymous reader writes "George Ou writes in his blog that he found a remote exploit for the new and shiny Vista Speech Control. Specifically, websites playing soundfiles can trigger arbitrary commands. Ou reports that Microsoft confirmed the bug and suggested as workarounds that either 'A user can turn off their computer speakers and/or microphone'; or, 'If a user does run an audio file that attempts to execute commands on their system, they should close the Windows Media Player, turn off speech recognition, and restart their computer.' Well, who didn't see that coming?"

372 comments

  1. Most Important Part of the Announcement by eldavojohn · · Score: 5, Funny

    Microsoft cautioned everyone not to play the song "Hit Me Baby One More Time" by Britney Spears on or near your computer while the mic is on.

    Several lawsuits already involve brutal crimes by computers against annoying young teeny bopper women. Although we can't act like we didn't see this coming, tension has been steadily rising.

    --
    My work here is dung.
    1. Re:Most Important Part of the Announcement by kannibal_klown · · Score: 5, Funny
      Worse yet!!!

      One of the computer geeks at the Pentagon better not be watching any Star Trek episodes.

      Computer. Initiate self destruct sequence. Authorization 1A 2B 3C
    2. Re:Most Important Part of the Announcement by Linker3000 · · Score: 1

      I'd be more worried about watching episodes of Dr Who online when the cybermen are involved.."delete..delete..delete".

      There's also a joke about talking too loud near your computer to a colleague named Colin who's asked you which of several designs for a new brochure you prefer - but I can't be bothered to set the scene:

      "Format C, Colin"

      --
      AT&ROFLMAO
    3. Re:Most Important Part of the Announcement by asmiller1950 · · Score: 1

      "Insert disk labeled 'debian-31r4-i386-netinst' now."

    4. Re:Most Important Part of the Announcement by Hoi+Polloi · · Score: 1

      Don't play the audio file that repeats "Buy v1gr1a! Buy v1gr1a!..."

      --
      It is by the juice of the coffee bean that thoughts acquire speed, the teeth acquire stains. The stains become a warning
    5. Re:Most Important Part of the Announcement by joshetc · · Score: 5, Funny

      Microsoft cautioned everyone not to play the song "Hit Me Baby One More Time" by Britney Spears on or near your computer while the mic is on.

      Several lawsuits already involve brutal crimes by computers against annoying young teeny bopper women. Although we can't act like we didn't see this coming, tension has been steadily rising [theonion.com]. You should see what happened to the guy who played the Nirvana song "Rape Me".
    6. Re:Most Important Part of the Announcement by Anonymous Coward · · Score: 5, Funny

      Authorization 1A 2B 3C
      Hey! That's the authorization code on my luggage!
    7. Re:Most Important Part of the Announcement by BrokenHalo · · Score: 1, Insightful

      "Format C, Colin"

      Probably a good idea, though. And while we're at it, since Microsoft recommends rebooting (again, sigh), perhaps it is wise to do so with an installation CD of [linux distro of choice] in the drive. Seriously, who wants Vista? More trouble than it's worth.

    8. Re:Most Important Part of the Announcement by plopez · · Score: 2, Funny

      who wants Vista?
      billg, ballmer, hardware manufactures, virus writers, anti-virus vendors, spam bot operators, antispam software writers.... oh, you meant *humans*... in that case, none.

      --
      putting the 'B' in LGBTQ+
    9. Re:Most Important Part of the Announcement by darthnoodles · · Score: 3, Funny

      I'm guessing they were already raped when Vista was installed.

    10. Re:Most Important Part of the Announcement by Opportunist · · Score: 1

      Actually, IMO, one deserves corporal punishment for playing this (or any BS) song.

      --
      We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
    11. Re:Most Important Part of the Announcement by Opportunist · · Score: 2, Funny

      Anti-Virus vendors certainly don't want Vista. You have NO idea what headache that system means to you if you have to include anything remotely resembling a driver in your product.

      Personally, I'd be VERY happy if it vanished faster than it appeared. Erh... ok, considering the development time that isn't such a strong statement, but I'd be happy if it vanished faster than it installs. Erh... if it vanished faster than it boots. Erh...

      Damn, can someone come up with a suitable analogy?

      --
      We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
    12. Re:Most Important Part of the Announcement by Anonymous Coward · · Score: 0

      In what experts are calling "actually pretty funny" a hacker has intentionally made a fool of himself in front of a webcam and gotten onto the top videoes on youtube. He is doing an odd dance with a very odd custom soundtrack that sounds suspiciously like a series of commands that would download spyware or format a hard drive... It has 10 million unique views, but only actually zero comments, and zero people have viewed it twice.

      OR....

      In what experts are calling "pretty twisted" someone seems to have added a custom soundtrack of carefully articulated words to a popular p0rn video and suddenly all internet activity ceased for the first time since it started. Nobody seems to recall hearing the words spoken until all the sudden their computer turned off and the video ended right before the good part...

    13. Re:Most Important Part of the Announcement by asCii88 · · Score: 2, Funny

      You meant, if it vanished fasther than its first bug is found?

    14. Re:Most Important Part of the Announcement by Anonymous Coward · · Score: 0

      Playing _anything_ by Britney Spears on your computer, whether the mic is on or off, is liable to cause severe injury to you and/or your computer from your coworkers.

    15. Re:Most Important Part of the Announcement by Anonymous Coward · · Score: 0

      You better not tell the airport security personnel that your luggage can self-destruct

    16. Re:Most Important Part of the Announcement by seann · · Score: 1

      almost choked on my sub.

      thanks

      --
      I'm a big retard who forgot to log out of Slashdot on Mike's computer! LOOK AT ME.
    17. Re:Most Important Part of the Announcement by DrSkwid · · Score: 1

      Corporal punishment while listening to Britney ? I'd cum in my pants

      --
      There are places where the networks are not touching,and there are places where they are-Boeing's Lori Gunter
    18. Re:Most Important Part of the Announcement by Anonymous Coward · · Score: 0

      OOHH I know!

      Did he get a pie in the face?

    19. Re:Most Important Part of the Announcement by dgatwood · · Score: 1

      Bender's head explodes.

      Bender: Great. Now everybody knows!

      --

      Check out my sci-fi/humor trilogy at PatriotsBooks.

    20. Re:Most Important Part of the Announcement by misleb · · Score: 1

      Microsoft cautioned everyone not to play [a] song... by Britney Spears
      Well that just goes without saying, doesn't it?

      -matthew
      --
      "THERE IS NO JUSTICE, THERE IS ONLY ME." -Death
    21. Re:Most Important Part of the Announcement by xianfox · · Score: 1

      Faster than it crashes?

    22. Re:Most Important Part of the Announcement by Zonnald · · Score: 2, Funny
      And of course you proceed that with.

      "Hey, Colin, check out my new 'Start, Run, CMD, Enter'" (wtf) "Oh, I like Format C, Colin." (turns to the doorway where Bob has just arrived) 'Enter, Yes, Bob'.

      Really it would seem a little bit more complicated then just throwing (or Squirting (tm)) a random phrase at the computer. I would imagine that the application with focus has to be able to interpret the phrase.

    23. Re:Most Important Part of the Announcement by acherusia · · Score: 1

      You should see what happened to the guy who played the Nirvana song "Rape Me". You mean the computer could actually understand what that song was saying? Damn, maybe Vista really is advanced after all.
    24. Re:Most Important Part of the Announcement by Teddy+Beartuzzi · · Score: 1

      Demon Seed redux?

    25. Re:Most Important Part of the Announcement by StikyPad · · Score: 1

      I'm pretty sure teenyboppers, by definition, don't listen to music that's almost 10 years old.

      Yes, you're getting old.

    26. Re:Most Important Part of the Announcement by netsharc · · Score: 4, Funny

      Anyway, typing "format C:" in a running Windows doesn't work, because it will say "The volume is in use." (assuming Windows is on C:)...

      Don't believe me? Try it yourself. ;-)

      --
      What time is it/will be over there? Check with my iPhone app!
    27. Re:Most Important Part of the Announcement by iamroot · · Score: 1

      The first thing that came to mind was:
      <Dilbert>Well look who just got fancy new voice recognition software. Well, at least I won't work all day and then DELETE a FILE!</Dilbert>

    28. Re:Most Important Part of the Announcement by Varun+Soundararajan · · Score: 1

      Thats not possible, coz it usually runs between crashes :)

  2. Simpler "remote exploit" by Anonymous Coward · · Score: 1, Funny

    Shout.

  3. Suckage. by The+Neck · · Score: 0

    Sometimes you just can't come up, even if you try, with a better way to show how much something suck than to tell the public how you can fix it.

    The Neck.
    .

  4. Yell Commands Across the Room by ehaggis · · Score: 5, Funny

    Is that a remote exploit?

    --
    One ring to bind them - should probably have more fiber and less rings in their diet.
    1. Re:Yell Commands Across the Room by kalirion · · Score: 1

      Is it a remote exploit if someone hacks into your bluetoothed laptop from five feet away?

    2. Re:Yell Commands Across the Room by morcheeba · · Score: 1

      My bank had one of those hp printers in the window with an IR port on it... It wouldn't take too much to freak them out with some weird documents that looked like they were printed by an insider.

      But, a soundbug on the window might now actually be dangerous.

    3. Re:Yell Commands Across the Room by volpe · · Score: 1

      File!!... Exit!!!... No!!!

  5. That's hardly an exploit by kahei · · Score: 4, Insightful


    Taking a computer that obeys audio instructions, and playing it some audio instructions, is more of a 'duh' than an 'exploit'. But this problem is a very Good Thing. It can only mean:

    -- EITHER people stop yakking on about voice computing, which has been the Way Of The Future since about 1935 or something
    -- OR pressure is exerted on web designers to NOT make sites that start making noise the moment the page appears!

    Either of these, but especially the latter, would be a big win. So here's to you, Mr. Exploit Finding Man!

    --
    Whence? Hence. Whither? Thither.
    1. Re:That's hardly an exploit by just_another_sean · · Score: 5, Funny

      So here's to you, Mr. Exploit Finding Man!

      Now there's a Bud commercial I'd like to hear.

      --
      Creationist Textbook Stickers Declared Unconstitutional by CowboyNeal
    2. Re:That's hardly an exploit by jimstapleton · · Score: 1

      the probelm his, the exploiters/hackers will do this kind of thing even if told not to. Makes me think of the "_required" input names in cold-fusion forms *shudder*.

      No, what we need is browsers that will let us force-mute things. What needs this more than anything else? Flash Player., since I think most browser has a "no audio" option anyway.

      --
      34486853790
      Connection too slow for X forwarding? Try "ssh -CX user@host"
    3. Re:That's hardly an exploit by Anonymous Coward · · Score: 1

      or default to If playing audio then audio instructions listener = off

      mind you this is such an advert for vista in disguise - what the computer understands you? wow! 'hello computer' isn't just a dream.

      FFS

    4. Re:That's hardly an exploit by Anonymous Coward · · Score: 2, Insightful

      Even so, with Vista's new software audio stack, this is inexcusable. It should have been trivial to compare the input and output signals and filter out most of this automatically.

    5. Re:That's hardly an exploit by gstoddart · · Score: 4, Insightful

      -- EITHER people stop yakking on about voice computing, which has been the Way Of The Future since about 1935 or something
      -- OR pressure is exerted on web designers to NOT make sites that start making noise the moment the page appears!
      Or, we make browsers so they don't run every damned audio file, flash frigging plugin, executable, movie, or whatever that the idiot who made the site thinks I should hear/see/play with/click/download/execute or whatever.

      There has never been any sound from a webpage that didn't make me want to immediately beat the person who wrote it with his own leg. I don't want to listen to your stupid MIDI file of whatever the fsck you think is cool on your web page.

      There was never any good reason to embed sounds in web pages unless you have to click a button to specifically play it.

      Cheers
      --
      Lost at C:>. Found at C.
    6. Re:That's hardly an exploit by Anonymous Coward · · Score: 0

      So either blind people are screwed or deaf people are screwed. Nice.

      Explanation on the deaf people being screwed. HP business computers have "business sound" which is basically a mono speaker built into the case. The default is that the sound is on without hooking up extra speakers. I don't know if other lines/brands have the same thing or not, but HP definitely does. Unless they think to turn it off (maybe an office mate complains) they're going to get sound, and not even know it. I don't have Vista yet, so I don't know how it's set up, but if the voice command is on automatically or some other person turns it on...

      Blind people? Voice recognition is a good thing. Also, turning down the sound is a bad thing.

    7. Re:That's hardly an exploit by morgan_greywolf · · Score: 2, Insightful

      or default to If playing audio then audio instructions listener = off
      Yes: for all of you fanbois out there saying "Oh, that's not an exploit!" pay attention to what the parent is saying! You gotta admit, it was huge oversight on Microsoft's part to not include any mechanism for turning off the accepting of audio instructions while playing audio, or at least to have a user-configurable option for protection against this exploit, defaulted to "On".

      This is yet another case of Microsoft putting ease-of-use ahead of security and reliablity. We've all heard this song before. Same story, different Windows version.

    8. Re:That's hardly an exploit by VertigoAce · · Score: 5, Informative

      The audio mixer in Vista is no longer based on different audio types (MIDI, CD Audio, WAV, etc). Instead, there is a volume slider and mute button for each application that makes sounds. So you can mute IE, AIM (those annoying video ads), and Windows itself, while still playing your music in WinAmp or WMP.

    9. Re:That's hardly an exploit by GreyPoopon · · Score: 1

      Probably the best thing to do is to program the computer to recognize the speaker by their voice pattern, and only respond to commands from "registered" speakers. Note that this is not the same thing as training a speech recognition system in that you aren't teaching the computer to understand your words. Instead you are teaching the computer to distinguish your voice from others.

      --

      GreyPoopon
      --
      Why is it I can write insightful comments but can't come up with a clever signature?

    10. Re:That's hardly an exploit by jimstapleton · · Score: 0, Redundant

      ooh, that's a nice change.

      --
      34486853790
      Connection too slow for X forwarding? Try "ssh -CX user@host"
    11. Re:That's hardly an exploit by bicho · · Score: 1

      Now *That* is something!

      --

      errera hunamum ets
    12. Re:That's hardly an exploit by Anonymous Coward · · Score: 0

      "Computer, start program."

      "Computer, start program."

      "Computer! Start program!"

      "....damn this cold."

    13. Re:That's hardly an exploit by bloobloo · · Score: 4, Funny

      Never? Not even Bananaphone?

    14. Re:That's hardly an exploit by pato101 · · Score: 1

      -- OR pressure is exerted on web designers to NOT make sites that start making noise the moment the page appears!
      I smell^H^H^H^H^H hear the next goatse joke

    15. Re:That's hardly an exploit by VagaStorm · · Score: 1

      This has got to be one of the most user friendly features in vista, even tho it will kill your pornotube experience :p :p

    16. Re:That's hardly an exploit by HTH+NE1 · · Score: 1

      You gotta admit, it was huge oversight on Microsoft's part to not include any mechanism for turning off the accepting of audio instructions while playing audio

      You know you've just replaced a remote exploit with a denial of service attack, yes?

      "Computer, stop playback. Computer, mute! Computer, reduce volume!! Computer, end program!!! Computer, shut the f@#k up and do what I tell you!!!!"

      --
      Oh, say does that Star-Spangled Banner entwine / The myrtle of Venus with Bacchus's vine?
    17. Re:That's hardly an exploit by Falladir · · Score: 1

      One time I looked up the lyrics to a scottish folk song that my mother used to sing, and one of the hits auto-launched a midi file. I didn't really mind. If you hate the sounds so much, disable them in your browser. Opera can do it. Tools > Preferences > Content (tab).

    18. Re:That's hardly an exploit by peepleperson · · Score: 1

      There was never any good reason to embed sounds in web pages... I heartily disagree. http://www.zombo.com/
    19. Re:That's hardly an exploit by xero314 · · Score: 3, Interesting

      Couldn't the system simply have a filter that removes the wave signature of what it is outputting before processing input as a command? This is relatively simple technology, as compared to voice recognition itself. You might have to re-calibrate if you move your speakers but I would think that is a small price to pay to not leave open the ability for a web site to control your system through an auto-playing wave file.

      Mind you this won't stop your roommate from yelling "Shut Down...Yes" just to piss you off. Or worse yet the guy you just fired yelling something more destructive on his way out of the office.

    20. Re:That's hardly an exploit by Teun · · Score: 1

      Yeah we need to DRM these words like Format and Delete.
      That'll stop the abuse!

      --
      "The likes of Facebook and WhatsApp are free to those whose privacy is of zero value."
    21. Re:That's hardly an exploit by Anonymous Coward · · Score: 0

      How many languages does Vista's voice command feature support? Are they all 'on' at the same time? I could see a website exploiting this by recognizing the visitor's preferred language, and playing the commands in one they're not likely to recognize.

      If you're a Spanish speaker, and German commands start playing on a website, are you likely to realize what they're doing?

      That'd be great fun!

    22. Re:That's hardly an exploit by BlueWaldo · · Score: 1

      This is what they should be advertising. I will now be buying Vista.

    23. Re:That's hardly an exploit by Dan100 · · Score: 1

      "Or, we make browsers so they don't run every damned audio file, flash frigging plugin, executable, movie, or whatever that the idiot who made the site thinks I should hear/see/play with/click/download/execute or whatever." Most browsers can be configured to do that. Opera even allows you to drag the options (e.g. "Enable sound in web pages") to the toolbar, so if you use sites that utilise say GIF animation you can easily turn that on when visiting them.

    24. Re:That's hardly an exploit by AeroIllini · · Score: 1

      IE, AIM (those annoying video ads)
      You must be new here.
      --
      For security, the MD5 hash of this message and sig is 09f911029d74e35bd84156c5635688c0.
    25. Re:That's hardly an exploit by haddieman · · Score: 1

      www.clearification.com(fancy flash site). The first page is one of the best listening experiences ever!

    26. Re:That's hardly an exploit by Wannabe+Code+Monkey · · Score: 2, Interesting

      The audio mixer in Vista is no longer based on different audio types (MIDI, CD Audio, WAV, etc). Instead, there is a volume slider and mute button for each application that makes sounds. So you can mute IE, AIM (those annoying video ads), and Windows itself, while still playing your music in WinAmp or WMP.

      If that's true, then that's awesome. I remember a couple years ago reading a story on slashdot about various experimental usability projects going on at Microsoft and this was one of them. I think they even put together a mock desktop in flash where they implemented this volume system that you could play with. From a usability standpoint it was way better. I had assumed that this was something that just got lost along the way, but I'm glad to see they went through with it.

      --
      We always knew Comcast was corrupt, here's the proof: http://tech.slashdot.org/comments.pl?sid=1909890&cid=34545432
    27. Re:That's hardly an exploit by Anonymous Coward · · Score: 0

      Unfortunately, they broke DirectSound and EAX to do that.

      Fancy new audio card with a much higher precision mixer and oversampler than Vista's software one? Tough.

      It's a bit of a problem that hasn't really been resolved yet. Better drivers which replaced all the relevant parts would probably do the trick, but of course they're still fairly new.

    28. Re:That's hardly an exploit by swarsron · · Score: 1

      This is very neat. Is there a way to get the same functionality in linux?

    29. Re:That's hardly an exploit by AndersOSU · · Score: 1

      That is not true

    30. Re:That's hardly an exploit by Lanoitarus · · Score: 5, Funny

      Bud Light Presents...
      Real American Heroes (reaaalllll american heroooessss...)
      Today we salute you, Mr Computer Software Exploit Finder (computer software exploit fii-inder)
      While others are wasting away their lives drinking, dating, and and having fun, you're hunched over a screen, plowing through code.(hunch plow hunchie plow)
      You may not have seen the sun in days, but thats ok- you do this for the greater good.(greaaater goooo-ooodd)
      Only YOU could realize that a carefully crafted web favorites icon could potentially bring the world to its knees.(Down on its kneeee--eesss)
      So crack open an Ice Cold Bud Light, Oh Overload of Overflow, because without you, CmdrTaco would have to get a real job.

    31. Re:That's hardly an exploit by Anonymous Coward · · Score: 0

      There was never any good reason to embed sounds in web pages unless you have to click a button to specifically play it.


      Au contraire, Millions of MySpace.com users can't be wrong.

    32. Re:That's hardly an exploit by nacturation · · Score: 1, Flamebait

      This is very neat. Is there a way to get the same functionality in linux? No doubt someone will take these features and clone them while Linux fanboys everywhere complain that Microsoft never innovates.
      --
      Want to improve your Karma? Instead of "Post Anonymously", try the "Post Humously" option.
    33. Re:That's hardly an exploit by Mortanius · · Score: 1

      In defense of ColdFusion (been developing with it since version 4.5 back in 2000), people who use the tag (which is where you find the "_required" suffix on field names, barring the random oddball person who implements that manually) are just plain lazy and asking for trouble anyway. It adds a whole pile of extra Javascript to the page and, as you hint at, it's insanely simple to bypass and it's just plain bad practice not to be defensive about even those variables.

      Blame the lazy i-just-got-a-college-degree-so-i-know-everything-a bout-everything students who learned how to write web scripts through Dreamweaver and have never actually seen the resulting ColdFusion code in their life. :-P (Bitter? Nah.)

    34. Re:That's hardly an exploit by jasen666 · · Score: 1

      haha...
      And just wait until millions of myspace pages have Vista voice commands embedded into them.

    35. Re:That's hardly an exploit by DrSkwid · · Score: 3, Informative

      here's how to do something similar in plan9

      mkdir /n/mute
      bind /n/mute /dev/audio
      run_noisy_application

      --
      There are places where the networks are not touching,and there are places where they are-Boeing's Lori Gunter
    36. Re:That's hardly an exploit by ednopantz · · Score: 1

      The audio mixer in Vista...

      This is Slashdot, you can't admit the existence of *features* in Vista.

    37. Re:That's hardly an exploit by just_another_sean · · Score: 1

      Thank you. That is perfect! :-)

      --
      Creationist Textbook Stickers Declared Unconstitutional by CowboyNeal
    38. Re:That's hardly an exploit by jimstapleton · · Score: 1

      My problem is that every pice of "official" documentation for CF that I read suggested that was a good idea, and gave *NO* warnings of the problems... Just suggested it was all you'd ever need for data verification.

      I just can't trust the security of the internals of a program where they suggest users do taht for security... It makes me wonder how the rest works...

      --
      34486853790
      Connection too slow for X forwarding? Try "ssh -CX user@host"
    39. Re:That's hardly an exploit by pluther · · Score: 1

      Wouldn't it make more sense just to have the computer not respond to voice commands that it, itself, is issuing?

      Can't the OS determine what's playing on output audio channels and just ignore it?

      --
      If the masses can keep you down, you're not the Ubermensch.
    40. Re:That's hardly an exploit by neuro.slug · · Score: 1

      What? You don't love the moon!? Blasphemy..

    41. Re:That's hardly an exploit by filterchild · · Score: 1

      Ever heard of the JACK audio server?

    42. Re:That's hardly an exploit by Anonymous Coward · · Score: 0

      Yeah, somebody was working on a project like this...I'm trying to remember the name. I think it might have been Pulse Audio? http://pulseaudio.org/

    43. Re:That's hardly an exploit by metroplex · · Score: 1

      The firefox plugin Noscript (direct install here) lets you block almost all unwanted web "features", including sounds. Very handy!

      --
      "Words of wisdom: drop that zero and get with the hero" -- Vanilla Ice
    44. Re:That's hardly an exploit by ScislaC · · Score: 1

      Perhaps using Jack? I don't know if there is a way to route sound to a "null" destination, but it seems like it could work (at least theoretically).

    45. Re:That's hardly an exploit by Talchas · · Score: 1

      You can do even crazier stuff through artsd for programs that use arts (the kde soundserver) by piping output from a program though filters made in artsbuilder.

      --
      As the Americans learned so painfully in Earth's final century,free flow of information is the only safeguard against...
    46. Re:That's hardly an exploit by the_womble · · Score: 1

      Pulse does a lot more stuff, moving audio from one machine to another etc.

      However, does muting by application have any use other than muting flash? Anything else will let you use the volume control in the app itself surely?

      I deal with flash by using Noscript which has a option to block plugins untill you click on a placeholder. Very little flash actually gets played that way.

    47. Re:That's hardly an exploit by Anonymous Coward · · Score: 0

      Pornotube is mostly for gay content. Try youporn.com instead.

    48. Re:That's hardly an exploit by complete+loony · · Score: 2, Funny

      badger badger badger ...

      --
      09F91102 no, 455FE104 nope, F190A1E8 uh-uh, 7A5F8A09 that's not it, C87294CE no. Ah! 452F6E403CDF10714E41DFAA257D313F.
    49. Re:That's hardly an exploit by complete+loony · · Score: 1

      Hear Hear! Plus it would make talking in skype / team speak etc much more pleasant.

      --
      09F91102 no, 455FE104 nope, F190A1E8 uh-uh, 7A5F8A09 that's not it, C87294CE no. Ah! 452F6E403CDF10714E41DFAA257D313F.
    50. Re:That's hardly an exploit by pclminion · · Score: 2, Informative

      Couldn't the system simply have a filter that removes the wave signature of what it is outputting before processing input as a command? This is relatively simple technology, as compared to voice recognition itself. You might have to re-calibrate if you move your speakers but I would think that is a small price to pay to not leave open the ability for a web site to control your system through an auto-playing wave file.

      The quick answer is "no." Even though the computer knows what waveform it is playing, it has no idea what waveform will actually emerge from the speakers, or arrive at the microphone.

      The problem is that the audio system taken as a whole (Sound card DAC -> speaker wire -> speaker driver -> air in the room -> microphone pickup -> microphone wire -> sound card ADC) introduces small but significant spectral distortion into the sound by the time it runs through the entire system. Even if we ignore the nonlinearities of the amplifiers, the finite resolution of the digital-to-analog converters, and everything else, we still run into the problem of objects MOVING in the room (like you, leaning 2 inches forward in your chair), which changes the impulse response of the system and therefore changes the spectrum of the received signal.

      Even if we consider only two elements, the speaker cone and the air in the room, it is fairly easy to see that the sound wave generated is NOT equivalent to the wave being sent to the speaker cone. Imagine a step signal (e.g. a Heaviside function) where the speaker deflection instantaneously goes from 0 to 1, then stays there. What does the AIR PRESSURE right next to the speaker cone do? Does it instantaneously jump from 0 to x and then stay there? No, of course not -- a WAVE propagates from the speaker into the air of the room. So the signal applied to the speaker and the signal in the room are not the same signal.

      Now in theory, if all of these effects are linear, then the total impulse response can be computed. This is the "calibration" you mention. The problem, though, is that the system is not TIME INVARIANT, meaning its impulse response changes with time simply because of all the variables which affect the system.

      So it's not only a matter of "recalibrating when you move your speakers." You have to recalibrate when the speakers move, when the temperature changes, when the air pressure changes, when the microphone moves, when the microphone has dust on it interfering with pickup, when anything at all in the room moves, when there is a draft in the room, etc etc.

      This would not be simple technology at all. Not impossible, but probably extremely expensive and unreliable.

    51. Re:That's hardly an exploit by swarsron · · Score: 1

      My problem is that there is a flash game (dice wars, simple but nice) which has a real annoying sound and i can't find a way to just quiet flash and/or firefox.

    52. Re:That's hardly an exploit by Mortanius · · Score: 1

      Yeah, their documentation (excepting certain editions of the reference manuals, they tend to be pretty neutral) does tend to lean towards the fast development / lower security side of things a little too much. Because that is a great function for beginners, or people who are deploying to trusted intranets, it saves development time and just works.

      But like most things, once you've got a solid enough knowledge of it, in my opinion, if you sick with the quick and easy pre-made libraries and such without considering the security implications, you're setting yourself up for disaster. It amazes me how little people consider security in web applications, even today I noticed a glaring flaw (to me anyway) in a client's single signin system that they've been using for years, I explained it to their lead programmer and just got a blank stare.

      Ten minutes later, I had access to all of his email, files, databases and had almost gotten into their master password store. I like to think he's a changed man now.

      Though I know better than to think like that.

    53. Re:That's hardly an exploit by Anonymous Coward · · Score: 0

      ALSA has software volume control, which gives you a seperate slider for the application in your mixer application. The application has to be configured to use it though, which is like putting a flag on, it's not enforced on all applications like in that fascist operating system.

    54. Re:That's hardly an exploit by Trogre · · Score: 1

      That's nice, but I for one am grateful for web pages that allow me to listen to how foreign words are pronounced, or to a segment of music based on a particular key.

      I wonder if you also kicked up a stink when everyone shifted away from Gopher. And you can't believe that a grown man would click on small coloured pictures to operate a computer, am I right?

      --
      "Nine times out of ten, starting a fire is not the best way to solve the problem." - my wife
    55. Re:That's hardly an exploit by OriginalArlen · · Score: 1

      Or, we make browsers so they don't run every damned audio file, flash frigging plugin, executable, movie, or whatever that the idiot who made the site thinks I should hear/see/play with/click/download/execute or whatever. Good heavens, man, if you don't like your browser autoplaying music, why don't you just disable music autoplaying?! It's trivial in Firefox, haven't used Konqueror enough to know if it autoplays or how to configure that (tho the prefs dialog is big and juicy with many many categories fo stuff to fiddle with) -- Firefox gives you about:config, of course, and linx would probably pop up a standalone audio player if you choose to accept the download (it prompts for all that sort of crap before downloading it!) Dunno about MSIE (thank god!) but it must be possible to turn that sort of stuff off there, too. If there's no control knob in "internet settings" or whatever the IS prefs are called these days, perhaps there's some obscure DWORD to set or unset in the recursing bowels of hell that is the Registry.
      --

      Everything I needed to know about life, I learnt from Blake's Seven
    56. Re:That's hardly an exploit by Slur · · Score: 1

      Couldn't the system simply have a filter that removes the wave signature of what it is outputting before processing input as a command? This is relatively simple technology, as compared to voice recognition itself. In fact, Apple's iChat software does just this. You can hear a little bit of aliasing in the sound where the computer's own output waveforms have been subtracted, but it's pretty thorough at removing the noise. Also, Karaoke machines have been able to do this for a long while with a combination of dynamic EQ and fuzzy logic. There's nothing to prevent Microsoft from incorporating these and other methods to improve its interpretation of sound, assuming it has a single unified API for sound that ensures all output goes through a single bottleneck.

      --
      -- thinkyhead software and media
    57. Re:That's hardly an exploit by gregmac · · Score: 1

      Not to mention it won't stop it from affecting other computers in the same room / listening range.

      --
      Speak before you think
    58. Re:That's hardly an exploit by notnAP · · Score: 1
      You are correct, right from the subject line you gave this thread. The truth is, this is just way too stupid a "feature problem" to be called an actual exploit.

      The problem is, the IT world doesn't really have a another word that sinks down to the level of ineptitude this "exploit" shows.

      I mean... come on. Was I the only one who rechecked his calendar to make sure this wasn't some kind of April Fool's joke? This is real?!?

    59. Re:That's hardly an exploit by Deadplant · · Score: 1

      Echo cancellation technology has been available in pro-audio equipment for many years and in consumer electronics for quite some time.
      It is cheap and effective.

      I don't personally know how it works but I use it all the time.
      You can put a mic on a stage, send the output from that mic through lots of cables, mixers and amps then blast it out through 15 different speakers at ear-damaging volume levels all in a tiny crowded bar and still not have a feedback loop problem.

    60. Re:That's hardly an exploit by fbjon · · Score: 1
      Not quite. The software only needs to estimate the delay between speaker(s) and microphone, because successfully removing the original sound will very likely make that component of the signal unintelligible. There's no need to perfectly excise it without leaving noise behind.

      There's an alternate solution, though: perform speech recognition on the mix of all playback sources. Any commands recognized there are disabled for the next second or so.

      --
      True confidence comes not from realising you are as good as your peers, but that your peers are as bad as you are.
    61. Re:That's hardly an exploit by the_womble · · Score: 1
      Ah,

      Then Firefox should have a volume control. There is a FF enhancement request about it:

      https://bugzilla.mozilla.org/show_bug.cgi?id=33320 8

      But noone is working on it.

      One of the comments says it needs to be done at the plugin level, but I know (at least on Linux) that it is possible to launch apps with various wrappers that do things with their sound output (e.g. to record it), which might provide a workaround.

      Of course it is a shortcoming in the plugins as well.

    62. Re:That's hardly an exploit by the_womble · · Score: 1

      Yes, MS innovates by copying MacOS freeware.

    63. Re:That's hardly an exploit by Anonymous Coward · · Score: 0

      Here are a few more potential bug vectors:
      0) Turning on the remote camera - he he already done ;-).
      1) Turning on microphone remotely and recording the sound of keystrokes - particularly password/banking ones.
      2) Either re-routing the sound/ voice commands to a non-visible command window - just like DVD audio snagging.
      3) Applying some filter to XOR commands on the back of real music.
      4) like any multimedia engine, there will be buffer overflow exploits for this 'new' code. Entirely expected and predictable.
      5) Hook the voice engine with a morse-code like front end.
      6) Settings. However you change them, malware can get in and un-change them
      7) Modify the dictionary or macros, so that a command like shutdown does a whole lot more.
      8) Privilege escalation is foreseeable

    64. Re:That's hardly an exploit by GreyPoopon · · Score: 1

      Wouldn't it make more sense just to have the computer not respond to voice commands that it, itself, is issuing?
      No because that won't stop somebody from using one of those 3D sonic beams to focus sound through a window directly on the microphone of your computer. Nor will it prevent the same "hack" from taking place in an office where you have two computers running.
      --

      GreyPoopon
      --
      Why is it I can write insightful comments but can't come up with a clever signature?

  6. I tried to replicate the bug, but all I got was by knightmad · · Score: 5, Funny

    c:> Dear aunt, let's set so double the killer delete select all: Command not found

    1. Re:I tried to replicate the bug, but all I got was by teslar · · Score: 5, Funny

      Lucky you. I was watching Star Trek First Contact in the living room and fifteen minutes after Picard told the Enterprise computer to initiate the self-destruct protocol, my laptop exploded!

    2. Re:I tried to replicate the bug, but all I got was by Yabol · · Score: 1

      I wonder how it responds to http://www.zombo.com/ ?

    3. Re:I tried to replicate the bug, but all I got was by Overzeetop · · Score: 4, Funny

      It's not Vista's fault your laptop uses a Sony battery. MS can't be blamed for everything, you know.

      --
      Is it just my observation, or are there way too many stupid people in the world?
    4. Re:I tried to replicate the bug, but all I got was by An+ominous+Cow+art · · Score: 1

      Be sure not to watch TOS in hearing range of your laptop. Seemed like just about every other original episode, Cpt Kirk talked a computer to death: Landru, the M5, Nomad, various androids...

    5. Re:I tried to replicate the bug, but all I got was by Mark+Hood · · Score: 1

      MS can't be blamed for everything, you know.

      They can't? Damn. I thought I was reading Slashdot, I shouldn't have taken that left turn at digg.com.

      Mark

      --
      Liked this comment? Why not buy me something nice
    6. Re:I tried to replicate the bug, but all I got was by Trogre · · Score: 1

      Eventually I just got Cod Ham Pizza Ship.

      --
      "Nine times out of ten, starting a fire is not the best way to solve the problem." - my wife
    7. Re:I tried to replicate the bug, but all I got was by Overzeetop · · Score: 1

      Well, I suppose we could blame everything on MS, but then there woudn't be time to complain about Sony, the RI/MPAA, or the fact that roland has gotten another story accepted.

      --
      Is it just my observation, or are there way too many stupid people in the world?
  7. amusing, but not much else by Thansal · · Score: 2, Insightful

    If you computer starts spitting out voice commands, just create another sound that will interupt it.

    Admitedly all I can think of is the Dilbert cartoon with Wally getting ticked at Dilbert having voice driven software.

    --
    Do Or Do Not, There Is No Spoon, There Is Only Zuul. Everything in the above post is probably opinion.
    1. Re:amusing, but not much else by TheNetAvenger · · Score: 1

      If you computer starts spitting out voice commands, just create another sound that will interupt it.


      And oddly Win98 was one of the first commerical OSes to have realtime Audio mixing even between applications with a single sound processor.

      Has anyone here even seen a Windows computer since it was DOS based and Clinton was president?

  8. Bug? by drinkypoo · · Score: 3, Insightful

    I wouldn't call it a bug. I'd call it a very bad idea to use a microphone without a switch for voice recognition. Your television could theoretically do things on your computer. Does that sound like a possibility you want to entertain? Get a mic with a switch, or get rooted.

    --
    "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    1. Re:Bug? by ScentCone · · Score: 1

      Your television could theoretically do things on your computer

      Yes, but luckily all television-based computer use consists of intense leadership figures telling glamourously dressed nerd hotties sitting in front of obviously faked up Flash-animation-running workstations to, "Zoom in on the license plate reflection in the sequin on that crossdressing supermodel's dress and get me all information related to everyone who knows the person who last changed the oil in that terrorist's Land Rover by cross-checking the microtags in the oil against all known bank robbery plots in the same latitude."

      --
      Don't disappoint your bird dog. Go to the range.
  9. The Real Agenda of this Article? by ksalter · · Score: 4, Insightful

    All voice recognition software, no matter what platform, would suffer from this supposed "exploit". So why this article on Vista specifically? What is the real agenda here? Also, if the voice recognition software is trained for a specific user's voice, the chances of an exploit are reduced.

    1. Re:The Real Agenda of this Article? by Thansal · · Score: 1

      If I remember, a large point is that the vista one does NOT need training (this is not actualy all that new, there have been voice recognition things that don't need training for a little while now).

      After all, training voice recognition software is long, tedius, and often pointless. The best thing I ever did with one was intentionaly training it horribly (readign something else instead of the text they gave me), and then seeing what would happen. (It wasn't all that interesting in all honesty)

      --
      Do Or Do Not, There Is No Spoon, There Is Only Zuul. Everything in the above post is probably opinion.
    2. Re:The Real Agenda of this Article? by Bertie · · Score: 1

      Well, speaker verification is more than 99% reliable if you first get the user to say specific utterances a number of time so that you can build up a model of their voice patterns (such as ask them to count from one to ten three times or so). But most of this stuff's speaker-independent.

      Anyway, the problem's not with the recogniser so much as how Microsoft's integrated it into the OS. You'd think they would have learned by now, but it seems they're still putting the user's convenience before sensible security precautions.

    3. Re:The Real Agenda of this Article? by shark72 · · Score: 2, Insightful

      "All voice recognition software, no matter what platform, would suffer from this supposed "exploit". So why this article on Vista specifically? What is the real agenda here? Also, if the voice recognition software is trained for a specific user's voice, the chances of an exploit are reduced."

      Yup, this is an old one. There's an apocryphal tale of a user group meeting from long ago of a vendor demonstrating voice-control software and a smart aleck in the back of the room yelling "DEL *.*!" (or whatever the MS-DOS command was).

      As you implied, the agenda is, of course, to have a laugh at Microsoft's expense. If they hadn't included voice control software, the opportunity would have been to point out that Microsoft spent $BIGNUM person-years working on Vista and didn't even include that feature. OSX's easy access to a shell prompt with root access is about as relevant an exploit as the voice control exploit, and the odds of a cat wandering into my house and walking on the keys in such a way to generate the wrong "rm" command are about the same as this Vista "exploit" happening to me. But, it's aways fun to have a laugh at Microsoft's expense, isn't it?

      --
      Sitting in my day care, the art is decopainted.
    4. Re:The Real Agenda of this Article? by ksalter · · Score: 1

      FWIW, if I remember from my beta testing, you could train the Vista recognizer for better performance.

    5. Re:The Real Agenda of this Article? by 99BottlesOfBeerInMyF · · Score: 4, Informative

      All voice recognition software, no matter what platform, would suffer from this supposed "exploit". So why this article on Vista specifically?

      This is untrue. Speech recognition software can be made to filter out anything coming in the mic that matches something going out the speaker channel. More simply, you can simply require all commands be preceded with an arbitrary word (like the computer's name). Call you computer "George" and then issue the command "George, kill dash nine star dot star." As opposed to "kill dash nine star dot star." Since the exploit writer won't know to include "George" their exploit fails almost all the time. This was a feature of MacOS 7, more than a decade ago, as I mentioned elsewhere.

      Also, if the voice recognition software is trained for a specific user's voice, the chances of an exploit are reduced.

      Depending upon the tolerance, this is entirely possible, but I don't see it as being as important or versatile as the other two methods I listed above. MS should have learned from the example of others.

    6. Re:The Real Agenda of this Article? by ksalter · · Score: 1

      I would assume that most people would call their computer "Computer" and you could still exploit. Just like most people run .exe in their email attachments, etc. Social engineering could still exploit that technique, though I agree using an unknown prefix word would help. As to the "filtering out of anyting coming in the mic that matches something going out the speaker channel", basically you are talking about echo cancellation. That seems plausible to me, and on modern computers the hit on peformance would be fairly low. Do you know of any packages which incoporate this?

    7. Re:The Real Agenda of this Article? by billcopc · · Score: 4, Funny

      Voice control is fine, but having the computer react to its own output is ludicrous! You'd think Vista would be smart enough to recognize feedback... It's like having a retard talking into a mic that's hooked up to his own headphones.

      Bob: "Bob go jump off a bridge"
      Bob: "Who said that ?"
      Bob: "I said that. Now jump!"
      Bob: "Ok.. Aaaaaaaagh!"

      Stupid.

      --
      -Billco, Fnarg.com
    8. Re:The Real Agenda of this Article? by ksalter · · Score: 1

      As a follow-up, my experience with echo cancellation with telephony equipment (I write software professionally using them) tells me that there are some definite limitations to echo cancellation algorithms, and that I think that could be broken also at some point. The type of algorithm and the cost in performance would have to considered.

    9. Re:The Real Agenda of this Article? by stewbacca · · Score: 1

      Maybe the real agenda is to point out a brand new problem for Vista that is yet another 10-year old copied Macintosh feature?

    10. Re:The Real Agenda of this Article? by xoyoyo · · Score: 2, Interesting

      True, all speech recognition software *would* suffer from this exploit if the application designers hadn't thought about the likely scenarios in advance. I just checked the situation with my Mac, which comes with speech recognition built in (and has done since what, Mac OS 9?)

      Nothing destructive is enabled by default: the worst you can do on a Mac is log yourself out, but that will keep everything running as it was before.

      If you go to the Speech control panel you can, after putting your admin password in, enable Menu Bar actions which allow you to do things like trash files and restart the computer.

      So by default the computer will just do helpful stuff, but if you really need full control over the OS through speech recognition (eg, you are disabled) you can enable it.

      It's a good indicator of the different philosophies between the two OS vendors we also see in their approach to networking (this may have changed with Vista, I've not really been following it): Apple shut down everything by default and requires the user to open ports; windows boxes, on the other hand are wide open from first boot, have to have their ports shut down by a knowledgable user.

    11. Re:The Real Agenda of this Article? by 99BottlesOfBeerInMyF · · Score: 1

      I would assume that most people would call their computer "Computer" and you could still exploit.

      Make naming the computer part of the install process and provide no default. This is useful for other things as well.

      ...basically you are talking about echo cancellation. That seems plausible to me, and on modern computers the hit on peformance would be fairly low. Do you know of any packages which incoporate this?

      My experience with voice recognition was ages ago. I wrote an article in the 90s saying this was a must have feature to make voice recognition workable, and I'm sure someone else must have realized the same thing by now. Maybe the commercial dictation software uses this by now? I don't know.

    12. Re:The Real Agenda of this Article? by ILikeRed · · Score: 1

      Once again they prove they should really be called MyopicSoft....

      If you get a chance check out the co-creator of COM apologizing for his mess in creating the registry - with no idea that many of the problems he still struggles with have been solved by others - it's on the University of Washington channel I think.

      Which problems of CS would you like to revisit today?

      --
      I have come to a conclusion that one useless man is a shame, two is a law firm, and three or more is a congress -J Adams
    13. Re:The Real Agenda of this Article? by Kopretinka · · Score: 1

      All voice recognition software, no matter what platform, would suffer from this supposed "exploit".

      That's incorrect. I don't know about such a software, but the voice-recognition program should filter out what it captures as going to the output of the computer. For example, Skype, too, should be able to filter out the annoying echo if you have the mike in range of the speakers. This is a feature waiting to be implemented, not an inevitable characteristic. And the feature could, in fact, be in the OS, somewhere around the sound drivers.

      --
      Yesterday was the time to do it right. Are we having a REVOLUTION yet?
    14. Re:The Real Agenda of this Article? by Khabok · · Score: 2, Informative

      My Mac requires a keyword before accepting voice commands. Does Vista do this? If not, I'd call it a vulnerability, albeit a minor one.

      Maybe they should ask the user for a keyword without offering a default? But how many people would use "computer" anyway?

    15. Re:The Real Agenda of this Article? by krakelohm · · Score: 2

      "Hi, my name is Werner Brandes. My voice is my passport. Verify Me."

      --
      You are all a bunch of idots.
    16. Re:The Real Agenda of this Article? by mrbcs · · Score: 2, Funny
      I had this problem years ago. I was playing with something called verbex.. talk to your computer... it does stuff. You had to train it. It worked fairly well and freaked out visitors. I had it set up so that if I swore at a program (it was windows 95 after all) the computer would do an alt+F4. Funny stuff... until one day I'm on the phone and getting a bit... ummm.. upset. Apparently I was cursing a lot cause when I turned around.. my computer was off.

      I removed the software after that.

      --
      I'm not anti-social, I'm anti-idiot.
    17. Re:The Real Agenda of this Article? by ksalter · · Score: 1

      I agree, it should, but which one do echo cancellation (if any)?

    18. Re:The Real Agenda of this Article? by pkulak · · Score: 1

      "OSX's easy access to a shell prompt with root access"

      Really? How do I get a shell prompt on a Mac with root access without typing my password?

    19. Re:The Real Agenda of this Article? by Instine · · Score: 0

      Not so. The solution is relatively simple. You take the mixer audio and remove it from the mic audio stream (which should be done anyway to stop music fucking with your system) and then only let mic minus wav trigger recognition.

      Done.

      Next...

      --
      Because you can - or because you should?
    20. Re:The Real Agenda of this Article? by ghqman · · Score: 1

      You don't really need to do echo cancellation, just voice recognition on the output stream as well, and if it matches the input stream ignore.

    21. Re:The Real Agenda of this Article? by man_of_mr_e · · Score: 2, Informative

      You do realize that it's a bit more complicated than that. Depending on where the speakers were from the microphone, any reflective surfaces that might bounce the sound back, etc... it can all fuck up a noise cancelation circuit, which is what we're basically talking about.

      I've seen EXPENSIVE noise canceling speakerphones screw this up.

    22. Re:The Real Agenda of this Article? by djh101010 · · Score: 2, Informative

      "OSX's easy access to a shell prompt with root access"

      Really? How do I get a shell prompt on a Mac with root access without typing my password?
      I notice he hasn't responded to this. I'm thinking it's because, well, there isn't an easy way to do it. In fact, I can't think of a _hard_ way to do it. Maybe an SUID script to open it as root, but then you have the display thing to deal with. Hm... more likely he was just talking out his arse.
    23. Re:The Real Agenda of this Article? by planetmn · · Score: 2, Insightful

      Except that it will never match. You are basically doing a D/A conversion to output the sound via the speakers, and then A/D when using the mic for input. Both of these stages will cause some distortion (lots of distortion with crappy speakers and microphones). Furthermore, the acoustical environment is going to affect different frequencies to different extents.

      For instance, the mic may not pick up any of the low frequencies due to location of a subwoofer, quality of speakers, sound absorbers (carpet, etc.). So in order to match the output to the input, you need to allow for these factors and by the time that you give yourself enough of a margin, you've in effect taken out all functionality.

      Sure, it's fun to bash MS here on slashdot. Just don't let reality get it the way.

      -dave

      --
      /., where "Apple and Google provide Iran with nukes" will be refuted with "But Microsoft is a convicted monopolist"
    24. Re:The Real Agenda of this Article? by Anonymous Coward · · Score: 0

      do you have a link to this?

    25. Re:The Real Agenda of this Article? by moofo · · Score: 2, Interesting

      It worked pretty well in Mac OS 9. You could login to the machine by selecting your username and then saying a passphrase. The default was: "My Voice is my password"

      Thing is, it was local accounts only, no directory system at this point, much less for voiceprints !

      --
      "I've heard nonsense, compared with which that would be as sensible as a dictionary." Through the looking glass and what
    26. Re:The Real Agenda of this Article? by ILikeRed · · Score: 1

      Actually, I just found it - interview with Tony Williams of Microsoft. He would have saved himself a lot of trouble by just reading ESR's free book....

      --
      I have come to a conclusion that one useless man is a shame, two is a law firm, and three or more is a congress -J Adams
    27. Re:The Real Agenda of this Article? by 99BottlesOfBeerInMyF · · Score: 1

      Except that it will never match.

      Then the exploit won't work either, so you have no problem.

      Think of it this way:

      1. Process audio to speakers and try to recognize commands.
      2. Make a list of commands as they appear.
      3. Process audio coming in microphone and try to recognize commands.
      4. Compare commands in the two lists, if they match, drop them, otherwise execute.

      For instance, the mic may not pick up any of the low frequencies due to location of a subwoofer, quality of speakers, sound absorbers (carpet, etc.). So in order to match the output to the input, you need to allow for these factors and by the time that you give yourself enough of a margin, you've in effect taken out all functionality.

      I takes much, much, much less than a second for any effective reflection of sound to occur in a normal setting where the sound is still going to recognizable to the mic.

      Sure, it's fun to bash MS here on slashdot. Just don't let reality get it the way.

      Reality? MS's solution doesn't even have the basic security feature my Mac OS 7 box did over a decade ago. They deserve a good thrashing for this, even if it is not a serious security issue for most people.

    28. Re:The Real Agenda of this Article? by adolf · · Score: 2, Insightful

      All true.

      However, this should be a solvable problem with current DSP technology.

      If my cellular telephone can perform realtime echo cancellation, and subtract its own speakerphone audio from the microphone audio, and do it for several hours at a time on a battery the size of a matchbook, then I can only fucking hope that a modern dual-core machine would be able to tackle the task handily.

      Even after the variables are all multiplied by some factor because the speakers might move relative to the microphone, there seems to be plenty of horsepower available to throw at the problem. The fundamentals have all been solved by folks like Bell Labs, US Robotics, and Polycom a long fucking time ago, with less DSP power than my $20 optical mouse, using the widely variable POTS network as a testbed, where even the -remote- handset affects the quality of your own voice on the line.

      Just because there's layers of distortion, band limiting, spurious external noises, with dynamics and delay possibly being anywhere on the map and an echo signature that changes as people move around the room, does not mean that it's not all measurable, quantifiable, and possible to reduce it to acceptable levels.

      Remember, you don't have to get rid of all the feedback, and it doesn't have to be perfect. We're talking about a limiting computer's ability to hear itself, which is a far easier task than anything involving a human being. You only have to get rid of enough that the computer does not respond to its own voice. And also, remember that the resultant quality of the recorded microphone audio need not be production-grade, but only good enough for the computer to understand human-generated voice commands.

    29. Re:The Real Agenda of this Article? by Anonymous Coward · · Score: 0

      Except that it will never match. You are basically doing a D/A conversion to output the sound via the speakers, and then A/D when using the mic for input. Both of these stages will cause some distortion (lots of distortion with crappy speakers and microphones). Furthermore, the acoustical environment is going to affect different frequencies to different extents.

      For instance, the mic may not pick up any of the low frequencies due to location of a subwoofer, quality of speakers, sound absorbers (carpet, etc.). So in order to match the output to the input, you need to allow for these factors and by the time that you give yourself enough of a margin, you've in effect taken out all functionality.
      That's why you're doing voice recognition on the output stream, and not simple echo cancellation. If the output stream evaluates to "FORMAT C:", and the input stream from the same timeframe also evaluates to "FORMAT C:", then Vista ignores the input.
    30. Re:The Real Agenda of this Article? by Torvaun · · Score: 1

      I'd name mine HAL, but that might cause even worse trouble.

      --
      I see your informative link, and raise you a pithy comment.
    31. Re:The Real Agenda of this Article? by 99BottlesOfBeerInMyF · · Score: 1

      I'd name mine HAL, but that might cause even worse trouble.

      Amusingly enough my old computer had the audio command, "computer shutdown" aliased to play a sound byte, "I'm afraid that's something I cannot allow to happen, dave" from the movie. Inevitably someone would be smart ass and try it while we were hanging out. The only real problem I had was that the speech recognition would mistake a lot of different commands for "start mozilla" for some reason, and on a 66Mhz machine with 16 meg of ram, that took a minute and a half.

    32. Re:The Real Agenda of this Article? by pclminion · · Score: 1

      You do realize that it's a bit more complicated than that. Depending on where the speakers were from the microphone, any reflective surfaces that might bounce the sound back, etc... it can all fuck up a noise cancelation circuit, which is what we're basically talking about.

      It's all doable in principle. If the complete system is LTI (linear and time-invariant) then performance can be perfect. The problem is in the TI part of LTI. Most real acoustical system are not time-invariant.

      Ever played with microphone feedback? Set up a sound recorder program to record and play the sound back through the speakers. Set the mic close to the speaker and fiddle with the volume until it JUST BARELY starts to feed back on itself. Now, wave your hand around in the vicinity of the microphone. Notice that even the SMALL effect of your hand moving influences the feedback characteristic.

      Now imagine a very sensitive echo cancellation system trying to operate in a room where the acoustic properties are changing with time. Good luck making that work.

    33. Re:The Real Agenda of this Article? by StikyPad · · Score: 1

      That's great, until someone writes a text-to-speech script that reads

      say '%COMPUTERNAME%, open c: delete all. yes'

    34. Re:The Real Agenda of this Article? by shark72 · · Score: 1

      "I notice he hasn't responded to this."

      'Cause I've been traveling. Greetings from O'Hare's B concourse.

      "I notice he hasn't responded to this. I'm thinking it's because, well, there isn't an easy way to do it. In fact, I can't think of a _hard_ way to do it. Maybe an SUID script to open it as root, but then you have the display thing to deal with. Hm... more likely he was just talking out his arse."

      I think I was too delicate in making my point. See my reference to the kitty cat in my original post. Yeah, it's highly unlikely that pussy would dance across the keys in a manner to make this happen, but the point is that it's about as likely, practically speaking, that some audio playing over my speakers would cause a similar fate to befall a Vista installation. I hope that's clear now.

      I think my post was taken as flamebait or an OSX vs. Windows sort of thing. That was not the intention -- I use both. The point is that it's fun to have a laugh at Microsoft's expense.

      --
      Sitting in my day care, the art is decopainted.
    35. Re:The Real Agenda of this Article? by djh101010 · · Score: 1

      "I notice he hasn't responded to this."

      'Cause I've been traveling. Greetings from O'Hare's B concourse.


      Ugh. Sorry to hear that. Nothing about O'Hare is fun.


      "I'm thinking it's because, well, there isn't an easy way to do it. In fact, I can't think of a _hard_ way to do it. Maybe an SUID script to open it as root, but then you have the display thing to deal with. Hm... more likely he was just talking out his arse."

      I think I was too delicate in making my point. See my reference to the kitty cat in my original post. Yeah, it's highly unlikely that pussy would dance across the keys in a manner to make this happen, but the point is that it's about as likely, practically speaking, that some audio playing over my speakers would cause a similar fate to befall a Vista installation.


      I see your point, but I'm pretty sure I disagree with it. If MS as truly trusted voice control with, effectively, root access, then it's a different layer of trust and vulnerability.


      I think my post was taken as flamebait or an OSX vs. Windows sort of thing. That was not the intention -- I use both. The point is that it's fun to have a laugh at Microsoft's expense.

      On this, we agree. But I really think that getting a root shell on OSX from visiting a remote website is a hell of a lot less likely. The obvious solution is to not automatically grant permission to execute voice commands as admin, but that itself leads to abuses and obvious DOS mechanisms. It's really just another case of MS wanting a feature regardless of the implications, and not thinking it through or not caring.
    36. Re:The Real Agenda of this Article? by Anonymous Coward · · Score: 0
      Because configuring dangerous actions into a computer which can be triggered through voice recognition is a dumb idea?

      This exploit is such a classic Microsoft blunder, where one seemingly innocuous function (playing sound from a webpage) interacts with a system capability (responding to sounds) and creates a vulnerability.

      The analogy with HTML email is:

      1. HTML email looks nice (to some) - that's the innocuous function
      2. To render HTML requires retrieving documents from the internet (thus creating web bugs), execution of javascript and activex controls (user's system becomes compromised) and display of images (making it possible to exploit bugs in the image decoding logic)
      3. As these problems are discovered and exploited, Microsoft patches and patches the code to try to close all the holes.

      More examples: Word document viruses (the convenient macro feature which drives a too-powerful backend), attacks based on hiding the URL or making it look like some other URL (permitting domain names in non-ASCII character sets and/or allowing obfuscation of domain names and hostnames through %-encoding).

      Microsoft touts Vista as "armed with enhanced protections designed specifically for Internet use. Dynamic security protection in Windows Internet Explorer 7 helps guard your computer and your privacy against threats like malware, fraudulent websites, and online phishing scams". But here we are, just a day or two after release, and there's a known exploit for Vista.

    37. Re:The Real Agenda of this Article? by adolf · · Score: 1

      Not to reply to myself, but there's another possibility which I had just thought of:

      It's not really even necessary to do all of that DSP work.

      Just run the speech-to-text engine on the -output- of the sound card as well as the microphone input. Use the output as a filter for the input, such that any commands which exist at the output are automatically ignored at the input.

      Problem fucking solved. :)

  10. Voice controlled video player. Echo cancellation? by Anonymous Coward · · Score: 1, Insightful

    I remember someone once announcing a voice controlled video player, and wondered what would happen when it played a video in which someone shouted "Stop!"

    Microsoft's comments on the BBC site are poor. What microphone feedback? If it's not howling now it's not going to suddenly howl when someone tries this exploit. Clear dictation - but the attacker will make the dictation as clear as possible, and the consolation that the user will likely be in the room to hear it happening - what consolation is that?

    A solution would be to use echo cancellation as used in phone systems to prevent output from the speaker being used on the microphone.

      - Richard

  11. Format by jlebrech · · Score: 3, Funny

    "Open Terminal For Matt See Yes Im sure Reice Tart!!"

  12. I'm waiting for the audio exploit that responds to by StressGuy · · Score: 2, Funny

    the phrase "Simon Says"

    --
    A goal is a dream with a deadline
  13. A few ideas. by tg2k · · Score: 1

    Presuming the device drivers know what is being played, the system could try to detect that and mask it out if it comes back through the speakers. Or just disable speech recognition whenever audio is playing. An easy mute control for the browser would be nice as well. And, maybe security privileges even to play music? I'm sick of random websites that have to play ridiculous music when you visit them.

    1. Re:A few ideas. by Doctor+Crumb · · Score: 1

      Actually, Vista's DRM prevents such "Echo cancellation" from being implemented. See the "Indirect Disabling of Functionality" section in this document:
      http://www.cs.auckland.ac.nz/~pgut001/pubs/vista_c ost.html

  14. lip reading by Anonymous Coward · · Score: 0

    You can switch the speakers off, but what if the crackers' webcam can still see their lips moving?


    [Isert space oddysey 2001 music here]

    1. Re:lip reading by Anonymous Coward · · Score: 0

      You can switch the speakers off, but what if the crackers' webcam can still see their lips moving?

      Microsoft has already acknowledged this bug and has suggested as work arounds that 'A user can close their eyes while using a webcam'; or, 'if a user is unable to close their eyes, that they simply switch off the computer.'

  15. Restart? Really? by ignavusinfo · · Score: 1
    Why is it necessary to "restart [the] computer" to turn off speech recognition?

    As for the "exploit" ... windows will cause your computer to explode if you douse it with gasoline and set it alight too. Should there be a warning label and slashdot story to point that out?

  16. A Solution by Constantine+XVI · · Score: 0

    A good way to fix this would be to make the user hold down a button or buttons (like maybe WinKey+Space or both mouse buttons). Then it doesn't work without you meaning to put in a command.

    --
    "I think an etch-a-sketch with an ethernet port would beat IE7 in web standards compliance."
    1. Re:A Solution by 99BottlesOfBeerInMyF · · Score: 1

      A good way to fix this would be to make the user hold down a button or buttons (like maybe WinKey+Space or both mouse buttons). Then it doesn't work without you meaning to put in a command.

      If you're pushing a button, you can just type the command and it will be faster. No, this problem was solved long ago. In MacOS 7, the built in speech recognition had the option that it only listened to commands preceded by an arbitrary word. I believe the default was, "computer, open slashdot" as opposed to "open slashdot." (It was cool, like Star Trek.) Since most geeks named their computers some obscure hostname you had commands like "cheesemonkey, open slashdot." Since the creator of a Web page exploit won't know that name, they have a hard time initiating commands.

      Even better though is to filter out all output sounds from the input stream before processing. That way playing MP3s while working won't necessarily make the system nonfunctional, and you get the security benefit for free.

    2. Re:A Solution by alshithead · · Score: 1

      "A good way to fix this would be to make the user hold down a button or buttons (like maybe WinKey+Space or both mouse buttons). Then it doesn't work without you meaning to put in a command."

      Kind of defeats the purpose of voice command though doesn't it? :)

      We WANT those who are physically handicapped to be able to use the intarntet, eh?

      --
      I reserve the right to think for myself. Others' opinions are optional. Puppy on lap = typos...not illiteracy.
  17. Oh no... by ifknot · · Score: 0, Troll

    I forsee an exponential rise in loud annoying web sites shouting commands :( A n other reason (if more were needed) not to vi$tarize

    --
    we are all cosmic nuclear waste
  18. A Whole Decade of Nothing by 99BottlesOfBeerInMyF · · Score: 4, Interesting

    More than ten years ago I was playing with the speech recognition software that shipped with MacOS 7 or something and I though being able to check my e-mail without getting out of bed was pretty cool. At the time I wrote something about the technology and predicted that speech activated commands would never take off until: 1, most audio people listened to was controlled by the computer, and 2, the computer was smart enough to filter out the sounds it was emitting before processing commands. At the time a lot of people listened to music from their computer and I imagine many still do. Why can't the computer ignore all that sound? It knows it is outputting it so why not filter it? It is sad that the same missing feature is still a problem, so many years later.

    1. Re:A Whole Decade of Nothing by stewbacca · · Score: 1

      More sad than your story is the fact that speech recognition is being touted as a shiny new Vista feature.

    2. Re:A Whole Decade of Nothing by xappax · · Score: 4, Insightful

      Why can't the computer ignore all that sound? It knows it is outputting it so why not filter it?

      The sound that is output by the computer sounds similar to us when re-received through the mic and played back, but to the computer it's a totally alien waveform. A lot of distortion happens between when the computer sends a digital signal to the sound card and when it receives an analog signal from your microphone - so basically, the computer may know what it's playing, but it has very little idea how it'll sound when it reaches the mic.

      There are advanced filters and algorithms that can try to match and isolate particular patterns and "sounds" within a waveform, but they're not nearly as powerful as CSI would have us believe, and they also require far too much computing power to be run in realtime.

      Of course, the obvious low-tech solution to this issue is to wear headphones, as people in recording studios have for decades.

    3. Re:A Whole Decade of Nothing by Jerf · · Score: 4, Insightful

      The easiest answer to this question is, try it.

      Most simple schemes people come up with to address this are perfectly doable with a free sound program. Play some music, record the area while you're playing the music, then try your great idea. Like, you might think you can start out with inverting the source file and feeding it into the recording with a delay and modified amplitude. If you're really curious about this problem, this is a better way to learn about the difficulties then reading people on the internet, as, in my experience, you're quite likely to be skeptical about the explanations anyhow. The best (and in some sense, only true) explanations involve a lot of math.

      I can offer you this meta-rule, though: If it were so easy, it would already have been done. Many things that I see people posting on Slashdot about "Why don't they just do this thing?" are covered by this rule.

    4. Re:A Whole Decade of Nothing by 99BottlesOfBeerInMyF · · Score: 1

      The sound that is output by the computer sounds similar to us when re-received through the mic and played back, but to the computer it's a totally alien waveform. A lot of distortion happens between when the computer sends a digital signal to the sound card and when it receives an analog signal from your microphone - so basically, the computer may know what it's playing, but it has very little idea how it'll sound when it reaches the mic.

      If the computer can predict how various phonetics will sound coming from me, it should be able to filter phonetics from someone else issuing commands with reasonable tolerance. It should either fail to recognize the voice from the exploit, or be able to filter the voice from the exploit, one way or another. More generically, I've used noise cancellation technologies and they certainly can filter 75% of a musical performance or some-such, which is enough to greatly improve the accuracy of identifying the remaining input and probably make this exploit fail.

      Of course, the obvious low-tech solution to this issue is to wear headphones, as people in recording studios have for decades.

      If I'm wearing headphones (which I hate doing) I'm in front of my computer. If I'm in front of my computer, I'll type it as that is faster and more accurate.

      A simpler solution to mitigate the problem is to require the user to address the computer specifically. On Star Trek they say, "computer, what time is it." On my ancient Mac OS 7 box I issued the exact same command until I changed the string "computer" to be the hostname, which was more specific. If you say, "Bob, what time is it." Your computer can answer and it doesn't accidentally open a bunch of Web pages when you answer the phone (unless someone called "Bob" called you). When an exploit writer is embedding a malicious command, he doesn't know my computer's name and the chances it will execute right after I say the computer's name, but while I'm not making it unrecognizable by saying some other command, is pretty small.

    5. Re:A Whole Decade of Nothing by Al+Al+Cool+J · · Score: 1

      Fine. If raw acoustic filtering it is too difficult, then run any outbound audio stream through the same speech recognition software, and filter out any duplicate commands that result.

      Assuming of course that Microsoft's legions of highly skilled programmers can figure out a way to compare two text strings.

    6. Re:A Whole Decade of Nothing by 99BottlesOfBeerInMyF · · Score: 1

      I can offer you this meta-rule, though: If it were so easy, it would already have been done. Many things that I see people posting on Slashdot about "Why don't they just do this thing?" are covered by this rule.

      It's entirely possible this has been done. I haven't used voice recognition software in many, many years. Theoretically it is possible, and it was needed a decade ago. I don't think your rule applies to Microsoft, by the way. They frequently ignore the state of the art in an industry, especially where security and usability are concerned, and just go ahead an made a 1.0 version that is useless crap. This could easily be one of these cases. It is also possible that since voice recognition is useful for three areas (control method, authentication, and dictation) and noise cancellation is mostly important for the first one, that no research has really gone into this, like many other areas of OS research. If there is only one company that can bring their OS to market, and they don't research an area, it probably doesn't get done.

    7. Re:A Whole Decade of Nothing by xappax · · Score: 1

      That's a really good idea, I'm surprised nobody thought of it!
      Well, I guess if we're talking about Microsoft I'm not that surprised. But still, good point.

    8. Re:A Whole Decade of Nothing by Anonymous Coward · · Score: 0

      Though echo cancellation has been used in phone circuits for a while. I don't think it works perfectly well, or even has to work perfectly well to be effective enough. I think the simpler circuits were designed to deal with only electrical feedback over phone wires, which is easier than dealing with microphone feedback.

      A lot of mobile phones use noise gating though, which is what has been described here as "turn off voice commands whenever you're outputting sound". It's annoying when someone's phone does that as whenever you speak you can no longer hear them at all. It would be annoying to have to turn off the music to use voice commands.

    9. Re:A Whole Decade of Nothing by Anonymous Coward · · Score: 0

      "Assuming of course that Microsoft's legions of highly skilled programmers can figure out a way to compare two text strings."

      What are you implying by this? Are you saying that Microsoft's programmers are less skilled than yours or anyone elses? The programmers hired by various companies (and those stupid enough to work for free) all are trained by the same universities. There's no evidence that you are more skilled than an MS programmer. You sound like a tool that failed to make it through MS's interview process and are still quite bitter.

    10. Re:A Whole Decade of Nothing by fwr · · Score: 2, Insightful

      I call bull. What about that "echo cancellation" feature you find on all the popular web cam software? What about all the collaboration software out there that has echo cancellation? The basic premise is that if you don't use headphones and instead the computer speakers then the mic will pick up the sounds that the computer is transmitting from the other side, and you'll get an echo. Saying that it requires far too much computing power is incorrect. While it probably won't make it totally disappear, it will reduce the incoming signal from the mic to a level such that the voice processing feature on the computer won't be able to make out any of the commands. "totally alien waveform" right. Tell that to Sony and their noise cancellation headphones. If they can fit the technology in a headphone then a modern computer capable of running Vista certainly has enough horsepower.

    11. Re:A Whole Decade of Nothing by yerM)M · · Score: 1

      Just because you can't think of a solution doesn't mean there isn't one. The solution is to solve an easier problem. If an application is outputting sounds, automatically feed them into the the voice recognition software without executing the command. If commands are found, disable the recognition software with a big fat notification.

    12. Re:A Whole Decade of Nothing by Joelfabulous · · Score: 1

      I find it somewhat horrifying that it's even possible that CSI can be considered a standard to measure real-world applications' effectiveness. Seriously, with CSI, it's almost like Arthur C. Clarke's third law applies by default.

      --
      Sometimes I wonder if I think too much.
    13. Re:A Whole Decade of Nothing by Jerf · · Score: 1

      Who says I don't think there's a solution?

      What I am saying is that it's harder than you think.

      For instance, your proposed solution has serious problems with false positives that would render it useless in practice. Unfortunately, you can't "just try" that solution so easily.

    14. Re:A Whole Decade of Nothing by 14CharUsername · · Score: 1

      MS's programmers are skilled just unmotivated. I'm sure they could solve this problem, but they won't bother to unless a manager specifically tells them to do it. Otherwise they're just putting in time until they've worked at MS enough years that it looks good onthier resume so they can get a job somewhere decent.

    15. Re:A Whole Decade of Nothing by fencehouse · · Score: 1

      Couldn't they just run the voice recognition on the output sound and see if any commands were there, then compare them with any commands coming through the mic? It would then just disregard the those that matched. This would be much easier then trying to compare the actual sound. Although I do agree - headphones would be easiest and work best. Or better yet, just turn off voice recognition.

    16. Re:A Whole Decade of Nothing by *weasel · · Score: 1

      "For instance, your proposed solution has serious problems with false positives that would render it useless in practice."

      As he literally described it, sure.
      But his concept sounds quite workable to me, even if the implementation is flawed.

      Why not:
      . Run the output audio through the interpreter
      . any potential commands detected in the output go onto a 'watch list' for a second or two
      . if commands come in from the mic that match those on the 'watch list' - ignore them or optionally ask for confirmation

      if the user isn't in the middle of a voice 'session' the computer could even squelch all the watch list matches automatically. A 'session' being determined easily enough as starting with a 'go' word and kept alive for X time after a successfully processed command.

      E.g.

      If the algorithm detects 'delete star dot star' in the outgoing audio stream, it would simply ignore (or require confirmation for) any detected 'delete star dot star' from the mic for the next second or so.

      The big downside here, is running all outgoing audio through the speech filter. That may ultimately be more resource intensive than a true mathematical solution. However, it's a reasonable quick-n-easy hack that can be implemented today and is pushed off to under-utilized cores easily enough.

      At any rate: potentially nasty voice commands (anything resulting in changes or deletions) should require password verification and all commands should have several levels of 'undo'.

      --
      // "Can't clowns and pirates just -try- to get along?"
    17. Re:A Whole Decade of Nothing by xappax · · Score: 1

      Tell that to Sony and their noise cancellation headphones.

      Noise cancellation is a different game. Detecting and filtering out a constant background hum is much easier than removing a dynamic waveform. Noise cancellation headphones can filter out the sound of your air conditioner perfectly, but probably not a conversation next to you nearly as well.

      What about that "echo cancellation" feature you find on all the popular web cam software?

      You're right that there are echo cancellation features for many VOIP/teleconferencing type apps. which use the exact principle which was suggested here. However, the sound quality for these apps is usually pretty miserable, because the echo cancellation tends to cut out either too much or too little of the waveform, causing significant distortion. I mean, it's more than good enough to have a conversation with a human - you can understand them fine, but voice recognition software needs a much "cleaner signal" than a human does to extract meaning - it's crappy enough at understanding what's spoken into the mic, let alone an audio stream that's been mangled with noise reduction processing.

      I suspect that the type of echo cancellation used for teleconferencing would make the voice recognition accuracy significantly lower. But since someone else pointed out a much more elegant solution (do voice recognition on the outgoing audio stream and filter any commands which match on both outgoing and incoming audio), there's no reason to worry about it anyway :)

    18. Re:A Whole Decade of Nothing by kruhft · · Score: 1

      Run a couple of 'beep' tests to find the latency and attenuation of the output sound as compared to it's return into the system through the microphone. Scale the output waveform by the attenuation and offset by the latency time and subtract that from the incoming waveform and that should cancel a lot of the outgoing signal enough to disable the speech recognition. Not too much computation, but then again I haven't tested and it's off the top of my head....

    19. Re:A Whole Decade of Nothing by Uberleet+Superninja · · Score: 1

      If Vista dedicates an audio path to each and every application running on your system, why not create a subroutine that listens for commands on the output of these applications? If it receives a command from any other path than the mic input it puts that command into a list of temporarily ignored commands so that if it receives the same command from the mic input within a certain time frame it will ignore it. Under normal circumstances you should never see a voice command originate from any other source than the mic, and the time frame would be incredibly short as the latency involved is very low. This would eliminate the problem entirely and would not require nearly as much processing power as running real time wave form mapping and inverse transfer functions for noise cancelation.

    20. Re:A Whole Decade of Nothing by Anarchitect_in_oz · · Score: 1

      Could you also play on the shortcoming of voice rec system needing a clean stream
      and add mild distrotion to the speaker output to render any outgoing speach useless.
      Say in the frequency range of the Concentates which is the part of speach that carries the most meaning.

      With all the tools in the box surely the best one is to have a trigger switch, like the computer name in OS X or the comm badge in Star trek, that can turn on the system and the performance hit that it involves.

      --
      "Call us when the New age is old enough to drink" Beck
    21. Re:A Whole Decade of Nothing by zimm0who0net · · Score: 1

      Even if you fixed this problem and were able to effectively remove the outgoing audio from the incoming this would still be an exploit for any computer near the computer making the sounds. Your cube-mates computer has no idea if the playback is coming from your computer or from your cube-mate himself. I can imagine someone in a crowded office going to a site with the audio "Shutdown Now" embedded and suddenly the whole office shuts down..

    22. Re:A Whole Decade of Nothing by Jester99 · · Score: 1

      "Totally alien waveform"? I admit it will be a different waveform than the one it sent to the sound card, but to call it "totally alien" implies that there is absolutely no correlation. I actually bet it'd probably be very highly correlated.

      I bet that a simple convolution filter would be able to cancel out an overwhelming percentage of the output sound.

    23. Re:A Whole Decade of Nothing by Deadplant · · Score: 1

      Ok, so voip software echo-cancellation may be clunky but what about pro-audio echo cancellation?
      I'm thinking here of the rock-concert stuff. The audio output is of high quality and yet the mics pick up a heck of alot of output from the speakers. Somehow the manufacturers of this equipment manage to eliminate the echo (usually)completely and in real-time without computers.
      I have no idea what technique is used but I DO know that it works very very well.

    24. Re:A Whole Decade of Nothing by SillyNickName4me · · Score: 1

      Are you saying that Microsoft's programmers are less skilled than yours or anyone elses?

      That is the impression they give from the products they created.

      The programmers hired by various companies (and those stupid enough to work for free) all are trained by the same universities.

      Ah, you are one of those. You come from university X so you MUST be smart.... If you didn't find out yet that that has absolutely nothing to do whatsoever with being smart or not then you have a long way to go still.

      There's no evidence that you are more skilled than an MS programmer. You sound like a tool that failed to make it through MS's interview process and are still quite bitter.

      Rather, you sound like a Microsoft programmer feeling personally attacked by his statements.

    25. Re:A Whole Decade of Nothing by 99BottlesOfBeerInMyF · · Score: 1

      Even if you fixed this problem and were able to effectively remove the outgoing audio from the incoming this would still be an exploit for any computer near the computer making the sounds.

      I don't think using voice recognition in a location where your computer can hear other people's computers would work anyway. I mean, if you have 20 people all within hearing range of each other all using voice commands, the resulting clamor would likely make performance uncertain and drive everyone insane with the constant babbling. I just don't see this as a problem. I have an office and maybe four people can hear me if I talk in it. If I take a phone call on my cell, I go to a conference room so as not to disturb anyone. Using voice recognition and and talking all day where other people can hear would be annoying as all hell and very rude. Finally, if I'm in a cube, I'm not likely to be very far from my computer in the first place. The keyboard is a faster, more accurate input mechanism for commands. Voice recognized commands are great for when I'm doing something else with my hands, or don't want to get up from the couch. That is not a cubicle type situation.

  19. So.... by Vokkyt · · Score: 1

    suppose you write an executable that displays a simple image (let's assume everyone is thinking of goatse) and gives the executable a common title that the Voice Control may pick up; is this the new spam/spyware? Companies send out spyware that activate on common words that Vista picks up? Incidentally, initially I was reminded of Futurama: Farnsworth: "Shut up, friends. My Internet browser heard us saying the word "Fry" and it found a movie about Philip J. Fry for us. [The staff gather around.] It also opened my calendar to Friday and ordered me some French fries."

  20. OS X? by 3.14159265 · · Score: 1

    I wonder how Apple goes around this problem...

    1. Re:OS X? by Anonymous Coward · · Score: 0

      1. You can set it so you have to say the computer's "name" before any command -- sort of like a password
      2. Or you have to hold down ESC to say anything

    2. Re:OS X? by Anonymous Coward · · Score: 0

      You need to hit a key (configurable), or speak a name first.

      "(esc) Tell me a joke"
      'Knock Knock'

      "Computer, tell me a joke"
      'Knock Knock'

    3. Re:OS X? by gkearney · · Score: 2, Interesting

      I tried this on MacOS X version 10.4.8 (the latest version) I could not make the mac respond to voice commands being played from the speakers or from patching the sound out into a iMic. Here is what I did.

      1. Ran the voice command option and configured it as apple suggests.
      2. Made sure that the voice command understood my command by issuing several and getting the correct replys back from the system.
      3. Recorded the command "What time is it?"
      4. Played back the command with voice commands on.

      The mac did not respond. I then tried the same thing with a patch cable between the output and a iMic USB audio adapter. It still would not respond from the recording bout will respond to my voice. I have no idea how Apple is able to distinguish where the voice is coming from.

  21. In One Ear and Out the Other by Doc+Ruby · · Score: 1, Interesting

    The damn OS is playing the audio. The damn speech-rec software is doing echo cancellation. Vista should be testing its incoming audio to detect whether it matches any outgoing audio that Vista is playing. What an incredible load of bullshit.

    The quality of MS security analysts working on Vista is revealed to be very dim by this explot. This kind of exploit and defect in the Vista multimedia architecture speaks very badly of the prospect for the next 5 years of MS operating systems. They're a plague.

    --

    --
    make install -not war

    1. Re:In One Ear and Out the Other by itsme1234 · · Score: 2, Insightful

      "Vista should be testing its incoming audio to detect whether it matches any outgoing audio that Vista is playing."

      I guess you never saw a room with more than one computer in it.

    2. Re:In One Ear and Out the Other by Doc+Ruby · · Score: 1

      Of course I have, but it's very unusual (despite Slashdotters' geeky preferences). And most of those computer rooms have admins who can control the audio and security, even if they're just the kind of savvy user with multiple computers for their own use.

      Mostly this will attack single users of single computers in their homes and offices. That threat can be mitigated in the single OS instance. There are other threats like the one you imply, but that's no reason not to fix the biggest one that is easier to deal with.

      --

      --
      make install -not war

    3. Re:In One Ear and Out the Other by Jah-Wren+Ryel · · Score: 1

      Wasn't this exact problem in the list of DRM-created pointless sucks from the "Vista - Longest Suicide Note in History" paper? I seem to recall craptastic echo-cancellation was expected due to the secure-path audio drivers not allowing any other software to listen to the outgoing sounds. MS's rebuttal said that they thought the API provided enough information to do echo cancellation anyway. I guess they were wrong. Surprise!

      --
      When information is power, privacy is freedom.
    4. Re:In One Ear and Out the Other by SillyNickName4me · · Score: 1

      Of course I have, but it's very unusual (despite Slashdotters' geeky preferences). And most of those computer rooms have admins who can control the audio and security, even if they're just the kind of savvy user with multiple computers for their own use.

      Having an office where there are multiple people sharing the same room is very exceptional?

      With all respect, in approx any company with more then one office worker, it is the norm, not the exception.

  22. this makes for some fun sound files by SashaMan · · Score: 2, Funny

    website sound: "All your base are belong to us"
    Vista: "Do you want to reformat your hard drive?"
    website sound: "All your base are belong to us"
    Vista: "Are you sure you want to reformat?"
    website sound: "All your base are belong to us"
    Vista: "Reformatting.........."

  23. Shit... by thousandinone · · Score: 5, Funny

    I just watched 2001: A Space Odyssey on my machine... this may be my last post.

    1. Re:Shit... by VGPowerlord · · Score: 1

      Windows, open Pod Bay Doors.doc.
      "I'm sorry Dave, I'm afraid I can't do that."

      --
      GLaDOS for President 2016! "Well here we are again. It's always such a pleasure." -- GLaDOS, 2011
    2. Re:Shit... by Archangel+Michael · · Score: 1

      "Windows, open iPod Bay City Rollers"

      "I'm sorry Dave, I'm afraid I can't do that."

      --
      Agent K: A *person* is smart. People are dumb, stupid, panicky animals, and you know it.
    3. Re:Shit... by VGPowerlord · · Score: 1

      I like yours better. It makes more sense. :D

      --
      GLaDOS for President 2016! "Well here we are again. It's always such a pleasure." -- GLaDOS, 2011
    4. Re:Shit... by kubrick · · Score: 1

      My God, it's full of bugs!

      --
      deus does not exist but if he does
  24. Nothing new here by Ruprecht+the+Monkeyb · · Score: 5, Funny

    Years ago when I worked in a shop that used OS/2 (one late version of which included speech recognition), we used to play pranks on each other all the time using that 'feature'. Things like changing a startup sound to be two minutes of silence followed by a verbal shutdown command, or changing confirmation prompt sounds to be 'cancel'. Good fun. The random 'select all / delete / yes' was the best, though.

  25. or by www.sorehands.com · · Score: 3, Informative

    The geek watching Andromeda. "Fire all missles"

  26. Hey, no need to panic... by Bertie · · Score: 3, Informative

    I mean, look:

    "Microsoft has said that even if the machine was primed to accept voice commands it would be unlikely the user would not be in the room to hear the file with malicious instructions being played."

    Yeah, nobody ever leaves their computer unattended.

    And of course, it would be completely impossible for a Trojan to pipe appropriate sounds directly to the input buffer of the sound hardware, thus negating the need for it to be played through your speakers at all. As we all know, Windows is completely watertight against that sort of thing.

    This raises an interesting possibility, though - what if you could confuse the recogniser itself into making false positives? You could, for example, persuade it to recognise silence as a command of your choosing.

    Best way round this is probably to prevent people doing potentially destructive operations via voice commands. But if this isn't suitable, you could employ clever confirmation strategies, like "If you're sure you want to delete c:\windows, please say the following words..." with the words in question being drawn from a dictionary. No malware could anticipate the sequence (although I suppose you could set the recogniser to work against itself, by playing the text-to-speech engine's own output back to it and triggering recognition).

    Hmm. Promises to be quite fun, this.

    1. Re:Hey, no need to panic... by Yetihehe · · Score: 1

      If you already have trojan on machine, why try to crack it with sounds?

      --
      Extreme Programming - Redundant Array of Inexpensive Developers
    2. Re:Hey, no need to panic... by Bertie · · Score: 1

      Any hole's a goal, right?

      By which I mean, if there's a vulnerability, exploiting it's as good as exploiting any other. Microsoft have spent a lot of time closing a lot of back doors in Vista, but this just opens up a great big new one.

    3. Re:Hey, no need to panic... by Anonymous Coward · · Score: 0

      If your program is already running on the computer why not just delete the files directly?

    4. Re:Hey, no need to panic... by xappax · · Score: 1

      I think what Yetihehe meant was that if you've got a trojan running on a machine, you already have the ability to run arbitrary code with that user's privileges. Using your trojan to execute voice commands would allow you to...run arbitrary code with that user's privileges - it doesn't grant any further access, and therefore can't be considered an exploit, and it would be unreasonable to think that it could be prevented.

      Trojans are usually the end result of the exploit. A security hole is used to run arbitrary commands which download and install the trojan. So the real threat is that an attacker will use voice commands to gain a trojan-based foothold in your system, and then do the rest of their dirty work using conventional means.

    5. Re:Hey, no need to panic... by Bertie · · Score: 1

      Oh, I see. So I suppose this isn't any more dangerous than any other trouble that could be caused by a Trojan. Fair enough.

  27. howto for Mac users by sootman · · Score: 4, Informative

    to create malicious audio files with OS X (10.3 or later), fire up Terminal and use 'say':
    $ echo "format sea slash you" | say -o evil.aiff
    This makes your messages with a nice, clear, even voice--wouldn't want a bunch of 'um's and 'ah's borking up your exploit, now would you. :-)
    `man say` for more options.

    --
    Dear Slashdot: next time you want to mess with the site, add a rich-text editor for comments.
    1. Re:howto for Mac users by Dan100 · · Score: 1

      Vista doesn't allow drive formatting by voice instruction.

    2. Re:howto for Mac users by powerlord · · Score: 1

      Ah ... that should be the first thing the voice instruction should patch :)

      --
      This space for rent. All reasonable inquiries will be entertained at proprietors discretion.
  28. Fraternity Fun by Zerth · · Score: 5, Funny

    If they don't prevent them from running arbitrary commands, you know 5 years in the future that every time term end comes around there will be some naked freshman running through the uni library/labs shouting "quit without saving! yes! reboot! yes! shutdown -h now!"

  29. Pretty Silly by ThinkFr33ly · · Score: 1

    There are so many mitigating factors with this that a successful exploit of this "bug" is extremely unlikely.

    First of all, as was mentioned in the article, voice recognition cannot bypass User Account Control. So that immediately limits damage to the local profile.

    Second, the user would see all of this happening and would have to remain silent for this to work. It's not like a piece of code executing. The commands are not particularly speedy. They would see dialogs flashing, hear the commands being spoken, and decide not to do anything about it. All it would take is the user saying something or turning down their speakers and it would likely be enough to stop things from proceeding.

    The danger with this is extremely limited and unlikely. It certainly has some novelty value, though.

  30. Time for that hit song, "Format Sea" by 192939495969798999 · · Score: 1

    Sailing, sailing over the format sea: /yes!

    --
    stuff |
  31. Every should know. by www.sorehands.com · · Score: 1

    It was in Dilbert years ago. Can't remember which characters, but it had one showing the other their speech recognition system, and the other said what would happen if I said "DELETE ALL FILES"?

  32. Yakking by SilverJets · · Score: 2

    As my coworker said when I told him about this, "That's not hacking it's....yakking!"

    (Or yacking for those who prefer the alternate spelling)

  33. We've been waiting for this (and joking about it) by Qbertino · · Score: 5, Funny

    Me and my friends have been waiting for this and joking about it since IBM Via Voice and Dragon Speak. A whole new era of IT pranks and cyberterrorisim awaits us. Imagine bursting into a room full of PCs and yelling

    "FORMAT DRIVE C! CONFIRM!".

    Instant fun.
    Makes me feel all soft and gooshy inside just thinking of it. :-)

    --
    We suffer more in our imagination than in reality. - Seneca
  34. Predictions from the past ... by Gopal.V · · Score: 4, Funny

    Userfriendly had predicted the fate of voice recognition six years ago - rm -rf / and yet again !.

    1. Re:Predictions from the past ... by itsme1234 · · Score: 1

      This is EXACTLY what I had in mind - I even looked for those 2 stripes myself. After six years somebody else goes to the same semi-obscure reference. Spooky.

    2. Re:Predictions from the past ... by Trogre · · Score: 1

      You missed my favourite.

      --
      "Nine times out of ten, starting a fire is not the best way to solve the problem." - my wife
  35. What about UAC by mobby_6kl · · Score: 1

    Ok, I think the "exploit" is ridiculous, but what I do find interesting is how would it deal with UAC? If the commands ask the computer to do something dangerous, the system should prompt the user with the privilege elevation dialog which is on a separate secure desktop and so shouldn't react to anything but direct user input. Anybody tried that?

    1. Re:What about UAC by rednuhter · · Score: 1

      If the commands ask the computer to do something dangerous, the system should ...
      Dial a predesignated phone number which must be for someone other than the registered user who can speak the secret pass phrase which in turn can only be discovered by said non-registered user by visiting a website via WAP noted by the automated caller and entering their mothers PIN.
      Reminds me of the support call I took from a woman complaining about the lack of features to recover items that had been deleted from the "Deleted Items" folder after she clicked OK to the message informing her that "Items would be permanently removed".
      I think she got escolated to a supervisor to complain.

      --
      ERR 411[Max number of witty sigs reached]
    2. Re:What about UAC by Anonymous Coward · · Score: 0

      Mod parent up. This is the real funny bit behind targeted negative impact reporting... this "exploit" would not work on vista exactly because of UAC.

      I have to admit that I love UAC - not for myself, but I have my parents running vista, and this is the first OS where they run locked down. No installing on their part, they can't actively damage the computer. Loving it. Also: the repair and reinstall features on the DVD rock.

  36. All right! by manifoldronin · · Score: 1
    I for one welcome our new shout-format-c:\-across-the-room overlords.

    Man, now I can't wait for the wide business adoption of vista. That would be the beginning of a new era in the history of office spanking.

    --
    Tyranny isn't the worst enemy of a democracy. Cynicism is.
    1. Re:All right! by manifoldronin · · Score: 1

      Man, now I can't wait for the wide business adoption of vista. That would be the beginning of a new era in the history of office spanking.
      And apparently, improper sexual conduct in a modern office would be deemed far worse than any attempt to format others c:\, so I'll just settle with "office pranking".
      --
      Tyranny isn't the worst enemy of a democracy. Cynicism is.
  37. Re:I'm waiting for the audio exploit that responds by bazorg · · Score: 1
    On OS X you can set "simon says" or anything you please as the trigger for the computer to accept what is said next as a voice command. "Beeatch" and "arrr!" are popular choices.

  38. this is HYSTERICAL by sootman · · Score: 1

    So, the "solution" is to turn of speakers and/or microphone. This is the same MS whose solution to a recent Office exploit was "don't use Office for a couple days."

    It's been said that the only secure computer is one that has been unplugged, encased in cement, and thrown in the ocean. I didn't know MS was planning to make this their official support policy. "Security flaws? No problem. Just DON'T USE IT AT ALL."

    Wow, they're good.

    --
    Dear Slashdot: next time you want to mess with the site, add a rich-text editor for comments.
    1. Re:this is HYSTERICAL by UnknowingFool · · Score: 1

      That remind of that old joke:

      How many Microsoft employees does it take to change a lightbulb?

      None. Microsoft will redefine darkness as the standard.

      --
      Well, there's spam egg sausage and spam, that's not got much spam in it.
  39. And today's message by Frozen+Void · · Score: 1

    A:\Format C: /autotest

  40. Maybe a good start, but not that easy by mopslik · · Score: 2, Insightful

    Vista should be testing its incoming audio to detect whether it matches any outgoing audio that Vista is playing.

    I imagine it's not quite so straightforward. You'd need to take into account room acoustics, hardware effects, generic ambient noises, or even other interfering sounds in the same room that could all interfere with a comparison of outgoing sound to incoming sound. It's very rare that you'd ever have a time where your outgoing sound file exactly matches one that is sensed coming from the speakers.

    1. Re:Maybe a good start, but not that easy by MindStalker · · Score: 1

      Doesn't matter just install the same speech recognition on the outgoing sounds. If the outgoing matches the incoming ignore it. Hell you don't even need speech recognition just classic sound pattern matching will do, a slight time delay and some small environmental changes won't fool most programs that do pattern matching.

    2. Re:Maybe a good start, but not that easy by Doc+Ruby · · Score: 1

      The other reply has got it right. Just make Vista recognize the speech Vista is emitting as audio, and compare those recognized symbols to the recognized speech symbols coming through the microphone. It would be easy to tell that the same symbols are coming out the speakers as coming in the mic. In fact that should also increase the recognition of the speech actually originating in the room.

      --

      --
      make install -not war

    3. Re:Maybe a good start, but not that easy by flibuste · · Score: 1

      You have no idea how complex pattern matching is in sound analysis. People complain that Vista is slow. Add this kind of "feedback cancellation" and you won't be able to display more than one icon per hour.
      Is it worth it? NO.

  41. Shocked! by Andrei+D · · Score: 3, Funny

    I am shocked! Damn you Bill, I really believed you when you said Vista is "dramatically more secure than any other operating system released". My world view is turned upside down now :(

    --
    We often refuse to accept an idea merely because the tone of voice in which it has been expressed is unsympathetic to us
    1. Re:Shocked! by Barlo_Mung_42 · · Score: 1

      I'm not sure how much less secure we are because of this "exploit" but the fact that the potential for it has been around for years and we are just now talking about it must mean that Vista's voice recognition is much improved, and that IS a good thing.

  42. Best. Prank. Ever. by copponex · · Score: 3, Funny

    Find office with 10 or 15 stations with shiny new copies of Vista. Verify through other means that mics and voice commands are on. Run in, and yell as loud as you can the commands that will shut down the machines. Don't run out yet!

    Watch people panic at their keyboards. Listen to their gasps as the hard disk spins down and their monitors cut off, at which point they all stare at you. Wave. And then run.

    1. Re:Best. Prank. Ever. by SuiteSisterMary · · Score: 1

      In the novel Head Crash, by Bruce Bethke, the main character mentions how he password protects the shutdown feature of his portable computer, rather than the startup. The reason is something like 'a freshman prank during the week before the final papers were due, involving runing through the library shouting 'SHUTDOWN!'

      --
      Vintage computer games and RPG books available. Email me if you're interested.
  43. Re:Restart? Really? by ultranova · · Score: 1

    As for the "exploit" ... windows will cause your computer to explode if you douse it with gasoline and set it alight too. Should there be a warning label and slashdot story to point that out?

    If Windows came with a canister of gasoline and a lighter, then yes, there should.

    It's a bit like with ActiveX: letting any website to execute arbitrary code in your machine is a bad idea, no matter what the underlaying OS. But only Windows does it by default.

    --

    Forget magic. Any technology distinguishable from divine power is insufficiently advanced.

  44. auto refresh by cdn-programmer · · Score: 1

    All a website needs to do is set autorefresh and load the exploit page x minutes after the innocent page and only once.

    Many users open a web page and walk away.

  45. Vista tech support by Anonymous Coward · · Score: 0

    MS Staffer: Yeah, uh, just hold your phone up to the microphone.
    Customer: OK. Hold on.
    MS Staffer: <whispers>erase all files.
    Customer: Oh my god! I think it's getting worse...did you do something?
    MS Staffer: Uh, no. By the way, you have a bad hard drive...you're going to need to call Dell tech support to resolve this issue. Thanks for calling and have a nice day.

    *click*

    MS Staffer: Hey Jack! How many calls do I have to handle a day to get that juicy bonus?

  46. Hmm.... by Hosehead17 · · Score: 1

    Having played around with the speech recognition in Vista, it is very easy to turn the speech recognition off when you are not using it. I don't remember the precise phrase, but you just say something like "stop listening". A kid in one of my classes had a Mac and used speech recognition on it. When he was using speech recognition we would shout different phrases in an attempt to shut his machine down, among other things. I think we only got it to shutdown once, most of the time it didn't work b/c we were too far away, or he turned that feature off. I've had the same experience with Vista. This sounds like FUD to me.....

  47. Correct me if I am wrong by Karem+Lore · · Score: 1

    Wouldn't you get feedback through your speakers before being able to do anything possibly damaging?

    --
    When all is said and done, nothing changes...
    1. Re:Correct me if I am wrong by Bob-taro · · Score: 1

      Wouldn't you get feedback through your speakers before being able to do anything possibly damaging?
      I wouldn't think so because the sounds picked up by the mic aren't typically played back out through the speakers, so there's no feedback loop. I have a home studio and record to a computer and feedback (due to the computer's mic, anyway) has never been a problem.
      --
      Prov 9:8 Do not rebuke mockers or they will hate you; rebuke the wise and they will love you.
    2. Re:Correct me if I am wrong by Anonymous Coward · · Score: 0

      Only if you have the sound driver set to output the sound input from the microphone, which most people do not.

  48. Oh the irony... by stewbacca · · Score: 1
    If you search Google for "Microsoft Vista speech control" the first hit gives you this gem:

    "Windows Speech Recognition was built using the latest Microsoft speech ... you are always in control;"

  49. meme crashes vista roll-out by Anonymous Coward · · Score: 0

    what a superb meme this is I just knew vi$ta would have an exploit, I just didn't realise there wouldn't be any programming involved "the meme is mightier than the code" hype it up & shout it out!

  50. Another Apple first! by Anonymous Coward · · Score: 0

    Sigh. Microsoft's bug is just mimicing OSX, which recently had the same problem.

    The Month of Apple Bugs: showing all the ways Apple continues to innovate!

  51. Simple Fixes by Anonymous Coward · · Score: 0

    1. Expect keyword before commands
    2. If voice pattern is unknown await confirmation via ui dialog
    3. Don't execute voice commands while outputting sound

  52. I was EXPLOITED! by Anonymous Coward · · Score: 0

    I had this all set up to test when the local walmart ad started playing on the TV.

    As soon as the TV announced kid's pants were half off, Windows Media player came on playing the Michael Jackson MP3 Pretty Young Thing

  53. I'm feeling anal today, so ... by spellraiser · · Score: 4, Insightful

    An exploit is, by definition, a successful manipulation of a bug/omission/hole/whatever in a computer system to make it perform something that it was not designed to do. Usually this term is only applied when said action is harmful or potentially harmful.

    What is being described here is the possibility of controlling the voice recognition system in Vista remotely to make it perform potentially harmful tasks. Furthermore, this functionality is not something that said system was designed to do; it was only designed to accept commands via microphone.

    Therefore, what is being described here is an exploit.

    Q.E.D.

    --
    I hear there's rumors on the Slashdots
    1. Re:I'm feeling anal today, so ... by Actually,+I+do+RTFA · · Score: 1

      The exploit operates through the microphone, so that's not the exploit. Maybe the fact that there was an 'omission' (from your criteria of an exploit) that the speech recogntion software doesn't have any speaker identification, but I doubt that was accidentally left out. It probably wasn't in the spec.

      --
      Your ad here. Ask me how!
  54. Oh, dear. by jonadab · · Score: 1

    Speech control is on by default? That shouldn't be. Quite aside from any internet-related remote exploit issue, it's going to create problems if there's more than one person in the room with the computer. Granted, most computers don't have a mic, so for them it won't be an issue, but still.

    There's also the question of why we would want our web browsers to play sounds, but I think we've lost that batte.

    --
    Cut that out, or I will ship you to Norilsk in a box.
    1. Re:Oh, dear. by Anonymous Coward · · Score: 0

      Speech control is off by default and has to be explicitly activated.

    2. Re:Oh, dear. by jonadab · · Score: 1

      > Speech control is off by default and has to be explicitly activated.

      Oh. Good. That's as it should be. The article summary seemed to imply otherwise.

      I still don't want my web browser playing sounds, but I seem to be in the minority on that one.

      --
      Cut that out, or I will ship you to Norilsk in a box.
  55. "Hi, I'm a Mac..." by starglider29a · · Score: 2, Funny

    The Vista replies, "And I'm a PC."

    1. Re:"Hi, I'm a Mac..." by StikyPad · · Score: 1

      The Vista replies, "'Hiymamac' is not recognized as an internal or external command, operable program or batch file."

  56. Bah... by eno2001 · · Score: 5, Funny

    I expect someone to come up with a site that says:

    "Start Internet Explorer"
    "Go aytch tee tee pee colon slash slash gee oh ay tee ess ee dot see ex"

    Brrr...

    --
    -"...bad old ideas look confusingly fresh when they are packaged as technology" - Jaron Lanier (Digital Maoism on Edge.o
    1. Re:Bah... by Terrasque · · Score: 1

      Basically a Rube Goldberg redirect? :)

      --
      It's The Golden Rule: "He who has the gold makes the rules."
    2. Re:Bah... by Anonymous Coward · · Score: 0

      I'm visually impaired, so I browse Slashdot with a screen reader you insensitive clod!

      My guide dog just yelped and ran away...

  57. That reminds me of a story about a Mac user... by pestie · · Score: 1

    This reminds me of a story I heard years ago about one of those smug, irritating Mac users who thought he was just so cool 'cause his computer did stupid little tricks like voice recognition. He had some sort of voice-command software installed on his Mac and he bragged about it (and other Mac things) to his fellow cube-dwellers constantly. At some point, though, someone figured out that one of the commands was "shut down," at which point his co-workers would regularly walk by his cubicle and yell "shut down!" at the computer, which would immediately and happily comply. Mac Boy uninstalled his voice-recognition software shortly thereafter.

  58. MS Security Response Blog: Adrian responds by davidwr · · Score: 4, Informative

    Adrian responded to this on the Microsoft Security Response Blog.

    Issue regarding Windows Vista Speech Recognition

    Hey everyone this is Adrian and I am writing to try and clear up some concerns regarding a recently reported vulnerability in the Speech Recognition feature of Windows Vista. An issue has been identified publicly where an attacker could use the speech recognition capability of Windows Vista to cause the system to take undesired actions. While it is technically possible, there are some things that should be considered when trying to determine what the threat of exposure is to your Windows Vista system.


    He goes on to list reasons why this is not a major issue. The first being that voice commands have to be turned on and configured for this to work.

    He ends with

    While we are taking the reports seriously and investigating them accordingly I am confident in saying that there is little if any need to worry about the effects of this issue on your new Windows Vista installation.

    I think he's right. If this was a serious problem, the MacOS and OS/2 "exploits" mentioned above would've received a lot more press. Still, I expect in a future version, the voice software will be smart enough to ignore the computer's own output.

    Personally, I don't like voice commands. They are necessary for users with certain impairments and useful for certain applications such as kiosks, but they are counterproductive in a shared-office environment and just plain weird on my desktop. Even on Star Trek - The Next Generation much of the computer input was via control consoles not voice.

    --
    Knowledge is how to play a game, intelligence is how to win, wisdom is knowing what game to play.
    1. Re:MS Security Response Blog: Adrian responds by Anonymous Coward · · Score: 0
      Yeah, sure - but to turn it on you can use:

      - another exploit fiddling with settings at the user level

      - social engineering (i.e. to configure SuperGameX, please make sure you have, blah, blah)

      - statistics (i.e. a large target base will ensure that SOME will have it on)

      And one might also be able to re-route other audio input into the recognition engine. So, the question really is, how secure is the recognition engine's "system(text_from_speech)" command ...

    2. Re:MS Security Response Blog: Adrian responds by Anonymous Coward · · Score: 0

      "Even on Star Trek - The Next Generation much of the computer input was via control consoles not voice."

      Now that I think of it, I think on Star Trek, computer voice commands are only for queries.
      "Computer, what is the location of Riker?"
      "Computer, what are the chemicals that do blah blah blah?"

      Whenever a computer is asked to actually *do* something, it's done through those wierd illogical keyboards. :p

  59. Really easy solution by nmg196 · · Score: 1

    All they need to do is disable audio commands (other than "stop" or "pause") whenever any sound is being played. That way - skype etc would continue to work normally (full duplex) and the speech recognition system wouldn't even be listening if a web page or MP3 or any other application was making noises.

  60. Seriously? by Anonymous Coward · · Score: 0

    Well. I'm sure that effects all nine of you who actually use speech control.

  61. Haha by BlenderFX · · Score: 1

    lol, this made my day :)

  62. Re:Restart? Really? by inquisitor · · Score: 2, Insightful

    It's not necessary to restart the PC to turn off speech recognition - just say "stop listening" or click on the always visible recognition toolbar to turn the microphone off. It's also not on by default either, and only those interested in it will find it anyway. Not really an "exploit" that's actually exploitable.

  63. Um by yesthatmcgurk · · Score: 1

    How is this a Vista vulnerability? Any speech recognition program is vulnerable to this. Also, its vulnerable if somebody walks by and talks into your microphone. Or if you have the phone on speaker. Glad the release of Vista brought this kind of voice recognition vulnerability to the forefront.

  64. Microsoft confirms this... by Anonymous Coward · · Score: 0

    ...in a special press conference they held today. The whole conference video can be seen online here [microsoft.com]. (Note to Windows Vista users: turn off your microphone and speakers at the time 2:35 into the video where the spokesperson says "...this vulnerability can be used to execute dangerous commands, such as: "DELETE C:\*.* /S /Q /F" by playing special wave file...").

  65. Brilliant! by Kozar_The_Malignant · · Score: 2, Insightful

    The security advice is "A user can turn off their computer speakers..." before playing an audio file. We can also solve the problem of porn getting into our school network by unplugging the monitors. I didn't realize this security stuff was so easy.

    --
    Some mornings it's hardly worth chewing through the restraints to get out of bed.
  66. DRM Biting them in the ass? by nobodyman · · Score: 1
    I'm sure that they are aware of the importance of noice cancellation. However, what do you want to bet that their own DRM hand-wringing is the culprit? My theories:
    • They *can't* do proper noise cancellation, because you does not allow you to set up a pass-through loop. This is to prevent you from re-encoding your DRM music in an unprotected format, but it's also necessary for noise cancellation.
    • They try to noise cancel as best they can, but external microphone inputs are purposely degraded so as to prevent you from recording copyrighted music. Maybe noise cancellation is hamstrung because the degraded signal doesn't match the playback source?


    I'm pretty ignorant of the science behind all this but I'm certain that DRM concerns are a big factor in why this exploit is even possible. I expected this DRM business to bite them in the ass, but I didn't expect it to bite them on day one. ouch.
    1. Re:DRM Biting them in the ass? by Doc+Ruby · · Score: 1

      I think they're doing noise cancellation on the mic. But your point about MS DRM possibly preventing them from decoding between the network and the speaker, because that conflicts with the MS "end to end DRM" model, is probably right on. Or, at least, if they can decode before the speaker for "preemptive speech recognition", that's a major hole in their multimedia DRM "between the ends".

      Either way, they can't do both. And "biting itself in the ass" is a great way to describe the loop MS is now caught in. Monopolists never learn that you can't have it all: where would you put it?

      --

      --
      make install -not war

  67. In Vista no one can hear you scream by Anonymous Coward · · Score: 0

    In Vista no one can hear you scream, except for your computer which shuts down promptly.

  68. MacOS 9 used to recognize users by voice by Lord+Satri · · Score: 1

    All voice recognition software, no matter what platform, would suffer from this supposed "exploit".

    Some time ago already, MacOS 9 used a voice recognition authentication to login (voiceprint passwords). Your mac recognized you. Such a exploit would not have worked.

    This feature was dropped from OSX. But the principle remains. I'm not sure about identifying the user for all voice commands, but why not?

  69. Re:Voice controlled video player. Echo cancellatio by Code+Master · · Score: 1

    Echo cancellation on a PC is very difficult. Many systems have slightly different sampling rates for output compared to input. A difference of hundredth of Hz is enough to screw them up. The computation is also nothing to ignore at high sampling rates, but that problem is secondary. The solution for most companies wanting to do echo cancelling on a PC is to use their own hardware for the sampling so that they can be in sync with the same clock. BTW, MS used to have an R&D project on PC based echo cancelling. They had no further description and no publications and I can't find the site anymore, so I don't know if it went anywhere. Now, a solution that ran the speech synthesis on the speaker output and the mic input could be used to correlate the two and prevent the 'exploit', but that is not echo cancellation. Code Master

    --
    The Code Master
  70. What is the Vista Equivlent by VEGETA_GT · · Score: 1

    Of rm -rf /

    I have a web site to make :P

    1. Re:What is the Vista Equivlent by Viceroy+Potatohead · · Score: 2, Funny

      Nobody's really sure, but it happens with surprising regularity.

  71. Startup Sound by EricJ2190 · · Score: 5, Interesting

    Now I see why Microsoft doesn't want you to change the Vista startup sound.

  72. Favorite exploit EVAR by idontgno · · Score: 0, Troll

    "My name is Leenus Torvalts and I pronounce Leenux 'echo wy pipe format c colon slash you'"

    --
    Welcome to the Panopticon. Used to be a prison, now it's your home.
  73. Prior art by hweimer · · Score: 5, Funny

    Time to quote a usenet classic:

    Last year, out in California, at a PC users group, there was a demo of
    smart speech recognition software.

    Before the demonstrator could begin his demo, a voice called out from the
    audience:

    "Format c, return."
    "Yes, return."

    Damned short demo, it was.

    --
    OS Reviews: Free and Open Source Software
  74. You'll know your company is now a botnet... by sprior · · Score: 4, Funny

    When your machine room starts doing a gregorian chant...

  75. More difficult but better... by Mateo_LeFou · · Score: 1

    Am I the only one who thought "Nam-shub of Enki" when I read this?

    --
    My turnips listen for the soft cry of your love
    1. Re:More difficult but better... by wirelessbuzzers · · Score: 3, Funny

      Am I the only one who thought "Nam-shub of Enki" when I read this?

      Yes.

      --
      I hereby place the above post in the public domain.
  76. Filtering? by phorm · · Score: 2, Informative

    No, actually it isn't really agendized.

    Ever used a program such as skype or other voice-chat software? Notice when you have speakers and microphone on, you generally don't hear your voice constantly repeating into echoes (if echo-cancel is on, of course). Notice that you don't with the speakerphone on your cell either? That's because the software/hardware is smart enough to take the audio output and subtract/prevent it from entering the audio input (avoiding feedback loops etc). If used properly with voice-recognition software, it would defeat programs on a webpage from sending output to be re-picked up from your input system. Since MS assumedly has control over the audio subsystem of the operating system, it should be able to snag the master combined output and filter it in this way.

    Now that doesn't preclude some annoying twit from walking by and telling your computer to do things it shouldn't. However, that issue could be prevented by engraining an element of "speaker recognition" (the person speaking, not the ones on your computer) to the machine. Further, it could require a user-defined prefix or suffix to the command, such as "Computer, earl grey tea, hot!" or "Open the doors, Hal!"

  77. Re:I'm waiting for the audio exploit that responds by Anonymous Coward · · Score: 0

    Eh. NeXTstep circa 93, had a product name "Simon Says" that did voice commands. Each commands started by "Simon":

    "Simon Says
    Lavin: I happen to be an amateur expert on the phenomenon of computer-to-human voice communication: I have an '84 Le Baron convertible. It was a luxury auto during that brief shining moment when cars talked to humans. It tells me, due to a faulty sensor, that my washer fluid is low every time I turn on the car. It also tells me, due to a faulty operator, that my seat belt is unfastened and that I have failed to turn my lights off.

    On the surface, machines talking to operators have little to do with Simon Says, in which the user talks to the computer. Still, first-time visitors to both my computer and my car are always amused with this spoken communication. A steady diet of either, however, does not even remotely measure up to initial expectations.

    Within ten minutes of using Simon Says to voice-control my NeXT, I thought it was the coolest thing ever invented. Within an hour, I had discovered that it was also the first decent macro generator for the platform, and I was even happier. Within three hours, I had an axe through my face from aggravated office neighbors not clued into the joys of loud, mono-tonous voice commands.

    But where the talking car concept was totally discredited as a human-interface solution, dying a quick and merciful death, Simon Says is in fact genuinely useful over the long haul.

    You start by training Simon with the words you want it to recognize. This involves many repetitions of these words. Since inflection matters in Simon's voice recognition and remembering exactly how you said, "Show Ruler," is next to impossible, you need to develop a consistent style to make the thing work.

    Then you can choose which apps you want to have under some voice control. If you are like the rest of us, you'll go hog wild, training everything from PrintManager to BlastApp and all commands from "Insert Soft Hyphen" to "Make Spline."

    If you do this, though, you'll quickly find out why applications have menus. A better strategy is to limit the number of trained words to a manageable list covering only key commands in key apps. Once you get the right mix, you'll find that using your voice is truly like having a third hand.

    Amazingly enough, Simon also has a powerful macro generator built in. You can construct voice-command macros that will perform keystrokes or primitive mouse events, paste text, run UNIX commands, play a sound, send mail, or do a combination of the above. The resulting macros are so powerful that I wanted them available with keyboard equivalents in addition to voice control.

    Like any 1.0 product I like, there are a large number of features I would add or fix. Since voice is a whole new means of interfacing with my computer, deficiencies are magnified. But none of this takes away from the fact that the voice-recognition engine is a thing of marvel. There's no doubt that I'll keep using Simon, though that should come as no surprise. I still drive the Le Baron."

    Holy sh!t, am I that old ?

  78. Way to go, Slashdot by eck011219 · · Score: 1, Troll

    Yet another batch of truly astounding BS about Vista topped by a misleading headline. This is not a Vista-specific defect, this is a characteristic of voice commands (but granted, a very valid reason why it's an unreliable way to try to run a computer under many circumstances).

    And more to the point, if I have a Mac with voice recognition software installed, is it somehow NOT prone to this?

    There are several things wrong with Vista, but this isn't one of them. Are we all so hungry for security-related dirt about Vista that THIS is what constitutes "news for nerds" or "stuff that matters"? Good gravy.

    --
    It is pitch black. You are likely to be eaten by a grue.
  79. Loop-de-loop by Anonymous Coward · · Score: 0

    I wonder if you could setup a sound to create a loop? Such as "play sound file 'play_sound_file.wav".

  80. easier solution: by Valar · · Score: 1

    There have been a multitude of posts suggesting 'easy solutions' to this problem (most of which involve doing some sort of automagical signal processing on the signal received based on the audio being played out). Most of these suggestions just plain won't work. Why? Because every set of speakers is different, every room is different. Therefore, the signal will be mangled in some unknown (to the computer) way, before being received again.

    The EASY solution? A voice password required before every voice command. The user sets a short phrase that they prefix voice commands with. Any commands lacking that prefix are ignored. Yes, it makes using voice slightly more inconvenient. On the up side, you can tell users they are giving their computer a name. Instead of 'open internet explorer', it is 'arglebargle, open internet explorer'.

  81. Hmm.... by Opportunist · · Score: 1

    Considering the amount of swearing and other drivel that will be spoken in the vicinity of a Vista-infected computer, I'd think that turning off the mike is generally a very good idea. Not only because of voice commands, but also 'cause someone might just contact you through your IM and the first thing he hears is you cursing at him.

    --
    We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
  82. Speech Researcher Here Confirms It by virtigex · · Score: 3, Interesting

    I have worked on both at Apple on PlainTalk and at MS Research on speech. When I was at Apple (around 1996) I poked my head into a co-worker's office who was testing PlainTalk and said loudly "Computer Shut Down". His computer then started shutting down. This "exploit" has been on the Mac since 1996 and nobody seems to have complained about it. I don't think it's a big deal.

  83. Holy Fuck! by Cervantes · · Score: 1

    Holy fuck... if you tell a computer to obey voice commands, IT DOES!!!
    News at 11.

    Seriously, what's next, breaking news about how you can record commands to tape and play them on the stereo as a "hackers remote exploit"?

    Yes, I'll agree, it would be nice if voice rec software would filter out itself. But that doesn't seem to be mainstream yet. So just do what everyone does... and turn off voice rec.

    --
    If I knew the wedgies I gave you back in 6th grade would have resulted in this . . . I might have taken a moments pause.
  84. rm -rf / by MadnessASAP · · Score: 1

    I can't wait to walk into computer labs at school and yell out "rm -rf /" and then walk away whistling.

    --
    I may agree with what you say, but I will defend to the death your right to face the consequences of saying it.
    1. Re:rm -rf / by Kalriath · · Score: 1

      I can't wait to walk into computer labs at school and yell out "rm -rf /" and then walk away whistling. Yes, I'm sure the sudden appearance of "'rm' is not recognised as an internal or external command, operable program, or batch file" on everyone's screen will be SO funny, not.
      --
      For a site about things like basic rights, Slashdot users sure do like to censor "dissent".
    2. Re:rm -rf / by MadnessASAP · · Score: 1

      fine "deltree c:" then. Happy now

      --
      I may agree with what you say, but I will defend to the death your right to face the consequences of saying it.
    3. Re:rm -rf / by Kalriath · · Score: 1

      fine "deltree c:" then. Happy now "'deltree' is not recognized as an internal or external command, operable program or batch file." Thanks Windows.
      --
      For a site about things like basic rights, Slashdot users sure do like to censor "dissent".
    4. Re:rm -rf / by MadnessASAP · · Score: 1

      Alright the for those of you not cool enough to be using 95/98 "rmdir /s C:"

      --
      I may agree with what you say, but I will defend to the death your right to face the consequences of saying it.
    5. Re:rm -rf / by Kalriath · · Score: 1

      "C:, are you sure?" Since noone will be entirely sure what that's supposed to MEAN, I'm sure everyone will say "no" or close the Command Prompt.

      --
      For a site about things like basic rights, Slashdot users sure do like to censor "dissent".
  85. bye windows! by magicrobotmonkey · · Score: 1
  86. voice control feedback by Anonymous Coward · · Score: 0

    Bob: "Bob go jump off a bridge"
    Bob: "Who said that ?"
    Bob: "I said that. Now jump!"
    Bob: "Ok.. Aaaaaaaagh!"


    I think we've found the reason why Silent Bob is silent!

  87. Push to talk by tepples · · Score: 1

    Computer, stop playback.

    Your posed denial of service: a button or foot pedal that mutes audio playback when depressed and unmutes audio playback when released. I seem to remember that amateur radio, CB radio, and mobile phones with the "nationwide walkie-talkie" feature have something similar.

  88. There's An Even Simpler Solution by sweatyboatman · · Score: 2, Interesting

    If the computer thinks you're saying a command, it should disable output to the speakers. If I am talking to my computer then it should stop making its own noises. Otherwise, that's just rude.

    --
    It breaks my pluginses, my precious!
    1. Re:There's An Even Simpler Solution by r3m0t · · Score: 1

      "If the computer thinks you're saying a command, it should disable output to the speakers. If I am talking to my computer then it should stop making its own noises. Otherwise, that's just rude."

      The computer hasn't realised whether you are commanding it until about a second after you stop speaking. (Your silence is a trigger to process whatever has just been recorded.)

  89. Saying it is unfixable is a copout by mattr · · Score: 2, Interesting

    Detection of whether a given sound is what was just emitted from the speaker may be very difficult, but it is relatively easy in terms of timing. So long as the system knows how much lag time is present in the system, it should be possible to disable detection of all sound that is being played at the same time (i.e. basically turn off the mic then). Nobody expects voice recognition to work when music or other sounds are playing, and the system, whether Vista or OS X, ought to be able to disable voice recognition instantaneously when sound output is generated.

    The problem of course is that the computer next to you might suffer from the exploit since it doesn't know what sound your computer is generating, though this might be diminished by subtracting other sound to some extent via sidepointing mics or even better by just refusing to do dangerous commands like format or delete via voice recognition in the first place. There are gray areas that probably make total safety impossible but some common sense things including disabling all recognition during sound generation from explorer and wmp sound like a good place to start.

    1. Re:Saying it is unfixable is a copout by mariushm · · Score: 1

      Right, So.. let's pretend I'm talking on Skype with someone while having the FM tuner running at low volume in the background..News starts on radio, I'm talking, my friend is talking .. what happens now ? There are 3 voices in total. According to your solution, Vista should mute the tuner and my friend on Skype because they may issue commands to my computer. Stupid.. This solution shouldn't have been implemented as long as the recognition software can not make a difference between MY voice and other voices/music.

    2. Re:Saying it is unfixable is a copout by mattr · · Score: 1

      No, that's not what I am saying at all. I didn't say it should mute anything.I am saying Vista should disable voice recognition when specified dangerous applications are outputting sound.

      IE would be one such app. It would be more secure to do so when Skype is outputting sound too but that's up to you maybe. Heck someone could hijack your FM band and tell the entire office's Vista computers to delete their files I suppose (assuming each person is listening only to their computer and not to a centralized radio).

      All I'm saying is that the voice recognition program should just reset itself when the OS is making other sounds, with the ability to specify only certain apps to ignore. As for the tuner, it still might trigger the recognition if it is playing from the same machine but that's up to you.

      The OS can know *when* it outputs sound and *what* application is doing it (in collusion with the sound server on linux I suppose). Differentiation between voices is possible especially in terms of positioning sound sources but this may get expensive or require a cpu core just for that.

      Voice recognition I think really has limited use but in particular it can be vital to handicapped users. They are not going to be running an FM tuner on their computer or a nearby radio if it interferes with their commands.

      Now I don't know how voice recognition works on Vista, though I've tried it on OSX. I find it is quite difficult to actually get the round recognition window to disappear and that was quite annoying and should be fixed, but you can indeed pick whether you want an app to be recognized (it shows a dictionary per app IIRC). If Vista does not such thing and in fact allows any website to hijack a running voice recognition session that is indeed a major vulnerability and if Microsoft continues to blow it off, they will be contributing to the danger with potential liability.

    3. Re:Saying it is unfixable is a copout by mariushm · · Score: 1

      My Leadtek TV/FM tuner has this CD cable that connects it to a CD in connector on a motherboard. It was my choice to connect it to CD because I could as well connect it to AUX in, Modem In or any stereo input connector that my soundcard supports.

      I don't think Windows is able to detect what the software tuner supports because itself it does not generate any sound.It just sends a command to the tuner that tells him to go to a certain frequency and start output sound on it's audio out cable. Where that cable goes, it doesn't care (you actually have to configure the player from the configuration - Volume control CD, Recording CD ).

      Just as if you would insert an Audio CD on your unit and press the Play button on it, Windows ( I don't think ) can't detect that something plays. There is no application telling Windows that it's generating a waveform on CD input. In the case of the tuner Windows can only detect that the application is running or query the tuner through IC2 to see what it uses (but not all tuners use IC2 for configuration). PLus, several times I've been able to end the Winfast FM process using the Task Manager and not resetting the tuner, thus having radio without any application running. It gets stopped when Winfast FM process is started again.

    4. Re:Saying it is unfixable is a copout by mattr · · Score: 1

      I see, thank you for the details which are quite interesting. Certainly if the OS is completely unaware that sound is being generated by a given application then it won't know to temporarily disable voice recognition at that instant. But something tells me that we will see more of Vista branching out to monitor semi-external devices for rights monitoring. Also if OS commands are used to play the sound then it would know something is being played.

      At any rate I think IE is the most dangerous vector due to ActiveX and automatically playing website sounds. It seems to me that if the PC is already owned the game is off, and it is too valuable as a zombie to visibly damage. Most people use their eyes most to deal with computers and audible response is mainly for entertainment and some GUI feedback, and I think this (and the overwhelming number of vulnerabilities in this area) is why MS failed to examine sound I/O as an attack vector. Though I would if I had 6 years and billions of dollars to do it...

  90. The value of being a packrat... by Viceroy+Potatohead · · Score: 1

    I knew I kept the Cop-killer album around for a reason.

  91. Joke fails. Accent matters. by tepples · · Score: 1

    For Matt See

    <sarcasm>Funny.</sarcasm> "Format" is pronounced FAW-maet,[1] accented on the first syllable, while "for Matt" is fuh-MAET, accented on the second. Any speech recognizer that can handle Mandarin or Cantonese can handle accent in English.

    Reice Tart

    I pronounced reice like "rice", which doesn't produce the "restart" you were going for. You get RAH-ees-t'haht, not rih-STAHT. If you're going to make an Anguish Languish joke, don't make it sofa king Wii Todd did.

    [1] Ad-hoc respelling is used instead of International Phonetic Alphabet because Slashdot is configured to delete characters not on its whitelist for English orthography.

    1. Re:Joke fails. Accent matters. by Anonymous Coward · · Score: 0

      Parent confirms: IPA pushers are aspie fucktards. Go back to Uncyclopedia.

  92. Next Mac Ad is even better by jgc7 · · Score: 5, Funny

    PC: Hi I'm a PC
    Mac: and I'm a Mac
    PC: I have a cool new feature called voice control.
    Mac: That is stupid. I have the Time-Machine which let's you recover old documents. Let's say you accidently delete the documents folder
    PC: Okay
    Mac: To get you documents back, all you have to do is slide the time machine back one minute.
    PC: Sounds cool, but cant you just get the documents out of the trash?
    Mac: Yes, but it works even if you accidentally empty the recycle bin

    --
    70% of statistics are made up.
    1. Re:Next Mac Ad is even better by ickoonite · · Score: 1

      Took me a while.

      Very good.

      iqu :D

    2. Re:Next Mac Ad is even better by jacksonj04 · · Score: 1

      Vista's got it, but admittedly not as well integrated as Time Machine. It's part of the new Volume Shadow goodies. Right click, "Restore previous versions".

      --
      How many people can read hex if only you and dead people can read hex?
    3. Re:Next Mac Ad is even better by curunir · · Score: 4, Funny

      Better yet, the next Mac ad could make light of this exploit.

      PC: Hi, I'm a PC.
      Mac: and I'm a Mac.
      PC: Now that I run vista, I can accept voice commands!
      Mac: Wow, that sounds cool. But what if someone tells you to punch yourself in the face?
                PC punches self in the face and nose begins to bleed
      PC: Ouch, that hurt!
      Mac: I'm sorry PC, I didn't realize that just telling you to do something like "poke yourself in the eye"...
              PC pokes finger into his eye
      Mac: ...or "begin sneezing incesantly"...
              PC starts to uncontrollably sneeze, the blood from his nose splattering everywhere
      Mac: ...would make you actually do it.
      PC: groan I'm sorry if I splattered on you.
      Mac: That's ok PC, I'm pretty immune to viruses, so I think I'll be alright.

      --
      "Don't blame me, I voted for Kodos!"
    4. Re:Next Mac Ad is even better by suckmysav · · Score: 1

      Ummm, I might be wrong because I have little interest in Windows, but I was of the understanding that the Volume Shadow service is a server side function that allows network clients to roll back to previous versions of data located on a network share. ie it doesn't affect local files at all.

      --
      "You can't fight in here, this is the war room!"
    5. Re:Next Mac Ad is even better by suv4x4 · · Score: 1

      I was of the understanding that the Volume Shadow service is a server side function that allows network clients to roll back to previous versions of data located on a network share. ie it doesn't affect local files at all.

      What can I tell you.. If you run it on the server, it affects the files on the server, if you run it locally, it affects the local files...

    6. Re:Next Mac Ad is even better by TheNetAvenger · · Score: 1

      I might be wrong

      And this is the only thing about your post that is correct, you are wrong...

    7. Re:Next Mac Ad is even better by r3m0t · · Score: 1

      Wow, that's amazing! Now I just need to buy Enterprise or Ultimate (almost 2x more expensive in the UK) to get that amazing functionality which every Mac will have in a few months!

      Microsoft seems to advertise all of its features together without indicating that only the most expensive version actually has this stuff.

    8. Re:Next Mac Ad is even better by Richard+Steiner · · Score: 1

      Serious question:

      How do the voice control and dictation features in Windows Vista compare to those that came standard with OS/2 Warp 4 back in 1996?

      --
      Mainframe/UNIX Bit Twiddler and long time Windows/Linux Hobbyist.
      The Theorem Theorem: If If, Then Then.
    9. Re:Next Mac Ad is even better by RzUpAnmsCwrds · · Score: 1

      Nice try, but Vista can already do that. It's called "previous versions".

      I'm sure in 6 months, someone will claim that Microsoft ripped off time machine. Just like they "ripped off" Dashboard (HINT: Konfabulator did pretty much the EXACT SAME THING before Dashboard did, and Stardock DesktopX did it before Konfabulator).

    10. Re:Next Mac Ad is even better by jgc7 · · Score: 1

      I think you missed the joke. Read it again, and think about what would happen if someone had voice control turned on and the mac ad came on a TV in the same room as the computer.

      --
      70% of statistics are made up.
  93. Compare to citizens band radio by tepples · · Score: 1

    A good way to fix this would be to make the user hold down a button or buttons (like maybe WinKey+Space or both mouse buttons). Then it doesn't work without you meaning to put in a command. Kind of defeats the purpose of voice command though doesn't it? :) Then why does CB radio use the exact same method (button mutes incoming sound and turns on mic; releasing button unmutes incoming sound and turns off mic) ? If lack of keyboard ability is a problem, there are other switch designs, such as a foot pedal or a large "plunger" button from game shows.
    1. Re:Compare to citizens band radio by alshithead · · Score: 1

      "Then why does CB radio use the exact same method (button mutes incoming sound and turns on mic; releasing button unmutes incoming sound and turns off mic) ? If lack of keyboard ability is a problem, there are other switch designs, such as a foot pedal or a large "plunger" button from game shows."

      Respectfully, I think you missed my point or I didn't do a good job of making it...

      What about someone completely paralyzed except for the ability to speak? Hitting a button or using a foot pedal will be beyond their abilities. Complete voice control is a fantastic goal for everyone, not just those who are physically incapable of using manual controls.

      --
      I reserve the right to think for myself. Others' opinions are optional. Puppy on lap = typos...not illiteracy.
  94. Re:I'm waiting for the audio exploit that responds by rb4havoc · · Score: 1

    Actually, OS X is set up so you have a prerequisite phrase that you have to say before it will initiate any voice driven commands, and you can change it to whatever, so you can actually get your computer to respond to "Simon Says"!

    I did it at one point just because I could, then turned it off because it was just something to play around with.

    --
    "There are 10 types of people in this world--Those that understand binary, and those that do not..."
  95. Re:The Real Agenda of this Article? - OS X solved by Anonymous Coward · · Score: 0

    That is why OS X has a "keyword" setting in the voice recognition system, using the "keyword" unlocks the voice commands and processes the following words as commands. By picking an unusual "keyword" you can insure the system does not run random commands based on surrounding noise, or a web page audio, like Vista deos.

    BTW, the default option in OS X speech control only listens when a selected key is pressed, default is esc key.

    The second option is "listen continously with keyword". The default word is "computer" but it is user selectable.

    Apple actually thought about the usage of this before implementing the voice commands feature.

  96. But it IS accepting commands by microphone... by Anonymous Coward · · Score: 0

    It just so happens that the Microphone is picking up commands played by the computer's own speakers.

  97. What about removal? by A_Non_Moose · · Score: 1

    IME with os9 and osx the speech could be disabled easily in 9 and fairly easily in X,
    and was usually one of the first things I'd disable.

    I don't want my computer talking to me, nor do I want to talk to it (at it, maybe, but not to).

    Same functionality is built into XP, and I was only aware of it because of Xplite.

    Just curious as to ways to kill this off w/o special tools you have to pay for.

    In a former job, killing off speech, outlook, and some of the flotsam and jetsam of a fresh install
    helped 2k/Xp run for months w/o reboots in a production environment.

    I guesss that's why the big "meh" over Vista...more "stuff" we don't need/want/might not use
    piled even higher than before and no way to remove/disable it (that I'm aware of).

    --
    Have you read the moderator guidelines? Well, have you, PUNK? (and I want a Karma: Gnarly option)
    1. Re:What about removal? by Anonymous Coward · · Score: 0

      Seeing as it's Off by default and needs to be activated and configured before it can be used, what is there to disable?

  98. Voice Recognition in Vista - Gimmick by Anonymous Coward · · Score: 0

    Voice stuff, just a gimmick. Like someone with working hands would need voice recognition. Learn to type people. Voice recognition will never replace the keyboard. I think the voice recognition in Windows XP is better. Sure, you have to train it, but then it only understands YOU which is just good security in my opinion. I can just imagine someone's kid saying 'format drive c' and then - POOF! Problem solved!

  99. Alert! Alert! Sense of humour failure! by Anonymous Coward · · Score: 0

    It was a funny joke. Your comment was completely, utterly, totally pointless.

    Yes, accent matters. However, in a british accent for example:
    1 - his joke works pretty well.
    2 - "Wii Todd did" sounds *nothing* like retarded, assuming that's what you meant.

  100. Attention ! by smoker2 · · Score: 1

    If you experience problems when using our flagship software, turn off computer and don't use it.
    Problem solved.
    This message brought to you by your ever richer Microsoft overlords.

  101. Thanks for the inspiration! by Em+Adespoton · · Score: 3, Funny

    PC: Hi I'm a PC
    Mac: and I'm a Mac
    PC: I have a cool new feature called voice control.
    Mac: That is stupid. I've had secure voice control for years
    PC: Yes, but with your primitive voice control, the statements had to be in the right format, see?
    Mac: OK, but that's why we call it secure. The user has to select a keyword that will trigger the commands.
    PC: ... Mac: I hope he has his XP install CD handy....

  102. Or... by Greyfox · · Score: 3, Funny

    PC: Hi! I'm a PC!
    Mac: And I'm a Mac!
    PC: I have a cool new feature called Voice Control!
    Mac: FORMAT C!

    --

    I'm trying to teach myself to set people on fire with my mind... Is it hot in here?

  103. duh by oohshiny · · Score: 1

    This is like one of the most obvious exploits imaginable! How could they possibly have missed that?

  104. No, no, "men of genius" by Kozz · · Score: 1

    My understanding is that after 9/11, the new Bud commercials changed "Real American Heroes" to "Real Men of Genius" to be a little more sensitive and not dilute the meaning of "hero". But this was a rumor I heard. Can anyone verify?

    --
    I only post comments when someone on the internet is wrong.
    1. Re:No, no, "men of genius" by ari_j · · Score: 1

      I do remember hearing the old ads, and Wikipedia agrees: Real Men of Genius. I actually like both titles equally, and I don't mind them changing the ads (whereas most things that are suddenly different in this post-9/11 world just get on my nerves).

  105. Somebody do this to Bill by Master+of+Transhuman · · Score: 1

    Program HIS machine to tell him what a fucking asshole he is and how stupid and incompetent his entire company is.

    --
    Richard Steven Hack - This sig is TOO GODDAMN SHORT TO DO ANYTHING USEFUL WITH! MORONS!
  106. Article Correction by scire9 · · Score: 1

    From TFA:
    "suggested as workarounds that... they should close the Windows Media Player, turn off speech recognition, and restart their computer"

    Was meant to read:
    "...and shutdown their computer, maybe even unplug it just to be sure"


    And come on fellas, this is not an bug. It's a warming gift to our beloved exploit writers, now they don't even have to lift a finger to run exploits!

  107. How-to for Amiga Users by Anonymous Coward · · Score: 0

    May also work on AmigaOS 1.2

    ;-)

  108. Easy prevention by TGoddard · · Score: 1

    Preventing this is really easy. Make the user record a word only they will know and only listen for commands after hearing the word or after a key is pressed. This would protect both against malicious web sites and against complete strangers yelling commands.

  109. Old Technology by ThomS · · Score: 1

    This Is The Voice Of The Mysterons...

    1. Re:Old Technology by It'sYerMam · · Score: 1

      Had I mod points, I think I'd have to mod you up just for reminding me of that intro sequence.

      --
      im in ur .sig, writin ur memes.
  110. Re:We've been waiting for this (and joking about i by jernejk · · Score: 1

    Well, not much would have happend as Vista just says:

    Access Denied as you do not have sufficient privileges.
    You have to invoke this utility running in elevated mode.

    I must say UAC is one of the best features of Vista...

  111. Ooh, that gives me an idea... by Gubbe · · Score: 1

    One of the reasons speech recognition hasn't taken off is simply that it's incredibly embarrassing to be caught speaking to a machine. It also distracts others and talking with a clear voice is actually quite tiring in the long run.

    With webcams becoming more and more ubiquitous, why not start work on lip-reading algorithms? Talk to the computer, but don't make a sound. You don't disturb anyone and you don't have to stress your vocal cords. Sure, it might be more difficult and more processor intensive to make it work, but since it's doable by (some, trained) people, it should eventually be doable by computers.

  112. oh yeah! by QAChaos · · Score: 0

    vista, with all of it's bugs, is porno for the QA tester in me...

  113. Long distance hacking by bhamlin · · Score: 1

    So, what's next? Are they going to warn us that people on the other side of the room can also control your pc if they talk loud enough? (Would that even count as a remote exploit anyway?)

  114. not quite by Anonymous Coward · · Score: 0

    There was an episode where it was just Picard and the ship. Picard issued more than a few voice commands.

  115. AKA "Real Men of Genius" (Wikipedia) by Anonymous Coward · · Score: 0
    Hahahaha, those commericals are a riot. Even funnier is that a Wikipedia entry is available, listing all of the spots:

    http://en.wikipedia.org/wiki/Real_Men_of_Genius

    Here's a sample:

    Mr. Restroom Toilet Paper Re-Filler
    Mr. Wedding Band Guitar Player
    Mr. All You Can Eat Buffet Inventor
    Mr. Nudist Colony Activity Coordinator
    Mr. Camouflage Suit Maker
    Mr. Male Fur Coat Wearer
    Mr. Professional Figure Skater
    Mr. Edible Underwear Maker
    Mr. Company Computer Guy
    Mr. Horsedrawn Carriage Driver
    Mr. Golf Ball Washer Inventor
    Mr. Hockey Organ Player
    Mr. Experimental Medications Tester
    Mr. Professional Movie Extra Guy
    Mr. Renaissance Fair Performer
    Mr. Supermarket Free Sample Guy
    The Wikipedia entry also lists websites where you can hear them at the bottom of the page's entry.
    1. Re:AKA "Real Men of Genius" (Wikipedia) by skuzzlebutt · · Score: 1

      I can't find the linkie, but there is another one out there somewhere for Las Vegas visitors (real Las Vegas Heroes?)...like Mr Shiny Shirt Club Guy and Miss I Only Pole Dance to Pay for College

      --
      My debut novel AMITY now available: http://jeremydbrooks.c
  116. Alternate injection route by Jon+Luckey · · Score: 1

    So lets say guy leaves his Vista machine on 24/7 for 'Instant On' internet access.

    Evil dude calls guy when not home. Answering machine picks up, also playing sound outload in guy's house.

    Evil Dude: "FORMAT SEE COLON (pause) YES"

    two chances to initate exploit. Once during the call, and one when guy checks messages.

    --
    -- 3 events that reshaped the world in the 20th century: WW1, WW2, and WWW
    1. Re:Alternate injection route by Kalriath · · Score: 1

      For the eleventy-billionth time, you can't format an in-use volume. Vista would tell you that the volume is in use, and cannot be formatted.

      --
      For a site about things like basic rights, Slashdot users sure do like to censor "dissent".
  117. How is this... by Omeger · · Score: 1

    Defective by Design when it has nothing to do with DRM?

  118. The answer is simple by Anonymous Coward · · Score: 0

    3 words "push to talk" the end.

  119. Didn't we learn by Trogre · · Score: 1

    this lesson back in the '90s? Remember when Apple Macs shipped with Voice Recognition, and everyone figured out that it was a lame-ass fad the novelty of which wore off after ten minutes. (A bit like the Wiimote, I guess).

    Congratulations MS, you're truly the innovators.

    --
    "Nine times out of ten, starting a fire is not the best way to solve the problem." - my wife
  120. Seriously... by webheaded · · Score: 1

    Its pretty obvious this could happen...its voice recognition software. This isn't news so much as an invitation to make fun of Microsoft for the hundreth time. No, this isn't innovative, and no its not really a bug, so much as common sense. If you have a sound file loudly speaking commands over your speakers then naturally that may be picked up by your mic. If you can't figure that out then you probably shouldn't be using the software in the first place.

    I would assume Microsoft has this turned off by default if they aren't completely retarded, because people are stupid, but I honestly wonder about that. By all means though, continue making jokes...but there are probably far worse things we could pick out than this. I still don't see why its being called a bug.

    --
    "Those who would sacrifice essential liberties for a little temporary safety deserve neither liberty nor safety." - BenF
  121. Imitate nature? by i_like_ducks · · Score: 1

    When you want to talk to another person you tend to turn towards them and speak, I expect people would behave the same way with a computer. Why don't hardware developers design computers with stereo mics that's a little apart like human ears, and use a little signal processing circuitry so the computer can work out if it's being spoken to or not? I don't know how feasible that is, but it sounds like a simple thing to do compared to getting software to learn a person's voice characteristics.


    Also why do these app developers think that voice command *necessarily* need to consist of commands in plain English (or any other human language)? I really want some sort of plugin for my media player that'll allow simple commands to be entered by whistling, for example say I want it to play, instead of saying in a flat monotone voice 'computer, play' or something silly, I clap twice (to get the computer's attention), and whistle two notes a major third apart, another command can then be a fourth, a fifth etc (so the user doesn't need to have perfect pitch). That would be far more reliable as command input, the user won't sound like a wally, and easier(?) to implement in software.

    Pls pls any talented developers here take this idea up!
  122. At least I won't... by Mad-cat · · Score: 1

    I'm not bitter that my request for a color printer got denied.
    This way I won't work all day and accidently

    DELETE...a...FILE!

    Welcome to Wally-World.

  123. Obvious Solution by ProfessionalCookie · · Score: 1

    Isn't it kind of obvious to look for sound matching the output signal of the computer and subtract it from the incoming speech recognition signal?

    Like noise canceling headphones.

  124. Re:We've been waiting for this (and joking about i by chochos · · Score: 1

    OK so you have to smoke a joint before entering the room and yelling the commands. Big deal.

  125. How about this solution. by Kaenneth · · Score: 1

    Have the playback system add a 'watermark' to outgoing audio.

    Such as a 'chirp' above human hearing range several times a second, or a low frequency note, below human hearing range (Mythbusters already covered the 'brown' note.)

    MP3 and other audio codecs remove a lot of frequencies from playback; the human ear dosn't notice the differance, but the waveform changes a lot. (for example, if a loud 2000 Hz tone is played at the same time as a quiet 1980 Hz tone you wouldn't notice the quieter tone, because your brain only handles so many signals at once.)

    Combining the two principles, an audio playback system could detect the loudest frequency in a time interval, and pulse neighboring frequencys at a lower amplitude and high speed to transmit extra data, such as a DRM flag, or a 'disreguard voice input' flag.

    Fairly obvious, and should be easy to implement with any understanding on modern audio codecs. with the advantage that one computer wouldn't trigger a second computers voice system, unless this filter was specifically disabled.

  126. lol.. who didn't see this coming? by nanodec · · Score: 1

    Vista is junk. Get a mac and forget about it.

  127. Mac OS X's solution by blacklint · · Score: 1

    Apple seems to have already had a solution to this problem:

    If one were to turn on speech recognition in OS X, the default behavior is to listen only when a key (default is escape) is pressed. The other option is to listen continuously with a keyword (default is Computer, but can be changed to anything) that is either required before every command (default), optional, or 15 or 30 seconds after the last command. One would have to change two different settings to expose OS X to such an exploit as easily as Vista is.

  128. Just for the Record by serutan · · Score: 1

    The blackface/whiteface episode with Frank Gorshin in which the destruct sequence was used was on TVLand just last night, so I knew right off that "1A, 2B, 3C" wasn't quite right. The correct self-destruct sequence for the original Enterprise, according to Wikipedia, is Kirk: "1-1A," Spock: "1-1A-2B," Scott: "1B-2B-3," Kirk: "0-0-0-Destruct-0." The destruct-abort code was "1-2-3-Continuity."

  129. Re:We've been waiting for this (and joking about i by hcr62 · · Score: 1
    I'm smiling as I envision the scenario you've created - pretty funny! Regarding the English, you're making yourself quite clear. Good for you!

    One English grammar thought, regarding the phrase "Me and my friends" - when you think through the sentence, dropping the phrase "and my friends", you will note the awkward pronoun reference "Me have been waiting". To speak correctly you need to make the pronoun fit the voice - 1st person - which means you want to use "I have been waiting". To include your friends, while keeping the voice as 1st person, you would say "My friends and I have been waiting" or "I, along with my friends, have been waiting". If you wish to broaden the voice to the plural, you might say "We have been waiting for this, my friends and I, and"...

    How important is all this? It isn't - as I said, you certainly communicate well. Thanks for the humor!

  130. Like noise-cancelling headphones by GPS+Pilot · · Score: 1

    Couldn't the system simply have a filter that removes the wave signature of what it is outputting before processing input as a command?

    Yes, it could. That's real similar to what a $20 pair of noise-cancelling headphones does.

    --
    That that is is that that that that is not is not.
  131. cool, this means I can go into a computer lab by Anonymous Coward · · Score: 0

    and scream "COMPUTER! RUN FORMAT C: /Y" and shut down the whole place? :)