Slashdot Mirror


Consonants Not Required

billybob2001 writes: "A report at the BBC explains how voice-control of computers can be more successful using grunts and sighs, as "voice recognition programs often failed to accurately capture words". Dr Takeo Igarashi, of Brown University suggests the use of "ahhhh" for skipping tracks on a cd, or adjusting tv volume, but I wonder what the effect would be on pr0n sites? Another suggestion is "uh oh" for undo. Perfect for online banking. Is this going to confuse your system or what?"

139 comments

  1. Undo command by Shafalus · · Score: 5, Funny

    Surely "Ah, shit!" is the obvious choice for an undo command?

    --

    Linux advocates are in a no Win situation

    1. Re:Undo command by JJ · · Score: 2

      I think that would be more appropriate as a full "Mission abort!" or "Disconnect." command.

      --
      So long and thanks for all the fish . . . !!!
    2. Re:Undo command by Anonymous Coward · · Score: 0

      Ok, now suppose you're dictating to your pda or laptop, while sitting on a automated toilet.

    3. Re:Undo command by posmon · · Score: 1

      ug.

      (+1 - insightful)

      --

      update comments set karma=-1, reason='offtopic' where sid=26315

    4. Re:Undo command by FortKnox · · Score: 2

      No... that's the window command for "reboot".

      --
      Good quote, too many chars. Seriously, the slashdot 120 char limit sucks!
    5. Re:Undo command by VivianC · · Score: 3, Funny

      It sounds like they are trying to give control of my computer to the Teletubbies!

      Now my 16 month old will be able to run my machine!

      --
      Viv

      Gmail invites for ip
  2. So many applications.. by heyitsme · · Score: 0

    What would the computer do if you moaned? Show more porn?

    heyitsme

    1. Re:So many applications.. by gazbo · · Score: 1

      Imagine how easy it would be to write code using this?

  3. Great use in showers! by ymgve · · Score: 2, Funny

    Now, whenever you yell you YIEEEE! in a shower because the water is too hot or cold, it will immediatly switch to a more pleasant temperature!

    1. Re:Great use in showers! by HiQ · · Score: 2

      And that would be hotter or colder? That must be one hell of a clever shower to decide on one and the same yell if you mean "too hot" or "too cold". Knowing the state of most household technololgy, when you yell "YIEEEE" (too hot), your shower will undoubtedly give you hotter water, after which you can peel your skin right of you're back.

    2. Re:Great use in showers! by Magumbo · · Score: 2

      Well the best solution to this is to get rid of your giant water heater and replace it with one of the flash heaters with a digital temperature
      control. These are really common in Japan and Hong Kong (surely elsewhere too). They are more economical, give you water heated to your desired temperature almost instantly, and you never run out of perfectly heated water.

    3. Re:Great use in showers! by Anonymous Coward · · Score: 0

      Well, when you yell YIEEE, the shower would of course adjust the temperature towards some predefined temperature, perhaps around 25degC.

  4. Help Desk by well_jung · · Score: 5, Insightful

    Anyone that's worked at a Help Desk should know that Users have been trying this for years.

    --
    Carl G. Jung
    --
    "With one breath, with one flow, You will know Synchronicity" -La Policia
    1. Re:Help Desk by SilentChris · · Score: 2
      Does it handle expletives?

      Can you imagine the Microsoft ad? "Now talk to the computer *the way you've always wanted to*. IntelliSense handles all forms of four letter words..."

  5. Ooooh by Red+Avenger · · Score: 1

    More reasons to talk dirty to my sexy computer. Now if only I could get Carmen Electra's talking back to me...

    1. Re:Ooooh by Anonymous Coward · · Score: 0
      Now if only I could get Carmen Electra's talking back to me

      Carmen Electra has a talking back?

  6. I can see it now... by webcrafter · · Score: 1


    Tarzan producing a buffer overflow. How long before a script kiddie rips its yell into an mp3?

  7. Damn that little gimp by Dolly_Llama · · Score: 0, Flamebait
    Surely this is another one of Hawkings ideas. First space colonization, now he doesn't want to use his little mouth pen..

    aaahh...eerrr..

    --

    Somewhere, something incredible is waiting to be known. -- Carl Sagan

  8. Well no shit. by BiggestPOS · · Score: 2, Funny
    But this isn't what I dream about doing on the bridge of the Enterprise D. Instead of saying "Computer, Tea, Early Grey, Hot" I'd say something like "Oooh, Ahhhh, Grrrr"

    I dont think so.

    --
    What, me worry?
    1. Re:Well no shit. by zephc · · Score: 2

      more like
      ooo-errr! eee! errr aeee! ahhh!
      =]

      --
      "I would say that 99 per cent of what my father has written about his own life is false." - L. Ron Hubbard Jr.
  9. Multiple undo by iforgotmyfirstlogon · · Score: 0, Redundant

    Can I make it undo accidentally saving over my only good copy of something when I say:

    "God ... dammit ... AWWW HELLLLL!!!!"

    - Freed

    --
    "Coffee should be black as hell, strong as death, and sweet as love." -Turkish Proverb
  10. Dangerous, surely? by iainl · · Score: 2

    Surely this could really backfire. I'm just finishing up an important document, perhaps having a significant section of text highlighted as I move paragraphs around.

    "Sorry, I couldn't get that disc you were after today" says a collegue.

    "Ah, shit!". Oops, there goes a bunch of your document. Don't swear, though, or you'll lose it from the undo buffer as well!

    --
    "I Know You Are But What Am I?"
    1. Re:Dangerous, surely? by hoggoth · · Score: 3, Funny

      Be careful with this!
      I can just see it now. You are recounting a traffic accident to a college:

      You: "I rammed a sheriff!"
      Computer: "Executing: rm -(dash)rf"

      --
      - For the complete works of Shakespeare: cat /dev/random (may take some time)
    2. Re:Dangerous, surely? by ThatComputerGuy · · Score: 1

      And what happens when I try to play Run DMC's "Ahhh"?

      --
      XML is like violence. If it doesn't solve the problem, use more.
  11. Know who could be... by Anonymous Coward · · Score: 0
    their spokesperson?


    Mayor Quimby! Who's he based on again? Jimmy Stewart or someone?


    -Ummm.....hhh....errr......ahhh....

    1. Re:Know who could be... by geomcbay · · Score: 2

      The voice is meant to be a generic "Kennedy"/Hyannis accent. But the character is modelled after many politicians, real and imaginary.

  12. Now we'll really sound like monkeys by bayduv1n · · Score: 1

    Can you imagine the office environment... OU, OU, OU, EE, EE, EE!!

    1. Re:Now we'll really sound like monkeys by BLAMM! · · Score: 1

      Not to mention that when you go "OU, OU, OU" and your neighbor goes "EE, EE, EE", both computers execute both commands. And then you've got the company clown who walks down the aisle going "AAAAAAAAAAAAAAAAAAAAA".

  13. Won't work for social reasons by Alrocket · · Score: 1

    Like the article says, it'd be impossible to get people to use that in the office... I certainly wouldn't.

    As well as that, if this did take off, can you imagine the implications for language: grunts, moans and sighs would become ubiquitous in everyday conversation... :)

    Al.

    1. Re:Won't work for social reasons by OnTheWay · · Score: 1

      It's already happening, dUUUUUUde. WhassAHHHHp?

  14. Damn Computer . . . by Red+Avenger · · Score: 1

    When your computer starts taking verbal abuse it should turn itself off. That way you when you rip the bugger out of the wall you won't have to fsck the hard disk once you calm down.

  15. Voice controlling a computer sucks. by Gummbah · · Score: 1

    It just does. Who the hell wants to talk to their computer anyway? What we need is a direct neural interface. Oh yeah baby..

  16. Re:Undo command - another possibility by Ed+Avis · · Score: 5, Insightful

    D'oh!

    --
    -- Ed Avis ed@membled.com
  17. Self Destruct by stinkydog · · Score: 3, Funny

    Just don't say Mua'dib or the computer explodes.

    -He has the weirding way.

    --
    âoeWho knew something as harmless as willful ignorance could end up having real consequences?â
    1. Re:Self Destruct by jacksbn · · Score: 1


      My PC no longer needs the weirding module!

      Echhh...

  18. It's cute, but... by d5w · · Score: 5, Interesting

    The computer can't distinguish words easily, so we'll give you a potentially much smaller vocabulary and see if it does better? Of course it'll do better, whether or not that smaller vocabulary contains consonants.

    What I'd worry about is whether these unarticulated sounds sound more like background noise than articulated speech; if so, then you've made the situation worse by making it harder for the computer to know when you're talking to it.

    On "uh oh": Dragon Dictate (discrete speech recognition from a few years ago) used "oops" for telling the SR system when it made a mistake; it was reasonably easy to distinguish from words that you actually wanted to put into your text with any frequency.

    1. Re:It's cute, but... by dollargonzo · · Score: 2, Interesting

      well, yuo are actually not quite correct on the consonant thing. ever try doing an FFT on some sound, and keeping only the major frequencies? we humans hear consonants, but for example p and b are essentially the same thing. and in the case of say, an S, its sound like noice to the computer, making it harder to distinguish than when an AAA makes one distinct frequency. So, although yuo are correct in saying that a smaller vocabulary would help, not as much as removing cononants.

      --
      BSD is for people who love UNIX. Linux is for those who hate Microsoft.
    2. Re:It's cute, but... by Prion86 · · Score: 1

      not only that, but imagine the havoc when your child decided to watch telletubbies while your trying to do something on your computer..

      --
      "Alot of people don't know what they are doing...and most are pretty good at it." -George Carlin
    3. Re:It's cute, but... by billnapier · · Score: 1

      When making vowel sounds, your vocal tract is essentially an uniform tube. This is really easy for a computer to model (as another poster mentioned, a simple FFT), as opposed to ficatives and plosives, which involves contrictions in the vocal tract that are harder to model (since the effect they have on the sound signal is to not make it complety sinusoidal).

    4. Re:It's cute, but... by d5w · · Score: 1

      Sorry, I don't buy it, if you're arguing that removing consonants makes command discrimination easier. Yes, consonants, especially stops, are hard to identify on their own, out of context, but we're with speech recognition you're not doing an instantaneous reading on a single sound out of context. You're doing an order- (and usually duration-) sensitive match of an entire utterance against one or more patterns. Even if you collaps all the stops, nasals and fricatives into a single "consonant" model, which is incredible overkill, the presence or absence of consonants in the pattern still gives you information.

    5. Re:It's cute, but... by plastik55 · · Score: 3, Interesting
      FFT is exactly the wrong technique for resolving transient or plosive sounds. Wavelets work better. Take the CWT of a person speaking, and you can *see* the shape of all the consonants.

      When people speak, it is the consonants that matter. Ever try listening closely to someone with a pronounced regional accent? The vowels are all jumbled up but the speech is still intelligible. IIRC, people tried to teach gorillas to communicate using different grunts, and gave up in favor of sign language. Reason being that you *can't* string two different vowels together without a consonant in between and have it be intelligible.

      --

      I have a positive modifier on Troll. When I mod someone Troll their karma should go UP!

  19. Just so you know... by tswinzig · · Score: 2

    The letter 'h' is a consonant.

    --

    "And like that ... he's gone."
    1. Re:Just so you know... by billybob2001 · · Score: 1
      I concede that point graciously, although it is, in this case, silent.

      Like the p in bathing.

  20. Actually... by Scoria · · Score: 1

    It could be "Ahhhhhh! Uh-oh!" for some people, seeing as how they've just made their keyboard a little more, eh, dirty.

    --
    Do you like German cars?
  21. Combine this with biofeedback by N3P1u5U17r4 · · Score: 1

    Maybe if they combined this grunt recognition system with some kind of brain-wave recognition system we could have something. We could all revert to being neadrathals.

    --
    You're Just Jealous Because The Voices Are Talking To Me.
    1. Re:Combine this with biofeedback by Anonymous Coward · · Score: 0

      We could all revert to being "neadrathals" huh?
      Ohhh, you mean "neanderthals". Somehow I think you're already there partner. :)

  22. No, this is serious academic research! by Anonymous Coward · · Score: 3, Interesting


    Seriously. I have colleagues that work on this type of thing:

    "Sound Symbolism in Conversational Grunts in English"
    "The Challenge of Non-lexical Speech Sounds"
    "Issues in the Transcription of English Conversational Grunts"

    http://www.sanpo.t.u-tokyo.ac.jp/~nigel/publicatio ns.html

  23. If the speakers are aimed at you by wiredog · · Score: 2
    Then you explode.

    This assumes you are talking about the Muad Dib in the movie, and not the one in the book. All that weirding module stuff isn't in the book. The "weirding way" is basically Super Ninja fighting techniques. Paul was taught by Jessica.

    1. Re:If the speakers are aimed at you by cburley · · Score: 1
      Paul was taught by Jessica.

      Yeah, and Jessica wasn't bad...she was just drawn that way.

      ;-)

      --
      Practice random senselessness and act kind of beautiful.
  24. Tim Allen will love this by wowbagger · · Score: 3, Interesting

    Of course, many have said that the GUI is a "caveman interface" - point and grunt, err, click.

    This really strikes me as the verbal equivelent of Palm's Grafitti - if normal interactions (printing/speaking) is too hard, make a simplified interface (Grafitti/grunting) that isn't.

    I don't know, but I already learned one interface (typing) to make my computer's life easier. Why should I do all the work?

    1. Re:Tim Allen will love this by Anne_Nonymous · · Score: 1

      Perhaps in the future we'll all:

      Type in soundbites (thanks to email)
      Write like small children (thanks to Palm, emoticons)
      Speak like cavemen (thanks to voice recognition)
      Observe like small, hyper monkeys (thanks to television)

      ...and eventually regress to thinking like Neanderthals

    2. Re:Tim Allen will love this by bay43270 · · Score: 2

      I don't know, but I already learned one interface (typing) to make my computer's life easier. Why should I do all the work? Exactly! Wasn't the whole point of voice recognition to make computers interact with humans the same way we interact with each other? Lets be realistic... the reason Palm uses Graffiti is because the keyboard was too small to use... not because it recognizes handwriting so well. Graffiti does not satisfy the goals of handwriting recognition, and this technology does not satisfy the goals of voice recognition.

  25. I know what I want ... by HiQ · · Score: 2, Insightful

    I don't believe in the necessity of a voice operated computer. At the risk of reopening a very old discussion, a good command line will do better in most cases. It takes far less time (for a skilled person) to use a command than to explain the desired action in 'normal' language to a computer. I mean 'rm -r /*' is typed in a lot faster than saying: "Go to the root directory and delete every file, including all subdirectories".

    1. Re:I know what I want ... by rednuhter · · Score: 1

      -fr or -rf all or nothing !

      --
      ERR 411[Max number of witty sigs reached]
    2. Re:I know what I want ... by Anonymous Coward · · Score: 0

      How did this get modded 'insightful', when really it should be 'Funny'?

      Maybe it would be easier to have the speech recog accept, "Wipe my drive, please?"

      Seriously, command lines *are* faster to use ... but it would be nice if *everyone* could use a computer effectively ...

    3. Re:I know what I want ... by Anonymous Coward · · Score: 0

      But is it faster than saying 'Rmh Dasharr Slashstar'?

      Well, yes probably in most cases.

  26. In Related News, Code Sex Virus Released by Myriad · · Score: 2
    In related news, police have closed in on a suspect believed to be responsible for creating the Code Sex virus that crippled thousands of systems across the net last week.

    When asked about the virus the unidentified man responded "It's not my fault! I didn't to it intentionally. All I was doing was surfing my favorite pr0n sites and, well, you know, enjoying myself, when all these windows started popping up! At first I thought it was the usual spam trick - but no, this code just started appearing everywhere. It just sort of created itself... really! You've gotta believe me!"

    The investigation continues.

    --
    "They do not preach that their god will rouse them, a little before the Nuts work loose." Kipling, 'The Sons of Martha'
  27. oooll rite by Insipid+Trunculance · · Score: 1

    Hmmm..so i get a scientific basis/reason for my misprounciation,mangling of grammar,absolutely incorrect spellings and other atrocities.

    In other words i acn now sodomize Queen's english with scientific approval!!

    --
    Wanted : A Signature.
  28. Ooo...eee.. by thewiz · · Score: 2, Funny

    "Ooo eee ooo ah ah, ting tang walla walla, bing bang"

    A line from "The Witch Doctor" by David Seville or voice command to shutdown Windows? Decide for yourself by playing it for your voice recognition software.

    --
    If "disco" means "I learn" in Latin, does "discothèque" mean "I learn technology"?
    1. Re:Ooo...eee.. by wirefarm · · Score: 3, Funny

      "Ooo eee ooo ah ah, ting tang walla walla, bing bang"

      The verbal equivalent of perl?

      Cheers,
      Jim in Tokyo

      --
      -- My Weblog.
  29. mics by zephc · · Score: 1, Troll

    i dont know if its how it is done now, but why isnt there a single-button that you press that would activate the mic to accept voice commands, and maybe a second to input text... its not totally hands free, but if Stephen Hawking could talk, even HE could use it :P

    --
    "I would say that 99 per cent of what my father has written about his own life is false." - L. Ron Hubbard Jr.
    1. Re:mics by Anonymous Coward · · Score: 0

      goddamnit, some one is modding me down just to be a turd... get a life!

  30. singing ditties for commands by peter303 · · Score: 3, Funny

    Its easier to recognize tonal changes than constanants. Its easier for humns to use full words than isolated vowels.

  31. With apologies to St. Paul... by PeterClark · · Score: 0, Troll
    In the same way, the Computer helps us in our weakness. We do not know what we ought to ask for, but the Computer itself intercedes for us with groans that words cannot express.

    (See Romans 8:25-27 for original.)

    Well, I thought it was funny. :)

    :Peter

  32. Not particularly useful... by -dsr- · · Score: 2, Funny

    I spent the last ten minutes with a bad case of the hiccups. What do you think that would have done to my weekly report?

  33. Why?? by Doug+Neal · · Score: 0, Troll

    OK, speech recognition software is cool, but really, is it such a good interface for a computer? Surely a pointing device and a keyboard are the best human interfaces for the traditional GUI. I tried a speech control program once, where you have to say stuff like "up" and "down" to control the mouse pointer. Or you say "mail" to check your email. And I must say, i found it quite infuriating and slow and completely counterintuitive. A mouse is a much better interface.

    The only real practical use I can see for speech recognition is for word processing - believe it or not some people still like to dictate their letters into tapes and give them to a typist, they don't like working with a keyboard. But, controlling your PC with grunts and moans? Come off it! As well as being a really backwards idea for an interface, I know I'd feel damn stupid grunting and moaning at my PC. Especially in public. Wouldn't you? ;P

    1. Re:Why?? by demaria · · Score: 2

      Well for general purpose operating systems like windows, linux, mac this isn't as great. In the current GUI model keyboard and mouse are superior. Perhaps if someone invents a voice controlled GUI (maybe with integrated touch screen or some sort) then you could, but the current GUIs aren't built for voice control.

      There are other applications though. For example, a car radio. Why press the buttons to find radio stations if you could tell the car "tune 95.3". It has applications on a telephone menuing system.

      But don't underestimate dictation software. There are lots of advantages of dictation. It lets you 'type' faster (assuming it's good software and you train it), and people who are disabled or have injury (broken wrist, carpal tunnel) really need it.

    2. Re:Why?? by steevc · · Score: 0

      I believe Jaguar already have a voice activated radio/telephone in their latest models.

      Must be some clever programming to distinguish the driver from the radio sound. What if the presenter says 'and now for something louder'?

      I used a voice recognition system at Texas Instruments in about 1984. It cam on a full length ISA card which slotted neatly into the 40lb 'portable' computer I was using (5" colour CRT). I felt pretty foolish talking to the computer in an office.

      The uh-oh reminds me of the definition of an ohnosecond as the time between making a mistake and realising you'd done it.

      Steve

    3. Re:Why?? by HiQ · · Score: 2

      Hmmm, if you owned radiostation 95.3, all it would take is to buy some advertising time on other radiostations, and just say 'tune 95.3'. Could have some serious fun with that..

  34. Ahhhh by garoush · · Score: 3, Funny

    Now there is a whole new meaning to "Yada, yada, yada, ..."

    --

    Karma stuck at 50? Add 2-5 inches.. err.. 2-5x Karmas Count to your pen1es.. err.. Karma all naturally and private
  35. right on by unformed · · Score: 2

    and they should also have a hammer that beats the shit of the computer whenever you say "Motherfucker!"

    or a little lesser violence with lesser curses. For example "Fucking A!" will just BSOD. Hey the irony itself would be funny...

  36. Coming soon by Anonymous Coward · · Score: 0

    Women are dangerously close to being made irrelevant. First the weepy, whiney car and now this - a control system that *understands* our grunts and growls.

    Uhhh ......... don't bother me I'm tired

    Grrr ......... no I don't want to talk about our relationship

    Mhhh ......... where's my tea?

  37. FindSound didn't find by skinnymofo · · Score: 1

    For a while now, I've been looking for a certain sound from Cartoon Network, but no one seems to have it. Any suggestions?
    The sound I'm looking for is from the Toonami segue to/from commercials, right after the beat stops there is this whistle like sound (I'd imitate it, but something tells me that wouldn't work). I'm looking for that whistle like sound, so if anybody knows where I might find respond to this comment.

    --
    Happiness is like peeing yourself, only you can feel the warmth.
    1. Re:FindSound didn't find by radja · · Score: 1

      1) run wire from TV headphone out to soundcard
      2) record sequence with whistle in it to wav
      3) cut off what you don't need

      No I dont have it, but I used to do this to get certain song-samples.. some 10 years ago..

      //rdj

      --

      No one can understand the truth until he drinks of coffee's frothy goodness.
      --Sheikh Abd-Al-Kadir, 1587
  38. Technology Devolves Humans by scorp1us · · Score: 2, Funny

    After 30,000 years of having good comminucation skills, humans' finally revert to pre-historic communications skills. Their technology is responsible for thier de-cevilization. It seems a computer interface consisting of only grunts and primitive sounds was selected for windowsXP, and as a result the entire human vocabulary has reverted back to pre-historic roots.

    Bill gates said "We are proud to be responsible for the conversion to a much easier language. While XML can organize our data better, we needed a common language for human interaction. Leveraging our power on the desktop, we we able to achive this." When asked about how aliens might perceive our change of language, Gates repsonded "I'm sure that they will appreciate the simplicity more. I mean, who ever liked French and all of it's eligance anyway?"

    Grunt snort grr grr.

    --
    Slashdot's rate-of-post filter: Preventing you from posting too many great ideas at once.
  39. Typing vs. speech by d5w · · Score: 1
    I don't know, but I already learned one interface (typing) to make my computer's life easier. Why should I do all the work?
    This is probably the single biggest problem that large-vocabulary speech recognition had and has in getting adopted, even where it's a good fit: it requires you to learn to use it rather than "just talking". Some people say "I already learned one interface..." Even more have simply forgotten how long it took them to get comfortable with a keyboard and compare the pain of a new interface to the habit of years.

    Any new interface requires some accomodation from the user.

    1. Re:Typing vs. speech by Asic+Eng · · Score: 3, Interesting
      Any new interface requires some accomodation from the user.

      Ok, that sounds fair, but I guess you'd want to have some sort of benefit after you invest your time?

      I just don't see this sort of interface to catch on for standard applications. I mean - imagine you are in an office with 20 people grunting at their computers, the noise they make is just going to be unbearable. That's got to be worse than that annoying guy who's checking his voicemail via speaker phone. *shudder*

      From the article:

      By increasing the pitch of your voice, the scrolling speed increases. When you stop speaking, the scrolling ends.

      Can you imagine sitting next to a guy who uses this, and not have a headache after 10 mins?

  40. Don't Sneeze! by Mac+Nazgul · · Score: 1


    "Wachooooooooo!"

    Computer reboots...

  41. Snowcrash? by Ieshan · · Score: 0

    Has anyone read 'Snowcrash'? One of the characters has a car-plane-automated-mobile that works this way, he has to make odd sounds to make it move. Claims it doesn't interfere with his normal conversation too much, and that he can do it without thinking after a little practice.

  42. Okay for now.. by jimmyCarter · · Score: 1

    This is nothing more than a bridge or a quick fix until full "get your word-on recognition" is in place. Kind of like learning a keyboard until you can talk to your machine...

    --

    -- jimmycarter
  43. .e.., by The+God+Soldier · · Score: 1

    I ..i.. ..a. ..i. i. a .a.u.a. .a. .o .o..u.i.a.e

    .oo. e.e...o.. .i.. .e .a..i.. .i.e ..i.!

  44. Whistle of Command by SEWilco · · Score: 1
    About 20 years ago, a whistling code for robot control was suggested.

    It's pretty easy to detect several frequencies of whistles, so command can be made from sequences of whistling. It was pointed out that high-low sequences would be easiest (rather than combinations of 3 or more tones), as individual people could use what was high-low tones for them rather than trying to train humans to have better pitch control.

    1. Re:Whistle of Command by HiQ · · Score: 2

      I can picture myself working behind my computer, eating cookies (or whatever), and giving the computer a whistled command, and getting up to get of box of tissues to wipe the wet crumbs of my screen. I really don't think it will work...

  45. Oook! by JimPooley · · Score: 2

    Just whatever you do, do NOT take your computer to the monkey house. It'll probably self-destruct!

    --

    "Information wants to be paid"
  46. Time for Tellytubbies! by iapetus · · Score: 2, Funny

    Over the hills and far away, Teletubbies come to hack!

    Eh-oh!

    Uh-ehn! Uh-ehn!

    Time for tubby shutdown...

    Uh-oh...

    --
    ++ Say to Elrond "Hello.".
    Elrond says "No.". Elrond gives you some lunch.
  47. Speakable Items by Voidhobo · · Score: 1
    Apple's OS9 has had an extension called Speakable Items which is fun to play with ("Open file... I said 'Open file!' dangit!") but far from useful.

    It's just the next step in making the usage of a computer more "user friendly" and thereby utterly inefficient. Typing vi kane/rosebud.text is so much faster than double-clicking on the folder kane and then on the file rosebud.text, and by far faster than saying "Show Speakable commands. Open folder Kane. Open folder Kane. Open folder Kane. Finally! Open file Rosebud dot text. Open file Rosebud dot text. Open file Rosebud period text..." Now, if you don't even use real language but only grunts, it becomes even worse. Talking about "Disneyfication"! Or rather, alienation of the work process.

  48. nature sounds? by metalhed77 · · Score: 1

    would playing a recording of ambient jungle sounds (monkeys grunting) could your system be h4x0r3d by the l337 5K|113d /\/\0nK3yz

    --
    Photos.
  49. Bad idea from a linguistic standpoint by dasmegabyte · · Score: 5, Interesting

    Asking people to use another language when dealing with machines -- especially one that's more visceral -- is just asking for trouble. Already computers are seriously affecting the ability of humans to communicate orally, by concentrating the language into short bursts used during chats we lose the particles of sentences that help establish context in speech (yes, there is a reason for "the" and "a"). Besides, here's an oppurtunity to elleviate a lot of the bad habits that make dialectic English so tough to understand for those outside the dialect: set the machines to understand one sort of English, so that everybody has to speak at least that type along with their colloquial speech. Of course, there's always the possibility for eugenic practices with this, so my proposal is this: teach the computer the differences between the 8 vowel sounds used by people in Colorado, where pretty much every vowel approaches the schwa (the schwa being the neutral position for the human vocal system and therefore easiest to pronounce). After a while, people will realise that to be successful at using voice activated systems, they'll need to adjust their inflection, and after a while will adjust it automatically when dealing with people who don't understand them, either.

    But voice activated systems are stupid, anyway...speech is one of the slowest forms of human interaction, and is one of the few we have to actively concentrate on to perform. You know when people say, "Think before you speak?" That's because once you start speaking a large portion of your brain activity is devoted to doing so...it actually becomes harder to think about what to say next. Pressing a button or turning a dial takes practically no thought...which is another reason why a speech written in spontaneous draft still sounds better than one that is spoken aloud. If we convert machines to speach recognition, we're effectively asking people to interact with them in dumber ways. And can you imagine the logic involved with processing a fairly simple statement like "This check in my hand should be processed by you and in return i'd like fifty bucks in tens and ten one dollar bills." Since the command isn't linear, the machine not only has to recognize what each word means, but try and interpret them in queue. And if humans can't construct complicated sentences like the one above -- which any human over the age of about 4 can understand, before that kids can't identify the subject and object in complex sentences -- they'll be inconvenienced by speaking machines. Oh and for a simpler example, try this: "My pin number? 376 uhhhhhh...Forty-two thirteen...aaaaaaaaaaaand...is it six? no. Eight?...oh! oh! sixty eight!" A human can understand that...we'd be annoyed, but we'd get it.

    --
    Hey freaks: now you're ju
    1. Re:Bad idea from a linguistic standpoint by Anonymous Coward · · Score: 0

      Learn how to spell "alleviate" and then you can get all preachy about the English language.

    2. Re:Bad idea from a linguistic standpoint by dasmegabyte · · Score: 2

      Well, despite five years of studies in rhetorical science I can still spell "fuck off."

      --
      Hey freaks: now you're ju
    3. Re:Bad idea from a linguistic standpoint by XNormal · · Score: 2

      "If we convert machines to speach recognition, we're effectively asking people to interact with them in dumber ways."

      Uh huh.

      --
      Stop worrying about the risks of nuclear power and start worrying about the risks of not using nuclear power.
  50. Background noises deleted my HDD! by glebite · · Score: 4, Interesting

    How selective would the speech recognition be? If I was playing musing on that computer, would the computer pick up the tones coming in and start "doing stuff(tm)" on my computer? What about background noises? My friend's Jello Biafra spoken word CDs?

    I won't even go there with my Saturday Morning Cartoon CD - Eep Opp Ork Ah-Ah (This means mail all of my friends a copy of my resume)...

    --
    I donate all spillover Karma to the charity of my choice... Ada was still a babe despite what people may say...
  51. What I would do... by ch-chuck · · Score: 2

    just for the heck of it, is interface the voice synthesis output of one computer to the voice recognition interface of another and start a transfer of a large text file just to see how long it takes and how accurate it is. I might get about 10-20 bps thru phone line.

    If they start standardizing on a vowel command system and people overcome the embarassment of using it, how long before SharperImage starts selling little boxes that make the same sounds at the push of a button, to, you know, make life even better?

    --
    try { do() || do_not(); } catch (JediException err) { yoda(err); }
  52. We need a new language(s) by nickol · · Score: 1

    Yea! Umm! Zhkxw! Fwpfpfuuu!
    We have already mangled the natural language and
    created a bunch of programming languages.
    Now this new effort requires standardization.
    Just imagine the video tape learning new voice features of Windows 2XXX !
    Open file - Off!
    Close file - Buff!
    Save file - Grm!Grm!

    Imagine teachers telling students how to
    properly pronounce the "Set preferences" (PfGfGrrf!)

    Imagine "holy wars" between adherents of
    MicroQuack(tm) and FreeGrunt(GPL).

    That's our future.

    PS. Don't forget international sighs !

  53. Tuki Tuki Bird Must Stay Home. by Anonymous Coward · · Score: 0

    I guess this means my Tuki Tuki bird will have to stay home.
    (Ah Ah eh eh Tu Kee Tu Kee) Meaning: Remove your user ID from network and format your Hard Drive.

  54. This is what lojban is for. by lambsonic · · Score: 1
    Lojban is designed to be used by people in communication with each other, and possibly in the future with computers.


    http://lojban.org/

    Don't wait until Microsoft releases their version.
    --
    # make clean sig
    1. Re:This is what lojban is for. by ectizen · · Score: 1

      .u'e

  55. Just speak in Spanish by Anonymous Coward · · Score: 0

    Spanish has lots of vowels in its pronunciation.
    The president speaks it,too...

  56. It's all phonetics by dodongo · · Score: 1

    What they're not saying here explicitly is that they still haven't come up with a waveform recognition / microphone setup that they can implement in "normal" usage situations, and still have it recognize consonants with voice and without. The voiced plosive "b" and the voiceless plosive "p", for example, just sound too damn alike.

    I think rather than manipulate our computers using "oooh" and "ahh" and "Oh shit!".... perhaps we should just restructure the English language?

  57. Voice controlled TV volume?? by Anonymous Coward · · Score: 0
    The article suggests a TV volume controlled by "TV, sound up, aaaaah".

    Well, what if the TV show itself happens to consist of somebody yelling "aaaaah"? You'll get a feedback loop until your windows shatter.

  58. Wrong. by Haeleth · · Score: 2, Informative

    The letter 'h' is a letter, which is sometimes used to represent the sound [h], sometimes other sounds, and sometimes is silent.

    The sound [h] is usually considered a consonant.

  59. A potential timeline: by Noer · · Score: 5, Funny

    2020: Computers everywhere are controlled by grunts, moans, sighs, and snorts.

    2040: Computers are finally small enough that they're all embedded into our environments, but neural interfaces don't work, so we still grunt and snort into our computers, but it looks like we're just grunting and snorting in general. People use computers exclusively, and never talk to one another; thus, language is lost and we just grunt and snort a lot.

    2060: aliens visit hoping to find intelligent life, but instead find a bunch of snorting, grunting apes. They leave.

    --
    -- "Those who cast the votes decide nothing. Those who count the votes decide everything." -Joseph Stalin
  60. Turn off PBS. by Happy+Monkey · · Score: 2

    You don't want the Teletubbies on if you've got this setup.

    --
    __
    Do ya feel happy-go-lucky, punk?
  61. This is Bad, very bad . . . by lordfetish · · Score: 1

    What if I accidentally leave my computer on while having sex with my girlfriend - it may take my vocal signals as a que to fire up my mpeg porn collection. The rapid and violent removal of my genitals is sure to follow . . .

  62. Won't work in New England by aredubya74 · · Score: 2, Funny
    Dr Takeo Igarashi, of Brown University suggests the use of "ahhhh" for skipping tracks on a cd, or adjusting tv volume

    As a Boston-area resident, I'd like to suggest that this choice of sound wouldn't work for us:

    "Hey paahl, gahhhttah go pahhk my caah." *CD skips 4 tracks*

    You'd figure the guy works for a New England university, he might've picked up on that. How about "y'all" instead?

    --

    RW

  63. Where's the Python foot? by ellem · · Score: 2

    "He wouldn't have written 'ahhhhh,' to skip tracks on his CD player."

    "Maybe he was dictating."

    --
    This .sig is fake but accurate.
  64. Internet surfing dog? by Anonymous Coward · · Score: 0

    Great, now my dogs will want to use my PC too. As if there are not already enough things in the house to fight over.

    Come to think of it, this could be the break through in human animal communication we've all been waiting for. I see a huge new market: Internet Appliance designed for pets.

  65. From the manual - step 1: Logging on. by CProgrammer98 · · Score: 2, Funny

    Pick up the mike and say "Waaaaaazzaaaaaaaaaappp"

    --
    And the people shall be oppressed, every one by another, and every one by his neighbour Isaiah 3:5
  66. Won't work in the South either by T1girl · · Score: 2

    We're well known for stretching every vowel into several syllables. "Well" comes out "way-uhl" and a long "I" sounds like "ah." Every time one referred to oneself, the TV or CD would start skipping around.

    "Way-uhl, Ah doan know wut Ah'm gonna do. Mah CD keeps skippin'. Wut are y'all gonna do?"

    Here we are at the peak of the greatest technological revolution the world has ever known, and this guy wants us to go back to communicating with grunts and moans.

    What would Rain-in-the-Face do?

  67. Tourette's GUI by gelfling · · Score: 2, Funny

    Clicks, wheezes, pops, random onscenities. Sounds like the way I interract with my computer NOW!

  68. stolen idea by cowtamer · · Score: 1

    I think Neal Stephenson (in Snow Crash) had a disabled guy controlling his wheelchair/truck by making grunts and other noises. The idea always made sense to me.

  69. OGG by Anonymous Coward · · Score: 0

    So where's OGG when you need him?

  70. Sheep by Kpechtunx · · Score: 2, Interesting

    Sound kind of like how a farmer controls a sheepdog ... - !K

  71. Witch Doctor by sharkey · · Score: 2

    Great, let's get a roomful of people trying to control their PCs, and it'll sound something like this:

    Ooo, Eee, Ooo, Ahh ahh,
    Ting, Tang, Walla walla bing bang.

    --

    --
    "Outlook not so good." That magic 8-ball knows everything! I'll ask about Exchange Server next.
  72. Where speech is the right thing by d5w · · Score: 1
    But voice activated systems are stupid, anyway...speech is one of the slowest forms of human interaction, and is one of the few we have to actively concentrate on to perform.
    You're faulting speech by comparing full-out, general spoken natural language to much more restricted modes of input, like a button or dial. The spoken equivalents of a button or dial can be as quick and easy as the tactile versions, and with very little practice they become just as automatic. Speech isn't the right input method for everything, but then neither is the keyboard, the mouse, the pen, or the steering wheel. Computer-targeted speech is good, and worth putting some effort into, when:
    • You need to quickly select from a very large, known list of options; you very quickly hit the point where speech is way faster than mouse, dial or stylus.
    • You need to do something with text and a keyboard just isn't feasible for size, convenience or portability reasons. Speech input of text is almost invariably faster than a stylus.
    • Your hands are busy, remote (from the device, though I suppose also if they're remote from you) or incapacitated. "Remote" in this case includes "at the other end of a phone call".
    • You're dealing with with text that's sufficiently uniform that you can speak at full speed, in which case you're going faster than almost everyone types. (And there are contexts where that occurs.)
  73. Fun with Windows Users by geekguy · · Score: 2, Funny

    Steps to mess with your friends (or enemies)

    1)Install this and set it up so that this starts up when windows does.

    2) Set a sound to shutdown Windows

    3) Record that sound and set it to play whenever windows starts or whenever there is an error.

    4)loop the sound output into the input.

    5) sit back and enjoy watching them turn on there computer only for it to grunt and turn off on them.

    *note* Don't know if all of this would be possible but I just had to share this thought

    --
    -- Any comments seen here are not mine, but a mixture of alchohol and lack of sleep.
  74. Undo by Fjord · · Score: 3, Interesting

    Great. I'm almost finished my ultra-long /. post and someone ICQs me.

    "Uh oh"

    On another note, I knew a guy who worked with voice rec software where the delete-word command was "oops". Whenever he would watch another person typing and they would typo, he would instinctively say "oops". I'm guessing it's kind of how my writting went bad went I was using graffiti a lot. You get used to these quirky mannerisms you use to control the machines. Then you end up looking like a dork and annoying the people around you

    --
    -no broken link
  75. No need for this by Arandir · · Score: 2

    There is no need for this. Voice recognition already works. And it works well. And it already works with REAL words. No need to grunt, squeal of burp into your microphone.

    I first used voice recognition software with OS/2 4.0 on a P100 with 16MB. I was amazed at how well it worked. Of course, 16Mb was inadequate for dictation, but even with that puny system I had it trained in half and hour.

    There's a reason that voice recognition hasn't caught on. It's not because it doesn't work. It's because people don't want to talk to their computers. It's embarassing. It's not convenient. It's awkward to say those commands that computers need, like "arrem minus arref slash star".

    --
    A Government Is a Body of People, Usually Notably Ungoverned
  76. Mod parent up by einhverfr · · Score: 1

    I haven't laughed so hard in a long while...

    --

    LedgerSMB: Open source Accounting/ERP
  77. Sex by ZaneMcAuley · · Score: 2, Funny

    And during sex the entire house becomes a party place (lamp on, lamp off, hifi on , hifi off ....) a:D

    --
    ----- Whats wrong with this picture? http://www.revoh.org:1234/whatswrong
  78. Benefits fo speech recognition by einhverfr · · Score: 2

    I think that speach recognition as a computer interface would be very powerful for the following reason:

    In general (yes, there are exceptions), GUI's excel at bringing a greater density of information from the computer to the user, while command line technologies are better at delivering a greatly enhanced level of information density from the user to the computer. I remember trying to go from a command line FTP to WS-FTP and going RIGHT BACK because it made "simple" tasks like downloading a file to a floppy disk but as a different name and making it FAR more complicated.

    The advantage of a speach interface is that theoretically, you have at nearly as much information density going to the computer as you do from the command line, and it does not conflict with the GUI.

    Of course this argument also works for X-term...

    --

    LedgerSMB: Open source Accounting/ERP
    1. Re:Benefits fo speech recognition by Gizzmonic · · Score: 1
      I think that speach recognition as a computer interface would be very powerful for the following reason:

      I don't know if an "Auditory OS" would ever take off for one simple reason...we all make mistakes. can you imagine how hard it would be to dictate a paper to your computer, even if its speech recognition was perfect?

      "Computer networks have assisted...uh no. Computer networks assist...no...fuck, delete all that."

      All of a sudden, you've got a page full of junk and mutterings that you have to go over with the keyboard anyway. What's the point?

      --
      (-1, Raw and Uncut is the only way to read)
  79. Dragon's undo command by MobileC · · Score: 0

    Is like this.

    You can make it whatever you want.

    I settled for "Oops" after trials of "Bugger", "Shit" and "Ah fuck it".

    --

    Fran
    :):):)
    1st 1st Poster of the new Millennium!

  80. Great for Slashdotters by afree87 · · Score: 2, Funny

    You could run the voice recognition system as a vital resource, so when the system crashes, you go "[Zarking] [buggering] [smegging] Windows!" and it installs Linux automatically. Good idea, right?

  81. The problem is the English language... by ChenLing · · Score: 1

    This is not (meant to be) a troll, so please bear with me.

    English is just such a hard language to pronounce consistently. It's not the consonants -- it's the fact that we have to pronounce consonants that are not followed by vowels.
    Think about that for a while. Then say the word 'eighths'. Notice the 'g-h-t-h-s', in which the g and the h are not pronounced, but the t, h, and s are -- as two syllables in fact!
    Even words like 'what', contain that ending consonant that is pronounced, but is very hard to pick up.
    Some people claim that English is a phonetic language, but only barely if it is at all.

    I use Dragon Dictate/Naturally Speaking, and I get around a 95% recognition rate. You have no idea how surprising that is!!!
    That actually is only slightly less than what people can do. Of course we don't need to actually hear every word correctly.

    So what we all need to do is to speak a decent language, maybe Esperanto :) where each word is pronounce consistently from the written word, and each word is distinct and each syllable contains a vowel.
    Of course my vote is for Mandarin Chinese -- each word is exactly one syllable long; there is no pause at the end of words, only sentences; each syllable is either a vowel or a consonant followed by a vowel; words are distiguished by intonations (which is easily picked up by any speech recognition software); and there is no conjugation! (that means there is no be/is/am/are/been/being, no infinitives (to go) to split, no silly grammar like 'Where do you live?' where the sentence goes object-subject-verb, etc...)
    Of course the written language has to go...50,000 distinct characters *grumble grumble*.

    --
    "You have the option of insanity. I do not. And that makes me crazy!" - Brian to Angela, My So-Called Life
  82. Re:Undo command - another possibility by nirvdrum · · Score: 1

    This is far from insightful.

    --
    If there was a "-1 Not Funny", that'd be my most used mod.
  83. You'll be sounding like... by Fredbo · · Score: 1

    The adults on the Peanuts cartoons talking to your computer.

  84. not the unix way by bmeikle · · Score: 1

    ... traditionally, we leave out the vowels

  85. .e.., = Well, by Anonymous Coward · · Score: 0

    "I think that this is a natural way to communicate

    Soon everybody will be talking like this!"

    Cracked without a computer.

    Ethan
    moteSPAM ^ ziplip # com