Consonants Not Required
billybob2001 writes: "A report at the
BBC explains how voice-control of computers can be more successful using grunts and sighs, as "voice recognition programs often failed to accurately capture words". Dr Takeo Igarashi, of Brown University suggests the use of "ahhhh" for skipping tracks on a cd, or adjusting tv volume, but I wonder what the effect would be on pr0n sites? Another suggestion is "uh oh" for undo. Perfect for online banking. Is this going to confuse your system or what?"
Surely "Ah, shit!" is the obvious choice for an undo command?
Linux advocates are in a no Win situation
Now, whenever you yell you YIEEEE! in a shower because the water is too hot or cold, it will immediatly switch to a more pleasant temperature!
Anyone that's worked at a Help Desk should know that Users have been trying this for years.
Carl G. Jung
--
"With one breath, with one flow, You will know Synchronicity" -La Policia
I dont think so.
What, me worry?
Surely this could really backfire. I'm just finishing up an important document, perhaps having a significant section of text highlighted as I move paragraphs around.
"Sorry, I couldn't get that disc you were after today" says a collegue.
"Ah, shit!". Oops, there goes a bunch of your document. Don't swear, though, or you'll lose it from the undo buffer as well!
"I Know You Are But What Am I?"
D'oh!
-- Ed Avis ed@membled.com
Just don't say Mua'dib or the computer explodes.
-He has the weirding way.
âoeWho knew something as harmless as willful ignorance could end up having real consequences?â
The computer can't distinguish words easily, so we'll give you a potentially much smaller vocabulary and see if it does better? Of course it'll do better, whether or not that smaller vocabulary contains consonants.
What I'd worry about is whether these unarticulated sounds sound more like background noise than articulated speech; if so, then you've made the situation worse by making it harder for the computer to know when you're talking to it.
On "uh oh": Dragon Dictate (discrete speech recognition from a few years ago) used "oops" for telling the SR system when it made a mistake; it was reasonably easy to distinguish from words that you actually wanted to put into your text with any frequency.
The letter 'h' is a consonant.
"And like that
Seriously. I have colleagues that work on this type of thing:
"Sound Symbolism in Conversational Grunts in English"
"The Challenge of Non-lexical Speech Sounds"
"Issues in the Transcription of English Conversational Grunts"
http://www.sanpo.t.u-tokyo.ac.jp/~nigel/publicati
This assumes you are talking about the Muad Dib in the movie, and not the one in the book. All that weirding module stuff isn't in the book. The "weirding way" is basically Super Ninja fighting techniques. Paul was taught by Jessica.
Best Slashdot Co
Of course, many have said that the GUI is a "caveman interface" - point and grunt, err, click.
This really strikes me as the verbal equivelent of Palm's Grafitti - if normal interactions (printing/speaking) is too hard, make a simplified interface (Grafitti/grunting) that isn't.
I don't know, but I already learned one interface (typing) to make my computer's life easier. Why should I do all the work?
www.eFax.com are spammers
I don't believe in the necessity of a voice operated computer. At the risk of reopening a very old discussion, a good command line will do better in most cases. It takes far less time (for a skilled person) to use a command than to explain the desired action in 'normal' language to a computer. I mean 'rm -r /*' is typed in a lot faster than saying: "Go to the root directory and delete every file, including all subdirectories".
When asked about the virus the unidentified man responded "It's not my fault! I didn't to it intentionally. All I was doing was surfing my favorite pr0n sites and, well, you know, enjoying myself, when all these windows started popping up! At first I thought it was the usual spam trick - but no, this code just started appearing everywhere. It just sort of created itself... really! You've gotta believe me!"
The investigation continues.
"They do not preach that their god will rouse them, a little before the Nuts work loose." Kipling, 'The Sons of Martha'
"Ooo eee ooo ah ah, ting tang walla walla, bing bang"
A line from "The Witch Doctor" by David Seville or voice command to shutdown Windows? Decide for yourself by playing it for your voice recognition software.
If "disco" means "I learn" in Latin, does "discothèque" mean "I learn technology"?
Its easier to recognize tonal changes than constanants. Its easier for humns to use full words than isolated vowels.
I spent the last ten minutes with a bad case of the hiccups. What do you think that would have done to my weekly report?
Now there is a whole new meaning to "Yada, yada, yada, ..."
Karma stuck at 50? Add 2-5 inches.. err.. 2-5x Karmas Count to your pen1es.. err.. Karma all naturally and private
and they should also have a hammer that beats the shit of the computer whenever you say "Motherfucker!"
or a little lesser violence with lesser curses. For example "Fucking A!" will just BSOD. Hey the irony itself would be funny...
After 30,000 years of having good comminucation skills, humans' finally revert to pre-historic communications skills. Their technology is responsible for thier de-cevilization. It seems a computer interface consisting of only grunts and primitive sounds was selected for windowsXP, and as a result the entire human vocabulary has reverted back to pre-historic roots.
Bill gates said "We are proud to be responsible for the conversion to a much easier language. While XML can organize our data better, we needed a common language for human interaction. Leveraging our power on the desktop, we we able to achive this." When asked about how aliens might perceive our change of language, Gates repsonded "I'm sure that they will appreciate the simplicity more. I mean, who ever liked French and all of it's eligance anyway?"
Grunt snort grr grr.
Slashdot's rate-of-post filter: Preventing you from posting too many great ideas at once.
Well for general purpose operating systems like windows, linux, mac this isn't as great. In the current GUI model keyboard and mouse are superior. Perhaps if someone invents a voice controlled GUI (maybe with integrated touch screen or some sort) then you could, but the current GUIs aren't built for voice control.
There are other applications though. For example, a car radio. Why press the buttons to find radio stations if you could tell the car "tune 95.3". It has applications on a telephone menuing system.
But don't underestimate dictation software. There are lots of advantages of dictation. It lets you 'type' faster (assuming it's good software and you train it), and people who are disabled or have injury (broken wrist, carpal tunnel) really need it.
Just whatever you do, do NOT take your computer to the monkey house. It'll probably self-destruct!
"Information wants to be paid"
Over the hills and far away, Teletubbies come to hack!
Eh-oh!
Uh-ehn! Uh-ehn!
Time for tubby shutdown...
Uh-oh...
++ Say to Elrond "Hello.".
Elrond says "No.". Elrond gives you some lunch.
The voice is meant to be a generic "Kennedy"/Hyannis accent. But the character is modelled after many politicians, real and imaginary.
Asking people to use another language when dealing with machines -- especially one that's more visceral -- is just asking for trouble. Already computers are seriously affecting the ability of humans to communicate orally, by concentrating the language into short bursts used during chats we lose the particles of sentences that help establish context in speech (yes, there is a reason for "the" and "a"). Besides, here's an oppurtunity to elleviate a lot of the bad habits that make dialectic English so tough to understand for those outside the dialect: set the machines to understand one sort of English, so that everybody has to speak at least that type along with their colloquial speech. Of course, there's always the possibility for eugenic practices with this, so my proposal is this: teach the computer the differences between the 8 vowel sounds used by people in Colorado, where pretty much every vowel approaches the schwa (the schwa being the neutral position for the human vocal system and therefore easiest to pronounce). After a while, people will realise that to be successful at using voice activated systems, they'll need to adjust their inflection, and after a while will adjust it automatically when dealing with people who don't understand them, either.
But voice activated systems are stupid, anyway...speech is one of the slowest forms of human interaction, and is one of the few we have to actively concentrate on to perform. You know when people say, "Think before you speak?" That's because once you start speaking a large portion of your brain activity is devoted to doing so...it actually becomes harder to think about what to say next. Pressing a button or turning a dial takes practically no thought...which is another reason why a speech written in spontaneous draft still sounds better than one that is spoken aloud. If we convert machines to speach recognition, we're effectively asking people to interact with them in dumber ways. And can you imagine the logic involved with processing a fairly simple statement like "This check in my hand should be processed by you and in return i'd like fifty bucks in tens and ten one dollar bills." Since the command isn't linear, the machine not only has to recognize what each word means, but try and interpret them in queue. And if humans can't construct complicated sentences like the one above -- which any human over the age of about 4 can understand, before that kids can't identify the subject and object in complex sentences -- they'll be inconvenienced by speaking machines. Oh and for a simpler example, try this: "My pin number? 376 uhhhhhh...Forty-two thirteen...aaaaaaaaaaaand...is it six? no. Eight?...oh! oh! sixty eight!" A human can understand that...we'd be annoyed, but we'd get it.
Hey freaks: now you're ju
How selective would the speech recognition be? If I was playing musing on that computer, would the computer pick up the tones coming in and start "doing stuff(tm)" on my computer? What about background noises? My friend's Jello Biafra spoken word CDs?
I won't even go there with my Saturday Morning Cartoon CD - Eep Opp Ork Ah-Ah (This means mail all of my friends a copy of my resume)...
I donate all spillover Karma to the charity of my choice... Ada was still a babe despite what people may say...
just for the heck of it, is interface the voice synthesis output of one computer to the voice recognition interface of another and start a transfer of a large text file just to see how long it takes and how accurate it is. I might get about 10-20 bps thru phone line.
If they start standardizing on a vowel command system and people overcome the embarassment of using it, how long before SharperImage starts selling little boxes that make the same sounds at the push of a button, to, you know, make life even better?
try { do() || do_not(); } catch (JediException err) { yoda(err); }
I can picture myself working behind my computer, eating cookies (or whatever), and giving the computer a whistled command, and getting up to get of box of tissues to wipe the wet crumbs of my screen. I really don't think it will work...
Ok, that sounds fair, but I guess you'd want to have some sort of benefit after you invest your time?
I just don't see this sort of interface to catch on for standard applications. I mean - imagine you are in an office with 20 people grunting at their computers, the noise they make is just going to be unbearable. That's got to be worse than that annoying guy who's checking his voicemail via speaker phone. *shudder*
From the article:
By increasing the pitch of your voice, the scrolling speed increases. When you stop speaking, the scrolling ends.
Can you imagine sitting next to a guy who uses this, and not have a headache after 10 mins?
Hmmm, if you owned radiostation 95.3, all it would take is to buy some advertising time on other radiostations, and just say 'tune 95.3'. Could have some serious fun with that..
The letter 'h' is a letter, which is sometimes used to represent the sound [h], sometimes other sounds, and sometimes is silent.
The sound [h] is usually considered a consonant.
2020: Computers everywhere are controlled by grunts, moans, sighs, and snorts.
2040: Computers are finally small enough that they're all embedded into our environments, but neural interfaces don't work, so we still grunt and snort into our computers, but it looks like we're just grunting and snorting in general. People use computers exclusively, and never talk to one another; thus, language is lost and we just grunt and snort a lot.
2060: aliens visit hoping to find intelligent life, but instead find a bunch of snorting, grunting apes. They leave.
-- "Those who cast the votes decide nothing. Those who count the votes decide everything." -Joseph Stalin
You don't want the Teletubbies on if you've got this setup.
__
Do ya feel happy-go-lucky, punk?
As a Boston-area resident, I'd like to suggest that this choice of sound wouldn't work for us:
"Hey paahl, gahhhttah go pahhk my caah." *CD skips 4 tracks*
You'd figure the guy works for a New England university, he might've picked up on that. How about "y'all" instead?
RW
"He wouldn't have written 'ahhhhh,' to skip tracks on his CD player."
"Maybe he was dictating."
This
Pick up the mike and say "Waaaaaazzaaaaaaaaaappp"
And the people shall be oppressed, every one by another, and every one by his neighbour Isaiah 3:5
We're well known for stretching every vowel into several syllables. "Well" comes out "way-uhl" and a long "I" sounds like "ah." Every time one referred to oneself, the TV or CD would start skipping around.
"Way-uhl, Ah doan know wut Ah'm gonna do. Mah CD keeps skippin'. Wut are y'all gonna do?"
Here we are at the peak of the greatest technological revolution the world has ever known, and this guy wants us to go back to communicating with grunts and moans.
What would Rain-in-the-Face do?
Clicks, wheezes, pops, random onscenities. Sounds like the way I interract with my computer NOW!
Sound kind of like how a farmer controls a sheepdog ...
- !K
Great, let's get a roomful of people trying to control their PCs, and it'll sound something like this:
Ooo, Eee, Ooo, Ahh ahh,
Ting, Tang, Walla walla bing bang.
--
"Outlook not so good." That magic 8-ball knows everything! I'll ask about Exchange Server next.
Steps to mess with your friends (or enemies)
1)Install this and set it up so that this starts up when windows does.
2) Set a sound to shutdown Windows
3) Record that sound and set it to play whenever windows starts or whenever there is an error.
4)loop the sound output into the input.
5) sit back and enjoy watching them turn on there computer only for it to grunt and turn off on them.
*note* Don't know if all of this would be possible but I just had to share this thought
-- Any comments seen here are not mine, but a mixture of alchohol and lack of sleep.
Great. I'm almost finished my ultra-long /. post and someone ICQs me.
"Uh oh"
On another note, I knew a guy who worked with voice rec software where the delete-word command was "oops". Whenever he would watch another person typing and they would typo, he would instinctively say "oops". I'm guessing it's kind of how my writting went bad went I was using graffiti a lot. You get used to these quirky mannerisms you use to control the machines. Then you end up looking like a dork and annoying the people around you
-no broken link
There is no need for this. Voice recognition already works. And it works well. And it already works with REAL words. No need to grunt, squeal of burp into your microphone.
I first used voice recognition software with OS/2 4.0 on a P100 with 16MB. I was amazed at how well it worked. Of course, 16Mb was inadequate for dictation, but even with that puny system I had it trained in half and hour.
There's a reason that voice recognition hasn't caught on. It's not because it doesn't work. It's because people don't want to talk to their computers. It's embarassing. It's not convenient. It's awkward to say those commands that computers need, like "arrem minus arref slash star".
A Government Is a Body of People, Usually Notably Ungoverned
And during sex the entire house becomes a party place (lamp on, lamp off, hifi on , hifi off ....) a:D
----- Whats wrong with this picture? http://www.revoh.org:1234/whatswrong
I think that speach recognition as a computer interface would be very powerful for the following reason:
In general (yes, there are exceptions), GUI's excel at bringing a greater density of information from the computer to the user, while command line technologies are better at delivering a greatly enhanced level of information density from the user to the computer. I remember trying to go from a command line FTP to WS-FTP and going RIGHT BACK because it made "simple" tasks like downloading a file to a floppy disk but as a different name and making it FAR more complicated.
The advantage of a speach interface is that theoretically, you have at nearly as much information density going to the computer as you do from the command line, and it does not conflict with the GUI.
Of course this argument also works for X-term...
LedgerSMB: Open source Accounting/ERP
You could run the voice recognition system as a vital resource, so when the system crashes, you go "[Zarking] [buggering] [smegging] Windows!" and it installs Linux automatically. Good idea, right?