The Coming Wave of Gadgets That Listen and Obey
dgan brings us a NYTimes piece about the development of speech recognition for common gadgets. Companies such as Vlingo and Yap are marketing their software to cellular carriers to give consumers a hands-free option for tasks like finding directions and text messaging. Quoting:
"Vlingo's service lets people talk naturally, rather than making them use a limited number of set phrases. Dave Grannan, the company's chief executive, demonstrated the Vlingo Find application by asking his phone for a song by Mississippi John Hurt (try typing that with your thumbs), for the location of a local bakery and for a Web search for a consumer product. It was all fast and efficient. Vlingo is designed to adapt to the voice of its primary user, but I was also able to use Mr. Grannan's phone to find an address. The Find application is in the beta test phase at AT&T and Sprint. Consumers who use certain cellphones from those companies can download the application from vlingo.com."
Is it possible that all of mankinds dreams are coming true now?!
So basically, -1 troll/offtopic is really slashdots way of saying "I hate that you thought of something before me."
"I'm afraid I can't do that, Dave."
The CB App. What's your 20?
User: Please connect me with Hugh Jass
Gadget: Sorry, I could not find a Hugh Jass
User: *snicker*
I can imagine the day we speak the name of some legislation in the phone and say "vote yes" or "vote no". The results show up on our congressman's web site and some other third party sites that archive. This way we take control of a few and transfer it to the less corruptible and wiser "many".
"an infinite player that has lost his finite mind" ~Infinite Play the Movie (it blends with reality)
I maintain great skepticism about speech recognition as an interface. It just isn't much faster than typing, even on a cell phone- and its not that it takes so much longer to get an ideal rendering, its that even a minor error in translation results in about five seconds of prompting followed by reentry. Until they can get that figured out, or get accuracy up to a point where someone unused to giving dictation can use it, its just not that great a technology.
Limited phrasebook technology is a lot better than voice recognition technology in a lot of devices. Given that most (well, all) devices have limited functionality (not even Steve Jobs' iPod can do his taxes for him), there's very little point in giving the device the ability to understand possibly-misdirected phrases such as "Honey, have you seen the remote?". A good approach for this technology would be to limit it to understanding alternate ways of phrasing a particular command; "Device, Get Me A Beer"/"Device, Can I Have A Beer"/"I'm Really Thirsty". This way, we'd avoid misdirected speaking (the device thinking you're speaking to it instead of to another), and could also exploit the reduced set of understandable phrases to correct for people with colds/accents/quiet voices/etc, in much the same way as limited-phrasebook devices work (only with more flexibility).
Commodore64_love: I don't comprehend people who're so frightened of death that they'll bankrupt themselves to stay alive
Everything. I personally don't give a rat's ass about cell phones - it's not really a big deal or very innovative until you just have a communicator built in. Everything else though, from doors, lights, running tasks on a computer, etc. is what's really cool. Little inane things that just piss you off in life - like having to get up from the bed with the girl/boy on it to turn the light off, or setting a TV up for a movie, or having the computer do everything you want. I'd much rather say "wait until this song ends, then play "Helter Skelter," and then put up an away message and turn the display off." (Not that I can't really do that already, it's just more aggravating)
I live in constant fear of the Coming of the Red Spiders.
Comment removed based on user account deletion
I can't get over this "hands free text messaging" option! What engineer had the insight "we need to give customers a way to communicate over the phone just by talking"? It's a strange world.
Socialism: a lie told by totalitarians and believed by fools.
There is no reason you couldn't set your car's speed with your cell phone using Blue Tooth. Just say 80 MPH please. Or reduce rapid, no break lights, 60. Or speak "reduce 3 spot 60 BL not." That means reduce speed to 60 in 3 seconds no break lights. Over the course of 3 microseconds the car determines based on recent stored values if there is another vehicle approaching from behind and how close speed and if a collision would result from your command. If not it executes the command. Everyone could be tweaking the safety parameters to have the fastest vehicle. With enough sensors and computerized control we could travel much faster safely. Of course there is an issue with voice command in an environment with lots of noise pollution. Of course why not just tell it your destination and have it automatically race like a bat out of hell, coordinating with all the other cars on the road, to get you there while change stations on the stereo and recline the seat. Obviously we would engineer any big brother features out of the traffic intelligence system creating autonomous anonymous intelligent navigation.
"an infinite player that has lost his finite mind" ~Infinite Play the Movie (it blends with reality)
Most of the cell phone systems described in the article are likely uploading the audio to a server farm, running recognition there, and then sending back the response.
Forget voice recognition/synthesis and all that crude claptrap ... I want a brain implant capable of accessing symbolic thought patterns directly. Just think about something and the machine will figure out what it is that you want to know, and feed the information back into your head as if you'd just remembered it naturally. You wouldn't even have to know the difference between a "real" recollection and one that was put there on-the-fly. You would just know stuff. Need to perform some integral calculus? No need to boot up your desktop PC ... let the implant do the work! So long as it's connected to the global network it could find out anything that's been published, access any service it would need to do what you want. How cool would that be? Imagine you're a scientific researcher and you're thinking about what to do for your next study: you'd instantly know what's already been done in the field. Want to control your home? Just think it! {"She's getting into the mood, better start dimming the lights and fading up the music."}
I mean, LSD was supposed to "expand the mind", but this kind of technology could actually do it.
The higher the technology, the sharper that two-edged sword.
One of the features of my new phone is "Voice SMS."
Think about that for a moment. It's like a text message, but it's voice. On a phone.
According to Sprint, the reason this is better than a normal voice mail message is that you're guaranteed to leave a message and not actually reach the person you're calling (which comes up how often?) and that the text message UI is easier to deal with than the voice mail system. (Then why not offer a voice mail UI?)
And, of course, it wastes both a text message and data transfer. So instead of leaving a voice mail message, which uses normal minutes and during off-peak hours is free, you get to pay extra for this feature. (Oh, I get it, improving the voice mail experience would be too hard to monetize, so it's just not worth it. The iPhone must be an illusion.)
You are in a maze of twisty little relative jumps, all alike.