IBM Develops Technology To Talk To Web
ProgramErgoSum writes to tell us that IBM's Indian-based research arm is trying to bring a new dimension to web interaction through voice interaction on your mobile phone. Developing a new protocol, Hyperspeech Transfer Protocol (HSTP), the hope is to allow users to talk to the web and get a response. Without more explanation I'm hoping this goes about as far as the gopher web. "The spoken web is a network of voice sites or interconnected voice and the response the company got in some pilot projects in Andhra Pradesh and Gujarat and the kind of innovations that people came up with were just mind-boggling, Gupta said. "
but unnecessary. Instead of trying to create a new standard, what's wrong with sending an http request, and receiving an rtp response. Let the device do the text-to-speech conversion, like they do already.
I just can't imagine an entirely new protocol being adopted when it is already very possible using existing technologies...
Sewage Treatment Facilities - "Our duty is clear."
Talk to the animals? Talk to the Web? Same difference.
Schroedinger's Brexit: The UK is both in and out of the EU at the same time!
Voice tech has an achilles heel: It's called accents. Most voice software works great for english-speaking people in the midwestern United States. But if you have an accent and have ever tried to "interact" with one of those voice mail systems that are speech-activated rather than touch-tone, the words unholy rage doesn't begin to describe the frustration of listening to a soothing voice repeatedly saying "I'm sorry, I do not understand your request" and then endlessly repeats the menus. Pressing '0', if you're wondering, will only make the system remind you that it (a) only speaks english and (b) while it can process touch tones, it won't -- because it hates you.
And IBM wants to bring this unique hell to the web? What kind of sadists are these people? As if websites that require Flash and the horrors that server-side Java unleashed wasn't enough...
#fuckbeta #iamslashdot #dicemustdie
User: fap fap fap fap fap
Web: Oh Yea baby!
User: fap fap fap fap fap
Web: Wow that's it yea!
We are the people our parents warned us about.
Breaker breaker, good buddy! Thanks for visiting my online speakin' site! My handle is: The Delta Lady! If ya'll wanna visit my cousin Watts' site, just say "bacon." If'n'ya wanna hear a special Christmas story about varmints pullin' Santa's sleigh, say "Merry Chris'mas, ya'll!"
(-1, Raw and Uncut is the only way to read)
NOTE TO SELF: code future IVR system to respond to "*6", instead, for operator requests.
When you're talking about millions of terminals vs. relatively few servers, the "dumb" terminals are cheap. Also, doing good voice recognition requires beefy hardware -- probably, ideally, DSP/GPU accelerator boards or a google-style huge cluster of commodity PCs. Finally, for blind users, but also for others, listening to even the best synthesized voice gets tiring/grating after a while. It's much nicer to listen to good speech from a professional narrator, over even a normal human speaker, much less a "good" voice synth.
I still think it'd be better for everyone if they worked on supporting a globally usable standard that could be applied on any machine, like CSS aural media, though. TTS and voice recog is probably the future anyway, might as well start taking it seriously now.