The Future of Speech Technologies
prostoalex writes "PC Magazine is running an interview with two of the research leaders in IBM's speech recognition group, Dr. David Nahamoo, manager of Human Language Technologies, and Dr. Roberto Sicconi, manager of Multimodal Conversational Solutions. They mainly discuss the status quo of speech technologies, which prototypes exist in IBM Labs today, and where the industry is headed." From the article: "There has to be a good reason to use speech, maybe you're hands are full [like in the case of driving a car]. ... Speech has to be important enough to justify the adoption. I'd like to go back to one of your original questions. You were saying, 'What's wrong with speech recognition today?' One of the things I see missing is feedback. In most cases, conversations are one-way. When you talk to a device, it's like talking to a 1 or 2 year old child. He can't tell you what's wrong, and you just wait for the time when he can tell you what he wants or what he needs."
The present of speech technology already does, and did so for years. One problem is that you don't have a huge enough word corpus for training that technology (the knowledge of context is always limited to the domain that you have been training it against).
Doctors in Finland are starting to use speech recognition to update patient records. I think it is in testing at the moment, check the following link for details.
1 6080;163;9862
http://www.tietoenator.com/default.asp?path=1;93;
You're mistaken about Tellme laying people off; they are doing quite well and are growing. You're right that the voice portal idea is no longer emphasized, but Tellme's making great money selling voice services to enterprise customers.
speech recognition
O /software.htmlc h/software/
http://www.speech.cs.cmu.edu/sphinx/
image+speech recognition
http://sourceforge.net/projects/opencvlibrary/
Desktop voice commands
http://perlbox.sourceforge.net/
Others
http://www.tldp.org/HOWTO/Speech-Recognition-HOWT
http://www.cavs.msstate.edu/hse/ies/projects/spee
Do you know about other usable open source speech solutions?