Slashdot Mirror


The Future of Speech Technologies

prostoalex writes "PC Magazine is running an interview with two of the research leaders in IBM's speech recognition group, Dr. David Nahamoo, manager of Human Language Technologies, and Dr. Roberto Sicconi, manager of Multimodal Conversational Solutions. They mainly discuss the status quo of speech technologies, which prototypes exist in IBM Labs today, and where the industry is headed." From the article: "There has to be a good reason to use speech, maybe you're hands are full [like in the case of driving a car]. ... Speech has to be important enough to justify the adoption. I'd like to go back to one of your original questions. You were saying, 'What's wrong with speech recognition today?' One of the things I see missing is feedback. In most cases, conversations are one-way. When you talk to a device, it's like talking to a 1 or 2 year old child. He can't tell you what's wrong, and you just wait for the time when he can tell you what he wants or what he needs."

4 of 101 comments (clear)

  1. Re:the footer offs peach take no allergy by knipknap · · Score: 3, Informative

    The present of speech technology already does, and did so for years. One problem is that you don't have a huge enough word corpus for training that technology (the knowledge of context is always limited to the domain that you have been training it against).

  2. Doctors are going to use speech recognition by Aggrajag · · Score: 3, Informative

    Doctors in Finland are starting to use speech recognition to update patient records. I think it is in testing at the moment, check the following link for details.

    http://www.tietoenator.com/default.asp?path=1;93;1 6080;163;9862

  3. Re:So why is voice input in decline? by mikeylebeau · · Score: 3, Informative

    You're mistaken about Tellme laying people off; they are doing quite well and are growing. You're right that the voice portal idea is no longer emphasized, but Tellme's making great money selling voice services to enterprise customers.

  4. Open source speech recognition engines by mandreiana · · Score: 3, Informative