Open Source Speech Recognition - With Source

← Back to Stories (view on slashdot.org)

Open Source Speech Recognition - With Source

Posted by timothy on Tuesday September 28, 2004 @11:18AM from the what-I-hear-you-saying-is dept.

Paul Lamere writes " This story on ZD-Net and this recent story on Slashdot describes the recent open sourcing of IBM's voice recognition software. This release, unfortunately, doesn't include any source for the actual speech recognition engine. Olaf Schmidt, a developer on the KDE Accessibility Project , is quoted as saying 'There is no speech-recognition system available for Linux, which is a big gap.' In an attempt to close this gap, we have just released Sphinx-4, a state-of-the-art, speaker-independent, continuous speech recognition system written entirely in the Java programming language. It was created by researchers and engineers from Sun, CMU, MERL, HP, MIT and UCSC. Despite (or because of) being written in the Java programming language, Sphinx-4 performs as well as similar systems written in C. Here are the release notes and some performance data."

4 of 404 comments (clear)

Min score:

Reason:

Sort:

But what about text to speech? by Anonymous Coward · 2004-09-28 11:26 · Score: 5, Interesting

When are we going to get GOOD text to speech, that uses modeled parameters of human vocal tracts rather than stitching together a bunch of pre-recorded phonemes?
Free C++ alternative from Mississippi State Univ. by j.leidner · 2004-09-28 11:34 · Score: 4, Interesting

Another open source system, but implemented in C++ (like all industrial systems I know of) can be found at here (a vision statement is here.
--
Try Nuggets , the mobile search engine. We answer your questions via SMS, across the UK.
Speech recognition by CastrTroy · 2004-09-28 11:44 · Score: 4, Interesting

Speech recognition is one of the worst means of input there is for a computer. Keyboards work so much better. Even for those who don't have full use of their hands, there are many other options for user input, all of which are better than speech recognition. Worst thing ever is someone trying to use speech input in a cubicle environment.

--

Anthropic principle: We see the universe the way it is because if it were different we would not be here to see it.
nifty desktop control with sphinx and festival by Danny+Rathjens · 2004-09-28 13:07 · Score: 4, Interesting

http://perlbox.sourceforge.net/
The very small vocabulary needed for desktop control makes the speech recognition much more accurate and usable.