Slashdot Mirror


IBM to Open Voice Recognition Software

phug writes "According to the NY Times, IBM is donating code that it estimates cost the company $10 million to develop. One collection of speech software for handling basic words for dates, time and locations, like cities and states, will go to the Apache Software Foundation. The company is also contributing speech-editing tools to a second open-source group, the Eclipse Foundation." There's not much information out there yet - e.g. no word on licenses etc. It is worth pointing out that the Eclipse Foundation was started by IBM.

3 of 189 comments (clear)

  1. HTK is already availabale as open source by virtigex · · Score: 4, Informative
    From the article, it looks like they are making their network grammar version available, not their dictation grammar version. There are types of continuous speech recognition engines, the simple version that uses a hand-crafted network grammar (which seems to be the version that they are talking about), which can be used to recognize simple utterances such as dates, and one that uses a statictical language model and which can recognize an entire language.

    This is not earth-shattering news, since HTK has been available for some years. HTK was owned by a company called Entropic and was released as open source when it was bought by Microsoft. HTK can be found at http://htk.eng.cam.ac.uk/. and can handle network grammars. This lessens the impact of IBM's news.

  2. Re:HTK is NOT availabale as open source by bonniot · · Score: 5, Informative
    I was suspicious about MS releasing anything under an Open Source license, so I checked. From HTK's license:

    2.1 The Licensor hereby grants the Licensee a non-exclusive license to a) make copies of the Licensed Software in source and object code form for use within the Licensee's organisation; b) modify copies of the Licensed Software to create derivative works thereof for use within the Licensee's organisation.

    2.2 The Licensed Software either in whole or in part can not be distributed or sub-licensed to any third party in any form.

    This license is in no way Open Source. Yes, you can play with the source, but you cannot build something useful with it and redistribute under the same license.

  3. Sphinx by agentk · · Score: 5, Informative

    Hmm, this is nice, but I was never impressed by ViaVoice. Sphinx is much better to work with.

    Reed

    --

    VOS/Interreality project: www.interreality.org