IBM to Open Voice Recognition Software
phug writes "According to the NY Times, IBM is donating code that it estimates cost the company $10 million to develop. One collection of speech software for handling basic words for dates, time and locations, like cities and states, will go to the Apache Software Foundation. The company is also contributing speech-editing tools to a second open-source group, the Eclipse Foundation." There's not much information out there yet - e.g. no word on licenses etc. It is worth pointing out that the Eclipse Foundation was started by IBM.
This is great, ViaVoice has disappeared for quite a while now on linux, I hope that this will open a great variety of cool open source applications. If this will be made modular like e.g. festival, I can think of endless applications worth using it.
Life is just nature's way of keeping meat fresh.
Is this ViaVoice? The linux packages have been pulled off the IBM site a year or so ago but they're still floating around.
8 of 13 people found this answer helpful. Did you?
Are you sure you meant to say "All your base are belong to us?" Did you mean "All you lasers are better than us?"
Shameless self promotion
Eclipse is actually a kind-of Swiss Army Chainsaw -IDE. You can make plugins for pretty much everything, so one could speculate that a voice recognition plugin would be feasible.
I don't know about everyone else, but the concept of coding by voice does fascinate me. There are obvious issues (like eliminating having to say every single control character (if at all possible)), but with a background of RSI I think it's at least worth a shot.
Thoughts?
.: Max Romantschuk
I love you, IBM. I want you inside me.
Welcome to the land of the free...pay toll ahead...no photography...please open your bag...
Why is it doing this, is it because they think they can make more money with increased software sales? It also might be an advertising campaign, $10 million donation is buying a lot of free coverage.
Corporations dont usually give a way stuff for nothing, in fact their mission by law is to maximize profit.
When you look at GNU/Linux as a complex system and think of the things that users complain about when Linux usability is concerned, GPL'd speech recognition software is definitely one of them.
Hooray for IBM and as Ali said in the Linux ad "don't back down"!!
Never underestimate the power of idiots in large groups
Is voice recognition software really viable? When you take into account the different accents, dialect and slang, is it just a pipe dream? Is it a software or hardware related issue?
...if only computers (namely Macs) had this technology back in the 80's our favourite 23rd century engineering hero wouldn't have had so much trouble using one at the plexiglass plant. "Hellooooo computer". Still cracks me up.
"Backups are for wimps. Real men upload their data to an FTP site and have everyone else mirror it." -- Linus Torvalds
My brother (who works for IBM) recently sent me an article on USA Today about the system IBM and Honda have developed for speech-interface with a GPS-enabled navigation computer. Really cool stuff.
For those of you who haven't read it, check out The Unfinished Revolution by Michael Dertouzos. I don't agree with all of his analysis (he was a little lacking in pragmatism on some points), but overall this book was very insightful. This book, along with Weaving the Web by Tim Berners-Lee, caused a big paradigm shift in my thinking about computer technology.
In the late 90s I talked with an IBM representative about releasing the ViaVoice source under a Free Software license and the person I talked to (I don't recall his name) said that they might be willing to release the source code- the code wasn't valuable to them. The value in the ViaVoice is the "thousands of hours of training" that the code uses to determine words and voices.
So my question is- will the code released include training to make it work and or will someone be able to put together the necessary resources to train the system.
This is not earth-shattering news, since HTK has been available for some years. HTK was owned by a company called Entropic and was released as open source when it was bought by Microsoft. HTK can be found at http://htk.eng.cam.ac.uk/. and can handle network grammars. This lessens the impact of IBM's news.
Nice title;
.Net technology. What exactly is the difference in quality and approach between the package from M$ and the one here mentioned from IBM ?
Speech code from IBM to become open source
And even better.. the comment from Microsoft, quoted at the end of the article
"IBM has not executed in bringing this technology to a broad market as Microsoft has."
Beside the jokes; The article states as well that Microsoft introduced their Speech Server 2004 last March, and that 100,000 software programmers have downloaded Microsoft's free software developers' kit for building speech applications on its Windows
IBM Hursley labs had a name dialler 5 years ago that let you phone the computer, say the name fo the person you wanted to speak with, and get put through. They also had a system that provided weather forecasts based on the name of the city or country you said. I was pleased to name the latter "Global Weather Information System" or GWIS, pronounced Gee-whizz. Both ran on the machine under my desk. Both worked reasonably well, especially given that a lot of the acoustic models for names and places were automagically generated.
"The new wave is not value-added; it's garbage-subtracted" - Esther Dyson, Dec 1994
2.1 The Licensor hereby grants the Licensee a non-exclusive license to a) make copies of the Licensed Software in source and object code form for use within the Licensee's organisation; b) modify copies of the Licensed Software to create derivative works thereof for use within the Licensee's organisation.
2.2 The Licensed Software either in whole or in part can not be distributed or sub-licensed to any third party in any form.
This license is in no way Open Source. Yes, you can play with the source, but you cannot build something useful with it and redistribute under the same license.
Watch great movie opening scenes!
Good will in the geek community, free publicity for something that would have just laid around collecting dust otherwise, and maybe a $10 million tax deduction for donating to a non-profit. Not sure about the tax deduction, but this is a donation to a charitable organization, and you can deduct the value of what you donate to these organizations, such as the value of a used car.
Hmm, this is nice, but I was never impressed by ViaVoice. Sphinx is much better to work with.
Reed
VOS/Interreality project: www.interreality.org
For an open-source speech recognition system with a real open source licence check out the CMU Sphinx Project, a family of speech recognition engines, training tools and associated acoustic and language models. The latest version Sphinx-4 is written in Java and is released under a BSD-style license.
Modern voice dictation software is pretty good I'm using viavoice now to write this and I find bark bark shaddup I find that it bark bark shut up damnit bark bark don't make me come down there I find that bark bark okay that's it I'm coming down there argh crash thud bark bark bark bark bark bark
Curiosity was framed. Ignorance killed the cat.
Does this mean that speech is now free as in beer?
Lost: Sig, white with black letters. No collar. Reward if found!
They're not open-sourcing anything resembling ViaVoice to the Eclipse folks. Check out the eclipse voice tools proposal. It's directed at making it easier at creating call-center type voice reco apps - not at making Eclipse a voice-directed IDE.
:-) As a stop-gap they're hoping to get WINE support for Dragon/Scansoft NaturallySpeaking.
If you're interested in open-source voice recognition check out OSSRI - an effort to bring together some sort of practical large vocab speech recog to linux. They're just starting up, but the mailing list archives hold a fair amount of discussion about the current state of the open-source SR world. (Which, to sum up, isn't that great
Bill Gates announced today that the source code for Microsoft Bob® and Microsoft Clippy®, valued on Microsoft's books at $175 million, has been donated to the Free Software Foundation, a tax-exempt entity.