Google Opens Access To Its Speech Recognition API, Going Head To Head With Nuance (techcrunch.com)
An anonymous reader quotes a report from TechCrunch: Google is planning to compete with Nuance and other voice recognition companies head on by opening up its speech recognition API to third-party developers. To attract developers, the app will be free at launch with pricing to be introduced at a later date. The company formally announced the service today during its NEXT cloud user conference, where it also unveiled a raft of other machine learning developments and updates, most significantly a new machine learning platform. The Google Cloud Speech API, which will cover over 80 languages and will work with any application in real-time streaming or batch mode, will offer full set of APIs for applications to "see, hear and translate," Google says. It is based on the same neural network tech that powers Google's voice search in the Google app and voice typing in Google's Keyboard. Google's move will have a large impact on the industry as a whole -- and particularly on Nuance, the company long thought of as offering the best voice recognition capabilities in the business, and most certainly the biggest offering such services.
Would anyone here be willing to admit that they would use this service just so they could repeatedly say "penis" to it for no reason other than to have it recognize that they're saying "penis" to it?
It's not so much that Nuance is known for being the best for a long time, it's more that they've bought out all their competitors and have pretty much controlled the market.
>" Google says. It is based on the same neural network tech that powers Google's voice search in the Google app and voice typing in Google's Keyboard."
Indeed. So does this mean Google will store and mine and analyze and profitize the spoken text data too?
To attract developers, the app will be free at launch with pricing to be introduced at a later date.
The first one's always free...
#DeleteChrome
Google Closes Access To Its Speech Recognition API, 3rd party developers left scratching heads
...the app will be free at launch with pricing to be introduced at a later date.
/insert metaphor about drug dealers here
Yes, thanks you for the free QA.
I'm using Google Voice to Text thing to write this, just to see how good it is or if it's today or not. Generally I think it works pretty good but every once in awhile it really fucked up my words. For instance in this message so far I see at least two words that is messed up.
I'm waiting to see if/how this affects Pebble Time. We've been wanting access to the Google Voice API for ages now. Personally I want it mostly for Google Now integration, which may or may not be separate.
Nothing to see here, folks. Google will end support for this in short order anyways. I recommend picking something else that won't be abandoned...which means things other than Google offers.
This is interesting but not really where I'd prefer to see speech recognition going. The current approach seems to be to do speech recognition in the cloud, which results in a huge level of concern about where your speech is being sent (smart TVs always listening to your living room etc.)
It would be great to see someone addressing the privacy concern by developing a decent offline speech recognition system. There are a few free software efforts: CMU Sphinx, Julius and Kaldi; but none seem to be really fully mature and there's little benchmarking to indicate how well they perform.
Now NSA can spy on non-english language speech..... for free.... good /rofl...
The world needs high quality STT that works when the net is down and isn't vulnerable to arbitrary changes in API, availability, and legal impediments.
It's clearly one of the harder software problems, but I expect it to be solved in fairly short order; years, not decades.
I've fallen off your lawn, and I can't get up.
Any idea you might have that the market will do what you think is optimum is based upon a complete misunderstanding of markets.
Markets often choose inferior performance options. High quality solutions often fail to gain, or keep, traction. No undertaking that doesn't have significant lobbying impact (which of course means high $) with the relevant legislature can reasonably expect its business model to be protected in the face of any particular eroding force. Once a particular solution to a problem has been chosen, it is very likely that any change has social hurdles to overcome: those having made the decision are invested; training costs and familiarization erect similar barriers; disruption of stockholder confidence can be a factor.
I've fallen off your lawn, and I can't get up.
Like many of google's products, if it doesn't take off or they can't profit, it'll be shut down and you and your users will be left out in the cold.
https://en.wikipedia.org/wiki/Category:Discontinued_Google_services
Homsar, is that you?