Google Opens Access To Its Speech Recognition API, Going Head To Head With Nuance (techcrunch.com)
An anonymous reader quotes a report from TechCrunch: Google is planning to compete with Nuance and other voice recognition companies head on by opening up its speech recognition API to third-party developers. To attract developers, the app will be free at launch with pricing to be introduced at a later date. The company formally announced the service today during its NEXT cloud user conference, where it also unveiled a raft of other machine learning developments and updates, most significantly a new machine learning platform. The Google Cloud Speech API, which will cover over 80 languages and will work with any application in real-time streaming or batch mode, will offer full set of APIs for applications to "see, hear and translate," Google says. It is based on the same neural network tech that powers Google's voice search in the Google app and voice typing in Google's Keyboard. Google's move will have a large impact on the industry as a whole -- and particularly on Nuance, the company long thought of as offering the best voice recognition capabilities in the business, and most certainly the biggest offering such services.
It's not so much that Nuance is known for being the best for a long time, it's more that they've bought out all their competitors and have pretty much controlled the market.
>" Google says. It is based on the same neural network tech that powers Google's voice search in the Google app and voice typing in Google's Keyboard."
Indeed. So does this mean Google will store and mine and analyze and profitize the spoken text data too?
To attract developers, the app will be free at launch with pricing to be introduced at a later date.
The first one's always free...
#DeleteChrome
Google Closes Access To Its Speech Recognition API, 3rd party developers left scratching heads
I'm waiting to see if/how this affects Pebble Time. We've been wanting access to the Google Voice API for ages now. Personally I want it mostly for Google Now integration, which may or may not be separate.
Nuance's Dragon Naturallyspeaking is about the most frustrating, ill-conceived, effectively-unsupported, crash-prone, erratic and generally flaky application of its kind on the market. It's unstable, unpredictable, and regularly drives every user I know into apoplexy. The problem is, they just don't care. Really, they don't: bugs are left unaddressed for years, often through several major "revisions", because they know that there's nowhere else for users to go. That's especially true if one needs their specialized vocabularies.
... this is it.
If anyone wants to know why monopolies are bad
The higher the technology, the sharper that two-edged sword.
Any idea you might have that the market will do what you think is optimum is based upon a complete misunderstanding of markets.
Markets often choose inferior performance options. High quality solutions often fail to gain, or keep, traction. No undertaking that doesn't have significant lobbying impact (which of course means high $) with the relevant legislature can reasonably expect its business model to be protected in the face of any particular eroding force. Once a particular solution to a problem has been chosen, it is very likely that any change has social hurdles to overcome: those having made the decision are invested; training costs and familiarization erect similar barriers; disruption of stockholder confidence can be a factor.
I've fallen off your lawn, and I can't get up.