Slashdot Mirror


Google's New Voice Recognition System Works Instantly and Offline (If You Have a Pixel) (techcrunch.com)

Google's latest speech recognition works entirely offline, eliminating the delay that many other voice assistants have to return your query. "The delay occurs because your voice, or some data derived from it anyway, has to travel from your phone to the servers of whoever operates the service, where it is analyzed and sent back a short time later," reports TechCrunch. "This can take anywhere from a handful of milliseconds to multiple entire seconds (what a nightmare!), or longer if your packets get lost in the ether." The only major downside with Google's new system is its limited availability. As of right now, it's only available to people with a Pixel smartphone. From the report: Why not just do the voice recognition on the device? There's nothing these companies would like more, but turning voice into text on the order of milliseconds takes quite a bit of computing power. It's not just about hearing a sound and writing a word -- understanding what someone is saying word by word involves a whole lot of context about language and intention. Your phone could do it, for sure, but it wouldn't be much faster than sending it off to the cloud, and it would eat up your battery. But steady advancements in the field have made it plausible to do so, and Google's latest product makes it available to anyone with a Pixel.

Google's work on the topic, documented in a paper here, built on previous advances to create a model small and efficient enough to fit on a phone (it's 80 megabytes, if you're curious), but capable of hearing and transcribing speech as you say it. No need to wait until you've finished a sentence to think whether you meant "their" or "there" -- it figures it out on the fly. So what's the catch? Well, it only works in Gboard, Google's keyboard app, and it only works on Pixels, and it only works in American English. So in a way this is just kind of a stress test for the real thing.
"Given the trends in the industry, with the convergence of specialized hardware and algorithmic improvements, we are hopeful that the techniques presented here can soon be adopted in more languages and across broader domains of application," writes Google in their blog post.

3 of 41 comments (clear)

  1. Re:the reason offline function is available.. by Solandri · · Score: 3, Informative

    Most of the software functionality of the Pixel 3 has been hacked and extracted. You can install it on your Android device running Nougat or later if you're rooted with Magisk. If this offline voice recognition is done in software instead of dedicated hardware (like the original Moto X), expect it to be made available for other rooted devices as well.

  2. Re: Battery by Anonymous Coward · · Score: 2, Informative

    And it's quality was so poor you never hear of anyone actually using it. So what's your point?

  3. There is dedicated hardware for neural networks by SuperKendall · · Score: 3, Informative

    I wouldn't be surprised to learn if there's some dedicated hardware that's been added to the SoCs in the latest phones that enable doing this on the device itself.

    Yes, just like Apple has the Neural Engine, Google has the Pixel Visual Core

    The name is misleading because from what I can tell (and what the article says) it is like the Apple chip, and can help with arbitrary neural network processing.

    What I'm not sure of is the speed of the iPhone chip compared to the Pixel one, the iPhone chip took quite a leap in speed this year...

    --
    "There is more worth loving than we have strength to love." - Brian Jay Stanley