Slashdot Mirror


Apple's Siri May Soon Process Voice Locally On a Device, No Cloud Required (appleinsider.com)

Proudrooster writes: "Apple wants Siri to become more useful to users when not connected to the internet, including the possibility of an offline mode that does not rely on a backend server to assist with voice recognition or performing the required task, one that would be entirely performed on the user's device," reports Apple Insider. Just give it 10 years and everything old is new again. Siri will join the ranks of Ford/Microsoft Sync and Intel Edison. Do any other phones/cars/speakers have this option right now? The new capabilities are outlined in a recently-published patent application that describes an "Offline personal assistant."

"Rather than connected to Apple's servers, the filing suggests the speech-to-text processing and validation could happen on the device itself," reports Apple Insider. "On hearing the user make a request, the device in question will be capable of determining the task via onboard natural language processing, working out if the requested task as it hears it is useful, then performing it. "

45 of 83 comments (clear)

  1. AC May Soon FP On /. Locally, No Network Reqd by Anonymous Coward · · Score: 1

    French toast, m'ladies

    (tips toque seductively)

    1. Re:AC May Soon FP On /. Locally, No Network Reqd by ArchieBunker · · Score: 1

      Oh come on this is funny!

      --
      Only the State obtains its revenue by coercion. - Murray Rothbard
  2. Wow by Anonymous Coward · · Score: 1

    Welcome to 1998: https://en.wikipedia.org/wiki/Microsoft_Speech_API#SAPI_4

    1. Re: Wow by Millennium · · Score: 1

      1998? Silly Microserf (as some called them at the time), Welcome to 1993.

  3. As desktop computers were doing 20 years ago by SlaveToTheGrind · · Score: 1

    Golf clap.

    1. Re:As desktop computers were doing 20 years ago by Anonymous Coward · · Score: 1

      ROFL.

      True story: 20 years ago I bought via voice for my family. This new amazing product looked very impressive and promised to take the pain out of typing for family members who had "hunt and peck" typing methods.

      The result: utter crap. It couldn't get anything right, so you spend the same amount of time copyediting the gibberish it had transcribed as you would have hunt-and-peck typing it.

      The technology 20 years ago doesn't compare to modern results (which is really just the associated processing power behind them, I presume much of the underlying methods are the same). To compare the two is laughable. What I would be more worried about is how much such local processing will hammer the phone battery.

      Golf clap indeed!

    2. Re: As desktop computers were doing 20 years ago by Millennium · · Score: 1

      25-year-old dictation software did indeed suck. But the voice-control software worked well enough (PlainTalk, nee MacInTalk, did, anyway). Frankly, the voice-recognition software in my Echo seems roughly comparable to what Apple was doing 25 years ago, which is more depressing than anything else.

  4. Siri, Open the pod bay doors. by jfdavis668 · · Score: 2

    Sorry, I didn't get that...

    1. Re:Siri, Open the pod bay doors. by 93+Escort+Wagon · · Score: 3, Funny

      Here’s what I found on the web regarding “Bombay florists”...

      --
      #DeleteChrome
  5. Why the sarcasm? by SuperKendall · · Score: 4, Insightful

    20 years from desktop to pocket sounds pretty impressive to me!!

    And frankly it will probably work better than the old desktop stuff did which moistly did not take off (though Dragon seems to have done well with desktop software).

    --
    "There is more worth loving than we have strength to love." - Brian Jay Stanley
    1. Re:Why the sarcasm? by TWX · · Score: 2

      Not really. The desktop computers doing it were 16MHz Macintosh LCII models with 4MB RAM. I remember playing with Dragon in 1992 when in school.

      --
      Do not look into laser with remaining eye.
    2. Re:Why the sarcasm? by 93+Escort+Wagon · · Score: 1

      You looked pretty silly carrying that Macintosh with you as you walked down the street, though - doubly so with that ginormous extension cord.

      --
      #DeleteChrome
    3. Re:Why the sarcasm? by SlaveToTheGrind · · Score: 1

      20 years from desktop to pocket sounds pretty impressive to me!!

      Epic troll, dude. The first friggin' iPhone 10+ years ago had more processing power than a typical 1990s desktop. +5 insightful indeed.

  6. This shouldn't be patentable by TWX · · Score: 2

    This really shouldn't be patentable. We had the ability to control computers with voice a quarter-century ago. Not only would there have been patents back then, but those patents would have expired long ago.

    --
    Do not look into laser with remaining eye.
    1. Re:This shouldn't be patentable by markdavis · · Score: 2

      >"This really shouldn't be patentable."

      Exactly. I was wondering the same thing when I read the summary. Voice recognition on a "X" isn't really any different than on a "Y" or "Z". Now, if the *methodology* they are using is considerably different/improved, I would think THAT could be patentable, but not simply that a phone can perform voice recognition. Otherwise, it is a matter for copyright.

      Software patents are horribly abused. Ultimately, consumers are always who suffer due to higher prices, fewer new products, and the chilling of all innovation.

    2. Re:This shouldn't be patentable by swell · · Score: 1

      More specifically: "Offline personal assistant." ...

      is 90% identical to "Online personal assistant", and obvious to anyone familiar with the industry.
      There is nothing novel in this technology.

      And yes, I agree, most modern patents are obvious to anyone in the industry.
      And that fact is obvious to the patent examiners, yet they are approved.
      Very frustrating!

      --
      ...omphaloskepsis often...
    3. Re:This shouldn't be patentable by rtb61 · · Score: 1

      There is a patent in there, you just don't realise it. As a person device, it is no longer blanker voice recognition, but you recognition, it recognises your acoustic contact and learns to adapt functions to it. Technically you should be able to include whistles or any other noise generation technique to teach the device to adapt to you, with your voice as being the basic acoustic control engine. Of course taking your voice off the internet, making it private between the device you own and yourself means, Apple is striving to adhere to the digital principle of selling you privacy and not selling your privacy to the highest bidder.

      Sure the control freaks and anal retentive from Google, from M$, from Facebook, from the three letter agencies are all going to come out of the dung heap and troll and down ride this idea, it is their control freak fucked in the head nature but for the rest of us Apple should be really congratulated for this effort in the right direction. I look forward to my 85" all in one Apple desktop in my lounge room, privately taking my instruction and sharing the data privately amongst my devices and I'll bet Apple are on the way there.

      --
      Chaos - everything, everywhere, everywhen
    4. Re:This shouldn't be patentable by thegarbz · · Score: 1

      You don't understand this is new. It's like the original idea, but on a computer. ... errr in a pocket.

  7. Android Assistant... by Tomahawk · · Score: 3, Informative

    ... can perform some tasks offline. I can send an email, navigate, and change phone settings (adjust volume, etc), probably more. I've tested these in aeroplane mode and they worked fine.

    1. Re:Android Assistant... by AmiMoJo · · Score: 2

      Indeed, Google Assistant does local processing and remote processing at the same time. If the local processing is having trouble it can hand off to the cloud for more powerful voice recognition, and of course many questions require the cloud for an answer.

      By doing both it gives you the fastest possible response and is also highly reliable and works offline.

      Be interesting to see what the Apple patent contains considering other people are already doing it.

      --
      const int one = 65536; (Silvermoon, Texture.cs)
      SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC
  8. "Call Mom Mobile"... by magzteel · · Score: 3, Interesting

    ... "I'm having trouble with the connection"

    That drives me insane. Siri should at least handle voice calls when not connected.

    1. Re:"Call Mom Mobile"... by pz · · Score: 1

      My feature phone could handle voice-directed dialing nearly 20 years ago, once you trained it.

      My current Android phone can handle simple tasks like setting alarms without any connectivity whatsoever, and with no training whatsoever. It amazes me each time.

      What is the problem with Apple's Siri?

      --

      Put my fist through my alarm clock with its ding-dong death inside my ear. - The Blackjacks.
    2. Re: "Call Mom Mobile"... by Dantoo · · Score: 1

      My hands free bluetooth car device does everything that the summary indicates. I talk to it. I can ask it questions and it gives me options. It gives me the option to answer a call or it can read out an sms to me. It has never been connected to the Internet. It's not all that complex if a $50 device can do it for the last 5 years.

      What pisses me off most and completely prevents me having anything to do with these voice assistants and smartarse homes is the inability to "skin" them.

      Why the F do I have to use childish phrases like "Hey Siri (Google Alexa) and pathetic shit like that? It is totally demeaning and places you in a position of servitude to a commercial empire. A pox on their eyes and bonuses.

      There is an absolute fortune awaiting for somebody who develops one of these voice assist monsters that you can tailor personally. Give it a name of your choosing, along with voice and personality skins. Make it so that it only has to go to the net for updates and complex queries and you have the absolute future of computing. A whole new industry of personality skinning would take off overnight.

      Orac, Slave, K9, Skynet, Computer and HAL would be popular skins to produce for nerds.
      Disney could throw in Mickey, Goofy, and Unca Donald for kids and older kids.
      Jeez even the Kardashians could do something for teenagers, along with any amount of Hollywood, Bollywood and UK characterisations.

      James Bond skin: Hey Q (or Moneypenny) is my flight on time and is the car fully charged? If not have an Uber standing by in 15 minutes!

      It is all about the interface. The backend is mature enough for this already.

      Meantime we have Microsoft still trying make computers into phones and copying Apple's store strategy which they copied from Lindows which copied from Microsoft. Maybe there is a company in China or India that will just step and steal all their marbles by simply opening their eyes and ears.

    3. Re:"Call Mom Mobile"... by antdude · · Score: 1

      Hence why online requirement is stupid for many requests like this basic command.

      --
      Ant(Dude) @ Quality Foraged Links (AQFL.net) & The Ant Farm (antfarm.ma.cx / antfarm.home.dhs.org).
    4. Re:"Call Mom Mobile"... by MobyDisk · · Score: 1

      ^^ THIS ^^
      The worst part is when it correctly transcribes it and shows the text to you on the screen THEN opens a Google search. Why would it send the string "Call Mom on mobile" to the search engine!? Sometimes it even spells the contact name the exact same funny way it is in my address book!

  9. Re:"Everything Old Is New Again" by ShanghaiBill · · Score: 1

    If so, how can they patent it?

    Did you read the patent?
    Did you read the claims section?
    Do you have experience reading dense legalese?
    Do you have subject matter expertise?

    If the answer to any of these questions is "no", then the patent likely isn't what you think it is.

    Disclaimer: I didn't read the patent.

  10. Sorry, still cool by SuperKendall · · Score: 2

    The desktop computers doing it were 16MHz Macintosh LCII models with 4MB RAM

    Let me check... nope still cool to have desktop software moved into mobile, even I the mobile processor is more advanced.

    I have to think the stuff moving into mobile is a lot more accurate and can handle way more accents/languages than that ancient software though, so it's not like there has been no advancement that makes use of improved hardware.

    --
    "There is more worth loving than we have strength to love." - Brian Jay Stanley
  11. Patent !=Soon by p.g.king · · Score: 1

    Because of course we all know that filing of a patent means the "invention" will hit the market real soon.

  12. This already exists for Raspberry Pi 3 by BitPit1 · · Score: 2

    I discovered an open source project at https://snips.gitbook.io/docum... that already does this. This is not a turnkey device like Siri, but it seems to suggest the the cloud is not required to implement speech recognition.

    1. Re:This already exists for Raspberry Pi 3 by AncalagonTotof · · Score: 1

      Yep, no cloud required, no connection at all. First, you build an assistant using their web console, then, you download it, and finally, you install it on your target platform, and you're good to go offline. We are using it in some projects at work. If I may : fresh French tech ! Oh, wait, I read something about and older ...

      --
      Totof
  13. Re:Siri is pretty much useless by phantomfive · · Score: 1

    The best way to think of Siri is as hot-key shortcuts for mobile devices. "Siri, call Mom" is a shortcut to scrolling through the contact list. Siri isn't particularly smart, but it does have solid use cases.

    --
    "First they came for the slanderers and i said nothing."
  14. I told you so by ReneR · · Score: 1

    half a year ago; reviewing how well ViaVoicehttps://www.youtube.com/watch?v=7CAkYs8PJT0 and Dragon Natural Speaking worked on vintage hardware back in the day: https://www.youtube.com/watch?...

  15. I told you so by ReneR · · Score: 1

    over half a year ago; reviewing how well ViaVoice https://www.youtube.com/watch?... and Dragon Natural Speaking worked on vintage hardware back in the day: https://www.youtube.com/watch?...

  16. My GPS has done this for many years by fyngyrz · · Score: 2

    Do any other phones/cars/speakers have this option right now?

    My Garmin nüvi 3597LMTHD GPS has done this since I bought it in 2013. It's not connected to anything, not wifi, not cellular, and not bluetooth. Where I live (the boonies) the traffic features and even the map updates are pretty pointless, so there's never been a need to connect it to anything. Yet it understands me just fine. And unlike Alexa and others, it allowed me to rename it — it only responds to "yo, bitch", which is just how I like it. :)

    Is the unit's understanding of language in general up to par with todays systems? No. But does it work for what it needs to understand? Yes. Very well.

    For the home, when and if MyCroft gains a local speech understanding capability, that's the way I'm going. Everything I want to do is local, and the unit can be customized to run just about anything you put together (of course, commercial products aren't that easy to figure out, but that can be done in many cases as well.) Everything that depends on the "cloud" has failures, comm losses, and security concerns. Local is definitely the way to go.

    Otherwise, everything you say ends up sent to Google, Amazon, Apple or whoever. And whoever they partner with / roll over for / get hacked by.

    I trust Apple a little bit more as they've been pretty clear about being privacy focused, but that door is open for them to do "whatever" with your data, and it is best to keep that in mind. If they go local, that'd be nice. But inasmuch as it's a closed system, whereas MyCroft is an open system... yup, still going MyCroft if they can pull this off.

    --
    I've fallen off your lawn, and I can't get up.
    1. Re:My GPS has done this for many years by Dantoo · · Score: 1

      Thanks for that. MyCroft looks good. Shipping is a disaster though so I'll have to wait until that gets sorted. It shouldn't cost more to post than to buy.

  17. Re:"Everything Old Is New Again" by gnasher719 · · Score: 1

    Disclaimer: I didn't read the patent.

    I didn't read it either, but it took quite a while to scroll through it, so I think it's just a little bit more than "process voice locally without using the cloud".

  18. Loss of data? by aberglas · · Score: 1

    I've always assumed that the real reason to send the sound for external processing is so that it can be stored and analyzed.

    Smart phones have been powerful for a long time.

    Maybe now they can just process the voice locally and then send the data to the collection center.

    1. Re:Loss of data? by thoughtlover · · Score: 1

      I've always assumed that the real reason to send the sound for external processing is so that it can be stored and analyzed.

      Yup, as most people don't remember, Siri was created by an app developer and Apple bought it before the launch of the 4s... My friend had it, loved it, and it didn't need a data connection to do a lot of stuff. Once the 4s came out, it was deleted from his 4. I don't recall if he got a refund.

      Apple just wanted the voice data for pure analysis.

      You can still control your phone with voice options in accessibility settings to avoid sending your data to Apple via Siri.

      On an unrelated note, I think the developers said Siri is an anagram for iSir.

      --
      No sig for you! Come back one year!
  19. iOS already processes voice locally on the device by Ronin441 · · Score: 2

    iOS already processes voice locally on the device. Cloud is only required for the Siri stuff. As proof, set an iOS device into flight mode, and open anything with an on-screen keyboard: edit a note, draft an email, etc. Tap the microphone icon, and talk. You'll see your speech transcribed with no resort to the cloud. (Misleading Headline is Misleading -- Film at 11.)

  20. Re:iOS already processes voice locally on the devi by Proudrooster · · Score: 1

    To your point, in notepad, the microphone works for text to speech, but voice commands won't play a local song on the phone or make a phone call.

    Try saying:
    Open iTunes
    Play Playlist Chillout

    After waiting 60 seconds:
    "I'm having some trouble with the connection. Please try again in a moment."

    So yes, it has speech to text, similar to the Commodore 64, but no commands work, even if the command doesn't need the Internet for anything such as playing a song loaded on the phone or opening the camera.

  21. Re:iOS already processes voice locally on the devi by Ronin441 · · Score: 1

    Sure -- cloud is required for the Siri stuff. And it makes sense that Apple would want to move Siri smarts for stuff that doesn't essentially require an internet connection (playing local music, etc) onto the device. My point is just that the Slashdot headline is misleading, conflating voice-to-text with Siri.

    (And the C64 had (as you say) text to speech, but most definitely did not have speech to text, which is of course orders of magnitude harder.)

  22. Re:"Everything Old Is New Again" by Rob+Y. · · Score: 1

    Doesn't Google's voice-typing essentially do this? It makes a quick guess at what you're saying and then refines it based on a more powerful cloud based scan - that's still trained using samples of your voice. It sure seems that way, since you get an immediate result on the screen that changes seconds later to a more accurate result. So all they're doing is using the local guess directly when the network is not available. Patentable? Really?

    --
    Posted from my Android phone. Oh, I can change this? There, that's better...
  23. Re:A regular hot key would do it better. by Pieroxy · · Score: 1

    But you can say "Siri call %USERNAME%" no matter what the username as long as it's in your address book. So by your explanation you should add all your contacts to your home screen ?

  24. Re:local-only siri won't happen by Pieroxy · · Score: 1

    You're confusing Apple and the rest of the Gafas. Apple doesn't care about your data, they don't display ads and don't sell them. Google, Facebook, Amazon and even Microsoft does that, but not Apple. And guess what? That's why their stuff is more expensive.

  25. Re:What's the point? by ennis99 · · Score: 1

    Apple users have been waiting for offline mode for ages. it will be really useful since we do not have all the time access to the Internet and cellular data. I can not wait for it to be accessible. https://showbox.software/ https://tutuapp.win/ https://mobdro.onl/