Slashdot Mirror


Ask Slashdot: Who's Building The Open Source Version of Siri? (upon2020.com)

We're moving to a world of voice interactions processed by AI. Now Long-time Slashdot reader jernst asks, "Will we ever be able to do that without going through somebody's proprietary silo like Amazon's or Apple's?" A decade ago, we in the free and open-source community could build our own versions of pretty much any proprietary software system out there, and we did... But is this still true...? Where are the free and/or open-source versions of Siri, Alexa and so forth?

The trouble, of course, is not so much the code, but in the training. The best speech recognition code isn't going to be competitive unless it has been trained with about as many millions of hours of example speech as the closed engines from Apple, Google and so forth have been. How can we do that? The same problem exists with AI. There's plenty of open-source AI code, but how good is it unless it gets training and retraining with gigantic data sets?

And even with that data, Siri gets trained with a massive farm of GPUs running 24/7 -- but how can the open source community replicate that? "Who has a plan, and where can I sign up to it?" asks jernst. So leave your best answers in the comments. Who's building the open source version of Siri?

186 comments

  1. Ask Slashdot by Anonymous Coward · · Score: 0, Flamebait

    Who gives a fuck?

    1. Re: Ask Slashdot by Anonymous Coward · · Score: 0

      Google, Apple to name a few. Especially If voice becomes as popular as keyboards or mice in the future.

    2. Re: Ask Slashdot by layabout · · Score: 1

      ARRRRGGGGGHHHHH!! = voice

      why can't you tell the difference between speech and voice = speech

    3. Re:Ask Slashdot by nazsco · · Score: 0

      Despite the lowly vocabulary, the parent is 100% correct.

      The best indicator ever that something is just useless and completely ignored were it not as a toy, is that there is no open source alternative being worked on.

      I mean, some toy projects even get the open source alternative going. When it doesn't have, it is a very clear indicator that nobody cares outside of the little world that is san-francisco bay area.

    4. Re: Ask Slashdot by ArmoredDragon · · Score: 4, Insightful

      I personally don't like to use voice assistants, mainly for privacy reasons. And no, I don't particularly care if Google or Apple knows what I'm searching for. Rather, I don't particularly like when people I'm sitting near know what my immediate intentions are. I'm not doing anything nefarious, I just like keeping my personal business personal, which is much easier with Swype.

    5. Re:Ask Slashdot by DuckDodgers · · Score: 3, Insightful

      Nonsense. There would be enormous use of fully open source alternatives to Google search, Gmail, Call of Duty, Starcraft 2, Destiny, and dozens of other similar projects. The best, to my knowledge, fully open source search engine is Yacy and it totally sucks. Running your own email server isn't too hard, but getting your mail to recipients on Gmail, Hotmail, or Yahoo mail without relaying through one of the big services is all but impossible. There are plenty of nice graphical fully open source video games out there, but nothing with the artwork or the voice acting or the visuals on par with a top of the line AAA game.

      Nobody is making them because it's too damn difficult.

    6. Re: Ask Slashdot by Anonymous Coward · · Score: 0

      Google Search is not done due to it requires millions of servers in order to handle the same load so the software part is of no interest since no one else have access to that amount of hardware.

      Mailservers exists in the dosens so that is quite covered. And for games the main resource is graphics and level design, not source code.

    7. Re: Ask Slashdot by nospam007 · · Score: 1

      "ARRRRGGGGGHHHHH!! = voice

      why can't you tell the difference between speech and voice = speech"

      Is this a speach?

    8. Re: Ask Slashdot by nospam007 · · Score: 1

      "I personally don't like to use voice assistants, mainly for privacy reasons. "

      Well I can tell you that just saying in the dark: "Hey Siri, wake me up at 7" beats getting up. going to the bathroom to get my glasses, putting some light on, ruining my Melatonin production and setting the clock manually, any time, any day.

    9. Re: Ask Slashdot by LinuxIsGarbage · · Score: 3, Funny

      Moving the switch on the clock radio two clicks to the right works as well.

      Or setting the alarm on your phone earlier in the day when you had your glasses on and weren't concerned about Melatonin.

    10. Re:Ask Slashdot by jellomizer · · Score: 1

      There is enough buzz with Apple, Google, Amazon and Microsoft to make a point that it is popular technology and shouldn't be dismissed because normal techies (myself included) find voice command technology lacking. However I see a lot of melenials using it as well in cars or other hands free devices. In some way voice commands are bringing back the command line interface.

      The problem with open source versions is having a cloud system or massive computing willing to put up with the demand without a solid business model to pay for it. The reason for Siri success isn't the speech to text interface nearly any system can do that. But being able to interpret what you say and then come up with results is what we need the addition Horse power that we can get from a larger remote system.

      --
      If something is so important that you feel the need to post it on the internet... It probably isn't that important.
    11. Re: Ask Slashdot by ramorim · · Score: 2

      Because he was not trained enough with a massive farm of GPUs running 24/7? :D

    12. Re: Ask Slashdot by fluffernutter · · Score: 1

      I have an app with this special feature where you can repeat the alarm on weekdays WITHOUT EVER GOING INTO THE APP! fluffernutter 1 internet 0

      --
      Laws are rules for the court, but merely a bottom bar to hit for life. Think beyond laws in your actions always.
    13. Re:Ask Slashdot by Anonymous Coward · · Score: 0

      how is getting your mail to recipients on gmail hotmail or yahoo without relaying all but impossible o_O

      I literally use the first howtoforge tutorial i find every time i have to (like 5x so far) and it always... works. literally the first test email i send out was to my gmail. i dont relay (just checked just to be sure)
      straight from my server, to gmail.

    14. Re:Ask Slashdot by dj245 · · Score: 1

      Nonsense. There would be enormous use of fully open source alternatives to Google search, Gmail, Call of Duty, Starcraft 2, Destiny, and dozens of other similar projects. The best, to my knowledge, fully open source search engine is Yacy and it totally sucks. Running your own email server isn't too hard, but getting your mail to recipients on Gmail, Hotmail, or Yahoo mail without relaying through one of the big services is all but impossible. There are plenty of nice graphical fully open source video games out there, but nothing with the artwork or the voice acting or the visuals on par with a top of the line AAA game. Nobody is making them because it's too damn difficult.

      To be good, these things need experts in the field following good business and engineering practices. It's difficult, costly, and takes a lot of time, so only large international companies have much of a chance at being competitive. The forces of capitalism or government development seems to be the best ways currently to solve these kinds of massive undertakings. The internet has shown a lot of promise in allowing loosely-connected entities to collaborate, but once a project starts to look like it has value, somebody always takes the bone (or their part of the bone) and tries to personally benefit. Everybody needs to put food on the table at the end of the day.

      --
      Even those who arrange and design shrubberies are under considerable economic stress at this period in history.
    15. Re: Ask Slashdot by Anonymous Coward · · Score: 0

      Hmmm....
      Not used to commenting here...
      So I'll just post here.
      There is an open alternative...
      Which has been worked on and in constant use for years now...
      Don't have machines testing and teaching it 24/7...
      Instead its been people working on it..

      Its Saera.
      For the Nokia n900 ...n9 too I think ...(not that I ever bothered with the n9 to know...)
      Works good for our community...
      Works good for me.
      I would imagine it can...or has been adapted for other devices and Linux systems ...

    16. Re:Ask Slashdot by Anonymous Coward · · Score: 0

      It really isn't that difficult to get the big providers to accept your email. You just need to be on a business ISP and static IP, the rest is just config.
      If you're not prepared to pay for a business class internet connection, you really shouldn't be running your own email.

    17. Re: Ask Slashdot by baker_tony · · Score: 1

      Night I got my iPhone 7 while in bed with light off, the wife pondered to me if it was gonna rain tomorrow. "Hey Siri, is it going to rain tomorrow?". Worked a treat.
      Was pretty geek-cool. Had to laugh.

    18. Re:Ask Slashdot by DuckDodgers · · Score: 1

      Yup can't be done. That would be like having a bunch of volunteers writing a really good C compiler, or a kernel. It'll never happen, right?

    19. Re: Ask Slashdot by DuckDodgers · · Score: 1

      "Google Search is not done due to it requires millions of servers in order to handle the same load so the software part is of no interest since no one else have access to that amount of hardware."

      No, there is interest and people are trying - Yacy is the example. The data and the service are distributed and decentralized, so it's technologically difficult to pinpoint the searches of individual users. And since there are hundreds of nodes, or at least there were, there are plenty of resources available. But even though, if I recall correctly, Yacy uses parts of the Lucene and Solr projects its search features are awful. I did a search for the cartoon Spongebob with the simple keyword "Spongebob" and got neither the official website nor the Wikipedia page in the first 50 responses.

      So people care, and people are willing to contribute. But this is an incredibly difficult thing to tackle.

    20. Re:Ask Slashdot by DuckDodgers · · Score: 1

      This is what I get for reading about hosting email from public sites. I kept reading that it was all but impossible, and that messages are automatically dropped. I'll give it a try. Thanks for correcting me.

    21. Re:Ask Slashdot by DuckDodgers · · Score: 1

      Sorry, I meant to joke with the previous statement but upon reviewing it I came across as too snarky. I apologize, I meant to tease you and not insult you.

      I think you're right. While there are a number of incredibly sophisticated free software projects - Gnu C Compiler, Linux kernel, GNOME, KDE, VLC, Postgres, etc... a lot of cutting edge stuff is proprietary and the free software equivalent comes out later and is often - though not always - inferior. Speech-to-text is one of those areas, as is Virtual Reality, and 3D Gaming.

    22. Re:Ask Slashdot by DuckDodgers · · Score: 1

      Well, if it were technically feasible to let anyone run email from any IP address that would be more open and free. I should not need to spend an extra fee at a service provider to do it. I can afford it, but it shouldn't be this expensive for the average person to escape Gmail/Hotmail/Yahoo Mail. But as a practical matter I respect your point, most email coming from residential IPs is spam.

  2. iKnow by Anonymous Coward · · Score: 0

    Have the government operate a open cloud voice platform in a public datacenter db that anyone can utilize. We'll call it the voice of God and tell people to route their prayers to the NSA because they watch over all we do.

  3. I thought you were by Anonymous Coward · · Score: 0

    No? Not to worry. Someone will. No?

  4. Who wants one? by Anonymous Coward · · Score: 0, Insightful

    Does anyone actually know a programmer who want such a thing? As a developer who has never used this feature on his phone, I'm not very inspired to contribute to such a project. I'd be much more likely to work on projects that help improve security and isolation and specially break such services.

    1. Re: Who wants one? by Anonymous Coward · · Score: 0, Insightful

      Do you drive? Because I do, and it's handy as hell. Read/write messages without removing your hands from the steering wheel, or make calls or prettt much anything you want.

      there are many situations in which is very handy to use a phone/computer with your voice, and some have saved lives.

    2. Re:Who wants one? by vux984 · · Score: 1

      Does anyone actually know a programmer who want such a thing?

      Me. I want FOSS voice commands for my phone.

      I'd be much more likely to work on projects that help improve security and isolation.

      That's precisely why I want such a thing. I don't use Siri etc precisely because the idea of an open mic to Google/Apple etc creeps me right out and represents a huge security hole The only software I'd begin to trust is an FOSS that I could really know what it was doing, and what it was sending.

      I mostly just want it for car/bike navigation/ in can sending SMS (e.g. to an incoming phone call or text -- "reply with canned 'im driving' message"; or "tell X 'ill be there in Y minutes" and a few other commands. I'd want it to do all voice processing on the phone, and only send out specific types of requests to the internet. (e.g. if I request it to map a street address... it can send the address out.

    3. Re:Who wants one? by bytesex · · Score: 1

      There are in the tens of CPU like microprocessors on your phone, only one of which runs Android or iOS. You may be feeding into a mike that goes directly through to Cupertino, but you may be able to turn that off. You're also feeding into a mike that may go directly to Samsung, or their Chinese supplier. And you won't be able to turn that off.

      --
      Religion is what happens when nature strikes and groupthink goes wrong.
    4. Re:Who wants one? by ShanghaiBill · · Score: 1

      Does anyone actually know a programmer who want such a thing?

      I do. I have an Amazon Echo in my kitchen, and I use the voice recognition all the time. If I am cooking dinner, and I notice we are low on milk, I can say "Alexa, add milk to the shopping list" without washing my hands or touching my phone or laptop. I can use voice to turn lights on or off, request specific music, ask for news or information on specific topics, etc. The voice recognition works well, and it is a useful service.

    5. Re:Who wants one? by Anonymous Coward · · Score: 0

      The difference is Google monetizes the user every which way. Apple doesn't.

      In the process of ditching Google for email for that reason and migrating to iCloud.

    6. Re: Who wants one? by Anonymous Coward · · Score: 0

      So collecting data is fine. Making money on it is not. Impressive logic.

    7. Re: Who wants one? by Anonymous Coward · · Score: 0

      I'm not sure that's true, they provide the platform to monetise the user in every which way and others provide the content and then Apple takes a cut.

      I'm a bit of an apple fanboy but even I can see how they create the system to monetise everything that their users do.

    8. Re:Who wants one? by DuckDodgers · · Score: 1

      If I trusted the service completely, I would use it all of the time. "What's the weather tomorrow?" "Play Houses of the Holy." "Send a text to my wife asking her to get milk on the way home from work." "Find the cheapest price plus shipping of a DVD of Idiocracy on Ebay." etc... etc...

      I never use Google Now/Siri/Cortana because I don't trust their respective owners with even more data about me than they already have.

    9. Re:Who wants one? by nospam007 · · Score: 1

      "I mostly just want it for car/bike navigation/ in can sending SMS (e.g. to an incoming phone call or text -- "reply with canned 'im driving' message"; or "tell X 'ill be there in Y minutes" and a few other commands. I'd want it to do all voice processing on the phone, and only send out specific types of requests to the internet. (e.g. if I request it to map a street address... it can send the address out."

      That's exactly what Siri does. RTFM

    10. Re: Who wants one? by Anonymous Coward · · Score: 0

      I phrased that wrong. Apple makes money no doubt by analyzing what people do on its services (and via shitty deals with other companies - name a corporate that doesnt?) , but unlike "Do no evil" Google they don't sell it to 3rd parties.

    11. Re:Who wants one? by Anonymous Coward · · Score: 0

      That's precisely why I want such a thing. I don't use Siri etc precisely because the idea of an open mic to Google/Apple etc creeps me right out and represents a huge security hole

      And how are you getting around that? Ultimately you're using closed hardware, probably closed software, connected to the public net via your phone company that is also tracking your location and can listen to your calls and intercept your data.

      The problem with all of these initiatives is that they are just security theater, they always end up relying on closed hardware, closed software and/or 3rd party networks that are monitored. The same thing goes for Windows 10, people often say you shouldn't use it because of privacy but what about the hardware you're running on, the networks you're connecting to ... the various NSA programs had no problems collecting data before Windows 10 so even if you suppose that the information is not anonymized and that it is being sent so some government agencies (why would any government outside the US be using it if they believed this to be the case?) and don't use it on that basis you're still just as subject to all the other government spying as before. I'm not saying the idea should be dismissed on that basis but with all the bashing of the TSA's ridiculous security theater you'd think people here would be more aware of the implications of just saying use Linux/Windows 7/macOS on your closed hardware with closed firmware connected to a 3rd party network ... oh and of course your closed hardware smartphone that is tracking you all the time.

    12. Re: Who wants one? by exomondo · · Score: 1

      Does Google actually sell that data to 3rd parties? I can see how they monetize it by selling the opportunity for advertisers to target a specific demographic but that is a very different thing to actually selling the data that "Joe Bloggs searched XYZ at this time and sent this email to Jen Bloggs at this time".

    13. Re: Who wants one? by Anonymous Coward · · Score: 0

      That complete argument is a non starter. Your systems are already insecure so you should just go ahead and stack a couple more insecure systems on for the hell of it.

    14. Re: Who wants one? by old_kennyp · · Score: 2

      Do you drive? Because I do, and it's handy as hell. Read/write messages without removing your hands from the steering wheel, or make calls or prettt much anything you want.

      there are many situations in which is very handy to use a phone/computer with your voice, and some have saved lives.

      Well then you are an idiot!

      When I am driving, I concentrate on Driving, not " Read/write messages" or "make calls or prettt much anything you want"
      Driving a vehicle requires 100% attention to the task, and anyone who thinks differently is a fool!

      Turn the phone off when driving please before you kill someone! ( I do not care if you kill yourself though)

    15. Re: Who wants one? by uvajed_ekil · · Score: 1

      Do you drive? Because I do, and it's handy as hell. Read/write messages without removing your hands from the steering wheel, or make calls or prettt much anything you want.

      there are many situations in which is very handy to use a phone/computer with your voice, and some have saved lives.

      I drive, a lot. And I think I'm pretty decent at it. Part of my success as a driver has been recognizing how much distractions can negatively impact my driving, and by distractions I mean even just conversing with passengers or especially talking on the phone hands-free. I honestly don't want more excuses to use technology on the road, whether hands-free or voice-activated. Surely using Siri is safer than texting, but I feel that technology that requires less attention can lull people into a false sense of security, resulting in them fiddling with BT phones, changing music, using Siri, etc. way more than they should.

      --
      This is a hacked account, for which the owner can not be held responsible.
    16. Re: Who wants one? by uvajed_ekil · · Score: 1

      Well done, old_kennyp. Your attention to the multi-ton vehicle you operate means that of the eight times that people hit my cars in the past twenty years, it was you precisely zero times! On the other hand, one accident was caused by a young lady on a cell phone (who hit me head on - after I stopped at a stop sign!), one was an old lady fussing with her purse, one was a young man distracted by a passenger's cell phone, and one was a young lady who was eating and dropped something on her pants. I know from experience how serious even slight distractions can be while driving.

      --
      This is a hacked account, for which the owner can not be held responsible.
    17. Re:Who wants one? by vux984 · · Score: 1

      That's exactly what Siri does. RTFM

      Link? Because everywhere i search i find references to the iphone basically recording your command, and sending the audio clip to be processed by the cloud.

      It sounds like maybe in some cases it'll figure out an answer even without the cloud. But odds are it sent it to the cloud while it was processing it locally, just in case.

      And while I'm find with the agent on my phone sending specific requests to the cloud, the raw audio of every command I give it... nope.

      So if you have a link that counters this then enlighten us with a reference, rather than just an 'RTFM'.

    18. Re: Who wants one? by Anonymous Coward · · Score: 0

      Do you also want them to get off your lawn?

    19. Re: Who wants one? by Anonymous Coward · · Score: 0

      Not sure if you were agreeing or being facetious, but you highlight exactly my point. Drive a vehicle requires 100% attention and smartphones are only 1 item in the long list of things that can take that attentiveness away.
      It only takes a few seconds and someone can seriously injured or killed

    20. Re: Who wants one? by Anonymous Coward · · Score: 0

      I hope you don't talk to anybody when you drive (passengers), so you are truly consistent with your argument. Because, in case you didn't know, that is what reading/writing messages involve with Siri (et al.). `Hey Siri, send xxx a message saying yyy'. Done. 'Hey Siri, drive me to zzzz'. Done.

      Having a chat with someone else? Then you're a killer... isn't it?

    21. Re: Who wants one? by Anonymous Coward · · Score: 0

      What an absolutely TERRIBLE thought. And also why I have absolutely no use for such sets of programming.

    22. Re: Who wants one? by Anonymous Coward · · Score: 0

      That complete argument is a non starter. Your systems are already insecure so you should just go ahead and stack a couple more insecure systems on for the hell of it.

      Did you not read? I explicitly addressed that:

      I'm not saying the idea should be dismissed on that basis

      I am pointing out the obvious security theater that people here seem intent on remaining ignorant of all while bashing any other institution that does the same thing.

    23. Re: Who wants one? by Anonymous Coward · · Score: 0

      That complete argument is a non starter.

      What argument?

      Your systems are already insecure so you should just go ahead and stack a couple more insecure systems on for the hell of it.

      Yeah I don't see where that argument was made at all. Maybe you need to read it again.

  5. Hopefully nobody by Anonymous Coward · · Score: 0

    Siri sucks ass. The correct question is who is building the open source copy of Google'search voice recognition?

  6. SaaS is the end of Open Source by Anonymous Coward · · Score: 1

    There is no code to copy and build upon. Without copies, there is no copyright law to enforce openness. Consider a world without the GPL. Now consider it without piracy as well. Welcome to SaaS.

    1. Re:SaaS is the end of Open Source by NotInHere · · Score: 1

      There is AGPL. It was made with SaaS in mind.

    2. Re:SaaS is the end of Open Source by exomondo · · Score: 1

      There is AGPL. It was made with SaaS in mind.

      But when you have dependencies on non-free services, a reliance on machine learning that requires big data or any kind of infrastructure that you can't easily replicate it means you aren't going to be doing the computing on your own computer. AGPL is good but often in the context of SaaS it isn't practical.

  7. Sirius by Anonymous Coward · · Score: 5, Informative

    Sirius (Ubuntu only I believe):
    http://sirius.clarity-lab.org/sirius/

    1. Re:Sirius by markdavis · · Score: 3, Interesting

      >"Sirius (Ubuntu only I believe): http://sirius.clarity-lab.org/..."

      Thankfully it doesn't appear to be related to or require Ubuntu at all.

    2. Re:Sirius by Anonymous Coward · · Score: 0

      .....Now quit calling me Shirley!

    3. Re:Sirius by Anonymous Coward · · Score: 0

      Are they working on cybernetics? I've heard some interesting things about Sirius Cybernetics.

  8. Please speak the question louder by Anonymous Coward · · Score: 0

    I can barely hear it over the moaning from the other room in the background. Oh who am I kidding... I'm only listening to the moaning in the other room.

  9. The size of the farm shouldn't matter.... by Anonymous Coward · · Score: 4, Insightful

    When you talk about the 'massive farm of GPUs' running 24/7 you ignore the fact that, because it is proprietary they are missing out on the potential compute resources out there.
    How many people have run SETI@home, or gene folding efforts. We just need someone insightful and ingenious to find a way to deal with machine learning in an 'offline' way, and be able to present the user interface in a quick fashion.

    It would have to start out very dumb, but with some great key algorithms I expect an open source option could move a lot faster than anything out there in this regard.

    1. Re:The size of the farm shouldn't matter.... by DuckDodgers · · Score: 2

      One of the fascinating things being tried right now is building decentralized applications on top of the cryptocurrency platforms. Ethereum, MaidSafe/Safecoin, Lisk, and others. The data is distributed in a distributed hash table (DHT). Users contribute CPU cycles, RAM usage, and disk space to the network in return for tiny portions of the respective digital currency. They're trying to build distributed autonomous organizations (DAOs), distributed alternatives to Twitter and Facebook, distributed alternatives to Dropbox and Google Drive, etc... etc... something like Siri could run on the same network.

      If it ever works right. I'm hopeful but not willing to predict anything.

    2. Re:The size of the farm shouldn't matter.... by Etcetera · · Score: 2

      We just need someone insightful and ingenious to find a way to deal with machine learning in an 'offline' way, and be able to present the user interface in a quick fashion.

      It would have to start out very dumb, but with some great key algorithms I expect an open source option could move a lot faster than anything out there in this regard.

      Precisely. I don't get what the misunderstanding is here among the Slashdot crowd.

      Natural Language Processing is neat tech. Mechanics of speech recognition is neat tech. Integration of the two via a dispatch engine and scriptlets to go off an search Google, run a command, or whatever else one can script, is neat tech.

      I'd use this ALL THE TIME if the data didn't leave my network, and I'm sure I'm not alone.

      We can't duplicate a zillion far off machines running a Google-scale cluster, but it's hard to see why we need to in OSS land. I have a spare 32 core box and God-knows-how-many GPUs sitting here. Where's the project that can let me get up and running on my own, and that we can all use to iterate over as public algorithms (inevitably) improve and storage/memory/processing costs (inevitably) decrease.

      Frankly, it's difficult to see why that type of infrastructure is really needed in the first place. NLP is hard, but it's not like these building blocks aren't already there. Apple's dictation software (PlainTalk) was running on System 7.1 Pro 20 years ago, using local hardware 100's of times slower than what I have in my pocket. Basic NLP code was running on the Newton, which was 1000x slower and still managed to handle the basics on top of the handwriting recognition. "Speakable Items" let me run user-writable AppleScripts to automate tasks and was just missing dictatable variable names.

      None of this required cloud-level processing, especially not voice recognition, which even Apple lets you do locally w/o using Siri.

    3. Re:The size of the farm shouldn't matter.... by Anonymous Coward · · Score: 1

      Cortana on my old Windows Phone was very dumb. Apparently by watching everything I do with the phone, it's become smarter - searches that would easily have been better done with DuckDuckGo in Firefox now are credibly responded to by Cortana. For some reason, it hasn't helped Bing, only Cortana. So, as the article sez, where is the open source equivalent for Linux, and the back end processing to make it learn.

      That doesn't mean that I willingly let it be part of me in Win10. I have manipulated many settings and blocked many IPs to minimize Cortana's communication with MS - I don't need that amount of chattiness in my real computer (which still isn't my phone; I'm a curmudgeon). Still, if it were available in Linux, and the back end were trustworthy (the real failing of MS, Google, and the like), it would be very useful.

    4. Re:The size of the farm shouldn't matter.... by AmiMoJo · · Score: 4, Interesting

      It's harder than you think. Those older systems sucked, and couldn't handle natural language queries. The issue is not processing power, it's having a large enough volume of training material and mimicking how the brain fills in gaps.

      Training material isn't just a case of gathering samples. When the machine makes a mistake, it needs to understand why. The collection needs careful curation and sorting to be useful. Such databases are extremely valuable, and historically with OS projects they often started with a donation from a commercial body rather than from scratch.

      Mimicking the brain is also extremely hard. Often people don't hear things very clearly or in full, due to environmental noise, poor pronunciation and the like. To compensate the brain fills in the gaps or makes assumptions. People have been trying to program those assumptions into computers since the 1980s. Again, a database of that knowledge will be vast and valuable. Either you throw massive human resources at building it, or you crawl the web and look at trillions of search queries like Google does.

      That's also why they need a cloud service to do this. The database is vast and proprietary, and querying it far from a trivial SQL command.

      It's not just a programming or AI training problem, which is why no-one is doing it. The closest thing the OS world has is probably Open Street Map, but creating that data set was far less laborious and uninteresting than training a computer to have some common sense will be.

      --
      const int one = 65536; (Silvermoon, Texture.cs)
      SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC
    5. Re:The size of the farm shouldn't matter.... by SuiteSisterMary · · Score: 1

      Apple's dictation software (PlainTalk) was running on System 7.1 Pro 20 years ago, using local hardware 100's of times slower than what I have in my pocket. Basic NLP code was running on the Newton, which was 1000x slower and still managed to handle the basics on top of the handwriting recognition. "Speakable Items" let me run user-writable AppleScripts to automate tasks and was just missing dictatable variable names.

      I helped Apple wreck a nice beach.

      --
      Vintage computer games and RPG books available. Email me if you're interested.
    6. Re:The size of the farm shouldn't matter.... by Anonymous Coward · · Score: 0

      Make it a videogame. Though now we are vs DRM. Pity, you do not have the millions to get the training set, but you are still competing for free, eh?

    7. Re:The size of the farm shouldn't matter.... by StikyPad · · Score: 1

      It also ignores the fact that people mishear and misunderstand each other. All. The. Time. Those gaps we fill in? Often erroneous. People actually expect more from computers than from other people: nearly perfect listening AND comprehension.

  10. Wasn't Android supposed to be Open Source? by destinyland · · Score: 1

    Honestly, the only way that I see this happening is if Google decides to make their AI interface open source. Which they might do as a public service -- but we're still playing in Google's sandbox.

    Unless there's some way to get geeks to contribute their unused CPU cycles, like what SETI was doing...

    1. Re:Wasn't Android supposed to be Open Source? by bheerssen · · Score: 2

      Search for aliens -- OoooOOOooooh!
      Sex robot -- Giggity!
      Create a digital assistant -- Meh.

      Siri and Google Now aren't sexy. Maybe what's needed is a chatty digital alien sexbot that happens to double as an assistant. Slide in the useful features on the sly, like hiding dog medicine in a piece of cheese.

      --
      (Score: -1, Stupid)
    2. Re:Wasn't Android supposed to be Open Source? by Fnord666 · · Score: 1

      Honestly, the only way that I see this happening is if Google decides to make their AI interface open source. Which they might do as a public service -- but we're still playing in Google's sandbox.

      You mean like this?

      --
      'The tyrant will always find pretext for his tyranny.' - Aesop's Fables
  11. Possible by Bruce+Perens · · Score: 3, Interesting

    First, I'm sure there's lots of Open Source being used in Google's implementation - just not where we can see.

    There is a speech recognizer from CMU that might be a good starting point. I haven't heard about plain-language software, though. There is additional rocket science to be done. Not insurmountable given things we've already done.

    Training with millions of people? Actually, that's the part that community development is good at.

    1. Re:Possible by Anonymous Coward · · Score: 1

      I think development would accelerate tremendously if the FCC would do away with ISPs forbidding home server operation/hosting. I understand there is no direct critical relation, but I see a logical accelerated path with e.g. home server appliances, and natural ability of the FOSS community to add obvious voice control interfaces to them as advancements. Sure, maybe the same arguments could apply to non-internet-connected home PC(probably going to have some services) appliances. But I really think there is something magically fun about being in complete control of a server you own sitting in front of you at home. I really think that magic is what is missing from the equation. $0.02... -dmc

    2. Re:Possible by theskipper · · Score: 1

      Agreed. But please keep port 25 closed to those home servers. My spam folder is bursting at the seams right now from all the broken Windows boxes in the world now. We don't need another vector.

    3. Re:Possible by KingMotley · · Score: 1

      Actually, most spam doesn't come from broken Windows boxes any more and it hasn't been that way for a long time. Most spam comes from rooted linux servers these days. No spammer wants the old XP box running behind 128kbps DSL, they would much rather target the linux server sitting in a data center connected by multi-gigabit connections.

    4. Re:Possible by Anonymous Coward · · Score: 0

      It's all about botnets of compromised IoT devices. No self-respecting spammer wants a linux server at a co-location with easily identifiable IP ranges.

  12. Mycroft, obviously. by JudeanPeople'sFront · · Score: 4, Informative

    OK, you might not listen to the Linux Action Show or similar podcasts, but come on... google "open source AI" before asking.

    1. Re:Mycroft, obviously. by jwymanm · · Score: 1

      That is funny this was posted - I was working on my mycroft enclosure all night. It has a long way to go still but I like the development community. The STT (speech to text) still uses some cloud providers that are not opensource but they plan on going to their own OSS platform once it is ready.

    2. Re:Mycroft, obviously. by Anonymous Coward · · Score: 0

      OK, you might not listen to the Linux Action Show or similar podcasts, but come on... google "open source AI" before asking.

      OP means an open source equivalent of "Siri" etc. This is not just an open source AI engine, but a well trained AI instance.

    3. Re:Mycroft, obviously. by stx23 · · Score: 1

      Yeah, that's what came to mind. It's maybe more of a Echo/Alexa competitor but in the same space.

  13. World isn't ready for a open source Siri by macraig · · Score: 0

    Do you not realize that Siri must utilize a significant backend resource at the other end of a data connection to be effective, and that the backend requires substantial resources to operate and maintain? Siri is not some standalone app you download and forget. An open source equivalent would not be free, and would require a Kickstart and guaranteed subscriptions to be feasible. I don't think the world is quite ready for such a thing yet, not in a country populated by people willing to vote for either Trump or Clinton.

    1. Re:World isn't ready for a open source Siri by Anonymous Coward · · Score: 0

      The marriage of bitcoin mining, human sexuality and the training of the open source "Siri" will solve the issue. Call it Jenna.

    2. Re:World isn't ready for a open source Siri by Anonymous Coward · · Score: 0

      I'm not sure the lines as clear cut as that. Maybe the limitation is purely resource availability, but most open-source enthusiasts have a machine running somewhere. If you're reliant on a third party running a service for you it undermines exactly the sort of control a big part of the open-source market want.

      For a lot of people a mechanism by which they can interact with key programs with voice is what they want. Does it really take more than a decent home computer to understand, interpret, and respond to queries like "what will the weather be like", "play my songs on spotify" etc.

    3. Re:World isn't ready for a open source Siri by Anonymous Coward · · Score: 0

      That doesn't seem impossible. Programs like SETI@home get large quanities of donated computing power. Given that everyone gets a free vm on a few cloud services I imagine that a lot of people would be willing to click a link that spawns a vm that is ready to go.

      Of course you'd probably want to build more of generic distributed neural network backend rather than having everyone contribute their resources to a single application.

    4. Re:World isn't ready for a open source Siri by macraig · · Score: 1

      It is that hard. Just ask Google how hard it is to get accurate transcription for its Voice service. Most of those transcriptions are so bad that their real value is as humor.

    5. Re: World isn't ready for a open source Siri by sonamchauhan · · Score: 1

      Significant? Lookup cmu sphinx.

    6. Re:World isn't ready for a open source Siri by Anonymous Coward · · Score: 0

      One potential source of speech+text concordance could be from the subtitle feed from the television?

  14. A Question No One Asked by Anonymous Coward · · Score: 1

    The answer is most likely "no one." Those of us in the open source community are looking in a completely different direction from the mainstream; that's the whole point. Features like a voice-activated virtual assistants really only appeal to the lowest common denominator of computer and smart tech users--people who don't understand tech (or want to) and thus just want something incredibly simple that works without much effort. The same people who covet something like Siri are the same who sped 99% of their time interacting with Facebeast or the Tweeters.

    I'd much rather the open source community focus on important things like security, compatibility, efficiency, and workarounds for dick companies like Lenovo locking down their hardware.

    1. Re:A Question No One Asked by DuckDodgers · · Score: 1

      I disagree. I don't use Siri, or Google Now, or Cortana specifically because of the privacy implications. If I had available a completely open source, end-user-owned, private option I would use it all of the time for things like checking the weather, setting reminders, playing music, adjusting the temperature, assembling grocery lists, making purchases, etc... etc...

      As an aside, your contempt for the average person won't help our cause. Elitism helps none of us - the best way to make open source stronger is to get more people using it, because some of those new users will in turn become contributors.

  15. open source software is born of necessity by Anonymous Coward · · Score: 0

    so this shit technology will not be created for free anytime soon. its current existence is only to serve data mining corporations. what is the use case for an individual there?

    1. Re:open source software is born of necessity by Anonymous Coward · · Score: 0

      That's not true, Linux was created to replicate the functionality of UNIX which already existed. OpenOffice was created to replicate the functionality of microsoft office and word perfect whatever its called. Wine was created to play windows games on linux, even though windows already existed.

      There is more opensource projects to create free / open alternatives of existing tech than there is to pioneer new tech.

    2. Re: open source software is born of necessity by Anonymous Coward · · Score: 0

      You mean open source people are not inventors or innovators? Whaaaaaaaat? Like only for profit companies invent things? Huuuuuuuh? Interesting. Maybe this can be applied to other industries. Pharmacy? Yes. Auto? Yes. Energy? Yes. Yes for everything.

      You just pointed out the most avoided fact in open source. Open source doesn't make anything new. Ever. It's easier to copy something to invent it. It's less risk to copy an existing successful widget than create one.

      As soon as open source makes the leap from followers to leaders, we will see something.

      Until then, reverse engineering is always fun.

  16. Jasper Project by Anonymous Coward · · Score: 2, Informative

    It is in development..

    http://jasperproject.github.io/documentation/

    Not affiliated with the project.. saw it sometime ago.. decided to wait till it further matures...

  17. Why redesign the wheel by BringsApples · · Score: 1

    AFAIK Google's isn't open source, but I don't think anyone is paying directly for it. I recently bought a $28US phone that came with the android OS. This came installed on the device (it's built into maps), and it works very very well. So I think the question is worthless to answer.

    Sure, if there's an open source option, then the world can rest assured to be able to tinker with it themselves and that. And yes, Google could pull the plug on it. But for some reason, I feel that Google would just release it to the public before they'd simply toss out all that development. The task of building such a database of info, mixed with the ever-changing roadway of each country... no way anyone else, besides some huge corporate entity, could ever start from scratch. And even if they did, what would be the reasoning behind anyone using it, rather than Google's?

    --
    Politics; n. : A religion whereby man is god.
    1. Re:Why redesign the wheel by DavidRavenMoon · · Score: 4, Insightful

      You pay for Android with allowing Google to data mine your info. This is why they wanted to be on mobile phones in the first place. This is why they offer "free" services like Gmail and photos. Their software reads all your emails. Then they target ads to you. Google Now is another way they can get advertising info from you. That's how Google makes their income, and why they can give Android away for "free" to phone makers. It's not about being open source. It's about advertising revenue.

      --
      -- if it was so, it might be; and if it were so, it would be; but as it isn't, it ain't. That's logic - Lewis Carrol
    2. Re:Why redesign the wheel by Anonymous Coward · · Score: 0

      You seem not to be aware the consequences of having your voice (and background noise) recorded by some third-party.

      Are you talking about maps? Don't you know about OpenStreetMaps?

    3. Re: Why redesign the wheel by Anonymous Coward · · Score: 0

      Whaaaaat? You mean they are doing it for money? Huuuuh? Open source is really about profit? Wow. It's time to look at open source like it is. A marketing trick. It's call the loss leader. Free on the surface but there is a cost. It ain't nothing special or pure or ideal about it. It's the same animal in a different skin. Now you can make better decisions.

    4. Re:Why redesign the wheel by Anonymous Coward · · Score: 0

      where you been? android has generated well over 20 BILLION in *net profits* for google. you may not be *directly* paying for it.. but you are, nonetheless, through ads, ads, and more ads, being driven to google services with more ads, ads, and ads, and having everything you do, everyone you talk to and everywhere you go tracked, mined and archived by google so they can give you even more ads, ads and ads.

    5. Re:Why redesign the wheel by BringsApples · · Score: 1

      I have yet to receive an ad from any app, or email in the last 5 years. And if anyone wants to use my data for profit, good luck, I'm poor on purpose.

      --
      Politics; n. : A religion whereby man is god.
    6. Re:Why redesign the wheel by Anonymous Coward · · Score: 0

      You pay for Android with allowing Google to data mine your info.

      Not if you use Replicant. -dmc

    7. Re:Why redesign the wheel by Anonymous Coward · · Score: 0

      If you think they make most of their money through advertising, then you need to research more about exactly what kind of information they accumulate and how it is used.

    8. Re:Why redesign the wheel by Anonymous Coward · · Score: 0

      They already are using your data for profit. You help them train their AI with your voice, and they extract a fingerprint out of it so they can later identify you. They also ID your face, and your body. They know where you've been and whom you talk to. They know your political views. They know what are the things that makes you happy or angry. They know your favorite porn. They know your medical history. Those things are valuable, not your money.

  18. More importantly, by Hylandr · · Score: 1

    Who gets to teach the AI, and who gets to determine what it's taught?

    Not having these things determined by an entity that can be regulated or at least spanked could be a really bad thing. Then again, maybe Google wants to take over Earth.

    Could that be so bad?

    --
    ~ People that think they are better than anyone else for any reason are the cause of all the strife in the world.
    1. Re:More importantly, by Elfich47 · · Score: 1

      The problem is most speech scientists that develop the speech models are under contract between a relatively small number of companies and universities. These are the people that actually understand how the sounds that come out of our mouths get turned into text. They all have PhDs in the field and are paid competitively. The available pool of these people: 1. wanting to work on a voice model in their free time and 2. not being under a noncompete is very very small.

      I expect that an open source voice model will eventually be developed, years from now. By then the commercial versions will have gotten that much further ahead of the open source version.

      --
      Architectural plans are like computer source code with a couple of differences: You only compile once.
  19. gee by Anonymous Coward · · Score: 0

    We barely have time to work on *useful* things. Screw that cyberpunk bullshit.

  20. Re:Jasper Project / Correct link by Anonymous Coward · · Score: 1

    Correct link

    http://jasperproject.github.io/

  21. I don't know the answer to this... by Anonymous Coward · · Score: 0

    but have you tried asking siri?

  22. Links, for the interested by autonomouse · · Score: 3, Informative

    This was the kickstarter: https://www.kickstarter.com/pr... Their main community website is: https://community.mycroft.ai/ They also have a slack here: https://mycroftai.slack.com/me...

  23. Really? by Anonymous Coward · · Score: 0

    No one has mentioned Mycroft yet?

  24. Might not be doable open source by Theovon · · Score: 3, Insightful

    There are a few application areas that are specialized and difficult enough that it they may not be doable within the Free Software paradigm. Richard Stallman himself, for instance, was not able to explain to me how you could get the right specialized engineers together to develop a free equivalent to Synopsys design compiler. Enthusiasts in this area don’t tend to be interested in writing software as a hobby, so you’d have to hire engineers, which means you have to pay for all the development.

    With automatic speech recognition, it’s not just an AI problem. You need massive labeled datasets that cost money to acquire, and the experts who really know this stuff are moving to on to their next research project. So how are you going to get engineers to learn and implement the esoteric techniques used here? You’d have to pay them. Most people who would be interested in writing free software to do this just don’t know the subject area well enough.

    1. Re:Might not be doable open source by Theovon · · Score: 1

      This isn’t about the technical distinction between Libre and Gratis. This is about the perceptions of companies and software developers with the skills necessary to develop these things and the willingness of such people to develop open source software. Your nit-pick doesn’t change the fact that (a) some developers don’t have interest or skill in certain topics, and (b) if a company invests millions into developing software, they’re not going to share source code, regardless of sticker price.

    2. Re:Might not be doable open source by AmiMoJo · · Score: 1

      I imagine Stallman would point out that you could in fact pay engineers to work on it, but still release it under the GPL. Like Google does with a lot of its software, for example. Such specialist software is likely to have significant support requirements, which can be charged for, or users of the software could simply pay people to add the new features they want.

      --
      const int one = 65536; (Silvermoon, Texture.cs)
      SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC
    3. Re:Might not be doable open source by xanclic · · Score: 2

      if a company invests millions into developing software, they’re not going to share source code, regardless of sticker price.

      Except this is exactly what Red Hat does. Apart from paying engineers to work on existing free software, Red Hat also open sources every product of every single company they acquire, because it's the company's core principle only to develop open source software.

  25. Open source has compute farms too by z3alot · · Score: 2, Informative

    Ones that even beat the proprietary competitors too, see http://tests.stockfishchess.or.... This is not to mention efforts like folding@home and similar. Of course there is still the problem of having large training data sets.

  26. It will be worse than the wiki methods by Anonymous Coward · · Score: 0

    Look at the in ability of wikis to gain reliable information without edit warriors and revert bots. Do you really want people like that "programming" your artificial intelligence. Like it or not proprietary is more reliable, generates billions in revenue creating jobs unlike the mostly unemployed wiki volunteers. Open source siri is the biggest joke since "open source" money like bitcoin.

  27. Please by twistedcubic · · Score: 1

    SETI@home is old as hell, so the idea of "open source" render farms is at least as old. Those "massive farms of GPUs running 24/7" don't scare me at all. In fact, both Siri and Google's voice recognition kinda suck. When they try to control us with this, or it is revealed that they send all their data directly to the government, I suppose we will have an incentive to act. Otherwise, wake me up when they do something interesting and new.

  28. When a loser open sources its solution by Anonymous Coward · · Score: 0

    While it's difficult to built up a single all-inclusive solution, some individual parts are not that hard:
    - speech recognition (incl. grammar and content) is something quite a few smaller companies can do
    - specialiced functions are relatively easy to realize (home automation, car navigation, ...)

    In my opinion, the training is not the hardest part - the biggest issue is the business model: Running such a server farm is expensive, and consumer typically don't want to pay for it directly - so the only path is indirect revenue (marketing ...). I doubt that would be much different with an open source implementation - someone still needs to keep the servers running.

    IF (big if) the smaller players in the market that do not like to share their know-how with google agree upon a standardized format for queries and data handling, a decentralized solution might be an option. It would require more effort by the consumer (configure multiple providers, maybe require a keyword to switch) - but it would also offer more freedom.

    Regarding the complete solution, it's also possible that one of the big players loses on the market and decides to open-source the software. But as written above that still doesn't solve the issue of providing the servers and the business model behind it.

  29. Mozilla by stakman · · Score: 4, Interesting

    The Mozilla project Vaani is intended to fill exactly this niche. https://wiki.mozilla.org/Vaani

    1. Re:Mozilla by jernst · · Score: 2

      The Vaani wiki states: "No longer an approved project" (right side of https://wiki.mozilla.org/Vaani), unfortunately.

    2. Re:Mozilla by Threni · · Score: 1

      Because nobody wants it? "Set my alarm for 7" is solved, Mozilla has no need for data mining people's search terms, and nobody wants to look like a dick talk to their phone/pc, so that's that.

    3. Re:Mozilla by jernst · · Score: 1

      I'd want it. I'd like to be able to sit at the breakfast table, and say "Hey Open-Source Siri, is it going to rain today?", "What's interesting on Slashdot today?" and things like that, and it tells me, without me getting the jam on the cell phone. Amazon of course does things like that, but I don't want a black box with a 24x7 microphones in my house. If it were open-source, that'd be entirely different.

  30. It's not just speech recognition. by hey! · · Score: 2

    It's semantic recognition. Like what "it" in the prior sentence means -- in this case it's mainly a grammatical placeholder, but note how the various uses of "it" in *this* sentence are different.

    The really impressive thing about Siri is how well (although still not human-well) it divines intent, not just phonemes. Add to that a massive scale attempt to get the phonetic recognition part right, and it's a bit like trying to launch a competitor to Google Maps.

    --
    Post may contain irony: discontinue use if experiencing mood swings, nausea or elevated blood pressure.
    1. Re:It's not just speech recognition. by R3d+M3rcury · · Score: 1

      Not to mention the issues with It's.

  31. Mycroft by ArhcAngel · · Score: 1

    Thanks for asking the question. I didn't know about Mycroft until I looked for an Intelligent Personal Assistant.

    --
    "A person is smart. People are dumb, panicky dangerous animals and you know it." - K
  32. There are several projects out there by pthisis · · Score: 5, Informative

    Not yet mentioned yet is http://lucida.ai/ -- it's the successor to Sirius, and where all the ongoing development is focused.

    Major options that are mentioned elsewhere in the thread:
    https://mycroft.ai/ (One of the most advanced,can actually be used in a pretty useful manner now, but sends snippets to Google for voice recognition--they intend to change that eventually, and they don't have a full-time open mic. Plus they aggregate audio across users so it's less identifiable as from a single source).
    https://wiki.mozilla.org/Vaani (from the Mozilla project; supposed to enter beta this month according to that page)

    --
    rage, rage against the dying of the light
  33. Re:Slashdot doesn't read tech news anymore. by mewsenews · · Score: 1

    Google's cloud speech API is a paid service - what does it have to do with this topic?

  34. Not that hard by Sarusa · · Score: 4, Funny

    Open Source Siri always responds with 'RTFM, noob'. Should be pretty easy.

    Yes, this joke has been brought to you by the year 2005.

    1. Re:Not that hard by Anonymous Coward · · Score: 0

      Wish I had mod points, parent is funny.

    2. Re:Not that hard by Anonymous Coward · · Score: 0

      You mean if people said this before 2005 they were serious? Man I'm in deep shit

  35. Not really an open source issue by Anonymous Coward · · Score: 5, Insightful

    The thing is, this really is not an open source software issue, it is more of an infrastrcuture issue. People can make the code that will handle spoken queries and return answers and do it as a community. That's not really the tricky part. What the OP is looking for though is a massive project of which code is a small part. There is voice processing, servers to maintain, lots of fine-tuning and learning to do, if we want the assistent to speak then we need voice actors, etc. Plus hours and hours of testing and trials and putting it all in an interface people will like.

    This reminds me of the "Where is the open source Facebook?" question. There are plenty of open source social network frameworks, but the code is a small part of the job. There's a massive amount of servers, advertising and social engagement that would need to happen for someone to make a new Facebook alternative. The open source code is there, it's the other parts which are missing.

    The author also seems to think most commercial software up to this point has an open equvalent, but it doesn't. Geological, accounting, mapping and tax software tends to be commercial only. There are usually no open source alternatives because it's not something you can throw together and just publish on-line. You need auditors and geologists, accountants and so on to make these things work. It's not a coding problem so much as a business/product problem.

  36. Courage by ChunderDownunder · · Score: 1

    You don't think Apple are working on it?
    iPhone 7 removed the headphone jack. iPhone 11 will remove the internal microphone.

  37. How does TensorFlow solve this? by Anonymous Coward · · Score: 0

    From your link:

    TensorFlow is an open source software library for numerical computation using data flow graphs. Nodes in the graph represent mathematical operations, while the graph edges represent the multidimensional data arrays (tensors) communicated between them...

    That doesn't sound like an open source version of Siri to me...

    I see that Areyoukiddingme is too busy bitching about SJWs and global warming to read their own links anymore.

    1. Re:How does TensorFlow solve this? by Areyoukiddingme · · Score: 0

      TensorFlow is an open source software library for numerical computation using data flow graphs.

      Which is the basis of machine learning. Do you expect somebody to hold your dick for you while you pee, too?

      Is this Slashdot or is this Gawker?

      In the immortal words of Arnold Schwarzenegger, STOP WHINING!

  38. Saera by Anonymous Coward · · Score: 0
  39. Look at the economics by timholman · · Score: 4, Insightful

    It's pointless to talk about creating an open-source version of Siri or Alexa unless you can explain how you're going to also create and maintain the server-side infrastructure needed to make it work. The Siri and Alexa interfaces may run on a client, but they're brain-dead without the server farms of Apple and Amazon behind them.

    A similar example from the not-too-distant past: Aaron Swartz's download of a significant chunk of the JSTOR database. Those JSTOR articles wanted to be free, right? And they were set free - copies of Swartz's JSTOR download were available in a multi-GB torrent on several sites. Swartz's entire rationale was that those articles should be freely available to everyone.

    So where is the free, open-source version of JSTOR today? It doesn't exist, because building and maintaining a server-side infrastructure that makes that database useable costs money ... which, of course, is why JSTOR required a subscription fee.

    Solve out the server-side economics, and you have a shot at building an open-source Siri. Until then, you're better off putting your open-source efforts into client-side applications.

    1. Re:Look at the economics by r0kk3rz · · Score: 3, Insightful

      Solve out the server-side economics, and you have a shot at building an open-source Siri. Until then, you're better off putting your open-source efforts into client-side applications.

      There is a new wave of decentralised open source applications occuring at the moment which changes the server-side economics considerably. Perhaps not so much in terms of something compute heavy like Siri, but certainly other bandwidth heavy things like youtube. Things like Ethereum, IPFS, ZeroNet.

  40. Siri is many things - many open source projects! by paskie · · Score: 2

    Siri is a complete stack of text reco engines, intent recognition tools, and backends. There are many initiatives like Sirius, Mycroft and YodaQA, and each does something slightly different - either focusing on the speech reco infrastructure, or just answering factoid questions...

    --
    It's not the fall that kills you. It's the sudden stop at the end. -Douglas Adams
  41. Sounds fascinating, but... by Anonymous Coward · · Score: 0

    Not to be a negative nellie, but the way things have been going with patent/IP trollsuits going on over the last few years... Anyone betting against this sort of thing happening on a large/usable/popular scale?

  42. Re:Slashdot doesn't read tech news anymore. by Areyoukiddingme · · Score: 1

    Google's cloud speech API is a paid service

    It was completely free when it was announced. It's still completely free for the first 60 minutes of recognition time.

    And I only mentioned it because somebody asked if there's an API. There is.

  43. Captcha by Anonymous Coward · · Score: 0

    Why not use Captcha as a way to get millions of VR samples? It can be an option along side typing in an answer in case a mic isn't handy/convenient.

  44. Re:Slashdot doesn't read tech news anymore. by NotInHere · · Score: 1

    I'd like to interject for a moment. What you’re referring to as TensorFlow, is in fact, "corporate data/TensorFlow", or as I’ve recently taken to calling it, corporate data plus TensorFlow. TensorFlow is not a finished voice assistant itself, but rather a free component of an otherwise proprietary, fully functioning google system made useful by the google proprietary APIs, apps and web services comprising a full product experience as defined by the google leadership.

    Many computer users query the google system every day. There really is a TensorFlow, and these people are using it, but it is just a part of the system they use.

    TensorFlow is the basis: the program in the system that executes the AI programs. TensorFlow is an essential part of an AI service, but useless by itself; it can only function in the context of a complete AI service. TensorFlow is normally used with the google proprietary service: the whole system is basically google's proprietary service with TensorFlow added, or "corporate data/TensorFlow".

  45. It's a matter of leadership by Anonymous Coward · · Score: 0

    The resource based technical issues can be overcome by a distributed approach. With thousands of participants, you can wield thousands of CPUs and thousands of people to create data sets for training.

    The runtime issues can be mostly overcome in the same way though your performance (speed, accuracy or both depending on how you adjust tradeoffs) will always lag an approach using colocated CPUs.

    The magic ingredient is, therefore, some charismatic leader or group to drive the project. Find a geek that actually knows how to market a project and you're off to the races.

  46. Computer... by future+assassin · · Score: 1
    --
    by TheSpoom (715771) Uncaring Linux user here. I have nothing to add to this but please continue. *munches popcorn*
  47. Broadcast TV with CC by Anonymous Coward · · Score: 0

    Train it using broadcast TV and Closed Caption. Spoken word and text. Free and abundant and contains the type of data needed.

    1. Re:Broadcast TV with CC by Elfich47 · · Score: 1

      Careful with that: Closed captioning often abbreviates the spoken word.

      --
      Architectural plans are like computer source code with a couple of differences: You only compile once.
  48. I could build its equal in 10 minutes. by EmperorOfCanada · · Score: 4, Funny

    It only needs two features. First is to keep cutting people off mid sentence. If you are trying to say, "Send message to John Smith." I can have it cut people off before the name John Smith.

    Then I can randomly have it just wait until the end and then say, "I can't find that person in your contacts, would you like me to search the local area for businesses of that name?" This is regardless of what their actual command was.

    What I find interesting about Siri is that it so rarely gets what I am saying correct but when I insult it, it has got that right 100% of the time. "Fuck you Siri, you useless pile of shit." or any one of the zillion creative insults that I have thrown at it have resulted in some "If I had feelings, they would be hurt." So I know that it is not my microphone. It is the pile of crap just not getting what I am saying.

    I am saying, "Call John Smith." or "Message John Smith" or "Read last message" or "Play audiobook, the John Smith Story."

    I have a twenty minute ride home from work. I once spent the entire twenty minute ride home trying to send a message to someone that said, "I will be home in 20 minutes" (except that as I tried that number was ever growing smaller.)

    Nearly the entire time it would just cut me off mid sentence. It would often be in the middle of my message. So it would end up saying "Would you like to send the message "I will"?" I was even trying to give it a run-on sentence such as IWillBeHomeIn20Minutes, so that it wouldn't pick up on a pause as the end. Then there is all the other bullshit that it sucks at. In the previous example it wouldn't confirm to whom I was sending the message. It would not allow me to change the message. So I started over and over just to see if I could get it to work. Yet as a confirmation that it was hearing me I would ask things like, "What is the second derivative of x^3+x^2+3x+9" and it would give me the correct answer.

    Then after the map program nearly continuously putting me blocks from where I really am and thus giving me terrible directions in critical situations and then trying android's siri awesome equivelant, I switched to android.

    On this note, I don't think that apple realizes how bad these missteps are getting. The fact that it took me 20 minutes to send no messages, the fact that it took me 20 minutes to remove that U2 bullshit from my phone, the fact that I can't remove BS apps from my phone, the fact that iTunes nearly always is jumping to music and movies (both on the phone and the desktop) when I am clearly not looking for either (such as when I am looking for a podcast). The fact that my mac pro(not macbook but my $6,000 dollar mac pro) is shoving iCloud down my throat. The fact that I can't repair half of this shit without using magic tools. The fact that little things like some extra memory costs about as much as a cheap version of the same device. All totals up to my typing this on a completely kick ass windows desktop that is presently charging my completely kick ass huge screened Android phone that I rooted and easily removed all the BS from.

    While I am seemingly a single customer, I am also in charge of the purchasing for a large company. A company where I switched many of the execs and programmers to Apple. A switch that I am now reversing. Do I hate apple? Nope. The key is that Apple is no longer working for me, the devices that I bought weren't my servants, but little apple salesmen. Then there are things like XCode that was no longer really encouraging me to do things as a professional programmer, but trying to lock me into the apple ecosystem. Oddly enough this is why I originally left windows and microsoft. It was all about .net and getting me to become a sharepoint/MS salesmen. But now things like Visual Studio allow me to program for my Android and iOS just slick as can be. They are tools that work for me.

    Can you imagine a carpenter who got a hammer that would only hammer mastercraft nails? Or a hammer that regularly missed the nail regardless of your skill with a hammer?

    1. Re:I could build its equal in 10 minutes. by LinuxIsGarbage · · Score: 1

      Can you imagine a carpenter who got a hammer that would only hammer mastercraft nails? Or a hammer that regularly missed the nail regardless of your skill with a hammer?

      What would you expect from a Mastercrap hammer? You know you bought it from Crappy Tire right?

    2. Re:I could build its equal in 10 minutes. by EmperorOfCanada · · Score: 1

      I heard someone call it Cambodian Tire in a recent video. I should have used a higher quality example. I would be surprised if a mastercraft hammer could do any nails.

  49. You try and try by Anonymous Coward · · Score: 0

    You even build a working model to sequence bits, it is just a waveform representation.

    Then you try to talk to normal people and they think you are crazy for trying. Then you try to talk to programmer type people and they are elitist pricks who act like it is some amazing technology that is more than pattern replication because they want to sell it with hype to the idiots.

    The strange bit is why the idiots have the money. Somebody must have fucked up somewhere.

  50. Re:Communism@Home by Anonymous Coward · · Score: 3, Interesting

    timholman's post is incredibly insightful. To get around the problem he point out, I think we need to distribute these services to the community, as the OP suggests. The TelCo's make this difficult, with restrictive terms of service. A cloud powered by millions of home users is probably the technical solution to the economic problem, but to implement it we'll need to free the fibre.

  51. Think around the problem by John+Allsup · · Score: 1

    Think around, not through. What we want is efficient, intuitive and reliable human computer communication. If voice recognition is that hard, with many facepalm inducing errors, it is a stupid way to go. It is easier for humans to adapt to the machine. This means artificial dialects and simple AI and a bit of human training. Human consumers are lazy and want magic. Apple and MS try to grab them with the illusion of magic. It would be better for the free software to research what changes to speaking habits make the software component easier, then write howtos and youtube guides as to how to speak to it.

    --
    John_Chalisque
    1. Re:Think around the problem by Fnord666 · · Score: 1

      Think around, not through. What we want is efficient, intuitive and reliable human computer communication. If voice recognition is that hard, with many facepalm inducing errors, it is a stupid way to go. It is easier for humans to adapt to the machine. This means artificial dialects and simple AI and a bit of human training. Human consumers are lazy and want magic. Apple and MS try to grab them with the illusion of magic. It would be better for the free software to research what changes to speaking habits make the software component easier, then write howtos and youtube guides as to how to speak to it.

      Reminds me of the Palm Pilot and Graffiti. Rather than try to recognize normal handwriting like Microsoft was doing, Jeff Hawkins designed a simplified single stroke character representation that was very to recognize in software.

      --
      'The tyrant will always find pretext for his tyranny.' - Aesop's Fables
  52. Re:Siri is many things - many open source projects by nospam007 · · Score: 1

    " either focusing on the speech reco infrastructure, or just answering factoid questions..."

    So it's like me, when I speak, I can't listen.

  53. Siri is a cost center. How about Wikipedia? by nbritton · · Score: 1

    The trouble is, unlike software development which is free (if you don't value your time), implementing an open source siri would require a data center fill with servers and this costs money. The fundamental problem is software development creates value while an open source siri is a cost center. Wikipedia would probably be a good candidate to pick up this task because they are already familiar with the open source cost center model, they are a knowledge database, and they already have the server infrastructure.

  54. Open source version of siri / echo by fyngyrz · · Score: 5, Informative

    The answer is MyCroft

    I plan on buying one of these the very soonest I can once they are actually shipping the hardware. Echo is crippled by the many limitations Amazon coded in on purpose -- it's basically something that looks up text matches and does something if it finds one. No language parsing worth a damn. Even so, it's very useful, and within those limits, you can make stuff for it, Amazon's pretty open about it as long as you can set up a secure server (ugh) or use their cloud (double-ugh.) Siri, as per usual for Apple, is a much more closed system, and frankly, it's of no interest at all to me because of that.

    Mycroft is completely open source. I have very high hopes for it because of that. I have reams of my own natural language processing code I should be able to plug right in the moment there is a speech-to-text engine I can use directly. Others do as well. Custom apps in the home space, that are actually somewhat smarter than...

    [if string == "turn on light" then TurnOnLight]

    I suggest everyone check MyCroft out. Perhaps you'll be as enthused as I. I can hope. ;)

    --
    I've fallen off your lawn, and I can't get up.
    1. Re: Open source version of siri / echo by Anonymous Coward · · Score: 0

      That's not even out yet. It says it's open source, but you have to pay for the source, so...

    2. Re: Open source version of siri / echo by Anonymous Coward · · Score: 0

      so? It's free and open source software.

    3. Re: Open source version of siri / echo by Anonymous Coward · · Score: 0

      Wait what? Where did you pull that from?
      Here is the link to source code: https://github.com/MycroftAI/mycroft-core

  55. Just don't put it on a server by Trogre · · Score: 1

    Whoever is designing such a system needs to remember to keep it client-side.

    Given the ridiculous amount of processing power available on even low-end phones and tablets now there's really no excuse to rely on the horrible latency and dependence that comes with server based voice recognition.

    Any voice processing that relies on server-side processing has already failed.

    --
    "Nine times out of ten, starting a fire is not the best way to solve the problem." - my wife
    1. Re:Just don't put it on a server by Anonymous Coward · · Score: 0

      Written like a complete ignoramus. Apple, Google, MS and Amazon have not built data centers to process this stuff because it was easier than doing so client-side. It's because to do a good job requires a hell of a lot of data, accessible in less than a second, to maintain a good experience. OSS devs are great at programming. They suck at everything else.

    2. Re:Just don't put it on a server by Anonymous Coward · · Score: 0

      You do realize, I hope, that what you suggest is going to kill the battery!?

  56. What is There to Work With? by LifesABeach · · Score: 1

    Well, we'll need a voice to text generator. Then we'll need some kind of AIML handler. Finally, a text to voice generator. IBM use to sell a Voice-To-Text interface card in the late 1990's. Text to Voice is a small software routine these days.

    The Machine Learning part is the intriguing part. Books have been, and will continue to be written on this. What the hard part is, "Is how can a computer program find a valid fact, and be able to defend that the fact is valid?"

  57. Everyone! by Anonymous Coward · · Score: 0

    There are currently 50 different implementations and none are compatible, as is the OS way.

  58. Killed By Comercial Free Versions by Anonymous Coward · · Score: 1

    The interested in open source speech recognition used to be much higher. When Google and others released their free, cloud-based speech recognition APIs everyone jumped over to those. It became difficult to find projects based on the older FOSS software. There are a few products out there but they are an absolute pain to installed because they're designed by researchers. You have to know all the field-specific terms and concepts as there is no one-click installation package. You need to install and configure each component to work together: audio input, sound analysis , sound metadata -> words, words -> phrases, and finally phrases -> DIY action software and there aren't any good tutorials on any of those steps. When you do find some tutorials their packages are out of date so you end up trying to manually compile everything. It's simply not worth it when Google has an easy to use API. Common application developers don't care much about FOSS or privacy, they just want to make their cool little program. If it takes 160 to get the FOSS version working or 7 for Google, they all pick Google. And then Google gets all this free voice data as well.

  59. I'm late but by ArylAkamov · · Score: 1

    Does anyone know of something along these lines that can run without an internet connection?

    Something that you could ask the status of a gpio pin, state values, or even ask it to tell you a joke (from a predefined list)?

  60. Re:Slashdot doesn't read tech news anymore. by EzInKy · · Score: 1

    Free for 60 minutes is not free. Only free is "Free".

    --
    Time is what keeps everything from happening all at once.
  61. What you are looking for does not exist by Elfich47 · · Score: 1

    Short answer no: The voice model and is to large to conveniently run locally. Any data needed to formulate an answer (prices of products, driving instructions, jokes, baseball scores, count down timers, etc) has to be accessible to the voice computer.

    --
    Architectural plans are like computer source code with a couple of differences: You only compile once.
    1. Re:What you are looking for does not exist by ArylAkamov · · Score: 1

      Damn. I've been hoping for a somewhat primitive voice recognition/synth that could run on microcontrollers or similar. Limited responses and very primitive, sure, but it would be fun to integrate into some of my projects.

  62. Amazon by Elfich47 · · Score: 1

    I assume you are referring to the nebulous "Amazon is listening" issue. For amazon to be listening all the time would consume huge amounts of bandwidth and processing power for it to be of any use to them. And you the end user would see the bandwidth pull all the time.

    --
    Architectural plans are like computer source code with a couple of differences: You only compile once.
    1. Re:Amazon by jernst · · Score: 1

      https://www.amazon.com/gp/prod... is listening 24x7, otherwise how could it hear you say "Alexa"?

    2. Re:Amazon by Elfich47 · · Score: 1

      Let me clarify: The local Echo unit is listening all the time. It only wakes up, records and uploads for processing is if you say the magic wake word. Other than that you could do what ever you want and the Echo is just going to sit there dumb as a stump. The cloud side of the operation only knows when it is tasked with something from a local Echo unit.

      Don't believe me? Put a bandwidth monitor on your Echo and then test it. Let it sit there with a conversation in the room and then wake it up and ask it to do things: count down timer, sports, scores, the weather, prices for products, wikipedia pages. You'll see the bandwidth usage spike when it is talking to the mother ship.

      --
      Architectural plans are like computer source code with a couple of differences: You only compile once.
    3. Re:Amazon by Anonymous Coward · · Score: 0

      While that's all true, two questions for you:

      1) Do you know for a fact it isn't sending more than what you just said? That it doesn't send, at some point, bulk data as part of a voice request that includes many hours of recording or at least metadata?

      2) Do you have means to control it from being updated from the internet? If not, anything it does or does not do today is a mere software update to change. If that can be done, in any way, without your knowledge -- then that device becomes a listening device without you noticing, and indeed, could also be flipped back to normal, also without your ever being aware it occurred.

  63. How about open source version of something better? by v1x · · Score: 1

    I am an Android user, and have used Google Now, but had not tried Siri until very recently, when it was bundled with the latest macOS Sierra release. So far, I have been less than impressed with both Google Now and Siri, and after trying Siri for three days on my computer, I turned off that functionality altogether, because it was not as helpful as I had expected a voice interface to be. So, I would like to know who's building a better, open source voice interface (as opposed to merely recreating Siri or Google Now, which are both mediocre at best)?

  64. MyCroft by phmadore · · Score: 1

    Mycroft.ai

  65. [citation needed] by Anonymous Coward · · Score: 0

    We're moving to a world of voice interactions processed by AI.

    No, we're not; I don't believe voice interactions will ever become dominant mode of interaction with electronics, except maybe to control your living room TV, or for people with disabilities. Even if you can produce perfect recognition of every human sound in a given language, you still have the problems that: (1) people use electronics in public, and don't want to speak out loud for privacy reasons; (2) using a keyboard or touch interface is much faster and efficient than talking; (3) most speakers of e.g. English in the world are not native speakers, meaning that they'll have an accent, and it's very hard for electronics to compensate for that; (4) all human languages are quite ambiguous, and depend on a lot of context and culture that is hard for a non-human to "get"; (5) talking to a computer all day is much more tiring than using other interfaces — there's a reason many people prefer email to phone calls in business settings. And the list goes on.

  66. Re:How about open source version of something bett by Dog-Cow · · Score: 1

    I imagine Siri is much more useful on a phone than on a standard laptop or desktop system. I use it all the time on my phone, but I don't expect it to be at all useful on my mini.

    Being able to ask "how many tablespoons in half a cup?" and get a spoken answer, is really useful, especially if I'm in the middle of cooking at the time.

  67. Answer is Japan, what was the question? by Anonymous Coward · · Score: 0

    Isn't open source Siri / Coratana = UTAU?

  68. We're moving to a world of voice interactions? by Ash-Fox · · Score: 1

    We're moving to a world of voice interactions processed by AI.

    We are? I honestly haven't noticed.

    --
    Change is certain; progress is not obligatory.
  69. Imagine a Beowulf cluster... by doggo · · Score: 1

    "And even with that data, Siri gets trained with a massive farm of GPUs running 24/7 -- but how can the open source community replicate that?"

  70. People Have Been Talking AT Computers For Ages ... by Anonymous Coward · · Score: 0

    But only recently have some computers (mostly phones) started listening. Unfortunately, the current commercial interfaces have a bit of trouble when more than half of the words have 4 letters that refer to things that a machine cannot experience and natural functions they cannot perform.