Princeton Students Develop Open Source Voice Control Platform For Any Device

← Back to Stories (view on slashdot.org)

Princeton Students Develop Open Source Voice Control Platform For Any Device

Posted by Unknown on Wednesday April 9, 2014 @02:50AM from the yell-at-your-computer-more-effectively dept.

rjmarvin (3001897) writes "Two Princeton computer science students have created an open source platform for developing voice-controlled applications that are always on. Created by Shubhro Saha and Charlie Marsh, Jasper runs on the Raspberry Pi under Raspbian, using a collection of open source libraries to make up a development platform for building voice-controlled applications. Marsh and Saha demonstrate Jasper's capability to perform Internet searches, update social media, and control music players such as Spotify. You need a few easily obtainable bits of hardware (a USB microphone, wifi dongle or ethernet, and speakers). The whole thing is powered by CMU Sphinx (which /. covered the open sourcing of back in 2000). Jasper provides Python modules (under the MIT license) for recognizing phrases and taking action, or speaking when events occur. There doesn't seem to be anything tying it to the Raspberry Pi either, so you could likely run it on an HTPC for always-on voice control of your media center.

34 comments

Min score:

Reason:

Sort:

Open source platform for Voice control by Anonymous Coward · 2014-04-09 02:56 · Score: 0

They need to develope a Raspberry PI with at least 4 usb hubs, I use them for all sorts of data collection applications like RF tag readers and security cameras.
1. Re:Open source platform for Voice control by wiredlogic · 2014-04-09 03:13 · Score: 2
  
  The solution is just to attach a hub. It isn't meant to be a production SBC. They can't meet their price point by adding more connectors to a larger board.
  
  --
  I am becoming gerund, destroyer of verbs.
2. Re:Open source platform for Voice control by Anrego · 2014-04-09 03:45 · Score: 5, Interesting
  
  My experience is the rasp pi just isn't stable enough in that kind of configuration for serious use (other experiences may vary). When you get higher USB traffic or eth traffic, it fails, and when it fails spectacularly and usually takes the board down with it. There are better boards out there are a slightly higher price range that can handle this no problem.
  Don't get me wrong, I love the rasp pi and I think it's awesome what they've done and more importantly what they've started (this kinda ultra cheap computer was a dream just a little while ago, now you've got a wide variety, and I believe the rasp pi was directly responsible for this). The reality is however that a good number of alternatives have popped up at a variety of price points, many better suited for a lot of the purposes we originally were salivating over for the pi. Definitely worth looking around before trying to force a pi to do it.
3. Re:Open source platform for Voice control by Anrego · 2014-04-09 04:20 · Score: 1
  
  Yikes!
  Terrible grammar, even for me. Sorry folks :(
4. Re:Open source platform for Voice control by pr0fessor · 2014-04-09 04:38 · Score: 1
  
  isn't mono already open source cross platform and run on the same platforms as espeak including rasp pi... system.speech namespace http://go-mono.com/status/stat...
5. Re:Open source platform for Voice control by Anonymous Coward · 2014-04-09 06:08 · Score: 4, Informative
  
  Sounds like a crap powersupply.
Oh good another "agent" + ViaVoice synergy by Anonymous Coward · 2014-04-09 03:07 · Score: 0

When will there be new ideas? I've been in systems engineering since the early '80s, and no new ideas have appeared since the mid-'90s, except miniaturisation techniques. Computing has become dull.
Re:Another project using sphinx isn't impressive by serviscope_minor · 2014-04-09 03:24 · Score: 3, Insightful

There are tens of these sort of projects and this one won't run on 'any' hardware as it uses some heavy libraries that certainly aren't going to work on ANY of the embedded systems I use.
Well, I'm shocked at ther deceptiveness of the article! Shocked, I tell you, shocked AND appaled.
I mean there are *so* many devices these things won't run on. It won't run on my little PIC12F678 with it's 64B of RAM and 1K flash. It won't run on my typewriter OR daisywheel and it most certainly won't work on my wax cylinder player.
A clue: "any" in this context means there are no device specific restrictions limiting it artificially. But you already knew that.

--
SJW n. One who posts facts.
... with a RaspberryPi by wiredlogic · 2014-04-09 03:36 · Score: 1

With a RaspberryPi you don't say? Quick, get a patent on this innovative technology that would be so mundane if it were implemented on a desktop machine running Debian or something.

--
I am becoming gerund, destroyer of verbs.
1. Re:... with a RaspberryPi by Anonymous Coward · 2014-04-09 04:34 · Score: 0
  
  With a RaspberryPi you don't say? Quick, get a patent on this innovative technology that would be so mundane if it were implemented on a desktop machine running Debian or something.
  it was probably more geared towards informing the reader of how little horsepower the application actually takes, that it can be run on a $35 computer the size of a deck of cards
2. Re:... with a RaspberryPi by BitZtream · 2014-04-09 04:46 · Score: 1
  
  Just don't use a USB mic ... unless you want the RaspberryPi's awesome USB hardware to randomly drop words on you.
  For the life of my I can't understand why people think using the RaspberryPi is a good idea. Its shit hardware, its not the cheapest, at best its one of many in its price range and its a steaming pile of shit hardware wise. For fucks sake, they can make a god damn camera add on but can't make freaking revision of the board that has FUNCTIONAL USB.
  
  --
  Persistent Volume manager for Kubernetes - https://github.com/dwimsey/openshift-pvmanager
3. Re:... with a RaspberryPi by khellendros1984 · 2014-04-09 05:03 · Score: 1
  
  It could also be marketed as "run a voice-controlled program on your 15 year old PC" or "Like speech recognition? Don't want to send your data to Google? You're in luck! Now with Python-y goodness!" Neither of those are as heavy on buzzwords, though. At the very least, you get an idea of the minimum requirements to use the project.
  
  --
  It is pitch black. You are likely to be eaten by a grue.
4. Re:... with a RaspberryPi by wiredlogic · 2014-04-09 05:04 · Score: 1
  
  FWIW the USB and Ethernet problems are all Broadcom's fault for making shitty SoC.
  
  --
  I am becoming gerund, destroyer of verbs.
The new ideas... by Anonymous Coward · 2014-04-09 03:49 · Score: 0

Are happening on a higher level now.
Re:Another project using sphinx isn't impressive by Anonymous Coward · 2014-04-09 03:50 · Score: 0

It won't run on my little PIC12F678 with it's 64B of RAM and 1K flash.
Nevertheless it would be nice to know approximately where the lower boundary is.
Saying that it runs on Raspberry Pi isn't really telling considering that the Pi pretty much has the specification of a full blown computer.
Even if it won't work on any 8-bit controller it would be nice to know if there is any possibility to make it run on a 32-bit controller without having to add external memory and storage.
The segment of 32-bit controllers in the 100MHz-range with around 64k internal ram and 200k flash is pretty big.
If one of those can be used it can be added to all sorts of neat stuff for a low cost.
If you have to add external memory you are moving to a segment of controllers with a different target audience and the neat motor control signals you wanted to use might not be available then.
Saying that it works on Pi is pretty much like saying that it works on a standard desktop without a fat graphics card. The Pi isn't very limited.
I Already Have This by Anonymous Coward · 2014-04-09 04:16 · Score: 0

I already have this. I say "[son's name] - go do this for me" and he goes and does it.
WHY ? Why do you want a live mic permanently on ? by span100 · 2014-04-09 04:28 · Score: 0

There is a point worth remembering, If its on a network it's hackable. Why would you want a hackable always on microphone in your house. Sorry, there is no way in the universe i will have ANY live mic or camera permanently on.
Re:Another project using sphinx isn't impressive by Ksevio · 2014-04-09 04:33 · Score: 1

A project that needs a full computer OS along with a high speed internet connection isn't really impressive. Now if he did it on an Arduino, that would show some progress.
Uh oh. by q4Fry · 2014-04-09 04:36 · Score: 1

"Format see colon. Why. Enter."
1. Re:Uh oh. by Anonymous Coward · 2014-04-09 05:39 · Score: 0
  
  bash: format: command not found
Pretty cool.. by Anonymous Coward · 2014-04-09 04:41 · Score: 3, Insightful

The fun things students come up with!
My son currently is in an engineering Graduate program in MA. He used to think they could try making cool things and maybe actually build and sell them.
Unfortunately most, if not all, can't take it beyond the classroom or home made use only. There is a huge list of patent trolls waiting on you if do.
Start here: https://www.google.com/?tbm=pts#q=voice+control&tbm=pts
He found this out the hard way :(
Re:Another project using sphinx isn't impressive by BitZtream · 2014-04-09 04:43 · Score: 2

No, I don't magically know what they mean given no context. Thats the point. It won't run on any device, hell, it won't even run on any RaspberryPi since ... some of the licenses for those libraries themselves are potentially conflicting.
I'm not sure what magical fairy world 'any' device belongs to, but not a single one I can think of applies here.
But hey, why let reality cloud your inner fanboy, eh?

--
Persistent Volume manager for Kubernetes - https://github.com/dwimsey/openshift-pvmanager
Re:WHY ? Why do you want a live mic permanently on by K.+S.+Kyosuke · 2014-04-09 04:48 · Score: 1

Probably depends on what you mean by "network". As far as mere voice commands are concerned, some bit-banged unidirectional interface could easily send low bitrate signals into whatever network you have at home. How would you "hack" something like that?

--
Ezekiel 23:20
Re:Another project using sphinx isn't impressive by khellendros1984 · 2014-04-09 04:55 · Score: 2

If the device can't run full-blown Linux with either ALSA or OSS support, then it sounds like you're out of luck. Full language models seem to be around 50MB of data, although I assume that simpler models could be used if recognition is constrained to a certain word set.

Compiled on my system, libsphinxbase.a is 298KB after being stripped, and the shared library is 302KB. That sounds like it's pretty far out of the size range that you're looking for.

--
It is pitch black. You are likely to be eaten by a grue.
I don't even have to look by ThatsNotPudding · 2014-04-09 05:10 · Score: 1

_Of course_ 'sudo make me a sandwich' is in the libraries!
interesting by Mirar · 2014-04-09 06:27 · Score: 1

I find this very interesting. I was looking for an easy way of setting up always-on microphones with speech synthesis for intelligent home use.
I didn't plan on using a Pi though, but a few of the always-on full blown linux pc I have around.
Aziz, light!
Exciting by elwin_windleaf · 2014-04-09 06:27 · Score: 1

I really like the way that these types of programs are taking us. It's about time that my computer starts listening to me while I'm yelling at it!
I've been using Blather myself, and really enjoy the results.
So they packaged some olds apps nicely. by Anonymous Coward · 2014-04-09 10:19 · Score: 0

I do not want to detract from the great work they did to integrate and document these existing packages but I do need to point out that there is very little that is original about what they have done.
I have been using a very similar set-up to entertain my kids for several years and I just followed the instructions written by other people. The difference was it was not in one place and I had to research the options then stitch it together with a bit of scripting.
Seems annoying by backslashdot · 2014-04-09 12:24 · Score: 1

After saying the trigger word, you have to pause .. that's a bit ridiculous and annoying .. I doubt this would catch on .. for it to catch on, it needs to allow you to say a continuous sentence without pausing. The latest chip from audience.com has this feature (called VoiceQ). Their chip is for phones, so it should be possible to implement the same technology in software on a desktop CPU.
Wow Really very useful information. Thanks a lot f by Anonymous Coward · 2014-04-09 19:40 · Score: 0

Wow Really very useful information.
Thanks a lot for sharing it with us. I will look forward to read more from you.
Could I share this information on my site.
http://webdesigningcompanyinchennai.in Web Designing in Chennai
Not usable outside research by JoSch1337 · 2014-04-10 01:18 · Score: 1

jasper depends on the "CMU-Cambridge Statistical Language Modeling Toolkit V2" which is released under the condition that it will only be used for research purposes. Therefore, their setup can't be used for non-research purposes. I doubt that setting up my own home-automation system counts as research...
1. Re:Not usable outside research by Jmc23 · 2014-04-13 09:04 · Score: 1
  
  maybe you should research that...
  
  --
  Don't complain about syntax, grammar, or spelling. There is no.hell like input on android.