Mozilla's New Open Source Voice-Recognition Project Wants Your Voice (mashable.com)
An anonymous reader quotes Mashable:
Mozilla is building a massive repository of voice recordings for the voice apps of the future -- and it wants you to add yours to the collection. The organization behind the Firefox browser is launching Common Voice, a project to crowdsource audio samples from the public. The goal is to collect about 10,000 hours of audio in various accents and make it publicly available for everyone... Mozilla hopes to hand over the public dataset to independent developers so they can harness the crowdsourced audio to build the next generation of voice-powered apps and speech-to-text programs... You can also help train the speech-to-text capabilities by validating the recordings already submitted to the project. Just listen to a short clip, and report back if text on the screen matches what you heard... Mozilla says it aims is to expand the tech beyond just a standard voice recognition experience, including multiple accents, demographics and eventually languages for more accessible programs.
Past open source voice-recognition projects have included Sphinx 4 and VoxForge, but unfortunately most of today's systems are still "locked up behind proprietary code at various companies, such as Amazon, Apple, and Microsoft."
Fuck Mozilla until they apologize for the SJW-fueled witchhunt against Eich.
I'm afraid an Alexa with the voice of that Australian woman who does the Trivago hotel-booking commercials could get me to buy anything.
The results of A/B testing over thousands of voices seems ominous for my paycheck.
Sounds good if they make the corpus freely available. Having lots of free high quality audio recorded from modern digital microphones would be useful. Voxforge recordings tend to be poor quality, TIMIT is still proprietary despite being over 30 years old now, and the TEDLIUM corpus recordings seem to have a horrible amount of reverb/echo in them.
"I bless every day that I continue to live, for every day is pure profit."
Sphinx is a 1000 rolls of ducktape holding together coughed up hairballs. Can we make the next one less convoluted? Avoid java and any other hipster languages. You know what? Just stick to C. Not even C++.
Please say, "My voice is my passport."
Seriously, how many people will do this and then happily use voice recognition to decrypt hard drives, get into email accounts useful for resetting their bank password, etc? Just dumb
Mozilla is not the top of their field in anything that they do because, without much money, they try to do everything.
Mozilla: maybe try being the best at something rather than a "me too" at everything.
Thanks to Nuance voice recognition industry is effectively dead. If Mozilla can make this work in offline mode it would be awesome. Not requiring your every word to be recorded shipped off to third parties would be very useful.
If they hired creimer he could clean out their storage closet while they go bankrupt. He is a miracle worker after all.
It's bad enough that Amazon, Google, the govt. etc are capturing all our spoken utterances with voice recognition tech. Do we really want to put it in the hands of every dipwad spammer and malware botnet?
What does this have to do with building a web browser?
At one extreme, TiESR https://gforge.ti.com/gf/proje... is a fairly simple to use. Not state of the art, but it does use Hidden Markov Models (HMM's) and has some noise compensation built in. It comes with word and language models, so it's fairly easy to use - for US English at least. I haven't been ambitious enough to figure out how to build new models.
At the other extreme, Kaldi http://kaldi-asr.org/ is the most advanced open source recognizer that I'm aware of. Neural Nets and all the other goodies researchers have been working on the last few years. Definitely not easy to compile or use, though. And don't even think about trying to design a neural net without a graphics card to use as a math accelerator: one of the examples ran for days and wasn't even close to finishing when I gave up.
Anybody else have suggestions for another toolkit?
and never computer date! (honeypots!)
My voice is my passport.
Verify Me.
In soviet Russia the domain extension autopilots you.
This perpetual motion machine Lisa made is a joke, it just keeps getting faster and faster. - Homer
We are building a new voice print database and would like to have a chat.
https://www.youtube.com/watch?v=-zVgWpVXb64 :D