US Intelligence Unit Launches $50k Speech Recognition Competition
coondoggie writes The $50,000 challenge comes from researchers at the Intelligence Advanced Research Projects Activity (IARPA), within the Office of the Director of National Intelligence. The competition, known as Automatic Speech recognition in Reverberant Environments (ASpIRE), hopes to get the industry, universities or other researchers to build automatic speech recognition technology that can handle a variety of acoustic environments and recording scenarios on natural conversational speech.
"Go fuck yourself."
Haven't Microsoft, Apple and Google already spend billions of dollars on this?
Seems they are appealing to any random developer who might have an idea.
Thank you, Bradley Manning, Edward Snowden and so many others, for courageously defending humanity, my freedom and more!
Only 50k to sell my soul for having them spy on more people... including myself?
Nope.
Call Nuance and tell them you are going to make a money injection in their R&D dept.
I'm sure your 50k will make a real impact, when added to their 1.9 billion dollar revenue.
So they want a complex problem solved in 2 months (first test on Feb 4 and there are holidays inbetween), for which they will pay a relatively low amount and only to the winners. Even if the result wouldn't be used for spying, I don't think there would be many takers.
Given my own personal experience with voice recognition, it's not a problem we can throw money at. We can throw money AWAY trying, but we haven't improved much in many, many years of trying.
I don't have a particularly poor speech, or unusual accent, and English-speakers all understand me - even foreign English speakers like the one I live with. But speech recognition has always been an absolute flop unless I want to learn how to talk to the computer, which is the exact opposite of what I want to happen.
Since the first days of Dragon NaturallySpeaking, it's never been worth the training time even if all I'm doing it dictating serial numbers, or product codes, using simple single letters spaced out in a silent environment. Telling the difference between "eight" and "A" is much more involved than just context matching on a rough FFT of my voice.
And, as has been pointed out, someone who can do this will get a damn sight more than $50k reward as the patents would be worth billions.
To do it properly, we're really looking into problems that are the equivalent of the higher functions of AI.
When US Intelligence says something so clearly stupid, you always have to look for the subtext. The hidden message. The truth crouching behind the apparent idiocy.
In this case, the hidden message seems to be "we are incompetent in even the simplest basics of our main task".
First person arrested will be Stephen Hawking.
The shepherds did so well protecting the flock that the sheep no longer believed that wolves existed.
As usual with competitions like this, you shouldn't settle for the prize money if you develop such a thing because its worth quite a bit more.
1) Give 10$ to someone in India.
2) Use him for voice recognition (good enough). Software forwards audio input to him and outputs the typed text.
3) ???
4) 49.99k$ PROFIT!
Anybody who is participating in a contest to provide technology to the Director of National Intelligence is a moron who should be tried for treason.
Because I bet the Director of National Intelligence probably should.
Fifty THOUSAND dollars!
Coincidentally, this competition, by its very introduction also reveals a method for making massive automated eavesdropping difficult. Unless it produces a success, that is.
Ignoring the obvious "it'll be used for spying" thing, such technology would be worth far more than $50k. It's the sort of technology that gets your start-up bought by Google for a bazillion dollars because it's a difficult long-standing problem in the industry.
I remember a demo out of IBM, I believe, for recognizing controlled vocabulary in high-noise environments. It handily OUT-performed humans -- listening to the test audio, you couldn't really be sure there was a human voice at all, but the software detected and interpreted the speech with high accuracy.
This demo would have been circa 2000. I can't help imagining that there's been more progress since then.
The proposed task, where the interference is correlated with the original sound, seems like fertile ground for superhuman performance again. The original signal gets replicated and redundantly presented. Our brains are hard-wired to be confused by that, but it seems like a well-designed speech-recognition system could take advantage of it.
What? they didn't want to hire the same people who built the Obamacare site? I'm shocked!
https://www.youtube.com/watch?...
Is it just my observation, or are there way too many stupid people in the world?
Get the public to make your spy shit.
So, who wants to be the one who improves the automatic speech2text capabilities of automatic wiretapping systems in the US for a few bucks? :))
I am putting myself to the fullest possible use, which is all I can think that any conscious entity can ever hope to do.
These guys need to up their game.
Let's see...for $50K...I could probably write up a quick mobile app ($1K) that feeds microphone input into a streaming acceptance service on a server ($3K), that chops it up into wav files for Mechanical Turk processing. Fund that long enough to pass the POC stage ($2K), ride some odds (25%) and cash the check before the tech collapses = $6K for possible $12.5K win = $6.5K possible profit? Er...still no.
Raytheon already do a 300bps voice codec for highly noisy environments (Helicopters!), given the crossover between speech compression and speech recognition (speech recognition is essentially just a special case of speech compression) maybe together they could work something out.
That acronym is an utter failure. It doesn't even work.
http://www.acetonestudio.com
Course we'll have a list of entrants! And that is probably a good thing!
Where do people who would do this come from? Is it child abuse?
Pfwa! The Ruskies did this in the forties/fifties.
Check out the book "In the First Circle" by this guy named Aleksandr Solzhenitsyn.
Really, the only valid reason to enter this competition is sabotaging their apparatus by making something that looks good at first but is a dead end (see Nazi Germany Nuclear scientests and their inflated calculations).
No, it means they already have the technology, and to trick the public into thinking they don't only costs $50K.
I propose that before this decade is out, America will put a microphone with speech recognition in every home and office on the planet.