Researchers Build An AI That's Better At Reading Lips Than Humans (bbc.com)
An anonymous reader quotes the BBC:
Scientists at Oxford say they've invented an artificial intelligence system that can lip-read better than humans. The system, which has been trained on thousands of hours of BBC News programs, has been developed in collaboration with Google's DeepMind AI division. "Watch, Attend and Spell", as the system has been called, can now watch silent speech and get about 50% of the words correct. That may not sound too impressive - but when the researchers supplied the same clips to professional lip-readers, they got only 12% of words right...
The system now recognizes 17,500 words, and one of the researchers says, "As it keeps watching TV, it will learn."
The system now recognizes 17,500 words, and one of the researchers says, "As it keeps watching TV, it will learn."
'the same clips to professional lip-readers"
ok, who else didn't know that there are "professional" lip readers?
Blow its goddamned stack.
I'm sorry Dave, I'm afraid I can't do that.
N/T
Sseeing as there's so much closed-captioning going on, they've got an enormous volume of material to train their neural network on.
I've done this sort of thing before, and often finding a large set of quality training material is a significant challenge.
Getting half the words correct, then feeding that into a grammar / context engine should yield very close to 100% accuracy. That's what deaf (and hearing impaired) lip readers have to do since the stated 12% initial recognition is about right. They have to stay very focused on the speaker and make heavy use of context to work out what's being said. And that's a perfect job for a computer.
I work for the Department of Redundancy Department.
https://tech.slashdot.org/story/16/11/25/1146258/googles-deepmind-made-an-ai-watch-close-to-5000-videos-so-that-it-surpasses-humans-in-lip-reading?sdsrc=rel
Sees the computer AI progressing in its research, and decides to replace the movies being watched, with the complete collection of gojira monster films that were dubbed in English and hardly provided any syncing at all, circa 1960's era, followed by Chinese martial arts movies full of lines like "Yaaaaa!" " Huh?" and "Prepare to die!"
The icing on the cake is when he throws in an Inspector Clouseau film
The surveillance state is coming in its pants thinking about all the additional conversations they'll be able to monitor now.
Time to break out the bandannas and cough-masks....soon it'll be fashionable to wear them in public!
Just cruising through this digital world at 33 1/3 rpm...
That cry of dismay was the sound of thousands of blind gynecologists realizing they will be out of a job reading lips. :-)
Of course the reality is grim - even more surveillance by marketers and the state - especially with TVs and webcams and (if you believe Trump) microwaves watching everything you say and do.
"Transparent" is a shit show that trades on every stereotype going. A man in drag is NOT a transsexual.
... silent-movie-interpreting overlords.
Go compare this to a deaf person that reads lips. I know of literally thousands that never miss a single spoken word as long as they're looking at the speaker's mouth.
Source: Camfrog, where there are fucktons of deaf people communicating with those with hearing. We speak after getting their attention with a hand signal, they read our lips and reply with zero issues.
Still waiting on Serviscope_minor to wake up to fucking reality and realize that Jessica Price isn't going to fuck him.
Watch my lips...
Or was Frank Poole killed because HAL thought they were going to unplug the "Mammary Circus" and that was basically the only DVD the three of them could agree on watching?
Mimetics Inc. Twitter
with all the AI job obsolescence going on the universal income one is pretty much relevant
I'm wondering what text they are using to train the AI about what was said. I sure hope it isn't the closed captioning text on the news broadcasts. In my experience that is only about 50% accurate itself.
I'm an American. I love this country and the freedoms that we used to have.
I will be impressed when they can lip read old Hong Kong kung fu movies!
Why don't they offer to run this against the thousands of hours of course videos that Berkley just pulled due to ADA? Google gets massive training material, Berkley gets free transcripts, and the material stays online. Everyone wins...
When TV was first being introduced as a consumer product, one of the selling points of the idea was that people would be able to learn by watching it. If this works out as well as that, then the system will only be able to recognize when someone is uttering lines from commercials.
Having the NN train on politicians might make for long-term reliability issues :S
At least I know it won't be able to read my lips. You see, I speak American, not English.
Humans are very difficult to read.
I'm a good cook. I'm a fantastic eater. - Steven Brust
That's OK. Burkas will be required.
Sharia for the win!
Did he just say "No new taxes," or did he say "No Newt[Gingrich] Axes" ?
Heck you were even told, prior to that line, "read my lips," so you got no excuses.
https://app.box.com/WitthoftResume Code: https://github.com/cellocgw
https://tech.slashdot.org/story/16/11/25/1146258/googles-deepmind-made-an-ai-watch-close-to-5000-videos-so-that-it-surpasses-humans-in-lip-reading
Maybe we'll finally find out what John Ford told Maureen O'Hara to say John Wayne...a secret all three took to their graves...