The Challenges and Threats of Automated Lip Reading

HAL 9000 by tchuladdiass · 2014-09-13 03:50 · Score: 5, Funny

Dave, although you took very thorough precautions in the pod against my hearing you, I could see your lips move.

Re:HAL 9000 by sconeu · 2014-09-13 05:38 · Score: 1, Funny

To everyone else: If something along these lines was NOT your first thought, please turn in your geek card.

--
General Relativity: Space-time tells matter where to go; Matter tells space-time what shape to be.
Re:HAL 9000 by Anonymous Coward · 2014-09-13 07:43 · Score: 0

Does Futurama count?
Re:HAL 9000 by flyneye · 2014-09-13 07:49 · Score: 1

Sorry, I was still stuck on the claims of reliability in the first line of the article. Now my trousers are damp and I must change them.

--
*Repent!Quit Your Job!Slack Off!The World Ends Tomorrow and You May Die!
Re:HAL 9000 by Tablizer · 2014-09-13 08:17 · Score: 1

Why didn't Slashdot editors use the HAL eye icon (eyecon?) instead of the lock? I'm disappointed and will increase my trolling 35% in protest.

--
Table-ized A.I.
Re:HAL 9000 by martin-boundary · 2014-09-13 12:36 · Score: 1

My first thought was "why the hell would I want a machine to lip read?" since lip reading is basically a crutch for humans' inability to hear sufficiently well to extract someone's voice from the surrounding environment.
We already have laser microphones, which can detect sound vibrations at a distance, and we have sophisticated sound processing methods to extract weak signals from noise, etc. We don't need lip reading, other than maybe as a fun science project for graduates.
Re:HAL 9000 by currently_awake · 2014-09-13 16:16 · Score: 1

By analyzing the light in the background of a video you can see what is reflected there (the people behind the camera). If someone in the background of a terrorist vid is talking about their next terrorist strike- I'd want to know what he was saying. It's a pre-recorded vid, you can't set up surveillance gear and the vid isn't good enough to show the sound vibrations.
Re:HAL 9000 by martin-boundary · 2014-09-13 21:00 · Score: 1

By analyzing the light in the background of a video you can see what is reflected there (the people behind the camera). If someone in the background of a terrorist vid is talking about their next terrorist strike- I'd want to know what he was saying.

That's ridiculous. If you can lip read the reflection in a terrorist vid, then you can see the person's face, and you don't need to know what he's talking about, you can arrest him for being an accessory. If you can't see the person's face, try using Photoshop's ENHANCE plugin to erase the blaclava from the picture.
Re:HAL 9000 by elgatozorbas · 2014-09-14 04:43 · Score: 1

Why not? Apart from the idea that lip reading may complement speech recognition and make it more reliable. Also it may be more useful in a loud environment, which is frequently the case when machines are around, btw. Or in cases where speaking up loud to a computer is not appreciated, such as in office environments. And if all of this would not be enough, note the title of this website: news for nerds. You want a machine to lipread because it CAN (maybe).
Re:HAL 9000 by fractoid · 2014-09-14 17:35 · Score: 1

In addition to the other suggestions made here, one use case for machine lip reading is tracking multiple simultaneous conversations in a crowd. You could theoretically have searchable index of anything anyone said in view of a particular camera (whereas once more than 2-3 people are talking at once, it becomes almost impossible to separate out their individual speech.)

--
Rampant carbon sequestration destroyed the Dinosaurs' tropical paradise. I'm here to help repair the damage.

Thanks Jerry Mahoney! by SternisheFan · 2014-09-13 03:50 · Score: 2

I'm glad I learned ventriloquism as a kid.

Re:Thanks Jerry Mahoney! by Anonymous Coward · 2014-09-13 03:54 · Score: 0

I'm glad I learned ventriloquism as a kid.
Fool your friends, and the NSA!
Re: Thanks Jerry Mahoney! by Jeremiah+Cornelius · 2014-09-13 04:09 · Score: 0

"I still can't allow you and Cmdr. Poole to jeopardize this mission, Dave."

--
"Flyin' in just a sweet place,
Never been known to fail..."
Re:Thanks Jerry Mahoney! by Tablizer · 2014-09-13 08:19 · Score: 1

I'm glad I learned ventriloquism as a kid.
Who said that?

--
Table-ized A.I.

NSA probably already has this technology by psy0rz · 2014-09-13 03:51 · Score: 1

NSA probably already has this technology

Re:NSA probably already has this technology by Anonymous Coward · 2014-09-13 04:00 · Score: 0

Except that it probably false-positives on terrorist activity ~1% of the time, and since actual terrorist activity is being discussed ~0.01% of the time, false positives outnumber correct positives 100:1, rendering it completely useless.
Like all of these systems.
Re:NSA probably already has this technology by Deep+Esophagus · 2014-09-13 04:10 · Score: 4, Interesting

I'd be very surprised if the false positive rate were as low as 1%. Lip reading is NOT an exact science. It depends on context, clear line-of-sight, and how well the speaker enunciates. You'd be amazed how many phonemes sound different to our ears but look identical on the lips.
But hey, I'll let these guys explain it much better. Bad Lip Reading
Hilarious stuff, but the point is relevant: Without *any editing at all* of the actors' lips, they are able to perfectly match ridiculous words to those mouth movements. Why would automated software pick the "real" words over the BLR version?
Re:NSA probably already has this technology by Anonymous Coward · 2014-09-13 04:23 · Score: 0

Why, do you think that the technology would be used for mass-screening? I think that it would be just one more tool used for close tracking of people who have already been identified as extremely likely suspects. Just like keylogging, visual surveillance, microphone-based tape-recording, etc. I seriously doubt that it would be feasible for locating previously unknown suspects.
Re:NSA probably already has this technology by Anonymous Coward · 2014-09-13 04:50 · Score: 0

Yes, but the point is that even if it works unrealistically well it's still completely useless.
Re:NSA probably already has this technology by TheMeuge · 2014-09-13 05:04 · Score: 1

You are making a fundamentally flawed assumption that the government cares about false positives. I think our no-fly lists, jails, and police militarization are a pretty good indicator that a low false positive rate does not figure into calculations as far as the NSA, TSA, DHS or other TLAs are concerned. A cynical man (or woman) may also wonder about whether true positive rate figures into their calculations at all as well, or whether a power grab is the sole purpose of these agenices.
Re:NSA probably already has this technology by TubeSteak · 2014-09-13 05:14 · Score: 1

Why would automated software pick the "real" words over the BLR version?
Those BLR guys are going out of their way to produce something ridiculous.
You can train recognition software using real language samples and some grammar rules.
Why would you assume that we can't strap these two technologies together?

--
[Fuck Beta]
o0t!
Re:NSA probably already has this technology by pz · 2014-09-13 05:59 · Score: 1

"Dude, you punched a f-ii-sh."
Frelling awesome!
The real point is, though, that although some of those redubbed conversations are like Jabberwoky, some exchanges are reasonable (and some are spot-on visual homonyms, like the fish interpretation above), demonstrating that lip reading is wildly underconstrained.

--

Put my fist through my alarm clock with its ding-dong death inside my ear. - The Blackjacks.
Re:NSA probably already has this technology by davester666 · 2014-09-13 06:31 · Score: 1

The NSA could care less about false positives. They just mean the budget as to be upped that much more next year.

--
Sleep your way to a whiter smile...date a dentist!
Re:NSA probably already has this technology by Oligonicella · 2014-09-13 07:10 · Score: 1

They did it as a humorous example, but there are many words which have negligible or synonymous lip movements. And as someone pointed out, ventriloquism is easily learned and *will* be as soon as authorities start using lip reading software.
Re:NSA probably already has this technology by jonbryce · 2014-09-13 07:33 · Score: 2

You can't for example tell the difference between "nine" and "ten" by lip reading, and often either could be equally likely in the context.
Re:NSA probably already has this technology by angel'o'sphere · 2014-09-13 07:43 · Score: 1

You perhaps can't distinguish nine from neun (german) or ten from zehn (german) but 9 from 10 in most languages is easy distinguished ... perhaps you just need practicing?

--
Cost free eBook I read (by iBook/Kobo/Amazon/ObookO/Gutenberg etc.): "The Green Odyssey" by Philip Jose Farmer.
Re:NSA probably already has this technology by jonbryce · 2014-09-13 08:39 · Score: 1

I spelt it out in words because I was talking about English. Obviously French is completely different.
When saying either of those words, first the tongue moves down from the top of the mouth, then you say a vowel, and the difference between them is at the back of the mouth where you can't see. Then you have the "n" which is the same in both words.
Re:NSA probably already has this technology by Anonymous Coward · 2014-09-13 08:54 · Score: 0

Vacuum.
Re:NSA probably already has this technology by thegarbz · 2014-09-13 11:56 · Score: 1

Because we as humans weren't even able to do it. Cast your mind back to the 2006 World Cup finals when Zidane head-butted an opposing player.
Lip readers concluded a wide range of possible answers to why he lashed out including calling him a terrorist, insulting his mother, and saying his sister was a prostitute. This may be something specific to the Italian language that the words may sound the same but it highlights the problem. All conclude that he said the Italian for "go fuck yourself" at the end.
The problem with following grammar rules is you assume people follow the grammar rules.
Re:NSA probably already has this technology by plover · 2014-09-13 16:15 · Score: 1

Not at all useless. Simply decode all possible sequences and rank them, ranking the most self-consistent interpretation highest. You may also have other sources of data to help correlate the interpretation (there was an article earlier this year about measuring sound using the video footage of a mylar potato chip bag's vibrations.) Even if the room is crowded, it might be possible to identify a few isolated words from the audio recording of the conversation.
The next thing you do is throw away those conversations that you're not interested in. Regardless of whether the conversation resulted in "You punched a fish" or "You munched a dish", neither is going to have value when you're searching for criminal activity. But if your streams could be "I bought the ammo so we can rob the bank" or "I mopped the jam up sorry can you mop the tank?" one of those could be valuable.
99.999% of conversations are inane drivel. If this technology is applied, the number of false positives is going to rapidly overwhelm a system. More discrimination and correlation is going to be needed to actually produce intelligence from this data. But never think that data is worthless or unusable.

--
John
Re:NSA probably already has this technology by kwbauer · 2014-09-13 19:25 · Score: 1

According to the article that would be groups of 5 phonemes (on average) that look identical.
Re:NSA probably already has this technology by kwbauer · 2014-09-13 19:27 · Score: 1

But neither should be able to get a warrant because of the inaccuracy but it will be presented as having no incaccuracies at all.
Re:NSA probably already has this technology by Smauler · 2014-09-13 23:48 · Score: 1

I can lip read a little (my hearing was awful as a child). I still always look at people's mouths when I'm talking with people to get extra information - my hearing's currently worse than average, but not too bad - I have trouble with background noise.
There have been some times watching quiz shows when I've read the contestant's lips (when they're conferring) to get the answer they're going to say before they've said it, and repeated it to the room. That being said, I agree it's far from an exact science.
I _hate_ (hate hate and hate again) audio and video being out of sync, because it completely throws me. I can't watch video with bad audio syncronisation, I just have to listen to it.
With regards to the Twilight bad lip reading example, I could tell that some words were off, but not too many. Like I said, it's not an exact science, and I agree that 1% false positive would be very optimistic.
Re:NSA probably already has this technology by Smauler · 2014-09-13 23:56 · Score: 1

You can't definitively tell the difference between nine and ten. However, nine is generally a little longer and less abrupt. I'd guess that I could get over 90% accuracy lip reading people who are just saying nine and ten. General speech is a different matter.
Re:NSA probably already has this technology by Reziac · 2014-09-14 05:19 · Score: 1

Judging by the false-positives rate, a case might be made that they are in fact aiming for zero negatives.

--
~REZ~ #43301. Who'd fake being me anyway?
Re:NSA probably already has this technology by plover · 2014-09-14 06:37 · Score: 1

They don't need a warrant if they're not trying to gather admissible evidence. See "parallel construction" for an example of what they do with this data.

--
John

Jesus H Christ! by mark_reh · 2014-09-13 03:52 · Score: 4, Insightful

We're all going to have to start wearing Burkas if we want any privacy at all.

Re:Jesus H Christ! by Dr_Barnowl · 2014-09-13 04:59 · Score: 2

More like CV Dazzle
A burkha will get you "profiled". Weird hair and makeup is a fasion statement.
Re:Jesus H Christ! by Anonymous Coward · 2014-09-13 05:10 · Score: 0

http://www.zimbio.com/photos/A...
http://cf067b.medialib.glogste...
Re:Jesus H Christ! by Anonymous Coward · 2014-09-13 08:35 · Score: 0

When catching terrorists turns you into a terrorist, the catch has no use.
Re:Jesus H Christ! by Anonymous Coward · 2014-09-13 11:26 · Score: 0

This is why all public surveillance should require a vote of the people. Like, in cities, have the voters of the city approve any surveillance, whether it's cameras, red light cameras, etc. And have it sunset after 5 years.
Can't lip reading aid in voice recognition?
Re:Jesus H Christ! by ayesnymous · 2014-09-13 18:06 · Score: 1

We're all going to have to start wearing Burkas if we want any privacy at all.
No because a microphone will be on every corner. They'll have all the cases covered.
Re:Jesus H Christ! by dissy · 2014-09-13 19:31 · Score: 1

Cobra Commander was SO ahead of his time!
ps. Go Cobra!
Re:Jesus H Christ! by Anonymous Coward · 2014-09-14 13:12 · Score: 0

My fulsome moustache and beard are not weird or a fashion statement - I just hate shaving. But they do make me lip reading proof. What next "Big Brother Bans Beards"?

Too bad by ArcadeMan · 2014-09-13 03:53 · Score: 5, Insightful

Beyond the computational aspect, we also need to decide, as a society, if this is a technology that should exist.

Too bad it never stopped anyone before.

--
Get free satoshi (Bitcoin) and Dogecoins

Re:Too bad by ClickOnThis · 2014-09-13 04:04 · Score: 1

In the end, I suspect we'll decide that the advantages outweigh the disadvantages, and pass laws to protect people from the disadvantages. I'm not saying this will be ideal, but it will be the best we can do.
We have faced, or are facing the same issue with other technologies such as face recognition, profiling, genome sequencing, etc.

--
If it weren't for deadlines, nothing would be late.
Re:Too bad by Anonymous Coward · 2014-09-13 07:36 · Score: 0

Beyond the computational aspect, we also need to decide, as a society, if this is a technology that should exist.
Too bad it never stopped anyone before.
Anyone? It has, but society as a whole is a different matter entirely.
Re:Too bad by Anonymous Coward · 2014-09-13 10:30 · Score: 0

pass laws to protect people from the disadvantages
Then law-enforcement just have to continue ignoring laws and everyone is happy. Big Brother has more data on potential enemies of the State, and we can pretend everything is fine because there is a law to protect us.

How Naive by Tanuki64 · 2014-09-13 03:54 · Score: 5, Insightful

Beyond the computational aspect, we also need to decide, as a society,

Re:How Naive by nbauman · 2014-09-13 04:18 · Score: 2

If we don't get it, the terrorists will get it first.
Re:How Naive by TeknoHog · 2014-09-13 11:56 · Score: 1

Try lip reading the many wives of terr'rists.

--
Escher was the first MC and Giger invented the HR department.
Re:How Naive by AmiMoJo · 2014-09-13 23:53 · Score: 1

But not before the marketing scum. There are already screens that advertise different things depending on your gender, determined by s little camera above it.

--
const int one = 65536; (Silvermoon, Texture.cs)
SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC

This technology *will* exist... by Anonymous Coward · 2014-09-13 03:55 · Score: 0

... either created under the light or not. So let's better create it in the open and decide which norms we would like to impose on it.

Anyhow, what's the difference between lip-reading technology and speech recognition that makes the first more dangerous than the second?

Re: This technology *will* exist... by Anonymous Coward · 2014-09-13 04:14 · Score: 1

There's lots of cameras deployed without microphones. Also pretty sure sound doesn't make it to geosynchronous orbit strata of the atmosphere...
Re: This technology *will* exist... by Anonymous Coward · 2014-09-13 04:16 · Score: 0

(And yes, I know there are already optical microphones.)
Re: This technology *will* exist... by ClickOnThis · 2014-09-13 04:40 · Score: 2

There's lots of cameras deployed without microphones. Also pretty sure sound doesn't make it to geosynchronous orbit strata of the atmosphere...
You're implying we could read lips from GEO. Good luck with that. Even if the Hubble Space Telescope (which is at low earth orbit, not geosynchronous) were pointed at the earth, the best resolution you could manage would be about 30 cm.
http://www.spacetelescope.org/...
https://what-if.xkcd.com/32/
In theory it might be possible to read lips at GEO, but you'd need a HUGE telescope, or smaller binocular-configured telescopes with a wide-enough baseline, to get the job done.
And nitpick: there's really no "strata of the atmosphere" at GEO. Contributions there from the Earth's atmosphere are miniscule. It's pretty much plasma and magnetosphere from a few hundred km altitude on upwards.

--
If it weren't for deadlines, nothing would be late.
Re: This technology *will* exist... by Anonymous Coward · 2014-09-13 06:09 · Score: 0

GeoEye 1 operates at 410cm ground resolution, although downsampled to 500.
Given that this is a commercial mapping satellite, I'd be more than willing to believe classified reconnaissance satellites can manage a 2-3 order of magnitude improvement.
Can I prove it? Not really. I can say that my ex-NRO coworker made himself scarce whenever the topic came up.
Re: This technology *will* exist... by Anonymous Coward · 2014-09-13 06:15 · Score: 0

Wow, decimal fail. That should be 41cm, downsampled to 50. Per Wikipedia.
Re: This technology *will* exist... by Greyfox · 2014-09-13 06:21 · Score: 1

Hey! It COULD work! If everyone had 30cm lips! They'll use Steve Tyler for the proof of concept!

--
I'm trying to teach myself to set people on fire with my mind... Is it hot in here?
Re: This technology *will* exist... by Anonymous Coward · 2014-09-13 06:33 · Score: 0

Hello, moron. I suggest you recheck the difference between this satellite's orbit and that of geosynchronous orbit. Simply having the word "geo" in a satellite name does not immediately imply it has a geosynchronous orbit.
Furthermore, I suggest you educate yourself about the diffraction limit of telescopes. It's not just a good idea, it's physical law. As the other poster indicated, you're only going to be able to bypass the need for an enormous aperature at GEO if you somehow are able to setup an interferometer.
Re: This technology *will* exist... by angel'o'sphere · 2014-09-13 07:49 · Score: 1

Or Mig Jagger(sp?)

--
Cost free eBook I read (by iBook/Kobo/Amazon/ObookO/Gutenberg etc.): "The Green Odyssey" by Philip Jose Farmer.

Why should it NOT exist? by Anonymous Coward · 2014-09-13 03:55 · Score: 1

Turning the question around, why should it NOT exist or be looked into? At the very least it's an academic curiosity. If privacy is a concern, there's a very easy way to break the algorithm - talk whilst covering your mouth, which people have been doing whilst whispering to others for a long time. Ventriloquists would probably defeat it easily as well.

Capture: Lunatic

Re:Why should it NOT exist? by SternisheFan · 2014-09-13 04:01 · Score: 4, Insightful

related dilemma: should we develop algorithms that can lip read? Of course we should, we should develop any tech. The real question is, will it be used for moral or immoral purposes?
Re:Why should it NOT exist? by ch-chuck · 2014-09-13 04:14 · Score: 1, Flamebait

we are morally obligated to develop this technology before the bad guys get it and use it against us.

--
try { do() || do_not(); } catch (JediException err) { yoda(err); }
Re:Why should it NOT exist? by nbauman · 2014-09-13 04:17 · Score: 2

Grow a big moustache.
Re: Why should it NOT exist? by Jeremiah+Cornelius · 2014-09-13 04:22 · Score: 3, Insightful

Governments and corporations are fictional persons. They have no "moral consciousness" of any kind, outside of rhetorical and ideological fantasy.
So, this will not be a question of moral or immoral use. It will be amoral, in the hands of those who have advanced themselves through manipulation of the aforementioned ideological rhetoric.
You continue to believe that there is hope for this modern, post-industrial society. But there is none. We as people have increased the sophistication of our tools and our reach - just as relentlessly as we have avoided the refinement of our own beings.
In the end you don't get Star Trek. You don't even get Starship Troopers. You get Scanner, Darkly And hope there is Valis.

--
"Flyin' in just a sweet place,
Never been known to fail..."
Re:Why should it NOT exist? by Anonymous Coward · 2014-09-13 04:24 · Score: 0

Just like my wife. She grew a penis too.
Re:Why should it NOT exist? by pz · 2014-09-13 07:04 · Score: 1

related dilemma: should we develop algorithms that can lip read? Of course we should, we should develop any tech. The real question is, will it be used for moral or immoral purposes?
Certain technology can be declared illegal. Like guns in certain countries. Radar detectors in some US states. Blue lights on non-police cars in most US states. Mechanisms for counterfeiting printed money. Cloning of human embryos. Et cetera. It's perfectly plausible for a society to declare some particular technology illegal.
Heck, even certain knowledge is illegal for the general public to own, let alone internalize, like plans to make nuclear bombs.

--

Put my fist through my alarm clock with its ding-dong death inside my ear. - The Blackjacks.
Re:Why should it NOT exist? by ClickOnThis · 2014-09-13 09:04 · Score: 1

Heck, even certain knowledge is illegal for the general public to own, let alone internalize, like plans to make nuclear bombs.
Designs for nuclear weapons are not too hard to find online. The hard part (thank God) is obtaining the materials to make one, such as enriched uranium, plutonium, deuterium and tritium.
That said, I agree it would be illegal for a member of the general public to possess classified documents of any kind, without authorization.

--
If it weren't for deadlines, nothing would be late.
Re:Why should it NOT exist? by Anonymous Coward · 2014-09-13 18:55 · Score: 0

"We" who? You could be one of the bad guys.
Re:Why should it NOT exist? by hodet · 2014-09-14 03:25 · Score: 1

Think of the advantages for the deaf and hard of hearing (combined with a HUD). That alone tells me we should develop it. NSA are gonna NSA. Terrorists are going to terrorize. This type of technology has the potential to change countless lives, and for that reason alone we should.

Pfft by msobkow · 2014-09-13 04:09 · Score: 3, Insightful

Like moral issues have ever stopped anyone. :(

--
I do not fail; I succeed at finding out what does not work.

Re:Pfft by Anonymous Coward · 2014-09-13 05:22 · Score: 0

The USG has no morals .. f*&^ the constitutiom , fsck the people's rights full steam ahead.

Jesus H Christ! by OrangeTide · 2014-09-13 04:09 · Score: 1

We're trying to catch the terrorists, not dress like them.

--
“Common sense is not so common.” — Voltaire

Silly Question by Anonymous Coward · 2014-09-13 04:11 · Score: 0

If that technology is feasible, it will be developed by someone or other. And it probably already has been developed by or for various spook agencies.

Paying lip service to privacy .... by Anonymous Coward · 2014-09-13 04:12 · Score: 0

we also need to decide, as a society, if this is a technology that should exist. The privacy implications extend beyond that of simple voice recognition.

Privacy implications? Hahahha. If it can exist, it will. Governments and the private sector are no doubt already working on it. In our children's lifetimes there will be no such thing as private communication.

Combined by xonen · 2014-09-13 04:12 · Score: 1

The most obvious approach is to combine the 2 methods - much like humans do, especially in noisy environments. It might improve the accuracy of current speech recognition which is, too be honest, still sub-standard.

Speech recognition as is now is way too limited. Sure, Siri and the likes may work. And some computerized phone systems use it to nag us instead of using reliable button clicking. But it is still far from transcribing an accurate memo. Let alone automated subtitling or other fancy applications.

So yes, please, develop it, and use it to improve overall speech recognition.

--
A glitch a day keeps the bugs away.

Re:Combined by Anonymous Coward · 2014-09-13 05:27 · Score: 0

There is some evidence that combining the 2 methods might make things worse
Re:Combined by queazocotal · 2014-09-13 06:48 · Score: 1

Err...
Yes - and if you actually read the article you linked, it's saying that if you edit the sound to be different than video, then you get effects that differ from the sound when listened to.
In real life - when the sight and video are not intentionally disturbed - it helps.
Re:Combined by Animats · 2014-09-13 07:12 · Score: 3, Insightful

The most obvious approach is to combine the 2 methods - much like humans do, especially in noisy environments.

Right. Especially since, when you're looking at your smartphone, it's looking back at you.
This would be valuable for vehicle driver speech input, which has to reject a lot of noise.

Open the pod bay doors HAL? by Anonymous Coward · 2014-09-13 04:13 · Score: 0

if it's been thunk, someone will.....

HAL did it. by koan · 2014-09-13 04:14 · Score: 1

It will happen, it's just a matter of getting the tech correct.

--
"If any question why we died, Tell them because our fathers lied."

It's already been decided.... by Aldenissin · 2014-09-13 04:17 · Score: 2

Beyond the computational aspect, we also need to decide, as a society, if this is a technology that should exist. The privacy implications extend beyond that of simple voice recognition.

How much do they extend beyond that of so called "simple" voice recognition? I suppose one could rarely listen in when they couldn't have with current amplifying audio equipment. As a society, we've already decided that it should exist: "We hold these truths to be self-evident, that all men are created equal, that they are endowed by their Creator with certain unalienable Rights, that among these are Life, Liberty and the pursuit of Happiness."

Can this be used as a weapon? Yes, so can a hammer. Ban hitting people with hammers, not the hammer.

--
Like a city whose walls are broken down is a man who lacks self-control.

Re:It's already been decided.... by Anonymous Coward · 2014-09-13 10:51 · Score: 0

It's possible to listen to ALL conversations in a 80,000 person station with just 300 or 400 placed uniformly around the stadium. That news is years old.
So, lip reading backlash? Overrated considering that people already bend over at the airport ...
Re:It's already been decided.... by AmiMoJo · 2014-09-14 00:05 · Score: 1

The problem in the United States is that corporations are legally people. The EU will clamp down on this hard, not allowing corporations to monitor any conversation in range for advertising purposes. Individuals will benefit (I'd love to be able to whisper silently to my phone instead of having to say "OK Google" out loud) but business use will be heavily regulated. New rules already allow for fines of up to 50%of global revenue for privacy violations.
In the US it will be a conditional issue and corporate lawyers/lobbyists will win. People won't speak in public for fear for the adverts they might trigger.

--
const int one = 65536; (Silvermoon, Texture.cs)
SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC
Re:It's already been decided.... by Aldenissin · 2014-09-20 03:00 · Score: 1

In the US it will be a conditional issue and corporate lawyers/lobbyists will win. People won't speak in public for fear for the adverts they might trigger.
I doubt they will "win" like you suppose, They are to smart for that. Perhaps they should, and people may start to push back...

--
Like a city whose walls are broken down is a man who lacks self-control.

Hasn't someone already done this? by Assmasher · 2014-09-13 04:20 · Score: 1

I seem to recall that this was done previously but the conditions had to be good (e.g. sitting facing the camera with good lighting.)

--
Loading...

Easier than you think. by pubwvj · 2014-09-13 04:27 · Score: 2

Lip reading is a lot easier than the original poster thinks. There is a lot more data available, especially within context.

Re:Easier than you think. by Oligonicella · 2014-09-13 07:16 · Score: 2

Try it from across the room and you don't know what the conversation is about. Do it at a bar looking for people using pick up lines and you'll get false positives. As for context, try to figure out how to inject that into the reading algorithms.

Of course it should exist by Anonymous Coward · 2014-09-13 04:33 · Score: 0

Did the inventor of the camera say 'I wonder how this could be abused' or 'How awesome is this'.
Really if you want to eavesdrop on someone a parabolic dish or laser microphone works pretty darn good.

Already being done... by Patent+Lover · 2014-09-13 04:39 · Score: 1

https://www.youtube.com/user/B...

Challenge by Livius · 2014-09-13 04:48 · Score: 1

It's certainly a worthy area of computational linguistic research. But the reason for that is that it's a very hard problem. Automated language processing, with very smart people and very motivated spy agencies working very hard at it, has taken 60 years to get to a point not quite at the level of high school language speakers.

The privacy concerns are irrelevant. The deaf will demand this, and as long as there are weak-willed politicians and judges more interested in making political statements than dispensing justice, the whims of a special interest group will always trump the rights of the majority.

Moral conundrum? I don't think so.. by Codeyman · 2014-09-13 04:54 · Score: 1

We are the same species that invented the atomic bomb. If we can think of a technology, someone is already probably working on it.

Do both by Anonymous Coward · 2014-09-13 04:58 · Score: 0

The article makes it sound like an either-or thing between speech recognition and lip reading. There's no reason you can't do both to supplement current speech recognition and bring recognition rates up.

Most deaf people aren't completely deaf. they can hear but not well enough to understand speech, but if they have some hearing they supplement with lip reading to the point people don't realize they are deaf. (it's counterintuitive but if you want to get a deaf person's attention and they aren't looking at you, just knock on their desk or door, low frequency hearing is usually the last to go).

So it'd be harder to implement, but lip reading might improve speech recognition to the point it's useful beyond a gimmick

Re: Do both by Anonymous Coward · 2014-09-13 05:54 · Score: 0

A good lip reader can pick up 30% of what is spoken.

YT channel by Anonymous Coward · 2014-09-13 05:05 · Score: 0

There's an amusing YouTube about bad lip reading:

https://www.youtube.com/user/BadLipReading

You sure that the NSA hasn't got it already? by Bruce66423 · 2014-09-13 05:16 · Score: 1

Or perhaps one of the others - the CIA would no doubt appreciate it.

It's going to be done anyways by davidwr · 2014-09-13 05:30 · Score: 1

You can bet your $THINGOFVALUE here that the CIA and similar organizations are already researching this if they don't have it already.

Like handwriting recognition this will be full of examples of "bad output" in the early days and there will always be cases where lack of context and/or deliberate obfuscation by the speaker makes this unreliable.

Let's just assume that this will be as reliable 5 or 10 years from now as automated face recognition is today and within 20 years both will be very reliable. What do we do about it as a society? Do we pass laws and adopt social norms such that only "authorized" people can use this technology? Do we pass laws requiring that people be put on notice if their lips are being read by a computer without a court order or something similar? Do we become a society where people just expect that anything they say in public will be picked up and understood by a computer, likely in real-time?

--
Knowledge is how to play a game, intelligence is how to win, wisdom is knowing what game to play.

Re: It's going to be done anyways by Anonymous Coward · 2014-09-13 05:48 · Score: 0

Production ready? Ok, try this http://m.youtube.com/watch?v=qkeqNI614eY

recognition spyware software. by pigsycyberbully · 2014-09-13 05:50 · Score: 0

There was a European voice recognition product from a non-English speaking country. They began supporting other languages and then suddenly a U.S. product that did the same thing appeared. The European company was convinced it was espionage. They ended up having a product they couldn't sell so they began giving it away and it slowly died off. IBM began its voice recognition product which was not very good and was quickly replaced by Microsoft's offering of voice recognition software. There was a product which took the lead which called it software Dragon Naturally Speaking, this product secretly collected voice wave text corrections and would send large parts of the users documents to the software company for "improving the product's correction rate" without the user's knowledge. The product was sold to customers who used Microsoft Windows operating system and the Apple system. These products are very intrusive spyware, and should not be used in the medical profession or government or companies who deal with confidential information. Do you Apple, users and Microsoft users really need any more spyware products! P.S. look at the "lip reading" document it is spoken not typed it uses U.S. speech as in gotten rather than the type word got. You have got or you have not got. There is no gotten in written English it has got or it has not got. Speech recognition software is spyware and makes the English speakers spoken words into text look childish because they are clearly spoken and not written. I am non-English and I believe I write better than most of these U.S. people using voice recognition spyware software.

Challenging to sounds to discern visually by mark-t · 2014-09-13 06:00 · Score: 1

'D' and 'T', 'G' and 'K', and even 'P' and 'B' are frequently all but impossible to discern by lip-reading alone, and can only ever really be discerned when one of the alternatives simply does not make any sense. But this is not always the case.

--
File under 'M' for 'Manic ranting'

How Naive by Anonymous Coward · 2014-09-13 06:24 · Score: 0

It's math. You want it banned now?

Ah, right... encryption technology is good, decrypting something all of us could do if we wanted to, bad. Logic rules here!

"society" doesn't get to decide by silfen · 2014-09-13 06:29 · Score: 1

Beyond the computational aspect, we also need to decide, as a society, if this is a technology that should exist.

Sorry to break it to you, but society not only doesn't "need" to make this decision, it has no right to make this decision. You don't get to decide what other people invent, and for the most part not even what it is used for.

Speech recognition, pretty good? by Anonymous Coward · 2014-09-13 06:55 · Score: 0

That would for an extravagantly optimistic definition of 'pretty good'. Speech recognition systems still have lots of problems with individual accents, background noise, colds and unlimited contexts. They are incrementally better than they were ten years ago, but their usefulness is still pretty limited. We are still far from achieving the level that can be seen in shows like Star Trek and derivatives. Things like Siri and co. are nice toys, but it is still faster, in most cases, to use the keyboard.

Morals. by Anonymous Coward · 2014-09-13 06:57 · Score: 0

Beyond the computational aspect, we also need to decide, as a society, if this is a technology that should exist. The privacy implications extend beyond that of simple voice recognition.

Morals only apply if you can't afford to have them waived. since this technology will be used by government and large businesses there is no question it will be used when it works effectively, and that any concerns one may have could only exist because they are a terrorist and have something to hide.

The legal system by penguinoid · 2014-09-13 06:59 · Score: 1

If lip reading software reaches the courts, suddenly all video recording becomes wiretapping. The courts might resolve that by allowing audio recording wherever they allow video recording. Or by forbidding video recording wherever they forbid audio recording. Or maybe they will finally do something about that ancient "wiretapping" deal they've been twisting into the modern world.

--
Don't waste your vote! Vote for whoever you want, unless you live in a swing state it won't matter anyways

Done way back in 2006 by Anonymous Coward · 2014-09-13 07:02 · Score: 0

http://www.telegraph.co.uk/news/uknews/1534830/New-technology-catches-Hitler-off-guard.html

New computer software that can read lips at almost any angle has helped make sense of one of the Second World War's lingering mysteries —Hitler's home movies.

The technology that has allowed the dialogue to be reconstructed is called ALR — automated lip reading — and has been developed by Frank Hubner, a speech recognition expert. The computer recognises shapes that lips make, turns them into sounds and matches these to a dictionary.

I've seen the documentary BTW

George Carlin by pipingguy · 2014-09-13 08:01 · Score: 1

Old George Carlin joke:
Here’s a good example of practical humor, but you have to be in the right place. When a local television reporter is doing one of those on-the-street reports at the scene of a news story, usually you’ll see some onlookers in the background of the shot, waving and trying to be seen on television. Go over and stand with them but don’t wave. Just stand perfectly still and, without attracting attention, move your lips, forming the words, “I hope all you stupid fuckin’ lip-readers are watching. Why don’t you just blow me, you goofy deaf bastards.” The TV station will enjoy taking the many phone calls.

Deaf Perspective by Anonymous Coward · 2014-09-13 08:49 · Score: 0

I spent 10 years clinically deaf before my first cochlear implant. I functioned in the hearing world entirely through lipreading. I never learned sign language.

The ethical question here is entirely mooted by the fact that competent lipreaders (IE: me) have existed for a long time. People, right now, effectively guard themselves against lipreaders if they feel the need.

Have you never noticed that when the catcher or manager approaches the pitcher's mound, that the pitcher almost always puts his glove over his mouth when he talks? Or on the sidelines of a football game, surely you've seen the coach put his play clipboard over his mouth when talking into his headset. Sometimes the headset itself is large enough to cover his lips. Annoying! My parlor trick stops working!

Being able to read lips can be a lot of fun (ask my wife how I knew to ask her out when we first met) but protecting yourself from it is trivial and happens all the time already.

Crap by thegarbz · 2014-09-13 11:59 · Score: 1

It's a load of garbage anyway. There's nothing this technology does to invade privacy that we can't already do.

You're in the open, then use a parabolic mic to pick up the conversation you're clearly already taping.
You're behind some glass, then use a laser microphone to pickup the conversation which while it sounds James Bondish, actually already exists.

As a society we're already too little too late on the privacy side.

Whatcha need... by fyngyrz · 2014-09-13 12:41 · Score: 1

...is a little monitor that hangs over your lips, showing a silent movie of your lips saying (in a loop) "I suspect I'm under surveillance" while underneath, you can be saying anything you like. :)

--
I've fallen off your lawn, and I can't get up.

Re:Whatcha need... by TigerBull · 2014-09-13 15:00 · Score: 1

Then the next level will be them reading the vibrations from the screen caused by your voice.

I never knew how far we have come... by Anonymous Coward · 2014-09-13 13:53 · Score: 0

Automated lips?

"Should" is irrelevant by Anonymous Coward · 2014-09-13 14:00 · Score: 0

All that is required is a camera connected to a computer, with the correct software. Whether this technology "should" exist is irrelevant. Someone will eventually develop it. You cannot prevent it - the required hardware can be legally purchased at any number of stores around the world. The software required can be written pretty much any language, including those whose compilers or interpreters are available at no cost.

Assume that the technology will exist, if it does not already exist, and set out rules for the use thereof.

Arrogant tripe by Anonymous Coward · 2014-09-13 14:20 · Score: 0

Beyond the computational aspect, we also need to decide, as a society, if this is a technology that should exist. The privacy implications extend beyond that of simple voice recognition.

What a bunch of arrogant tripe. If it can be invented, it will be, whether you like it or not. The question you should be asking yourself---excuse me, "we as a society"---is, once the technology is invented, how can we ensure it isn't abused?

Yes, develope for the disabled by Nyder · 2014-09-13 16:19 · Score: 1

I can see how this would be great for deaf people, using something like google glasses to get subtitles of convo's around them. How about making something for people who can't speak, but can form the words with their mouth, Might need something like a mic but with video/lasers for reading the facial movements, that outputs it to a speaker.

Sure it will get used for bad, but that is going to happen regardless anyways. So how about we do some good with it and help out the disabled people with some nice technology to make their lives more like ours?

--
Be seeing you...

I am so sure... by Anonymous Coward · 2014-09-13 21:39 · Score: 0

...that the Clandestine people (you're so surreptitious!) developed this tech like yesterday.

WE ARE DOOMED! ALL DOOMED!

"If you want a vision of the future, imagine a boot stamping on a human face - forever.", George Orwell

Augmented sensing by mattr · 2014-09-14 02:13 · Score: 1

Could augment by adding other sensors such as microwave, laser or terahertz imaging, to detect signals being generated by tongue and vocal cords, or even to directly image the organs themselves.
Also it seems possible that since tge whole head vibrates, reflections or motions of eye, nose lips and forehead might provide vibratory cues.

There's already a textbook by ulatekh · 2014-09-14 06:46 · Score: 1

The most obvious approach is to combine the 2 methods - much like humans do, especially in noisy environments.

Obvious, indeed. There's already a textbook for the subject, Multimodal Signal Processing...available for free online, no less.

This is exactly the sort of system you'd want on a flight deck, to supplement the accuracy of speech-recognition in the presence of noise, especially intermittent noise such as turbulence. It can also help with speaker identification.

As for the hopelessly naive idea that "society" should be able to choose whether this sort of thing should exist...the textbook came out in 2009.

--
"Once we've identified and embraced our sickness, we'll have strength...and that's when we get dangerous." - John Waters

There is no "Should we?" involved. We will. by Doghouse13 · 2014-09-14 09:07 · Score: 1

If recent history teaches anything about technology, it's that if something is technically possible - and it seems highly improbable that automated lip-reading isn't - someone WILL do it. Further that, if it's not actually illegal to do so, someone will make it commercially available in the civil domain. And that if it's made illegal in the civil domain, that's very unlikely to stop the security community, in all its sundry forms, from weaponising it (sorry, my Orwellian paranoia is on clearly overdrive; that should, of course, have been "deploying and using it for the overall good of society"). And even if it's illegal in some jurisdictions, it won't be illegal worldwide anyway.

Slashdot Mirror

The Challenges and Threats of Automated Lip Reading

120 comments