Slashdot Mirror


The Challenges and Threats of Automated Lip Reading

An anonymous reader writes: Speech recognition has gotten pretty good over the past several years. it's reliable enough to be ubiquitous in our mobile devices. But now we have an interesting, related dilemma: should we develop algorithms that can lip read? It's a more challenging problem, to be sure. Sounds can be translated directly into words, but deriving meaning out of the movement of a person's face is much more complex. "During speech, the mouth forms between 10 and 14 different shapes, known as visemes. By contrast, speech contains around 50 individual sounds known as phonemes. So a single viseme can represent several different phonemes. And therein lies the problem. A sequence of visemes cannot usually be associated with a unique word or sequence of words. Instead, a sequence of visemes can have several different solutions." Beyond the computational aspect, we also need to decide, as a society, if this is a technology that should exist. The privacy implications extend beyond that of simple voice recognition.

3 of 120 comments (clear)

  1. HAL 9000 by tchuladdiass · · Score: 5, Funny

    Dave, although you took very thorough precautions in the pod against my hearing you, I could see your lips move.

  2. Too bad by ArcadeMan · · Score: 5, Insightful

    Beyond the computational aspect, we also need to decide, as a society, if this is a technology that should exist.

    Too bad it never stopped anyone before.

  3. How Naive by Tanuki64 · · Score: 5, Insightful

    Beyond the computational aspect, we also need to decide, as a society,