Slashdot Mirror


The Challenges and Threats of Automated Lip Reading

An anonymous reader writes: Speech recognition has gotten pretty good over the past several years. it's reliable enough to be ubiquitous in our mobile devices. But now we have an interesting, related dilemma: should we develop algorithms that can lip read? It's a more challenging problem, to be sure. Sounds can be translated directly into words, but deriving meaning out of the movement of a person's face is much more complex. "During speech, the mouth forms between 10 and 14 different shapes, known as visemes. By contrast, speech contains around 50 individual sounds known as phonemes. So a single viseme can represent several different phonemes. And therein lies the problem. A sequence of visemes cannot usually be associated with a unique word or sequence of words. Instead, a sequence of visemes can have several different solutions." Beyond the computational aspect, we also need to decide, as a society, if this is a technology that should exist. The privacy implications extend beyond that of simple voice recognition.

2 of 120 comments (clear)

  1. HAL 9000 by tchuladdiass · · Score: 5, Funny

    Dave, although you took very thorough precautions in the pod against my hearing you, I could see your lips move.

    1. Re:HAL 9000 by sconeu · · Score: 1, Funny

      To everyone else: If something along these lines was NOT your first thought, please turn in your geek card.

      --
      General Relativity: Space-time tells matter where to go; Matter tells space-time what shape to be.