Slashdot Mirror


Cell Phones for the Deaf

nitzan writes "Quoting from the article: 'the software translates the voice on the other side of the line into a three dimensional animated face on the computer, whose lips move in real time synch with the voice allowing the receiver to lip read.' Unfortunately this only works with laptops, but a pda version is in the works." The company website has a demonstration.

267 comments

  1. Can you hear me now? by Cap'n+Canuck · · Score: 5, Funny

    Still no?

    Ok, can you hear me now? Still no?

    Ok....

    1. Re:Can you hear me now? by Anonymous Coward · · Score: 0

      That's very funny because deaf people can't hear.

    2. Re:Can you hear me now? by Anonymous Coward · · Score: 0

      thanks for clearing that up for me, eisenstein

    3. Re:Can you hear me now? by The+Dobber · · Score: 2

      That would be Einstein, Sherlock..........

    4. Re:Can you hear me now? by Cruciform · · Score: 2

      Maybe he knew a dumb kid named Eisenstein...
      You only assume that's what he meant because of your own frame of reference.

      Whee! Nitpicking with high karma is fun isn't it :)

    5. Re:Can you hear me now? by Anonymous Coward · · Score: 0

      As a legitimately deaf person who relies on speech reading for the majority of my human communication, I can honestly say this phone is completely useless. Speech reading is a HORRIBLE place to start in producing a cell phone for the deaf. Too many words appear exactly the same on lips. How Now Brown Cow. Say that while looking in the mirror. The only word you can tell apart is "brown". The tongue makes the other parts astand out. That freaky Max headroom wanna be has no tongue, and if it did, it'd frighten me.

      Get a clue: SPEECH TO TEXT PHONE. Please. Everything else is wasting our time.

  2. Oh, great... by Anonymous Coward · · Score: 5, Funny

    ...so now we'll all have to learn how to sign "Turn off your fucking phone, asshole!"

    1. Re:Oh, great... by Anonymous Coward · · Score: 0

      Um, why would the phone have a ringer on? They are deaf.

      Hello McFly!!

      Also, this is for people who lip read.

    2. Re:Oh, great... by Anonymous Coward · · Score: 0

      Also, the post was a joke.

    3. Re:Oh, great... by sczimme · · Score: 5, Insightful


      Yes, because the deaf person is bound to have the ringer turned way up...

      Oy.

      --
      I want to drag this out as long as possible. Bring me my protractor.
    4. Re:Oh, great... by Anonymous Coward · · Score: 0

      Yes, but when it is pure ignorance is becomes less amusing.

    5. Re:Oh, great... by Waab · · Score: 5, Funny

      I believe this particular sign has already been standardized and is currently in use by 99% of the American driving population.

      .!..

      !!.. if you're from the other side of the pond.

    6. Re:Oh, great... by Anonymous Coward · · Score: 0

      Thanks, but I was well aware that there would be no audible ringer. The post WAS a joke, fool.

    7. Re:Oh, great... by Anonymous Coward · · Score: 0

      This is just another technology solution to a problem that does not need to be solved.

    8. Re:Oh, great... by Anonymous Coward · · Score: 0

      Um, why would the phone have a ringer on? They are deaf.


      How would they know the ringer was turned off in the first place???

    9. Re:Oh, great... by Anonymous Coward · · Score: 0

      !!!. Profit?

    10. Re:Oh, great... by gosand · · Score: 2
      Yes, because the deaf person is bound to have the ringer turned way up...

      You can change the volume on those things? Hey, that's great - now will someone inform the rest of the US population, please!

      --

      My beliefs do not require that you agree with them.

  3. Technology overkill by tyler_larson · · Score: 5, Insightful

    What was wrong with speech to text?

    --
    "With sufficient thrust, pigs fly just fine. However, this is not necessarily a good idea...."
    RFC 1925
    1. Re:Technology overkill by p4ul13 · · Score: 4, Informative

      Rather than have a computer interpret a person's speech, the software basically gives a representation of what the speaker's mouth is doing. This will allow the deaf person watching the device to do their own interpretation of what they see, which I'd imagine is much more reliable than speech-to-text could hope to be.

      --
      Paul Lenhart writes words!
    2. Re:Technology overkill by karlowfwb · · Score: 1

      I agree. However they could use this to animate the mouths of CG characters... Might look halfway decent then.

    3. Re:Technology overkill by Anonymous Coward · · Score: 1

      As far as I can see, speech-to-text would be a lot bigger (space-wise), as well as *way* more processor intensive. And slower and less accurate.

    4. Re:Technology overkill by Ted_Green · · Score: 5, Interesting

      Actually speech to text is much more reliable.

      Text to speech:

      1. person speaks
      2. software interprets phonetics converts it into words
      3. deaf person reads the words

      versus

      1. person speaks
      2. software interprets phonetics into picture based lip movements
      3. deaf person interprets picture based lip movements

      Point of fact this is unbelievably dumb and is right up there with converting Russian to German for an English speaker to read.

    5. Re:Technology overkill by zapfie · · Score: 1

      It's buggy and slow and wrong half the time?

      --
      slashdot!=valid HTML
    6. Re:Technology overkill by Anonymous Coward · · Score: 0

      uhh.. sure.

    7. Re:Technology overkill by quintessent · · Score: 2

      Bingo. Much better to communicate the sounds and let the person draw conclusions. Otherwise it's a game of "Did they really say that? If not, what words sound like the ones I'm seeing spelled out." Speech to text really sucks, especially over the phone.

      It's interesting that not only do the model's lips move, but there are visual cues in the cheeks, nose, and throat.

    8. Re:Technology overkill by Kintanon · · Score: 3, Insightful

      How the hell do you draw that conclusion? How could speech to text be MORE processor intensive than converting speech to MOVEMENT on a face?! It's orders of magnitude harder to translate a sound into a muscle group movement on a computer generated face than it is to turn it into a group of characters representing that sound.

      Kintanon

      --
      Check out JoshJitsu.info for Brazilian Ji
    9. Re:Technology overkill by Anonymous Coward · · Score: 0


      Maybe it's for people who are deaf and "dumb" (they can't read).

    10. Re:Technology overkill by N3WBI3 · · Score: 2, Interesting
      We dont want the deaf people who cant read to be left out ;).

      On a more serious note not everyone reads lips in english, if you develop this right its common for any language

      --
    11. Re:Technology overkill by jandrese · · Score: 2

      Why not do both? That way the deaf person has the maximum amount of visual information to work with, especially if both methods (as is implied by my peer replies) are inaccurate.

      --

      I read the internet for the articles.
    12. Re:Technology overkill by Dephex+Twin · · Score: 2, Insightful

      Who says the latter is easier?

      By doing the pictures, you're essentially leaving the last part (converting the phonemes into words and sentences) to the deaf person reading the lips, instead of a computer.

      In order for the computer to do a more reliable job on this last part, it either has to take a long time or the processor has to be really fast. And even with that, the computer is still going to make a lot of mistakes.

      This is certainly nowhere near as brain-dead as you make it out to be.

      --

      If you want to make an apple pie from scratch, you must first create the universe. -- Carl Sagan
    13. Re:Technology overkill by sirius_bbr · · Score: 1

      A better approuch would be to build in a camera that records the lip movements, so the (unreliable) phonetics interpretation can be skipped.

      But then it would probably be more efficient to send streaming video of the speaking person...

      --
      this sig has intentionally been left blank
    14. Re:Technology overkill by Santos+L.+Halper · · Score: 1

      How about text-only, loaded with emoticon tags? I'm smiling as I say this!

      --

      "Ask not for whom the bone bones. It bones for thee." --Bender
    15. Re:Technology overkill by Santos+L.+Halper · · Score: 1

      The slashdot filter filtered out my tags. There were smilely face emoticons surrounding my last line there. Kind of like :) /:) but with the brackets around them. Dang it. It looks stupid now.

      --

      "Ask not for whom the bone bones. It bones for thee." --Bender
    16. Re:Technology overkill by billbaggins · · Score: 2
      Ok, let's compare... with either speech-to-text or speech-to-graphics, you first start off with speech-to-phonemes. This is highly nontrivial and likely to be a large part of whatever you end up with.

      To do speech-to-text well, you have to know the language being spoken, so that you can pick out the words and try to spell them right (since phoneme-to-letter mappings are not well-defined or predictable for most languages). On top of this, you have to somehow deal with slurrings (people on the phone are not necessarily the best enunciators in the world), slang, names, etc. etc. Then you have to do this for every language that you want to support.

      Text-to-graphics, on the other hand, is comparitively simple. Humans the world over have a relatively small number of sounds that they use (probably on the order of 300, if you don't count tonal variations, and you're trying to count every distinction that's made in some language) and the mapping from these onto facial shapes is fairly well-understood. There is (in theory) no tweaking needed to make it understand other languages, so when your deaf Chinese friend borrows it to call home it'll work without trouble. Tonal languages could be an issue... but really, deaf people are going to have trouble with that anyway, and this could even help there, since extra cues (a raised or lowered chin, say) could be used to indicate tone.

      It's not so much that there's anything wrong with speech-to-text as that this has the potential to be more right, esp. if used in combination w/ s2t. The fact that no word DB is needed makes it much more likely that s2g'll appear on a PDA that s2t, at least in the near future.

      --
      "The best argument against democracy is a five minute chat with the average voter."
      --Winston Churchill
    17. Re:Technology overkill by Dephex+Twin · · Score: 3, Interesting

      Think about it like this.

      If you have to say the sounds for the word "ow", what does that look like? There is a way for a computer to display this that would be pretty clear, and figuring this out would more or less require grabbing the "ow" picture group.

      Now, what if you have to write something with a "ow" sound in it, but this "ow" sound might be in the middle, beginning or end of any word? The sounds all around it have an effect on it. It might be spelled "au", "ow", "ao", "ough", god-knows-what-else. There are dozens and dozens of situations where this sound might arise. Including ways you might not even consciously think about. Figuring all this out is really hard for a computer to do, because it has no AI. It doesn't know what is being talked about.

      Probably mapping mouth movements isn't dead-easy, but I'd wager it is much easier than speech-to-text.

      --

      If you want to make an apple pie from scratch, you must first create the universe. -- Carl Sagan
    18. Re:Technology overkill by fishbowl · · Score: 2



      > What was wrong with speech to text?

      Speech to text is a much harder problem!

      You can map the articulation to an animated face without needing
      to know the language at all. Now, whether you can do it well enough
      to help a deaf person understand, is an open question.

      All you'd need to do to represent someone saying "Oh" versus "Ooooh" is map the phoneme to a shape.
      I'd imagine Fourier analysis would be oen of the more useful tools here.

      --
      -fb Everything not expressly forbidden is now mandatory.
    19. Re:Technology overkill by fishbowl · · Score: 5, Insightful


      "2. software interprets phonetics converts it into words"

      Is a very different, much more complex problem than:

      "2. software interprets phonetics into picture based lip movements"

      Consider that for the first example, we need the computer to understand the language,
      whereas in the second example, all the computer needs is a fourier transform and
      Max Headroom anatomy.

      Personally, I think it would be simpler and more effective to put a
      camera on the phone and transmit an image of the speakers face.

      --
      -fb Everything not expressly forbidden is now mandatory.
    20. Re:Technology overkill by terrab0t · · Score: 1

      "Personally, I think it would be simpler and more effective to put a camera on the phone and transmit an image of the speakers face."

      I don't think it would be either simpler nor more effective to do that. The advantage to this CG mouth is that it stays still, perfectly centered, and perfectly lit at all times. It also forms the all of it's facial positions clearly and more precicely than people bother to do when they talk.

      A person with a camera pointed at their mouth would be constantly moving out of view, poorly lit, and probably not concentrating on clearly mouthing their words.

      The other problem with effectiveness is the issue of bandwidth. A string of text or phonemes is phenominally smaller than a video stream.

      What I wonder about is whether or not it's easier for a deaf person to lip read than it is for a machine to convert the phonemes it hears into meaningful words and sentences.

      As mentioned above, the only difference between this and speech-to-text is how the phonemes are being interpreted and displayed by the machine. So is it better for a deaf person to read lips than machine interpreted sentences?
      How accurate are machine interpreters?
      How hard is it to read lips?

      If lip reading is much harder than text reading, would it be better to just display the interpreted sounds phonetically? It may look silly, but people who use abbreviated chat lingo can read it quite fluently (wut r u doin? l8tr, etc.).

    21. Re:Technology overkill by Anonymous Coward · · Score: 0

      only the dumb and dumber would say that... how sad people still talk like that sometimes

    22. Re:Technology overkill by nautical9 · · Score: 2
      How hard is it to read lips?
      Capt. Braddock: Okay no more bullshit [to Dave, talking fast] Capt. Braddock: was there or wasn't there a woman? Dave: Are you serious? Capt. Braddock: Yes I'm goddamn serious. Dave: Fuzzy Wuzzy was a woman? - Hear No Evil, See No Evil
    23. Re:Technology overkill by Cyno · · Score: 1

      It would be more effective, but who wants to give you that much bandwidth? Nobody is going to let you upload video from your phone or your home as long as there's money to be made selling content and advertising to you in the form of entertainment.

    24. Re:Technology overkill by sakeneko · · Score: 5, Interesting
      Point of fact this is unbelievably dumb and is right up there with converting Russian to German for an English speaker to read.

      Very well put!

      I've had deaf friends, one of whom attended Gallaudet University. (Famous liberal arts college for the deaf.) In addition, I lost most of my hearing for some years as a child -- fortunately, I got it back after surgery. I've thought about deafness, and dealt with it.

      Lip-reading works best for people who were hearing at one point and lost some or all of their hearing. I went deaf after I learned to talk, and went deaf slowly, which means I relied heavily upon it. People who have always been deaf often find lip-reading very difficult, or even impossible. When you have no concept of hearing or sound, trying to figure out what meaning is associated with specific lip movements is tough.

      This is true of learning to read, as well. A person who was already speaking, or could read, before going deaf has no real problem with reading. If you can't hear and never have heard, though, the concept of an alphabet and "sounding it out" makes no sense. A congenitally deaf person who wants to learn to read must learn each word as a whole, much as a Chinese or Japanese person who learns to read his/her language must learn each character separately.

      Since a congenitally deaf person faces a humongous task regardless of whether he/she is learning to read lips, or read and write, just which one do you think he/she would rather have to learn? In most cases, learning to read and write is going to be a lot more useful.

      From where I sit, speech to text would work better for most deaf people, congenitally deaf or not.

    25. Re:Technology overkill by GunFodder · · Score: 2

      Good question. I remember reading that a good lip-reader can only interpret 50% of what someone else is saying. So even a crappy speech to text system shouldn't have too much trouble beating that.

    26. Re:Technology overkill by Kintanon · · Score: 2

      You may be right, but I've SEEN speech to text. The stuff we use at work for converting news broadcasts to text is around 95% accurate. And it wasn't THAT expensive (I think 90$ for our copy) and it doesn't seem to take up much in the way of processing power.
      So I still have to put my vote in on the side of speech to text. There are too many subtleties to lipreading for me to believe a CELLPHONE will be able to display an image well enough for it using less resources than than speech to text takes. Plus, speech to text could theoretically be taken care of at some point between the two phones as far as the processing goes and then sent in final form to the recipient. The face HAS to be done on the cellphone itself...

      Kintanon

      --
      Check out JoshJitsu.info for Brazilian Ji
    27. Re:Technology overkill by Valluvan · · Score: 1

      Why not both, like the subtitles in foreign language movies ? With facial expressions and text underneath, the deaf person stands a better chance of getting what is said.

      --

      Science as a way of life.
    28. Re:Technology overkill by Dephex+Twin · · Score: 2
      You may be right, but I've SEEN speech to text. The stuff we use at work for converting news broadcasts to text is around 95% accurate.
      I have worked with speech recognition software as well, of course.

      There are a lot of applications for specialized speech recognition software that work much better because they have a narrower range of vocabulary to deal with. This makes an enormous amount of difference. Newscasters tend to have a certain "news" way of speaking, and they are reading from a prompter, so there is going to be a lot of consistency there. Also, sentence structure will be much more regular and subject matter is going to have a lot of common ground. I don't know your exact setup at work, but I would imagine that the speech recognition software is trained to the newscasters' voices. My guess is that there is more stuff going on than you think with this software.

      Additionally, think about the factors in a cellphone conversation that would make things more difficult. In a phone conversation, there will be ums and uh wells, etc. There will be a lot of stuttering and informal speech. If the signal isn't perfect and all or part of a word is left out, that can change every other word in the sentence (due to the nature of speech recognition-- things aren't figured out one word at a time, but rather one utterance at a time). If someone's name is not in the dictionary, it can easily be replaced with one or several words.

      When I was studying speech recognition, I tried varying the load on the processor and seeing how much that affected the speed. The speed started dropping dramatically right away. Your average desktop computer today is going to be able to handle speech recognition, as long as the processor has little to nothing else going on. But your average cellphone is going to have a LOT less processing power, and that will lead to an exponential slowdown in speed.
      There are too many subtleties to lipreading for me to believe a CELLPHONE will be able to display an image well enough for it using less resources than than speech to text takes.
      There are certainly subtleties in displaying lips, but they aren't as complicated. Displaying lips to read is only a portion of the steps you have to go through to do speech to text-- the easiest part. Both methods take the sounds and figure out which phonemes they represent. Speech recognition would be small and simple if it only had to spit out a string of phonetic symbols, not having to worry about separation between each word, the English language in general, etc. This is all the lip reading software has to do, but instead of spitting out symbols, each phoneme has a picture. Obviously, there are complications I'm probably not aware of, but I don't see how it could possibly be as complicated as the process of converting these sounds into proper English language sentences. The only complicated part I can think of is smoothing together each picture. On the other hand, converting into English is about the toughest thing you can possibly have to do with those phonemes.
      --

      If you want to make an apple pie from scratch, you must first create the universe. -- Carl Sagan
    29. Re:Technology overkill by Enzondio · · Score: 1

      Probably because of the amount of computing power required to do speech-to-text.

      It's not worth the extra money and battery power.

    30. Re:Technology overkill by Kintanon · · Score: 2

      I maintain that creating complex graphics out of sound is more difficult that creating text out of sound. However if they have some kind of simplified symbol library with a few hundred stick figure style representations of each sound then I think it would work. But at that point it's no longer "lipreading" it's phonetic representation, essentially creating a new alphabet of symbols that are written phoneticly.
      Come to think of it, that's not a bad idea.... It would require that the person using it spend some time learning it, perhaps having someone near them speak through the phone so they could get used to the correlations between a live person saying something and the cellphone representation...
      But true lipreading relies on cues in the throat, cheeks, eyes, eyebrows, nostrils, the entire face. It's not just a matter of watching two lips move around. I find it very hard to believe that a cellphone will be able to display enough detail for this to be feasible at a lower resource level than speech to text.

      Kintanon

      --
      Check out JoshJitsu.info for Brazilian Ji
    31. Re:Technology overkill by Dephex+Twin · · Score: 3, Interesting

      Those are some good points.

      For my interpretation of how the lipreading worked, I was looking at the sample on the website. It appeared to be that certain sounds had certain pictures. Anyway, it also had something that touched on what you were talking about. It had a little bit of extra information you can't normally see, like red dots on the cheeks for g and k type sounds, and a red nose for nasal sounds, etc. Extras like these might make up for some of the lacking facial cues.

      The other thing I was thinking about was that the lipreading could be part of the understanding process. A good number of people (most?) who are legally deaf are not truly 100% deaf. If a person is able to get a bit of auditory information, plus this lipsynching information, it might be enough to make things a lot easier for these people, even if the lip-reading by itself were too simplistic.

      --

      If you want to make an apple pie from scratch, you must first create the universe. -- Carl Sagan
    32. Re:Technology overkill by Kintanon · · Score: 2

      The other thing I was thinking about was that the lipreading could be part of the understanding process. A good number of people (most?) who are legally deaf are not truly 100% deaf. If a person is able to get a bit of auditory information, plus this lipsynching information, it might be enough to make things a lot easier for these people, even if the lip-reading by itself were too simplistic.

      This is true too, but frequently it won't apply to this particular application simply because cellphone sound quality tends to suck, and there tends to be a lot of louder interference around that can muffle out the peaks that the partially deaf use to differentiate sound cues. So I don't know how helpful the sound from this cellphone will be in helping to interprate their images... Unfortunately I haven't been able to run their little demo yet so I haven't seen the example. I'll try again when I get home later. Maybe I can get it to work in Phoenix....

      Kintanon

      --
      Check out JoshJitsu.info for Brazilian Ji
  4. What about TTY? by genka · · Score: 5, Interesting

    I worked with deaf people for a while and they were (and I am sure still are) disappointed that cell phones are not compatible with TTY devices. How difficult is this to do?

    1. Re:What about TTY? by mystik · · Score: 4, Funny

      My new Motorola phone I purchased this weekend mentions in it's menus Something about a TTY. I imagine I'd need data service from Verizon though.

      --
      Why aren't you encrypting your e-mail?
    2. Re:What about TTY? by quintessent · · Score: 2

      Hmmm. There's always the Blackberry or relay services.

      But yeah, with just a minute effort... It's slightly surprising that there hasn't been anything mandated by law on this front.

    3. Re:What about TTY? by mgrochmal · · Score: 3, Interesting
      Speaking as someone who works with trying to get lots of accessibility devices to communicate (for the blind and visually impaired, but similar principles apply), one of the main problems is deciding on a standard, followed by making sure it works with those that won't adhere to said standard.

      Case and point: I recently got a cellphone, so that someone else could have their phone back. I shopped around for a while and settled for one that would be free after rebate. I had it for a few days, and returned it for a more expensive one. First, the phone had an odd number layout, so I had to relearn the key mapping (the keys were part of a curve, instead of straight across). Second, I use a laptop to connect to the Internet, and I occasionally use a cell phone adapter to do it. The phone I bought was incompatible with the connector, and the phone's manufacturer had no immediate plans to make one. Those two reasons, as well as several other factors, prompted a return.

      If the cell phone companies would agree on a single interface, it would make the compatibility much easier to implement. Not only that, but the TTY devices need the information to implement all the various brands and models of cell phones. The possibility's there, but there's not much of a chance it'll happen anytime soon.

      --
      This .sig Intentionally Left Blank.
    4. Re:What about TTY? by Anonymous Coward · · Score: 0

      I imagine I'd need data service from Verizon though.

      No. You would only require a modem that understands the TDD protocol. It sounds like your phone might already have it.

    5. Re:What about TTY? by nuggz · · Score: 2

      I shopped around for a while and settled for one that would be free after rebate. I had it for a few days, and returned it for a more expensive one.

      Why didn't you just shop around a bit longer and make sure it would suit your needs?

    6. Re:What about TTY? by mgrochmal · · Score: 1

      Local selection is fairly slim, and I was strapped for cash. I was hoping I would be able to get by with the cheap one. Mistakes happen. I'm human. As for the peripheral problem, the salesman told me that such adapters would be out soon. That's what I get for listening for salesman.

      --
      This .sig Intentionally Left Blank.
    7. Re:What about TTY? by PeteEMT · · Score: 2, Informative

      Ok I'm deaf so I've actually used this

      The new phones are TTY compatible, they do not have a TTY in them, but if you hook a TTY to them it actually works, whereas with the other digital phones that aren't TTY compatible (right now the majority) you get alot of garbage characters.

      Analog cell phones are unaffected and with a TTY just fine without modification

      --
      Pete
    8. Re:What about TTY? by Anonymous Coward · · Score: 0

      It's good to hear what deaf have to say.
      please moderate up those post.

      My question is about GSM (cellular phone) live chat feature. I think my nokia has got that feature. I don't know if it is anyway standard and I found no use of this since typing on a small phone is painfull and costly when online.

      Are deaf peaple using that feature?

    9. Re:What about TTY? by Anonymous Coward · · Score: 0

      There is actually a company called Verbalink that has a solution (which can be used by operators) for this. With their system, one can make textphone calls mobile-to-mobile, landline TTY-to-mobile and the other way around. There are operators running the service today.

    10. Re:What about TTY? by Tschepsit · · Score: 1

      The vast majority of cell phone networks are TTY-compatible and have been for years. I can't speak toward the widespread availability of this feature on the mobile side, but I know that commercial phones with TTY support DO exist.

    11. Re:What about TTY? by PeteEMT · · Score: 1

      I don't have a GSM phone, but my Cell Phone has SMS. I use that pretty extensively. I was using AIM via the WAP browser but AOL blocked Verizon Access and the AIM SMS thing that replaced it stinks.

      I do know that the Danger Sidekick (GSM) has taken off in a big way in the deaf community. Sadly the GSM coverage where I'm at is very poor.

      --
      Pete
    12. Re:What about TTY? by Anonymous Coward · · Score: 0

      FCC is mandating compliance with TTY. So most phones have TTY Compatbility iether via an accessory of internal.

  5. nice... by Skal+Tura · · Score: 1

    Heh, nice for the impaired, good thing there is allways people developing things like this. Tho this one must be expensive but does the impaired ones care as in many countries the gov. pays for the aiding stuff like this, wheel chairs etc... My friend is impaired and gov pays nearly everything, in some small things he must pay his own part.

  6. Voltron? by heka-rup · · Score: 0, Offtopic

    Voltron was doing stuff like this in 1984. He's sweet and awesome.

    1. Re:Voltron? by Anonymous Coward · · Score: 0

      wtf?

  7. One flaw ... by Greedo · · Score: 3, Funny

    No downloadable ring-tones.

    --
    Tuus crepidae innexilis sunt.
    1. Re:One flaw ... by Anonymous Coward · · Score: 0

      :-o :-O :-0 :-P :-D :-/ :-| :-\

      8-)

      obligatory /. crud filter workaround

    2. Re:One flaw ... by Anonymous Coward · · Score: 0

      In space, no one can hear you scream "Bingo"

      they can if they can read lips!

  8. And why isnt it just realtime text??? by Havoc'ing · · Score: 1

    Why in the world do they bother with the lip-sync, and just use real time text-banner style. Gotta wonder about over-engineering sometimes.

    1. Re:And why isnt it just realtime text??? by Cap'n+Canuck · · Score: 2

      Because there's probably a way lower error rate on lip reading compared to Voice2Text.

    2. Re:And why isnt it just realtime text??? by Havoc'ing · · Score: 1

      Voice to Lip-Sync? it has to be decompiled on one end someplace in-order to interpret it on the other.

    3. Re:And why isnt it just realtime text??? by Dephex+Twin · · Score: 3, Informative

      Seems like it's not over-engineering. This is less steps than speech-to-text as far as I can see.

      You have to record the speech and convert those sounds into phonemes. Now all you do is use the picture(s) that go with that phoneme, which is going to be more or less consistent.

      With speech-to-text you have to use probability and word banks to figure out what the heck words those phonemes are supposed to go with, which is the hardest part by far, because spelling and grammar is so inconsistent. That requires a lot more time and computing power, and you are prone to a bunch more mistakes of course.

      --

      If you want to make an apple pie from scratch, you must first create the universe. -- Carl Sagan
    4. Re:And why isnt it just realtime text??? by Anonymous Coward · · Score: 0

      bullshit. animating a face on a computer screen to lip-sync a conversation in near-real-time accurately enough so a lip-reader can understand HAS to be a more demanding task.

    5. Re:And why isnt it just realtime text??? by Dephex+Twin · · Score: 1

      Well, that just shows your lack of understanding of the speech-to-text field.

      Do a little reading on the topic.

      --

      If you want to make an apple pie from scratch, you must first create the universe. -- Carl Sagan
  9. Yikes! by RumGunner · · Score: 2

    That's probably the most frightening anthropomorphic mouth I've ever seen animated!

    .

  10. Crikey, they're already bad enough drivers by typical+geek · · Score: 2

    Amaxzingly enough, the deaf can drive. I live near a technical deaf school, and quite a few drive. I just hope they don't try to use this cell ohone and drive at teh same time.

    1. Re:Crikey, they're already bad enough drivers by Anonymous Coward · · Score: 0

      Hi Mr. Idiot, how are you? You might not realize it but deaf drivers are typically much better drivers than you might think.

      Deaf people rely more on their vision (to compensate for lack of a sense).

      Deaf people are typically more receptive of things that happen on the road. In all the years I've been driving, most of the accidents the deaf drivers get into, it is because a hearing retard was behind the wheel either drinking their ass off or chatting their ass off on a cell phone.

      It amazes me how many stereotypical idiotic retards are living out there in their own little worlds, thinking deaf people can't do jack shit except make weird noises.

      In case you were wondering, yup, I'm deaf.

    2. Re:Crikey, they're already bad enough drivers by Anonymous Coward · · Score: 0

      I agree. It's one thing to make a joke that deaf people shouldn't be using this while driving, it's another to preface that with "amazingly, the deaf CAN drive".

      Maybe I should preface all my posts with "Amazingly, Slashdotters HAVE talked to a woman in real life at some point, but anyway..."

    3. Re:Crikey, they're already bad enough drivers by NilObject · · Score: 1

      What makes you think the deaf cant drive? That's like saying the blind can't listen to music, the mute can't read, etc etc. The nice thing about being a deaf driver would be that you wouldn't have to listen to those stupid double-digit Civics with "phat" mufflers on them (or the pathetic music these morons blast on their cheap-a** stereos, thinking it's the coolest thing). Oh, and on the topic of these cars, why do they put rear air spoilers on a front wheel drive car? *Sigh* My P.O.S. 1987 Nissan Maxima has more power than some of these. I shall now burn in "-1 Flamebait" and "-1 Offtopic" hell. AGHHH!!!

    4. Re:Crikey, they're already bad enough drivers by Kintanon · · Score: 2

      Eh? How is this different from the jackass with his stereo on full volume so loud that I can't hear the stereo in MY car driving? He sure as hell can't hear it if someone beeps a horn at him or tires start squeling....

      Blind people driving though... That would be scary.

      Kintanon

      --
      Check out JoshJitsu.info for Brazilian Ji
    5. Re:Crikey, they're already bad enough drivers by Misch · · Score: 2

      What makes you think the deaf cant drive?

      There actually used to be laws against it. IIRC, correctly, the only thing needed in NYS is to have a larger rear-view mirror now, and I'm not even certain if that is a requirement.

      --

      --You will rephrase your request for me to go to hell. Goto statements are not acceptable programming constructs
    6. Re:Crikey, they're already bad enough drivers by PeteEMT · · Score: 1

      Yep, they put a restriction on your license:
      Wide View Mirror or Hearing Aid

      DMV couldn't find anything that says what exactly constitutes a wide view mirror though.

      Also in NYS, Deaf Drivers can't have a CDL (Commercial Driver's License) but most other states this ok

      --
      Pete
    7. Re:Crikey, they're already bad enough drivers by Anonymous Coward · · Score: 0

      Rear spoilers on a fwd make sense, since the rear ends are fairly light and it keeps the rear wheels on the ground. If you have bad tires on a fwd vehicle, it is pretty easy to get the rear end loose and have it spin on you.

    8. Re:Crikey, they're already bad enough drivers by soundofthemoon · · Score: 1

      How's this for a kick in the pants? Statistically, Deaf people are safer drivers than hearing people. Insurance companies used to not give car insurance to Deaf folks, but now they give them lower rates!

      I have a bunch of Deaf friends, since I started learning ASL a few years ago. Driving with them is crazy. Scares the hell out of me when they try to converse with me while driving, but never seems to be a problem.

      I read somewhere that one of the major causes of car accidents is drivers being distracted by tuning the radio. I guess that's not a problem for Deaf people =)

  11. What about deaf mutes? by SonicBurst · · Score: 1

    Sure, they'll be able to "hear" you, but how will you hear them? Seems like this company has only half the equation here. Also, not that I'm a lip reader, but those demos were very erratic for me to lip read.

    --

    Geek used to be a four letter word. Now it's a six-figure one.
    1. Re:What about deaf mutes? by stratjakt · · Score: 1

      TTY devices (basically text-to-speech).

      All the lipreading skills in the world don't help you watch old Godzilla flicks. When will they fix that?

      --
      I don't need no instructions to know how to rock!!!!
    2. Re:What about deaf mutes? by SonicBurst · · Score: 1

      TTY is great and all, but since this was promoted as a cell-phone app, I find it very hard to believe that anyone would even attempt to teletype on a cell phone or pda for that matter.

      --

      Geek used to be a four letter word. Now it's a six-figure one.
    3. Re:What about deaf mutes? by Anonymous Coward · · Score: 0

      what, you can't understand angry retarded grunting noises?

  12. Re:Interesting by Anonymous Coward · · Score: 0

    This has to be the most assinine attempt at a first post I've ever seen. It would most likely be helpful to the disabled community in an article entitled "Cell Phones for the Deaf"?? Come up with something more original, PLEASE.

  13. Ugly by Superfreaker · · Score: 2

    If it is anything like the demo they have on their site, this technology is doomed.

    I hope to God they are not using Flash to deliver this product.
    Uhhgg!

  14. why bother with animation by clattymine · · Score: 1

    why dont you just translate it into words ... reading words is probly easier than lip reading

  15. Complicated by batboy78 · · Score: 4, Insightful

    This just seems complicated, why can't they just improve the speech to text capability. It seems like drawing a face with life-like facial movements to enable lip reading is a little beyond the scope of power for a PDA.

    1. Re:Complicated by Dephex+Twin · · Score: 2

      It's surprising how many people here don't realize that converting a sound to words is harder than converting a sound to pictures of lip movements.

      Converting phonemes into sentences requires context. Right now, speech recognition software "simulates" context by having large word banks and using probability. It tries to guess from sample text what the most likely string of words was. This is really not easy. This is why you have to train with speech recognition software. It tries to build up a database of likely things you'll say. It's like Tivo, sort of. And it can get a sentence totally wrong sometimes (often?). If you have a slow processor, going through all this data can take a REALLY long time.

      Speech-to-text might be out of the scope of today's PDA to do, whereas the lip-synching stuff wouldn't be.

      --

      If you want to make an apple pie from scratch, you must first create the universe. -- Carl Sagan
  16. Good idea and good start but.... by BWJones · · Score: 5, Informative

    This is a fantastic idea which will enable communication for the vast numbers of hearing impaired, however if the web-site is any indication, the technology needs improvement. I'm pretty good at reading lips and I was working pretty hard to figure out what was being said with the sound off.

    --
    Visit Jonesblog and say hello.
    1. Re:Good idea and good start but.... by BWJones · · Score: 2

      As a quick followup to my previous post...The other thing that I wonder about however is if they have a good speech to text engine onboard and the lip reading is so bad, why not simply perform speech to text? Cell phone screens are high enough resolution that considerable text could be displayed at once.

      --
      Visit Jonesblog and say hello.
    2. Re:Good idea and good start but.... by Gudlyf · · Score: 2
      Yeah, from the demo on their site I translated it as something like:

      "Ee Peach View. EIEIO."

      --
      Trolls lurk everywhere. Mod them down.
    3. Re:Good idea and good start but.... by MadBurner · · Score: 1

      The lip reading seems to be the consumer driven "cool factor". Of course text to speech would be better. But, people are so stupid they'd be trying to read their phones while driving. and I swear the next idiot that almost runs me off the freeway because they are on the phone instead of concentrating on merging from the onramp... Ow never mind. Intellegent and consumer don't belong in the same sentence.

  17. deaf terrorists now have the edge.. by Anonymous Coward · · Score: 0

    Now the fascist US government cant monitor all the cell phone activity with the receiving end... deaf people will be recruted to attack key positions...

  18. How long... by Anonymous Coward · · Score: 0

    ...before some deaf guy sues a cell phone maker for discrimination because those of us who can hear have a huge selection of ringtones to choose from, while the deaf can only use vibrate mode?

    Will cell phone makers have to make phones that 'hum' the ringtones via vibration?

    1. Re:How long... by Skal+Tura · · Score: 1

      to make them to 'hum' could be possible, but i think there will be a market share (although small) for 'vibration tones' ;) Lets see whos first to try that! ;)

  19. Uhhh... by NilObject · · Score: 5, Funny

    Being a severely hearing impaired person, I do find the virtual person's "O"'s to be highly disturbing if not graphic. Yikes.

  20. Re:To quote Amadeus... by Anonymous Coward · · Score: 0

    Damn, you beat me to it.

  21. If only Helen Keller were still around... by SoCalChris · · Score: 2

    She could call her bank to find the nearest drive through ATM with Braille!

    Oh wait, she'd still need to see the cell phone. Never mind. I guess its a good thing she isn't still here.

    1. Re:If only Helen Keller were still around... by Anonymous Coward · · Score: 0

      Did you know what her favorite color was?

      Cordoroy!

    2. Re:If only Helen Keller were still around... by Anonymous Coward · · Score: 0

      Hellen Keller walks into a Sears one day, and grabs her seeing eye dog by the tail and starts swinging him over her head. A sales clerk comes over, and asks her if she needs any help, to which she replies "No thanks, I'm just looking around!"

  22. Olive Juice by Anonymous Coward · · Score: 0

    Can you pick up some Olive Juice while you are at the store?

    What, I love you too man!

  23. Max Headroom by Anonymous Coward · · Score: 0

    I'm just wondering what the face will look like... I'm just hoping that it looks like Max Headroom... I'd get on if that was the case!

    -Magiluke

  24. Accessibility for the deaf by _Sambo · · Score: 1

    Glad to hear that the deaf will finally be able to use mobile communications. If they have to look at a screen while they're using it, people had better not use these while driving...

    Somewhat reminiscent of the blind driving project:

    http://www.rallyracingnews.com/blinddrv.html

    I

  25. Wouldn't it be simpler to translate to text? by bartash · · Score: 2

    Presumably this technology does
    speech->text->animated model
    Wouldn't it be simpler to present the text to the user? I would have thought text->human is much higher bandwidth than animated-model->human.

    --
    Read Epic the first RPG novel.
    1. Re:Wouldn't it be simpler to translate to text? by Cap'n+Canuck · · Score: 2

      No, I'm guessing the process is more like:
      Speech->soundex (or equivalent)->animation.

    2. Re:Wouldn't it be simpler to translate to text? by Anonymous Coward · · Score: 0
      No, I'm guessing the process is more like:
      Speech->soundex (or equivalent)->animation.
      Which brings us to the question - why not print the soundex in a textual format and have the deaf person read and interpret that instead? Sure, it's something that would have to be learned, but so is braile, or lipreading, or morse code.
    3. Re:Wouldn't it be simpler to translate to text? by Cap'n+Canuck · · Score: 2

      Because soundex->text is the heart of speech-to-text, which is why speech->text is so crappy! Take a look at soundex codes; widely different text produces the same soundex code. Context doesn't help, either, and neither does accents. Soundex is more closely approximated by mouth movements!

    4. Re:Wouldn't it be simpler to translate to text? by Anonymous Coward · · Score: 0

      I think you misunderstood what I meant. I merely wanted the soundex codes to be represented on screen in a readable format - they could be glyphs for all I care. Instead of trying to convert soundex->facial animation, just give the deaf person the soundex data directly. By text I didn't mean words.

  26. huh? by Anonymous Coward · · Score: 0

    what?

    1. Re:Huh? by Anonymous Coward · · Score: 0

      Vibrate, dumbass

    2. Re:Huh? by grape_soda · · Score: 1

      vibrating alert?

  27. Speaking from experience by FunkyELF · · Score: 5, Insightful

    I lived with a deaf room-mate last year. It took me about 2 months for me to understand what he was saying, and took him about the same to get used to my lips. Anytime he meets someone new, its very hard for him to read their lips (i.e. every time a new telemarketer tries to prey on the deaf user). Also, its not just the lips, its the tounge also. It'd probably be easier to use speach-> text software than this stuff....and what about background noise? I doubt this thing works well if not at all.

    1. Re:Speaking from experience by Anonymous Coward · · Score: 0

      Next time make out with him... That way he can experience your lips and tongue...

      Damn dude, you make it so easy to flame!

    2. Re:Speaking from experience by Anonymous Coward · · Score: 0

      flame? what kind of lame-o wanna be troll are you. you suck badly.

    3. Re:Speaking from experience by Anonymous Coward · · Score: 0
      Anytime he meets someone new, its very hard for him to read their lips (i.e. every time a new telemarketer tries to prey on the deaf user).


      Of course its hard for him to read telemarketers lips, they're on the other end of the phone.
    4. Re:Speaking from experience by Anonymous Coward · · Score: 0
      its very hard for him to read their lips (i.e. every time a new telemarketer tries to prey on the deaf user

      Say whuh?? Hard for him to read the telemarketer's lips? That doesn't surprise me for some reason...

  28. bt wht i rlly wnt by Anonymous Coward · · Score: 0

    is a prgm tht trnslts txt msgs in 2 rdbl txt

  29. scary... by paranoos · · Score: 1
    OK, who else thinks the demonstration looks incredibly scary? my first thoughts were "No! No, don't eat me!"

    More seriously though, I couldn't figure out what she was saying... I don't claim to be a lip-reading expert, but I can make out maybe 60% of what people are saying on TV while muted.

    Anybody care to smack me with a cluestick?

  30. Yeah, sure by Wind_Walker · · Score: 3, Interesting
    I was one of the fortunate ones who got to the company's website before it got Slashdotted, and was able to view the "demonstration" of their software. The demo consists of a mouth saying "Thank You" in various languages. I looked at English and Spanish, the two I know best.

    I sure as hell couldn't tell you what they were saying, even when I knew what words were coming out of their mouth. And this is not to mention cell phone static, distractions, contractions, mumbling, and lots of "ummm" and "uhhhh" that occurs during normal speech. I really don't see how this is a viable communication method.

    Maybe it's because I'm not experienced with lip reading. Maybe people who are deaf are better at it than I am, but I can usually tell what Football coaches are saying on the sidelines of games (of course, that's limited to "Bull****" and "You've gotta be ****ing kidding me!", but still...)

    1. Re:Yeah, sure by bananahammock · · Score: 0

      Sorry, but you're knocking this technology and you have what, no experience in lip reading? And your point is?

  31. Is it just me? by suman28 · · Score: 2

    First of all, I have to say this is a great idea. Just because you are deaf doesn't mean you can't use a cellphone. I have a cousin who is deaf. The last I talked to her, she was using sign lang. She was not reading lips (atleast I don't think). I personally don't know how to read lips. So, is this really going to take off?

    1. Re:Is it just me? by quintessent · · Score: 2

      I think this will be especially useful to people who are partially deaf. For instance, if someone can hear limited frequencies, then sounds like "s" might not be audible. Looking at someone's mouth provides a way to compensate.

  32. SMS by Anonymous Coward · · Score: 0

    I thought thats what SMS was for?

  33. Logo? by Anonymous Coward · · Score: 0

    Is it my imagination, or does their logo bear a striking resemblance to NVIDIA's?

  34. oh goody by Anonymous Coward · · Score: 1, Funny

    now deaf people can get brain cancer too...

  35. Er - have you heard of text messages? by Anonymous Coward · · Score: 0

    I'm confused.

  36. WHy should they. by Unknown+Poltroon · · Score: 4, Insightful

    I still cnat get coverage, or hear the other person clearly, why should the deaf be different? But i can ply 3 different games and send a fucking picture of a duck. Stupid phone companies. Its a fucking phone!! First, fix it so i can hear someone, THEN gimme the damn bowling games.

    OK, this might be a troll. Im not sure myself. Its definately a vent. Fucking sprint. Oh well.

    --
    All Troll + "offtopic" mods are meta moderated as "Unfair", because you abused the system.
    1. Re:WHy should they. by 5KVGhost · · Score: 2

      Stupid phone companies. Its a fucking phone!! First, fix it so i can hear someone, THEN gimme the damn bowling games.

      Yes, mobile phone service sucks because they've pulled all their best engineers out of the field and made them write bowling games instead.

      Negatory, good buddy. Building fancier handsets is trivial compared to actually improving coverage, and it's not like the two are mutually exclusive. Let's say the phone companies overcame all the various technical problems like interference, desensing, varying topography, etc. That still leaves political problems like unrealistic people in upscale neighborhoods who magically want 100% coverage without any visible antennas and dumb people who apparently think that cell phone towers will steal their souls.

      It would also help if the phone companies would stop taking on massive numbers of new customers when they can't support the ones they have, but I guess the money has to come from somewhere.

  37. Ugh by NilObject · · Score: 3, Insightful

    I just can not picture myself on a bus looking at this wildly articulate mout while yelling back: "Can yoo reepeeet dat agaannn???" Yes, I am hearing impaired. I would NEVER touch this thing. I'll stick with 2 way messaging.

  38. lip reading.. by pretzel_logic · · Score: 3, Funny

    look at a lip reader and say:

    'I want a fig newton'

    IMHO:
    too many flaws, the investors will back out

    --

    pretzel_logic
    1. Re:lip reading.. by _ph1ux_ · · Score: 2

      I want to vacuum is more acurate.

      Dont even need to say that to a lip reader. Try mouthing it to the hot blonde across the room sometime. Everyone can lip-read that phrase.

    2. Re:lip reading.. by Anonymous Coward · · Score: 0

      you might get laid

      or some fig newtons

      depends on how well you annunciate

    3. Re:lip reading.. by Dephex+Twin · · Score: 3, Insightful

      Well, for comparison, see how well a speech recognition program does with the same sentence.

      And unless you just randomly blurted out the sentence, you probably have context in the surrounding sentences (e.g. you are talking about fig newtons, food in general, newton's law, whatever).

      I'd definitely put my money on the lip-reader, frankly.

      --

      If you want to make an apple pie from scratch, you must first create the universe. -- Carl Sagan
  39. frightenly useful by tyler_larson · · Score: 2

    Somehow I don't think that a 5 fps animated mouth is going to catch on as a major tool for the hearing impared.

    "Sure looked good on paper...."

    --
    "With sufficient thrust, pigs fly just fine. However, this is not necessarily a good idea...."
    RFC 1925
    1. Re:frightenly useful by whovian · · Score: 1

      Beat me to the punch there, kiddo. I was going to say something like the bitrate seemed a bit low.

      Funny, I think the italian mouth was saying 'gelato' but I don't speak italian. Even with english as my mother tongue I couldn't figure out what the english one was mouthing.

      --
      To-do List: Receive telemarketing call during a tornado warning. Check.
  40. Faces by mmol_6453 · · Score: 2

    You have a choice between several celebrities' faces.

    Or you could mix n match your own. :)

    --
    What's this Submit thingy do?
    1. Re:Faces by N3WBI3 · · Score: 1

      Yea but unless you can make it a different face for every call. how disturbing would that be if you picked some hot chicks face while talking to a dog, I mean youre only supposed to have beer goggles when you are drunk..

      --
  41. its not cell phones by Anonymous Coward · · Score: 0

    its mobile phones. jeez, why doe the usa create their own word for EVERYTHING.

    1. Re:its not cell phones by magnum3065 · · Score: 1

      In order to clarify the type of mobile phone. Satellite phones could also be considered mobile phones, but they definately aren't cell(ular) phones.

  42. Office space.. by stephenisu · · Score: 1

    And now I can show her my 'O' face.. you know.. my 'OOoo' face.

    --
    Sigs? We don't need no stinking sigs!
  43. Why Lip Reading instead of Signing? by serutan · · Score: 2

    Accurate lip reading is a lot more difficult than sign language. SpeechView would have a much more usable product if they animated signing hands instead of a speaking face. I guess the software would be more complicated since it would involve speech recognition instead of just sound mimicry.

    1. Re:Why Lip Reading instead of Signing? by cindy · · Score: 1

      There's a company called Vcom3D that has a text to sign technology that's pretty amazing. It would seem that if TTY wouldn't work for some reason, this would be a much better solution than lip reading.

    2. Re:Why Lip Reading instead of Signing? by Anonymous Coward · · Score: 0

      The problem is a bit more complicated than that. American Sign Language is a completely different language than English. If it is to be truly useful to any one it will need to be a full universal translator. Or you could talk in ASL grammar, I know an interpreter who dose that every once in a while, it sounds very weird

  44. Re:Interesting by Anonymous Coward · · Score: 0

    Write Santa and ask him for a CLUE FOR CHRISTMAS!

    Asshole...

  45. Okaaay.... by Anonymous Coward · · Score: 0

    So this software translates speech to text, parses the text, and then outputs an animated face which renders the text into mouth positions? Hm... how about skipping the stupid naked chick phase and just output the text directly?

    I think this would be much better suited to creating animated films instead of enabling the deaf to see a crappy rendition of mouth movements. I'd bet this technology was taken directly from digital animation software companies and hacked to fit this application.

    Of course, all this just begs the question: does it show a puckered asshole flapping if you make a farting noise into the phone?

  46. Can you.. by uberstool · · Score: 1

    Can you see my lips now? Good!

  47. In a related story... by woogieoogieboogie · · Score: 2

    Developers are nearing a major breakthrough in 5.1 Surround Spacial Narrative Vision(TM). This amazing new technology targeted towards blind people immerses the blind viewer in an immersive field which narrates the scenery as an overlay of the movie soundtrack.

    --
    ... Governments are instituted among Men, deriving their just Powers from the Consent of the Governed...
  48. This is Slashdot by Anonymous Coward · · Score: 0

    Don't you mean:
    ....halfway decent than.

    1. Re:This is Slashdot by Anonymous Coward · · Score: 0

      No, he doesn't. Go back to grammar school.

    2. Re:This is Slashdot by Anonymous Coward · · Score: 0

      Ok, but you need to go to lame-joke-recognition school first.

  49. Re:Interesting by Anonymous Coward · · Score: 0

    I agree with this post.

  50. This doesn't really do much... by Cyclopedian · · Score: 5, Informative
    Lip reading is only half the whole "info-stream" that comes out of peoples mouths. I know this. I'm deaf (severe to profound sensori-neural hearing loss, since birth) and I'll tell you one thing: lip-reading can give ambiguous results.

    Someone can say "Pot" and yet with the same lip movement, can also say "My". Men with bushy mustaches are a lip-reading disaster.

    For me, I've adapted in my own way: I rely heavily on my hearing aids. That combination of both lip-reading and hearing the audio stream from your mouth enables me to achieve at least a 70% success rate (under ideal conditions, if it's a party atomosphere, fudgeddaboutit). I've had hearing aids since I was 1 1/2, and only with extensive speech therapy can I speak well. I'm one of the few deaf-from-birth people that can do it this well. So, from that perspective, I can speak on a phone (as long as I can understand that mangled audio coming out the receiver, which is 0%).

    Why don't they just focus on speech recognition? A great speech recognition phone would enable deaf people that speak to use phones for near real-time conversations. In addition, such technology can also be (easily?) adapted to foreign language translators for tourists.

    However, until such technology is available at the consumer level, I'm stuck with two-way text messaging devices like the T-Mobile SideKick.

    -Cyc

    1. Re:This doesn't really do much... by Dephex+Twin · · Score: 2
      A great speech recognition phone would enable deaf people that speak to use phones for near real-time conversations. In addition, such technology can also be (easily?) adapted to foreign language translators for tourists.

      Actually, this is not so easily done.

      You come across a number of ways to introduce errors. First, recognizing the speech and figuring out the phonemes, with some margin of error. Then, you have to convert these phonemes to text, which requires a lot of computing power to do in real time, and you introduce a lot more errors. Then you have to translate this text. We all know how babelfish is. So you'd end up with a very garbled (probably unintelligible) message.

      I'm not saying this is impossible, just that this is no trivial task.

      In the meantime, I think this technology we see in the article is a step in the right direction.
      --

      If you want to make an apple pie from scratch, you must first create the universe. -- Carl Sagan
    2. Re:This doesn't really do much... by Anonymous Coward · · Score: 0

      I go along with this as I have a profound sensorineural loss since birth as well. I've been wearing hearing aids since age 3 1/2, and while I speak better than 90% of the hearing-impaired people I know, I can tell ya bars and parties are just really bad places for me too.

      As a result of my late start, I didn't really catch up with my age group in vocabulary and conversational skills until I got to college. Whenever I sit in a classroom, I always have to sit at or near the front and focus on the teacher's face so I use the combination of seeing the lips move with the audio stream.

      Lip-reading by itself is just damn difficult. Try telling the difference between the words shoe, chew and sew. There are many such similarities in English, and the only way to figure it out is from the context of the conversation. We need the audio to go with the lips. And even then, it often takes too much time to figure out the context when the conversation is moving along rapidly.And I'm a pretty smart guy. How would that work for others not as quick mentally?

      I can use cell phones with my hearing aids (foreign accents are murder in this medium), but for assistive listening I would prefer text to lips, but I understand from the experts here that that is much more difficult. So give me a video camera in pda's so the picture is big enough for me to see the other person's lips move as they talk... now you'd be on to something really usable.

  51. Read My Lips by bytesmythe · · Score: 5, Interesting

    I thought it seemed a little weird at first, but then I checked out the other demos. When I knew what the words were ("Thank you" in English, German, French, Spanish, and Japanese), I could easily tell what was being said.

    I notice a lot of people complaining about improving text-to-speech, which is far more advanced than this technology. Speech sounds come out in a continuous flow. Getting a computer to recognize the breaks between words, properly spell them reliably, etc. is hard enough on a desktop system, much less a PDA. Especially considering in languages like English, where most vowels in unstressed syllables are rendered vocally as "uh".

    This system simply has to hear a sound, and immediately display an associated... well, not "grapheme", since this isn't writing... maybe "pixeme". It is the graphical equivalent of attempting to spell perfectly phonetically.

    Also, if you didn't notice it, "invisible" sounds that occur on the back of the tongue are indicated by circles on the cheeks (like hard 'g' and 'k'), and nasal sounds are indicated by a darkening of the nose.

    All in all, I think this is an interesting idea. It will be even cooler when they can render different faces so the "avatar" resembles the person to whom you're speaking.

    --
    bytesmythe
    Hypocrisy is the resin that holds the plywood of society together.
    -- Scott Meyer
    1. Re:Read My Lips by Anonymous Coward · · Score: 0

      The term you're looking for is "viseme".

    2. Re:Read My Lips by ShinmaWa · · Score: 1

      When I knew what the words were, I could easily tell what was being said.

      Go figure. When you knew what was being said, you could easily tell what was being said. *boggle*

      The main problem is that this thing that the vast majority of the deaf community can not lip read. Those that can would probably have a very hard time with this since this would only be (at best) an approximation of the mouth movements and be completely without the subtle "non-verbal" cues that lip reading relies so heavily on (such as facial expressions, head motions, eye movements and other such things).

      --
      The /. Effect: Thousands of users simultaneously accessing a site to not read its content.
    3. Re:Read My Lips by bytesmythe · · Score: 2
      Go figure. When you knew what was being said, you could easily tell what was being said. *boggle*

      Hmm... I suppose in retrospect, what I was trying to get across and what I said are two different things.

      What I was trying to get at is that lip reading is easier based on context. I know what the words are likely to be, so I can tell what's being said more easily. If I tell someone "Thank you", I expect a response from a fairly standard repertoire. Perhaps "you're welcome", "no problem", or something of the sort. If someone said "licking glass bonkers makes pencils freeze", I might be somewhat perplexed, but we usually don't have to deal with non sequiters of that nature. Lip reading the animated face with appropriate contextual cues would seem to be fairly easy to learn for someone who could already lip read.

      The main problem is that this thing that the vast majority of the deaf community can not lip read.

      I won't doubt it, but I tried to avoid singling out this technology as something solely for the deaf. A similar package might be used to add more life-like interfaces to computer systems someday. Here's an example:
      AI systems can rival (if not surpass) human ability to do things like recognize faces, determine a person's gender, etc. A system specially designed to listen to speech phonemes and then represent them graphically might make speech-based interfaces easier to understand in noisy environments, because we'd be able to see what was being said as well as hear it. Being able to watch a mouth form the phonemes makes it immesurably easier to understand words that might otherwise be misheard.

      --
      bytesmythe
      Hypocrisy is the resin that holds the plywood of society together.
      -- Scott Meyer
  52. The reasons this is better than speech-to-text by zipwow · · Score: 5, Informative

    Partly, because speech to text isn't very good.

    Speech to text isn't very good because its very hard to turn phonetics into words. Our ability to understand people is very reliant on context. Knowing what's been said helps you understand what's being said.

    Some will say that speech to text is getting fairly good in English, which is somewhat true. Obviously, though, there are bigger markets in other languages.

    So how does this thing work, if it doesn't do speech to text? It does speech to phonetics, and phonetics to lips.

    For example, its relatively easy to understand when someone has said "h -ee- r", but knowing if that's supposed to be "here" or "hear" is quite difficult.

    This is why the same software works across languages. "Th" is "Th" in any language, and your single algorithm doesn't have to care.

    -Zipwow

    --
    I don't know which is more depressing, that 2/3 didn't care enough to vote, or that 1/2 of those that did are crazy.
    1. Re:The reasons this is better than speech-to-text by Anonymous Coward · · Score: 0

      You don't need to know the difference between "here" and "hear." Just as with lip reading, the difference is inferred. I would think it would be easier for a person to adapt to a phonetic alphabet than trying to interpret a computer model of animated lips:

      kan yu heer mee frum ovr heer?

      The new 'leet speek?

    2. Re:The reasons this is better than speech-to-text by fishbowl · · Score: 2

      "Speech to text isn't very good because its very hard to turn phonetics into words."

      I wonder why it wouldn't be appropriate to deliver the phonetics themselves?
      In my university language classes, I'm expected to read French from a standard
      phonetic alphabet. If the device in the article is truly mapping phonemes from
      speech, then a representation of those phonemes would be very useful;
      never mind representing them into a given language.

      --
      -fb Everything not expressly forbidden is now mandatory.
    3. Re:The reasons this is better than speech-to-text by zipwow · · Score: 2

      Paraphrasing: "why not deliver the phonetics themselves"

      I'd guess that a text-based reprsentation of phonemes isn't as intuitive as one would like. At the very least, its another skill one would have to learn to use the phone, whereas the user presumably already can do some amount of lip reading.

      In general, though, I've often wondered if sending the phonemes over 'the wire' and reconstructing them as sound on the other side wouldn't work well.

      Granted, the phonemes would probably have to have a lot more detail than they do now to sound right, but it seems one could send a lot of that kind of detail and still take up less space than even a compressed sound wave.

      One application of this approach would be chat programs, and game communication. Because the phonemes are being reconstructed, you could reconstruct them with different characteristics than they were recorded, making them more feminine, or with a scottish brogue, etc.

      -Zipwow

      --
      I don't know which is more depressing, that 2/3 didn't care enough to vote, or that 1/2 of those that did are crazy.
    4. Re:The reasons this is better than speech-to-text by Bluesman · · Score: 1

      Heck, most posters hear can't get it write.

      And there supposed to be using they're brains.

      --
      If moderation could change anything, it would be illegal.
  53. The Bush Cabal : +1, Patriotic by Anonymous Coward · · Score: 0


    Slashdot reports trivial news about wireless while the
    Cheney-Rumsfeld Dictatorship plots to enslave U.S. residents as it broadens
    its Meta-Wars Against Countries
    That Opposes U.S. Business Interests

    Put that in your bong and inhale!

    Be Patriotic: Smoke Amerikan Grown Marijuana.

    Cheers,
    Woot

  54. This doesnt surprise me by Anonymous Coward · · Score: 0

    They already have songs for the deaf, and it's apparently a big seller .....

  55. hahaha roflmao by Anonymous Coward · · Score: 0

    Finally, someone takes those elitist deaf people down a notch.

  56. Okay This is just silly by tempestdata · · Score: 1

    The only people who would benefit from this are people who are deaf AND illiterate.
    If I were deaf I'd prefer it to simply display text on a screen the size of a pda.

    What benefit does watching lips move have over reading plain text?

    --
    - Tempestdata
  57. Finally a solution for illiterate deaf people! by techstar25 · · Score: 5, Funny

    This is clearly a solution for the large population of completely illiterate deaf people, for whom speech-to-text is not an option.

    1. Re:Finally a solution for illiterate deaf people! by Anonymous Coward · · Score: 0

      large population of completely illiterate deaf people Where?

    2. Re:Finally a solution for illiterate deaf people! by mazur · · Score: 1
      ...for whom speech-to-text is not an option.

      Even so, would not a simple transmission of image of the mouth area be much simpler and reliable? After all, it needn't be colour.

      Stefan.

      --
      The truth shall make you fret. (Ankh-Morpork tImes motto)
    3. Re:Finally a solution for illiterate deaf people! by karlmiller · · Score: 1

      Well considering only a little over half of the world's population is literate, and about 5% of the world's popualtion is deaf, that's a good 20 million people. You can read a rather interesting statement on deaf illiteracy. One shouldn't think for a moment that just because one can read that others can.

    4. Re:Finally a solution for illiterate deaf people! by karlmiller · · Score: 1

      Why is the above post moderated as funny? How is a large poplulation of illiterate deaf people funny?

  58. IN SOVIET RUSSIA by Anonymous Coward · · Score: 0
  59. Nuff said (and signed)! by LittleGuy · · Score: 2
    --
    Mod Karma -1: I sed bad wurds. If I cep my mouf shut, I wud be at riyses.
  60. I thought we had this feature a long time ago! by twoslice · · Score: 2

    Just turn on the vibrating feature. Then you can call your deaf girlfriend and say "I just called to say I love you" in morse code...

    --

    From excellent karma to terible karma with a single +5 funny post...
  61. How's that? by EyeSavedLatin · · Score: 1

    Cell phones for the deaf? What?

  62. gantz_graf by Anonymous Coward · · Score: 0

    i'm curious to see how this thing handles glitches, and moments where the compression gets so heavy and watery you might as well be listening to recent autechre albums. will the mouth appear to be hissing? will the face collapse into pablo picasso-esque abstract forms?

    (To be honest, though, i'm mostly curious about this because of an overwhelming desire i have at this exact moment to find a deaf person with one of these cellphones, call him up, and play an autechre album into the phone.)

  63. why not just 2 way messages? by Anonymous Coward · · Score: 0

    I have my ASL(American Sign Language) class in an hour. I'm going to ask my teacher what he thinks about this. He already carries around a 2 way message device and I wonder what he thinks about switching to something like this. I'll post back his response.

  64. I can see one advantage to this... by jhines0042 · · Score: 5, Interesting

    ... if you have this software running on a phone then if you are hearing impared you could get real time conversation with the other party without having to go through a human being.

    I've spoken with a hearing impared person on a phone before through a TTY system and it is painfully slow. First you have to say your sentence and then they send it. Then the other end needs to read it, type in a response, and then send it at which point it is read back to you. Imagine having a conversation over an Instant Messenger except you're secretary was reading the screen and typing for you. (IM for the blind for example)

    I agree that we need better voice to text and text to voice translation. That technology would give use better access for everyone. You could have "hearing" for the hearing impared (speech to text), "reading" for the vision impaired (text to speech), and you could even have "writing" for those with fine muscle control imparement or who are lacking the necessary limbs for various reasons.

    But this is an interesting approach to solve one of the three problems.

    --
    42 - So long and thanks for all the fish.
    1. Re:I can see one advantage to this... by mt2mb4me · · Score: 1

      actually, and this is mildly off topic, I dialed 0 today on my land line, and got the AT&T "operator", it wasn't a live voice, but it was only enough off for me to pick it up. But i just asked a question as if it was a live operator, and it responded just fine, using that as in intermediary it seems at it may cut down the response time in relay calls.

    2. Re:I can see one advantage to this... by Crewd · · Score: 1

      I used to work tech support for a ISP and would get TTY calls once a month or so. I can attest that it is painfully slow. Imagine trying to talk someone through troubleshooting their connection through TTY. When one came through it was time to kick back, put my feet on the desk, and get ready to have my call times go WAY up!

  65. MOD PARENT WAY UP by Anonymous Coward · · Score: 0

    I'm not deaf, but my parents are. Everything you said about the deaf driving is 100% correct. My parents are in their 70s, and my only mom was in two accidents in her life, none her fault. One guy rear-ended her at a stop sign, and another ran a red light & hit her broadside. Hearing was a non-issue in both cases.

  66. combine lip-reading with speech2text by peter303 · · Score: 3, Interesting

    David Stork has a chapter computer lip reading on in the book "Hal's Legacy" on A.I. methods. The combination is much more reliable that either audio or visual.

  67. YepThis Will Solve the Worlds Problems by mt2mb4me · · Score: 1

    Thats all we need deaf people usinga cell phone while their driving.

  68. Only reason I can see for this.. by Ted_Green · · Score: 2

    "As far as I can see, speech-to-text would be a lot bigger (space-wise), as well as *way* more processor intensive. And slower and less accurate."

    How well do you actualy see? The software would require the speach to be converted into some form of phonetics before it could determine mouth position. And if it can be converted to phoenetics it's just as easy to convert to text.

    The only possible reason I can see for this is if the software isn't really that accurate and so ti can get away with fudging the phonetics just a bit and/or the software was designed not so much for the pure deaf, but for those with some audio preception and the visuals of the mouth help them to better understand the audio.

  69. Re:In honor of In Soviet Russia... by Anonymous Coward · · Score: 0

    in england, we call them mobile phones, and we actually use a system compatible with 160 countries, and we actually have COMPETITION with providers.

  70. Huh? by dachang · · Score: 1

    How does this work? How do you alert the deaf person of an incoming call?

  71. no no no no no! by Elwood+P+Dowd · · Score: 2

    So close, but yet so far. Give us this plus this. That is, a portable chordal handset with braille output. Then connect it to either a blackberry like device, or one of those AIM cellphones.

    Can you imagine? I walk to work every day with my phone logged into AIM. I chat with people while I walk. I try not to step in potholes. The convenience of chatting and holding the cellphone at my side while waiting for the vibrating alert set me to thinking...

    Iduno. Y'all want a portable SSH client that you don't have to look at in order to use? Without the requirement for a screen, I don't care how big the device is. It goes in my backpack. The input/output is all tactile.

    I wonder how hard it is for sighted folks to learn braille. I wonder how hard it would be to mount braille-like output on a small handheld device. Dunno if that's possible, really.

    --

    There are no trails. There are no trees out here.
  72. Infamous Lip Synch by Anonymous Coward · · Score: 1, Funny

    That's not a computer generated lip synch demo.

    It merely an old millie vanillie video.

    Better hope the RIAA doesn't find out.

  73. Re:Office space... by thelinuxking · · Score: 4, Funny

    I'm thinking about taking that new chick from Logistics. If things go right I might be showing her my O-face. You know: Oh! Oh!

  74. Re:To quote Amadeus... by Anonymous Coward · · Score: 0

    Maybe so, but it's marked off-topic. Apparently, nobody's watched the movie.

  75. Misconseption... by mattyohe · · Score: 1

    It is a huge misconception that the deaf can read lips, when only 30% of what is said is visible on the lips, the rest has to be guessed. Now imagine them trying to decifer the text on "Virtual" lips.

    --
    - what is the definition of simultanagnosia?! I've been meaning to look it up!
  76. market by felix23 · · Score: 1

    Simple. There's not a million dollor market in making cell phones for the deaf. It's the sad truth. They don't really care about minority consumers.

  77. Why Not Recognize Words And Convert To Sign Lang. by DoctorMabuse · · Score: 1

    That would be less error prone.

  78. I thought they had these already by farnsworth · · Score: 2, Funny
    I thought they had these already.

    At least I assumed that the folks speaking at 95 db into a highly compressed mic did so because they were deaf and unable to hear themselves.

    --

    There aint no pancake so thin it doesn't have two sides.

  79. Simpler Solution by Durin00 · · Score: 1

    Wouldn't it be much easier just to have a voice to text conversion on the cell phone, instead of developing some fancy animated emoticons?

  80. Talking back? by Anonymous Coward · · Score: 1, Insightful

    That's fine for the deaf person, but how do they communicate back? Does the phone convert from mouth movements to audio?

  81. You miss the point by Anonymous Coward · · Score: 0

    I don't think Trolling Stones was belittling your cell phone infrastructure. He was merely pointing out that you drive on the other (wrong) side of the street. Plus you boil your food and don't brush your teeth, wanker.

    In Soviet Russia, a post replies to you.

  82. I'll tell you. by FreeLinux · · Score: 2

    Nothing, that's what! Contrary to many of these other posts, speech to text is a much better solution.

    People seem to be forgetting that speech to text is the back-end for this lip service anyway. In order for it to work, speach is interpreted by a computer which then maps the interpreted speech to canned lip movements. The canned lip movements require cpu horsepower to drive the graphics and they need a large screen for it to be readable. These two reasons are why it is only available on a laptop.

    With the speech to text scenario; speech is interpreted by a computer and is matched to canned pieces of text. So far, pretty much the same. But, now the text is output to just about any screen, including the text screens of today's cell phones.

    Basically the speech to text would be an automated TTY/TDD system. TTY/TDD has been in use and has proven highly effective for decades.

    To answer your question, there is NOTHING wrong with speech to text. However, you won't draw too many VCs with it. Now, put a computerized talking head on it and extoll the greatness of its virtues and you may well be able to sucker in a few VCs. And afterall, isn't that what it's all about?

    1. Re:I'll tell you. by fishbowl · · Score: 2

      "speech to text is a much better solution."

      Of course it is. It's also not current technology.
      The phone is more of an oscilloscope than a speech translator.

      If you want an apples-to-apples comparison, consider the alternatives
      between an animated face or a spectrum analyser, not between an
      animated face or a text display.

      --
      -fb Everything not expressly forbidden is now mandatory.
  83. Seinfeld... by taernim · · Score: 1

    Hmmm, so if you wanted to talk to someone on the phone with a deaf person, you could do what Jerry and George did here, and cover your mouths with tissues, etc while you talk.

    All of life's puzzles can be solved by Seinfeld...

    --
    "PC Load Letter? What the $@#% does that mean?!"
  84. I did this for karaoke by t0qer · · Score: 2

    I go to karaoke every week, and lately i've been making my own karaoke VCD's of more modern songs.

    I decided to do La La land by green velvet one week and just for kicks I thought I would make a talking head ala max headroom.
    http://www.zeromag.com/images/downloads/videos/t ry 1.avi

    (Divx compressed BTW 6 megs)

    Basically I just recorded a second track of my singing without the music, then pumped the wav through the facial animator in truespace 6.

    What I found was it actually made it a bit easier for me to keep up with the words because I would watch how the lips on my on screen persona and mimic them myself.

    Anyways, enjoy folks.

  85. However ... by Greedo · · Score: 2

    Speech-to-text works fine for the deaf person "listening" to the phone. But what does the deaf person do when he/she needs to "talk"?

    I know I'm generalizing, and not to be politically uncorrect, but don't most deaf people have difficulty speaking "clearly"? So how does the phone deal with that? Or does the deaf user need to type in their response?

    It seems to me that speech-to-text for receiving and text-to-speech for sending is the way to go ... and then the speech part is pretty redundant.

    In which case you've just re-invented the Blackberry.

    --
    Tuus crepidae innexilis sunt.
    1. Re:However ... by raistlinjones · · Score: 0

      The deaf person would just talk. Like a normal phone. There would be no reason to complicate things by making deaf guy try to type his response and having a computer monotone it out to the person on the other end.

  86. Lightbulbs for the Blind by egg+troll · · Score: 5, Funny

    Reading this makes me realize that my Lightbulbs for the Blind scheme was not crazy! Bundles of cash, here I come!

    --

    C - A language that combines the speed of assembly with the ease of use of assembly.
    1. Re:Lightbulbs for the Blind by neafevoc · · Score: 1

      Reading this makes me realize that my Lightbulbs for the Blind scheme was not crazy! Bundles of cash, here I come!

      Good! I'm with you all the way. I'm planning on delivering my solar powered flashlight just in time for Christmas!

  87. SMS by mholt108 · · Score: 1

    Members of the Deaf community are huge on SMS. It was a revolution for the whole scene. I reakon a better solution would be to give them easier ways to enter SMS messages quickly without needing a PDA. dont know how but it wouls still need to be small. HI people ?

  88. another waste of money by Anonymous Coward · · Score: 0


    You know, every other living thing on this planet kill or leave their wounded or diabled members behind...

    No really now. Why not invest the money they spent here developing research to actual electronic ears, ect. Research a cure not a way qround the problem.

    You don't cut off the penis to slow the spread of aids do you?

    I bet this project was thought up by a bunch of project managers that got together in a think-tank and decided that this was the most shiny thing that would exist. Forget functionality, does it sound/look cool? Lets do that!

  89. Sigh... by ALG · · Score: 1

    "No, no... I said V-A-C-U-U-M!"

  90. Re:Interesting by zapfie · · Score: 1

    You have the I.Q. of a lemming on crack.

    --
    slashdot!=valid HTML
  91. A better solution by MobyDisk · · Score: 1

    I am developing a system that converts the audio into lips, reads the lips into text, then converts the text into concrete ideas, then does a lookup to convert the ideas into pictures.

    The result is that the deaf person sees pictures of what the other person is talking about. For example, if they say "I'll meet you at the bus stop" they will see a picture of some meat, then the letter U, then a PCI bus, then a stop sign.

    The next step is a version for the blind, who cannot use cell phones since they have trouble dialing. It will interpret sentences into a techno song that conveys the meaning.

  92. Well the vibrate feature is always useful... by Salden · · Score: 1

    Can the phone convert voice to morse code and vibrate the dashes and dots?

  93. What the hell ... by LoudMusic · · Score: 2

    How about we skip the whole idea of text output? This is stupid! It's a complete waste of technology and time. Translating one person's audio into a 3d modeled face. Brilliant ...

    How about they just use a video phone? Or have the audio be displayed in a text output? It has to go through that step anyway.

    3D modeled face ... what the crap.

    --
    No sig for you. YOU GET NO SIG!
  94. One other thing by Dephex+Twin · · Score: 2
    I'm deaf (severe to profound sensori-neural hearing loss, since birth) and I'll tell you one thing: lip-reading can give ambiguous results.
    Someone can say "Pot" and yet with the same lip movement, can also say "My". Men with bushy mustaches are a lip-reading disaster.
    Imagine if every person enunciated consistently and clearly. I know there's still ambiguity, but it wouldn't be nearly as hard. This computerized face doesn't have problems like bushy moustache, and the pronunciations are precise (of course, it could be better). So you'll be in the best conditions for recognizing what is said (theoretically).

    With the accuracy of speech-to-text these days, the margin of error you get reading those lips might very well be smaller than if a computer tries to make those sounds into words and sentences.
    --

    If you want to make an apple pie from scratch, you must first create the universe. -- Carl Sagan
  95. Actually... by Anonymous Coward · · Score: 0

    > Presumably this technology does
    > speech->text->animated model

    No, actually it does
    speech->sounds->animated model

    It's actually easier to convert speech sounds into mouth movements than to text because phonemes in English aren't always typed the same way.

    For example, the sound "neighbor" could be interpreted by the software as the text "nayber"..figuring out the correct spelling of the phonemes the software hears is complicated and depends on context.

    However, every non-vowel sound corresponds to one mouth movement (and every vowel sound could be easily converted into a color or something like that). This is why it's so much easier to convert speech into lip movements than to text (and also why the software isn't language dependent--say "bluggamachooga", and it'll analyze each phoneme and convert it into a lip movement).

  96. not really by Anonymous Coward · · Score: 0

    just slap the fuck so hard they begin to hear again - repeat as necessary...

  97. Yes... yes... yes.. by jsonmez · · Score: 1

    Yes... yes... yes... but how do you know when it's ringing?

  98. Ummm... by SCHecklerX · · Score: 2

    wouldn't it be a lot more practical to just dump it to text???

  99. SMS by Anonymous Coward · · Score: 0

    SMS

  100. Phonemes, visemes, TTS, and lip synching by TekkonKinkreet · · Score: 3, Informative

    Posting late, but wtf.

    By way of introduction: I developed the core coarticulation and other algorithms for lip synching when I worked at a now-defunct company called...wait for it...LIPSinc. We thought the resulting lip synching was pretty damn convincing, so on my own I tested out our stuff with a hearing-impaired friend, with mixed results. Anyway, I don't know a little about this stuff, I know a *lot* about it.

    What these guys have done is map phonemes onto exaggerated visemes (the pictures of the mouth). Not a bad idea at all! Bunch of problems, though. First, there's a data data reduction of about 3x in going from sound to video--there are 40-50 distinguishable phonemes, and 9-16 distinguishable visemes, depending on how you count each. This is because the visible part of the face only makes up the end of the vocal tract, a lot of distinctions between letters occurs without the involvement of the lips, like the difference between F and V, while others, like K, can be pronounced with the face in virtually any position. This is part of what makes lip reading so hard with a real person, and why they need a lot of context to pull it off. They also seem to be slowing down the timing, as if they recognized the phonemes and then synthesized each at the same length. This gives longer to recognize each one, but wrecks the visual prosody (rhthym) of the speech, which is a good cue for where the parts of speech are. Then there's the rest of the face. The eyebrows and head positions help you figure out key words, ends of clauses, tell if something is a question, etc.

    Those who say that TTS is superior to lip reading have a point. Good TTS contains *more* accurate information than an uninterpreted stream of phonemes (itself 3x richer than a stream of visemes, as I said above), because the machine can do a Viterbi search to find the most likely sequence of words from a continuous stream of phonemes. Words also open up higher NLP functions, so you can do constraint relaxation to test whether "wreck a nice beach" or "recognize speech" fits better in the context.

    Still, I'd like to see an experiment where the raw phonemes are fed, as text, to the recipient. I think with practice, your brain would start to decode the string (it manages with the sound, right?), despite the lack of word boundaries and the errors in phoneme detection (which is not all that high without text-I think seventy-something percent). Seems like an easier pattern recognition problem than lip reading. Who wants to go get funding?

    1. Re:Phonemes, visemes, TTS, and lip synching by Aexia · · Score: 2

      This is because the visible part of the face only makes up the end of the vocal tract, a lot of distinctions between letters occurs without the involvement of the lips, like the difference between F and V, while others, like K, can be pronounced with the face in virtually any position.

      There are visual cues for "invisible" sounds, like the nose darkens, the throat turns blue and the dots appear on the cheeks for various sounds. Watch the demo.

      It'll take practice to get used to, I'm sure, but no more than it takes to pick up Graffiti on PDAs, I imagine.

  101. Wonderful! by Asprin · · Score: 2

    Now there are going to be BLIND drivers swerving all over the road because the're talking on the phone!

    --
    "Lawyers are for sucks."
    - Doug McKenzie
    1. Re:Wonderful! by Asprin · · Score: 2


      Oops, you're absolutely right.

      I was thinking of the *BRAILLE* cell phone.







      ...well, it *would* have been funny...

      --
      "Lawyers are for sucks."
      - Doug McKenzie
  102. Not a good idea by Anonymous Coward · · Score: 0

    I go to the Rochester Institute of Technology in Rochester, NY. We host the National Technical Institute for the Deaf (NTID). Guess what? Most deaf people can't read lips any better than hearing people. It takes a lot of extra bandwith and resources to record and play video, then say, (1) translate speech to text, or (2) translate text to speech. Ultimately this is only an advantage for two deaf people, both able to read lips, talking over a phone.

  103. Why do deaf people need cell phones? by Anonymous Coward · · Score: 0

    Deaf people are USELESS. They don't need phones, they need to be shot and their body parts should be harvested for more viable members of society.

    1. Re:Why do deaf people need cell phones? by Anonymous Coward · · Score: 0

      useless ? You are undereducated! Deaf people can do everything you fuckers can do!

  104. Neat by dissy · · Score: 2

    This is actually a pretty neat technology.

    Ive seen lots of suggestions for speech to text, but if you have had any experence with regular powerful PCs and speech->text you will see why that wont work on even a 2ghz intel system, let alone a pda/cel phone.
    (Didnt we just have an ask slashdot about this?)

    A wire frame of a face only requires slightly more CPU power than processing a WinAmp visual (No 32 bit color eyecandy here) and i have actually seen a visualization plugin (For ge-force, avail for winamp and itunes as well as some opensource packages) that has a module that draws a face and it in a way moves with the sound.
    Granted that was not its design, it was made to look good, but obviously with a few changes it could be made to acuratly simulate a face and mouth for this very purpose.

    Im all for any technology that makes interacting with a computer easier. While I personally would prefer direct sound into my ears over this, I also concider myself lucky to have that ability compared to those that dont.

    Personally Im all for the direct brain connection, but i have a feeling thats a ways off yet :)

  105. Experience talks by Malicious · · Score: 2, Interesting

    Working in a call center, i get the occasional deaf call.
    It takes tremendous amounts of time, because not only does the translator have to interpret what the customer is saying, so that i can hear it, he then has to translate what i say back to the customer. It takes ages, and i'd imagine that with a cell phone, having a comptuer immediately translate, if slightly less accurate, would be preferable to having a human slowly (compared to the comptuer) enter it. Speed Vs Ease of Comprehension. Pretty common comparison. To each their own

    --
    01101001001000000110000101101101001000000110001001 10000101110100011011010110000101101110
  106. Comment removed by account_deleted · · Score: 2

    Comment removed based on user account deletion

  107. 30-40% by Anonymous Coward · · Score: 0

    Consensus is that only about 30-40% of spoken English is comprehendible only through lipreading.

  108. Try Something Patriotic +1, Patriotic by Anonymous Coward · · Score: 0

    such as a Cheney-Rumsfeld approved "smallpox"
    vaccine.

    Then see who needs lithium.

    Thanks in advance,
    Woot

  109. T900 is the product to beat by Anonymous Coward · · Score: 1, Interesting

    My wife is hearing impaired, so we've got a lot of HI and deaf friends. Nearly all of them use Motorola T900 two-way pagers. Even the older, less tech-savy deaf people are comfortable with them, since it's not that different from a TTY (albeit non-interactive). They're small, handy, inexpensive, and run for quite a while on an AA battery. They're email-able so they have no problems communicating with hearing people (especially if you've got a email-capable cell phone).

    The only problem is the non-interactive nature, and the fact that the email messages have to be rather small. If someone would come up with a version that could do real unit-to-unit TTY (essentially put a phone/TTY in it) in addition to email, they would sweep the market.

    All this flashy lip-reading-speech-recognition crap is trying to kill a cockroach with a hand grenade.

  110. DUH!!! by SuperDuG · · Score: 2

    It's called SMS!! Works for even the non-deaf crowd and doesn't piss as many people off.

    --
    Ignore the "p2p is theft" trolls, they're just uninformed
  111. GSM phones. by Bake · · Score: 2

    Every GSM phone made has for years come equipped with SMS capabilities.

    SMS is very popular with the deaf (at least where I come from). It allows them to communicate just as easily with those with good hearing as other deaf people.

    SMS also solves the problem of being globally accepted (just as long as you're not on the North-Western side of the Atlantic pond), and you don't need a special kind of GSM phone to be able to communicate with SMS. Another nice feature is that it works no matter how noisy it is surrounding the sender.

  112. Better yet: by Misch · · Score: 3, Insightful

    We have tools like Sprint Relay On-Line that will do text-to-speech... and every state provides confidential relay services to begin with. Many states are moving towards making 711 a standard relay number.

    If a deaf person wanted a "cell phone", they'll probably have one from Wynd Communications, a two-way pager with text/e-mail and other services built right into the damn thing. They're all the rage here. Screw lip reading over the phone. This technology is pure eye-candy. Nice, but how useful will it really be?

    --

    --You will rephrase your request for me to go to hell. Goto statements are not acceptable programming constructs
    1. Re:Better yet: by Anonymous Coward · · Score: 0

      Government-funded services for the deaf are perhaps the perfect example of outdated government programs that refuse to die. Services like TTY and phone relays have long since been made redundant by the advent of universally available and reasonably inexpensive e-mail. But taxpayers still spend hundreds of millions a year subsidizing these obsolete services.

    2. Re:Better yet: by Misch · · Score: 3, Insightful

      Services like TTY and phone relays have long since been made redundant by the advent of universally available and reasonably inexpensive e-mail. But taxpayers still spend hundreds of millions a year subsidizing these obsolete services.

      May you never go deaf then. May you never buy a product that breaks and have to call a phone number for customer services. May you never have to call an emergency services number. May you never have to call the pizza place to order a pizza.

      The differnce between e-mail and TTY is the difference between push and pull technology. With e-mail, there's no guarantee that your e-mail is ever received, much less opened, read and processed.

      Because of this, e-mail cannot (and does not!) qualify under the ADA soley as a reasonable accomodation.

      I've worked as a secretary (in a school for the deaf, no less.) I know e-mails can take a long time to get delivered. There's still time between when it gets delivered and when it actually got read and processed by me. (Usually not long, but on some crazy hectic days, it could take some time.)

      I hate feeding the trolls, but this one needed to be thwacked over the head.

      --

      --You will rephrase your request for me to go to hell. Goto statements are not acceptable programming constructs
  113. and we thought drivers with cell phones were bad.. by MntlChaos · · Score: 1

    now we have deaf drivers (can't hear you honk horn) looking at an avatar right in front of them and you can't tell if they're looking at the road or not!

  114. Re:and we thought drivers with cell phones were ba by itsnotme · · Score: 2

    I'm deaf, and I'd know when you were honking the horn, we can feel it.. its stupid people like you who think that the only way you can know there's a horn going off is to hear it.

    AND I'm almost willing to bet you're one of those dumbasses who like to dial your cellphone with your phone out in front of you and not paying any attention to the road either..

    Now whats the difference between this and that? None! dont use both on the road while driving!

  115. Re:and we thought drivers with cell phones were ba by MntlChaos · · Score: 1

    sorry for any offense caused but I wasn't trying to be serious there. Please locate your sense of humor before posting on slashdot (check outside your doorstep, perhaps its there :-) )

  116. What's next? by Rassleholic · · Score: 0

    TV for the blind? Bicycles for quadrapaligics? Ham radios for the mute? Source code for Windows?

    --
    Not noteable, IMO a rubbish article.
  117. How about this by Anonymous Coward · · Score: 0

    Why not display it in text BUT NOT proper nor really English? Use phonetics... the big issue with speech to text is trying to make it into proper English, right? Then screw it, go with phonetical text, no crazy mouth reading.

    Now this would work fine, if a speaking person would read the phonetic sounds, it would make sense, however, im not sure if the deaf can...

  118. laptops and soon pda's? by Anonymous Coward · · Score: 0

    ummmmmmm what about desktops?

  119. Re:and we thought drivers with cell phones were ba by jtwine · · Score: 1

    Post-lingually Deaf? I only ask because I have experience with The Deaf, and I must say that you write English like a native Hearie! :)

    >I'd know when you were honking the horn, we can feel it

    Now, driving a manual transmission by feeling the car's vibrations (no tach.) is one thing; being able to feel even those little tweeter-sounding horns on some smaller vehicles is another... :)

    --
    -=- James.
  120. Not only can they drive.. they can fly too! by Anonymous Coward · · Score: 0
    I'm fully deaf and am going to start flight school next year.

    http://www.deafpilots.com/

    and

    http://www1.faa.gov/AVR/afs/deaffaq.htm

    are two good reference points for seeing how deaf people can fly up there without using a radio, etc. No, they can't fly at LAX, etc. On a related note, I checked out the T-Mobile Sidekick after seeing an ad just now on slashdot saying that CmdrTaco recommends it. I checked it out and it looks exactly like what I've been looking for: email, unlimited web and AIM.. but one caveat... after one year, they start charging $3.50 per megabyte above 15 megabytes/month.. ouch.. i wonder how bad this would be..

  121. bunches of crapshit! by Anonymous Coward · · Score: 0

    bottomline for the idea of cellphone for deaf is full of crapshits! i don't belive in it's usefullness ,only reliable way is email,fax and videochat in asl,yes there is a national video relay service for the deaf!!!!!permiting us to sign in asl
    btw asl is not the same as signed english and asl is much simpler

  122. I work for CSD (communication services for deaf) by Anonymous Coward · · Score: 0

    I work for Sprint relay who provides services for people who are deaf, hard of hearing, or speach disabled. After a few days of doing my job (saying what deaf people type and typing what the hearing person says back) I have felt I am simply being paid to emulate a piece of technology that hasn't been invented yet.

    I am paid to turn speach into text, and text into speach. I guess real speach to text soft/hard-ware, suitable for real life applications, is still not practical.

  123. a quote about the difficulty of lipreading by Gryftir · · Score: 2, Informative
    From
    • For Hearing People Only

    "Lipreading involves a high proportion of guesswork and "instant mental replay." Only some 30% of all spoke sounds are visible on the lips. Many sounds like "b," "p," and "m," are virtually impossible to distinguish by watching the mouth. [...] Anyway, 'lip-reading' is a misnomer. A more accurate term is speechreading. Speechereaders don't just look at the mouth;they read the entire face. [...] They note changes in expression, shoulder shrugs, posture, gestures. [...] Picking up these associational cues is an art in itself." (127-128)
    I'd also like to add that For Hearing People Only, ISBN 0-934016-1-0 is a great source of information about the complex and interesting world of Deaf people, and the language of ASL.
    --
    http://www.santacruzbynight.com/index.shtml Santa Cruz By Night Vampire Larp
  124. This solves one problem .. but there's one more by ryanw · · Score: 2

    Ok, so lets pretend this helps the deaf understand what the person on the other side of the phone is saying ... now how does the deaf person communicate back to the person on the other side? I mean, some deaf people can make noises which represent words to some extent.. but this does not seem to help the majority ...

  125. yes, by me.at.work · · Score: 1

    I think I've heard these. Usually followed by some "turn off that %Q phone!" yells.

  126. For the deaf by Andrewkov · · Score: 2

    Text messaging, anyone? ;-)

  127. Re-rendering speech for the deaf by Anonymous Coward · · Score: 0

    Commenting on: http://story.news.yahoo.com/news?tmpl=story&ncid=5 81&e=3&cid=581&u=/nm/20021126/tc_nm/telecoms_israe l_cellcom_dc
    "Israeli Software Enables Deaf to Use Cell Phones"

    The perception a seeing, hearing person has of the speech of another person whom he can see speak is quite complicated. Several years ago researchers exhibited an interesting demo. They showed an example in which they combined an audio track of a person saying one sound, with a mute synchronized video track of him saying a second very different sound, which all observers of the two tracks together would identify as yet a third very different sound!

    Researchers often attempt to characterize language-speech as being made up of a limited alphabet of sounds, called phonemes, which are partially modified and then linked together in a sequence to constitute all speech. This approximation is often very useful. More useful yet, if harder to process, is a scheme in which the "alphabet" chosen is the much larger (NxN) number of adjacent phoeneme transitions.

    Analogous to phonemes are visual representations of the human face making these sounds, called visemes. It is well known that while English has dozens of phonemes, it has well under a dozen visemes, with multiple phonemes mapping to a common viseme. This is what makes so-called "lip-reading" hard! No doubt disambiguation by experienced practitioners comes about because CONTEXT allows one to make a unique mapping to extant words.

    The "Microsoft Agent" technology avatars used on Windows PCs to render speech, simultaneously display appropriate visemes as they produce the audio of the speech itself. Not only do they do this when fed text which is made into speech via artificial text-to-speech voices, but they ALSO try to exhibit appropriate visemes when fed an audio file which records spoken language. The SECOND version of this technology rolled out in late 1997, so it is really unfair to say that "nothing like" the Cellcom technology already exists!

    By embellishing natural visemes with additional visual sound indicators, the Cellcom people arguably make a contribution, but I think that overly expansive claims serve only to encourage scorn among what one would call colleagues.

    Of course the notion of recognizing phonemes and using them to re-render the speech in another form is one I myself have suggested for another purpose - compression e.g. on 25 April 2002 at: http://groups.yahoo.com/group/WGTA-Discussion/mess age/587 where I said:

    "If communications costs are an issue, text-based chat (potentially interfaced to speech with speech recognition and text-to-speech engines!) can be effected at the incremental cost of Internet use - nothing."

    I embellished the notion in a posting of 3 August 2002 at:
    http://groups.yahoo.com/group/WGTA-DDL/messag e/1 ,
    stating:

    "There is also a way to create a 'poor man's VoIP' network when one is limited to a narrowband link suitable only for text-based chat: one can use the text as a means of compressing voice. It would work so:

    "One would have the parties to the conversation all use Internet text-chat clients. Received text would be rendered using text-to-speech engines. Sent text could be captured with speech recognition software on a sufficiently powerful and trained PC. Because the final text would be rendered phonically, one traditional problem with speech recognition would be greatly mitigated: substitution of homonyms and near-homonyms for actual words. And indeed, because the conversation would be interactive, not time-shifted like dictation of an e-mail, in the event of ambiguities or uncertainties a confused speaker could simply request the sender of the questionable speech to restate or at least repeat it. After all, that's what we *already* do when talking on the telephone, especially if the connection is poor or a speaker has heavily accented or impeded speech."

    Of course this particular suggestion also BEGS an alternative way to help a deaf person accept a telephone call - use of a speech recognition product to produce text! In fact, many deaf folks have for decades use so-called "TTY" devices (more on this below) to receive text over the telephone. So, if one wanted to automate speech recognition within the context of using a telephone, why not do it at a *central* server whose output can be forwarded to the deaf person? That way he need not *carry* all that hardware for the few minutes per day he uses a telephone. [Note that using a computer in place of a human transcriber may also help spare embarassment in intimate chat!]

    By the way, several wireless phone companies have worked on a TTY solution since the Federal Communications Commission mandated in 1996 that TTY users have wireless access to 911 emergency services. By early 2000, Bell Atlantic Mobile said it would sell wireless TTY phones by the second half of 2001.

    From "The New York Times", February 3, 2000:

    "[For telephony, the deaf have exploited] ...TTY devices, which cost as little as $150 or as much as $700 for a portable model... [which have been] used since the 1960's to transmit and receive text messages over phone lines...

    "Two people equipped with TTY devices can communicate with each other over phone lines in a straight text conversation...

    "Someone not equipped with a TTY device, on the other hand, can 'talk' to a deaf person via the Telecommunications Relay Service, a toll-free service that began in 1993 and is available in every [US] state. The service's operators, equipped with TTY devices, translate messages from text to voice and vice versa for both parties."

    I'll add that a 6 March 2001 press release by the US National Science Foundation talks about "Andy the avatar... a 3D animation... [which can] interpret words, sentences and complicated concepts into sign language, combining signing, gestures and body language to simulate natural communication." That is, Andy is an ASL analog to the Microsoft Agent characters.

    Dr. Ron Feigenblatt
    http://www.geocities.com/neohephaestu s/

  128. i think, actually, it makes some sense by raistlinjones · · Score: 0

    As we all know, computers are completely idiotic when trying to turn speech into text. This is, of course, due to the fact that a lot of words/phrases sound pretty much the same, and people mumble.

    The advantage of this system is that translating sounds to facial movements is relatively easy. Anytime you hear "ow", it's going to be the same facial movement producing it. My guess would be that your interpretation of the facial movement thing will be more accurate than the computer's inane text interpretation.

  129. Re:Why Not Recognize Words And Convert To Sign Lan by Anonymous Coward · · Score: 0

    That would have to be based soley on speech-to-text translation. Less error prone than speech-to-animated lips? As many above posts mentioned, I don't think so.

  130. Last Post! by alpg · · Score: 1

    Two men are in a hot-air balloon. Soon, they find themselves lost in a
    canyon somewhere. One of the three men says, "I've got an idea. We can
    call for help in this canyon and the echo will carry our voices to the
    end of the canyon. Someone's bound to hear us by then!"
    So he leans over the basket and screams out, "Helllloooooo! Where
    are we?" (They hear the echo several times).
    Fifteen minutes later, they hear this echoing voice: "Helllloooooo!
    You're lost!"
    The shouter comments, "That must have been a mathematician."
    Puzzled, his friend asks, "Why do you say that?"
    "For three reasons. First, he took a long time to answer, second,
    he was absolutely correct, and, third, his answer was absolutely useless."

    - this post brought to you by the Automated Last Post Generator...