Slashdot Mirror


PDA Speech Translator

jlowery writes "Not quite as good as a babelfish, but a PDA that does translation is probably better than resorting to hand gestures alone. I could see this as a boon to the tourist who travels to places where English speakers are uncommon."

14 of 161 comments (clear)

  1. The problem with these things by the+man+with+the+pla · · Score: 3, Insightful

    The problem with every software that I have used that tries to decipher human language (like Zork or the game included with emacs for X) is that you have to know what words the software understands and in what context.

    I have seen the same problems with automated phone systems that are supposed to recognize a generic voice and I can see the same thing happening here.

    The main difference here though, is that when entering text, you know exactly what you input before pressing enter. With voice recognition software, how do you know that the software "hears" exactly what you say? If you say somethign like "What are my appointments for the thirteenth?" and it hears, "What are my appointments for the thirtieth?" you would be receiving the wrong information.

    I hope this is a success but I don't have my hopes up.

    --
    7329756

    --
    The linux hacker
    1. Re:The problem with these things by Angus+Prune · · Score: 3, Insightful

      It all boils down to confidence. I have to be confident that what I'm doing will work.
      I use a wireless keyboard but Im having to switch back because I find I have to check what I am typing because it doesn't always pick up every keypress
      Voice to text are only of limited use while you have to re-read and correct any mistakes.
      While this is only 80% accurate it can never be trusted. When this works at 95% it won't be trusted. I won't trust that this won't mistake Renal for Venal.

      While this is a great step foward I can't see it being trustworthy for 2006 and I still think the same problems still apply to this as have always applied.

    2. Re:The problem with these things by fastidious+edward · · Score: 3, Insightful

      It was seriously almost as distracting as hand entering the number.

      Are you being sarcastic? I can type a number on a numeric keypad much faster than I can say it. The 5-6 times much more than compensates for the time of getting the phone out of my pocket.

      Voice recognition is great, but tactile recognition is also great, as is body movement.

      Until the machines can be 100% accurate without frustration they are next to useless.

      I know I have trouble understanding someone with a heavy Southern-USA accent, like someone else may have trouble with a heavy Scottish accent (as firends have) or heavy London accent (as I can revert to), people are not perfect at understanding people, let alone machines understanding people.

      Voice regognition is not a great saviour and IMHO is years away, in the meantime I'm happy with a numeric keypad.

      --

      karma karma karma karma karma chameleon, you come and go, you come and go.
  2. Had to be said by Anonymous Coward · · Score: 5, Funny

    "All your base are belong to us!"

  3. Good Grief... by ackthpt · · Score: 3, Insightful
    I could see this as a boon to the tourist who travels to places where English speakers are uncommon."

    Spoken like someone who has never taken a foreign language class. Suppose that thing is going to get the accent right? Emphasis on the right syllable? Not likely, mostly good for translating some text message into the PDA holder's tongue (and doing an Engrish job of it anyway.)

    --

    A feeling of having made the same mistake before: Deja Foobar
  4. And here by Anonymous Coward · · Score: 3, Funny

    I thought that you only had to speak English slowly and loudly enough for anyone to understand. Silly me!

  5. Good Idea... by avgjoe62 · · Score: 5, Funny
    > I could see this as a boon to the tourist who travels to places where English speakers are uncommon.

    Yeah, I could really use one of these when I go from Fort Lauderdale to Miami...

    --

    How come Slashdot never gets Slashdotted?

  6. Yeah, thanks, but I'll wait for a bit... by dejinshathe · · Score: 3, Funny

    "It also works only when the speakers are talking about medical information, and it's only about 80 percent accurate in the lab."

    Forgive my immediate misgivings, and you can call me chicken if you want, but I'm really not that keen on walking into a hospital and asking to have a medical procedure done with a 1 in 5 chance that instead of removing my appendix, they might remove my "appendage"...

    --


    "It is the prerogative of fools (or noobs) to utter truths that no one else will speak."
  7. a complete translator could be possible by shuz · · Score: 4, Interesting

    Technology is at a point where all the software has been written to create a translator where a person speaks into a microphone which then is translated into text which is then translated into a different language which is then played back verbally in the same persons voice in a different language. The problem is that this cannot be done in realtime. 4 years ago I worked on a project for At&t to create an application that would train a users voice, break down thier voice patterns and be able to rearange those patterns to create other sounds which sound like they are coming from that real person. The problem is that with current processors the time to train and process is about 10 hours. So we can do voice recognition in realtime, we can translate text words in realtime, and in 10 hours we can reproduce a persons voice nearly flawlessly. Think of the possiblities!

    --
    There is or can be built a machine that can simulate any physical object. -Church-Turing principle
  8. I can see it now... by stienman · · Score: 4, Funny

    "Are you speaking the english?"

    "I speak to the English, it's the Americans I won't talk to..."

    -Adam

  9. Yelling Helps by aredubya74 · · Score: 3, Funny

    Outstanding. This thing will finally make the common Ugly American practice of yelling actually useful:

    *hold PDA to face* Ahem! "WHERE IS THE BATHROOM?!" *hold PDA to foreigner's ear*

    --

    RW

  10. "My hovercraft is full of eels" by NZheretic · · Score: 5, Funny
    With apologies to the python crew...

    Text on screen: In 2004, the World Trade Center lay in ruins, and foreign nationalists frequented the streets - many of them Arabs (not the streets - the foreign nationals). Anyway, many of these Arabs went into tobacconist's shops to buy cigarettes....

    A Arab tourist approaches the shopclerk. The tourist is talking haltingly into a PDA.

    Arab: I will not buy this record, it is scratched.
    Clerk: Sorry?
    Arab: I will not buy this record, it is scratched.
    Clerk: Uh, no, no, no. This is a tobacconist's.
    Arab: Ah! I will not buy this *tobacconist's*, it is scratched.
    Clerk: No, no, no, no. Tobacco...um...cigarettes (holds up a pack).
    Arab: Ya! See-gar-ets! Ya! Uh...My hovercraft is full of eels.
    Clerk: Sorry?
    Arab: My hovercraft (pantomimes puffing a cigarette)...is full of eels (pretends to strike a match).
    Clerk: Ahh, matches!
    Arab: Ya! Ya! Ya! Ya! Do you waaaaant...do you waaaaaant...to come back to my place, bouncy bouncy?
    Clerk: Here, I don't think you're using that thing right.
    Arab: You great poof.
    Clerk: That'll be six and six, please.
    Arab: If I said you had a beautiful body, would you hold it against me? I...I am no longer infected.
    Clerk: Uh, may I, uh...(takes PDA, talks to it)...Costs six and six...ah, here we are. (speaks weird Arabic-sounding words)
    Arab punches the clerk.

    Meanwhile, a cop on a quiet street cups his ear as if hearing a cry of distress. He sprints for many blocks and finally enters the tobacconist's.

    Cop: What's up
    Arab: Ah. You have beautiful thighs.
    Cop: (looks down at himself) WHAT?!?
    Clerk: He hit me!
    Arab: Drop your panties, Sir William; I cannot wait 'til lunchtime. (points at clerk)
    Cop: RIGHT!!! (drags Arab away by the arm)
    Arab: (indignantly) My nipples explode with delight!

  11. Just wait ten years by mschuyler · · Score: 5, Interesting

    I believe PDAs are going to be tremendously transformed over the next few years.

    1. Convergence is going to happen with a vengance. The Treo 600 is just the start. More and more apps will make it to the PDA. Speech recognition is one, and that sets up for another dybamic...

    PDAs don't really need screens and keyboards if you can talk to them and they can talk to you. If they don't need those components, they can get a whole lot smaller. The next generation PDAs will be like a hearing aid, and the ones after that will be built into your glasses or an implant. That means less power, so less battery. Besides, it will be able to run on your body heat if not tap into your own body's electrical system, so it won't need a battery. Every improvemnt along these lines dwindles the size even more. A heads-up display, made transparent or opaque, ought to handle those times when you need to really observe rather than consult.

    A combination of AI and connectivity will mean your PDA is your first line of defense in many of life's situations. Get pulled over by a cop and it will tell you what to do, what NOT to do, and contact your lawyer. Need a cop and it will call them and know just how long it's going to take to get there.

    Medicine: It will have a complete medical history of you, remind you to take your meds, and monitor your blood pressure and other vita signs. If you have a heart attack it will call 911 with your location and be the first thing the medics consult when they get to you.

    Personality: You'll be able to choose its level of humor and sarcasm. Although clearly a machine, people will develop meaningful relationships with them, at least they'll think so.

    Connectivity: Everything you can think of, including your own house, which you'll call up to turn the heat up since you're coming home early. All teh Wi-Fi/cell connectivity you want will be built in.

    Finances: It will know everything you do and provide access to your dough. If you get overdrawn it will be intentional because it will have real time access. It will have all the ATM/debit/credit stuff all on-hand. It will also be able to shop for you and tell you where the best deal is.

    It will know all your friends and business associates and help remind you, "This is Joe. He's a Cougar. He knows you're a Husky, but don't rub it in. His kid just joined the Navy. He thinks LOTR sucks, and Rush is Right, so be careful. He drinks Guiness. His budget is 250K and he's looking to upgrade the Ciscos."

    You'd never think of leaving home without this. Indeed, since it very well may be built-in, you won't have to worry about it. Just keep up the subscription.
    '

    --
    How about a moderation of -1 pedantic.
  12. Re:There are telephone translation services. by belmolis · · Score: 3, Insightful

    Aptitude testing is useful, but two other major factors in the success of the US government language schools (there are actually four: The Defense Language Institute in Monterey, the Foreign Service Institute, the CIA Language School, and the NSA Language School) are time and focus. In most other situations, such as high-school or college, people studying a language study it a small fraction of the time. It's one of four or more courses. Class time is 3-5 hours per week. On a typical university schedule, that's a maximum of 130 hours a year in class. In contrast, in the government language schools, language study is the whole show. Students spend 8 hours a day or more on the language (not all in class). That comes to much more time devoted to the language, and there are fewer distractions.