Slashdot Mirror


Can Your Mouth Become Multilingual?

Roland Piquepaille writes "During a videoconference last week between Karlsruhe, Germany, and Carnegie Mellon University (CMU), Pittsburgh, USA, the talk of Alex Waibel, from CMU, was automatically translated in German and Spanish. Both the Pittsburgh Post-Gazette (PPG) and the Pittsburgh Tribune-Review (PTR) attended the conference, took pictures and were impressed by this new 'open domain' speech-to-speech translation. This new computer technology is based on artificial intelligence (AI) and statistical methods. During the demonstration, the speaker had electrodes attached to his face and his neck, but the researchers think that these electrodes could be implanted into your mouth and your throat in a decade from now -- if you agree of course."

11 of 212 comments (clear)

  1. Re:eh? by Andrew+Tanenbaum · · Score: 2, Interesting

    That happens with real people, too =)
    I know a Malay who learned English in Australia, and he talks like someone straight out of the 1800s.

  2. You Can't Hear Me Now by Quirk · · Score: 2, Interesting

    In terms of the hardware... " NASA scientists have begun to computerize human, silent reading using nerve signals in the throat that control speech." Subvocal speech reading systems offer the added bonus of now having to listen to the mundane trivia being broadcast from the cubicle next to yours.

    --
    "Academicians are more likely to share each other's toothbrush than each other's nomenclature."
    Cohen
  3. How could it translate? by spongebue · · Score: 4, Interesting

    Computers have a hard time translating written things as it is... any bilingual will tell you that online translators for complete sentences will do nobody any good, for the most part. My Spanish teachers are all able to see papers with computer translations very easily, due to similarities in words and meanings (such as the word "pants" which can be colthing or breathing heavily) Not to mention, grammar and things like that are not done well at all. For the fun of it, try going to an Online translator and write something in English, translate it to Spanish, then back to English. Some results are pretty crazy. I guess the point I'm trying to make is this: what makes the translators so special compared to the ones we have now? How can they work better? Sure, there is probably a bit more effort put into these, but I don't think that a good translator will be available for another 5 years, not to mention the whole "take the speech you aren't saying" thing is hard to believe.

    1. Re:How could it translate? by MBCook · · Score: 2, Interesting
      That's true, but their system was much more powerful. The first article made a quick mention of how "programmers try to make computers think like humans, while they were trying to make the computer work like a computer." I believe it said they were using statistics which (given enough source material) would eliminate those kind of problems. Seems like there was as article on Slashdot about that a while back.

      But by combining it with grammatical analysis you could also fix those kind of errors. In the example you gave, pants (the clothes) is a noun, while pants (breathing) is a verb form. By figuring out which context the word is (noun or verb) then the computer can make a much better guess at the correct meaning. A system could adapt to the user's speech patterns which would probably help it decide too.

      That said, you're looking at a free translation software offered on the internet. They have to do quick translations and they are ad supported at best. I'd image real translation software would be much better. SYSTRAN's little "try me" box on their site successfully translated "The dog was wearing pants. The cat pants loudly." to French using different words for the two pants (but then again so did Google and Babel Fish). Still their top of the line product is $900 so I would guess it would be rather good.

      Don't forget, by the time users get their hands on a system like this, it will be a few years from now and you'll have increased memory capacity and processing power. Plus if you don't need to be silent (which is why they were using electrodes) then I'd imagine a video camera or two would work just fine for reading the muscles (you could use this easily in the UN).

      Of course, maybe we should all just switch to Latin. You can't say we're playing favorites with a language if you choose a dead one.

      --
      Comment forecast: Bits of genius surrounded by a sea of mediocrity.
    2. Re:How could it translate? by Matthias+Wiesmann · · Score: 2, Interesting
      Statistics can only help if you really have large corpuses of reference material, and feeding the text in two langage into the system will probably not be sufficient, you will need to map what expression goes to which. Gramatical analysis can only help to you a point. If you take this french sentence "Il va voler la vedette." It can mean either ''he will steal the show'' or ''he will steal the speedboat''. Statistics won't help you much: because one translation is more probable that the other doesn't make it right. The only way to select the correct translation is to detect in the context that the text is about show-buisness or ships.

      As for commercial software, I'm not so convinced. I work in Japan, and we bought som translation software to translate to/from japanese. The quality is somehow better that what you can get on the web, in the sense that it seems to detect some key expressions and translate them, but we are far from getting even readable english from a Japanese corporate web-page. This is not very surprising: Japanese and English have very different structures, so translating is really not obvious. I suspect statistics will help in translating common things like manuals and commercial letters as they are basically always saying the same thing.

      Finally, I must disagree about your comment about latin. The difficulty for X in speaking langage Y has a lot to do with how different they are in the logical structure, the vocabulary (and the sounds they use), so for people who speak romance languages, learning Latin will certainly be easier that say for a Chinese, people who speak langages with cases might also have an advantage.

    3. Re:How could it translate? by bodrell · · Score: 4, Interesting
      Of course, maybe we should all just switch to Latin. You can't say we're playing favorites with a language if you choose a dead one.

      Actually, that's exactly what Israel did when the Hebrew language was brought back from the dead. For awhile, German was considered for the official language of Israel, since there were so many German Jews relocated to Israel. A guy named Ben-Yehuda was almost single-handedly responsible for reviving spoken Hebrew, making up Semitic-sounding words to fill in gaps, etc. Before that point, Hebrew was as dead as Latin (religious use only), although Yiddish has a fair number of Hebrew words (and German, and Slavic).

      Besides German, I believe Russian and Yiddish were other popular choices for a national language, but each had its own political issues.

      No, I'm not Jewish. I just like languages.

      --
      Si la vida me da palo, yo la voy a soportar Si la vida me da palo, yo la voy a espabilar
  4. synthetic remix/rerendering of video speech by G4from128k · · Score: 4, Interesting

    Another approach is from some work I saw demoed at an MIT conference in Vienna. If you capture enough video of a person speaking, you can remix/rerender video of that person saying anything you want them to say. The software works at the phonetic level so you can even synthesize words that the person has never even uttered before and even make them appear to speak languages that they don't know. They had some visually convincing video showing people saying things that the researchers claimed they never said. Yes, the demo version worked with clean test video and a professional video/image analyst could probably spot a faked/remized video. But if these technology becomes good enough, I can see it making video a nontrustworthy source of data (like skillfully retouched photos).

    --
    Two wrongs don't make a right, but three lefts do.
  5. Re:Finally! by TheGSRGuy · · Score: 2, Interesting

    One step closer to the Universal Translator...

  6. Just a continuation of an older project... by fraber · · Score: 3, Interesting

    Hey,

    Alex Waibel was one of the leading scientists in the Verbmobil project in 1995. The technology was pretty interesting (maintaining probability "graphs" from the Markov speech analysis through the syntactic and semantic analysis).

    However, results were pretty poor due to the structure of the project (just too many people) and because many institutions really weren't interested in the project and went for their favourite research topic with a new name (that's how research in Germany works...). Perfectly possible that Mr. Waibel advanced with the topic, now 10 years after the first major trial...

    Personally, I actually gave up AI completely after the ESSLLI (European Summer School on Logic, Language and Information) and promised not to touch the subject again until there were a "unified" formalism incorporating the old "symbolic" approach (predicate logic etc.) and the new statistical methods (Bayes, Markov, ...). Such a combination would be suitable both to deal with large amounts of data (statistical) and to deal with negation (only available in the symbolic appoach).

    Maybe they've got it this time? It's a pitty they don't talk more about the underlying formalism.

    Btw., the electrodes are probably just an enhancement of the normal speech recognition software to get a better "signal".

    Bests,
    Frank

    http://www.project-open.com/

  7. Re:Nifty but... by ToasterofDOOM · · Score: 2, Interesting

    That's very different. It may seem complex, but the only reason your native tngue seems natural is that you have been exposed to it since you were born. Perl is very structured and strict as compared to a spoken language. I know it was a joke, but it is useful to note this point.

    --
    I am Spartacus
  8. Re:Nifty but... by PakProtector · · Score: 2, Interesting
    That's very different. It may seem complex, but the only reason your native tngue seems natural is that you have been exposed to it since you were born. Perl is very structured and strict as compared to a spoken language. I know it was a joke, but it is useful to note this point.

    Uh, my native tongue doesn't seem natural to me. Atleast, not anymore.

    I grew up speaking English, and at Tweleve I started learning Latin. Now I'm studying Japanese, and I've had extensive exposure to Spanish, Italian, and German, too. I don't know enough German or Italian to ask where the bathroom is, and I don't know a great deal of Spanish, but I understand alot of the grammar and other rules of those languages.

    English no longer seems natural, or even correct. There are several times a month when I will find myself having great difficulty trying to express a concept in English when I can express it easily in, say, Latin, or in a combination of Latin and English, or in Japanese (this happens sometimes for debates on shades of blue), or in pidgeon-Spanish. Or German, for some things.

    But the point is, English no longer seems natural. The more I learn of other languages, and their rules and mechanics, and the more things other than English I get crammed in my head, the more and more I can see English for what it is: A bastard, cobbeled together piece of Linguistic Crap, the Language equivalent of the Jeep sitting down in my parking lot that barely runs, but is easy to add 'functionality' to (mainly because there are so many holes in it it's easy to run new wiring for anything I want to add.

    --

    Edward@Tomato - /home/Edward/ man woman
    man: no entry for woman in the manual.
    "Qua!?"