Slashdot Mirror


Speaking in Tongues

Desert1 writes "Carnegie Mellon's renowned computer science department has developed a system which allows for conversation between two different languages called Tongues. Currently this has been used between Croatian and English, perhaps one day they will be able to develop one that will allow politicians to talk to normal folks and be understood." It's been in development for a while.

264 comments

  1. Now all we need... by jameslore · · Score: 2, Funny

    ...is handwriting recognition that can handle Doctor's handwriting.

    1. Re:Now all we need... by ceejayoz · · Score: 2

      I think that one's up there with teleporters and Dyson spheres on the current feasibility scale... Considering it's usually the first letter and a squiggly line for the rest of the word ;-)

    2. Re:Now all we need... by coryboehne · · Score: 2

      Well they have demonstrated it, so it may be somewhat more feasable than teleporters and dyson spheres, however they do have quite a few hurdles to jump before this is pratical. But just a thought, if this was adapted from politician to normal folk interpreting a one hour speech could be compressed to less then 10 seconds, something like the following:
      Hello and thank you for coming, I have virtually nothing useful to say, however I would like to point out that you do like me and I have the best ideas. We're going to do alot of things that really don't matter, then try to restrict your freedoms a little more, and, trust us, this really is for your own good. Please vote _______ in the coming election and have a wonderful day.

    3. Re:Now all we need... by codeButcher · · Score: 1

      For practical purposes, I keep it even simpler:

      Don't believe anything a politician is saying when his lips are moving.

      Except maybe when he calls a fellow-politician a liar.

      --
      Free, as in your money being freed from the confines of your account.
    4. Re:Now all we need... by Nighttime · · Score: 1

      Don't believe anything a politician is saying when his lips are moving.

      Except maybe when he calls a fellow-politician a liar.


      So, if politicians always lie, and one calls a fellow politician a liar...

      Argh! It's a new form of the Kirk logic used to defeat computers.

      --
      I've got a fever and the only prescription is more COBOL.
  2. How long.. by BlackCobra43 · · Score: 2, Funny

    until it can allow h@x0r5 and non-"l33t"s to communicate?

    --
    I never spellcheck and I freely admit it. Save your karma for more worthwhile "lol erorrs" replies
    1. Re:How long.. by bluethundr · · Score: 1

      when it can figure out that the words h@x0r5 = |-|4xX0rZ=h4x0r5=hAxX0|2z=...ah screw it. 'l33t' folk ain't usually that bright, so why bother? ;)

      --
      Quod scripsi, scripsi.
    2. Re:How long.. by Anonymous Coward · · Score: 0

      Damn it people! A 10 second sentence in 90 seconds?!?! You people wouldn't even give Esperanto a chance, would you?

  3. Take it to a Pentecostal meeting. by marko123 · · Score: 3, Funny

    Then again, you need to understand Holyspiritish before you can write the translator.

    --
    http://pcblues.com - Digits and Wood
    1. Re:Take it to a Pentecostal meeting. by Paradise+Pete · · Score: 1
      Giving x makes x worth more than taking x where x is anything

      Especially where x = "a shit".

    2. Re:Take it to a Pentecostal meeting. by Jogar+the+Barbarian · · Score: 2, Interesting

      Just a bit of info "from the horse's mouth" as it were... :)

      It is VERY RARE that glossolalia (speaking/praying in tongues) is comprehensible to any mortal man. Scripture refers to it as "groanings that cannot be uttered", and that when "[your] spirit prays, [your] mind is unfruitful". I take that to mean you don't understand what you're saying, either. I know I don't when I do it.

      However, there are scattered reports of someone delivering a message in tongues, which was followed up by the interpretation (as God commands there to be), but that the original message was comprehensible by one or more strangers who just "happened" to come to that specific church meeting, and heard speech from their foreign, exotic dialect. (a miracle)

      Messages in tongues are, IMO, distinct manifestations of the supernatural from merely "praying" in tongues. Praying in tongues I believe is was is described as being used to "edify your spirit", and is what Paul was referring to when he said "I thank God that I speak in tongues more than you all." That means he had a extraordinarily vibrant prayer life, one that was immersed in the supernatural. Messages, in contrast, are brought to edify an entire body of believers (and to "wow" the unbelievers), but only when it is accompanied by the interpretation... otherwise, it's just gibberish.

      So, Message + Interpretation functions the same as the spiritual gift of Prophecy, it's just a two-phase form of the same manifestation: a message from God.

      --
      3. Profit!
      2. ???
      1. On Soviet Slashdot, a Beowulf cluster of alien Natalie Portman overlords welcomes YOU!
    3. Re:Take it to a Pentecostal meeting. by ccwaterz · · Score: 1

      FYI: I think "Holyspiritish" is hebrew. Could be wrong though.

    4. Re:Take it to a Pentecostal meeting. by Anonymous Coward · · Score: 0

      >Then again, you need to understand Holyspiritish before you can write the translator.

      I don't remember which one of those TV evangalists it was, but one of them had perfect
      pronounciation of Enochian.

      And a description of that language can be found here: http://www.hermetic.com/browe/etemple.html

    5. Re:Take it to a Pentecostal meeting. by joto · · Score: 2
      No, no, no.

      Speaking in tongues like the pentecoastals do is evil. It's the devil that's behind it. We all know what the bible told us about the apostles speaking in tongues. Everyone could understand them. But when the pentecoastals speak in tongues, usually only 0 or 1 person understand it.

      I'm quite certain that they are all evil, and will go to hell when they die. The devil can take many shapes, and will often present himself as someone else (e.g. a preacher, a beautyful woman, or a politician) in order to lure people into his diabolic schemes. Be careful which sect you are going into!

    6. Re:Take it to a Pentecostal meeting. by Anonymous Coward · · Score: 0

      You obviously haven't been reading Romans, then. Or did St Paul go to hell as well?

    7. Re:Take it to a Pentecostal meeting. by dillon_rinker · · Score: 2

      It is VERY RARE that glossolalia (speaking/praying in tongues) is comprehensible to any mortal man.

      Read Acts 2. I would suggest that it's not incomprehensible to any mortal man...it's just that we rarely pray in the presence of groups of people from every nation under heaven.

    8. Re:Take it to a Pentecostal meeting. by dillon_rinker · · Score: 2

      Email your scripture-based argument to me.

    9. Re:Take it to a Pentecostal meeting. by marko123 · · Score: 1

      Damn straight, bro! Not like head.

      --
      http://pcblues.com - Digits and Wood
    10. Re:Take it to a Pentecostal meeting. by marko123 · · Score: 1

      Didn't realise this would be as contentious as vi vs. emacs.

      --
      http://pcblues.com - Digits and Wood
    11. Re:Take it to a Pentecostal meeting. by Anonymous Coward · · Score: 0

      This is getting out of hand ...

      And having said that, oughtylv blirth frawsttmiev gulash uitz!!! Sarava Ogum-lele Exu-cavera!!! ;-)

  4. Next step -- sign language? by blackcoot · · Score: 3, Insightful

    Looks like a fascinating project --- I wonder if their Vision and Robotics boys are working on recognizing sign language which, for all intents and purposes, seems to be a very much more difficult problem (don't believe me --- see how well the facial recognition packages do in production environments :-P). I wonder if this is at the stage where it could be attached to a something like a virtual {insert sign language of your choice here} "translator"... hrm, sounds like a summer project ;-)

    1. Re:Next step -- sign language? by Anonymous Coward · · Score: 0
      I wonder if their Vision and Robotics boys are working on recognizing sign language

      How do you know that all their researchers are male?

    2. Re:Next step -- sign language? by Anonymous Coward · · Score: 0

      Hi blackcoot. You are sexist.

    3. Re:Next step -- sign language? by Anonymous Coward · · Score: 1, Informative

      This has been done already I believe, at least to a limited extent, and with excellent accuracy. One of my Professors at GaTech, Thad Starner, was involved in a project for this and iirc the project worked with a better than 90+% accuracy.

      http://www.google.com/search?hl=en&ie=UTF-8&oe=U TF -8&q=Thad+Starner+Sign+Language

    4. Re:Next step -- sign language? by Anonymous Coward · · Score: 0
      How do you know that all their researchers are male?

      If you're going to be so fussy, I'll point out that he didn't actually say they were. Taken literally, he asked if the the subset of researchers who are male adolescents are working on it.

    5. Re:Next step -- sign language? by Anonymous Coward · · Score: 0

      You go/went to GaTech and can't handle a fucking HTML link? Shame on you!

    6. Re:Next step -- sign language? by Anonymous Coward · · Score: 0
      Taken literally, he asked ...

      Nah, it's just that the only girl on the project really gets on with everyone is regarded as "one of the boys"

    7. Re:Next step -- sign language? by CowboyBob500 · · Score: 1

      It's being done the other way around. A project I was involved with for my Masters in Comp Sci - http://www.visicast.co.uk/

      Bob

    8. Re:Next step -- sign language? by blackcoot · · Score: 1

      Neat project :-) My main interest is in the forward processes --- I'm hoping to do something involving gesture recognition for my M.S. and take a crack at recognizing sign language for my Ph.D., assuming that I survive that long. We seem to be a long way away from reliable pose guestimation, largely because it's really hard to reliably find joints (like knuckles) in images. The guys over at University of Maryland have a neat system called GHOST, but I'm not certain how well their technique will extend to detecting finer features (like fore-fingers), particularly when there are important details that are occluded.

  5. Brute Force by chill · · Score: 2, Interesting

    I've long wondered why someone doesn't just brute force translation.

    Create a human translated database of damn near EVERYTHING in two languages, like English and Spanish. Then, just do fast lookups.

    Computing power is such that this would be possible.

    --
    Learning HOW to think is more important than learning WHAT to think.
    1. Re:Brute Force by egomaniac · · Score: 2

      ...except that the exact same sentence can be translated differently, depending on the context.

      Speaker 1: Where are we going?
      Speaker 2: To the bank.

      If you're on a river, the meaning of "bank" is different than if you're in a grocery store.

      --
      ZFS: because love is never having to say fsck
    2. Re:Brute Force by PD · · Score: 1

      In English it is very uncommon to say "go to the bank" in the river context. The program could have a simple rule to display the most common option, unless a context rule was triggered, such as a mention of a river in the past N sentences, in which case it would also display the alternate/less used version.

    3. Re:Brute Force by Keck · · Score: 1

      I sorta doubt that computing power is such that brute forcing translation is possible for anything but very simple exchanges... Humans can infer context that a machine just *can't*, and likely won't ever be able to. Things like the time of day, current events, the weather, where the exchange is takign place, etc.

      Languages in general are filled with inconsistencies; they evolve so long as new generations learn them. So in the time it would take to create a 'brute force' data base, the language would probably have changed enough to make such data worthless.

      This is ignoring of course, the brute stupidity of wasting computing power on something that obviously doesn't lend itself well to brute force..

      --
      A computer without Microsoft is like ice cream without ketchup.
    4. Re:Brute Force by Anonymous Coward · · Score: 0

      Humans can infer context that a machine just *can't*, and likely won't ever be able to.

      Maybe...

      I think the soln is to build a generic language learning engine, (not unlike a small childs brain) and give it enough stimulus + feedback that it "works it out" by itself.

      OTOH, how do you know it got it right? Well, you don't - at least no more so than a person...

    5. Re:Brute Force by Tackhead · · Score: 2

      >..except that the exact same sentence can be translated differently, depending on the context.
      >
      > Speaker 1: Where are we going?
      > Speaker 2: To the bank.

      And in the vein of the submitter who wrote " perhaps one day they will be able to develop one that will allow politicians to talk to normal folks and be understood."...

      ...regardless of context, if a politician is Speaker #2, you're in deep, deep trouble. :)

      Besides, all political speech boils down to (a) brute force, and (b) the non-politician going to the bank to pay for it.

    6. Re:Brute Force by madman2002 · · Score: 0

      The main problem in translation is not individual words, but instead grammar. Ever wonder why people who learned English as a second langauge occasionally seem to mix up the words in a sentence? It's because thats how the idea is expressed in their langauge. A simple example would be the phrase "I want to go to Madison Square Garden" in spanish this phrase would be (roughly...I never was to good at Spanish) Yo tango ir el jardin de madison caudrado (yes I left out all punctuation). Which when translated word for word is I want to go to the garden of madison square. True we can infer the meaning of the word for word translation but in many cases it is not as simple.

      --


      http://www.gamedev.net/reference/articles/article1 015.asp A spin on the old, if Microso
    7. Re:Brute Force by Paradise+Pete · · Score: 1
      Create a human translated database of damn near EVERYTHING in two languages, like English and Spanish. Then, just do fast lookups.

      You can't just do a word for word translation, you need some context. The number of combinations is enormous.

    8. Re:Brute Force by Paradise+Pete · · Score: 1
      The program could have a simple rule to display the most common option, unless a context rule was triggered

      If you have context rules then it's not brute force, which is the idea to which he was responding.

    9. Re:Brute Force by ObitMan · · Score: 0

      you obviously do not boat or fish much.
      I and others I know have many times used "to the bank", "over by the bank", "put it on the bank", etc...

      --
      Who run Barter Town?
    10. Re:Brute Force by ragnarok · · Score: 1
      OTOH, how do you know it got it right?

      absence of puzzled looks from the natives is a good start.

      --
      Search first, ask questions later.
    11. Re:Brute Force by quinto2000 · · Score: 1

      Ha ha. You forget that human language is productive. This means that the set of well-formed sentences can be put in a one-to-one correspondence with the integers, if I remember my linguistics correctly. What this means: there are an infinite number of sentences that you would need to translate. This is a very tough problem to solve with a lookup table approach (incidentally, this is part of why Searle's Chinese Room Problem is a ridiculous construction). I think that you'd find it hard to even get a useful set of sentences to carry 90% of human conversation. Think about how useless those phrase books are when you're asking even the simplest of questions in another language.

      --
      Ceci n'est pas un post
    12. Re:Brute Force by Anonymous Coward · · Score: 0

      "...this is part of why Searle's Chinese Room Problem is a ridiculous construction"

      Not that I disagree with your sentiments, but technological/material constraints are not a factor in philosophical thought problems.

    13. Re:Brute Force by Anonymous Coward · · Score: 0
      In English it is very uncommon to say "go to the bank" in the river context.

      I would say that roughly 100% of times someone uttered 'lets go to the bank' when I was on a boat in a river, it was said in 'the river context.' Nearly (if not) all those times, it meant lets go to the bank of the river.

    14. Re:Brute Force by forgoil · · Score: 3, Insightful

      And how about words that doesn't even exist in the target language? How about three of them in the same sentence? Would you like to insert a few sentences of explaination, including a few paragraphs of cultural references?

      The only translation softwares that I have seen has either been very faulty (babelfish) or very simple (ordering tickets). Every time I have spoken with a linguist they have given me reason after reason why it would be very hard, if not impossible, to translate from very different languages (English->Swedish is probably possible, even though you would sound like a complete moron after a while, but Japanese->English would be much harder).

      I think that the science and research is important, but I will retain a healthy sceptisism towards any "perfect" systems popping up anytime soon.

    15. Re:Brute Force by chill · · Score: 1

      I wasn't thinking word for word translation.

      In your example, the translation of the phrase "I want to go to " stays the same in the target language regardless of the noun, almost all the time. [You actually translated it is 'I have to go to', btw]

      "[Yo] quiero ir a " works for almost all cases of "I want to go to ".

      Phrases would change slightly depending on context, but that could be compensated for.

      What I am thinking of would doubtlessly work much better for non-interactive written words than interactive conversation.

      Finally, to the person who claimed there would be an almost unlimited number of sentences to translate. In theory, yes but in reality, no. Remember the article on the conversation bot? The researcher discovered that 90% of conversations use only a few words in very similar structures. There may be 100,000+ words in the English language but most people use less than 10,000 on a regular basis. It wouldn't be as massive a job as you suggest.

      --
      Learning HOW to think is more important than learning WHAT to think.
    16. Re:Brute Force by dillon_rinker · · Score: 2

      To the bank.
      "Either to the river's edge or the money lenders."

      That ambiguity exists in English, too...it just takes longer if you make it explicit. The listener should still be able to figure it out; context interpretation is only a problem for machines. I think a human being would have no more problem with my translation that with the original sentence. Although the typical American is going to look at you a bit strangely if you use my translation.

    17. Re:Brute Force by dillon_rinker · · Score: 2

      How about three of them in the same sentence? Would you like to insert a few sentences of explaination, including a few paragraphs of cultural references?

      Yes. Why is this a problem? FWIW, though, if I'm speaking with someone who doesn't speak English well, I try to formulate sentences to use only simple words. Compare...

      Well, barkeep, some of your finest nutty brown ale.

      I buy beer?

    18. Re:Brute Force by Koyaanisqatsi · · Score: 1


      The listener should still be able to figure it out; context interpretation is only a problem for machines

      But what if the word "bank" have a different translation on the target language depending on the context? In English bank can be used to refer to the edge of the river or the money lenders, no problem. But what if these two distinct concepts are expressed by different words on the other language?

      The same problem happens the other way around. I speak Portuguese and there are so many things like that, you have to apply context when translating something to English. Quick example: the work "casa" can be both the noun house/home or the verb "to marry". Go figure it :-)

    19. Re:Brute Force by kalidasa · · Score: 2, Interesting

      I've long wondered why someone doesn't just brute force translation. Create a human translated database of damn near EVERYTHING in two languages, like English and Spanish. Then, just do fast lookups. Computing power is such that this would be possible.

      Because natural languages don't work like that. Natural language translation is an AI problem, it cannot be solved with brute force. Hell, translation by human agents is nearly intractable. This is a problem which has been worked on since the 1960s (if you think Babelfish is bad, you should understand that it's superb compared to the early efforts).

      The syntaxes of natural languages do not map 1:1. For instance, some languages are ergative - they use agents rather than subjects to explain who is doing what. Though it sounds like all you'd have to do is map Tamil::Agent = English::Subject, and restructure the rest of the statement, it doesn't work that way. Vocabularies are the same way: each word has a semantic field which overlaps the semantic fields of other words in the same language, and overlaps the semantic fields in other languages, but almost never is the semantic field of one word in one language the same as that of another word in a second language. Try to translate the word "know" to French or "love" to ancient Greek and you'll understand the problem (French uses a different word depending on whether what you know is a person or an idea, e.g., while in Greek the words eros, philia, and agape all refer to different concepts that English speakers describe as "love").

      It can't be translated the way suggested, either, with "bank" being translated as either "river's edge" or "money storage place" - leaving aside that the second definition is culturally contingent, there could be connotations in either phrase which are distinct from those in the source phrase. There are also problems with ambiguity - natural languages are by definition laden with ambiguities and contingencies, which require the ability to reason to find some corresponding structure in a target language.

      This is a problem which will continue to be worked on for pretty much all our lives. If it's ever solved, HAL won't be far around the corner.

    20. Re:Brute Force by quinto2000 · · Score: 1

      Searle asserts that a person in a room doing lookups in an enormous book would not be intelligent, but then claims that a computer that could pass the Turing test is equivalent to the Chinese room. Well, if it takes an infinite amount of time for a response, I think we would conclude that whatever was on the other end of the conversation was not intelligent. Time constraints/feasibility are VERY important in this case. I would argue that the "Chinese Room" couldn't exist, and couldn't possibly fool a person on the other end, so comparisons to the Turing test are completely invalid.

      --
      Ceci n'est pas un post
    21. Re:Brute Force by dillon_rinker · · Score: 2

      Since "bank" has two meanings in English, you don't translate it; instead, you translate "river's edge or moneylender" into the other lanaguage. When translating "banco" from Spanish to English, you'd translate it as "bank (place where you save money)."

      Either way, if the ambiguity is known, it can be explicitly worked around and the listener can decide which of two ambiguous meanings to use. A good machine translator ought will point out ambiguous words and suggest synonyms.

  6. Universal Translator? by Binome · · Score: 1

    Only when the program can quickly and seamlessly convert a hitherto unknown language into everyday English will I be satisfied. The differences between Star Trek and reality must continue to dwindle.

    But hey, they could at -least- program Klingon into it.

    --
    In Soviet Russia, Beowulf cluster imagines you!
    1. Re:Universal Translator? by Anonymous Coward · · Score: 1, Insightful

      It always amazed me that the Star Trek univeral translators could translate a half completed sentence to the same meaning in all languages, even though each language would certainly arrange sentence structures very differently.

    2. Re:Universal Translator? by ObitMan · · Score: 0

      It's modern technology William!

      --
      Who run Barter Town?
    3. Re:Universal Translator? by Binome · · Score: 1

      Plus, there's the fact that all languages move their lips exactly the same way to pronounce their words as they would to speak the english translation. Absolutely amazing, eh?

      --
      In Soviet Russia, Beowulf cluster imagines you!
  7. technical link by madenosine · · Score: 4, Informative

    if you arent satisfied with the pc magazine summary, you can read this

  8. Esperanto... by Average · · Score: 5, Interesting

    In the pre-computer days, some folks noticed that a neophyte (basic idea, needs dictionary)translation into Esperanto was much more comprehended at the other end than a neophyte translation to the destination language or a neophyte translation by the recipient.

    The reasoning was that the process of translating into a more formal mechanical language clarified and codified ideas.

    Once again, it's the dividing line between human and machine that's the problem. Millions of people train themselves to C or the shells. Fewer to assembly. But it takes some wetware work to push the human/computer boundary closer to the computer.

    Like most programming has a learning curve, usually less than ASM, leaving language translation completely to the machine will be fraught and ambiguous. Good translation requires some push from normal speech, but maybe not so far as mastering every other possible language...

    1. Re:Esperanto... by Anonymous Coward · · Score: 0

      Are you suggesting that we learn esperanto? Or that machines translate into Esperanto and then into a foreign language? Would this really be more beneficial?

    2. Re:Esperanto... by Etcetera · · Score: 2

      Yes, he's suggesting that the machines use Esperanto as the middle language.

      I think it make's sense. Esperanto was pretty much designed exactly for that purpose.: a "middle ground" that would be (relatively) easy for any person who currently speaks any language to learn and use decently. The fact that it's longer and more cumbersome to use (and that there was no native populace using it) seems to have killed its use in the public domain, but a machine doesn't care about that.

      May not be the most efficient -- like converting a C program to Turing-machine codes and then converting that back into a Pascal program -- but it's at least a valid approach given sufficient HW is thrown at it.

    3. Re:Esperanto... by Anonymous Coward · · Score: 0

      Pardon my ignorance, but wtf is esperanto?

    4. Re:Esperanto... by Anonymous Coward · · Score: 0

      If you are so ignorant that you can't take the 1.63 seconds it takes to run a Google search then you don't deserve to know the answer.

    5. Re:Esperanto... by Anonymous Coward · · Score: 0

      It's good to see people making the analogy between human languages and computer languages. I think it works the other way too in that a lot of what computers do should be seen in a human context. We should code systems that address things like nouns, verbs, adjectives and so forth. I've argued with many in the field who think there is no relation though.

    6. Re:Esperanto... by Anonymous Coward · · Score: 0

      Searched the web for esperanto. Results 1 - 10 of about 739,000. Search took 0.05 seconds.

    7. Re:Esperanto... by Anonymous Coward · · Score: 1, Informative

      I volunteered for the past year full-time in the central office of World Esperanto Association (an internationally recognized NGO in official relations with the UN and UNESCO), and in such a place you can easily see that Esperanto is very much alive and in use. Working in its youth department, my job involved travelling quite a bit, going to various Esperanto events throughout the U.S. and Europe, and it's clear that many young people are learning it, keeping its population of speakers fresh.

      As far as the Tongues project goes, I know a U.S. army chaplain who was picked to test the technology (the first people to use this will be the U.S. Army, especially chaplains and medics), and he said it's pretty rusty, but shows promise. Maybe a middle step in Esperanto would improve such a project, though.

    8. Re:Esperanto... by BrookHarty · · Score: 2

      Slighty off topic, but the first sci-fi books I read, Stainless Steel Rat series. The character Slippery Jim diGriz traveled to other planets, and the main spoken language was Esperanto.

      BTW, the series rated rather high, so if you like Sci-Fi, you might really like these books.

      A young kid wants to be a famous thief, how do you do that in a computer, data driven society?

    9. Re:Esperanto... by Anonymous Coward · · Score: 0

      Assembly language is not more difficult to learn than other programming languages - in fact it is easier - it is just much more work to use in practice.

      Knowing assembly language is essential for programmers who want to know what they're doing, whether they use it or not; otherwise too many fundamental concepts are lost in the abstraction provided by other languages.

    10. Re:Esperanto... by Latent+Heat · · Score: 1

      Spanish-sounding words with Finnish grammer. If you saw the movie Gattica, you heard Esperanto spoken on the PA system where the main character worked.

    11. Re:Esperanto... by kalidasa · · Score: 2, Interesting

      The problem with this is that Esperanto doesn't have enough resources to map syntactical and vocabulary fields from all possible source languages (and all parts of any given source language) to all possible target languages (or every part of any given target language). And the adding the resources would be a relatively intractable problem. This is an AI issue, not something that can be solved entirely within the machine.

    12. Re:Esperanto... by Anonymous Coward · · Score: 0

      If you saw the movie Gattica...

      oh christ. GATTACA. There is no DNA base starting with "i". Does this need to be pointed out every time someone mentions that movie? (actually, no, i guess it probably doesn't. owell).

    13. Re:Esperanto... by cosyne · · Score: 1

      ohmigod. not to be a whiny bitch, but maybe you could have tried that in esperanto first? or whatever it takes to write in readable english. your sentences make my head swim, dude.

    14. Re:Esperanto... by tgeller · · Score: 2

      AC writes:

      >I volunteered for the past year full-time in the
      >central office of World Esperanto Association

      What a coincidence! I volunteered there for six months, 1991-1992. Malgranda mondo, c'u ne? :)

      --
      Tom Geller
    15. Re:Esperanto... by Etcetera · · Score: 2


      I believe in Phillip Jose Farmer's Riverworld Series first book To Your Scattered Bodies Go*, the populace eventually settles on Esperanto as a universal language.

      If you're not familiar with it, it's a surreal sci-fi book involving the mass simultaneous resurrection of all 36 Billion people who have ever lived on Earth onto a strange world composed of one very long river valley.

      Rather interesting concept story (though the last two books were horrible).

      *Affil. link goes to a non-profit.

  9. Not quite real translation... by Theaetetus · · Score: 5, Interesting
    The team built the system's translator using a technique known as example-based machine translation. Essentially, they created a database that holds a massive list of English phrases and their Croatian equivalents, culling data from bilingual Internet sites and university textbooks. When the engine receives a text phrase in one language, it provides the equivalent text in the other.

    So, basically, it's a lookup function, translating the incomming speech and then comparing in a database... So, while they could have a huge dictionary that could cover most situations, they aren't really doing a 'translation' per say...
    Although, then again, for anyone who has taken language classes, but are not fluent in the second language, isn't that what we do? I know that while I was taking French and Latin, to come up with phrases I would do phrase translations because I was still thinking in English. I wasn't fluent enough to think in those other languages, so I couldn't formulate phrases directly properly.

    I suppose, in essence, this will work as a translator, but it is neither a babel-fish type universal translator nor is it any replacement for fluency.

    Still cool, though. Now, can they get it to run on a Palm?

    -T

    1. Re:Not quite real translation... by Anonymous Coward · · Score: 0
      they aren't really doing a 'translation' per say...

      Per say?! Sheesh - I hope you don't think that means something like "each time it's said."

    2. Re:Not quite real translation... by charlie763 · · Score: 1

      He got a mod 5 because he said "babel-fish"

      --
      Welcome to the land of the free...pay toll ahead...no photography...please open your bag...
    3. Re:Not quite real translation... by KewlPC · · Score: 1

      It was a typo. Most likely he meant "per se" (which is pronounced the same as per say).

    4. Re:Not quite real translation... by Entrope · · Score: 1

      I did some GUI and integration work for an earlier version of the same technology (the DIPLOMAT project at CMU's LTI). It uses a multi-engine translation approach; some translation is done by looking up whole (simple) sentences, some by pure dictionary lookup, some by morphology (un-declining nouns and un-conjugating verbs, basically). Once all of those are done, it runs the output through a Markov model to figure out which "looks" the best (in comparison with human-generated text in the target language). It doesn't always generate perfect output, but it's generally easy to understand.

    5. Re:Not quite real translation... by scruffy · · Score: 2

      It is probably not just lookup, but lookup plus some kind of pattern matching. If phrase X is close to phrase Y, then a translation of phrase X is probably a good start for translating phrase Y.

    6. Re:Not quite real translation... by Dephex+Twin · · Score: 1

      This is pretty much the way things are done in translation these days. The translator uses lookups in large databases, proximity of words to one another, frequency of words, etc. Doing anything more would essentially require a computer to be able to understand the meaning of what you are saying (and probably have social and historical knowledge as well), which would be a great achievement in and of itself.

      mark

      --

      If you want to make an apple pie from scratch, you must first create the universe. -- Carl Sagan
    7. Re:Not quite real translation... by kalidasa · · Score: 1

      So in other words the damned thing isn't actually doing any translation; it just matching column A with column B. Worthless. It can only "translate" something that's already been thought up and translated by a human. Which means it can only be used with a very limited repetoire. And the nature of human language is that it's repetoire is unlimited; it is theoretically finite, but it is a finity that approaches infinity (see Borges' story "The Library of Babel" to understand what I mean).

    8. Re:Not quite real translation... by Theaetetus · · Score: 1
      True, true... So how about creating a language database online? This was attempted a few years back as a knowledge database (Does anyone know what the website for this was? Is it still active?) wherein people would ask questions which were then sent to multiple 'experts' (made up of other members who are knowledgeable in certain areas). The experts would reply, and then the asker would rate the responses as being helpful/not helpful, etc.
      The plan was to eventually create a knowledge database in which asking a question would first search through the archives for past matching questions and return the most helpfully rated answers... If that didn't solve your problem, your question would be sent on to be answered by real people.

      Couldn't something like this be done with translations, where bi-fluent or tri-fluent people get handed phrases and make translations for them, including local dialects, colloquialisms, etc., and rate the value of others' translations?

      It would take a while to build the database, but eventually, you'd have a really kick-ass translator.

      -T

    9. Re:Not quite real translation... by kalidasa · · Score: 1

      It would take decades to get something usuable, I'm afraid (I'm working on a project doing an online searchable annotated translation of a single dictionary, and after four years of distributed volunteer work, we're almost 1/3 done; and this is just 1 book!).

    10. Re:Not quite real translation... by Theaetetus · · Score: 1
      True, but don't you think you could get a larger, world-wide base of volunteers for a translation machine? Particularly since you don't need dedicated volunteers... Even if they only do 10 entries each, and you only get a quarter of one percent of the world's population (still 15 million), that's 150 million entries.

      I wonder how many bilingual people there are on slashdot alone. ;)

      -T

    11. Re:Not quite real translation... by kalidasa · · Score: 1

      Even if they only do 10 entries each, and you only get a quarter of one percent of the world's population (still 15 million), that's 150 million entries.

      That's the math we did at first, figuring we'd get about 200 volunteers to do 30,000 entries. In the end, though, over 75% of the work is being done by half a dozen volunteers.

    12. Re:Not quite real translation... by Theaetetus · · Score: 1
      200 volunteers to do 30,000 entries is 150 each, though... That's a lot of entries. Now, if you could get 3000 volunteers...

      It's tough to keep people interested and into the work for extended periods of time, but it's not that hard to get people into it for a few minutes. Hell, I think with /. effect and a good stable server, we could get tens of thousands done in the first day...

      Then we just keep reposting the story every few weeks. :D

      -T

    13. Re:Not quite real translation... by kalidasa · · Score: 1

      200 volunteers to do 30,000 entries is 150 each, though... That's a lot of entries.

      Well, you have to remember that you need people who are sufficiently fluent in both languages. For the project I'm talking about, that's less than 10K-20K people. (Hint - the project I'm talking about is a translation of a work you mention on your homepage.) We figured getting 200 or so would be about right.

      Remember, not everyone who is bilingual has the langauge skills to do translation properly. And you have to understand how translation works.

    14. Re:Not quite real translation... by Theaetetus · · Score: 1
      Well, you have to remember that you need people who are sufficiently fluent in both languages. For the project I'm talking about, that's less than 10K-20K people. (Hint - the project I'm talking about is a translation of a work you mention on your homepage.) We figured getting 200 or so would be about right.

      Heh. It's all Greek to me. ;)

      Yeah, to farm this out to more people would require some sort of peer review process to be sure that the translations are accurate ones. I still think that a distributed translation would be valid on a global scale, though the processing to handle checking and re-checking would be intensive.

      We can hope, no?

      Good luck with the translations.

      -T

    15. Re:Not quite real translation... by kalidasa · · Score: 1

      Yeah, to farm this out to more people would require some sort of peer review process to be sure that the translations are accurate ones. I still think that a distributed translation would be valid on a global scale, though the processing to handle checking and re-checking would be intensive.

      We've got that.

      Good luck with the translations.

      Thanks

  10. my hovercraft is full of eels by Anonymous Coward · · Score: 0


    "Drop your panties, Sir William; I cannot wait 'til lunchtime"

    'Tis 'bout time that 'twas translated into Croatin, aye!

  11. I don't need one. by Mr2cents · · Score: 1, Informative

    I have a fish in my ear.

    --
    "It's too bad that stupidity isn't painful." - Anton LaVey
    1. Re:I don't need one. by PaleBoy · · Score: 0, Offtopic

      boo that the parent of this post got modded down. Hitchhiker's Guide was a great book, great game.

      --
      ------ What's sadder than realizing you've filtered out your own comments?
    2. Re:I don't need one. by Anonymous Coward · · Score: 0

      Because inevitably someone will think that this is overrated, because they don't "get it":

      Read The Hitch-hikers Guide to the Galaxy by the late Douglas Adams. All language translation is taken care of by inserting one (1) Babel fish inside the ear canal of the noncomprehending listener. The Babel fish feeds off of excess brain waves of the host, and in return, emits the brain wave patterns necessary to comprehend the incoming speech.

      This, coincidentally, is the cornerstone of the proof that God does not exist. The Babel fish is so incredibly bizzare and inexplicable that it could not have arisen from evolution; it is evidence that there is a supreme Creator. But because there is proof, there is no need for faith; and without faith, God is nothing. Therefore, because of the Babelfish, God can not exist. QED.

    3. Re:I don't need one. by ObitMan · · Score: 0

      Do not forget the end of the argument:
      Man seeing how easy that bit of logic was goes on to prove that Black is White and gets himself killed at the next zebra crossing.

      --
      Who run Barter Town?
    4. Re:I don't need one. by Anonymous Coward · · Score: 0

      Here's the full quote from The Hitchhiker's Guide. Enjoy!

      "Ford," he said.

      "Yeah?"

      "What's this fish doing in my ear?"

      "It's translating for you. It's a Babel fish. Look it up in the book if you like."

      He tossed over The Hitchhiker's Guide to the Galaxy and then curled himself up into a foetal ball to prepare himself for the jump.

      At that moment the bottom fell out of Arthur's mind.

      His eyes turned inside out. His feet began to leak out of the top of his head.

      The room folded flat about him, spun around, shifted out of existence and left him sliding into his own navel.

      They were passing through hyperspace.

      "The Babel fish," said The Hitchhiker's Guide to the Galaxy quietly, "is small, yellow and leech-like, and probably the oddest thing in the Universe. It feeds on brainwave energy not from its carrier but from those around it. It absorbs all unconscious mental frequencies from this brainwave energy to nourish itself with. It then excretes into the mind of its carrier a telepathic matrix formed by combining the conscious thought frequencies with nerve signals picked up from the speech centres of the brain which has supplied them. The practical upshot of all this is that if you stick a Babel fish in your ear you can instantly understand anything said to you in any form of language. The speech patterns you actually hear decode the brainwave matrix which has been fed into your mind by your Babel fish."

      "Now it is such a bizarrely improbable coincidence that anything so mindboggingly useful could have evolved purely by chance that some thinkers have chosen to see it as the final and clinching proof of the non-existence of God."

      "The argument goes something like this: 'I refuse to prove that I exist,' says God, 'for proof denies faith, and without faith I am nothing.'"

      "'But,' says Man, 'The Babel fish is a dead giveaway, isn't it? It could not have evolved by chance. It proves you exist, and so therefore, by your own arguments, you don't. QED.'"

      "'Oh dear,' says God, 'I hadn't thought of that,' and promptly vanished in a puff of logic."

      "'Oh, that was easy,' says Man, and for an encore goes on to prove that black is white and gets himself killed on the next zebra crossing."

      "Most leading theologians claim that this argument is a load of dingo's kidneys, but that didn't stop Oolon Colluphid making a small fortune when he used it as the central theme of his best-selling book Well That About Wraps It Up For God."

      "Meanwhile, the poor Babel fish, by effectively removing all barriers to communication between different races and cultures, has caused more and bloodier wars than anything else in the history of creation."

    5. Re:I don't need one. by Anonymous Coward · · Score: 0

      I can't hear you. I have a banana in my ear.

  12. Just begging for an infection vector, aren't we? by ruhk · · Score: 2, Funny

    Oh great, just what we need: a machine/program that makes it easier for us to snow crash. I'd like to play with this a bit, and find out where it's rough edges are--especially running translated output back through, a la the Babelfish.

    --



    404 Error: .sig not found.
  13. I am Sam! by Anonymous Coward · · Score: 0

    I would not, could not, in a boat
    I would not, could not, with a goat

    Sam I Am you let me be!

  14. Similar tech... by Radish03 · · Score: 1

    ...perhaps one day they will be able to develop one that will allow politicians to talk to normal folks and be understood.

    I don't think there's that much problem understanding what politicians say; it's just a lot of times they aren't very "accurate" with what they say.

    1. Re:Similar tech... by Anonymous Coward · · Score: 0

      Translating from politician-speak to normal english would be easy. Politician-speak is zero-content, so:

      char *political2Normal( char *politicianSentence )
      {
      return "";
      }

  15. Long way to go by puto · · Score: 4, Informative

    I dunno if computer translation is going to be up to par for a long time.

    I speak both Spanish and English. English is native and Spanish is due to 3 years in South America. And my grandparents are from Spain. I did not really know anything until I lived in Colombia and my granny who has Phd in her own language was a pretty harsh mistress. I was 21 years old when I learned. Of course living with a Colombian sysadmin girl for two years was a big help. She liked the Penguin.

    Languages differ too much from location to location. Justlike English in regions in the US. I am from New Orleans and the english changes from neighborhood to neoghborhood.

    Word meanings and expressions might be exactly the same in spelling and sound but mean different things to different people.

    To build these variables into software would be a *HUGE* task.

    I think the best we could hope for is software that does a decent brute translation and then a human does the final edit.

    The problem is one word might be ok to use in Puerto Rico(well they are confused about which language they speak) but socially unacceptable in Colombia. Software cannot know the difference.

    People will always do the translation gig better.

    Puto

    Course my handle is pretty bad in any Latin country.

    --
    The Revolution Will Not Be Televised
    1. Re:Long way to go by Theaetetus · · Score: 1
      The problem is one word might be ok to use in Puerto Rico(well they are confused about which language they speak) but socially unacceptable in Colombia. Software cannot know the difference.

      Id est - during the Gulf War, American soldiers were cautioned not to use the word "dude" when referring to each other or their Arabic allies... Apparently, in Arabic, 'dude'='worm'.

      However, this could be built into the system, at the expense of efficiency... Translate from English to Arabic... Then translate back for 'proofreading' by the English speaker... When he hears "Hey, worm, thanks a bunch!", he can cancel the translation, and try a new phrase instead.
      This would take a hit to "real-time" communications, however.

      -T

    2. Re:Long way to go by achurch · · Score: 3, Interesting

      People will always do the translation gig better.

      Oh, absolutely. I'm bilingual as well (Japanese and English), and particularly with Japanese, the language itself is so ambiguous that even native speakers don't always understand each other--you can imagine how difficult that made it to learn the language. ;)

      But I don't think the point of machine translation is necessarily to get a perfect translation out; for that, the machine would have to be able to think like a human, and that would bring up all sorts of difficulties I don't even want to touch. But if the computer can do a good-enough translation, then the humans involved can figure out the rest. For example, another poster suggested the ambiguity of "bank"--a place where you store money vs. the edge of a river--but even if the machine translation got it wrong, the humans involved could figure things out in the end. (You could say "the edge of the river" instead, for example.)

      I'm personally looking forward to progress in machine translation. While there will never be any substitute for actually learning and understanding a foreign language, realtime translation could go a long way toward improving intercultural understanding, and could help stem the loss of languages due to the spread of English and other "core" languages.

    3. Re:Long way to go by Dominic_Mazzoni · · Score: 2

      However, this could be built into the system, at the expense of efficiency... Translate from English to Arabic... Then translate back for 'proofreading' by the English speaker... When he hears "Hey, worm, thanks a bunch!", he can cancel the translation, and try a new phrase instead.
      This would take a hit to "real-time" communications, however.


      Actually I saw a demo of Tongues at CMU a couple years ago and that's exactly what it did.

      Person A speaks a sentence. The computer displays the phrase, translates it to language B, translates that back to language A, then displays that sentence on the screen. If person A approves, he/she passes the computer to person B, who can then hear the sentence in language B.

      I'm assuming that's why the article says that it takes more than a minute to convey a ten-second sentence.

    4. Re:Long way to go by Anonymous Coward · · Score: 0

      living with a Colombian sysadmin girl for two years was a big help

      Yowza. It would solve all my problems anyway!

      I'm jealous, you big ... puto :)

    5. Re:Long way to go by sasha328 · · Score: 2
      Word meanings and expressions might be exactly the same in spelling and sound but mean different things to different people.

      I agree with both Theaetetus and Puto about the differences in word meanings within the same language or language group. I sometime do some consulting for a multiligual publisher, and he reckons that translastions for a certain audience must be done by a person from that audience. The differences don't become obvious until you do a back translation by a different translator. Some of the mistakes are very funny.

      Besides, I think using machine generated translations as a basis for proofreading by a real translator is just as time consuming and resource intensive as a real translation;

      I think we have to wait for contact with the Vulcans before we can have good real-time translators. At least this is a good start though.

    6. Re:Long way to go by Paradise+Pete · · Score: 1
      People will always do the translation gig better.

      Always is a long time. For a while, yes, but not anywhere near forever.

    7. Re:Long way to go by Anonymous Coward · · Score: 0

      Justlike English in regions in the US. I am from New Orleans and the english changes from neighborhood to neoghborhood.

      But.. you speak American, not English.

    8. Re:Long way to go by Brainchild · · Score: 1
      Id est - [...]

      "That is"? I think you meant exemplum gratia, "for (the sake of) example". (Impressive, however, that you actually knew what i.e. expands to).

      This illustrates another problem with automated language translation: languages change over time. New words appear, old ones change their meaning, and really old ones are sometimes forgotten completely.

      Even "dude" used to mean something quite different (male who tends cattle from horseback) from it's current use today as pal, chum (as a term of address, e.g., "Phreak, dude-dude-dude-dude-dude, i gotta---" --"Get ahold of yourself Joey! One more 'dude' and i'm gonna have to slap the shit outta you!") or male of at least the age of puberty (in the third person, e.g., "She's buff ... ballsy." --"Let's keep her ... waste the dude").

      Note how i prove my sophistication by demonstrating proper use of the abbreviation e.g.. ;)

      --

      :: "I am non-refutable." --Enik the Altrusian ::

    9. Re:Long way to go by quinto2000 · · Score: 1

      The point is, human translators may be better, but they're a lot more expensive. This is meant to fill in the gap in emergencies or urgent situations. Allowing nurses to talk to wounded soldiers, etc.

      --
      Ceci n'est pas un post
    10. Re:Long way to go by scot4875 · · Score: 1

      Man -- never refer to a granny as a harsh mistress. *Especially* when it's your own.

      --Jeremy

      --
      Jesus was a liberal
    11. Re:Long way to go by twinpot · · Score: 1

      Problem exists a lot in "English" too. Fanny is one sure to create a problem between US English and UK/Oz/NZ English.

  16. Idiomatics by nfras · · Score: 2

    I think it is very interesting that it works by using phrases rather than individual words. Most translators in the past have used words and that leaves room for error with idiomatic phrases such as "window shopping" (the french equivalent translates as "window licking").
    Maybe it would be a good idea to put something on the web and let us test it, at least without the speech components.

    --
    You call me a pedant? I prefer the term "correct"
    1. Re:Idiomatics by Theaetetus · · Score: 1
      Excellent point... You know, it could be done with phrase tables made by bi-fluent speakers... and if your phrase doesn't match one in the table, it could query you to rephrase, or select the closest match... Then, you wouldn't have to worry about mis-translations.

      -T

    2. Re:Idiomatics by evalhalla · · Score: 1

      There are some problems in this approach: first of all you would need an huge amount of phrases in your table, and even worse you can't simply check for exact match, but you should look for similar matches, because in most languages there are at least two similar ways to say something, whith the same translation in another language.

      I don't think that we're going to have enough computational power to do so in the few next years, at least not in some device that can be easily carried everywhere.

    3. Re:Idiomatics by Theaetetus · · Score: 1
      Well, then again, you can trim down your phrase table somewhat by not including professional or regional dialects and just focus on the core language (do you really need to include the word 'monitor' to mean a) a video screen, b) an audio speaker, c) to observe a signal, d) an ironclad steamship from the Civil War?)

      Best yet, would be a phrase table that could learn from you, in which it would query back if it doesn't have an exact (or very close) match for your phrase, and ask you to rephrase, and then add that to the table as a comparison for the original:

      "What's up?"
      [rephrase please]
      "How are you?"
      ["what's up?" added to database]
      Translation: "Ca va?"

      -T

  17. Translations by bloatboy · · Score: 2, Funny

    One of the most useful ones, now with all the scrutiny in the business world will be the translation from any kind of management speak/weaselease into english.

    Corp officer: We are commited to stringent compliance with accounting rules and will not tolerate anything less than the pure truth.

    Translation: We're covering our rears as fast as we can.

    Or to steal one from Dilbert...

    Management: Employees are our most valuable resource.

    Translation: (nothing)

    1. Re:Translations by Tackhead · · Score: 2
      > Corp officer: We are commited to stringent compliance with accounting rules and will not tolerate anything less than the pure truth.
      >
      >Translation: We're covering our rears as fast as we can.

      Close, but wrong. That one means "Sell your stock now, because while we're committed to compliance, we haven't achieved it.

      It's what's not said that counts.

      > Or to steal one from Dilbert...
      >
      > Management: Employees are our most valuable resource.
      >
      > Translation: (nothing)

      Again, close but not quite "nothing".

      Translation: "We're laying some of you off. Go to fuckedcompany.com to see if you need to start looting now, or if you can wait a week to start looting."

  18. already available in handheld units by js7a · · Score: 5, Informative
    A St. Petersburg, Russia company called Ectaco has been selling bidirectional handheld speech recognition-based translation systems called Universal Translators.

    They have them in English-Russian and English-German at present, but apparently plan to add more languages all the time. Their unidirectional models ("UT-103") handle about eight languages currently.

    1. Re:already available in handheld units by Anonymous Coward · · Score: 0

      The Ectaco translation is also very bad, has extremely limited vocabulary and set of phrases, and misrecognizes the input constantly.

  19. Tower of Babel? by Anonymous Coward · · Score: 0

    Nothing like technology undoing God's old testament work!

  20. Low Expectations by limekiller4 · · Score: 1

    Heck, I'd be happy if it would just let me understand my girlfriend.

    --
    My .02,
    Limekiller
    1. Re:Low Expectations by pantherace · · Score: 1
      > It's much easier to mod me down than to post an intelligent reply.

      naw, not for me (ok, ok, it is because I have no mod points right now. :) )

      And for those of you like some people I know, that was a joke. or perhaps a rejoke because it is joking on a joke, or maybe I am just tired and rabling on. probably, time to sleep or have caffene, though sleep would be better.

      This post of a higher INTELecual level than most dotslash comments. Personally, I prefer the ALPHA comments.

    2. Re:Low Expectations by Anonymous Coward · · Score: 0

      Understanding women is reducible to the halting problem and thus undecidable.

  21. Repost!! by Anonymous Coward · · Score: 0

    Another repost. This one's from yesterday, people. http://slashdot.org/articles/02/08/10/033231.shtml ?tid=126

  22. ..well. by Archon-X · · Score: 1

    it's better than californication and expecting the world to speak english.

    1. Re:..well. by Anonymous Coward · · Score: 0

      English-speaking people do not force English upon non-English-speaking people.

      Computers force English upon non-English-speaking people.

      Unless computers are intelligent enough to "expect the world to do something", then I'd have to say that your statement was completely out of line.
      Californication????? You're either kidding, or have listened to too much RHCP.
      -kw

    2. Re:..well. by Anonymous Coward · · Score: 0
      ... expecting the world to speak english

      Yes, but the world WILL speak English, albeit with an American accent.

  23. Re:Ugh, bad English! by ahaning · · Score: 1

    Surprisingly, the editor is quite correct.

    it's Pronunciation Key (ts)

    1. Contraction of it is.
    2. Contraction of it has. See Usage Note at its.

    Source: The American Heritage® Dictionary of the English Language, Fourth Edition
    Copyright © 2000 by Houghton Mifflin Company.
    Published by Houghton Mifflin Company. All rights reserved.

    --
    Withdrawal before climax is very ineffective and those who try this are usually called "parents."
  24. 0nly th3 l33t... by I+Love+this+Company! · · Score: 2, Funny
    m4y us3 th1s tr4anS1at0R...

    For example,

    I am a law-abiding citizen, speaking perfect English!

    becomes:

    eye 4m th3 Gr34t z3r0-k3wL, 3y3 w1ll 0wnxZ0r j00 w1nd0z3 b0x!$##@
    --

    "All art is quite useless." -- Oscar Wilde
    1. Re:0nly th3 l33t... by Anonymous Coward · · Score: 0

      This is a bit off topic... but who is Zero Cool? I've read a reference or two to him, but can't find anything about him on the web or in the Jargon files. Anyone care to enlighten me?

    2. Re:0nly th3 l33t... by qqtortqq · · Score: 1

      Go to blockbuster, rent "Hackers." I don't care what anyone around here says, its a damn good movie, well worth watching.

    3. Re:0nly th3 l33t... by BrookHarty · · Score: 2

      Google is ahead of the game with the Hax0r Google version of its search engine.

  25. Err. . . by Com2Kid · · Score: 2

    From what I have seen (spoken to?) speech recog pretty much sucks now days, unless you are one of the lucky ones to have one of those 'special voices' that computer speech recog likes. . . .

    As I have stated before in these types of articles, until speech recog can get over 95% or so recog on untrained voices, (or heck, I would like it if it could get 90% recog on my voice /trained/. . . .) these sorts of applications of technology are going to be very limited in scope.

  26. I hear it can also do Hungarian by Copperhead · · Score: 5, Funny
    Here are some sound clips...

    "I am looking for the tobacconist."

    "I need some matches."

    "How much do I own you?"

    The entire dictionary can be found here.

    --
    Your reality is lies and balderdash and I'm delighted to say that I have no grasp of it whatsoever. - Baron Munchausen
    1. Re:I hear it can also do Hungarian by Anonymous Coward · · Score: 0

      "How much do I own you?"

      You own me like the bitch I am. Now give me back my root password!

  27. You can ... by Anonymous Coward · · Score: 0

    ... consider my ass presented for biting.

    that was seriously fucking lame man.

  28. Why bother? by Malcs · · Score: 1

    No matter how you slice it, you'll never be able to make a machine do what a translator does. Why is it that these things are always made by people who aren't multilingual?

    --
    My name is Carlos Montoya. You share files of my music. Prepare to die.
    1. Re:Why bother? by mariube · · Score: 2, Insightful
      No matter how you slice it, you'll never be able to make a machine do what a translator does. Why is it that these things are always made by people who aren't multilingual?
      Really? Keep in mind that the human brain is a terrible computer - not designed for linguistics at all. Languages have very precise definitions, and it is possible to make programs that translate any language into logic, see aristotle for an example. Of course, the tricky part is to make such a program aware of all local variations. In Norwegian, the direct translation of "foot" can mean anything from "foot" to "below the hips".
    2. Re:Why bother? by Bazzargh · · Score: 2

      Nobody cares if it doesnt turn out to be as good as a human translator, because not everyone can afford to retain a translator on staff. Or a decent butler for that matter.

      Secondly, it matters not a jot if the creators are multilingual, since the problem is not that you don't know 'many' languages, but that you and one other person don't know a language in common. doh!

    3. Re:Why bother? by kalidasa · · Score: 2, Insightful

      Languages have very precise definitions, and it is possible to make programs that translate any language into logic, see aristotle for an example.

      No, languages do not have very precise definitions. Take this from a published translator: they do not. The definitions in the dictionary are at best approximations to a particular range of any given word's semantic field; precision with human languages is impossible. Read up on some linguistics before you start posting things about linguistics.

    4. Re:Why bother? by kalidasa · · Score: 1

      Secondly, it matters not a jot if the creators are multilingual, since the problem is not that you don't know 'many' languages, but that you and one other person don't know a language in common. doh!

      Hey, I know PERL, so of course I can write a program that converts PERL to LISP, even though I don't know LISP.

      But it's worse than that. Because PERL and C are logical symbolic languages, and natural languages are by nature ambiguous and contingent, and noone learns that until they are at least bilingual.

    5. Re:Why bother? by 0x0d0a · · Score: 2

      The real computer science work was in developing a metalanguage represenation and a method of mapping the language to that metalanguage and back. Filling in the actual mappings is something that you can hand off to translators or native speakers.

  29. Salvation by j2gEEk · · Score: 1

    Funny, I've spoken in tongues, and didn't need any computer assistance whatsoever...

    Creation threads, eternal life threads (ala cryo), now tongues...

    Seriously, Michael: do you want to know that your going to heaven when you die?

    Jake

  30. Politician-speak by guttentag · · Score: 1, Flamebait
    perhaps one day they will be able to develop one that will allow politicians to talk to normal folks and be understood
    You don't need a new invention to translate politician speak. Simply pipe the text of the politician-speak to the following shell script:

    #! /bin/sh
    echo "All I'm saying is that I keep my options open."

    A "good politician" (good as in "successful," not "responsible") is intentionally vague whenever possible because that allows him to keep his options open. A vague statement that is commonly interpreted one way can later be interpreted a different way. The more details the politician provides, the greater the number of people who will disagree with him.

  31. This is old by Patik · · Score: 1

    A smaller-than-PDA version of this was used by that goofy roommate on the TV show Undeclared to speak to his Japanese girlfriend a number of months ago.

  32. Do we really want the digital Babel fish? by Pac · · Score: 2

    Do you remember what the Guide says? I quote: "Meanwhile, the poor Babel fish, by effectively removing all barriers to communication between different races and cultures, has caused more and bloodier wars than anything else in the history of creation".

    Do we really want Pierre Parisian to be able communicate his exact feelings to Lin Chinese?

  33. Fictional character from the movie 'Hackers' by hackwrench · · Score: 1
    1. Re:Fictional character from the movie 'Hackers' by martyn+s · · Score: 1

      Hackers (1995)

      Synopsis:
      Zero Cool--real name Dade Murphy--is a legend among his peers. In 1988, he single-handedly crashed 1,507 computers on Wall Street, creating worldwide financial chaos. Eleven years old, Dade Murphy had a record with the F.B.I.--forbidden to finger the keys of so much as a touchtone phone until his 18th birthday. It's been seven years without a byte... and he's hungry. Kate Libby, handle Acid Burn, has a souped-up laptop that can do 0 to 60 on the infobahn in a nanosecond. When Zero Cool collides with Acid Burn, the battle of the sexes goes into hard drive. But all bets are off when they must pool their resources to battle The Plague, a master hacker employed by a corporate giant and using his considerable talents to worm his way into millions. Worse yet, he has hidden his own scheme by framing Dade, Kate and their friends in a diabolical industrial conspiracy. The young band of renegade hackers sets out to recruit the best of the cybernet underground to clear their names.

  34. this is neat by 6OOOOO · · Score: 1

    This is neat and all, but why did they choose croatian? Why not start with something more useful?

    1. Re:this is neat by davidmccabe · · Score: 1

      I suppose that there aren't millions of people in Croatia who can't speak English but would be able to get a much better job if only they could, and there isn't anyone to teach them English, and their language happens to be very difficult to learn for many people which shortens the supply of teachers. And the whole nation isn't bombed out from the war. And by the way, it's usually called "hervastky".

    2. Re:this is neat by 6OOOOO · · Score: 1

      As the stereotype goes, you are a typically obnoxious apple user.

      Of course there are uses for this. Millions of people in a country do not overshadow the billions of people in the rest of the world.

      What about Chinese to English? Surely that would be of more value?

      Either way, you can make your point without betraying your lack of social skills, and I suggest you try it in the future.

    3. Re:this is neat by Anonymous Coward · · Score: 0
      why did they choose croatian? Why not start with something more useful?

      Is is useful, with this gizmo, US soldiers can tell "it was wonderful" to the 11-years-old girl they just gang raped.

    4. Re:this is neat by xtremex · · Score: 1

      If one speaks Russian, one can converse in practically any Slavic language. Actually, more people speak Fulani than Croatian. (10 points to the first person who can tell me where they speak Fulani!)

      --
      If you're not a Liberal in your 20's, then you have no heart.If you're still a Liberal in your 30's you have no brain.
  35. Insert obligatory... by gilroy · · Score: 2
    ... Hiro Protagonist reference here.


    And this time, I think I spelled his name right, dammit. :)

  36. don't stop there by Anonymous Coward · · Score: 0


    "... perhaps one day they will be able to develop one that will allow politicians to talk to normal folks and be understood."

    we already can. it's just no one can believe what they're saying could be that stupid, so they insert a delusion. to help out there, Carnegie Mellon will have to develop a systems which enables belief. Once again, Douglas Adams leads the way.

  37. I've been wondering when someone would have enough by hackwrench · · Score: 1

    sense to realize something like that (besides me of course)... but then I believe that if speaking in tongues is of God it must be able to be translated like other languages.

  38. Sample output from project ... by Mind+Socket · · Score: 0

    Tongue? TONGUE! Tongue tongue Beowulf cluster tongue.

    Tongue, tongue belong to us. Tongue.

    Tongue tongue #$^@ DMCA tongue RIAA tongue double plus bad tongue.

    It's the strangest word after typing it a few times.

  39. Politician translators by Vrallis · · Score: 2

    Two exist already...grand juries and impeachment hearings. Obviously, they don't work very well yet. Too much politician involvement in the translation matrix.

  40. The technology must be applied right, that's all by zorander · · Score: 2, Insightful

    So long as this stuff stays on the recieving end, this is all a step in the right direction. You don't want to deprive the people you're sending information to of any information. Let them decide whether to use a human or a computer. Sending a computer based translation that you can't understand only increases the chance of offending someone/misrepresenting something.

    Giving it to soldiers in the field so they can "speak" the foreign language is bad. Instead give one-way devices to both sides and let them use those to translate what's told to them. That way if they need a human translator to clarify that's still an option.

    It would be terrible if information started flowing between countries that had been passed through a computer translator first. Please, let me use babelfish to translate that spanish document, don't use it for me (heck, I have friends from south america who can help me clarify it if I need to but that's *no good* without the original spanish)...

    Translation through tounges is a lossy process. Not translating it at least prevents compromising the information. It's all still there...just a wee bit harder to get at.

    Brian

  41. Politician translator by elixx · · Score: 0, Flamebait

    perhaps one day they will be able to develop one that will allow politicians to talk to normal folks and be understood

    I made one of these, it loops and plays a wav file explaining that it doesnt help because it made claims solely for the reason of getting elected...

    --
    No, Beowulf clusters can't imagine in Soviet Russia.
  42. Re:Ugh, bad English! by Paradise+Pete · · Score: 1
    "It's" != "It has"

    Of course it does. It's a simple contraction representing the way some people say it. Just because it can also mean "it is" means nothing. Many words are spelled the same as others.

  43. Better yet... by Dr.Evil · · Score: 2
    [P]erhaps one day they will be able to develop one that will allow politicians to talk to normal folks and be understood.

    Better yet, how about one that let normal folks talk to politicians and be understood?

    --
    Right...
  44. The next step... by Michael+Woodhams · · Score: 2, Funny

    Dialect output. Soon, you won't have to listen to some Croatian nun discussing free will translated into Bostonian English, you'll be able to listen to a Croatian nun discussing free will translated into Jive.

    This reminds me of a story told to me long ago by a friend of the family. She was of Dutch descent, and the story is about a well bred Englishman who went on a working holiday to Holland. He got work on the docks, and that is where he learned to speak Dutch. The result was that in a refined English accent he spoke obscenity-laden gutter Dutch, apparently unaware that he was doing so.

    --
    Quattuor res in hoc mundo sanctae sunt: libri, liberi, libertas et liberalitas.
  45. What's even more amazing... by Deal-a-Neil · · Score: 1

    .. is that the article on PCmag.com is dated September 3, 2002. Slashdot is going all Minority Report -- we know about news BEFORE it happens.

  46. Dictionary Translations by Bios_Hakr · · Score: 2

    The prob I have with a dictionary translation is the whole is often greater than the sum of its parts. For instance, how do you say "How are you"? Que Tal? What's up? How's it hanging? Come stai? A dictionary can, literally, translate any of these into any language imagineable, but would the listener understand?

    --
    I'd rather you do it wrong, than for me to have to do it at all.
    1. Re:Dictionary Translations by erpbridge · · Score: 2

      If you're gonna try to post a translation of something, at least get the spelling right, or else it screws the WHOLE meaning up.

      Come stai? = Do you eat stai?
      Como estas? = How are you doing?

      I'm not even going to think what stai is, or why someone would eat it.

      Sorry to be an interlingual grammar Nazi.

    2. Re:Dictionary Translations by Max+von+H. · · Score: 2

      You, Sir, are an idiot.

      "Come stai" is Italian and "Como estas" is Spanish.

      They both mean "How are you?".

      Or did I miss a sarcasm?

      -max

      --
      -- It's always darker before it goes pitch black.
    3. Re:Dictionary Translations by Bios_Hakr · · Score: 2

      "Come stai" is "How are you" in Italian. It may be Com`e or something like that. I spent 20 years in the USA and I can't spell english words for crap...I spent 6 years in Italy and I am proportionally worse in that language.

      --
      I'd rather you do it wrong, than for me to have to do it at all.
    4. Re:Dictionary Translations by Anonymous Coward · · Score: 0

      If you're going to be an interlingual grammar nazi, at least learn more languages. "Come stai?" is italian for "How are you doing?".

    5. Re:Dictionary Translations by Anonymous Coward · · Score: 0

      Just plain "come". The apostrophe in Italian is used to signal the omission of the last vowel in a word that is followed by another word that starts with a vowel. For example, "Come è il tuo caffè?" ("How is your coffee?") should be written (and pronounced) "Com'è il tuo caffè?". The answer, from an Italian sipping American Coffee might be: "Fa schifo!" ("It sucks!").

    6. Re:Dictionary Translations by seann · · Score: 1

      don't feel too bad, I'm on my 19th year in this world, and I still can't spell worth shit. :>

      --
      I'm a big retard who forgot to log out of Slashdot on Mike's computer! LOOK AT ME.
    7. Re:Dictionary Translations by Anonymous Coward · · Score: 0

      I hope you learned an important lesson today.

      If you're going to be an obnoxious ass, make sure you at least know what the hell you are talking about.

    8. Re:Dictionary Translations by erpbridge · · Score: 1

      I stand corrected. It looks very similar to the Spanish spelling.

      Just goes to show what problems lie in determining one language from another, especially when there is no spell check available or context to determine what language is being spoken.

  47. Married with Children by Raul654 · · Score: 2

    Your comment reminds me of one of the all time funniest moments from that classic of american TV, Married with Children. The scene: Al Bundy is at the DMV. He asks (in english) to take the written exam. The clerk asks him what language.
    Al: I speak the language that everyone in this country speaks

    Clerk: Ah, spanish it is

    Al: No. This is america. I speak american.

    Clerk: American, eh? (Looks in the file cabinet)
    Aha, here it is. American. Wow, I hope you know a lot about trucking.

    --


    To make laws that man cannot, and will not obey, serves to bring all law into contempt.
    --E.C. Stanton
  48. Actually.... by dalangalma · · Score: 1

    It was meant to be used by field chaplains serving in Croatia, so they had it translate Croatian. They figured it would be a good test, and the chaplains weren't doing anything where a mistranslation would kill someone.

    1. Re:Actually.... by NickB2 · · Score: 1

      Especially since Croat is the language spoken in Bosnia and Serbia (it's ussually called "Serbo-Croat").

      That means this'll let our troops be better understood in Kosovo, and the Bosnian mission. Most other places the Army has longstanding relationships so they don't need a computer translator.

      Basically the Army doesn't need a Chinese translator and they're paying the bills...

    2. Re:Actually.... by kiberovca · · Score: 1

      That is so uninformed and not true. Croatian language is spoken _only_ here in Croatia. In Bosnia, there is their own variant, and in Serbia too. There was a "bastard" language called Serbo-Croat which was in forced use while there was a socialist Yugoslavia. There was always a difference in Croatian and Serbian language, as in for example a difference between English that was spoken in Shakespear's time and today's official Brittish English. Yes, we can (mostly) understand each other, but it still isn't the same. Please, just because my country is much smaller than yours, do not confuse my language with someone else's.

      --
      Eric: "What're quantum mechanics?"
      Rincewind: "I don't know. People who repair quantums, I suppose."
    3. Re:Actually.... by NickB2 · · Score: 1

      Language is always debateble. The basic point stands even if they are two seperate languages: If two different labnguages are mutually intelligible a device used for translating one works on the other, and is therefore more useful to the US Army then a device for translating French.

      Here's the other side:
      http://www.sit.wisc.edu/~spkraus1/catalyst/ languag e/serbocroat/
      http://www.ethnologue.com/show_lang uage.asp?code=S RC

      They have some excellant points, most importantly that people who speak one language frequently can't understand each other; so if Serbs and Croatians can it doesn't make much sense to call them seperate languages.
      For example, English speakers can't understand each other much of the time -- I just watched Gosford Park with someone who's wasn't into shit like Monty Python, the only non-American accent he understood was the American with a fake Scottish accent -- so it makes little sense to claim that Serbian and Croatian are completely different languages.

  49. Discovery channel. by jahead · · Score: 1

    this was on the discovery channel about a year ago, already being used by the US military in the UN peace keeping mission.

  50. Alan Black (the researcher) by Bakajin · · Score: 1

    Wow... just look at this guy. Strongly indicates someone trying to be outside the box.

    Expanded picture from the article.

    Another pictures is on his homepage that even has some information on "running Unix on IBM PC110 palmtop computer" and "a Casio E-105 Palm-sized PC"

  51. Would that be,,, by Scrab · · Score: 1

    Mellon like the Sindarin (elvish) for friend, as seen in Lord of the Rings?

    Cos if so, that's either very cool or very geeky.

    --
    RoseColor red={0, 0xffff, 0x0000, 0x0000};VioletColour blue={0, 0x0000, 0x0000, 0xffff};find / -name *mybase*|chown you
    1. Re:Would that be,,, by Anonymous Coward · · Score: 0

      --Mellon like the Sindarin (elvish) for friend, as seen in Lord of the Rings?--

      No, that would be Andrew Mellon, the aluminum magnate.

      (MCS--YES!)

  52. Useless for politicians by Anonymous Coward · · Score: 0

    - perhaps one day they will be able to develop one that will allow politicians to talk to normal folks and be understood.

    A translator would be useless; there's no way to understand who doesn't want to be understood. A vulcan mind meld could do the job much better.

  53. Even worse yet by Bastian · · Score: 3, Funny

    How are we /ever/ going to get it into a package that is small enough it fit in your ear and watertight enough to let swim around in a bowl of water when you're not using it?

  54. Serious Sam by roushi · · Score: 1

    Is this how they came up with all of the bad english puns in Serious Sam?

  55. It's called the "interlingua" approach... by PontifexPrimus · · Score: 2, Insightful

    I'm majoring in computer linguistics, and currently we're examining different computer translation models; the one you're suggesting is called the interlingua approach.
    The idea is, basically, that you need an "in-betweener" language that can carry all the meaning and connotations of both source and target language. Then you only need translations rules for both sets and then let it run.
    The main drawback is that you always have some loss in both translation steps, which sometimes adds up to quite a difference in meaning. The main advantage is that you can modularize - once you have a working English-to-Interlingua module, you can use Interlingua-to-French, Interlingua-to-German, what have you. For further information, google for interlingua "machine translation"...

    --
    -- Language is a virus from outer space.
    1. Re:It's called the "interlingua" approach... by martyn+s · · Score: 1

      That's really cool! I don't have any formal background in that stuff, but I've always said that would be the best way to translate stuff (create an interlingua translation every 100 years, so they can read our stuff 1000 years from now). I didn't realize that they're actually doing this. Thanks for making my day. Where are you studying?

  56. Explanation by superyooser · · Score: 1
    The Jewish festival of Pentecost occurred seven weeks after Jesus' resurrection. On this day, Jesus' apostles were given the gift of speaking in tongues.

    Here's the reference.

  57. A page involving linguistics.... by blackcoot · · Score: 1

    ... and not a single soul karma whoring with references to Noam Chomsky? I'm a little worried... of course, I once had a professor explain that if you're ever at a party with all sorts of academics and you want to look smart, wait for someone to say something that sounds reasonably intelligent and say, "but doesn't that follow from Chomsky?" Apparently nobody will stop to question you ;-)

    1. Re:A page involving linguistics.... by xtremex · · Score: 1

      I love when Linguistic neoPhytes "quote" Chomsky. I have disagreed with many of his theories, especially the poo-poo theory. According to his theories, Amerindian tongues derive from the Mongols, however, there is yet any evidience that they are even REMOTELY related. He never took into account the South to North migration of Amrerindian Langauges...put that in yuor pipe and smoke it! ;)

      --
      If you're not a Liberal in your 20's, then you have no heart.If you're still a Liberal in your 30's you have no brain.
    2. Re:A page involving linguistics.... by blackcoot · · Score: 1

      Like I said, what do I know? The professor in question was teaching a class on compiler theory and construction... somehow Chomsky came up when we were talking about context free grammars (I forget how --- I think it was the jackass in the front of the class trying to score cookie points). Anyways, Chomsky is famous for a lot of things outside linguistics too, not the least of which are his politics...

    3. Re:A page involving linguistics.... by xtremex · · Score: 1

      True. Chomsky is a name that most non-Linguists will know. He is also famous for his Communist ideals, which is why most college professors know his name :)
      If you're actually interested in Linguistic Theory, look up Swadesh. He's the King of Comparitive Linguistics

      --
      If you're not a Liberal in your 20's, then you have no heart.If you're still a Liberal in your 30's you have no brain.
  58. Waste of Technology by Anonymous Coward · · Score: 0

    Like Esperanto, lang2lang translators threaten to be rendered useless by the real world. After all, most people in the world speak some level of English now. In the next 25 years almost everybody in the world will speak fluent English, and the people who can't, well you really wouldn't want to talk to them. What use are 'universal translators' then? But as much use as Esperanto ever was.

  59. fortran IV - to - Cobol... by Anonymous Coward · · Score: 0

    then Cobol -to - Pascal

  60. Re:Ugh, bad English! by Anonymous Coward · · Score: 0
    "It's" != "It has"

    ("It has".contractedForm() == "It's") => true
  61. Is this really useful? by invckb · · Score: 1
    "Half the time, they could actually carry on a productive conversation."

    "Two people can exchange a 10-second sentence in about a minute and a half,"

    I don't think this project is quite ready, yet.

  62. This reminds me of a business idea ... by torpor · · Score: 2

    ... involving extremely cunning linguists creating *brand new* languages, completely from scratch, for corporate clients who need to communicate freely and yet still keep something relatively secure.

    A per-transaction language, in other words, with a complete new lexicon for each speaker. Of course, the individuals would have to learn the language quite quickly, so this would also be another service realm in this plan.

    Sort of like Kings of old, who used to use language differences to obfuscate and control various parts of court, only in this case it would be a commercial service, and available to all.

    Something like this would be a good tool in the modern corporate environment, I think.

    Well, I'm off to register Babylon, Inc...

    Oh, D'oh!

    --
    ; -- the corruption of government starts with its secrets. a truly free people keep no secrets. --
    1. Re:This reminds me of a business idea ... by rogerz · · Score: 1

      ... involving extremely cunning linguists creating *brand new* languages

      Boy, I can't tell you how long I've been waiting for a post containing those two words.

      --
      If humans are mostly water, and beer is mostly water, then humans must be mostly beer.
  63. Things would be easy... by codexus · · Score: 2

    ... if only everyone learned to speak Klingon.

    --
    True warriors use the Klingon Google
  64. Re:The technology must be applied right, that's al by Anonymous Coward · · Score: 0

    Would it be OK if the soldier and the Croatian gave the two-way translator to the other guy before saying something? This way it would always be the person at the receiving end who's using a translator. As you so elaborately explained, this would be a big improvement over the soldier keeping the two-way translator all the time.

  65. Re:Politician-speak[MOD UP] by Anonymous Coward · · Score: 0

    this is not a flame This is a reasonable post

  66. sounds impressive... until I read the article.... by Anonymous Coward · · Score: 0

    "It includes a speech recognizer, which turns spoken words into text (aka dragon naturally speaking); a machine translator, which converts the text from one language to another (aka babelfish); and a speech synthesizer, which turns the text back into audible words (um... it exists, but forget the name...)."

    Sounds to me like all they did was get the programs to communicate with each other ("the speech recognizer, known as Sphinx, and the speech synthesizer, known as Festival"), but other than that it doesn't sound like a major break through to me.

    Actually I'm shocked this doesn't already exist considering we already had software to do each of those for at least the past 5 years, it's only now that someone thought of putting them together?

  67. Politicians? by Anonymous Coward · · Score: 0

    They lie, cheat, take bribes, and does everything to save their hides. I already know how to translate 99% of the politicians, just do "> /dev/null".

    Sorry for being so pessimistic, but we are closing in on an election over here, and it sickens me how stupid people are who can't see the reason behind what the politicans say and can't remember from past elections how little (almost nothing) they keep of their promises...

  68. Not the 'other' tongues by SystematicPsycho · · Score: 0

    *Phew*, for a second there I thought it was the other type of 'speaking in tongues' also referred to as glossolalia which it a totally bogus. If someone comes up to you asking you to attend there church group to learn how to speak in tongues tell them to get lost.

    --
    Analytic & algebraic topology of locally Euclidean meterization of infinitely differentiable Riemmanian manifold
    1. Re:Not the 'other' tongues by j2gEEk · · Score: 1

      Yeah, going to a church to *learn* to speak in tongues would be completely bogus- it's not learned- It is a gift of the Holy Spirit, but it is very much real (1Corinthians14).

      Jake

    2. Re:Not the 'other' tongues by SystematicPsycho · · Score: 0

      how is jibberish a language? Most of the bible is written in codes so what you read is not what it seems, 'speaking in tongues' _is not a gift and is _not real but a state of mind.

      --
      Analytic & algebraic topology of locally Euclidean meterization of infinitely differentiable Riemmanian manifold
  69. The Redneck Soulation by thales · · Score: 2
    If them damn furreniers ain't smart enuff ta speak American, who gives a damn what they is saying?


    Hell if we need ta hear from 'em we'll jus kick thier asses and make 'em learn ta talk American instead of all that gibberish!

    --
    Quemadmodum gladius neminem occidit, occidentis telum est
    1. Re:The Redneck Soulation by Anonymous Coward · · Score: 0

      Gotta love the irony of your latin .sig!

  70. My God, it *is* the Chinese Room. by dmorin · · Score: 2
    Remember this argument in AI circles? A computer that has a big stack of rules that says "When I get sentence X in English, respond with sentence Y in Chinese"? Done. Next.

    But can it beat Kramnik in chess? Ah, now *there* is the question!

  71. How about... by fizban · · Score: 1

    ...allowing normal folk to talk to politicians and be understood...

    --

    +1 Insightful, -1 Troll. What can I say, I'm an Insightful Troll.

  72. hahaha by Anonymous Coward · · Score: 0

    Yet another funny moderator strikes! +1 Informative :-)

  73. Normal People by Anonymous Coward · · Score: 0

    "... perhaps one day they will be able to develop one that will allow politicians to talk to normal folks and be understood."

    If it would allow 'normal' people to understand what the slashdot crowd is saying, THAT would be an accomplishment

  74. they didn't really do anything by dollargonzo · · Score: 1

    sphinx and festival? dear lord, these have been around for a while, i use both daily. festival used with the mbrola phoneme databases are actually pretty decent, even in multiple languages. sphinx is speech recognition, which is obviously more complicated, so naturally it doesn't work quite as well. it is actually tailored very well for this sort of thing, because it is built around recognizing phrases, which is exactly what they need since they have a huge lookup table. so, if anything in the table is recognized, it makes things that much easier. i don't really see why carnegie mellong deserves so much credit for this? i could have done the EXACT same thing. in fact, i even have MORE. i have a natural language processor, known by some as CHAT BOTS, although i beg to differ, because mine is not meant to fool anyone, instead i am actually trying to teach it to say something intelligible (not from a database of phrases)..with mixed results.

    a lookup table for a translator + sphinx + festival is not exactly worthy of an overblown article. sorry, that's just it. no aim to flaim.

    QED

    --
    BSD is for people who love UNIX. Linux is for those who hate Microsoft.
    1. Re:they didn't really do anything by Anonymous Coward · · Score: 0

      Okay, sure, anyone could setup sphyinx + festival, the translator is what is the important part. but even more important was the input for the "lookup table" - the knowledge base/database whatever you want to call it.

      What it important is that they did it, and it works.

      Heck, I worked for them two summers ago, when they were building the data base. I called the croatians, got them to come in, worked with them, helped them use emacs (yes, mule made it possible) They translated bunches of stuff, and Ralph Brown did the work on the translator. Alan worked with them to do recording, and Bob handlesd all the admistrivia. It was fun to work there, but still, it was an accomplishment, because it actually worked, heck I've used an earlier version, and it worked. The biggest limitation was that it worked better for some people than for others, and I wasn't one of the people it worked well for.

      As for things they didn't tell you in any article, it works best for croatian spoken by a female native - why? because that is all the had to do speaking to build the croatian speech recognition model. stupid? yes, but we couldn't get croatian men to come in. oh well. oh, and it also speaks croatian with a female voice rather than male, because once again, went building the festival model, we only had croatian females...

      The really bad thing about this project, at least when I was there, was that we were limited on the number of croatians, we had 3 main females, and one guy who did all the work. So it was heavily biased for translating male spoken english, to female spoken croatian. remember kiddies, males and femals speak differently.

      anyway, enough ranting and raving.

    2. Re:they didn't really do anything by dollargonzo · · Score: 1

      i realize that, but the article seemed to highlight the fact that it is able to use speech recognition + voice synthesis. most importantly, i don't see why u need a "revolutionary computer science" department to do this. is there some innovation here?

      QED

      --
      BSD is for people who love UNIX. Linux is for those who hate Microsoft.
    3. Re:they didn't really do anything by 0x0d0a · · Score: 2

      Well, probably because CMU has done a lot of speech recognition stuff that was used in this. The translation table stuff is just dumped on top -- happens to be the latest work. You talk about how festival is crucial -- Alan Black, of festival fame works at CMU. You mentioned sphinx, I believe.

      Don't knock CMU -- they're an international leader in this area.

  75. Damn... by Anonymous Coward · · Score: 0

    I still picture _Snow Crash_ whenever the "speaking in tongues" phrase comes about... someone heal me!

  76. Oblig Babelfish ref. by Insightfill · · Score: 1
    "by effectively removing all barriers of communication between different races and cultures, has caused more and bloodier wars than anything else in the history of creation."

    -Douglas Adams

    Some would say that people enter into more conflict than ever when the barriers sep. them are removed (distance, language, etc.) Is a "universal translator" a good thing?

  77. Snow Crash? by Anonymous Coward · · Score: 0

    I wonder if this is the nefarious work of Ashera....

  78. Heh. by dasmegabyte · · Score: 2

    I love it when somebody invents something that isn't new.

    The first thing you learn about in psycholinguistics is the concept of the pidgin -- a common language which develops between two or more peoples who must interact but share no lingua franca. These simple languages, which sound like baby talk bastardizations of both languages, eventually turn into what's called a creole, such as that sexy patois spoken by fortune tellers on cable.

    All these chaps have done is built their own version, and as the case of esperanto shows, manufactured language is very difficult to gain acceptance and adoption of. They'd have been better off locking a Croat and a Brit in a large office building with big gulps and no marked bathrooms. These guys would develop a pidgin pretty quick.

    --
    Hey freaks: now you're ju
  79. Similar approach used in speech synthesis by Latent+Heat · · Score: 1
    There is a similar issue in text-to-speech systems with regard to using larger units. Smaller units (phonemes) require much less memory storage but result in much more mechanical speech. Intermediate units would be diphones (like ha, pa, sa, tee, kee, ree), and larger units would be entire words or phrases -- each step is more natural sounding at the expense of much more memory, but memory is cheap so what the hey?

    The classic example in language translation without language understanding is Jesus admonishment to his sleeping disciples (Jesus can't sleep because he knows he is about to be executed): "The spirit is willing but the flesh is weak." Jesus is essentially calling his disciples wusses for nodding off. A round trip to Russian and back supposedly put it as "The meat is rotten but the liquor is holding up." A phrasebook as comprehensive as Bartlet's would recognize this as a highly-quoted Bible passage and use the correct quote from the Bible in Croatian.

    The correct translation of "The spirit is willing but the flesh is weak" requires a great deal of context -- one needs to know that people in the Bible talk in symbols and metaphors all the time. The only way I even know what it means is that I remember that it is from the Bible, so for the computer to use a brute-force phrasebook without trying to understand what something means is not far off the mark -- I doubt most of us in the pews understand half of what is in the Bible anyway.

  80. Why is this needed? by The+Variable+Man · · Score: 1

    If people don't understand me, rather than learning their native language, I simply shout loudly and aggressively in my own.

  81. Politicians? by macdaddy · · Score: 2

    I'd rather have one for lawyers. I don't know anyone that can speak legalease.

  82. That's an easy one by Anonymous Coward · · Score: 0

    perhaps one day they will be able to develop one that will allow politicians to talk to normal folks and be understood

    a simple lex scanner will do...
    "$" {
    printf("I'll draft the legislation myself!");
    };
    "can you" {
    printf("I'll see what I can do.");
    };
    ".*" {
    printf("Certainly. Vote for me.");
    }

  83. Uhh... Why Croatian? by I_R_Che · · Score: 1

    The article fails to mention why did they choose to build a Croatian-to-English translator first. Croatian is a complicated language... Imho a lot more complicated than English. Trust me, I live in Zagreb. When I see a foreigner on TV that knows Croatian, I think to myself, 'Wow...'. And they never know it perfectly. I don't mean the pronounciation, but the grammar.

    Not that I'm complaining (hey, it's free publicity for Croatia, visit for your holidays :p), but wouldn't a language like German be more suitable for the first prototypes? I know a bit of German too, it seems closer to English than Croatian to me.

    1. Re:Uhh... Why Croatian? by twinpot · · Score: 1

      Few foreigners (hell, few natives) know English properly either. English is good, in that it is an easy language in which to get started, and to make yourself understood. An extreme amount of bastardisation can take place, and the listener can still make a good fist of understanding it. But knowing it properly is much harder.

      (BTW, I do speak other languages)

  84. Legalese by Hoi+Polloi · · Score: 2

    I will know this tech is mature when they are able to translate a legal document into English.

    --
    It is by the juice of the coffee bean that thoughts acquire speed, the teeth acquire stains. The stains become a warning
  85. Re:Ugh, bad English! by Anonymous Coward · · Score: 0

    Nope sorry. It is wrong grammatically.

  86. There are hundreds of sign languages by markgriffith · · Score: 1
    There are hundreds of different sign languages round the world, but I guess the technical challenges may well be be similar.

    But are we so sure the challenge underlying the thousands of spoken languages are totally sorted then? English and Croatian are both in the same family [Indo-European] that Chomsky mistakenly generalises all languages from.

    Croat and English structures are pretty similar.

    Whereas English in and out of Sioux or English in and out of a West African Bantu language like, for example, Yoruba would be a bit more of a serious test.

    I'd like to see the developers tackle English-Yoruba translation, and then come back a little more modest!

    1. Re:There are hundreds of sign languages by Anonymous Coward · · Score: 0

      No, no! Croatian and English are very different.

      "Croatian belongs to the family of synthetic inflexional languages which abound with word forms complicatedly formed of various morphemes with substantial alterations of phonemes (letters) on junctions of the morphemes.

      Although the information theory proves all languages of almost equal information content, the way how this content is used, and what structure enables compression to the same content varies substantially from analytical languages like English where it suffices to store words as they are to heavily inflected languages where one has to find out what and how to store in order to map to the abundance of word forms."

      This leads to the fact that Croatian dictionary, if it's implemented in the spelling checker the same way is English implemented, can have over 4 milion words, instead of 70 thousands!

      P.S. I am Canadian of Croatian origin.

    2. Re:There are hundreds of sign languages by blackcoot · · Score: 1

      From what little Zulu that I remember (also a Bantu language), Zulu itself wasn't very hard to learn --- the grammar and vocabulary are fairly easy to pick up; however, it's the idioms that are the real bitch.

  87. Totally agree Re:Not quite real translation... by markgriffith · · Score: 1
    I totally agree. I'm a translator myself [Hungarian into English] and I'm sure look-up tables can get very smart [they certainly help us translators to work faster and better], but as for understanding texts, you've really hit on the weak point in existing systems there.

    Not to be unkind, but I think the optimism of some people in computing about automating things like translation is due to them never having learned a second language and never having translated anything.

    There is a massive problem with missing context between languages, for example, that is quite hard to explain to anyone monolingual.

  88. Serbocroat - English by imipak · · Score: 2

    I'm British, and speak only poor schoolboy French. However since hooking up with my half-Russian, half-Serbian girlfriend, I've found that by learning a dozen or so basic words and phrases by rote, then trying to use them conversationally, I've been able to pick up a surprising amount. Serbo-croat was always supposed to be a nightmare to learn, but it's waaay easier than English... for instance, a pnoneme(?) a group of three or four letters will always be pronounced the same way (cf eg "ain" in English.) I'm rather hoping the Babel Fish is never released; by learning the language you start to subconsciously pick up something of the target language's cognitive assumptions, and (in a small way) to "think like" a native speaker. Now /Russian/, there's a tricky language... but we both play chess which is a good middle-man ;)

  89. The problem everyone misses Re:Long way to go by markgriffith · · Score: 1
    Right, agreed - I certainly look forward to steady progress in machine translation too, but I agree learning other languages is still a good idea.

    The problem I think most posters have missed so far in this discussion [though I have overlooked lots, I'm sure] is that a lot of computer developers assume translation is about different ways of saying the same thing -- and that basically we are all talking about the same core topics. [Hence the discussion about resolving the amibiguities around 'river bank' versus 'money bank']

    The real problem is that in fact translation is about different ways of saying different things. This problem is fairly trivial between any two European languages because they have such similar structures [eg languages as close as Croatian and English], and most of us in Europe and North America only get taught one or two of those languages, so most of us have no idea just how different different languages can be. But bridging between Japanese and English, achurch will have a much better idea than most of us of the problem of translating something there is not yet a way of saying in the other language.

    Almost every language has a feature no-one else bothers with, and the key question is what do you do when you try to translate a type of statement into a language where it doesn't exist?

    Example: non-Bantu-language-speakers' bewilderment at information like whether you are speaking to women, to men, among family, about something you saw with your own eyes, something you saw alone and more being encoded into the choice of verb. Bantu-language into English, OK - you can explain those features in the English translation. But the other direction? How? Going from an English text, how do you choose the right verb form in the Bantu language based on information the English words simply don't provide? Of course you choose a neutral or weak verb form to cover yourself, but then you [or the machine system] isn't really translating, and you really see the difference. A group of West African speakers of one of these languages [for example Yoruba] will rightly regard a machine translation as lame, because it leaves so much stuff out because that stuff simply isn't in the English text as presented. They will choose a Bantu-speaking human to explain what really happened, properly, putting back in all that missing context that a person can know but, until we have seriously intelligent machines, a machine does not know about a live, real-time situation [or even a situation presented in a written text].

    That's the real problem. Not finding ways to match up different ways of saying something, but deciding what to do when information one language expects to be in any text or conversation simply isn't in the other language. Sophisticated look-up tables can work well between two languages in the same family [like English and Croatian] because you're looking up the same kind of thing. But outside the same family, and you have a real obstacle - asymmetry. One language wants you to look up something that's simply not there to look up in the other language.

    1. Re:The problem everyone misses Re:Long way to go by kiberovca · · Score: 1

      Since I'm a Croatian native, I'd have to say that Croatian and English language aren't that similar, not in syntax, and especially not in grammar. That example of Bantu language is similar. In Croatian, there are different verb forms depending on the gender of subject or object, also, English uses mostly indirect forms, while Croatian uses mostly direct forms. Also, some linguists compare complexity of our grammar and syntax to Latin. True, it's not so different when you compare English with Japanese, but English and Croatian are distinctivly different.

      --
      Eric: "What're quantum mechanics?"
      Rincewind: "I don't know. People who repair quantums, I suppose."
  90. Croatian and English very different by markgriffith · · Score: 1

    OK, fair enough. Then if Croatian and English are very different, then Yoruba and English [or Sioux and English] are very very very different.

  91. Right Re:Universal Translator? by markgriffith · · Score: 1
    Right -- and that assumes the two languages even recognise the same kinds of things to talk about.

    The point about sentence order is good, but it is even worse. In plenty of human language pairs, one language as given [in text or speech] simply lacks information the other language regards as crucial. There are basic asymmetries.

  92. Re:Idiomatics not so serious an obstacle by markgriffith · · Score: 1
    Yes, lots of idioms and phrases need to be stored and looked up, but until developers show impressive results with two languages in different families [so not both Indo-European languages like Croatian and English] they haven't even begun to scratch the surface of the real problem.

    Different language families are context-dependent in different ways, and creating an intelligent lexicon that can work between two languages that are context-dependent in the same way is far from proof of feasibility.

    Idioms are pretty minor compared to structures that are regarded as important in one language but are totally missing from the other language.

  93. Re:Ugh, bad English! by Paradise+Pete · · Score: 1
    Nope sorry. It is wrong grammatically.

    You're loony. "It has" is perfectly fine. Your quarrel is with the spelling.

  94. Brute problems with Re:Brute Force by markgriffith · · Score: 1
    Nice idea, assuming that both languages contain roughly the same kinds of thing to look up [Spanish and English might not be too hard since they're so closely related compared to any given pair of human languages.].

    But with two languages not in the same language family there's a pretty insurmountable asymmetry to do with context.

    Putting an English text into one of the Bantu languages, for example, will just lead to enormous amounts of information important to the Bantu-language speaker being missed out because it simply isn't in the English. Stuff like the social context of the event, how many people saw it, whether they were men or women, young or old -- if that kind of linguistic data is not mentioned in the English text it simply can't be decided on in the Bantu side of the lookup table, leaving nothing like a real translation.

  95. DARPA has BABEL by Anonymous Coward · · Score: 0

    eom

  96. Croatian Re:The problem everyone misses by markgriffith · · Score: 1
    I'm afraid that English and any West African language, equally English and any native American language, are at least as different from each as English and Japanese.

    Gender and case agreement in languages like Croatian [most Indo-European languages have gender and case agreement - Latin is an Indo-European language] cause problems for English speakers, but however large these differences seem, the aspect system of, for example, the Bantu languages, is much more different. With West African languages it is not simply a case of modifying a noun or adjective to make it agree, as is the case with most Indo-European languages, but choosing from a battery of distinct verbs to express context. So there can be eight totally different [ie. no letters or sounds in common] verbs to express 'steal' or 'give' or 'kiss'. It's not a question of inflecting a wordstem, it's another thing altogether.

    The language families are described that way by linguists because they have major structural differences between families bigger than any differences inside the family. So for example Croatian, English, Norwegian, and Persian/Farsi are classed as being closer to each other than any one of them is to Turkish or Hungarian, for example.

  97. oh, the possibilities... by benson+hedges · · Score: 1

    lawyers talking with humans.
    reps talking with democrats.
    women talking with nerds!
    ... well, then again, maybe that's too sci-fi... ;)

    --
    Karma : Soylent Green (Mostly due to eating junk food and mocking religion)
  98. not Serbocroat - English by shallot · · Score: 1

    I don't disagree with your idea that actually learning foreign languages has good uses, but you can't use "Serbo-Croat" here, because for the purposes of a translator like this, I'm fairly sure Serbian and Croatian are indeed different enough not to be lumped together.

    If you write a phrase both in Croatian and in Serbian (with the latter using the Latin alphabet instead of Cyrillic) on a piece of paper, they might be very similar or in some cases even identical. But if you get two native speakers to pronounce these phrases, no matter how similar they are on paper, the machine will get two fairly different recordings to deal with.

  99. ERRR by 0x0d0a · · Score: 2

    Actually, (believe it or not, I didn't read the story), CMU's had a system that sounds exactly like this (speech->computer metarepresentation->speech) that gets 95% accuracy.

    Plus, research speech recognition is well ahead of most consumer-available speech recognition...but also requires custom hardware or more resources.

    1. Re:ERRR by Com2Kid · · Score: 1

      but also requires custom hardware or more resources.

      Err, after the (insert number here) Gigahertz systems with up to 3GB of RAM, what the hell are they running that cannot be decently emulated on a high end consumer PC?

  100. My open-source translator on SourceForge by dos+equis · · Score: 2

    I guess I ought to mention that I have a project
    on SourceForge called Linguaphile. It handles
    about 50 languages currently but only about 4 of
    them are remotely useful. The Spanish and
    Swedish are probably worth playing with. It's
    early days and needs lots of work but it does
    actually do something now. I'm really interested
    in finding people who would like to work on it.
    You can try it online or download it if you have
    Perl. Apologies in advance that there are no
    docs at all since I've had little interest:

    Linguaphile online

  101. Bantu languages Re:hundreds of sign languages by markgriffith · · Score: 1
    It's West-African Bantu languages which have the fearsome reputation among linguists for being festooned with grammatical aspects. Bantu is a big group of languages, even compared to Indo-European.

    I envy you knowing an African language - is it Xhosa (Winnie Mandela's mother tongue I believe) which has that clicking sound in the throat?

    1. Re:Bantu languages Re:hundreds of sign languages by blackcoot · · Score: 1

      I know that Zulu has 3 clicks which aren't used nearly as frequently as Xhosa's; however, it's the Khoisan languages that really take the cake --- the entire langauge is essentially one long sequence of clicks (incidentally, this is how the Hottentots came to earn their name --- it derives from the Dutch words for "stammerers" or "stutterers" around the early-mid 17th century). I don't know that the Khoisan languages are Bantu derivatives.... As for the Mandelas, *thinks* they may speak Sesotho, but I'm not sure --- I left SA just before The Elections.

  102. Zulu, Xhosa, Sesotho by markgriffith · · Score: 1
    Wow, I would really like to hear those sounds you describe. Actually, it's probably pretty simple if I stir myself -- I could probably find some South African web radio stations.

    Last year I met a German restaurant manager in Budapest who had picked up some Zulu in SA. Did your time in the country leave you optimistic/pessemistic about their future?