Slashdot Mirror


A Universal Networking Language for the Internet?

Anonymous Coward writes: "The United Nations University is developing a Universal Networking Language for the Internet, which is designed to allow effective communication between people writing in their native languages, with automatic conversion through an intermediate Meta-language (perhaps a precursor to Star Trek's Universal Translator.) They will be holding a symposium on the technology on 18 November in Brussels, Belgium, where they will publicly announce their achievement. They claim that the initial stage of UNL will support 16 languages: Arabic, Chinese, English, French, Russian, Spanish, German, Hindi, Italian, Indonesian, Japanese, Latvian, Mongol, Portuguese, Swahili and Thai." An interesting idea, but this is one of those "the devil is in the details" things. It'll be interesting to see how/if this can work.

35 of 291 comments (clear)

  1. DLT project did this with Esperanto by limako · · Score: 2
    The Distributed Language Translator (DLT) was a project in the Netherlands that made a first pass at this 10 years ago. They started with Esperanto and then made some changes to disambiguate words (even more than Esperanto already does). It worked pretty well, but suffered from the same kinds of problems anyone who's used translating software has seen before. Here's a nice article about it -- in Esperanto, of course.

    What's evil about these projects, of course, is that they don't let people just talk to one another. It would be neat to be able to have access to the literuture of other countries, but that pales in comparison to having access to the people in other countries. If you just learn Esperanto you can really converse with people without needing technology or anything. It just works.

  2. Bablefish crap by SpinyNorman · · Score: 2

    Why would anyone want their web page to read as if it's been run through a bablefish? A translation from netspeak into, say, English is always going to suffer some mangling, and most likely is not going to allow idiom, metaphor, etc.

    Machine translation will improve, but the best oranization is still going to be browser or proxy based translation. If that translation package internally uses an intermediate semantic representation, then fine, but the day /. reads like bablefish crap is the day I find myself an English web site.

    You have to admire the democratic thinking though (NOT!) - rather than just foreigners seeing your web page as crap, you can (must) see it that way too! Designed by politicians, no doubt.

  3. What shape will it be? by stx23 · · Score: 3

    I'm hedging my bets it will be fish shaped, and will fit into the inner ear.

  4. Language support by Ledge+Kindred · · Score: 2
    What, no support for Esperanto?!

    -=-=-=-=-

    --

    -=-=-=-=-
    My mom's going to kick you in the face!

  5. And by eliminating all communication restraints... by handorf · · Score: 2

    it will allow more and longer flamewars than anything else since the invention of SNTP!
    (with a nod to Douglas Adams)

    But still a very cool idea!

    --
    -- IANAEG - I am not an elder god.
  6. Re:Sounds possible.. by Cuthalion · · Score: 2

    Is this what happens inside the head of a bi-lingual person?

    Uh, no. Typically, I believe bi-lingual people internally switch back and forth, or represent some concepts in one language and others in the other. That is to say, when they actually are using internal linguistic representations of things at all.

    --
    Trees can't go dancing
    So do them a big favor
    Pretend dancing stinks!
  7. Re:it will fail by dillon_rinker · · Score: 2

    You're right, but any decent "universal translator" will not stop at translating individual words. Its dictionary would extend to phrases of the sort you mention. Perhaps it would define "stand at window" as "stand .1m - 1m away from window, while normal vector from plane of body intersects window." Regardless, it would be quite a chore to accomplish this. Context is everything. The more subtle the meaning, the more context you need For example, if I said "That's really smart", you don't know if I'm being complimentary, self-deprecating, ironic, or insulting.

  8. sounds difficult by TheCodeMaster · · Score: 4

    I'd think it would be difficult to make an abstracted meta-language out of human languages. There's lots of grammatical issues which would be particularly difficult to deal with well.

    For example, in the case of inflected languages, how do you get the declensional case information into the metalanguage? In many languages, there are grammatical cases have overlapping declensions, so there's ambiguity about what would be intended with meaning. And mapping between languages would be really tough.

    Verbs would be really tough. Like in Russian, you have three tenses (past, present, and future) as well as two verb aspects. So you have pairs of verbs, one expressing action that occurs once, the other expressing habitual activity.

    Sounds like the project would be lots of fun to work on, though. It's a really neat idea, linguistically.


  9. The "meta-language" by Keelor · · Score: 2
    It seems to me that the strength of the meta-language will be the entire strength of this system. The question is, will the meta-language be skewed towards one language (*cough* English *cough*) or will they manage to create a language that does not impose biases toward one language.

    Overall, I agree strongly with the idea. From a testing standpoint, with the development of an effective meta-language, all one would need to do test the translation for the most part is go from language x->meta language->language x. If that works, than presumably the meta language did not slaughter language x.

    One question I have is how the language engine will handle words it does not know--or, more likely, abbreviations, misspellings, and slang. From what I've gathered, this is where other translators fail. If the translator doesn't understand half the sentence, than it generally has too much trouble finding context for the rest for anything to make sense. Just a thought.

    -Keelor

  10. We already have a "Universal Language" by Blackheart · · Score: 2

    We already have a "Universal Language." It's called English.

    I'm not trying to be facetious; I'm not saying English is better than other languages; and I'm not saying that English will serve you best, or even tolerably well in all places; but it is an inevitable conclusion you must come to after spending any reasonable length of time abroad: if there is anything resembling a universal language in this world, it's English.

    English is already a lingua franca in technical and many academic fields. Many universities in non-English-speaking countries actually demand that graduate students write their theses in English, because that is the best way to ensure its diffusion. Some such schools even conduct their classes themselves in English.

    The Hollywood movie industry has also no doubt played a large part in helping to making English (not to mention Western culture) palatable and popular the world over. Dubbed versions of films are hardly ever as popular as subtitled ones (exception: kiddie films).

    Is English the best choice for a universal language? Definitely not from the point of being easy to learn. Esperanto would be much better. But realistically Esperanto doesn't have a chance. If English ever encounters a contender, it will probably be Chinese, if only because 1/5 of the planet speaks the language.

    BH

  11. Confused by Urmane · · Score: 3

    I'm a little confused ... does "Universal Networking Language" mean Esperanto or TCP/IP?

    --

    --
    "I find your lack of faith disturbing." -- Darth Vader
  12. Re:Sounds possible.. by Stephen+Williams · · Score: 2

    Seems to me that it will be easier to write a translator from your native language to a very well defined and documented intermediate language than trying to understand the fine details of a non-native language.

    Though I know nothing about natural language parsing and translation, it seems to me that, from a software engineering point of view, translating from a spoken language into the metalanguage will be the hardest part of the exercise. This will be especially true with languages such as English that have inconsistent grammar and more than one way to do everything (makes you wonder if English is the spoken equivalent of Perl :-)

    Once you've got your text translated into the regular, simple metalanguage, it should be an easier task to convert it into a natural language than conversion to the metalanguage was.

    (Incidentally, the parallel with Star Trek's universal translator was a good one. In Star Trek, outgoing communications from Starfleet vessels are translated into a metalanguage called Linguacode which is supposedly easier for the alien's translation computer to process.)

    -Stephen

  13. Re:UNL? Yeah, right! by Noryungi · · Score: 3

    OK, here are some more answers.

    Watch out this is very, very long...

    Don't think about it as "automatic" translation, it's much more likely to work out as semi-automatic. I expect that the process would be something like this:

    1.Run automatic converter from natural language to intermediate.
    2.Have an expert in the intermediate language review the translation.
    3.Run automatic converters to the target natural languages.
    4.Have linguists review the output.

    Compare and contrast with a "traditional" translation process:

    1. Ask a translator to translate from language "A" to target "B". Ideally, the person in charge of the translation should be fluent in language A, a native speaker of B and have at least basic knowledge of the subject at hand (for instance: Open Source).

    2. Ask a linguist, (ideally fluent in language A, native speaker of B, etc.) to review the translation produced at step 1.

    The point is that the intermediate language should be designed to be free of the ambiguities that plague language translation.

    And how exactly can you do this? Either your intermediate language is "limited" (that is to say: misses many of the subtleties of the original language), which eases step #1 but certainly introduces many errors down the line. Or, it is an "advanced" language, that is able to translate many of the finer point of your "start" language -- but then, the interesting thing is the translation engine itself. Not the intermediate language. If your translation engine is good enough to translate, say, Spanish into UNL with little/no loss of meaning, it is also good enough to translate Spanish to English with no intermediate step!! If this is true, what's the point of UNL.

    Another point is, how can you be an expert in an "intermediate language"? Either the language is "human-readable", but probably produces an output compared to sludge and correcting this sludge may introduce additional errors. Not to mention the pain it represents to check on something that borders on the unreadable. Or it is machine readable -- but in that case, who is going to read it?

    Final point is productivity: using UNL, computers and machine translation may take longer than a simple translation "by hand" with human grey matter. A Windoze95 machine with MS Word and some good "paper" or digital dictionaries is, in many cases, more efficient and cheaper than going through the pain of machine translation.

    The hope is to minimize or eliminate step (4).

    Good luck! Frankly, this has been the "Holy Grail" of machine translation ever since it started. And I do not think we are any closer. So, far, every large, international institution that I am aware of (UN, UNESCO, EU Commission, EU Parliament, NATO, IMF, etc) either use tons of translators or have standardized on a couple of languages at most (English being, of course, the "Lingua Franca"). All the large international institutions mentioned above that use machine translations ahve discovered that, even on simple subjects, the 4th step you describe above is the one that consumes the largest time.

    It would be a big win if you could get to the point where all the hard stuff is done just *once* instead of repeated over and over again for all of your target languages.

    Again, this is the "Holy Grail" of machine translation. I don't believe that we are any closer to it than we were, say, 30 years ago. At least not judging from the output of some of the software available out there...

    And no, this will not work for poetry or humor, but there's no good way to translate poetry and humor in any case. The idea would be to get it to work with technical, legal, and business language.

    Sorry to say this, but this does not work very well either for legal or technical language. It may work with Business, since PHBs are so limited intellectually =). Legal translation can be horrendous: I have translated many legal documents in the past and I can tell you there is nothing worse than that, because legal terms are incredibly complicated and old-fashioned and also since legal trivia has to be rendered in a very exact manner. Legal terminology (in almost every language) is one of the most confusing and complicated one. Plus, lawyers and legal people are a major pain in the neck when it comes to Once you get the terminology right, I agree the rest of a legal document is usually a matter of "filling the blanks". But getting the legal terms right is enough to drive you nuts.

    Technical translation is another problem: I think some technical areas may be the best bet for machine translation yet. The problem, as far as the technical field is concerned, is that in fast-moving areas (computer science is one) the technical vocabulary is changing and evolving so fast it's hard to keep up pace. I read up to 5 computer magazines a week (not to mention a daily dose of Slashdot =) just to keep up-to-date with the latest evolution in language and technology. Keeping a UNL database of terms and translation could prove to be a daunting task...

    >What's so special about UNL? Theoretical translation of language A into a universal language and from there to language B is almost as old as "machine" translation itself.

    The fundamental argument is that it hasn't worked before so it isn't going to work now is stupid. It has been demonstrated how difficult it is to do this, but not that it is impossible.

    Please note that I never said (in the sentence you quote above) that this is not going to work. I just said that, as far as I am concerned, using an "intermediate" language is old news. This may be a new and interesting idea to you, but, frankly, for someone who has worked in translation, you could very well trace back this concept all the way to Volapuk and Esperanto. And these two were invented in the 19th century.

    As far as I am concerned, I think you could prove that correct translation is impossible. All you would have to prove is that a "human" language is a chaotic complex system, which usually follows unpredictable rules and has several strange attractors, inducing a runaway complexity.

    Case in point: English. Roots: Saxon dialects, Norman dialects, Old English and Old French. Latin. A little bit of Greek. Maybe German and Old Dutch. Evolution influenced by French and a myriad of other languages. Now divided into several branches (US English, British English, Irish English, Australian English, Indian English, International English), all of them influencing each other and countless other languages. Reducing the English language to a set of neat little equations and computer routines is left as an exercice to the reader... =)

    Please understand me: computer translation of "basic" English into UNL and from there into Chinese, French, Spanish, Italian, Japanese, etc... is no big deal. Computer translation of highly technical/scientific papers may be achieved. But even then, due to the inherent complexity of English (or any other human language), a human will have to review the machine translation and correct it.

    I therefore suppose that perfect translation does not exist (or is impossible). Translation (like programming) is an art, not a science. You can have a certain number of "artistic" rules, but you cannot have a "perfect", scientifically proven, solution.

    Example: give a problem to be solved to two good programmers, and they'll probably come up with two different and equally valid solutions. Which solution you pick has to be determined by other factors (speed of implementation, maintenance and evolution of the system, optimization, resources used, etc).

    Give a translation to be done to two good translators and they will probably come up with two rather different and equally valid translations. Which one you pick is then determined by other factors (length of translation, speed of said translators, price of translation, style, etc). Complex systems, like languages, cannot be reduced or predicted. They can be analyzed and more or less "solved" -- the quality of the solution being dependent on many factors, such as the experience of the specialist, his choice of tools, etc. This is true even in reductive or limited systems, where, for instance, the vocabulary to used is small (see technical translation above).

    Remember the butterfly in Brazil that creates a storm at the other end of the world? I suspect translation (especially multiple language translation) may well be the kind of complex system that is so hard to solve using computers.

    Perfect translation, like perfect programming, is only possible in a very limited scope. A "DO ... UNTIL" loop is the perfect solution for certain problems, and "dinero" is the perfect translation of "money" into Spanish. A TCP/IP stack, no matter which OS it is running on, will always have some sort of ACK/NACK test. But these are all very limited examples.

    >For a good example of the total and dismal failure of machine translation,
    >try translating this text into French (or Spanish, or Italian, or whatever)
    >with Babelfish and back to English. Then do it a few times. Then try
    >English to Chinese and back a few times. Case closed.

    Hardly, Here's why that is not a valid test

    1.Babelfish doesn't use an intermediate language.
    2.Babelfish doesn't even achieve loseless translation from
    language A to B and back to A. This is the simplest case and
    one which can be improved the most with a good definition for UNL.


    Answers:

    1. A intermediate language should introduce even more bugs into Babelfish translation. See above.
    2. "Lossless" translation is impossible. See above. Complex systems, such as human languages, cannot be reduced easily to a set of equations.

    >It is, in fact, an even better AI test than the Turing test.

    They do not claim perfect translation, but yes computer which could translate between languages and do it perfectly would pass the test. Do you really argue that it is impossible for computer programs to ever pass the turing test? It is only a matter of time till this happens. The only way to stop it is to stop making computers.

    Actually, I thought a computer had managed to recently pass the Turing Test, or some limited version of it. Anyone out there could supply information on this one?

    But: I don't think the Turing test is actually a very good AI test. There is a huge difference between a program that is able to "talk" to you (parrot back what you said) and one which is able to understand you. A computer able to understand human language would probably be the first real AI on this planet. Most Turing test software are based on some variation of Eliza, and this has been around for ages.

    Here we are reading /. At the very heart of the cutting edge. (some text removed) I wouldn't expect your friends to be out of work any time soon. But isn't the job of a professional translator radically different now than it would have been 100 yrs ago? Political change was not the only thing that caused this change... communication technology has had a big role.

    Well, this may be surprising to you, but the work of a professional translator has not evolved very much. Computer and communication technologies have eased their task a lot. Like many other professions, translators are now able to work from home, access the Internet and its wealth of information, send documents to clients by e-mail, and even use some very clever software that ease the translation process (TM/2, Trados, etc).

    Word processing, in particular, certainly is the best thing to happen to translators since sliced bread =). Also, I agree that many new translation fields have been added in the past century: biology, computer science, aerospace, etc.

    But the central fact remains this: to be a translator you have to be fluent in (at least) one language, a native speaker of another, and have a good expertise in one or more field of human activity. That's it. Oh, and you have to have a certain "talent" with languages, just like you need to have a certain "talent" for programming. It's an art, remember? Even the best-trained translator is worth 0 if he/she does not have that special "talent". Exactly like a lot of people work on Linux -- but there is only one Linus Torvald. =)

    We may translate faster, have more tools and information at our disposal, and produce better-looking documents -- but the core skills remain the same and the work process is exactly the same. You could train a translator today in the exact same way they were trained 100 years ago: with a pen and a piece of paper. Sorry to disappoint you, but Computer technology is not always the perfect solution it prides itself to be...

    That's All Folks!

    --
    The right to offend is far more important than the right not to be offended. (Rowan Atkinson)
  14. Poor website by Tet · · Score: 3

    For a project that's supposed to allow effective communication, they could at least have designed a web site that works well in all browsers. No alt attributes for images... Sigh. Those of us using lynx just have guess, based on the image names :-(

    --
    "The invisible and the non-existent look very much alike." -- Delos B. McKown
  15. Interesting, but maybe off the mark by Shin+Dig · · Score: 2

    I will admit to not having read all of the UN documentation, but what I can tell about it from what I have read, they are attempting to create a abstraction of language in general.

    Although this is an interesting idea, it makes an assumption that all language is based off of one abstract "map". IMHO different languages have different maps. Having spent a fair ammount of time learning ancient greek in high school and college, I can say that the map for that language is quite different from english, and those are both Indo-European languages.

    The concepts that exist in one language may not in many other languages, which is often very problematic. Eventually, to learn any language, you must actually just start thinking in it, and not doing translation to your native language. Contemplating the 3 voices in greek (active, passive, and middle) is something I rather enjoy doing, as it is very foreign to english.

    I am just afraid that they will have to produce a Least Common Denominator language which won't be useful for anything beyond technical specifications and instructions. I will have to admit that that would be useful on many fronts, but may not be the dream that we were all hoping for.

    --
    There is no silver bullet. Plus, werewolves make better neighbors than zombies or vampires anyway.
    1. Re:Interesting, but maybe off the mark by YellowBook · · Score: 2
      Although this is an interesting idea, it makes an assumption that all language is based off of one abstract "map". IMHO different languages have different maps.

      Actually, the dominant paradigm in formal linguistics, generative grammar, implies that all languages are generated from one abstract "map", the so-called Universal Grammar. Now, actual grammars vary a lot, but the idea is that they can be generated from the Universal Grammar by tweaking various parameters. The main evidence for this is the specialized language-learning ability of human children, and particular evidence about how that ability works and doesn't.

      Now, as to whether this will make universal automated translation via a metalanguage possible, that depends a lot on the metalanguage. I envision the metalanguage looking a lot like "glosses" in syntax papers, rather than an actual language, so that you preserve all of the language features of the original in the metalanguage. The more languages the metalanguage is supposed to accomodate, the larger it will be.

      Even if the metalanguage is perfect for all the supported languages, there will be problems with idioms, probably with slang, and certainly with cultural concepts. But in general, how important those failings are will probably vary depending on the conversation. On the whole, I think that both the most enthusiastic and most critical posts I've seen in the comments to this article are underinformed.


      --
      The scalloped tatters of the King in Yellow must cover
      Yhtill forever. (R. W. Chambers, the King in Yellow)
      --
      The scalloped tatters of the King in Yellow must cover
      Yhtill forever. (R. W. Chambers, the King in Yellow
  16. Who is gonna patent this first? by dieman · · Score: 2

    I think this is a very neat idea. My worry is, who will patent the technology first and screw the world.

    Amazon does it with ecommerce 1-click. Microsoft does it with style sheets. Hell, if its a good.. Interesting technology why not, lets take it and pantent it to death! Then we can charge everyone for it and make a zillion-and-one dollars. Perhaps I should send in my application today!

    There needs to be limits on patents. Yes, I believe they do foster invention, but they also can stop community work on a really-good-thing.

    Perhaps a community-patent-agency and a easy, low cost effort to setup patents that are held by some sort of group for the explicit reasoning of keeping some basic ideas *free* for us geeks and the rest of the world.

    Really, it shouldn't have to come down to this tho. But someone will patent the implementation of this and we will all be screwed.


    My $0.02

    --
    -- dieman - Scott Dier
  17. It won't work. by jd · · Score: 2
    At least, not in general. Regional expressions, local terminology, written accents, cultural mannerisms, and all sorts of other fiddly details, might not HAVE a direct translation, into the meta-language OR into any other language.

    Yorkshire, UK, for example, still uses "thee" and "thou". If you translate this into some kind of meta-language, it's either going to barf, or lose details. Those details may be important to meaning. God only knows how it'd cope with Cockney slang, or even common phrases (eg: "from the horse's mouth", "a sticky wicket")

    As I see it, this can ONLY work for formalised documents, using a formalised subset of the various languages. eg: Legal papers, UN treaties, etc. It'll NEVER work with informal, written language.

    --
    It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
    1. Re:It won't work. by jd · · Score: 2

      I shall continue to believe in Compo, no matter what thee says, lad!

      --
      It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
  18. Re:sounds difficult - not as you say by MobyDisk · · Score: 2

    I will be difficult, but I don't think for those reasons:

    Verb tenses are not the problem. Every language can express every tense, just in a different way. Hard yes, impossible no.

    Additionally, approximations work well enough. Ex. Most English readers couldn't tell you the difference between past tense and preterite(sp?) tense.

    Grammar is easily defined. 90% of language could be described in a BNF. adv-adj-noun in one, noun-adj-adv in another. So what. That is probably the simplest part.


    My interest would be in the meta-language design. Words by number? string? Grammar by parsing into a std format, or classifying each word? Are there multiple ways to organize a statement? What about this "word hierchy" they talk about. Quite cool there.

  19. This is a very good idea by Cuthalion · · Score: 2

    This allows the semantic extraction to be MUCH more computationally intensive than systems like babelfish can afford. When you make a document, it's okay to spend an extra 15 seconds to extract a pretty good representation of the gist of it, so long as it doesn't need to happen every time the page is viewed. (babelfish doesn't even cache translations, does it?)

    Okay, so some of the idioms and convoluted sentences will be improperly converted, and will need some manual tweaking. Hopefully this system will allow this tweaking to take place. By providing multiple different conversions back into the author's native tongue, they may be able to see some of the translational oversights, and fix them.

    This won't be good for poetry, but will allow people who only know one language (English speakers seem more likely to fit this category than other people) to publish documents readable by people who do not speak English - that's a substantial breakthrough.

    It would be nice if this standard would allow segments to be set to fixed translations, so that if I really wanted the English to read a particular way, I could enforce that particular idiom, without loss of generality. ("Normally translate 'it has a low probability' but if you ARE translating to english, substitute the literal string 'fat chance'")

    --
    Trees can't go dancing
    So do them a big favor
    Pretend dancing stinks!
  20. it will fail by jilles · · Score: 3

    Though at first sight the idea of translating to an intermediate language seems interesting, I can't help but note that similar projects in europe have all failed so far.

    Automatic translation between languages in the EU is something that could save a lot of money. So there have been a lot of research projects funded with loads of EU money to accomplish this. All of these projects have failed (as far as I know).

    This seems to be a similar effort, this time by the UN which is an equally burocratic organization. I think the goal of this project is probably too ambitious to work. Even translations between two related languages (english and german)are troublesome (babelfish for example is not exactly perfect), so I can't see why translations to an intermediate language would change things (ever tried to do that in babelfish? the result is not pretty).

    So, it will probably fail and loads of money will be wasted on it.

    --

    Jilles
  21. Spelling errors by scumdamn · · Score: 2
    Imagine the translation errors when people spell words like "you're" "lose" "its" and "too" wrong.
    A holy war could be started because of the sentence:
    Your wife is going to lose, but you will win.

    If it's spelled incorrectly.
    Use your imagination.
  22. UNL? Yeah, right! by Noryungi · · Score: 3
    All right, all right, all right...

    Several points -- for full disclosure, let me just state that I am a localization engineer, with a 5+ years of experience in software localization (read: adaptation into different languages) and a 7+ years experience in translation. If that does not makes me qualified to comment on this, I don't know what does.

    • First of all, I do not really believe the UN can produce anything remotely interesting, technically speaking. I like the IETF motto: "we believe in rough consensus, and working code". Show me the money^H^H^H^H^Hcode first, please.
    • What's so special about UNL? Theoretical translation of language A into a universal language and from there to language B is almost as old as "machine" translation itself. As far as I remember, early EU research into machine translation were based on a similar idea -- and they were dismissed as a failure.
    • For a good example of the total and dismal failure of machine translation, try translating this text into French (or Spanish, or Italian, or whatever) with Babelfish and back to English. Then do it a few times. Then try English to Chinese and back a few times. Case closed.
    • People, Star Trek is nothing but TV! Don't misunderstand me: I love spending an evening with Cap't Kirk and Mr Spock, but this not reality! The Universal Translator is, in my opinion, a perfect (read: impossible) dream. It is, in fact, an even better AI test than the Turing test. The day a computer can perfectly translate a text from language A to language B is (a) the day I'll be out of a job and (b) the day I'll begin to seriously worry about that glowing red camera and calm voice saying: "Would you like a nice game of chess, Dave?".
    • Frankly, would you trust somthing as big, bureaucratic and inefficient as the UN to determine the next standard in machine translation?
    • Finally, I have some friends who work at the UN as official translators, and they are doing perfectly fine, thank you very much (and, I should manking some serious money). Why? Because, AFAIK, no machine has ever been able to translate perfectly the multiple meanings, subtle changes in context, double-entendre, puns, cultural and historical framework, regionalisms, etc. that exist in every language on this Earth. Call it the "Curse of Babel" if you will, but a human brain is, and will remain for a long time the best translation machine there is. Machine translation has its place, but only on documents of a very limited scope/vocabulary and of a very repetitive and technical nature. Even then, a human translator is needed to correct the multiple mistakes made by the machine.


    Of course, I may be completely wrong and UNL may be the next best thing since sliced bread. But I doubt it.

    --
    The right to offend is far more important than the right not to be offended. (Rowan Atkinson)
  23. And here's an antecedent... by SEE · · Score: 2

    This discusses a similar project...

    Wow. I've been on /. longer than I thought, to have remembered to look for that...

  24. Do they have a clue? by Jered · · Score: 2

    "enconverter software"? "Inter-Net"?

    Are there any actual computer scientists or linguists involved with this project? Their web site looks like it's either a team of bureaucrats or fifth-graders.

  25. Re:Language support - Esperanto? by Chalst · · Score: 2

    Let me just add something to the above, since I haven't made myself clear in what I have said in the above.

    In German it is possible to use the definite article to refer back to something used in the previous sentence, rather like `it' in English: but with the crucial distinction that what we refer back to must be of matching gender. So if a masculine, feminine and neuter word occur in the sentence it is possible to refer to any of them with the `it'. This ability to refer on the basis of gender must be captured in our syntactic model. Similarly the case system allows one to have multiple indirect objects (one accusative, one datave and one genetive, for example) directly attached to a verb, where in english one would use a preposition.

  26. one big bland language coming right up by tuffy · · Score: 2
    Assuming this scheme can work and they can map some subset of all languages to one another, the result won't be terribly pretty to use. The whole idea behind having different languages is to express cultural diversity - like the old adage that eskimos have 11 different words for snow. There's no way a universal language could capture that level of subtle differentiation.

    On the other hand, they just might be able to come up with a way to map a small subset of natural language, computer-speak for example, for the purpose of easing the creating of internationalized apps and making web sites more navigable. But I don't see how this could be successful in a general case.

    --

    Ita erat quando hic adveni.

  27. Noam Chomsky and the Universal Grammar by Hasdi+Hashim · · Score: 3

    For those of you who think this is impossible because of the variations between languages, Noam Chomsky has something to say to you. I was exposed to his idea back in formal languages and automata class. Basically, his argument is that we have universal grammar (UG) parser built within us when we are born. We 'hardened' the parameters to the UG to conform to our prefered language. Sorta of like guile and perl where guile is a very expressive language but perl, while express less, can express the same thing in a more consise manner.

    Universal grammar is defined by Chomsky as ``the system of principles, conditions, and rules that are elements or properties of all human languages... the essence of human language'' [Chomsky, 1978].

    Thus, all languages that we are accustomed, English, Arabic, Malay, Japanese, and Chinese are special cases of a universal grammar. Chomsky and subsequent linguists are looking for those common elements of all languages.

    Universal grammar and the innateness hypothesis

    Universal Grammar in Prolog

    There are lots of discussion about this... see google.

    Hasdi

    1. Re:Noam Chomsky and the Universal Grammar by the+eric+conspiracy · · Score: 2

      Perhaps there is a universal grammer that is innate to humans. But what makes you think that you can implement it in a Turing Machine?

  28. Key differences from Esperanto by ToastyKen · · Score: 2

    As far as I can tell (just guessing, though) there are 2 key differences between UNL and Esperanto:

    1) It's not Romance-based and thus won't be as Euro-centric and will thus probably translate Eastern languages better.

    2) It's designed as an intermediate language and not as a final end-user language. As far as I can tell, it could even be machine-readible and not speakable. In any case, it will not have as many constraints as a language like Esperanto that is designed for human speech.

    These are just my gueses. I don't know what kind of language they're actually trying to implement. (The website is skimpy on those details.)

  29. Interesting, but... by Millennium · · Score: 4

    It's not going to work very well. The problem is that each language has its own nuances, and in many cases these don't translate very well into other languages. I'll use Japanese honorifics as an example. The list of them is relatively long ( -san, -sama, -kun, -chan, -sensei, -wa, and others). Simply by attaching one to the end of a person's name, I can make the same sentence express immoderate flattery or extreme derision. This can be translated in an extremely limited fashion to romance languages such as Spanish or French (by using familiar vs. formal form of address, but it's still limited). It doesn't translate into English at all (this is why I prefer subtitled anime; get the general meaning from the subtitles, and actually listen to the Japanese for the nuances). And, of course, you still have the problem of inflection not translating very well into written words. This makes English particularly unsuitable for network communications, actually, since so much meaning is left to inflection. What's the solution? I don't know. There probably isn't one. Even Esperanto isn't immune to this problem of losing meanings in translation. I don't think a "universal meta-language" is going to work, though.

    1. Re:Interesting, but... by the+eric+conspiracy · · Score: 2

      AN Whitehead was obviosly not a linguist at all.

      In actuality English is perhaps the most complex language in modern use, for a number of reasons. It has by far the largest vocabulary; it takes root words from many many other laguages; it's rules of grammer are highly irregular; because of the introduction of printing before the great vowel shift the spoken form of English does not agree at all with the written from; the geographic spread is so large that several dialects pigdins and patois forms of English now exist. If you propose that we adopt English as the base language you are going to have to be very specific about WHAT English using what local idioms and rules.

      The richness and complexity of English is perhaps best exemplified by the richness of it's body of great literature and poetry where expression and level of meaning are best brought to form by a language that has a great richness of vocabulary and ability to express multiple levels of ideas in a single word. Of all the languages of the world there are three that clearly have great bodies of literature - Sanskrit, Greek, and yes, English.

  30. Linkages by the_tsi · · Score: 2

    Linkwa, pink dama, arf muzheek. Rintintinambulation. Alla da peepholes enda voold, enda looniverse, cargo a schlong ender hertz. Epp, dat schlog arf Unamunda.

    -Chris

  31. Bahasa Indonesia by Hobbex · · Score: 2


    has, to my knowledge, only one tense. And no articles. And plural noted by saying the noun twice ("orang" is person, "orang-orang" is people).

    Needless to say, there isn't much poetry in Indonesian...

    -
    /. is like a steer's horns, a point here, a point there and a lot of bull in between.