Multinational Machine Translation?
TheLocustNMI asks: "I'm currently employed as a systems analyst for a mid-sized consulting firm, and we have been charged with the task of finding a multilingual solution for an entire enterprise system. After much study, the question remains: Is there an effective multi-lingual machine translation system out there? Could one be built? Would it be a massive distributed knowledge system, akin to Everything2? Could it be free to the net public? Ideas?"
The problem with translation systems is grammatical ambiguity.
An even worse problem is ambiguity due to cultural issues. The phrase "cold slice" in the US almost always means leftover pizza. How to translate something like that into another language for a culture that may not eat pizza, may not eat convenience foods, may not eat leftovers, etc.
Don't forget "plane" as when shaving off a layer of wood. And I'd rather forget the people who just plain can't spell.
Yes, I know the difficulties with translations. But there are tools hither and yon with various skill levels.
Deep Translate - New, from IBM: Speaks 125 more languages than Gary Kasparov!
"It's tough to be bilingual when you get hit in the head."
I think a major problem with the undertaking is that (last time I checked) the fundamentals of how human brains deal with semantic/pragmantic issues (i.e. how we process concepts) was far from nailed down. Half the linguistics community thinks Jerry Fodor is out there, and half of them think he has a point, when he says all concepts are innate.
Now whether or not you believe what he says, the fact that there is so little concensus about something as fundamental as what a concept is and how we process them is a bit worrisome. This is going with the assumption that whatever we build on the software side will mimic *us* on the software side (yet another good question).
You may wonder what all this has to do with machine translation. Well, one of the difficulties mentioned before is known as the "scope problem" -- i.e. if the computer is to use any semantic knowledge (of the world - i.e. concepts) to sort through ambiguity, where does it begin? How do you create an understanding of the world so that there is an understanding of definitions?
I once wrote a PROLOG based Natural Language Parser in college, and thought it was pretty cool until I realized that it was the biggest can of worms I ever opened...
just my blog and pix
the *same* word can have different meanings in different countries that share the same language... For example, in Mexico and most of Latin America, "pollo" means chicken; in Spain, it means "cock," in the less traditional sense.
Moreover, how could you translate the word "boy" into Spanish when in Mexico they may call him "chamaco," in El Salvador "cipote," etc.?
heidi
Thats why I mentioned the Everything2-sorta thing. When it first starts out, the translator will only be as good as current engines are, but if it is developed to allow "learning", and by learning i mean the application of grammatical rules etc from users, it could become rather extensive and deep.
It would be quite the undertaking, but if enough analysis and design was put behind it, it could be done. Of course, that's like saying we could develop a matter transporter in two years if we had all the scientists and engineers in the world working on it, too... hehe.
Ham on rye, hold the mayo please.
thelocust[dot]org
The problem with translation systems is grammatical ambiguity. Computers lack the massive "database" of context which we humans have for resolving ambiguities.
How do you translate "plane"? It could be "plane" as in mathematics, or as in flying...
That's just one simple type of ambiguity. When you have a fluid grammar like English -- for which a BNF grammar cannot be made, regardless of how many tokens of lookahead you have -- the lack of context makes translation all but futile.
Assembling such a database is not a theoretical impossibility. It is however a *practical* impossibility for the present and foreseeable future.
-JF
MrJoy.com -- Because coding is FUN!