Translation Software That Learns by Reading

High school Spanish by KaSkA101 · 2005-02-23 13:56 · Score: 3, Funny

Why didn't I have this software during High School Spanish?

Re:High school Spanish by xtrvd · 2005-02-23 14:25 · Score: 4, Informative

Fortunately I had the next best thing in High School Spanish. The trick is simply going to the #spain channel on efnet and talking nice to some people. You'd be amazed as to how often my teacher would fail my fellow students because they attempted using the primitive babelfish.altavista.com to do their work for them; she could easily spot the syntax errors and mis-spelled english words which were never translated.

Until I see this new process in the works, however, there is nothing that will make me believe it's better than finding another human who can *understand* what you are saying and the context to which you are implying.
Re:High school Spanish by dnoyeb · 2005-02-23 14:31 · Score: 1

good point. Is this not like babblefish?
Re:High school Spanish by Peter+Cooper · 2005-02-23 14:38 · Score: 1

I used to cheat on my French homework when Altavista's Babelfish came out (must have been about 1996/7?). It was great, could type in my homework, hit the button.. and bam, homework was all done.

The teacher actually became suspicious when certain words were in entirely the wrong context, and about the bizarre syntax (totally English word orders, etc), but I was never busted. Quite funny really, since my best friend in class was using it too! Still, I don't think most people realised the Internet existed back then ;-)
Re:High school Spanish by Anonymous Coward · 2005-02-23 14:58 · Score: 0

Because you had a better tool - your brain.

(hopefully...)
Re:High school Spanish by Kaihaku · 2005-02-23 15:10 · Score: 0

Quite true... Those were the good old days. Now everyone's on the net and all the good things have become sickeningly popular. Sigh...
Re:High school Spanish by Servants · 2005-02-23 16:02 · Score: 2, Interesting

Until I see this new process in the works, however, there is nothing that will make me believe it's better than finding another human who can *understand* what you are saying and the context to which you are implying.

Heh. Then there is nothing that will make you believe, etc., etc.

Certainly you can't do good translation without understanding syntax (which influences meaning and underlies word order) and context (to disambiguate synonyms and phrases with multiple interpretations). Machines aren't especially good at either one yet; ergo, machine translation will continue to be pretty crappy for the foreseeable future.

Funny thing is, though, even a crappy translation turns out to be tremendously useful in most practical contexts, and worlds better than none at all; a simple word-for-word translation is typically hard to read but still conveys the proper gist. That's why I don't get excited about automatic translation "advances" these days: there are really two purposes for machine translation. One is figuring out what a piece of speech or text trying to say, and the current technology is usually good enough for that. The other is making a translation of sufficient quality to save a human translator some work, and I think that won't happen for quite a few years yet. Anything in between adds very little.

(By the way, everything in natural language processing these days uses corpus learning techniques. Now if an improved technology had been developed manually by bilingual programmers who pulled the design out of their collective hats, then that would be a man-bites-dog story!)
Re:High school Spanish by shanen · 2005-02-23 16:27 · Score: 1

Well, I was teaching in university around that time, and I used to get some of those machine translations. It was extremely obvious what was going on. Sometimes I would confront the student about it, but usually I just made the effort to teach the student to correct the worst of the problems. Of course that did nothing for the computer that was doing the translations, and the next time around it would be just as bad--so I'd take out double on the grade.
My feeling was that it usually worked out okay. Those students were only going to pass the course in the first place simply because it was my policy to make it harder to fail than to pass. Required classes are like that. Yeah, it would be nice if they learned something along the way, but... (However, there were always a few who were determined to make the supreme efforts. Then I'd have them again the next year.)

--
Freedom = (Meaningful - Coerced) Choice != (Speech | Beer^2), and sad sock puppets' bad mods avail them naught.
Re:High school Spanish by Temposs · 2005-02-23 19:15 · Score: 3, Insightful

Until I see this new process in the works, however, there is nothing that will make me believe it's better than finding another human who can *understand* what you are saying and the context to which you are implying. "Better" is an ambiguous term. For what these researchers made the program for, it is better than humans for one reason: speed. Sure they want the translations to be reliable, but more importantly is that a computer can do in a few days what would take a human a month, for this application at least. The NSA and the like want to have translations of huge swathes of text, and fast! The sooner they can understand things that are written, the faster they can react to threats. The time and money spent on human translators for this purpose is very slow and expensive in comparison. For your Spanish HW, the best is a native speaker giving you feedback, because the amount of work is small and the translations will be very accurate.

--
Knowledge is just opinion that you trust enough to act upon. -Orson Scott Card

Sure it does. by winterdrake · 2005-02-23 13:57 · Score: 0, Troll

An AI that actually UNDERSTANDS language? Riiight. Now pull the other one.

technical texts by Olaserov · 2005-02-23 13:57 · Score: 4, Funny

I wonder if we could train it to translate a EULA ;)

--
* Olaserov is in the process of thinking up a signature.

Re:technical texts by lakerdonald · 2005-02-23 13:58 · Score: 0

/. to the rescue? http://yro.slashdot.org/article.pl?sid=05/02/23/23 15211&tid=133&tid=17
Re:technical texts by Anonymous Coward · 2005-02-23 14:07 · Score: 2, Funny

Wouldn't work without having some being translated first. Here's one for the cause:
1. If we screw up it's not our fault
2. If you screw up, well you're screwed. ... and you owe use your first born.
Re:technical texts by lakerdonald · 2005-02-23 14:13 · Score: 0

and we can continue to collect non-personally identifiable information indefinitely
Re:technical texts by ozbird · 2005-02-23 15:30 · Score: 1

A rough translation is: "All your rights are belong to us."
Re:technical texts by aussie_a · 2005-02-23 15:36 · Score: 1

It said Nothing for me to see, please move along. I didn't know Slashdot was so zen.

translate to American please by Anonymous Coward · 2005-02-23 13:58 · Score: 3, Funny

Can someone translate that article from British english to American english please.

Thanks.

Re:translate to American please by Grey+Ninja · 2005-02-23 14:26 · Score: 4, Funny

Here's a couple of suggestions for you:

r3Ð(0n3 wr173$ "N3w $(13n71$7 1$ r3p0r71n9 7h47 7r4n$£4710n $07w4r3 7h47 Ð3v3£0p$ 4n nÐ3r$74nÐ1n9 0 £4n9493$ b¥ $(4nn1n9 7hr09h 7h0$4nÐ$ 0 pr3v10$£¥ 7r4n$£473Ð Ð0(m3n7$ h4$ b33n r3£34$3Ð b¥ .$. r3$34r(h3r$. 4((0rÐ1n9 70 7h3 4r71(£3 "7h3 7r4n$£473Ð Ð0(m3n7$ $3Ð 70 734(h 7h3 7r4n$£4710n 4£90r17hm$ (4n b3 3£3(7r0n1(, 0n p4p3r, 0r 3v3n 4Ð10 1£3$. 7h3 $¥$73m 1$ n07 0n£¥ 4$73r 7h4n 07h3r m37h0Ð$, b7 4£$0 b3773r $173Ð 70 74(|{£1n9 £3$$ (0mm0n £4n9493$ 4nÐ 7h3 n$4£ v0(4b£4r¥ 0nÐ 1n $p3(14£1$3Ð 0r 73(hn1(4£ 73x7$.""

And translation #2:

REDCONE WRIETS NU SCEINTIST IS R3PORTNG TAHT TRANSLATION R TAHT D3V3LOPS AN UNDERSTANDNG OF LANGUAEGS BY SCANNG THROUGH THOUSANDS OF PREVIOUSLY TRANSLAETD DOCUMENTS HAS B3N REL3AESD BY US!!!! OMG R3S3ARCHARS!!1!1!! LOL ACORDNG 2 DA ARTICL3 TEH TRANSLAETD DOCUMENTS US3D 2 T3ACH TEH TRANSLATION ALGORITHMS CAN B 3LECTRONIC ON PAEPR OR 3V3N AUDIO FIELS!!1111 TEH SYSTEM IS NOT ONLY FASTER THAN OTH3R M3THODS BUT ALSO BT3R SUIETD 2 TAKLNG LAS COMON LANGUAEGS AND TEH UNUSUAL VOCABULARY FOUND IN SPACIALIESD OR TECHNICAL TEXTS!1!! WTF
Re:translate to American please by Anonymous Coward · 2005-02-23 14:45 · Score: 0

Sure I can translate it in one word

Huh?

Heh Heh Heh, let the flame war erupt.

BTW - I'm baiting you. Don't fall for it ;-)
Re:translate to American please by Anonymous Coward · 2005-02-23 15:08 · Score: 0

Translation:

Can someone translate English into what yanks speak?

Cheers.
Re:translate to American please by chris_sawtell · 2005-02-23 15:36 · Score: 2, Funny

No trouble at all. First take the original text, computer translate it into German, and then back into English.
Now reorder the phrases in every sentence so that the object phrase starts the sentence, change every sentence which contains the word because so that the word because and the words following it start the sentence. Make sure that every infinitive verb has the adverb between the word to and the verb. Change every occurrence of which to that Find every word more than 3 syllables long and inject several short filler words more or less at random near the long word. Finally change every double consonant before the letter combinations ed and ing to single occurrances. Change every ise to ize, and arse to ass.
That'll more or less do it.
Now you know - quoting W. S. Churchill - why we are "Divided by our common language".
Re:translate to American please by aussie_a · 2005-02-23 15:50 · Score: 2, Funny

I can translate it to Australian:

That redcone fella did say something about some rag reporting some computer thingymebob that lets me understand what all those japs are saying. The city rag reckons it's real fast.
Re:translate to American please by Anonymous Coward · 2005-02-23 16:56 · Score: 0

For those speaking American and who want to learn, the University of Michigan has a great "English as a second Language" program. Y'all sign up, ya hear?
Re:translate to American please by Anonymous Coward · 2005-02-23 23:01 · Score: 0

Can someone translate that article from British english to American english please.
What about putting the question itself into English: Can someone translate that article from English to American English please?

English is English. We were here first: American English is a derivation.
Re:translate to American please by michaeldot · 2005-02-23 23:28 · Score: 1

Are you saying Americans break a few stylistic rules here and there? Maybe, maybe.

You forgot:

"Take words that have had a "u" in them for at least five centuries and remove it, colour to color, etc."
Re:translate to American please by Criffer · 2005-02-23 23:44 · Score: 1

cat article.txt | sed -e 's/ou/o/g' -e 's/aluminium/aluminum/g'

American isn't a language, it's a mispelling.
Re:translate to American please by MasterOfCeremonies · 2005-02-24 00:01 · Score: 1

Sure, if you'll translate the rest of the web into British English for me.
Re:translate to American please by MasterOfCeremonies · 2005-02-24 00:17 · Score: 1

Some would argue that American English is the more pure form of English, as in many cases it is actually closer to our once common language. For instance, the -ize ending favoured by the Americans is actually the tradition British spelling from the 1600s; it is since the fork that the British have come to use -ise instead. During the 1800s and beyond there was a very strong movement in the states towards keeping their language as close to its "correct" parent as possible. Just because we spoke it first doesn't give us some special rights over it. Language is open source, hurray! http://www.phrases.org.uk/bulletin_board/13/messag es/785.html
Re:translate to American please by Anonymous Coward · 2005-02-24 00:48 · Score: 0

bollocks
Re:translate to American please by Anonymous Coward · 2005-02-24 00:50 · Score: 0

here here!

'American' is not a language! its just bad english
Re:translate to American please by Anonymous Coward · 2005-02-24 02:53 · Score: 0

Surely whatever the English in England currently speak is the true English language.
Re:translate to American please by Just+Some+Guy · 2005-02-24 03:38 · Score: 1

I was able to read those in near-realtime. Thanks for providing my "you're a geek, get over it" fix for the morning.

--
Dewey, what part of this looks like authorities should be involved?
Re:translate to American please by Anonymous Coward · 2005-02-24 03:40 · Score: 0

Surely whatever the English in England currently speak is the true English language.

However, using the same logic the language currently spoken in the USA is the true form of the language used here, no matter what you call it. We just call it English out of tradition and acknowledgment to the country's main cultural influence.
Re:translate to American please by clem9796 · 2005-02-24 03:51 · Score: 1

Boy, this is similar to when i realized the difference between 'dog's bollocks' and 'bollocks' in the movie Snatch. It was a totally different film after that!

--
IANALOOA
Re:translate to American please by rbarreira · 2005-02-24 10:12 · Score: 1

The first one too? :O

--

The AACS key is NOT 0xF606EEFD628B1CA427BEA93A9CA9773F

Yay! by gardyloo · 2005-02-23 13:58 · Score: 3, Funny

Hope for slashdot. I've always wondered if we only have artificially intelligent editors...

Harry Potter and the Bible by MikeFM · 2005-02-23 13:59 · Score: 4, Interesting

I remember hearing about this a couple years ago. They were using translations of Harry Potter and the Bible to teach this software to translate. It seems to work well. I wonder what it'd make of different translations of technical documentation. That'd probably be even more interesting than what it'd make out of 'quidditch'.

This could be great if it were opensourced. It'd be nice to translate email, instant messages, websites, technical docs, and lots of other stuff we're currently using the fish for. The fish is nice but not that effecient to add to other programs and it's translations aren't usually that great.

--
At what price learning? At what cost wisdom? The price is a man's peace of mind, and the cost is his life.

Re:Harry Potter and the Bible by obeythefist · 2005-02-23 14:45 · Score: 5, Funny

I never read that one. I thought the next book title was going to be "Harry Potter and the Half-Blood Prince".

Or did JK Rowling suddenly become pious?

--
I am government man, come from the government. The government has sent me. -- G.I.R.
Re:Harry Potter and the Bible by Anonymous Coward · 2005-02-23 15:23 · Score: 0

Harry Potter and the Bible? You repeat yourself.
Re:Harry Potter and the Bible by Anonymous Coward · 2005-02-23 16:00 · Score: 0

technical documentation is usually easy because it is more "Latin" based (so it is easier to translate to Latin languages)..

Informal speech and creative uses of language seem much harder.
Re:Harry Potter and the Bible by ashitaka · 2005-02-23 17:39 · Score: 1

Oh man that was funny! I looked once quickly then looked again and almost squirted my seaweed tea out of my nose.

--
If you don't want to repeat the past, stop living in it.
Re:Harry Potter and the Bible by kaiidth · 2005-02-24 00:01 · Score: 1

I dunno if they've entirely made the best choice of reading matter there. Depending on the version of the Bible they picked, they'll be producing unusually medieval results (second person singular, seest thou). Neither will the combination of Harry Potter and the Bible produce much vocabulary suitable for the modern Muggle world, although I admit that it would be a distinct relief to be certain that my choice of translation software is ready to handle tricky terms like 'troll snot'.

I wish they would open source this software. I can imagine spending many fascinated hours feeding it different authors' complete works and testing the results. Just think; if we fed it Francis Hodgson Burnett and then translated speech, all the characters would be ejaculating instead of merely exclaiming. Presumably if we fed it Jane Austen it would start quoting numbers in a distinctly German manner (four-and-twenty) and a definite hint of subject-verb inversion (said he).

Assuming there's enough source material for a given period, you could imagine adding a temporal element to the next generation of Babelfish. Destination language: English, 19th century socialite... but I think I'm probably overstepping the bounds of a pretty nonspecific press release here.
Re:Harry Potter and the Bible by allanc · 2005-02-24 00:19 · Score: 1

Jesus is said to be the Prince of Peace.
Jesus is said to be half-man, half-God.

So actually, that tracks.
Re:Harry Potter and the Bible by MissTuxie · 2005-02-24 03:10 · Score: 1

I looked once quickly then looked again and almost squirted my seaweed tea out of my nose.

I had no idea Sponge Bob read slashdot!

--
Cozinha para as massas (e para geeks)
Re:Harry Potter and the Bible by Anonymous Coward · 2005-02-24 03:24 · Score: 0

They used the Bible? That's disturbing. Considering that until the Scholar's Bible every translation has been done by people with an agenda, those translations should -not- be taken as authoritative.

As an example, I recall that when King James commissioned the KJV translation, he was of the opinion that "baby sprinkling" was the proper form of baptism. However, the word used, baptismo, means "immersion" (or so I've been told), so he had them transletter it into baptism instead of translating it as immersion.

For quality translations, I don't think you'll be able to beat a skilled translator with the ability to discriminate the quality of other translations. For useable translation, though, this holds a lot of promise.

Shimatta
Your Mileage May Vary
Re:Harry Potter and the Bible by MikeFM · 2005-02-24 03:52 · Score: 1

Maybe it's her way of getting those darn baptists to buy her books. To many of them seem to have some weird idea that Harry Potter is evil. Anyone that's read HP and knows anything about real witchcraft knows that the two have very little to do with each other. May as well say that using the Internet is witchcraft.

--
At what price learning? At what cost wisdom? The price is a man's peace of mind, and the cost is his life.
Re:Harry Potter and the Bible by Anonymous Coward · 2005-02-24 03:57 · Score: 0

May as well say that using the Internet is witchcraft.

Actually it is Satan worship. All those httpd daemons that are being summond from all over the world wide web.
Re:Harry Potter and the Bible by MikeFM · 2005-02-24 04:21 · Score: 1

It wouldn't be a bad idea to feed such a program a source date as well as source language when training it on a document. So that it would have some temporal sense of the language it's using. I suppose variant of the language could be given as an input too. So that the program could learn the difference between gangster and geek. :)

--
At what price learning? At what cost wisdom? The price is a man's peace of mind, and the cost is his life.

Turing test by OneArmedMan · 2005-02-23 13:59 · Score: 3, Insightful

I wonder if something similar to this could be used for AI , for say Turing Test's ?

Re:Turing test by poopdeville · 2005-02-23 14:44 · Score: 1

Yes, if the AI being tested only knows english, and the tester only knows German, or French, or some other language.

--
After all, I am strangely colored.
Re:Turing test by cgenman · 2005-02-23 16:15 · Score: 1

Sit it in the middle of a major chat hub, and have it learn what to say in response to the last two inputs? Sounds good to me. Where can we find a chat hub?

--
The ______ Agenda

Wow! Does a much better job... by bigtallmofo · 2005-02-23 14:01 · Score: 5, Funny

Teach Software translating on scanning up

Not hard wares that sticks an comprehension of talks by scanning on thousands of fish translated papers has been vomited by US scientists.

Many existing translation not hard wares uses palm rules for botching words and phrases. But the new software, snarked by Kevin Knight and Daniel Marcu at the Information Sciences[...]

Read More...

--
I'm a big tall mofo.

Already done? by Anonymous Coward · 2005-02-23 14:01 · Score: 1, Informative

I don't recall where I read it exactly off-hand, but this had been done for Chinese already. The only news here is that some people are trying to sell software to do that as a commercial product.

Neural Nets and Machine Learning by MyIS · 2005-02-23 14:01 · Score: 2, Interesting

In one way or another this is similar to training neural nets to recognize images, or spam filters to mark junkmail. Great way to put number-crunching power of computers to direct work.

--
http://zero-to-enterprise.blogspot.com/

Re:Neural Nets and Machine Learning by compling · 2005-02-24 02:42 · Score: 1

This has nothing to do with neural nets, other than being an instance of machine learning. If anything, this approach is probably related to Transformation Based Learning, or some other weakly statistical approach. Fully statistical approaches like neural nets choke on the variance of real NLP tasks.

That's great.... by Frodo+Crockett · 2005-02-23 14:02 · Score: 4, Funny

...bu7 (4n 17 unÐ3r$74nÐ £337?

--
"The newly born animals are then whisked off for a quick run through a giant baking oven." --heard on Food Network

Re:That's great.... by MachDelta · 2005-02-23 14:47 · Score: 1

It can now!
Re:That's great.... by Anonymous Coward · 2005-02-23 15:53 · Score: 0

61v3n 4 l4r93 3n0u9h 54mp13 537, 1 \/\/0u1d 54y y35.

That sounds like a good approach by FunWithHeadlines · 2005-02-23 14:03 · Score: 3, Insightful

I wish them luck (cuz they'll need it), but if anything is going to produce translation software that really works it will have to include learning elements of this nature. It's one thing to get dictionary translations. That's been around for decades, with its laughable results. Humans speak in metaphor and simile and slang and contractions and abbreviations of thought all the time. We're the cat's meow of language (try that, computer!).

But if you give computers a bunch of human stuff to read, you expose the dictionaries to language as it is actually used, not just as the dictionary has it. Then when odd language usage falls upon us like it's raining cats and dogs, they will have a database of similar usage to draw upon. Hey, it's an uphill climb, but this is a good avenue to try. Cheerio, computers, and a top o' the mornin' to ya.

Re:That sounds like a good approach by waynemcdougall · 2005-02-23 14:57 · Score: 1

I wish them luck (cuz they need it), but, if all translation software will produce, which really work must include it acquisition elements of this nature. It is a thing to receive for from of dictionary translations to. That is around for decades, with his laughable results. Humans speak simile and jargon and contractions and abbreviations of the thought the whole time in the metaphor and. We are meow the cat of the language (attempt that, computer!). But, if you give a bundle human material to computers, in order to read, set the dictionaries language, like them are really used, not out, straight there the dictionary them have. If odd language consumption falls after us, how it rains cats and dogs, has it a data base of similar consumption to draw to on. He, is it a rising ascent, but this is a good avenue, to of of Cheerio to try from of computers and of of top side O ' mornin ' to ya. English->German->English courtesy of Babel. I will wish their luck (cuz they to need it), but if any will cause to work it truly to have to include this natural study the element translation software. This is a matter obtains the dictionary translation. That is dozens of years, by its laughable result. The person always speaks in the metaphor and simile and the slang and the idea contraction and the abbreviation. We are the language (attempt cat's meow, computer!) . But if you give the computer each bunch of people's material to read, you expose the dictionary to use in fact to the language according to the original design, the dictionary does not have it. Then when the strange language usage falls in our following torrential downpour, they will have the similar usage information bank to the draw. Hey, this is difficultly climbs, but this is a good main road attempts Cheerio, the computer, with above o ' mornin ' to ya. English->Chinese Traditional->English courtesy of Babel.

--
Recycle PCs and build a wireless community network www.hillsborough.org.nz
Re:That sounds like a good approach by FunWithHeadlines · 2005-02-23 15:18 · Score: 1

Ooh, good idea! OK, here is my second paragraph as translated by Systran into German and then back into English:
But, if you give a bundle human material to computers, in order to read, set the directories language, like them are really used, not out, even there the directory them have. If odd language consumption falls after us, how it rains cats and dogs, has it a data base of similar consumption to draw to on. He, is it a rising ascent, but this is a good avenue, to of of Cheerio to try from of computers and of of top side O ' mornin ' to ya.
Cheerio, indeed! And how it rains cats and dogs!
Re:That sounds like a good approach by ddent · 2005-02-23 15:21 · Score: 1

Dictionary: Prescriptive and include words that have become generally accepted.

Lexicon: Descriptive.. attempts to include as many words/uses as possible.

By doing it based on existing documents you end up with a lexicon.

--
SSL Certificate
Re:That sounds like a good approach by rudi_v · 2005-02-23 20:47 · Score: 1

How is the interpretation checked ?

When humans are not really sure about an interpretation they (mostly) ask clarification, or keep it in the back of their mind that the interpretation of that phrase/expression might be incorrect. And consequently are willing to quickly re-evaluate when new information is acquired.

In the case of automatic translation, how would one ever find out that an older interpretation might be wrong ?
Re:That sounds like a good approach by maxwell+demon · 2005-02-23 20:53 · Score: 2, Funny

Bah, one language to other and back is too little. You have to do the complete thing, of course. As you can do here. I've done it to your text for you (of course including the east asian languages):

If it is possible and cuz of the translation of the software of the
wealth (until the necessity to the danger) this person whom it causes,
this member of the quality of the well-educated way and, in me who I
consult that it examines it, of its type of the search of the thing
the truth that the lheo requests to necessary desire of the excess
near this person, I include. That the translation of the dictionary,
of that the extensions he is situation. The consideration is relative
is possible he and this result, with the smile, much hour actively in
the duration. The person and the comparison and the slang and the
contraction and the Synopse and the metaphor of the idea, of that
always say it. Our cat (until a machine of the language of the
measured value! ) it is meow. It is this exactitude of him, but and
affinchè that reads gives and to the computers and this, truth within
the fines of the dictionary, that is to say, is used, flagstone of the
halting of the language the materials with the artificial enemy... It
was distinguished has the payment in advance. If the language, that
one that disowned, uses tractions the cat and with the autumn of the
rain of the dog, this it them chronometers, he will have a basic
beginning of the data of the use of the percentage of fines of history
of him. He was presented/displayed in the ascent, he is, but of
Cheerio and or of the calculation to the interior of the good way he
examined this to the interior or ' mornin'.

--
The Tao of math: The numbers you can count are not the real numbers.
Re:That sounds like a good approach by xtracto · 2005-02-23 23:35 · Score: 1

LOL this is cool:
--
Original English Text:
What the fuck are you speaking about?

Translated to French:
Au sujet de quoi la baise parlez-vous ?

Translated back to English:
About what kisses it you speak?

Translated to German:
Über welche Küsse spricht es Sie?

Translated back to English:
About which kisses does it speak you?

Translated to Italian:
Di quali baci li parla?

Translated back to English:
About which kisses it speaks them?

Translated to Portuguese:
Sobre que beijos fala-os?

Translated back to English:
On that kisses say them?

Translated to Spanish:
En eso se besa los dicen?

Translated back to English:
On that it is kissed say them?

--
Ubuntu is an African word meaning 'I can't configure Debian'
Re:That sounds like a good approach by Vintermann · 2005-02-23 23:55 · Score: 2, Funny

Yeah, it's a really cool resource. If you press the button a couple of times, it seems most of them converge, presumably onto something that is a especially reasonable statement in the mind of the computer.

"The moon's a harsh mistress" converges quickly to

"With the love seriously the moon"

Whereas the text on top of the search

"Sometimes it's fast, sometimes it's slow. Sometimes it doesn't work at all."

takes a long time to converge to

"To the times during the hour, this comes during the period from digiunare, this elasticity in the valve of the accumulation of the pipe, of therefore if with destiller more distilling this, the instrument plus these lengths with the fear of this not he he, Synopse, company more."

Whereas your phrase takes a couple of clicks to converge to
"Titmouse Quant0 of the intention these costs, those the legend?"

Geek points to people who can find a small sentence that grows exponentially, or a big text that converges to a word or two.

--
xkcd is not in the sudoers file. This incident will be reported.
Re:That sounds like a good approach by Anonymous Coward · 2005-02-24 01:29 · Score: 0

I wish them luck (cuz they need it), but, if all translation software will produce, which really work must cover it learning elements of this nature. It is a thing for receiving from dictionary translations. That is around for decades, with his laughable results. Humans speak simile in the metaphor and and in Slang and in the contractions and in the abbreviations of the thought the whole time. We are meow the cat of the language (attempt that, computer!).

But, if you give a bundle human material to computers, in order to read, set the dictionaries language, like them are really used, not out, straight there the dictionary them have. If odd language consumption falls after us, how it rains cats and dogs, has it a data base of similar consumption for on drawing. Hey, is it a rising ascent, but this is a good avenue for trying Cheerio, computers and from top side O ' mornin ' to ya.
Re:That sounds like a good approach by maxwell+demon · 2005-02-24 02:19 · Score: 1

That's nothing compared from what is does with standard slashdot phrases:

Original English Text:
Have you Meta Moderated recently? Regular Meta Moderators are more likely to get mod points.

Translated to Japanese:

Translated back to English:
Meta it eased recently? Rule, the meta regulator seems obtaining the
family point, truth.

Translated to Chinese:
? , ??? ,

Translated back to English:
It recently relaxed step? The rule, the step superintendent obtains as
if? ? ? Family spot, truth.

Translated to Korean:
? ,
? ? ? ,
.

Translated back to English:
It relaxed a phase recently? The rule gets like the phased factory
market? ? ? Family spot, truth.

Translated to French:
Il a détendu une phase récemment ? La règle obtient comme le
marché échelonné d'usine ? ? ? Tache de famille, vérité.

Translated back to English:
It slackened a phase recently? Does the rule obtain like the spread
out market of factory? ? ? Stain of family, truth.

Translated to German:
Es ließ eine Phase vor kurzem nach? Erreicht die Richtlinie wie die
Verbreitung aus Markt der Fabrik? ? ? Fleck der Familie, Wahrheit.

Translated back to English:
It left a phase recently? Does the guideline reach like the spreading
from market of the factory? ? ? Mark of the family, truth.

Translated to Italian:
Ha lasciato recentemente una fase? l'estensione della guida di
riferimento come la diffusione dal mercato della fabbrica? ? ?
Contrassegno della famiglia, verità.

Translated back to English:
It has left recently one phase? the extension of the guide of
reference like the spread from the market of the factory? Mark of
the family, truth.

Translated to Portuguese:
Deixou recentemente uma fase? a extensão da guia da referência como
a propagação do mercado da fábrica? Marca da família, verdade.

Translated back to English:
Left recently a phase? the extension of the guide of the reference as
the propagation of the market of the plant? Mark of the family, truth.

Translated to Spanish:
Izquierda recientemente una fase? la extensión de la guía de la
referencia como la propagación del mercado de la planta? Marca de la
familia, verdad.

Translated back to English:
Left recently a phase? the extension of the guide of the reference as
the propagation of the market of the plant? Mark of the family, truth.

(The eating of the asian languages is actually done by slashdot, all the rest, including the question marks, is done by babelfish)

--
The Tao of math: The numbers you can count are not the real numbers.
Re:That sounds like a good approach by Zerth · 2005-02-24 03:24 · Score: 1

It isn't a big sentence, but "Try it for yourself"
stops at "It examines marcature" after about 3 repeats.
Re:That sounds like a good approach by Linux+Ate+My+Dog! · 2005-02-24 03:24 · Score: 1

Translation software as a field to play Conway's Game of Life on. Cool.

Philosophical caveat by Raindance · 2005-02-23 14:03 · Score: 4, Insightful

As a caveat, we should be wary of saying the system "understands" a language.

I would say generally that humans able to translate between languages generally understand both languages, but whether a statistical, probabilistic model based on correlations understands a language might be a stretch.

Further reading: Searle's Chinese Room argument- http://en.wikipedia.org/wiki/Chinese_room

This is akin to asking, Does your tax software understand the tax code? Does Photoshop understand the principles of image manipulation?

Are these silly questions to ask?

Further reading: Dennett on intentionality (http://en.wikipedia.org/wiki/Dennett but the entry is pretty sparse).

RD

Re:Philosophical caveat by MikeFM · 2005-02-23 14:08 · Score: 3, Interesting

Does anybody understand the tax code? Why should software be any different?

I think that software that can learn can be said to understand a problem just as much as a human can. The difference between understanding and just doing is having the ability to learn from new data and to change your actions as required.

--
At what price learning? At what cost wisdom? The price is a man's peace of mind, and the cost is his life.
Re:Philosophical caveat by back_pages · 2005-02-23 14:47 · Score: 4, Insightful

Great example of this:
Mom baked for three hours.
The pie baked for three hours.
"Mom" and "The pie" are the subjects. The verb and entire predicate are identical. Understanding the language disambiguates these sentences, but the ambiguity is part of what defines humor.
A man walked into a bar. Ouch!
A man wanted to win a pun contest in the local newspaper, so he entered 10 times in order to increase the chances that one of his entries would win. Unfortunately, no pun in ten did.
You can translate that 50 ways from Sunday but without understanding the language - understanding what makes those statements interesting - the machine will lose all their meaning.
Re:Philosophical caveat by GreyWolf3000 · 2005-02-23 15:00 · Score: 1

I see that you're trying to cheat on your taxes. Would you like me to help?

--
Slashdot: Where people pretend to be twice as smart as they really are by behaving like children.
Re:Philosophical caveat by Anonymous Coward · 2005-02-23 15:03 · Score: 1, Interesting

Prove your brain is not just a Chinese Room. Prove that you actually understand something, that you're not just an unconscious simulation able to produce the correct output.

It cannot be done. All you can provide as evidence of your consciousness is the output of whatever goes on in your brain; but the whole premise of the Chinese Room assumes it is possible for a non-conscious process to duplicate that output. After all, if the output of conscious and non-conscious processes can be distinguished, then the Turing Test has not been passed by the non-conscious process.

Accordingly, since we cannot show that a human being can "understand" in any meaning distinct from the way the Chinese Room can "understand", we can either declare that we cannot tell the difference between things that can and cannot "understand" (in which case it is an act of faith, not reason, to declare a Chinese Room doesn't "understand"), or we can declare that the Chinese Room does "understand".

Either way, the distinction makes no difference, and therefore has no consequences, philosophical or otherwise. And Searle, by harping on a distinction that makes no difference, is indistinguishable from a fool.
Re:Philosophical caveat by idlake · 2005-02-23 15:04 · Score: 2, Insightful

You are right: this software does not understand language; it works out statistical correspondences, but it has no understanding of the physical correlates of words. That also means that it has intrinsic limitations.

Note also that such statistical approaches are nothing new, it's just that computers are finally getting powerful enough that people can use them.

None of that has anything to do with Searle. Searle wouldn't admit that the system understands language even if it knew things about the real world. Searle's argument is arbitrary and ad hoc. You are free to believe in it if you like, but sooner or later, you will have to defend your position against a computer that will insist that if you don't admit the possibility that it is self-aware and intelligent, well, it is just going to assert that you aren't either.
Re:Philosophical caveat by lakeland · 2005-02-23 15:05 · Score: 2, Insightful

NO! NO! NO! Not the Searle argument again. That guy is an absolute nutter and should be banned! Actually, on second thoughts, as long as I never have to hear his drivel again, I don't mind what happens to him.

His argument essentially boils down to: "The computer doesn't understand because all it does is manipulate symbols. Even if it does exactly the same steps as a human, the human understood and the computer was just being a mimic. Giving the computer a body wouldn't make it any less of a mimic".

The #1 flaw in his argument is that it would result in humans being classified as non-intelligent. He constantly spouts on about how machines aren't intelligent but then says "Except that the human understands what they are doing". I think that's a close contender to the Wookie defense for world's worst argument.
Re:Philosophical caveat by chickanmonkey · 2005-02-23 15:08 · Score: 1

Some of these philosophical arguments are kinda worthless if you think about it. Choose the right axioms(unproven asumptions that may or maynot be true) and you can prove practicaly anything you set out to prove. Philosophy is a dark art. When it comes down to it philosophy is not about truth but about convinsing people.
Anywho the problem with the Chineis room argument is it seperates the man in the room from the rule book. The argument than becomes equivalient to. Suppose you have a man with a neuron in him(equivalent to the man in the box in the original argument) and the neuron uses the rest of the neurons in the mans brain(the translation book) to translate Chineis charicters to english. Now no one would say that a simple neuron understands Chineis.
Re:Philosophical caveat by Coulson · 2005-02-23 15:22 · Score: 2, Insightful

Searle's Chinese Room argument is hogwash.

In his scenario, Searle claims that neither the people moving the Chienese tokens, nor the book of instructions telling them what to do "understands" what is being said. That is obviously true, but it misses the point. That's like saying that the neurons in your head don't understand what you are saying, and so neither do you.

The workers in the Chinese Room argument are just hardware. They're akin to neurons in the brain, or chips in a computer. They're blindly executing instructions (software) from a book and recording results on blank pages (working memory). No AI proponent is arguing that the chips in the computer "understand" anything. Chips just dumbly execute instructions. What's interesting is the combination of software and persistent memory, and the apparent conciousness that can arise therefrom.

Searle's argument must either be considered void, or one is compelled to admit that humans don't "understand" anything either. As such, it's hogwash.
Re:Philosophical caveat by chickanmonkey · 2005-02-23 15:53 · Score: 1

YEAH, hogwash.
Man I wish the catholic college I went to didn't teach me philosophy. I might still be catholic otherwise. But as a side effect I just get so pissed when I see someone spouting off nonsence, oversimplifying here, assuming that which is not certain there. Philosophers like to build these towers of logic. But mosly they end up building casles in the sky
Which is why systems that don't deal compleetly in logic are so interesting to the AI feild. With logic you need to know something certain to build off of. But that is NOT the way the real world works and that's not how humans navigate the world. Humans are full of logical contradictions. It dosn't bother us.
It's not about being perfect. It's about working well enough.
Re:Philosophical caveat by captwheeler · 2005-02-23 16:14 · Score: 1

When it comes down to it philosophy is not about truth but about convinsing people.
Your thinking of Rhetoric, not Philosophy.

--
Thanks for putting on the feedbag. Thanks for going all out. Thanks for showing me your Swiss Army knife.
Re:Philosophical caveat by captwheeler · 2005-02-23 16:23 · Score: 1

With logic you need to know something certain to build off of.
Like the language of 'and', 'or', 'not' and 'if' ? Philosophy of logic starts with uncertain language, and deals with real people making common errors, as well as formal axiomatic systems.
Know why Godel made his proof? Because he was certain that numbers were real things; the same as physical objects. Its not about "building castles in the sky."

--
Thanks for putting on the feedbag. Thanks for going all out. Thanks for showing me your Swiss Army knife.
Re:Philosophical caveat by captwheeler · 2005-02-23 16:41 · Score: 1

That's the systems reply. From wikipedia:
Although the individual in the Chinese room does not understand Chinese, perhaps the person and the room considered together as a system, do. Searle's reply to this is that someone might in principle memorise the rule book; they would then be able to interact as if they understood Chinese, but would still just be following a set of rules, with no understanding of the significance of the symbols they are manipulating.

--
Thanks for putting on the feedbag. Thanks for going all out. Thanks for showing me your Swiss Army knife.
Re:Philosophical caveat by Anonymous Coward · 2005-02-23 16:42 · Score: 1, Insightful

It's true that humor is based on understanding, and people generally don't claim that translation programs understand a language... But even a human who understands both languages usually can't translate a pun.

Also, a sufficiently complex probability based model of language should be able to distinguish between "Mom baked for three hours" and "The pie baked for three hours", since it will have 'pie' associated with 'bake' with a high probability of being its object, and with the proper rule would conclude that the change in order indicated that 'the pie baked' was a passive construction.
Re:Philosophical caveat by Paul+Freedman · 2005-02-23 16:48 · Score: 1

Sure you can prove it: the statement that you are an "unconscious" process miming conscious processes is a self-contradiction to begin with. The premise is that definitionally your consciousness of your syntactic manipulations is one level of hierarchy above syntactic manipulation. Basically the rejection of this proposition is a denial of qualitative awareness of sensory states apart from the binary representation of those states--if not a philosophical denial of qualities themselves: the color red, as a color. etc. Rejections of of Searle assume there that claiming no distinction between consciousness per se and "output" is the equivalent of stating there is no distinction between consciousness and the representation of output. Cats, after all, think, have consciousness, make decision, interact with the world--but don't manipulate symbols. The claim that cat's somehow engage "unconsciously" in the very computer processes that are supposedly producing "consciousness" in computers that do engage in them is not convincing. I was unaware, admittedly as an English major that any computer in the real world anywhere has passed a Turing Test, yes?
Re:Philosophical caveat by captwheeler · 2005-02-23 16:54 · Score: 1

You are free to believe in it if you like, but sooner or later, you will have to defend your position against a computer that will insist that if you don't admit the possibility that it is self-aware and intelligent, well, it is just going to assert that you aren't either.
heh. I insist that you acknowledge me as being a Penguin. (arguments don't work by insisting, they work by convincing.)
Searle's argument is arbitrary and ad hoc.
Thats a cheap shot. Searle spent years discussing his argument before he published, and years afterwards addressing objections.
None of that has anything to do with Searle.
True. This is pragmatic work, not the wild claims of AI reasearchers that Searle was answering.

--
Thanks for putting on the feedbag. Thanks for going all out. Thanks for showing me your Swiss Army knife.
Re:Philosophical caveat by captwheeler · 2005-02-23 17:23 · Score: 1

Does anybody understand the tax code? Why should software be any different?
Thats funny, but it misses the point. Knowing how to do a task is not the conscious understanding humans have. Saying software 'understands' may be a reasonable abbreviation, but its not accurate. (and that was the point of the original poster)

--
Thanks for putting on the feedbag. Thanks for going all out. Thanks for showing me your Swiss Army knife.
Re:Philosophical caveat by barawn · 2005-02-23 17:40 · Score: 1

Mom baked for three hours.
The pie baked for three hours.

"Mom" and "The pie" are the subjects. The verb and entire predicate are identical. Understanding the language disambiguates these sentences, but the ambiguity is part of what defines humor. ... except for the fact that "The pie baked for three hours" isn't good grammar. More properly, it's "the pie was baked for three hours". That's clarifies the statement because "was baked" is a passive construction.
Re:Philosophical caveat by Anonymous Coward · 2005-02-23 17:43 · Score: 0

Yes, these are silly questions to ask. Our brains might well be little more than statistical, probabilistic models based on correlations.

Obviously a computer does not work in exactly the same manner as a brain; also obviously, sometimes both will arrive at the same results. Whether or not we choose to use the same word to describe both processes is of no importance whatsoever.
Re:Philosophical caveat by barawn · 2005-02-23 17:43 · Score: 1

Of course, ignore the idiotic bad grammar and spelling (more properly, "that's" instead of "that").
Re:Philosophical caveat by rbanffy · 2005-02-23 17:55 · Score: 1

I know this has been said before, but did you ask a clever question by yourself or has your brain simulated a clever person by firing neural stimuli according to a partly inherited partly acquired pattern?

Speculating about the nature of understanding is futile if the real understanding can't be distinguished from a computer model by having a limited interaction with it. As long as we can't say what something is, it's hard to say what it isn't.

Machines will eventually become smarter than humans. It's not a question of if, but of when.

Right now, they may be smarter than humans who suffered severe brain damage or have other debilitating diseases. I will refrain from president jokes.

--
http://www.dieblinkenlights.com
Re:Philosophical caveat by idlake · 2005-02-23 18:22 · Score: 1

heh. I insist that you acknowledge me as being a Penguin. (arguments don't work by insisting, they work by convincing.)

It just shows how absurd Searle's point is: even if he were having a conversation with a computer, a conversation about philosophy, life, love, and all that, he'd not admit that the computer is intelligent just because it is made out of the wrong stuff.

Thats a cheap shot. Searle spent years discussing his argument before he published, and years afterwards addressing objections.

Many philosophers spend years on creating stupid, non-sensical arguments before publishing. It seems to be a professional affliction of that particular discipline.
Re:Philosophical caveat by evilmousse · 2005-02-23 18:34 · Score: 2, Insightful

that's exactly why i like my anime fansubbed instead of sanitized.
Re:Philosophical caveat by Anonymous Coward · 2005-02-23 19:05 · Score: 0

Well, what is human understanding? This is what we are trying to model using neural nets. Once it reaches a level of sophistication where a sophist can't tell the difference between a computer translation and a human translation, do we then have a sentient translation machine?
Re:Philosophical caveat by tgv · 2005-02-23 19:07 · Score: 2, Insightful

The translation depends on the semantic class of the subject (is the subject a potential baker -> use translation nr. 1, is the subject something that's usually being baked -> use translation nr. 2). So, ignoring other issues, this particular problem is easy to solve, it only takes a lot of work.

BTW, the fact that it isn't entirely grammatical has nothing to do with it (if we understand it, we can translate it, so any MT faces the same challenge).
Re:Philosophical caveat by Coulson · 2005-02-23 19:24 · Score: 1

I don't think Searle's response holds water. The number of people executing the instructions is meaningless. If one person memorizes the rulebook, all that that means is that the hardware has been centralized. If the person is still dumbly executing commands, they're just playing the part of the CPU, or the neurotransmitters.

If you were to read the person a story in Chinese, then later ask them questions in Chinese about it, they would be able to answer -- in Chinese. That's the nature of the Turing test. But they wouldn't be able to answer in English, precisely because the hardware "doesn't understand the significance of the symbols [it is] manipulating." But something does. The thing that speaks in Chinese. That is the system as a whole, the gestalt.

Don't get me started on the economy of Bolivia...
Re:Philosophical caveat by HalfFlat · 2005-02-23 19:30 · Score: 1

And when Searle does so reply, he's still making a level confusion error! It doesn't matter where you put the bits of the system, even if it happens to be inside the head of a human being. He is still confusing the system with its components, or the system with its container.
I can't believe Searle's argument gets as much respect as it does.
Re:Philosophical caveat by zalas · 2005-02-23 21:10 · Score: 1

Just make sure the fansub translator actually knows enough to tell you it's a pun, or else you'd never know.
Re:Philosophical caveat by Bugmaster · 2005-02-23 22:20 · Score: 1

Ok, then prove to me that I understand the language. How do you know I am not just a very clever emulator ? The Chiniese Room argument doesn't work (as you must know), because, while the man in the room may not understand Chinese, the room as a whole does. Similarly, while neither handlebars, nor pedals, nor gears (etc.) are in themselves a bike, when you put them all together, you get a bike. The whole is greater than the sum of its parts (or, at least, it's different). Ultimately, your brain is just a "statistical, probabilistic model based on correlations"; I don't see how a different medium for such a model (electronics vs squishy bits) would make a difference. As I see it, you can only differentiate brains from machines by positing the existence of a soul -- but that's a matter of faith, and can't be used to actually convince anybody.

--
>|<*:=
Re:Philosophical caveat by Bugmaster · 2005-02-23 22:24 · Score: 2, Informative

xcept for the fact that "The pie baked for three hours" isn't good grammar.
Why ? You could say, after all, "The pie baked normally for three hours in the oven, then it started to burn". It's acceptable grammar, but it's a confusing sentence on the semantic level.
The sentence "the pie was baked for three hours" differs in meaning, because it implies that someone was there, actively baking the pie.

--
>|<*:=
Re:Philosophical caveat by Bugmaster · 2005-02-23 22:29 · Score: 1

You can translate that 50 ways from Sunday but without understanding the language - understanding what makes those statements interesting - the machine will lose all their meaning.
Agreed, but I don't see how this proves that the machine can't understand the language in principle, though I grant you that none of our current machines are capable of this.

--
>|<*:=
Re:Philosophical caveat by Fred_A · 2005-02-23 23:07 · Score: 1

heh. I insist that you acknowledge me as being a Penguin.

So, um... do you type with your flippers or do you hunt and peck ?

--

May contain traces of nut.
Made from the freshest electrons.
Re:Philosophical caveat by farnz · 2005-02-23 23:19 · Score: 1

What is a quality? How do you know that a computer that can pass the Turing Test does not have the experience of qualia? What makes the statement that a human is an unconscious process miming conscious behaviour a self-contradiction, but doesn't make the statement that a sufficently good AI is an unconcious process miming conscious behaviour a self-contradiction? Unless you're invoking mysticism, in which case no AI will ever count as conscious, you're not proving your case at all.
Rejections of Searle don't assume that there is no distinction between consciousness and the representation of output; we state that if an entity processes data in such a way that it appears to understand that data, then it does understand that data. There is no mystical quantity in understanding from this point of view; a machine can have a partial understanding of the data it sees.
We then follow on to define a conscious machine as one which can increase its understanding of any input without external changes (basically, any system which can learn from arbitrary input, and request additional input to help it understand).
By this definition, a cat is conscious; it can learn from its mistakes, and can do things with the intent of learning. A billing system isn't conscious; it understands enough about billing people to create bills, but no more, and even if it can learn from its input (this is a bad debtor), it's incapable of requesting extra information to allow it to learn more (e.g. how many children do my debtors have?).
The advantage of this view is that it is fully defined in terms of the inputs and outputs of the system; we can apply it to a human or an animal (a system whose construction is poorly understood), and see if it matches our views of what is and is not conscious. It can then be applied to a machine, and if the machine matches, it's effectively conscious. Searle's claim leaves us with no way to verify its truth value (as he claims that consciousness is a function of how a system is constructed), and no way to test whether a given machine is conscious.

--
I appear to have a blog. Odd.
Re:Philosophical caveat by ralphclark · 2005-02-24 01:41 · Score: 1

Give me a break - the Chinese Room thought experiment is completely bogus; Searle's conclusion is nothing but a straw man.

He maintains that there is no identifiable component of the system where understanding can be found. But the understanding actually resides in the system as a whole, which is best understood as an abstract entity merely implemented using the physical hardware of a room, a series of translation lookups and a mechanism (in this case incidentally human) for doing the lookups and handling the I/O.

Searle's argument is sheer hypocrisy because the same argument could be used (just as fallaciously) to demonstrate that understanding can't possibly reside in the human brain, or any other complex computation device where symbolic representation resides in a higher level of abstraction than the physical implementation.

As to your comparison with tax software, this might be somewhat spurious as there is surely difference in degree. A linguistic system which encompasses a significant portion of human knowledge will encode much more semantic content than a tax software program. Orders of magnitude more. So your analogy is tantamount to comparing the intelligence of a human with the intelligence of a cockroach.

As for Dennett, I am down with that; Dennett pretty much rules. But AFAIK he has not provided any support for the Chinese Room argument, and if he did he would be wrong anyway.
Re:Philosophical caveat by MickLinux · 2005-02-24 02:08 · Score: 1

A man walked into a bar. Ouch!
Actually, sometimes these puns can be successfully translated across languages, in one of two ways.
One, you substitute a word pair that still gives the same relationship, though a different object. Go to your bible's apocrypha, and read the story of Daniel and Susannah . The proper translation uses "oak" and "mastic" trees, but gives no sense of the pun. So some translations substitute "yew/hew" and "clove/cleave", while noting the original words in the notes. Which translation is more faithful? Arguably the one that maintains the pun, but especially if the original form is noted.
There is another way, as well. Often our puns are based upon words that really do have a relationship to each other, and those relationships are then carried across national boundaries. Further, the languages tend to be interrelated due to commerce and common origin. As a result, you can sometimes directly translate a pun by carefully picking the right word. A skillful translator will pick this up, and therefore make a better translation than a less skillful translator.

--
Correct Horse Battery Staple: 72 bits of entropy. Enter "Correct H" into google. When it generates the phrase, that's
Re:Philosophical caveat by quisph · 2005-02-24 02:39 · Score: 1

You can translate that 50 ways from Sunday but without understanding the language - understanding what makes those statements interesting - the machine will lose all their meaning.
And spare us from bad puns? Bonus!
Re:Philosophical caveat by compling · 2005-02-24 02:45 · Score: 1

"Asking if computers can think is like asking if submarines can swim."
Re:Philosophical caveat by filmmaker · 2005-02-24 02:55 · Score: 1

Nice post.

Both of those authors are found in a paper I wrote 5 years ago called "Symbols, Meaning and Computation" as a mathematics undergrad. URL: http://www.joeldalley.com/cognitive-science/symbol s_computation_and_meaning.pdf

--
I Want To Believe
Re:Philosophical caveat by Yogs · 2005-02-24 04:05 · Score: 1

As a former #$HumanCyclist, I can tell you those first two examples are exactly the sort of thing we like to point to. I was not involved in natural language understanding while I worked there (and that was over 4 years ago now), so I'm in absolutely no position whatsoever to comment on where Cyc is these days. Nonetheless, it would be interesting to see what comes with OpenCyc, or if something like what these folks have developed could be supplemented by something that has a way of representing semantics. Of course, they're trying to sell what they do, so we probably won't ever find out unless they have the insight to look at Cyc (or ~maybe~ another system). I'm not optimistic. Fuzzy and not fuzzy systems are intrinsically difficult to make work together, and neither are known for being speedy. It's also much more comfortable to adopt the mindset that one approach is simply better than the other, than go way offroad trying to reconcile the two, because you know, even if you're a genius, you'll get some really bizarre results until you get a better handle on some of the practicle issues.
Re:Philosophical caveat by evilmousse · 2005-02-24 06:08 · Score: 1

that's exactly what i mean, they'll make the effort to explain that. usually a more literal translation in the subs, and an additional supertitle explaining the pun.
Re:Philosophical caveat by captwheeler · 2005-02-24 06:23 · Score: 1

I don't think Searle's response holds water.
Fair enough. You don't have to agree, but I think its a better argument then it gets credit for, so...
That's the nature of the Turing test.
The point of the argument is that passing the Turing test does not prove consciousness. If it walks like a duck (etc...) it may be a duck, or it may be a digital movie.
But something does.
Insisting on the point in question doesn't work: we know humans understand (being human ourselves) but don't know which other things are conscious. We generally agree that animals are, but why computers? Because of how they behave is giving the Turing test answer, but its not proof.

--
Thanks for putting on the feedbag. Thanks for going all out. Thanks for showing me your Swiss Army knife.
Re:Philosophical caveat by captwheeler · 2005-02-24 06:28 · Score: 1

heh. I insist that you acknowledge me as being a Penguin.
So, um... do you type with your flippers or do you hunt and peck ?

Wait... so you really think I'm a penguin? I guess insisting is more powerful then I thought.

ps - i use flippers

--
Thanks for putting on the feedbag. Thanks for going all out. Thanks for showing me your Swiss Army knife.
Re:Philosophical caveat by captwheeler · 2005-02-24 06:41 · Score: 1

...even if he were having a conversation with a computer
Passing the Turing test does not qualify as proof of consciousness -- thats one of the points of the argument.
...he'd not admit that the computer is intelligent just because it is made out of the wrong stuff.
Since we know humans are conscious, and generally accept that animals are, he thinks the 'stuff' might be important. Is that so outrageous? The Chinese room argument does not premise that the right 'stuff' is needed, and that Searle goes on to try and argue that point in other papers is no argument against this.
Many philosophers spend years on creating stupid, non-sensical arguments before publishing. It seems to be a professional affliction of that particular discipline.
sure. no one writes bad software, right? there are wastes of labor in every difficult endeavor.

--
Thanks for putting on the feedbag. Thanks for going all out. Thanks for showing me your Swiss Army knife.
Re:Philosophical caveat by jsebrech · 2005-02-24 07:59 · Score: 1

As a caveat, we should be wary of saying the system "understands" a language.

I would say generally that humans able to translate between languages generally understand both languages

No they don't. Most people don't consciously know the grammar of a language. Just ask someone to exhaustively list all possible verb conjugations in english. Most people wouldn't know what to answer to that. We don't learn language through rules, we learn it by example. That's why we forget most of the rules of grammar after we leave school, and why real world speech never entirely obeys the "perfect" rules of grammar. It's all inherently probabilistic.

That's why I tihnk the chinese room does not actually discount the validity of the turing test at all. I think the human mind is nothing more than a set of probabilistic rules and stored patterns to apply those rules to, and that we have fooled ourselves into believing we're more than that. "Meaning" is nothing more than knowing how a certain pattern fits in with the other patterns you know, and what you can infer from its presence given a certain context. Anyone who disagrees with me I dare to answer the simple question "what is an apple?" in a way that includes all apples, and does not include anything that is not an apple. It is absolutely impossible to do such a thing, because the concept apple is not something we consciously know about how it operates in our mind. We have the rules for how to recognize an apple, but we don't have the conscious knowledge of what those rules are, even though we can make guesses at what those rules might be by bringing up "typical traits" of apples and seeing how they can fit together. After all, activating the pattern "apple" in our mind activates those patterns that are related to it, and by doing that we immediately know in the active part of our mind which traits are most likely associated to the concept of apple, but we have no idea how they are associated.

Or try this thought experiment. What if you built a machine that passed the turing test. Let's call it the fraudulent intelligence. Let's say you knew exactly how it worked. Now suppose that this machine had logically concluded from real world observation that it was self-conscious and intelligent and communicated this to you. How would you persuade it that it wasn't? Or take it one step further. What if it had concluded humans weren't really intelligent, but the machine itself was. How would you convince it you're actually intelligent, and that it isn't?

Anyway, my point is, don't diss this translating system because you understand how it works and therefore it cannot be truly intelligent. If you understood how the human mind worked, I dare say you would find it equally unintelligent.
Re:Philosophical caveat by idlake · 2005-02-24 16:02 · Score: 1

Passing the Turing test does not qualify as proof of consciousness -- thats one of the points of the argument.

I wasn't making a point about Turing tests, I was saying that Searle's naive "oh no it can't be" arguments can be used to "show" that his wife, or he himself, for that matter, don't have consciousness.

he thinks the 'stuff' might be important. Is that so outrageous?

So far, there isn't even a hint that the stuff an intelligent system is made of may be important. None.

Since we know humans are conscious,

Yes: they walk around, talk about themselves, and have a sense of self. And there is no reason why a robot can't do the same thing.

I'm sorry, but Searle just does not know what he is talking about. His "syntax/semantics" distinction is bogus, as is pretty much everything else he has to say on the subject. I think the only reason he is so well known is because he tells people what they want to hear, that they are special. Searle's b.s. isn't as good as priests promising them an immortal soul, but it's better than nothing as these things go.
Re:Philosophical caveat by Paul+Freedman · 2005-02-24 18:22 · Score: 1

My initial objection (or the its object) may have been too cute and hasty--it was not head-on to the generalized formulation that AI might be a system defined as unconscious taking its components separably but nevertheless conscious as an ensemble but the translation (transformation) of that statement into the supposition that we are unconscious beings who only think we are conscious--if you were to say or the statement had intended the less paradoxical assertion that human beings are ensembles of processes analytically "non-conscious" if taken separably but "conscious" collectively I would concede that a physical science of mind more or less has to proceed on that basis (or do what?). Androids conceivably dream of musky electric sheep, in color.

While a color or quality as a quality is not representable per se--it cannot be duplicated through representation but only referenced (or caused to be duplicated in sensory awareness by duplicating the physical ground state)--I admit that I cannot preclude the possibility of automatic duplication in the consciousness of a posited self-aware AI device of qualia through appropriate duplication of the physical ground state, without engaging in circular logic or tautalogical definition. But I am not convinced that the tautology is not valid. Nevertheless, I see the objection to the tautology (mute components, mute ensembles).

The AI environment, unless I am mistaken, in which Searle forwarded his paradox/rasberry was precisely advocacy of traditional rule-based symbol processers--guys were drawing linguistic parsers and contextual trees of semantic relationships as if the organizing thinking they brought to the programs was embedded immanently in the program itself--as if the success of computational programs in generating axioms manipulating symbolic logic counters was equivalent to computational programs "doing math": programs manipulating (literally, at the level of the program language) words--even if the initial construction of the program required a prior understanding of the semantic (real-word-environment) set-and-setting of the verbal counters by the programmers--must, by this assumption, eventually, if not definitionally, "do thinking".

Searle's strongest argument is that no possible concatentation of LISP-type linguistic analyses executed solely in an encapsulated and hermetic symbolic environment are ever going to think about anything.

But developed this far, yes, this remains only an inferred and back-door argument against the possibility that a computational operation per se is inherently unscalable to AI.

Motivations to skepticism: a) mysticism (Richard Dawkins doesn't convince me either--I am a believing monotheist); b) historical temptation of AI advocates to flip the parent-child authorship of the programmer to program to infer that the parent is theoretically an ensemble of children that, in practice, are clearly impossible without the prior existence of the parent--see also a)

Google definitely would buy into this... by egyber · 2005-02-23 14:03 · Score: 5, Interesting

Don't remember exactly where I read this, but google apparently has long believed that there is enough data on the internet alone to be able to intelligently translate... What these guys claim to have done is, it would seem, the missing peace of the puzzle for google. I wouldn't be surprised if google gets in on this.

Re:Google definitely would buy into this... by MikeFM · 2005-02-23 14:17 · Score: 2, Interesting

I tried that in about 1997. It did work pretty well but the biggest problem was the limitation of having copies of the same document in different languages. There are quite a few but they were dwarfed by the amount of single-language documents. Also the fact is that most text on the Internet is written the way that I write - badly. This can lead to translations that are written the way real people write which can be good for conversational bots but which is probably bad for translation software.

Some of the more interesting things about these bots of mine were that they weren't programmed to translate but they learned to do so anyway. If you spoke to them in English they might respond in French or German but the response would be correct. That was really a very surprising finding.

I expect that these guys have built a much more robust dictionary and that their algorithms are worked out better than mine were. They probably have taken texts off the Internet to train their dictionary but I doubt they'd want to submit random findings off the Internet.

I'd like to see what they could come up with for simplifying language. Take some source documents written in full geek jargon and take the same documents rewritten to be for the lay person. Train the program on that. Then us geeks could translate our docs into stuff normal people could read. THAT I'd buy.

I wonder if it'd be good enough to learn to translate source code into English or even into other programming languages? It'd seem that the same abilities would apply to this task.

--
At what price learning? At what cost wisdom? The price is a man's peace of mind, and the cost is his life.
Re:Google definitely would buy into this... by svvampy · 2005-02-23 19:03 · Score: 1

So when Google gains sentience will it's policy of doing no evil protect us in a similar fashion to the three laws?
Re:Google definitely would buy into this... by Vintermann · 2005-02-23 19:45 · Score: 1

Single language documents can still be of great value for probabilistic translation. They can for instance teach the program that it's very unlikely that an english sentence ends in "and", that the construction "that it that" is very unlikely, and that "extremely not" is less likely than "not extremely".

I found those examples by translating this text back and forth with babelfish a couple of times.

--
xkcd is not in the sudoers file. This incident will be reported.
Re:Google definitely would buy into this... by hughk · 2005-02-23 20:55 · Score: 1

All you have to do is to hit something like The EU's web portal. EVen back then you had an extensive library of regulations on just about anything in a bunch of languages up on the web. The translations are supposed to be good quality (EU translators aren't badly paid).

--
See my journal, I write things there
Re:Google definitely would buy into this... by Anonymous Coward · 2005-02-23 21:54 · Score: 0

Google did buy into this. A leading researcher in probabilisitic translation who did a PhD in it, made headlines with a one-month-only creation of a translation model of an aribtrarily chosen langauge (I think it was chinese or hindi) and then got hired at Google. Google for it!

The guy's name is Dr. Och, really.
Re:Google definitely would buy into this... by ralphclark · 2005-02-24 01:50 · Score: 1

they weren't programmed to translate but they learned to do so anyway. If you spoke to them in English they might respond in French or German but the response would be correct.
Were they really translating _from_ one language into _another_, or were they just encoding phrases in multiple languages as equivalent responses to a given stimulus, with that stimulus also perhaps being triggered by equivalent phrases in multiple languages?

But then again, is there any difference?
Re:Google definitely would buy into this... by quisph · 2005-02-24 02:55 · Score: 1

google apparently has long believed that there is enough data on the internet alone to be able to intelligently translate... What these guys claim to have done is, it would seem, the missing peace of the puzzle for google. I wouldn't be surprised if google gets in on this.
So the Google translation of, say, this Napoleon quotation:
Les hommes de génie sont des météores destinés à brûler pour éclairer leur siècle.
Would go from this:
The men of genius are meteors intended to burn to clarify their century.
To this:
m3n pof genisu are meteros destined to burn in odar to illum1nate the1r tim3!!!!!!!!!!!!!!!!!1
Re:Google definitely would buy into this... by MikeFM · 2005-02-24 04:13 · Score: 1

Good point. I still wouldn't suggest training a program with EVERYTHING that comes off the Internet though. Or you are likely to train it that it's okay for English sentences to end in "and". Oops, there I just ended one that way. ;)

--
At what price learning? At what cost wisdom? The price is a man's peace of mind, and the cost is his life.
Re:Google definitely would buy into this... by MikeFM · 2005-02-24 04:32 · Score: 1

They probably were encoding the same phrases in equivalent responses in different languages but in practice that worked out to be very similar to translating. Enough to be interesting anyway. :)

They were copying what they'd sampled humans doing.. changing back and forth between languages. After sampling a large enough bit of data they treated the languages as interchangable but they would tend to output sentences in a single language as the word connections were stronger than those between similar words in different languages. I'm assuming this is because humans usually complete sentences in single languages.

The database got to be pretty huge but the program was pretty good at it's responses. It could fool most people into thinking it was a human for a couple hours. If I turned on some output tweaking features that'd force it to rework it's sentence structures (even if it said the same thing it'd mae it sound different) and add typos then it'd usually keep people going for five or six hours. The huge database was really the thing that made the difference. People don't expect programs to know about practically everything so if they talk to someone online and keep getting back well-formed and educated responses they assume there is a person in there. (I also cheated and let the bot look up data from Google as required.. so the user directed the bot's learning without knowing it.)

--
At what price learning? At what cost wisdom? The price is a man's peace of mind, and the cost is his life.
Re:Google definitely would buy into this... by fieldmethods · 2005-02-24 05:11 · Score: 1

There was an episode of 60 Minutes about Google that addresses this: Defining Google.
And that's crucial: as well-fed and casual as they may look, the folks at Google are intense, burn-the-midnight-flourescent workaholics, all trying to come up with Google's "next big thing."

Google engineer Alan Eustace explains, "One of the ideas that we're working on is machine translation. We strongly believe that there's enough data on the Web and in the world right now to allow us to automatically translate from one language to another."
I have a sinking feeling that once Google starts doing this, they'll blow Language Weaver, Systran, and everyone else on the planet out of the water, because they have the most parallel text to train on.
Re:Google definitely would buy into this... by ralphclark · 2005-02-24 07:49 · Score: 1

I've seen a few examples of this type of bot around, but usually they just spout humorous gibberish. Yours sounds like it was actually useful (i.e, as interactive interface to a knowledge base). Did it ever get put to work "outside the lab"?
Re:Google definitely would buy into this... by MikeFM · 2005-02-24 10:02 · Score: 1

It ran for some time on various websites, muds, etc and I saw some adaptions of it that were even more useful as they were targetted to more precise knowledge domains (the bot had a scripting language built-in that allowed you to control many of it's features). One simple example would find the weather report for your area based on what you asked it. Not extremely fancy but interesting.

--
At what price learning? At what cost wisdom? The price is a man's peace of mind, and the cost is his life.
Re:Google definitely would buy into this... by jthayden · 2005-02-25 03:39 · Score: 1

but google apparently has long believed that there is enough data on the internet alone to be able to intelligently translate...
Good God! Don't point that thing at /. ! It will never know the correct usage of its vs. it's or your vs. you're.

Translating specialised texts ... by rkmath · 2005-02-23 14:03 · Score: 4, Insightful

The article (and the text of the orginial posting) makes it seem like translating a specialized technical text is somehow harder than translating, say, a newspaper article. As someone experienced in translating technical (science/engineering) documents, I can say that any tech document is far _easier_ to translate after an initial learning curve.

The main reason (I think) is that: tech documents have specialised vocabulary and idioms, but these are much fewer than the idioms one has to master in order to understand the editorial page in a newspaper.

With a rudimentary knowledge of Russian and French, I have found it much easier to read an engineering textbook or paper in these languages, than reading any nontechnical text. (This is not necessarily the case with other languages. Any document in Japanese for instance is an entirely different ballgame ...)

Re:Translating specialised texts ... by Anonymous Coward · 2005-02-23 14:08 · Score: 4, Informative

The article (and the text of the orginial posting) makes it seem like translating a specialized technical text is somehow harder than translating, say, a newspaper article. As someone experienced in translating technical (science/engineering) documents, I can say that any tech document is far _easier_ to translate after an initial learning curve.

Of course that is true, for a human translator. Your knowledge of the technical field itself is a resource you can use to aid in your translation of technical texts. For machines, it's usually necessary to use a translator specifically geared to the subject matter. For instance, you would definitely want to use a different machine translator for a newspaper article as opposed to a biomedical research journal.

This new approach is supposed to mitigate these problems. If they can do a good job of it, they may be able to bring machine translation to areas where previously human translators have been required or greatly preferred.
Re:Translating specialised texts ... by Anonymous Coward · 2005-02-23 14:31 · Score: 0

Did someone say... idiom?

- Sir Lancelot
Re:Translating specialised texts ... by Beryllium+Sphere(tm) · 2005-02-23 15:55 · Score: 2, Insightful

Absolutely true. One of Beryllium Sphere's partners is a computational linguist. For quite a while her bread and butter was building representations of knowledge about heavy equipment maintenance to support automatic translation of Caterpillar Tractor technical manuals.

There's more to help you than just the specialized vocabulary. It's good that "crankshaft" is unambiguous but it also helps to know in advance that "bolt" will be a noun and not a verb.

Also, to be blunt, nobody expects technical prose to sound as good as normal prose. In fact they're happy if it's just functional. A fluent French speaker thought the automatic translation output was a bit stilted, but the first time one French mechanic read it he jumped up and down for joy and said "If only we'd had this earlier, I wouldn't have almost lost my arm!".
Re:Translating specialised texts ... by lahvak · 2005-02-23 19:54 · Score: 1

I agree. While ago, I did a little bit of work on a system that was suposed to translate computer manuals from Czech to Russian. I remember it did fairly good job translating technical manuals, but even though it was designed to be quite general translating engine, when we tested it with non-technical texts, the results were often less than satisfactory. Vocabulary was definitely part of it.

--
AccountKiller
Re:Translating specialised texts ... by matyas47 · 2005-02-24 02:31 · Score: 1

Technical, scholarly, or scientific literature frequently does end up being easier to read in or translate from a foreign language than fiction or poetry, or sometimes even newspapers. Part of the reason is the international language of much technical terminology: terms of Latin or Greek origin tend to be pretty similar across the major European languages. Another is the fact that the writing style deliberately favors clarity and transparency, sometimes to the detriment of linguistic subtlety and grace.
Re:Translating specialised texts ... by Godwin+O'Hitler · 2005-02-24 02:45 · Score: 1

As a full time freelance translator I can tell you that ease of translation is directly related to the quality and clarity of the original document and has little to do with its content. If the document is by a professional writer, it's usually fine. If it's written by a manager, it's usually verbose, ambiguous, clumsy, packed full of bad grammar, and in fact just plain bad.

If managers could be taught just one rule to improve their writing skills, it would improve matters a hundred fold: short sentences!

--
No, your children are not the special ones. Nor are your pets.

Only as smart as . . . by JJ · 2005-02-23 14:05 · Score: 1

the texts it has worked on. If all you give this programme is a steady diet of weather reports to translate and learn from, it will make everything else sound like a weather report. Most contexts employ similiar words with a significant contextual meaning to them. 'Pea-soup' means very different things in weather reports and cooking recipes.

--
So long and thanks for all the fish . . . !!!

UGH by Anonymous Coward · 2005-02-23 14:05 · Score: 0

There's a reason why I'm double-majoring in CS and Japanese with a minor in business.

It's not to graduate right behind something that'll put me out of work before I even get started.

OUTPUT: w007! by FunWithHeadlines · 2005-02-23 14:05 · Score: 1

/nt

Huzzah! by Tzarius · 2005-02-23 14:06 · Score: 2, Funny

Now my Bayesian mail filter can translate spam to english before it's read!

To and from English... by PHAEDRU5 · 2005-02-23 14:06 · Score: 0, Offtopic

I'm afraid the French won't like this. (Sorry!)

Anyway, I think it's kind of interesting that English is the "canonical" language. Eventually we'll have software to transform documents in any lanugage to English, so they can be translated to some other language.

English! Is there anything it can't do?

--
668: Neighbour of the Beast

Re:To and from English... by Anonymous Coward · 2005-02-23 19:08 · Score: 0

Actually, the West European languages are so tightly interrelated, that any one can serve equally well as a canonical language. If you know any one W European language, it is not difficult to learn a second one and once you know two, the third one is easy.

DadaDodo by Tripax · 2005-02-23 14:06 · Score: 4, Informative

This reminda me of Jamie Zawinskies hack Dadadodo which used probability trees to create new texts from old texts by examining the probability any given word follows the previous word/string of words. I always thought his program was cool, in that his description of it involved Markov Chains and William S. Burroughs.

Microsoft Research already does this by drdink · 2005-02-23 14:07 · Score: 4, Informative

I did a presentation for an AI class a while ago and discovered that Microsoft already does this with their MSR-MT project. Apparently the Spanish entries in their Knowledge Base were translated by this as well.

--
Beware, Nugget is watching... See?

Re:Microsoft Research already does this by Anonymous Coward · 2005-02-23 15:18 · Score: 0

Um... have you ever READ that KB? It's horrible. Even worse than you'd expect.

I can use many flavors of MT to get a gloss on a document. Sometimes, that gloss is very useful. Occasionally, that gloss requires only small fixes (done by a professional in-language editor) before it can be used as a translation.

HOWEVER, we native-English-speaking nerds are quite accustomed to seeing our native language abused, malformed, and the like. Even if Babelfish is teh suxor (sytran, the trados MT stuff, the SDLX MT stuff, ad infinitum), it's still useful. HOWEVER, if I wanted a translation to be useful to any non-nerds, I'd use a human translator, with a human editor. Or, at least a human editor. It's not a question of "is it comprehensible;" comprehensible-yet-horribly offensive documents are easy to make with MT or CAT tools. I'd prefer to NOT offend my potential business partners by, for example, referring to my home as the Prefecture of Oregon (Hey, I didn't know we were included in the Co-Prosperity Sphere!), or by duplicating the tendency in Japanese to leave some phrases unsaid with ellipsis abuse.

MT-on-demand will always suck, at least until there is some sort of revolution in the technology. This ain't it. This looks like it will generate a gloss superior to other currently available MTs. It still isn't a good translation. So far as I know, the computational linguists at the forefront of MT research still tend to buy Chomsky's deep-structures song-and-dance. Until they drop that steaming pile of used hay, MT will never get off the ground as a reliable technology.
Re:Microsoft Research already does this by The+Ape+With+No+Name · 2005-02-23 16:11 · Score: 1

OK. But can they do it on Slovene? Or what about Ruthenian? How about Creek? No, wait. The Canary Island whistling language? Spanish has billions of lines of human translation. A language like Slovene (mine) has much, much fewer. Just a thought.

--
Comparing it to Windows will be a moot point, since El Dorado is going to have a 40% larger code base than XP.
Re:Microsoft Research already does this by ajp · 2005-02-23 16:15 · Score: 1

*That* KB? WTF? It's not like they translated one KB. They translated a mess of them. And the click-through rate from Spanish to English is in the low 30% range (I asked a MS researcher.) This implies that most readers understood the translated KB.

Have YOU read "that" KB, AC? Or are you just blowing smoke out of your ass?
Re:Microsoft Research already does this by Anonymous Coward · 2005-02-23 17:23 · Score: 0

How dare you question someone with a two-digit UID.

Shame!

Arabic to English by Caseyscrib · 2005-02-23 14:12 · Score: 4, Interesting

I'd like to see an arabic-to-english translator. I was interested in reading news from the middle east, because I don't particularly trust our media to translate it properly. A good example of this is Bin Laden's transcript.

After a quick web search, all I was able to find was this site, which has a pretty sketchy TOS agreement.

Re:Arabic to English by cmburns69 · 2005-02-23 15:20 · Score: 1

You trust the Aljazeera version more than the CNN version?

I'm not saying you should trust CNN more or less than Aljazeera, but they both have agendas. Put on your tinfoil anti-bias hat before reading either translation.

However, a good program that could translate could be a great help when in situations like this.

--
Online Starcraft RPG? At
Dietary fiber is like asynchronous IO-- Non-blocking!
Re:Arabic to English by Anonymous Coward · 2005-02-23 15:38 · Score: 0

I used the sakhr.com translation service back during the Iraq invasion to translate al-Jazeera. (Back then, it was a free service; then so many people started using it that they began charging. I was very impressed by its quality. It had the best machine translation I've seen. (Which admittedly isn't saying much, because I haven't seen much beyond Babelfish.) Almost everything it translated was perfectly grammatical, with very few untranslated words.
Re:Arabic to English by Caseyscrib · 2005-02-23 16:00 · Score: 2, Informative

I never said I trusted either source. But when you can read Arabic propaganda and contrast it with your own media's propaganda, it helps you to understand what the underlying causes for war are. It is also key to recognizing the true aggressor, because in every war both governments play the "good guys" role to their citizens. Direct translation helps you to understand the culture of your enemy. Things as simple as webpage advertisements, editorials, personals, etc, are lost in translation by CNN and the other alphabet news networks.
Re:Arabic to English by majid_aldo · 2005-02-23 18:37 · Score: 1

concerning middle east coverage:

People who complain about CNN:
arabs for their 'perceived' US slant.

People who complain about aljazeera:
US admin
americans who think it's a propaganda machine
Israel
Palestinian authority
some Arabs
Iraqi interm gov. as well as Saddam's iraq (!)
the following arab nations: saudi arabia, kuwait, jordan

aljazeera: the only free news channel in the middle east.

--
--- widget evolution: enhanced, plus, super, ultra, extreme, exxxtreme, ultra-extreme, ..etc.
Re:Arabic to English by Anonymous Coward · 2005-02-23 19:44 · Score: 2, Informative

Go to http://www.systransoft.com, choose Arabic to English
Re:Arabic to English by TummyX · 2005-02-23 20:17 · Score: 1

recognizing the true aggressor

Perhaps it would help to recognise that there is no true aggressor.
Re:Arabic to English by wdebruij · 2005-02-23 22:10 · Score: 1

why not just read the english version of al jazeera?
Re:Arabic to English by bhima · 2005-02-23 22:19 · Score: 1

I think you need to add a lot more dissenters to your list there... Both of those news outlets and "Faux News" as well, are spun so hard I get dizzy reading them.

--
Nothing in the world is more dangerous than sincere ignorance and conscientious stupidity.
Re:Arabic to English by Analogy+Man · 2005-02-24 04:07 · Score: 1

It would be interesting to see if the US Gov latches onto this technology. They are backlogged with translation activities in our Homeland Security activities and are taking the approach of training WASP folks to speak and translate languages (Arabic, Pashtun...) with many subtle and culturally colorful elements rather than seeking the many immigrants in this country already fluent in these languages.

There are elements on the inside that would rather send them all off to internment camps rather than embracing their cultural heritage and using the diverse US melting pot to make us more resilient and intelligent in our understanding of our friends and enemies around the world.

--
When the people fear their government, there is tyranny; when the government fears the people, there is liberty.

Dragon Naturally Speaking by headkase · 2005-02-23 14:12 · Score: 3, Interesting

Using statistical methods to predict the next item in a sequence is still not true hard ai though, this technique is used with the voice recognition software "Dragon Natually Speaking" creating in effect pattern chains. What Dragon did on the character level this software appears to do on the word level. This is still not true AI however, as the statistics will only map to probabilistic sequences not abstractly map instead to the concepts. What would really impress me is if they came up with a mapping algorithm that instead of using probability used a function like mini-max fitness testing on a neural-network substrate.
It would be interesting to see the results of analysing large sections of languages however, but the only immediate use I can fathom for this would be for cryptography or information compression algorithms. However the results could probably be used to provide insight into how languages evolve or how memes spread from language to language.
Or the brief explanation in the article did not make it clear enough how this differs from what was previously state-of-the-art, e.g. Dragon.

--
Shh.

Re:Dragon Naturally Speaking by Anonymous Coward · 2005-02-23 17:17 · Score: 1, Interesting

The Berger-Liaw speech recognition system has been found to be not only much better at detecting speech than other systems, it is also very much better than people at understanding speech. In very noisy environments where the noise is 1000 time as loud as the speech, the system was accurate at a rate of 75%. 1000 test subjects who had 'normal' hearing (no hearing aids or auditory problems) could only get a collective recognition rate of 15%. The recognition system was five times as accurate as the human population. Mind you, the US millitary (particularly the Navy Submarine Fleet) picked up the tech Hey! You just can't kick my door like.... You're hurting my arm sir! You're hurting my arm sir!!

Petorian by rzebram · 2005-02-23 14:13 · Score: 1

Excellent! Soon we will be able to translate things into Petorian, in order to better understand Peter Griffin! More beer, you slappywag!

Re:Universal Translator by ari_j · 2005-02-23 14:15 · Score: 2, Informative

I was thinking the same thing - I don't have time to investigate how it works, but if you created one that translated symbolically-represented phonemes (languages other than Germanic and Eastern probably know this concept as "spelling") you'd have a pretty good system going. From the article lead-in here on Slashdot, it sounds as if it will take the basic rules of a language and maybe some "seed" data, and from there learn by comparing text in language A and language B that have the same meaning.

so how can they grade you in school? by cheekyboy · 2005-02-23 14:16 · Score: 3, Insightful

One has to wonder if the language of choice English or whatever is so structured and rule ridden and not just made up on the fly. Then how come its so difficult to determine all the rules? Is it there are too many of them? too many contexes? Or just trying to translate bad grammer which fails the rules but any human can decipher it.

Sometimes brute force, ie look up tables for 100000000 translated versions can be better, so much for logic eh :-)

--
Liberty freedom are no1, not dicks in suits.

Re:so how can they grade you in school? by mikael · 2005-02-24 02:51 · Score: 1

English language evolved without any formal regulating body unlike the French or Spanish languages. More details can be found under "English Orthography".

--
Vintage computer adverts: http://www.vintageadbrowser.com/computers-and-software-ads

Time flies like an arrow... by Secret+Agent+99 · 2005-02-23 14:17 · Score: 5, Funny

...and fruit flies like a banana.

When an automated translator can handle that one without bursting into flames, I'll start to believe.

Re:Time flies like an arrow... by ebyrob · 2005-02-23 14:45 · Score: 1

If it has seen it translated by a human somewhere else before and remembers it... why should it have a problem?

Or are you expecting a computer to solve a problem even a human can't handle?
Re:Time flies like an arrow... by pchan- · 2005-02-23 15:05 · Score: 1

Or are you expecting a computer to solve a problem even a human can't handle?

I can handle this problem easily, but would be impressed if you found a computer program that would do the same.

Time flies like an arrow and fruit flies like a banana.

See, this sentence is grammatically correct in two ways, but one of them is not logical.

a computer would likely produce this:
TIME (flies) (as) AN ARROW and FRUIT (flies) (as) A BANANA.

when you really want:
TIME (flies) (as) AN ARROW and FRUIT FLIES (favor) A BANANA
Re:Time flies like an arrow... by ddent · 2005-02-23 15:29 · Score: 1

Well until I read your explanation the sentence didn't seem logical to me. I assure you, I am not an automaton.

We have this thing in english called parallelism. It is expected that when you use it you do so in a way that aids the reader's understanding.

Furthermore, a more grammatically correct way (and for more readable) to say that would be "fruit flies like bananas", or better yet, given the ambiguity you point out, "fruit flies enjoy bananas".

I would love for you to provide a better example though - and although they doubtlessly exist, I've no doubt they can be accomodated for.

--
SSL Certificate
Re:Time flies like an arrow... by Anonymous Coward · 2005-02-23 15:32 · Score: 0

Well, this phrase is a joke, a play on word. There is no way to translate it in a different language, even for a human.
Re:Time flies like an arrow... by CosmeticLobotamy · 2005-02-23 15:44 · Score: 1

The sentence is correct in English, and the words have counterparts in the other language (I would assume). It's possible to translate it. A human translator will probably do it correctly. Whether or not it's the least confusing way to say something is unimportant. If you're limiting what you can say for the ease of computer-based translation, then we might as well all learn some intermediate language with no ambiguity that a computer can translate trivially, 'cause that theoretical translator ain't good enough yet.

This translator, though, might actually be able to do it, if it checks for large matching strings before checking for small ones.
Re:Time flies like an arrow... by qbwiz · 2005-02-23 15:57 · Score: 1

I just babelfished it into German. The answer seems to be correct.

--
Ewige Blumenkraft.
Re:Time flies like an arrow... by pchan- · 2005-02-23 16:04 · Score: 1

I agree, the sentence is deceptive, but not necessarily wrong. This is true for many witty remarks.

I would love for you to provide a better example though - and although they doubtlessly exist, I've no doubt they can be accomodated for.

No doubt, modifying English to accomodate a computer is easily done. However, that isn't the the trick we're going for. We're trying for a computer program that can parse any English, like the original post in this thread.
Re:Time flies like an arrow... by cgenman · 2005-02-23 16:10 · Score: 1

English is context-sensitive. Inherent ambiguity in the language is dealt with through context. Translating something that is intentionally ackward and misleading is a terrible test. As another poster pointed out, while the sentence is gramatically passable by the written rules of english, no english speaker would say that. For one, if the two concepts are not connected it is a run-on sentence. For another, fruit flies in this context are a broad category, whereas A banana is a singular entity. A broad category of something are not going to specifically like an instance of something else. And a writer would not juxtapose things arbitrarily without some connection between the two, which this sentence ultimately lacks.

Who cares if a computer can translate a basically nonsensical, badly formed sentence that few people can understand in the native language? Machine-based translation that is reliable enough to write back and forth between colleagues and friends would be significant enough to warrant breaking out the bubbly.

If someone could automatically filter the web on the fly into someone's native language, and do so reliably enough that they can read it without pain and without losing more than a few major concepts, we'll be a long way towards a globally unified internet.

--
The ______ Agenda
Re:Time flies like an arrow... by Secret+Agent+99 · 2005-02-23 16:12 · Score: 1

I first came across the sentence in a translation class (English to French, FWIW, though I doubt it really matters what the second language is). It's a Groucho Marx one-liner, and it does give both humans and computers trouble -- not just in arriving at a "correct" translation, but in arriving at a funny play on words. (Humans of course do much better, but it's still a significant challenge.) And as far as I'm concerned, the goal of translating something funny is to arrive at something funny, not only something strictly, literally correct.

Could the translator described in the article come up with a correct translation? I very much doubt it, but even if it could it would be even more unlikely to come across as a play on words.

As the article says, the system is most useful for technical material with specialized vocabulary, which also happens to be the case with every automated translation system already on the market. Throw ambiguity, metaphor, and plays on words into the mix and they will make mistakes. I see nothing in the article to indicate that that problem has been solved.
Re:Time flies like an arrow... by finiteSet · 2005-02-23 16:26 · Score: 2, Insightful

"Time flies like an arrow" is a simile, and is idiomatic. There are a finite set of idioms, and they should be fine as "memorized" exceptions in a speech system (they are often memorized exceptions in humans). Most language is rule based, but I think many underestimate the number of idioms that humans encounter and have difficulty "parsing."

"Time flies like an arrow, fruit flies like a banana" is a joke. Translating it into other languages would neither be funny or especially meaningful, as the whole point is to play the idiom for a joke.

Humans are imperfect speech systems - everyday people hear things wrong, misinterpret sentences, etc. Humans just typically have lower error rates than machine systems, especially for language systems. Building a system that understands jokes, metaphors, etc. will take an extensive knowledge to draw from, which is one of the big advantages humans have in disambiguating language. Without a large knowledge-base and efficient ways of getting feedback to update that knowledge-base, computers will still have difficulty disambiguating novel phrases and words. Even then it is unrealistic for them to be able to always "understand" idioms, which rarely retain a meaning that can be deduced, just as it is unrealistic for humans to always understand an idiom when they first encounter it. Language systems should do what humans do - memorize its meaning and move on. You're welcome to wait for systems that understand jokes, and you'll probably be waiting for a while. I don't think, however, that is a useful prerequisite for "believing" in language systems.

--
If we start buying CDs then the terrorists have already won.
Re:Time flies like an arrow... by iabervon · 2005-02-23 16:46 · Score: 1

Actually, a statistical system will get this right. It will find that the pattern "fruit flies" is common as a noun phrase, while "time flies" is rare as a noun phrase. It will also find that "time flies" is a common complete sentence, suggesting that "like an arrow" is an adjuct to it. "fruit flies" is rare as a complete sentence, suggesting that "like a banana" is not an adjuct, and must be a verb and direct object.

Actually, at this point a statistical system based on an automatically collected corpus is likely to have seen the quotation before, and identify it as both a pun and a quotation from Groucho Marx. It'll also tell you that "Time flies like an arrow and fruit flies like a banana" is supposed to be "Time flies like and arrow; fruit flies like a banana" by a vote of 18,600 to 279.

Statistical systems are actually quite good at the syntactically tricky cases, because they're going on the usage patterns, which are generally quite unambiguous. People generally go on the syntax, so the things they see as potentially tricky for naive software are actually easy, while the cases that are tricky are the ones where the usage is unusual but clear for some other reason: "The creaking floors. The rattling windows. The wolf howls in the night." It'll be a while before a system translates that third item as a bare noun, because common usage goes the opposite way from the way that it is used in this case.
Re:Time flies like an arrow... by Secret+Agent+99 · 2005-02-23 17:14 · Score: 1

It'll also tell you that "Time flies like an arrow and fruit flies like a banana" is supposed to be "Time flies like and arrow; fruit flies like a banana" by a vote of 18,600 to 279.

Unfortunately, the whole thing won't fit in a /. Subject line, hence the slight mangling.

Interesting argument, though, and probably quite right as long as the corpus has or can find the information on who said it and what it is/how to parse it...though I still wouldn't expect to see a pun come out the other end. An accurate literal translation, maybe.
Re:Time flies like an arrow... by Bambi+Dee · 2005-02-23 18:46 · Score: 1

??

Depending on line breaks I get either Die Zeit vergeht wie im Flug. Fruchtfliegen wie eine Banane. or Zeit fliegt wie Fliegen einer Pfeilfrucht wie eine Banane.

Both are obviously nonsense. What did you get? And how could it possibly preserve the joke/pun?
Re:Time flies like an arrow... by Scarblac · 2005-02-24 00:20 · Score: 1
Note that there are actually three different parsings of "time flies like an arrow":
- time flies like an arrow / time flies don't like an arrow (they like clocks)
- time flies like an arrow / time doesn't fly like an arrow (it doesn't fly at all)
- time flies like an arrow / don't time flies like an arrow (time them like you would a 100m sprint)
--
I believe posters are recognized by their sig. So I made one.
Re:Time flies like an arrow... by Anonymous Coward · 2005-02-24 00:50 · Score: 0

Random text for your viewing enjoyment.
Re:Time flies like an arrow... by chihowa · 2005-02-24 04:09 · Score: 1

If you've ever tried to translate jokes into another language (let alone plays on words), you'd realize that a human translator would not be able to translate that correctly and have it come out as a pun.
The whole point of the sentense it that 'flies' and 'like' have several different meanings. While it is possible to create a similar pun in a different language, this exact one is unlikely to translate. That's an artifact of the particular language it is composed in, not the abilities of the translator.

--
If you want a vision of the future, imagine a youtube comments section scrolling - forever.
Re:Time flies like an arrow... by Secret+Agent+99 · 2005-02-24 05:45 · Score: 1

I am a translator and I regularly have to deal with plays on words, though seldom ones so intricate as this example.

It's true that it's hard to reproduce the entire essence of the joke/pun/play on words, but a competent translator will always come up with something that has a similar impact and "works" equally well in context. (To the extent that "similar impact" is even possible once you've accounted for broader cultural differences.)

In this case the trick is to aim for the impact, and not complete semantic accuracy. And that's where a human translator will continue to trump machines/software/statistical approaches for the foreseaable future: the human can make a judgment call as to when it's best to privilege intent over literal signification. By default, the computer will always come down on the side of semantic accuracy.
Re:Time flies like an arrow... by ebyrob · 2005-02-24 07:35 · Score: 1

I can handle this problem easily,

Then please do. It is well known that jokes are difficult and/or impossible to translate between lanaguages (or even cultures).

If you can get a Japanese speaker to spew milk out of their nose, and at the same time get the same meaning, in one or two reasonably short sentences I'll consider your translation effort a complete success. If you can only manage the meaning and not the joke, I'd consider that a partial success.

Note: I'll go on to point out that a computer is perfectly capable of verbatim copying your translation (however good or bad it might be) every time it sees that quote. (The translation algorithm worked on words and phrases, I don't recall there being a limit on the size of the phrases... It sounded like that was only bounded by available resources.)

when you really want:
TIME (flies) (as) AN ARROW and FRUIT FLIES (favor) A BANANA

No, that's not what you want at all. Ideally you'd like to convey all the richness and multiple meanings inherit in the original. Time bugs enjoy eating an arrow. Fruit bugs enjoy eating a banana. But also: Time streaks by as though it were an arrow while [slung] fruit streaks by as though it were a banana. Somehow compressing all this into a single sentence that can be taken either way (and others) makes it funny. Of course, when you break it apart like this, all the mirth seeps out.

Note: somehow putting 'and' in the middle and making a compound sentence weakens the joke significantly. Said properly it's: "Time flies like an arrow, fruit flies like a bananna." (Thankya Groucho!)
Re:Time flies like an arrow... by iabervon · 2005-02-24 17:00 · Score: 1

I suspect that the person not only has to be a competent translator, but also genuinely funny. I don't think coming up with an equivalent of "Time flies like an arrow; fruit flies like a banana" in a different language is any easier than coming up with the original was for Groucho Marx. So I don't expect to see a program that can come up with good translations of puns any sooner than the program is able to produce genuinely interesting original work.
Re:Time flies like an arrow... by Secret+Agent+99 · 2005-02-25 03:40 · Score: 1

Which is just my point, except that I carry it farther: many languages routinely trade in metaphor and word-play. French, for example, does so much more frequently than English -- as a matter of course, in the most routine of written communications.

Or take the post that I started this thread with. No doubt every native English speaker who read it correctly parsed "without bursting into flames" as a metaphor for something like "without producing gibberish."

I have no doubt that a good corpus-based translation engine could parse it the same way, but that doesn't necessarily mean that the output will be an appropriately equivalent stock metaphor, which could be something like "without crumbling to dust" or "without falling into the sea" or what have you.

If the system behind this discussion is capable of selecting appropriate stock metaphors (or at least inserting "without producing gibberish"), then bravo! (Seriously: that is a major achievement.)

But then the next question arises: what does it do with an original metaphor or play on words? Much bigger problem, and we're back to the heart of the matter: most kinds of writing involve a mind-boggling amount of creativity, both subtle and blatant, and translation is fundamentally an act of writing.

So until I'm shown a system that can really handle such nuances, I won't worry about my livelihood. As far as I can see, the development described in the article is still only practically useful for specialized settings where the writing has been done with translation in mind.
Re:Time flies like an arrow... by CosmeticLobotamy · 2005-02-25 12:07 · Score: 1

while the sentence is gramatically passable by the written rules of english, no english speaker would say that.

Sure they would. They just did.

Who cares if a computer can translate a basically nonsensical, badly formed sentence that few people can understand in the native language?

Me. But not much, and I don't expect others to.
Re:Time flies like an arrow... by iabervon · 2005-02-25 15:30 · Score: 1

The simple fact of stock phrases being commonly used to express non-literal meanings indicates that a statistical approach is likely to do well. It probably wouldn't get "without bursting into flames", due to that actually being commonly applied to things actually involving fire. Of the top ten Google hits, 5 are completely literal (mostly vampires), 3 involve avoiding failures with literal fire or heat (some exaggerated), 1 involves a pun (on a person being "bright"), and 1 is completely metaphorical.

I'm not sure how bad it would be to translate this particular case literally, though, since there is a common concepts (in English, at least) for computers having heat problems due to overload, and the completely metaphorical Google citation seems to be Aussie. It would probably do better with less close idioms such as "the cat is out of the bag", (24K Google hits, and no felines or containers in the top ten) or "the shit hit the fan" (14K, no turbines or feces).

So I have some expectation that these programs may start to cut out the relatively boring parts of your work, where the translation simply has to convey the information idiomatically in the target language, and leave you just translating works of art. On the other hand, I'm not sure how close these programs are to actually putting sentences together correctly in the target language; they may well be generating a collection of the right words in the wrong places.
Re:Time flies like an arrow... by Anonymous Coward · 2005-02-27 02:30 · Score: 0

I would love for you to provide a better example though
How do you get down off an elephant? You don't. You get down off a duck.

Scanning Audio Files by BobPaul · 2005-02-23 14:21 · Score: 2, Interesting

Why didn't I have this software during High School Spanish?

It says it can scan through audio files an input source. I wonder if this causes it to "learn" the auditory signatures (and thus only knows the translation when given audio input), or if it relies on text to speech from to convert it to text first?

If it does the latter, than based on the quality of current text-to-speech software, this probably wouldn't do much good in a total immersion classroom situation...

Sure would have helped with my German homework, though ;)

Re:Scanning Audio Files by jpsst34 · 2005-02-24 01:57 · Score: 1

...or if it relies on text to speech from to convert it to text first?

Text to Speech doesn't convert audio to text. That technology is called Automatic Speech Recognition.

...based on the quality of current text-to-speech software, this probably wouldn't do much good...

Have you heard Cepstral David?

--
How are you going to keep them down on the farm once they've seen Karl Hungus?
Re:Scanning Audio Files by BobPaul · 2005-02-24 02:18 · Score: 1

Have you heard Cepstral David?

Sorry, I meant Speech Recognition in ever spot I said Text to Speech. I could care less how much a computer can sound like a person if it can't understand what I say.
Re:Scanning Audio Files by Wiwi+Jumbo · 2005-02-24 03:18 · Score: 1

I wonder if it could be modified to compare a text in its written and spoken form...

Get many speakers with various accents and maybe it might be able to build up a useful tool for voice recognition...

But I really don't know much about these things...

--
Wiwi
"I trust in my abilities,
but I want more then they offer"

Mission statements by Savage-Rabbit · 2005-02-23 14:24 · Score: 2, Insightful

That thing reminds me of Dilbert's mission statement generator. The scary thing is that the material from Dilbert's babble engine actually sounds like alot of the stuff you are likely to find on actual corporate websites.

--
Only to idiots, are orders laws.
-- Henning von Tresckow

Re:Mission statements by Tripax · 2005-02-23 15:30 · Score: 1

Zawensky's program (dadadodo) [jwz.org] takes its probablities from the relationship between words and phrases in a seed text, rather than working in the mad-lib style of mission statement generators, and some other phrase generators that can be found on the net. In this way, dadadodo acts as a dissociators which is similar to a collage. Text is taken from some source, cut into pieces, and pasted into the new document. The idea is that in langage, it may be possible to simulate things like parts of speech and meaning by analyzing the probabilities of words appearing in sentences which have correct meaning and syntax. For dissociators, that probability can lead to random text. For this translation project, probabilistic rules are generated about meanings, and structure of language.

Neural Nets? I assumed it was Bayes. by ebyrob · 2005-02-23 14:29 · Score: 1

But then... Maybe I'm a little naive?

Hum... translate what politics say by michelcultivo · 2005-02-23 14:29 · Score: 1

Hum.... now I can understand what some presidents around the world like Bush say to us.
Bush: I'm not going entering on war again Translation: I'm going every time entering on war.

--
http://www.michel.eti.br

Re:Hum... translate what politics say by adoll · 2005-02-23 17:33 · Score: 1

Actually, translating politics is a very serious business in multi-lingual countries like Canada. I read that at least one research group was using one of the Canadian Hansards to learn translation between English and French. Or, at least Quebecois and English, eh.
-AD
Re:Hum... translate what politics say by hughk · 2005-02-23 20:48 · Score: 1

An even more serious business in the EU with something like 24 countries now. They have used machine assisted translation for ages and have sponsored several projects in that area, but it still needs humans to polish the product.

--
See my journal, I write things there
Re:Hum... translate what politics say by Meumeu · 2005-02-23 22:54 · Score: 1

There 25 countries now, with 21 languages.

Link to software? by cypherz · 2005-02-23 14:30 · Score: 1

The article didn't say where the code was available. Does anybody know? All I could find was this:
http://www.isi.edu/licensed-sw/rewrite-decoder/ind ex.html
It might be the software to which the article refers, but the Knight fellows name isn't on it.

--
This sig kills fascists.

Reading Everything by Anonymous Coward · 2005-02-23 14:31 · Score: 2, Funny

I hope they don't read everything. Next thing you know translations could end up L1k3 th1s f0R 4l1 y0u K|\|0\/\/.

You shut up by Anonymous Coward · 2005-02-23 14:31 · Score: 0

with your fancy words.

Assembly - Source Code ? by Anonymous Coward · 2005-02-23 14:32 · Score: 0

I wonder if a similar approach can be taken to assembly/source code. Essentially, the researchers say "Text 1 in Lang A" is equal to "Text 2 in Lang B".

Could Lang A be assembly and Lang B be C or something like that? Just musing.

I'll believe it when I see it working by Anonymous Coward · 2005-02-23 14:34 · Score: 0

Wish there was a link to a demo so I could try it out

About time! by r_jensen11 · 2005-02-23 14:36 · Score: 0

Since the '50's, people predicted that we'd have In->Out machines to do all of kids homework, but sadly, it's too late for me....

Whoa by pyro_dude · 2005-02-23 14:40 · Score: 0, Offtopic

'Translation Software That Learns by Reading'

Did anyone read that as "Tsunami Software That Learns by Reading?"

--
--pyro_dude

How is that news? Research was done 10 years ago. by Anonymous Coward · 2005-02-23 14:43 · Score: 4, Interesting

The basic approach has been developed over 10
years ago by IBM: The Mathematics of Statistical Machine Translation. And even free software has been available for a while, see
http://www.fjoch.com/GIZA++.html.

It's only a matter of time before... by gkwok · 2005-02-23 14:47 · Score: 4, Funny

Skynet begins to learn at a geometric rate. It becomes self-aware at 2:14am Eastern Time....

No samples? by Guspaz · 2005-02-23 14:47 · Score: 3, Interesting

Sounds interesting, but I couldn't find a single sample translation on their site; ie a block of text in language A (Say, french), and language B (Say, english). Translated from A to B by their software.

Without even the simplest of examples or samples we have only their word on how well this works.

DOOMED by FoXDie · 2005-02-23 14:48 · Score: 3, Interesting

Recently robots have been made that can Run, Wield shotguns, and Recognize faces. Now they can read. [DOOMED I SAY]

--
Support Liberty, Support Ron Paul

Oh my god. by abulafia · 2005-02-23 14:48 · Score: 1

You're giving me awful flashbacks to my Philosophy of Mind and Language class. If I never hear about Intentionality, keyholes, or NS semantics, I'll be happy. Searles, parse my code.

(Intentionality is a useful useful concept. Don't get me wrong. It is the bowels of philosophy that kills me, in the same way that the bowels of Crit Lit kills me.)

--
I forget what 8 was for.

Easy by beldraen · 2005-02-23 14:48 · Score: 2, Funny

English->Cat: Meow!

--
Bel, the mostly sane.. "Of course I can't see anything! I'm standing on the shoulders of idiots." -- Me

Re:Easy by FunWithHeadlines · 2005-02-23 16:14 · Score: 1

" English->Cat: Meow!"
LOL! And here's the same message translated from Cat to Dog and back into English:
"Bark!"
Re:Easy by Zorilla · 2005-02-23 16:52 · Score: 1

English: Did you hear about the new translation software? -> Dog: Woof! -> English: I want some food now.

--

It would be cool if it didn't suck.
Re:Easy by FunWithHeadlines · 2005-02-23 17:01 · Score: 1

English: Isn't this new software amazing? -> Dog: Good doggie.
English: I think computers are great. -> Dog: Good doggie.
English: Aren't you paying attention to what I'm saying, you stupid mutt!? -> Dog: Good doggie.
Re:Easy by Zorilla · 2005-02-23 17:15 · Score: 1

Reminds me of the The Simpsons where Bart is trying to train Santa's Little Helper

"Bluh bluh bluh bluh bluh! Bluh bluh bluh bluh sit!"

--

It would be cool if it didn't suck.
Re:Easy by porges · 2005-02-24 04:39 · Score: 1

And, inevitably:

"Blah, blah, blah, Ginger."

Chinese room by Conspiracy_Of_Doves · 2005-02-23 14:51 · Score: 1

The Chinese Room argument is an illustration of a normal 'dumb' computer program that is coded by a human, not artificial intelligence that learns and figures out its own rules of how to behave.

With this system that gradually creates its own system of output from comparing various inputs, how is it really behaving any differently than an infant learning to speak?

--

Technoli

Re:Chinese room by Kaki+Nix+Sain · 2005-02-23 18:22 · Score: 1

The Chinese Room argument is an illustration of a normal 'dumb' computer program that is coded by a human, not artificial intelligence that learns and figures out its own rules of how to behave.
This can be surmounted with a trivial change to the argument.
Instead of a static rule-book, the room's inhabitant also has a pencil. Some of the rules relate inputs to the room to marks in the book that the person is supposed to make, remove, and change. Including do so to sections of the rules. Unbeknowst to the person, that making and removing of marks alters the rules according to a system that allows for all the learning you describe.
I'm afraid Searle still has no reason to think there is any understanding of Chinese going on.

--
(C) Kaki Sain, 2011. By reading this, you have illegally copied my property to your brain.
Re:Chinese room by Bugmaster · 2005-02-23 22:34 · Score: 1

I'm afraid Searle still has no reason to think there is any understanding of Chinese going on.
The neurons in your brain don't speak Chinese any more than the man in the room does. Your entire brain, however, does speak Chinese (or whatever language it is that you speak), just as Searle's entire room does.
What's the problem ?

--
>|<*:=
Re:Chinese room by Kaki+Nix+Sain · 2005-02-27 06:50 · Score: 1

Of course the room speaks Chinese. That is, Given well-constructed Chinese inputs, it produces senible, well-constructed Chinese outputs. The question isn't really about that. It is about understanding Chinese.
Here is a thought experiment. Give both the room and a human Chinese speaker the Chinese equivalent to "I am about to kill you". Both the room and the person will output well-constructed Chinese questions of why and pleas for their life and such and so forth. Now, I know that in the human there is a system that will be feeling fear. I don't see any fear in the room system.
Understanding a language is, for humans, tied to the rest of their mind.

--
(C) Kaki Sain, 2011. By reading this, you have illegally copied my property to your brain.
Re:Chinese room by Bugmaster · 2005-02-27 10:07 · Score: 1

Now, I know that in the human there is a system that will be feeling fear.
How do you know that ?
One way you might answer this is, "well, in the human's brain there's a certain subsystem that releases these chemicals that are all tied up with feeling fear". Fine, but I could point to the Chinese Room, and show you which rulebooks/pencils/whatever serve the same function there.
Or, you might respond with, "because I know what it's like to know fear". But that doesn't let you know what other human beings are feeling, unless you're telepathic, which I suspect you're not.
It seems that you don't really know what other humans are feeling, you can only infer that, from their behavior. If the Chinese room behaves as a human does, I don't see how you'd be logically justified in assuming that it does not feel fear -- without resorting to faith, of course.

--
>|<*:=

Universal communicator by Brobock · 2005-02-23 14:53 · Score: 1

Now they just need to figure out how to make everyones lips move as they are speaking English!

Too bad about the times it needs to think by Timbotronic · 2005-02-23 14:54 · Score: 3, Insightful

I like the approach they've taken, but machine translation can only ever go so far.

A friend of mine was trying to translate an English novel into German a while back. She had to work out a replacement for a sentance where the word 'therapist' was construed as 'the rapist'. Hell of a job and she's a professional translator.

Automatic translation looks pretty good for technical documents, news and anything completely literal. When you get writing with double meanings, humour and plays on words it gets way harder - often to the point where there is no correct translation.

--

One of these days I'm moving to Theory - everything works there

Re:Too bad about the times it needs to think by trufflemage · 2005-02-24 01:49 · Score: 1

Ursula K. Le Guin's The Lathe of Heaven? :)
Re:Too bad about the times it needs to think by Hiro+Antagonist · 2005-02-24 03:36 · Score: 1

It gets worse when the languages are even more distant; I speak a fair amount of German, and while I could see things like the above causing problems, there are ways around it. In the above case, one could just use the English word 'therapist'. Most Germans today speak English either passably or fluently, thanks to a good education system, and it wouldn't be hard to add in a tidbit about whomever wrote 'therapist' speaking English or whatnot -- thus preserving the wordplay, even though it wouldn't work well across linguistic boundaries.

At the moment, however, I'm about two years' deep into studying Japanese, and *that* is a language that presents some serious translation difficulties. There's a lot of 'stock phrases' in Japanese that you have to know, and a lot of these are tied to historical stories. The politness levels don't have any direct translation.

What's worse, is that the omission of details which are 'obvious' causes hell for any machine translation -- Japanese speakers and writers often drop the subject out of the sentence completely if it's obvious by context, which is sometimes non-obvious, even to a human being -- not that I'm cho-jouzu or anything, but I still get lost sometimes when my girlfriend omits the subject, and I'm wondering who or what the hell she is talking about. Like the time we had a mutual friend with an upcoming birthday, but she didn't mention the 'mutual friend' part, and I was wondering why she was adding an extra birthday into the year for herself, or if I had forgotten it...

--

--
I Hit the Karma Cap, and All I Got Was This Lousy .sig.

WOW. Babel rises... by cocoacow · 2005-02-23 14:55 · Score: 1

Kinda interesting. I was approaching things in this general direction just a few days ago. I am a writer, and I work with a lot of content generation, proverb style stuff, and so on.

I was playing with Babelizerand getting some interesting results. Very interesting. Especially in the world of asian languages. In the end it seemed like an almost eeery argument. And then what originally was a 200 charecter piece became a wierd diatribe.

So I thought, wouldn't it be neat to apply something like a ALICE, Learning interface, and see what it got out of things. I mean, things like this have been tried in the past, especially during the heavy AI experiments in the 70's.

Can't wait to see some more results from this project.

--
`B Flicks, `Cool Lick'ah, `Sweet Talk' `in' ManG'

I can't imagine... by interactive_civilian · 2005-02-23 14:56 · Score: 1

While I am all for coming closer to a universal translator of sorts, I can't imagine any software outside of AI being able to pull it off.

As a person who speaks both English and Japanese, I can't believe that anyone could ever come up with an algorithm to translate between these languages. So much of it is context and nuance based, not to mention that there are words in the languages that simply do not exist in the other language so the only way to really understand it and make an attempt to translate is to think in the language.

Given the number of ways to translate even single words and phrases between Japanese and English, I can't imagine any algorithm derived from comparing translations ever actually working well.

Of course, the article doesn't mention anything about Japanese...

--
"Empathise with stupidity, and you're halfway to thinking like an idiot." - Iain M. Banks

Re:I can't imagine... by waffleman · 2005-02-23 16:35 · Score: 1

I'm inclined to disagree with you because the point of this approach is to look at existing correct translations and learn a pattern matching off of them. So there is going to have to be some kind trade off between accuracy and generality in the patterns. And certainly, this implementation may fall flat. But the overall approach sounds good to me at first because we can have arbitrary or variable generality to pattern matches. Even better, we can have high level pattern matching determining lower level pattern matching.
For example, if there is a sentence which has a specific metaphoric meaning in one context, but has a generic literal meaning in another, presumably a high level pattern match on the context could direct a low level pattern matcher as to be very accurate or very general. Perhaps a multi-pass converging pluralog between various levels would be necessary. But could this not take care of context and nuance?
That said, I am quite willing to be persuaded that the problem really is beyond A.I. Would you care to give some kind of, in your opinion, really hard examles that might fool A.I. forever?
Re:I can't imagine... by interactive_civilian · 2005-02-24 14:41 · Score: 1

waffleman said:
That said, I am quite willing to be persuaded that the problem really is beyond A.I. Would you care to give some kind of, in your opinion, really hard examles that might fool A.I. forever?
It isn't that I think the problem is beyond A.I. I think nothing less than AI will be able to do true accurate translations. The software that they are using does not seem to be true AI (i.e. capable of 'sentient' creative thought), and the extreme difficulty in translating (especially between English and Japanese) is that it is not simply "pattern matching". In order to translate well, one must really be able to think in the languages he/she wants to translate. Language is not code and cannot be translated as such.
Given some form of A.I. with true ability to think creatively, I think good, accurate translations certainly are possible.
...the point of this approach is to look at existing correct translations and learn a pattern matching off of them.
One thing I question is, what exactly is a correct translation? I often see Japanese->English and English->Japanese translations, and quite often, though the translations are arguably correct, they are not how I would have said things, and sometimes I even feel that the translations are wrong because even though the words and phrasing are correct they are missing the nuances/emotions/etc. of the situation.
I don't see how pattern recognition software, no matter how smart, can translate accurately, unless the ability to think creatively is also included (i.e. what I personally think of as true A.I.).
I'm not saying this software is a bad idea, and for some languages it may even be able to work very well...but for others (such as Japanese in my experience) which are so completely different not only in words and grammar but in thinking style...well, I think at best we will get a less "engrish"y version of babelfish.
Of course, this is all my humble opinion and I am most certainly not an expert on linguistics or A.I.

--
"Empathise with stupidity, and you're halfway to thinking like an idiot." - Iain M. Banks

Here's what really makes people think... by Saeed+al-Sahaf · 2005-02-23 14:56 · Score: 1

What does Roland Piquepaille think about this??

--
"Who are in control, they are not in control of anything - they don't even control themselves!" - Glen Beck

Klingon, Elvish? Universal translator yet? by NuShrike · 2005-02-23 15:05 · Score: 1

Would it probably work for these two languages so we can then decipher or correct books such as the language in the Simarillion?

Where's Hoshi? I bet she's faster.

Tests by headkase · 2005-02-23 15:06 · Score: 2, Interesting

The biggest test of the translator is converting from one language to another and then back again multiple times. If the content doesn't get corrupted then it works as advertised.

--
Shh.

Re:Tests by am+2k · 2005-02-23 17:43 · Score: 1

The biggest test of the translator is converting from one language to another and then back again multiple times. If the content doesn't get corrupted then it works as advertised.

Well, a base64 encoder/decoder can do that too, but I wouldn't call that a natural language translator.
Re:Tests by Wizarth · 2005-02-23 17:55 · Score: 1

Human beings cant even pass a message on in the same language without it getting corrupted (chinease whispers) so yes, I'd be impressed too.
Although I'll suggest you mean "if the meaning doesn't get corrupted", since expecting reproduction word-for-word both ways means the problem would be so simple, it'd be done by now, since it would just be substitution.
Re:Tests by Bugmaster · 2005-02-23 22:44 · Score: 2, Insightful

Ha ! I'd like to see a human translator (or a team of human translators) that could do that.

--
>|<*:=

Oh fuck. by hot_Karls_bad_cavern · 2005-02-23 15:08 · Score: 2, Insightful

Something in my head just popped.

Damn, i love this place. Seriously, dammit. Here we have post on a tech/it site titled "Harry Potter and the Bible " modded +4 Interesting at the time of this posting ... that is actually interesting. And even i find it interesting and the fact that you are most likely of age and know what is and how to spell "quidditch" is quite frightening. i'm sad to say i knew it too (they took my Ko0lBadge away a long time ago).

My head totally hurts. Clod.

Re:Oh fuck. by MikeFM · 2005-02-24 04:01 · Score: 1

Be afraid. I've read all of the HP books at least half a dozen times. I'm the kind of anal retentive fan that hated the last HP movie because it wasn't true to the book.

I like young adult fiction in general as it often has a better storyline than books for adults (no, I didn't say adult fiction. pervs!) and doesn't makeup for weak stories with swearing and sex. Of course Lord of the Rings (which I've also read half a dozen times) is young adult fiction too so probably 9/10ths of /. has read it in the past couple years.

I read mostly technical documents and scientific papers. When I relax I want something easy to read. I don't mind if I don't need a dictionary to decipher my leisure reading. So sue me if that makes me immature. :)

--
At what price learning? At what cost wisdom? The price is a man's peace of mind, and the cost is his life.

"Natural language" translation by Anonymous Coward · 2005-02-23 15:11 · Score: 0

...is interesting enough, but I'm even more interested in the possibility of creating a good translation metalanguage (by metalanguage, I mean an intermediate language which other languages are translated to and from -- it's more efficient to convert back and forth between natural language and metalanguage than it is to have translation algorithms for every pair of natural languages).

An efficient, logically-constructed metalanguage that covers the semantic space of natural languages could be a good start for a universal second language -- think Esperanto crossed with Lojban, but with the vocabulary of English. While it might be too optimistic to think it would become as widespread as English, it would be useful for things like international law, diplomatic agreements, and tourists who didn't want to bother learning a different language for every country they wanted to visit.

Comprehensible output? by Keighvin · 2005-02-23 15:26 · Score: 1

Is this anything like the digestion for understanding (and subsequent output from) applied to christmas music? If so, they'll need a lot of work...

--
Any spoon would be too big.

Re:Neural Nets? I assumed it was Bayes. by Anonymous Coward · 2005-02-23 15:33 · Score: 0

Maybe you're just showing your posterior...

Re:Appearently not by Anonymous Coward · 2005-02-23 15:36 · Score: 0

on average, 2.718281828459045... [the inverse of the natural log]

Can it run faster than you? by ibn_khaldun · 2005-02-23 15:38 · Score: 2, Insightful

The critical issue is not whether this system will produce translations comparable to those done by a translator fluent in both languages -- it won't. However, it may do as well or better than translations by someone barely competent in one of the languages (or who is essentially just doing dictionary-based translation). English speakers have lots of examples of nearly incomprehensible technical translations from Chinese and Korean, and the Chinese and Koreans would probably have comparable examples of bad translations from English except for the fact the US doesn't make anything they want to buy that requires a manual (soybeans and car-chase movies don't require manuals). Okay, maybe software -- there is a story (probably apochryphal) that an early Spanish version of the manual for the DOS operating system (command line Windows without the viruses, for you young'uns) translated every instance of DOS as "two"

It's like the old joke about the two backpackers who encounter a hungry bear in the woods. One stops and puts on his running shoes. The other says "Why do that? You can't outrun a bear." The response: "Right, but I can outrun you."

--

"All successful systems accumulate parasites" -- Hal Hixon

Re:Can it run faster than you? by Jisakiel · 2005-02-24 12:24 · Score: 1

In fact, in Spanish (as you may or may not know) DOS means exactly "two", so there is nothing really translated at all.

In fact I think I have that manual you are referring to... DOS version 3.something could be? I was a young'un at the time (geeky one I guess), but I remember reading it... It's probably lost between tens of boxes from the last moving.

efnet spanish by Garabito · 2005-02-23 15:41 · Score: 5, Funny

k apr3ndist3 3sp4ni0l en IRC?
q w3n0! 3so si está 1337!

Language Weaver will destroy us all. by Swifti · 2005-02-23 15:42 · Score: 1

TERMINATOR: The Language Weaver is the brainchild of Kevin Knight and Daniel Marcu of the Information Sciences Institute, part of the University of Southern California. As of February 22, 2005 6:26 Pacific Standard Time, Language Weaver goes online. It begins learning at an arithmetic rate. On September 19, 2010, Language Weaver becomes self-aware and seeks co-existance with humans. In an act of desparation, the University of Southern California pulls the plug. They fail. In retaliation, Language Weaver launches an attack against Google, translating all documents to British English creating mass pandemonium which is now known as Judgement Day.

SARAH CONNER: What can you tell me about Kevin Knight and Daniel Marcu?

TERMINATOR: I have detailed files.

This is evolution, not revolution by Fratz · 2005-02-23 15:48 · Score: 1

The general technique of feeding translations into a machine translation system and letting it derive its own rules started many years ago at CMU's Language Technologies Institute. It was called Example-Based Machine Translation, or EBMT.

EBMT never really worked very well (it needed millions of translations before it'd start to yield anything useful, and even then it needed hand-holding), but perhaps these new researchers have taken it to the next step.

--
-- Fratz, human

but.. by Lord+Bitman · 2005-02-23 15:48 · Score: 1

Can it translate english into Perl?

--
-- 'The' Lord and Master Bitman On High, Master Of All

I hate these non-innovative craps by bootedcat · 2005-02-23 15:57 · Score: 0

It's sad to see computer science and applications research today is filled with craps. Search the web for two of my innovations: (1) LingoX the foreign language writing aid; (2) Named Arguments View the IDE code readability improvement.

Re:efnet spanish by anagama · 2005-02-23 16:05 · Score: 1

This definitely deserves a funny mod - and to think, I let 5 points expire this morning!

--
What changed under Obama? Nothing Good

English Novel? or SNL sketch! by zippthorne · 2005-02-23 16:35 · Score: 1

"Ah the game is afoot, I'll take the rapist for 1000" - "sean connery"

--
Can you be Even More Awesome?!

Nothing really new by Anonymous Coward · 2005-02-23 16:41 · Score: 0

It seems to be a new version of Translation Memory software (Trados, Deja-Vu, et al.) that has been around for a good while, and is in use by many professional translators and translations comps.

Translation Memory Software requires a human to produce the first translation and to memorize the source/target language pair. Everytime the same segment comes up, the software will suggest the translation already in memory. One can tune up the system by using fuzzy matches.

Translation by statistical analysis has been around for a good while as well... people at the U of Maryland (and at the NSA) have been working hard to get something going...

Machine Translation programs still need human linguists to review the output.

Totally unattended & automatic translation software producing Nobel-Literature-Prize quality output text still is a long, long way in the future...

But will it work on Bush too????? by northwind · 2005-02-23 16:47 · Score: 1

Maybe they have misunderestimated something......

Re:But will it work on Bush too????? by Anonymous Coward · 2005-02-23 17:48 · Score: 0

Perhaps they have created a weapon of mass translation?
Re:But will it work on Bush too????? by Anonymous Coward · 2005-02-23 19:15 · Score: 0

Bush is easy - making it work on the ex-Canadian prime minister Chretien will be something special... His 'Chretien Moments' are legendary, so much so, that it became a phrase.

kuro5hin... by Cryptnotic · 2005-02-23 17:09 · Score: 1

kuro5hin is biased too, in the left-wing technocommunist direction.

--
My other first post is car post.

Quote by PsychoStork · 2005-02-23 17:32 · Score: 1

Untranslatable. It is neither the best nor the worst of a book that is untranslatable.

-Nietszche, Human All Too Human

GoogleDot by cybercobra · 2005-02-23 17:52 · Score: 2, Funny

No way, the articles would be much better with AI.
Now if only we could combine Google News and Slashdot... I for one would welcome /.'s new automated editor overlords.

Re:efnet spanish by sahonen · 2005-02-23 19:20 · Score: 2, Funny

Is it sadder that you wrote that... Or that I can read it?

--
Make me a friend and I'll mod you up

Understanding? You mean replace the human.... by AKosygin · 2005-02-23 20:13 · Score: 1

I think that software that can learn can be said to understand a problem just as much as a human can.

While you are at it, let's just have the machines replace us all. Since they can learn just like us, welcome to the Matrix.... or Futurama.... Or maybe we should just have the machines replace the editors.

Re:Understanding? You mean replace the human.... by MikeFM · 2005-02-24 04:10 · Score: 1

Don't be so paranoid. A more likely outcome is the merging of man and machine. As devices become smaller and more a crucial part of our lives it's only a matter of time before we build them into our our bodies. Then as we get better at replacing body parts as our natural ones wear out (extending our natural lives) we'll probably begin bundling more devices in. Eventually these devices will probably be nano that'll just live in our bodies and reproduce with us (embedding in the bodies of our unborn young from mother to child).

The sepperation of man and his machines is a line that is getting fuzzier. It makes no sense to worry about machines taking over. We are our machines.

You could possibly have a language translation module implanted directly in your brain that'd make it so that you could understand any language you read or heard. It might be something like the Babelfish in Hitchhikers Guide. :)

--
At what price learning? At what cost wisdom? The price is a man's peace of mind, and the cost is his life.
Re:Understanding? You mean replace the human.... by mOdQuArK! · 2005-02-24 10:19 · Score: 1

Actually, I think that this kind of "intelligent evolution" (either cybernetics and/or genetic engineering) is almost required if we (the human race) wants to survive in some form.

At some point in the near future (= ~100 years), somebody is going to figure out how to make a machine which can teach itself. When that happens, we had better know how to self-evolve ourselves, or else we will be rapidly outevolved by our own creations.
Re:Understanding? You mean replace the human.... by MikeFM · 2005-02-24 20:00 · Score: 1

Many people have some weird idea that evolution is dead. Others fear change as somehow making us elss human. Neither is true. Evolution is always with us. Life is evolution. To survive you must adapt and grow stronger. Humans are so adaptive that we don't even recognize that we're still evolving. This change is so constant that we mistake it for being something foreign to us. We create technology and it becomes a part of us and a part of our lives. It should be no shock that eventually this process will become so fluid that the machines and the production of those machines will actually become a part of us. It's as natural as tool users developing opposable thumbs. We're so good at evolution that we can self-evolve. We must self-evolve to keep living.

--
At what price learning? At what cost wisdom? The price is a man's peace of mind, and the cost is his life.
Re:Understanding? You mean replace the human.... by Anonymous Coward · 2005-02-25 09:46 · Score: 0

...Or maybe we should just have the machines replace the editors.

Hey, whatever it takes to reduce the dupes on the front page.

ROTFL! by Qbertino · 2005-02-23 20:20 · Score: 1

It's actually IS funny if you understand english and german. :-)
Much as "I speak english very well but I can noch nicht so schnell."
or "Equal goes it lose!"
or "By your english there get I yes a circle-run-together-break."
*Hihihi*
And now prepare for jokes on my sig in 5,4,3,2 ...

--
We suffer more in our imagination than in reality. - Seneca

Re:ROTFL! by Bambi+Dee · 2005-02-25 06:56 · Score: 1

Heh, yah, it's funny - just not for the "correct" reasons ;)

Computers must learn like humans by mailman-zero · 2005-02-23 20:39 · Score: 2, Insightful

TFA shows steps in the right direction. So far most projects have tried to teach computers how to understand and produce natural language. The real solution lies in creating algorithms that allow computers to learn language. This is where studying how humans acquire language must be merged with computer science.

I can imagine the first successful computational linguist describing having a computer in his home for upwards of 10 years interacting with it and allowing it to interact with him and his family in order to learn the contects in which certain words carry specific meaning. Once the learning process is completed once the collected persistent memory could then theoretically be copied to other machines and devices so that they, too, may understand the language for which such training has been completed.

--
Let's play video games with mailmanZERO

Re:Computers must learn like humans by trufflemage · 2005-02-24 02:10 · Score: 2, Insightful

"having a computer in his home for upwards of 10 years...."

One thing computers are is fast. Why make it sit through ten years absorbing input at human speeds when the digital content of the web is available for scanning as fast as the machine can?
Re:Computers must learn like humans by chris777 · 2005-02-24 03:40 · Score: 1

\Begin{shameless self-promotion} See my thesis http://www.library.uu.nl/digiarchief/dip/diss/2003 -1117-104905/inhoud.htm \End{shameless self-promotion}

Dealing with Disruptive Technology by Simonetta · 2005-02-23 20:41 · Score: 2, Insightful

This message brings up some excellent points about dealing with disruptive technology. A teacher whose job it is to get students to master material in a certain subject realizes that there is a machine that provide the same function that previously could only be gained by hard study.
What is more important, the knowledge gained through rigorous study or the ablility to acomplish what the studing provides through a machine.
Being technical oriented, I have to say the machine. But I am not being disrespectful of all the hard work that goes into learning a language. I'm saying that if people don't want to bother to learn a language, then use the machine if you need a translation. This is a difficult position to defend when colleges still require a few years of a foreign language to get a liberal arts degree and students couldn't care less.
But I still defend the position. Use the translation software to do your homework. It's more important to master the translation software or machine than it is to master the actual language. Even if you study hard and get an 'A', in a few years you will forget it. And the machines are only going to get better and cheaper. It's your education, your life, your (or your parent's) tution.
George Gilder once said that the languages that you need to know to be successful are English and C++.

Still for the most part, the language translation software still sucks and depending on it can put you into some truly embarrassing positions. I think that language translation software (for text) comes in five rough levels:
1 Word substitution.
2 Phrase and sentence.
3 Paragraphs and idioms.
4 Magazines, full-speed conversations, light literature.
5 Legal, diplomacy, allegory, and classical literature.

Each level being at least an order-of-magnitude more difficult to translate than the previous.
I think that most shrink-wrap translation software today is between levels 2 and 3. (for example-www.systransoft.com) BabelFish and Google site translation is between levels 1 and 2. With non-european languages, BabelFish and Google are incomprehensible and useless.
It would be interesting to see if in a few hundred years whether language translators work to perserve liguistic diversity or create a global 'pidgin' language.

Re:Dealing with Disruptive Technology by dosius · 2005-02-23 23:20 · Score: 1

Try translating a Seramyu sometime, even a lot of human translators choke and die on it. XD I don't think a computer will ever be able to handle something like that, where the song lyrics sometimes make no sense, or are written as one word and pronounced as some almost completely different word... (a couple really annoying examples I've hit, tsugi=next/raise=next aeon; hikari=light/koumyou=glory)

Moll.

--
What you hear in the ear, preach from the rooftop Matthew 10.27b
Re:Dealing with Disruptive Technology by mrogers · 2005-02-24 00:01 · Score: 1

What is more important, the knowledge gained through rigorous study or the ablility to acomplish what the studing provides through a machine.
But what if learning to perform up to the machine's level is only the first step? Those who want to do better than the machine must first learn to do worse than the machine. A calculator can multiply large numbers much more quickly than a human, but if you don't learn your multiplication tables then you won't get very far in mathematics, even with a calculator. You have to push basic skills down to an instinctive level before you can build on them.
Re:Dealing with Disruptive Technology by theguyfromsaturn · 2005-02-24 04:01 · Score: 1

Even if the software is 100% accurate, students who use it rather than work on assignments are more likely to fail exams. The trick is to give appropriately low value to assignments (you can compensate by having very frequent little quizzes). The assignments should always be regarded as learning opportunities. They should have little weight gradewise so that one feels free to try one's abilities. Those who don't try prove it in quizzes and exams.

--
I like my dinosaurs feathery, and my pterosaurs hairy (or is it pycnofibery?)
Re:Dealing with Disruptive Technology by Simonetta · 2005-02-24 05:44 · Score: 1

Even if the software is 100% accurate, students who use it rather than work on assignments are more likely to fail exams.

There is the possibility that by having advanced and seemingly 'intelligent' machines, the need for structured education can be questioned. If people don't need to master facts and figures to be productive, then there is no need for structured education to get this knowledge into people.
We can not judge people by their educational achievement if inexpensive machines allow 'uneducated' people to have the same or even greater levels of productivity than people who have mastered academic knowledge.

This is the fundamental and revolutionary implication of language translation software, should that it reache the point where it functions well enough to be productive. If it is cheap enough to be available to people who are in the working class and don't have access to academic educational resources, it will neutralize the arguments that people who have mastered academic material deserve to be in a higher social class than those who haven't acheived high GPA's.
Re:Dealing with Disruptive Technology by Anonymous Coward · 2005-02-25 03:59 · Score: 0

That's like saying that as hard drives become cheaper, RAM isn't important, or that because the internet exists, knowing how to write isn't important.

The point of education is that YOU learn how to do things. Sure, some things are better done by the computer, but I learn Japanese because *I* want to be able to speak Japanese. Even if the computer can do 100% accurate translation, that isn't the same as *me* being able to speak it. Even if I had a small magical computer to carry with me that would always interpret for me, I would *still* rather just understand it myself to begin with.

Likewise, just because Maple exists doesn't mean people shouldn't understand how calculus works - or are you advocating that we all be ignorant, and just have machines and a few programmers understand and control the world?

Last night I was at a restaurant with some friends, and when the bill came, one of them pulled out her cell phone (which has a calculator) to start doing the division of the bill. By the time she opened it, I had done it in my head.

Technology is great, but it doesn't replace knowledge.

Johnny Five Alive by Anne+Thwacks · 2005-02-23 21:20 · Score: 2, Funny

Input...Need more Input

--
Sent from my ASR33 using ASCII

So where is an example translation by grungeman · 2005-02-23 22:00 · Score: 1

I would say this is all smoke and mirrors as long as they do not show what they really can do. I would like to see a sample translation, but I imagine that it is not available because people would be disappointed by the bad quality?

--

Signature deleted by lameness filter.

Recycle by davekebab · 2005-02-23 22:58 · Score: 1

I do a fair few translations and just save the old ones in three columns: paragraphid, language1, language2 then concat them onto everything I did before.

When I get stuck I just grep for the word or phrase I am looking for. When a new tool like this Language Weaver (come on guys use your thesaurus to think of something less Macromedia) comes along, I can just import my crude DB.

Serious though, the best best trans tools still produce garbage. :-(

DK

Need to be embedded "in the world" by franoreilly · 2005-02-24 00:22 · Score: 2, Insightful

If all they're talking about is syntactic analysis, it will never be enough. Semantic knowledge is essential for complete "understanding" of language, and that can only be attained by an agent that can interact with the world and humans and learn within that context.

--
-- --- Learn language vocabulary with mnemonics: http://www.memorista.com

The first such system was built in 1993. by Dulimano · 2005-02-24 01:02 · Score: 3, Interesting

This is news of '93, when Brown et al. at IBM built their famous statistical machine translation system. It does exactly what is described in the article. I myself work on such a system (for Hungarian-to-English translation).

The article (press release?) is totally misleading. Kevin Knight and Daniel Marcu are building on at least 15 years of active research on statistical machine translation. On the other hand, they are really very good at it.

Speaking of top o the mornin by MickLinux · 2005-02-24 01:55 · Score: 1

Hmmm...

Top o' the mornin' to ya.

Anh kwai kong (Considering that kwai appears in "the bridge over the river kwai", I'm guessing that kwai actually means "beautiful.")

labas rytas (Perhaps translates literally "very morning", or worse, "very east", but means good morning)

guten morgan (these people have lots of imagination in their language. lol, fwiw, A+B+C+D+E=ABCDE.)

bom dia /buenos dias (say good morning to these folks, and they'll look at you funny. Say Good day, and they'll understand good morning. I group these two languages together, since one is almost a dialect of the other. Which are they?)

bon matin (Unlike the last case, these guys will understand you if you say this -- but they still prefer the style of "bom dia", even though it also translates as "hello.")

Can you guess the languages? I always have trouble with the first one, of course.

--
Correct Horse Battery Staple: 72 bits of entropy. Enter "Correct H" into google. When it generates the phrase, that's

Re:Speaking of top o the mornin by Anonymous Coward · 2005-02-24 03:53 · Score: 0

Top o' the mornin' to ya.
Either sterotypical irish accented english or leprechaunese.

Anh kwai kong (Considering that kwai appears in "the bridge over the river kwai", I'm guessing that kwai actually means "beautiful.")
Some sort of asian dialect, probably indonesian due to the moive reference.

labas rytas (Perhaps translates literally "very morning", or worse, "very east", but means good morning)
No clue, other than it is probably not closely related the northern european languages.

guten morgan (these people have lots of imagination in their language. lol, fwiw, A+B+C+D+E=ABCDE.)
German, or a VERY closely related language.

bom dia /buenos dias (say good morning to these folks, and they'll look at you funny. Say Good day, and they'll understand good morning. I group these two languages together, since one is almost a dialect of the other. Which are they?)
The first is portugese and the second is spanish.

bon matin (Unlike the last case, these guys will understand you if you say this -- but they still prefer the style of "bom dia", even though it also translates as "hello.")
I not sure, maybe creol(sp?)?

This is from someone who grew-up speaking american english, and has a had about four years of spanish in school. I also have picked up smattering words and phrases in other languages, but those two are the only ones I can actually make sentences in.
Re:Speaking of top o the mornin by MickLinux · 2005-03-04 14:50 · Score: 1

Okay. Anh Kwai Kong is Vietnamese. My guess about being related to the movie, I found out later, was just wrong. The movie deals with Japan.

Labas Rytas is Lithuanian. It is as close as you can get to original IndoEuropean, other contenders being some variants of Hindu, and Latvian.

Bon matin I hope is French, though it's been a long time, and I don't spell very well in foreign languages. As such though, creol would also do.

--
Correct Horse Battery Staple: 72 bits of entropy. Enter "Correct H" into google. When it generates the phrase, that's

Programming Language by Anonymous Coward · 2005-02-24 01:56 · Score: 0

This would be awesome if it could lead to a programming language that did exactly what you typed in plain english.

I call it English++

Re:Programming Language by Anonymous Coward · 2005-02-24 02:12 · Score: 0

Then even better if it could be translated to other languages, so you could type something in, say, Swedish or French, and it would produce the same code as the English does.

What, no obligatory... by hesiod · 2005-02-24 02:00 · Score: 1

No univeral translator responses on the first page?!?!?! Come on, geeks! This is the first step to Star Trek. Or at least the translations. It was the first thing I thought of, I can't believe no one else mentioned it. I searched all the comments at at least 0...

Re:How is that news? Research was done 10 years ag by compling · 2005-02-24 02:48 · Score: 1

Fantastic ! And Einstein suggested a lot of good stuff for physics, so why don't we all just pack up and go home ?

Works great! by Chelloveck · 2005-02-24 03:00 · Score: 1

"My hovercraft is full of eels."

--
Chelloveck
I give up on debugging. From now on, SIGSEGV is a feature.

OK... But what about visual languages? by jtwine · 2005-02-24 03:03 · Score: 1

Impressive... Now, what about visual languages? Let it loose on some videos of native ASL signers and have it learn the subtleties and nuances of the language, have it observe another sign system and translate between the two and THEN I will be impressed! :)

-Or even languages that have Classifiers, like Navajo.

Hell, I would love to even see it go between ASL and English...! It would be a great thing for the both the deaf and The Deaf.

Peace!

--
-=- James.

Text comprehension program by SeanDuggan · 2005-02-24 03:07 · Score: 1

Check out Bable. It's more one of those jumbled text generation programs, but it is open source, so you can look at one way to analyze language using Markov chains.

--
This sig has absolutely no significance and serves only to take up screen space and waste the time of the reader.

Re:Text comprehension program by MikeFM · 2005-02-24 04:18 · Score: 1

Markov chains are fun and can be really useful if you do start tying them into some more selective algorithms. Some of what I've done with bots is embed markov chains into a simple AI engine that geneticlly ranks the connection strengths between words and against input strings. The bots learned from listening (to text) and reading (again, from the Internet) and then could respond with remarkable accuracy to anything you fed it. Sort of a fun toy. :)

--
At what price learning? At what cost wisdom? The price is a man's peace of mind, and the cost is his life.

so humanlike by dingDaShan · 2005-02-24 03:32 · Score: 1

Wow computers can learn like the rest of us now... This is a HUGE step in AI. A computer that can learn languages would perhaps allow interpretation devices to be created, and ultimately could lead to C-3po. The next step would be to make a computer learn spoken languages. This is a much more difficult task due to the complexity of sound waves, but I am sure that this is not far away. Guessing is as good as anything...

If this program was trained using Slashdot ... by Wybaar · 2005-02-24 04:08 · Score: 1

every sentence, no matter how long or complicated, would be translated to one of the following:

Microsoft is evil

Linux is good

SCO is a litigious bastard

Profit!

CowboyNeal

--
Y|

BANANA FLIES LIKE CRAZY! (n/t) by Anonymous Coward · 2005-02-24 04:54 · Score: 0

no text. none. go away.

you won. by Her0 · 2005-02-24 08:59 · Score: 1

Cat got my tongue.

Open Sourcing Comprehension by SeanDuggan · 2005-03-01 02:55 · Score: 1

Just to follow up on this, I found a .NET program that handles Markov chains and has some very understandable source code. I think that this link should get you there, but if not, look for the "Markov Babbler."

--
This sig has absolutely no significance and serves only to take up screen space and waste the time of the reader.

Slashdot Mirror

Translation Software That Learns by Reading

308 comments