Slashdot Mirror


Coming Soon, The Google Translator

compuglot writes "Google gave journalists a glimpse of its next generation machine translation system at a May 19th Google Factory Tour. "Google Blogoscoped" offers an excellent overview of the presentation. The system has been trained using the United Nations Documents as a corpus. This corpus is some 20 billion words worth of content. It uses existing source and target language translations (done by human translators at the U.N.) to find patterns it then uses to build rules for translating between those languages. Apparently it was successful where the current version had failed in translating certain phrases. If anyone were capable of making a serious go of MT, that would have to be Google."

1 of 418 comments (clear)

  1. Re:fascinating by elrous0 · · Score: 5, Insightful
    or at the least pick out pertinent words like "bomb."

    Why do I have a funny feeling that this research isn't being funded by philanthropic foundations?

    -Eric

    --
    SJW: Someone who has run out of real oppression, and has to fake it.