Slashdot Mirror


Microsoft Translator Now Supports Yucatec Maya and Querétaro Otomi Language

First time accepted submitter BrianFagioli writes So, just how rare are these two languages? The Yucatec Maya language is spoken by less than 800,000 people, while the Querétaro Otomi is spoken by about 33,000. These are extremely low numbers in the grand scheme of things, which increases the risk of the languages dying out altogether. With that said, Microsoft's support of the languages in its translator software will essentially preserve it for posterity. Even if the languages end up fading away from actual use, it should live digitally forever.

60 comments

  1. Ben oui c'est ça by Anonymous Coward · · Score: 0

    Pis le calisse de Microsoft Translator comprends toujours pas l'osti de français ben normal.

    Microsoft à marde. /joke

    1. Re:Ben oui c'est ça by ArcadeMan · · Score: 1

      This is as painful to read as SMS-speak in english.

    2. Re:Ben oui c'est ça by mcswell · · Score: 1

      Kush awo'tan, wen lek te k'op Yucatec.

  2. FIrst phrase translated from Yucatec Maya by JoeyRox · · Score: 4, Funny

    "Why did Microsoft take away the start menu in Windows 8? Metro is worse than bat shit."

    1. Re:FIrst phrase translated from Yucatec Maya by fustakrakich · · Score: 2

      No, the first word everybody is looking for is "pelaná"

      --
      “He’s not deformed, he’s just drunk!”
    2. Re:FIrst phrase translated from Yucatec Maya by Anonymous Coward · · Score: 0

      >Metro is worse than bat shit

      On what basis is that comparison fair?
      Obviously the software can't be recycled into gun powder. Maybe it's a suitable replacement for water-boarding??

      Perhaps some almost-lost wisdom will get translated?

      It might be like finding some forgotten papers on electro-fishing from the last hundred years or so, and seeing that it was noticed that fish swam towards the positive electrode. Maybe the Mayans knew that when their bright sky God was in a certain mood, strange things would happen with the fish, dolphins, and whales?

      Bing would say El Nino. I think not.

      http://www.ocregister.com/arti...

  3. And... by danbob999 · · Score: 1

    No one will be able to tell if the translation is valid or not.

    1. Re:And... by chipschap · · Score: 1

      No one will be able to tell if the translation is valid or not.

      You've succinctly described the real issue here, since machine translators are still not very good.

      Preserve the language? Not really.

    2. Re:And... by mcswell · · Score: 1

      833,000 people would disagree with you.

      Ok, if you meant in the future...yes, you have a point. The real way to preserve a language is to preserve genuine human-written or -spoken texts, and preferably a grammar and a lexicon as well. Even better if the grammar is computer-processable. (Disclaimer: that's my job, albeit not for endangered languages.)

  4. Ba'ax ka wa'alik? by fustakrakich · · Score: 1

    Mix ba'al.

    --
    “He’s not deformed, he’s just drunk!”
    1. Re:Ba'ax ka wa'alik? by mcswell · · Score: 1

      Max ka'i atohol.

    2. Re:Ba'ax ka wa'alik? by bill_mcgonigle · · Score: 1

      in aerodeslizador 'Chuup ye'etel u anguilas!

      (English<->Maya doesn't quite round-trip yet at Bing)

      --
      My God, it's Full of Source!
      OUTSIDE_IP=$(dig +short my.ip @outsideip.net)
  5. Number One! by Anonymous Coward · · Score: 0

    This may be the most useful thing MS has done!

  6. Will it translate this cookbook... by Anonymous Coward · · Score: 1

    To Serve Man?

  7. "Fewer than," not "less than" by Anonymous Coward · · Score: 1

    Submitter can't into English.

    1. Re:"Fewer than," not "less than" by Anonymous Coward · · Score: 0

      So? You can't into post Slashdot.

  8. Not the interesting bits by Anonymous Coward · · Score: 0

    The valuable bit about rare languages from a linguistics perspective is the rare and unusual constructions. This does nothing to preserve that.

  9. Test by translating back by chadkennedyonline · · Score: 0

    Test it out by translating to one of the new languages and then translating back to English. I hope it works better than most translation software I've seen. Otherwise we may be fooling ourselves about how much of the language is being preserved.

    1. Re: Test by translating back by ArmoredDragon · · Score: 2

      Some meanings are lost in translation because the target language has no analog to what you're trying to say. Not even the best interpreters can work around this problem when it occurs in some cases.

      That said, I highly doubt a computer program could either.

  10. klingon by Anonymous Coward · · Score: 0

    it won't be useful until klingon is in the mix

    1. Re:klingon by Anonymous Coward · · Score: 0

      The joke is on you, Klingon is already suported, both with the latin alphabet and pIqaD.

  11. In other news by invictusvoyd · · Score: 1, Funny

    Kaspersky lab has identified a critical vulnurability in the translator . One exploited the attackers can replace everyday use words with obscene words like fetituka and kgrojhyakta which could make people speaking these languages chop off each others heads .

    1. Re:In other news by Anonymous Coward · · Score: 0

      I will not buy this Microsoft. It is scratched.

    2. Re:In other news by chipschap · · Score: 1

      I think it also causes global warming.

  12. What's next by Anonymous Coward · · Score: 0

    What's next? Trying to preserve dying languages like French? Sometimes you just have to let these things go.

  13. Two comments. by LWATCDR · · Score: 1

    1 It also does Klingon.
    2. There does not seem to be a word in Yucatec Maya for quark.

    --
    See my blog http://ilovecookes.blogspot.com/ for light hearted technical information.
    1. Re:Two comments. by LWATCDR · · Score: 1

      Okay it supports Klingon but not Cherokee?

      --
      See my blog http://ilovecookes.blogspot.com/ for light hearted technical information.
    2. Re:Two comments. by ChunderDownunder · · Score: 1

      Quark was Ferengi, not Klingon.

  14. Inquiring minds want to know ... by 140Mandak262Jamuna · · Score: 0

    ... if Klingon and pig latin are supported.

    --
    sed -e 's/Chuck Norris/Rajnikant/g' joke > fact
  15. Digitally live for ever? by 140Mandak262Jamuna · · Score: 1
    All the fortran programs I wrote as a grad student in Indian Institute of Science, back in 1980s live for ever in that 2400 feet of half-inch tape recorded at 6250 Bytes-per-inch lives digitally for ever. The only good thing about that was that I swiped the blank media from my former employer instead of paying half-a-month salary of a gazetted (civilian equivalent of commissioned) officer to buy the blank tape in the open market. Back in those days, in India, imported items were that expensive.

    Then I also have Watfor compiler and ChiWriter in 5.5 inch floppy disks. I have my grad student work at UT in a unix mini tape. I also have some IOmega 100 MB disks. In my basement somewhere I have the backups of my WindowsNT machine in 3.5 inch floppy disks. All of them are digital. All of them are as dead as any corpse buried in a cemetery.

    This is Microsoft we are talking about. I am seriously thinking of buying a new desktop because pretty soon I won't be able to buy a new Windows7 machine. Google will keep it in beta forever. Microsoft will slap a new version number of end of life it in 10 years.

    --
    sed -e 's/Chuck Norris/Rajnikant/g' joke > fact
  16. They'll live forever, but... by Anonymous Coward · · Score: 0

    ...the only thing people will know how to say in these languages will be "It looks like you're trying to write a letter."

  17. And? by Anonymous Coward · · Score: 1

    What about Zapoteco, Nahuatl, Huichol, Sioux, Navajo, and the dozens of other native American languages? Our continent has had a rich history of cultures and languages, perhaps the most diverse in human history! FWIW, my niece is named after a Mayan goddess, Ixquic and my brother-in-law's ashes are interred in the Zapotec capital of Monte Alban in Oaxaca, Mexico and the largest boulevard in the city is named for him now. Also, my niece's husband is from Queretaro. I don't know if he speaks Otomi, but it is possible. I'll have to send him a link to this article.

    Mexico is a culturally rich and diverse land. I have always loved my time spent there (first visit in 1955). I have a sister-in-law who is a bruja/curandera (a witch/healer also from Queretaro), extended family with a Mescal still in Oaxaca for 4 generations, a brother-in-law who was at one time head of the electrical engineering department at La Universidad Politechnico de Mexico - Mexico's MIT, a sister-in-law who is a retired Catholic nun, another sister-in-law who is a professor of chemical engineering at the Technical University of Oaxaca, and more than I can express. What I am so clumsily trying to say, is that all of these languages, and cultural history, if possible, should be saved. Huichol is still spoken in eastern Mexico, and Zapotec in Oaxaca. These are important languages and cultures. If we can save them, we should.

    1. Re:And? by gwolf · · Score: 1

      If your family has had a mezcal for 4 generations, it just means they don't drink alcohol, or have it really well hidden. Maybe they have had a temazcal (similar to a sauna, but with far more associated rituals to it) ;-)

      Universidad Politécnico does not exist, in fact. There is a long-standing rivalry between "la universidad" (Universidad Nacional Autónoma de México) and "el poli" (Instituto Politécnico Nacional). Nowadays, I teach at UNAM and am a student at IPN :-)

      Huichol is spoken in the West. Náhuatl is the dominant indigenous language in the center of the country, the different types of Mayan in the East. But there are over 60 distinct indigenous languages throughout the country.

    2. Re:And? by jrumney · · Score: 1

      Our continent has had a rich history of cultures and languages, perhaps the most diverse in human history!

      Not to belittle the great diversity of language on the American continent, but sub-Saharan Africa and South East Asia, especially India, Indonesia and the Philippines, are significantly more diverse lingustically even than Bolivia (the most lingustically diverse country in the Americas).

    3. Re:And? by Anonymous Coward · · Score: 0

      It's helpful if you read ALL the words - he said his extended family has a Mescal STILL in Oaxaca. To me it means they distill their own Mescal, so I'm guessing they drink quite a bit of alcohol. Only question is, do they put worms in their bottles?

    4. Re:And? by mcswell · · Score: 1

      There is lots of work going on to "save" these languages, in various ways: recording oral and/or written texts, writing grammars and dictionaries, teaching children the languages in the classroom, translating learning materials into them, promoting literacy in these languages. Some efforts are more successful than others. Do a web search for "documenting endangered languages", or for the individual languages.

      And if you want to look for other languages--many of them endangered--a good starting place is the Ethnologue (ethnologue.com).

  18. No Quechua by cusco · · Score: 2

    Still doesn't do Quechua, which is spoken by over 14 million people.

    --
    "Think about how stupid the average person is. Now, realise that half of them are dumber than that." - George Carlin
    1. Re:No Quechua by Anonymous Coward · · Score: 0

      "Think about how stupid the average person is. Now, realise that half of them are dumber than that." - George Carlin

      Some people are even so stupid that they don't know the difference between average and median.

    2. Re:No Quechua by cusco · · Score: 1

      Think about delivering that line to 1400 audience members, half of whom are drunk, stoned, or both, and tell me which will be understood by more. And in this case the average and the median are so close it doesn't matter anyway.

      --
      "Think about how stupid the average person is. Now, realise that half of them are dumber than that." - George Carlin
  19. Neat but pointless. by Karmashock · · Score: 0

    Translating the top 10 most common languages between each other is the most useful. Anything beyond that is window dressing.

    --
    I've decided to stop wasting my time responding to AC trolls/sockpuppets... so if you want a response from me... login.
    1. Re:Neat but pointless. by Anonymous Coward · · Score: 0

      I doubt Microsoft would have invested the time lost on this to anything else. Somwhere very high up an ivory tower a powerpoint presentation was made and ppeople clapped.
      Of course, you don't see this kind of nonsense happening on Linux, because we vote on things. It may turn out like a bush-gore election, but dammit, this is free software. But then again so is windows 10. So i guess what i'm saying is - eat what you are fed and support global warming because if this feature is of zero use to you, you are probably a racist.

    2. Re:Neat but pointless. by tehcyder · · Score: 1

      Translating the top 10 most common languages between each other is the most useful. Anything beyond that is window dressing.

      Quickly checking on wikipedia, how about the 11th (German)? 18th (French)...?

      "Top ten" is a ridiculously arbitrary cut off point.

      --
      To have a right to do a thing is not at all the same as to be right in doing it
    3. Re:Neat but pointless. by Anonymous Coward · · Score: 0

      Waste of time! Let's all just speak Mandarin! Let the majority dictate the minority!

    4. Re:Neat but pointless. by Karmashock · · Score: 1

      Way to miss my point. If the system sits there and is useful for the top ten then it wins. If its useful for dead languages that no one really speaks anymore... then its useless. Do you need to translate your words into and out of Mayan?

      No?

      What was that? I didn't quite understand what you said...

      --
      I've decided to stop wasting my time responding to AC trolls/sockpuppets... so if you want a response from me... login.
    5. Re:Neat but pointless. by Karmashock · · Score: 1

      Yes because with mass market products its all about minority appeal... oh wait, no it isn't.

      --
      I've decided to stop wasting my time responding to AC trolls/sockpuppets... so if you want a response from me... login.
  20. Apocalypto by bunyip · · Score: 3, Insightful

    I wonder if we can now translate that Mel Gibson movie about the Maya?

    Seriously though, adding another language with a completely different structure will force them to improve their translation engine. I find it interesting.

    A.

    1. Re:Apocalypto by Half-pint+HAL · · Score: 1

      Unfortunately, it won't. They'll produce tripe, like they have always done, and people will congratulate them for "preserving" a language when they are merely producing a badly mangled version of it.

      --
      Got them moderator blues I blieve I walk out the do', With these mod-points I been gettin', I 'most never post no mo'
  21. MS Translator by sd4f · · Score: 1

    MS's translator is generally good, I like using it on my phone, the camera translation is a particularly nice feature, but the one language which I'm constantly having to go to google for translations is latin. Why won't they add it in? For a lot of educated disciplines, science, law, latin is a used, and sometimes it's helpful for some decent translating.

  22. Breadth of Source Material? by Anonymous Coward · · Score: 0

    Given the relative rarity of the language, and I'll assume isolation and technical illiteracy, there can't be a whole lot of material for them to build their translation engine on. How well does it work, and on esoteric topics? This might be a playground for conspiracy theorists looking for Mayan doomsday predictions when it generates strained translations.

  23. TFS according to google translate by Anonymous Coward · · Score: 0

    TFS sent through a machine translation round trip back to English...

    "Less than 800,000 people, Queretaro Otomi 'and talk about it, while 33,000. This increases the risk of completely disappearing language, which, in the grand scheme is very low numbers. He said, Microsoft languages for future generations to maintain this facility allows translation of the program. Will eventually fade from the actual use of language, even if it must be to live forever Digital"

    1. Re:TFS according to google translate by chipschap · · Score: 1

      This is actually much better than I expected. I'll have to (shudder) give Microsoft due credit!

    2. Re:TFS according to google translate by tehcyder · · Score: 1

      I'll have to (shudder) give Microsoft due credit!

      You must be new here.

      First rule of slashdot club is: you never give credit to Microsoft.

      I don't think there are any more rules.

      --
      To have a right to do a thing is not at all the same as to be right in doing it
    3. Re:TFS according to google translate by Half-pint+HAL · · Score: 1

      Well. much more its a bit surprised by the taakbesik of translation.

      That was supposed to say "I agree. I'm more than a little surprised by the quality of that translation." Oh well, back to business as usual.

      I'd also like to add in that the Maya here was "Ma'alo'ob. Ya'ab in más u jump ' íit sorprendido tumen le táakbesik ' u ' le traducción '. " -- the "más" looks like embedded Spanish, so (perhaps unsurprisingly) it seems like their translations are going via Spanish. In fact, you can switch to Yucatec Maya, and write entire sentences in Spanish that will be translated to English (conozco más gente que tú -> I know more people than you) which confirms this (although if you accidentally hit a word with a valid mean in Yucatec, it does get confused). I would suggest that the most likely reason for the GP round-trip translation to be so good is that the generated Yucatec was really just glossed Spanish -- one of the easiest languages there is for translation to/from English.

      --
      Got them moderator blues I blieve I walk out the do', With these mod-points I been gettin', I 'most never post no mo'
  24. Curious by Anonymous Coward · · Score: 0

    Microsoft knows that today, nor yesterday and nor tomorrow are there any living humans who could validate the accuracy and lack thereof of these "translations" will ever be called upon by Microsoft to validate these Translations.

  25. new or only remaining markets by Anonymous Coward · · Score: 0

    Microsoft must be throwing all this time, money and energy into these two rare languages as they are the only remaining market for Microsoft products. There cannot be another explanation, can there?

    1. Re:new or only remaining markets by Half-pint+HAL · · Score: 1

      Simple explanation: someone publishes or delivers a bunch of texts. The computers can churn through them with no further human input to create a crappy translation engine at next to no cost. Newspapers report "wonderful" work MS/Google/whoever is doing to "preserve" the world's languages. It's cheaper than a page in the Times, and has a potentially much bigger reach.

      --
      Got them moderator blues I blieve I walk out the do', With these mod-points I been gettin', I 'most never post no mo'
  26. Speak or write? by Anonymous Coward · · Score: 0

    The translation is only text, there is no spoken translation in audio format, yet.

  27. Fjutek by ZdzichuFjutek · · Score: 1

    To be, or not to be, that is the question— Whether 'tis Nobler in the mind to suffer The Slings and Arrows of outrageous Fortune, Or to take Arms against a Sea of troubles, And by opposing, end them? http://www.microsoftdynamicser... http://www.microsoftdynamicser... http://www.microsoftdynamicser... http://www.nav2015.pl/ To die, to sleep— No more; and by a sleep, to say we end The Heart-ache, and the thousand Natural shocks That Flesh is heir to? 'Tis a consummation Devoutly to be wished.