Slashdot Mirror


Google Launches Endangered Languages Project

redletterdave writes "About half of all of the languages in the world — more than 3,000 of them — are currently on the verge of extinction. Google hopes to stem the tide with its latest effort that launched Thursday, called The Endangered Languages Project. Google teamed up with the Alliance for Linguistic Diversity, a newly formed coalition of global language groups and associations, to give endangered-language speakers and their supporters a place to upload and share their research and collaborations. The site currently features posts submitted by the Endangered Languages community, including linguistic fieldwork, projects, audio interviews, and transcriptions."

28 of 194 comments (clear)

  1. as a petty google hater, let me just say.. by Johann+Lau · · Score: 3, Insightful

    .. kudos.

    1. Re:as a petty google hater, let me just say.. by cpu6502 · · Score: 4, Funny

      I'd rather see one universal language, or maybe a dozen (one from each family). Using Europe/the Mideast/North Africa as example:

      They were much better off when they spoke 2 common tongues (Roman and Greek) and could communicate with one another easily, then one thousand years later when they split-up into a bunch of incomprehensible tongues.

      --
      My AC stalker: " I personally agree with your posts most of the time, but that won't keep me from modding you troll"
  2. At the risk of a flame war... by fuzzyfuzzyfungus · · Score: 4, Interesting

    I can entirely understand why linguists value having as many different language samples to work with as possible, and I am similarly aware that active campaigns against various languages have usually closely accompanied active campaigns against their speakers(anything from harassment and discrimination up to and including wholesale slaughter). However, there is also a lot of language homogenization that occurs quite peacefully, with kids wanting to watch TV or speakers of some fairly obscure tongue looking for access to opportunities, culture, and company in more common languages.

    Given the value of language in communication between people, and the rather dubious history of the various things that make messy tribalism even easier than it already is, is this 'Linguistic Diversity' stuff actually a good thing(beyond the relatively narrow; although certainly important, value as a research sample for linguists and as a useful rallying point for resistance to other flavors of attack on relatively powerless groups)?

    1. Re:At the risk of a flame war... by fuzzyfuzzyfungus · · Score: 2

      Eh, I'd say that linguists wanting a diverse sample pool is more like biologists wanting a diverse sample pool: being able to do comparative work between languages, or organisms, can be very helpful in elucidating their structures and functions.

      Also, while there is some money for translators, linguists in general don't really derive substantial economic benefits from obscure languages(very few obscure languages, after all, have either wealthy speakers or wealthy would-be-listeners: the money in linguistics is concentrated largely where the speakers are). Tax lawyers, by contrast, have a directly symbiotic relationship with the complexity of the local tax code.

    2. Re:At the risk of a flame war... by sventech · · Score: 4, Insightful

      Culture and language are closely tied. Words used to translate between languages are not fully equivalent. When a language is lost, the culture, literature (written or oral), and worldview of that tribe is also lost. The unique qualities of each language teach us about humanity as a whole as we discover the relative weighting of their values. From a social standpoint, people eventually want to know about their ancestral tongue in the same way that adopted kids want to know who their parents were. Having a sense of place is key to being successful as a tribe, and is often a rallying cry. The things that support tribalism also support community -- propinquity is a basis for successful society, because it provides certain checks and balances, even though it keeps out some new ideas that would help. When people share a language they have a powerful tool to maintain shared values.

    3. Re:At the risk of a flame war... by AngryDeuce · · Score: 2

      Part of language is examining and understanding it's evolution. How the hell are we going to do that if the only example we have of the dialect is whatever popular culture managed to survive in the cultural consciousness? There are important references for real historical research locked inside those stories, as well. For instance, The 1700 Cascadia Earthquake, which they're recently examining through study of oral histories passed down among the Quilleute and Hoh Indian tribes. If no one bothered to speak this language anymore, how could we have ever made these discoveries? These stories provided an important record of the effect of that earthquake here (the resultant Tsunami is well documented in Japanese records from the period) This was only within the last 20 years, examining languages that will likely be fully dead within a century.

      I don't think there is any harm in preserving our history, I don't care how mundane it is. Something as stupid as a grocery list today could provide valuable insight into our daily lives 2000 years from now in much the same way graffiti provides insight into Roman culture 2000 years ago.

    4. Re:At the risk of a flame war... by aaaaaaargh! · · Score: 2

      I am a linguist and sort of agree with you. From the point of view of my discipline, it hurts me to know how fast languages die (= last speaker dies) and that even with optimal funding the linguist community will not be able to keep up with cataloguing all nearly extinct languages. First of all, since most languages do not have a writing system, when a language dies often a whole collection of stories and tales dies with it, too. Second, from a more theoretical perspective, it is kind of sad that problematic anglo-centric theories could become relatively adequate in the near future just because most counterexamples have disappeared. Many theoretically important languages like e.g. Warlpiri have rapidly dwindling speaker communities.

      However, as someone who has been changing country several times in Europe, I'd personally prefer if everybody in the world would talk English (which is not my mother tongue) and English was a universal secondary administrative language. Linguistic borders impose artificial cultural restrictions that in my honest opinion do not have any particular benefit in our modern society. (In my experience, the smaller the speaker community, the more nationalistic they become about their culture and language. However, in reality these people are already part of a global culture anyway and their cultural heritage usually just consists of a few strange rites, unusual meals and customes that they themselves are finding mor ridiculous than endorsing them.)

      Speaker communities that became the victims of genocide, colonialism, christianism, or cultural hegemony sometimes fight for the right to teach their language and culture in schools and to preserve their culture. While this is an understandable reaction and sometimes justified, because often languages are discriminated or prohibited only to oppress political and ethnics groups, in many cases keeping your culture and language is not the right way and doesn't work in the long run anyway. That's easy to see in Australia and the US, but also applies to many other places.

      So yes, we should preserve languages, but on a more personal level I don't see much good in trying to preserve linguistic diversity.

  3. Save ALGOL68 before it's too late! by jlv · · Score: 3, Funny

    Save ALGOL68 before it's too late!

  4. Can I by Sparticus789 · · Score: 2

    Can I upload the language that I created to talk to my invisible best friend when I was 6 years old?

    --
    sudo make me a sandwich
  5. Re:why in the hell by staalmannen · · Score: 5, Informative

    Well... in fact having a diversity can be pretty good. A lot of our thought process is based on associations and if you know a couple of languages (for me Swedish, English, Dutch and German - in order of proficiency) you also know that nuances are very hard to translate - there is simply no 1:1 perfect relationship between certain concepts within different languages. Those ambigous meanings and cultural associations are a fundamental part of the thought process. I am all for having English as a "lingua franca" and it should definitely be considered a second official language in most countries (especially within the EU institutions). On the other hand, there is a great strength in having different frameworks to form your thoughts in and, given this perspective, coming from a different language is clearly an advantage.

  6. My language is in the list :( by Cyberax · · Score: 4, Interesting

    One of the languages I know (Udmurt) is in the list :(

    It'd be nice to preserve it, but even I don't see that much value in it.

    1. Re:My language is in the list :( by Cyberax · · Score: 2

      Certainly not, I haven't spoken it for ages. I don't think there are many people out there who have Udmurt as a primary language.

      The total number of speakers is probably around 100-200 thousands, and all of them know Russian as well.

    2. Re:My language is in the list :( by FrootLoops · · Score: 2

      For others like me who wonder about these things, Udmurt is one of the two official languages of the Russian republic (similar to a US state) of Udmurtia (the other being Russian). It uses an extended Cyrillic alphabet and has around half a million native speakers.

    3. Re:My language is in the list :( by Cyberax · · Score: 2
      I also understand Ukrainian (a small feat, it's very close to Russian) and speak a little bit of German (it's incredibly easy to pick up if one knows Russian and English). Obviously, English, Russian and German are useful because of the large number of speakers.

      Udmurt language by itself is not very useful. However, it's a language from a family that is very different from English, German or Russian so it gives yet another 'view' of the world. And that's very useful by itself. Oh, and it also helps me to study the Finnish language - its structure just 'feels' familiar.

      Other than that? It's just interesting!

      I find myself wondering why these particular languages are going extinct (I'd imagine the USSR had something to do with this one, but in general) and if they offer less utility. That is, perhaps they're just not as good (expressiveness, ability to represent complex ideas, etc.), and there's some Darwinian action here among human languages. Or maybe that's not even a primary criteria for why languages go away.

      USSR actually was quite supportive of Udmurt language: Udmurt alphabet had been created, books were printed in Udmurt, etc.

      I think, it's simply the result of the Metcalfe's law applied to languages - the utility of a language grows with the square of the number of its speakers. And there's simply not enough speakers to support small languages.

  7. Easy Come, Easy Go by dmomo · · Score: 2

    ... annnnd immediately upon launch, the project was added to the Endangered Projects Project.

    Great idea, but how long will it be supported? Sadly, I think it will share a fate with Google Health.

  8. Re:Hm... by fuzzyfuzzyfungus · · Score: 2

    I'm more than a bit skeptical myself(and, frankly, suspect some of the 'language diversity' types of basically just wanting to keep the natives from doing the same boring stuff we do so that we can continue to gawk at them); but there is also a major historical... sore point... that makes talk about not preserving languages a bit troublesome.

    Yes, there are the utopian esperantists and various outcroppings here and there of "Oh, we could have peace if only we could understand one another!" flavored optimism; but much of the highly visible work in language homogenization has been directly associated with the pointy end of various imperialist projects. If you are planning on grabbing some savage's land and de-heathenizing them for god and country, it is practically a given that you'll give wiping out their language and culture, usually with good, old-fashioned violence and brutal child abuse, a try.

    This tends to mean that, in practice, various 'save the languages!' pressure groups are, operationally, more of a 'don't allow group X to be shoved off its land for bauxite mining, impoverished, and forced to migrate to some gigantic developing world slum and become day laborers' groups, with language loss being a visible symbol of the more dramatic annihilation of people with the temerity to live on top of valuable resources(let's just say that 'property rights' seem to become steadily more...flexible... as one heads toward the bottom of the heap.)

  9. Re:why in the hell by Twinbee · · Score: 2

    To that I would say; incorporate those 'in-between' words and concepts into English (or other language). Cross pollination of languages has happened throughout history of course, and suffice to say: A universal single language with all the subtlety of each existing language are not mutually incompatible goals.

    --
    Why OpalCalc is the best Windows calc
  10. Tax lawyers and perceived v. actual complexity by DragonWriter · · Score: 2

    Tax lawyers, by contrast, have a directly symbiotic relationship with the complexity of the local tax code.

    This is frequently stated, but not really all that true.

    Actual complexity of tax codes increases the effort tax lawyers must expend to deliver the same value to clients, which is disadvantageous to them.

    Perceived (by non-tax lawyers) complexity of the tax code increases the perceived value of the services of tax lawyers (and, therefore, the prices purchasers of those services are willing to pay), and is therefore advantageous to them.

    Insofar as tax lawyers have a vested economic interested in the complexity of the tax code, its not in maximizing the actual complexity of the tax code, but in maximizing the difference between the perceived and actual complexity of the tax code.

  11. Re:why in the hell by shutdown+-p+now · · Score: 4, Insightful

    down with everything but English

    I was with you until that point. A single world language would certainly do more good than harm, but English is a horrible choice. It's like deciding to standardize on a single OS circa 1999, and then picking Win95 because it's the most common one.

  12. Re:Hm... by wanderfowl · · Score: 4, Informative

    As a linguist myself (working with a few different revitalization projects), you can think about linguistic diversity as being like biodiversity: Examining the differences across many different, unrelated (or nearly so) languages gives better insight into Language (with a capital L) on the whole. Sure, losing an individual language doesn't destroy everything, but each language that's lost is one less (incredibly rich) datapoint which can be used to better understand how people do language, and what other ways things can be done.

    For instance, in Wichita, a language which may or may not be dead based on the health of its last few speakers, one could express "the buffalo ran up and down the village several times while scaring people" using a single, very long, very complex word. There are other languages which act like this ("polysynthetic languages"), but Wichita is really, frighteningly good at it. Don't you think that it'd be fascinating to do some MRI studies to see how Wichita people are parsing words, compared to speakers of, say, Mandarin Chinese, which isolate nearly every concept, grammatical or otherwise, into single words?

    In addition, as other people have pointed out, when you lose the language, you lose the culture very easily (and vice versa). Even if you're not interested in the specifics of how language works in the mind (or just in general), understanding different cultural approaches to the world provides more information on the human condition. If your culture doesn't permit or believe in the idea of "selling land", that's interesting data, and food for thought for most other cultures.

    In short, practically, in terms of trade or war or politics, there's little reason to have a group of 50,000 people speaking three languages rather than one. But if you're interested in how human language, culture, and cognition works, that diversity and those comparisons offer data that a homogenous group would not.

  13. Re:why in the hell by shutdown+-p+now · · Score: 3, Informative

    I'm genuinely curious; why do you consider English to be the Windows 95 of languages?

    Its spelling is horribly mismatched with pronunciation, and its morphology has a lot of irregularities (e.g. irregular verbs).

    I don't think the world should standardize on any existing natural language; a constructed one would be a better fit. But so long as we stick to natural languages, I'd much prefer, say, German over English (but then of course I'm also biased in favor of Indo-European group).

    Of course, in practice, the only way the world might standardize on a single language is by a process similar to Pax Romana. Today, that's Pax Americana, so that language is English, unfortunately. I sure hope Chinese are not going to be the next to run the world, because their writing system is so crappy English looks like Esperanto in comparison.

  14. Re:why in the hell by shutdown+-p+now · · Score: 2

    Bad analogy. Languages don't expire in the same sense as OS's do.

    You misunderstood the analogy. The point wasn't that Win95 is an expired OS; the point was that, even at the time of its dominance shortly after release, it was still crappy from design point of view. Similarly, English, while dominant, is rather poorly designed, largely due to its unfortunate origin as a marriage of two languages from different language groups, and then an uneasy and rather turbulent evolution.

    Also, are you implying that there exists a language that everyone should be switching to? Because there isn't really any 'good' universal language out there.

    It depends on your definition of "good". From my subjective POV, no natural language is good for that purpose, but artificial ones like Lojban might be.

  15. Re:why in the hell by gman003 · · Score: 4, Informative

    English is the exact opposite of "orthogonal". Nothing makes sense.

    Let's just look at the rules for taking a singular noun, and making it plural (paraphrased from Wikipedia's article):

    1. If the noun ends in a sibilant consonant sound, suffix -es
    1a. unless it ended with a silent E, in which case merely suffix an -s and pronounce the E
    2. If it ends with a non-sibilant unvoiced consonant, suffix -s
    3. For all others, suffix -s, but pronounce it as -z
    3a. Unless it ended with -o, in which case suffix -es and pronounce as an S (provided it is not a loanword from Italian)
    3b. Unless it ended with -y, in which case replace with -ies (but ONLY if there is not a vowel before the Y)
    3c. Unless the last consonant was an unvoiced fricative, in which case replace with a voiced fricative. Whether or not you should change the spelling varies by word
    3d. Unless it is one of the special words that do not change at all between singular and plural
    3e. Unless it is one of several Old English words that are suffixed with -en, often changing other parts (ie. brother -> brethren)
    3f. Unless it is one of several other Old English words that change certain vowels (ie. foot -> feet)
    3g. Unless it is derived from Latin and ends in -a, in which case follow the Latin rules and replace with -ae
    3h. Unless it is derived from Latin and ends in -us, in which case follow the Latin rules and replace with -i
    3i. Unless it is derived from Latin and ends in -um, in which case follow the Latin rules and replace with -a
    3j. Unless it is derived from Latin and ends in -[i|e]x, in which case follow the Latin rules and replace with -ices
    3k: Unless it is derived from Greek and ends in -on, in which case follow the Greek rules and replace with -a
    3l: Unless it is one of certain words from Hebrew, in which case suffix -im or -ot as appropriate
    3m: Unless it is one of other certain exceptions that occur for only one or two words.

    Got it?

    Now try to list every possible way to pronounce "gh" in a word. You *will* miss some.

    Now realize that you have to learn the entire nominative/accusative system common to European languages *just* for a handful of pronouns (see: I vs. Me, We vs. Us). At least in most languages that do that, it applies everywhere.

    Yeah, English follows the philosophy of "rules are meant to be broken". *Every* rule has at least one exception. Like how adjectives normally come before the noun, except in weird structures like "notary public".

    There's even more things. You know that "th" sound (or rather, sounds, because there's actually two distinct ways to pronounce it)? Yeah, that's pretty much one of the rarest phonemes on the planet. It's in English, Greek, Arabic, Hebrew, Spanish, Swahili, and *nothing* else of note (no, Arapaho is not notable). That's why so many foreigners can't pronounce "th" properly - it doesn't exist in their language.

    My vote for lingua franca? Esperanto. That's literally what it was designed for.

    It's very orthogonal - that massive list at the start of "how to convert a noun from singular to plural" is just one rule: add a -j. That's it. Auto becomes autoj. Kapo becomes kapoj. Letters are pronounced only one way.

    It's not perfectly culture-neutral, but it at least makes a significant effort. It's already widely-spoken enough to have an "installed base", unlike most other invented languages (I'm looking at you, Lojban!)

    And it's Indo-european enough that anyone who knows English, German, French, Russian, or any of those other related languages, will be able to sort-of understand you. Not perfectly, not even half the full meaning will get through, but if I say "mia komputilo estas rompita", you should be able to guess at least "my computer is ____", and hopefully the blue smoke leaking out will tell you the rest.

  16. Re:why in the hell by AngryDeuce · · Score: 3

    Doubleplusungood.

    "Don't you see that the whole aim of Newspeak is to narrow the range of thought?... Has it ever occurred to your, Winston, that by the year 2050, at the very latest, not a single human being will be alive who could understand such a conversation as we are having now?... The whole climate of thought will be different. In fact, there will be no thought, as we understand it now. Orthodoxy means not thinking-not needing to think. Orthodoxy is unconsciousness.

  17. Re:why in the hell by Anonymous Coward · · Score: 2, Interesting

    A study discussed on NPR recently found that when people think specifically in not-their-first language they are more rational and thoughtful. It was hypothesized that thinking in your primary language made it easier for your brain to take emotional shortcuts, making you less rational and coherent.

  18. Re:why in the hell by icebike · · Score: 2

    A lot of our thought process is based on associations and if you know a couple of languages (for me Swedish, English, Dutch and German - in order of proficiency) you also know that nuances are very hard to translate - there is simply no 1:1 perfect relationship between certain concepts within different languages. Those ambigous meanings and cultural associations are a fundamental part of the thought process.

    I agree with this, but only to the extent the "to be saved" language was a written language.

    There are several languages and dialects that are vanishing which were never written, and the only sample of them is the few examples of recorded speech by people trying to save the language. Most, if not all of the native speakers of these languages have passed, except a very few individuals.

    In this case you have nothing but a curiosity, a museum piece like an old rusty shovel. There is no body of literature upon which to apply it, and (at least among north american native and Eskimo populations) its historical legacy consisted of aural traditions, embellish and modified stories which change with each generation (if not each telling), so at best you have something that bears no resemblance to the original story.

    Most of these 3000 languages are merely dialects of some common root language, and have very little value either to the study of linguists, or history unless there exists a fairly large body of writing.

    --
    Sig Battery depleted. Reverting to safe mode.
  19. Re:Hm... by Mindcontrolled · · Score: 2

    Anyway, even if one would impose one universal language on the whole population of the earth, it would probably fragment into dialects and finally multiple mutually unintelligible languages within 2 or 3 generations.

    --
    Ubi solitudinem faciunt, pacem appellant.
  20. Re:why in the hell by Jesus_666 · · Score: 2

    No. I'm fluent in German and mostly fluent in English (and suck at Romance languages) and there are times when the languages just work entirely differently. Let's take the German "doch". It's the proper answer given to express that the assertion made by a negative question is wrong. For instance, for "Haven't you mown the lawn?" the answer "Doch." expresses that I have indeed mown the lawn. While English can approximate the answer with "Yes, I have.", the subtext is different. "Doch" is a negative answer; it implies wrongness on the part of the asker. "Yes, I have", on the other hand, answers in the affirmative.

    Now, this is a simple language construct that could be imported into the English language (although English is already such a hodgepodge that the relationship between spelling and pronounciation is often unintuitive even to native speakers*). But there are other things where it's not as easy. When you hear the word "cadence" you would think of music or perhaps My Little Pony. A German or an Italian might think of guns. Why? The German "Kadenz" can mean both "cadence" and "rate of fire". Similarly, "cadenza" can be extended to "cadenza di tiro" to similar effect. These relations might give a German or an Italian an idea relating to music and guns that would be entirely unintuitive to an American who might only make the connection through Kurt Cobain.

    I'm not going to go all Sapir-Whorf and declare that speaking another language means that you think entirely differently - but you do make different connections simply because often concepts are related differently in certain languages or even jargons. It's not about speakers of certain languages not being able to understand the color blue or polyglots being hyper-intelligent; it's about speaking multiple languages giving you more opportunities for unusual ideas because you have more conceptual relations to work off.

    And no, I'm not a linguist although I briefly considered studying it. Language is pretty interesting.


    * Including people who should really know better. Ghoti anyone?

    --
    USE HOT GRITS WITH STATUE OF NATALIE PORTMAN (NAKED AND PETRIFIED)