Dying Languages, Fading Formats
utopyr writes "A story on BBC News looks briefly at the problems in preserving human languages in digital formats. The scope of the problem? Of the world's roughly 6,500 languages (of which, fewer than 500 are listed here), half will be extinct within the century, as the last speakers die. However, formats are proving even more ephemeral than human memory."
... but why?
If no one is going to speak it again, and it isn't written anywhere, why should it be preserved?
Reminds me of people that are 'pack rats.' Why must you feel compelled to keep something you don't use?
Good quote, too many chars. Seriously, the slashdot 120 char limit sucks!
An interesting problem. Many laymen think a language can be reduced to raw data simply and easily, as if it were computer code. The problem is that we quite simply don't have any tool that can ecapsulate an entire language. According to modern linguistics, the only real version of a human language is that which comes with a human being who speaks it. That's why we distinguish "dead" languages from the others: We may know how to read and understand Ancient Egyptian to some degree, but there is a vast amount of information about that language that is now irretrievable, because there are no living speakers to demonstrate it.
Of course, the flip side of the coin is that there are no living native speakers of Old English either. That is, languages are born and they die just as species do, and this is a natural process. Trying to preserve them all completely intact is simply not possible, any more than freezing a few condor embryos is going to teach us what ecological role the animal played during its heyday.
Libraries, grammars, lexicons are all the genetic information of a language. But there is so much besides that will be lost...
When speaking of dying languages, you can look on the english language for example. More and more kids use expressions like "U" for "you", and "tnx" for "thanks". In my home country we have the the same problem, and we start to look on it as a serious threat to the language.
I think this will somehow make a change to future languages.
Note to self: get smarter troll to guard door.
You do realize, that even if there was one world language, that everyone understood by tomorrow, given enough time, dialects would sprout, then entire new languages?
-
ping -f 255.255.255.255 # if only
Translating the bible? And here I've been thinking that it helped in translating Egyptian hieroglyphs. Boy is my face red.
You are standing in an open field west of a white house, with a boarded front door. There is a small mailbox here.
Yoda is the only known speaker of FORTH.
(or in FORTHese:
FORTH known speaker yoda only is.
)
Unfortunately, it is a dying language.
(or in FORTHese:
language dying unfortunately it is
)
It must be preserved.
(or in FORTHese:
Perserved, it must be.
)
Most of the languages that are slated for extinction probably don't have any documents that need to be preserved in their language only. Lots of them never developed writing on their own, so there's nothing to worry about translating in the future. The British Celts, for example, had this problem. No writing means you have to have a class of scholars (or something like it) to preserve and pass on knowledge. In the case of the Celts, kill the druids, as Julius Caesar did, and their wisdom is destroyed forever.
This is all part and parcel of the so called "digital amnesia" that is taking place. What memories will people have twenty years down the road if everything was comunicated via email. Digital cameras are great for the present but again, how are they preserved over long time periods. Burning stuff onto CDs will work, but most CDs are quite unstable (Verbatim excepted, they use a AZO dye but are more expensive). Even the first video disks made in the 80's aren't playable by anything today, what's to guarantee that CDs will be playable in 20 - 30 years. Printing out digital pictures is no more archival than CDs, most people will print it out on paper which typically is not acid-free and will yellow quickly. Compare this to Kodachromes which look great 50 years later. Cibachromes will last centuries. There's nothing in our new digital media arsenal that can compare. Enjoy your memories now, cause they won't last...
By storing the Data in open formats, and link them with recordings, it should be possible to preserve the languages and their unique history.
I am working right now with LingoTeach and a US university to add a Native American language that is almost extinct to the Free LingoTeach Database, so that future generations have the choice to revive it. Can't say more here, because we are still working out details.
Any help is of course welcome. http://www.lingoteach.org
get 7 free Japanese lessons.
I agree. Let's start by retraining the 275 million people in the United States to all use the metric system like the other 5.8 billion people on the planet do. Then we'll move on to language.
Fascism should more properly be called corporatism, since it is the merger of state and corporate power.
The basic argument was that preserving linguistic diversity would have the corollary effect of preserving cultural diversity (which is good). I found this indirect logic to be somewhat weak. After finishing the book, I did not feel that the authors had given me a good reason to be concerned about the loss of so many languages.
Note that the book focused more on the problem of preserving the languages in society. The authors considered an archive to be a poor substitute for a living, breathing language, much like a recording is a poor substitute for a concert.
Check out Chad's News
the preceding comment is my own and in no way reflects the opinion of the Joint Chiefs of Staff
The world does have a common language. English. Just ask any English speaking person. If you're talking to someone that doesn't appear to understand english you just have to speak slower and louder.
All Space Aliens also speak english.
There are still many ancient texts, from dead languages, that have never been deciphered, and some, not from such a distant past. Maybe you would like to give your best shot at some of them. Here is a list of texts and writing systems awaiting to be understood:
Rongorongo, the hieroglyphic script of Easter Island
The Voynich Manuscript, 200 pages, probably written in the 13 century
Indus Valley scripts from Mohenjo-daro in Pakistan, 4000 years ago
Etruscan
The Disc of Phaistos, from Crete, 3700 years ago
Meroitic hieroglyphs of ancient Nubia
Zapotec script
Have fun!
Everyone should speak Klingon! Why should any one group of people have it any easier than any other group? To be completely fair to everyone, the language we unify on should be Klingon.
I'm trying to teach myself to set people on fire with my mind... Is it hot in here?
Or did anyone else expect to see COBOL on the list?
Oh, wait. Human languages...sorry.
The society for a thought-free internet welcomes you.
Here is the answer. When the issue came up, the US decided to go it alone and do things their own way. Now that it is twenty five years later, the need to change over is greater than ever and the expense involved is exponentially higher than it was then. The money that the USA wastes now on converting all incoming and outgoing products is considerable. Far more than the costs of conversion.
Fascism should more properly be called corporatism, since it is the merger of state and corporate power.
OK, I'll feed the troll.
The issue of what language people should speak was never at hand; your comment is a total non-sequetur, and is off topic.
The issue is the preservation of languages which are fast becoming historical. The reason it is a big deal is that we lose part of history if we do not. The language itself is of significance to historians, but futhermore, all of the literature and linguistic art of that culture is lost to us if the language in which they exist is lost to the knowledge of human kind.
Let me give you a little example. You almost certainly are familiar with the word "troubador". You may have a vague sense that is refers to a sort of medieval minstrel.
What it refers to is an elite of songwriters, "trobadors", in the 12th and 13th centuries, famed for the quality of their lyrics, and for the fact that, unlike the "serious" artists of the rest of Europe at that time who wrote in Latin, they wrote in their vernacular. We now call that language "Old Occitan", though they did not call it that.
For some eight centuries -- right through to the present day -- their fame as lyricists was so great that the word for them has become a common noun. Their craft was legendary for centuries after their home land was conquered in the Albigensian Crusade, and their worldly, sensuous art repressed by the Church.
I'm willing to bet you have never heard a single word of trobador verse, neither in the original nor in translation. This is the single most famous body of literature in the history of Europe, and you have never heard a single word of it.
The reason why is that the trobadors loved word play -- e.g. double-entrendres, extremely tight rhymes -- and invented complicated poetic forms (you have a trobador to thank or curse for the sestina). The result is that while the sense of a troubador song may be translated, translating the form, bringing all the witty word play which was the point of their craft, into another language is pretty close to impossible. They even managed to invent a kind of rhyme (rims derivatatius) which is close to impossible to execute in English, requiring, as it does, a syllable's length difference in congugation of verbs or declention of nouns.
So if you want to appreciate the most famous poetry in the history of Europe, you have to learn Old Occitan and read it in the original.
And that is one example of why it is so important to preserve dead and dying languages. So that, should some weirdo in the future actually care about the bounty of the human artistic acheivement through time, the door to the libraries of the past may yet be unlocked by those crazy enough to learn the keys.
We preserve languages for the same reason we don't burn libraries.
-*- Any technology indistinguishable from magic is insufficiently advanced -*-
My own mother language is in serious risk in the mid term. Most of you are native English speakers, so I think that you can not imagine what does this situation mean for us. It is good that there are "large" languages like English that are used arround the world, but the "small" languages are as rich and respectable as English or Spanish or Chinese, and it is important to protect and preserve them as they are part of the cultural heritage of humanity.
I don't want to say the name of my language because I would like to speak in the name of all the speakers of "small" languages. Every word of my son, who is 16 months now, makes me feel very proud. I hope that the sons of my son will one day learn and be proud of our language (and also learn English to be able to read slashdot !)
The thousandth sequential copy of a data CD will be identical to the original, assuming you're verifying the copies each time. You keep copying to new media every few years, which is annoying, but the data will be identical to the original.
The same cannot be said for any analog based data system such as film. If the original is damaged, you're left with an imperfect copy. Of course, pay enough money and your analog copy will be a close reproduction of the original, but it won't be identical.
Dr Fish
The Summer Institute in Linguistics has a much more comprehensive list of languages in their compendium entitled the Ethnologue (Available for perusing online.
UNESCO, an agency of the United Nations has compiled The Redbook of Endangered Languages listing many endangered languages around the world.
Another source for those interested in endangered languages is The Foundation For Endangered Languages.
For those more interested in creating languages of their own, or "conlangs" like Tolkien created, might I suggest Langmaker, Mark Rosenfelder's excellent Virtual Verduria (including his Language Construction Kit), and for those interested in Tolkiens' tongues (such as Quenya, almost unanimously considered the most beautiful conlang created) there is the very informational Ardalambion.
Hope those links will help people interested in the topics of endangered and model languages.
Yes, there is L'Acadamie Francais, but it isn't like most of 'em really care. They will happily steal words like "le hot dog", and "le weekend", because even they understand that "la fin du semaine" is just too long.
And German has the same thing. They publish the Duden and make the schools teach Hoch Deutsch.
But if you want serious linguistic hardasses, look no further than Iceland. They can still read texts from the 13th century. I met an American who was trying to move there (his wife is Icelandic) and the government was requiring that he adopt a traditional Icelandic name so his name wouldn't polute the language.
I imagine that if you polled the American populace in general, you might find more support for this than you think exists. I think you would find that the only group strongly opposed would be the very old and the poorly educated.
Now, try polling the populace in
utter rubbish
Before starting, I should mention that the given estimate for the number of languages spoken today is just that: an estimate. There are areas in the world such as Cameroon, Papua New Guinea, the Congo basian, and the Amazon basin that are constantly yielding new languages. Compounding the problem of an accurate number is the fact that, unfortunately, records and data are not available for all spoken languages and counting all of them is quite difficult. I have personally seen figures in the range of four thousand to fifteen thousand currently spoken languages so don't take that number as gold. (It is, however, as close to being accepted as any other estimate can be.)
OK. Why should a dying language be preserved? People have pointed out the parallel to preserving endangered animal species through environmental efforts or the scramble scientists made to save Mesopotamian artifacts from Iraq before the war broke out and these are both excellent analogies: just because a language is not a physical thing does not mean it is not worth the time, money, and effort to preserve. Wildlife activists fight for the rights of endangered species because they are unique and part of the natural environment of this world. Archeologists do the same for artifacts of human eras long gone and disappeared. Why shouldn't the same be made for languages? A language and the culture surrounding it are inseparable; a language is a living thing, a product of the unbelievable mechanism of the human mind. Chimpanzees can use basic tools to scrape termites out of their mounds but they are unable to communicate using spontaneous, creative language. Ultimately this is what lifts the human race above the rest of this planet's fauna. Preserve a dying language because it is part of the heritage of the entirety of mankind.
Of course, saving a language for its aesthetic value is not the only reason. Linguists (notably Noam Chomsky and Joseph Greenberg) have been trying for at least decades to document and discover the underlying reasons for the existence of language universals. Using simple examples, every language has the concept of a noun and a verb. Why is that? Is it just to facilitate the processing of communication in the human mind or is it innate? Every language that has evolved naturally is complex in its own manner and can express any concept found in any other language; no language is inferior or superior to any other language in facilitating communication. Is this natural? Are there languages out there that are simply empirically inferior to others and die out as its native speakers learn the value of another, superior tongue? Has every language ever spoken been this way?
There are still untold numbers of questions that cannot or have not been answered by contemporary linguistics. Joseph Greenberg is the father of the movement to uncover linguistic universals by studying large sets of data representative of the distribution of the genetic makeup of the world's languages. This approach has yielded many valuable insights into the human creation of language. If a universal is absolute, then perhaps it reveals part of the inner workings of our own minds. The sad truth, however, is that so many languages have been lost before the advent of the written language and since that no universal can ever be proven to be 100% absolute. Does this mean linguists should give up? No, of course not! Perhaps some unique language in the valleys of Papua New Guinea will manifest some exception to an absolute universal, forever changing our views on the human mind. For instance, the language Hixkaryana, spoken by less than 400 natives in the Amazon basin, has a default word order of Object-Verb-Subject. Before the discovery and documentation of Hixkaryana it was thought this word order was so counter to normal human thinking that it probably did not exist. What would have happened if no efforts had been made to document Hixkaryana? Linguists would have been unknowingly deprived of a valuable insight into language typology.
Vilk, from the ranks of the freaks
I'd much rather have one lanuguage with many dialects than the current state of many languages with many dialects. For example, if necessary, even the most uneducated speakers of two dialects of English (take Cockney and Ebonics) could find a common vocabulary on which to build a conversation. They might have to dial it back to the 2nd grade but they'd still find a way to communicate.
You seem to think English is the end all language of all things. Honestly, that's what others thought of Latin and yet look at how many languages outlived it!! Don't bother counting, the list is huge.
:-)
Do your homework... storing these languages will be a way for some with some interest to research how and possibly WHAT factors influenced the language development of various groups through history. For example, Latin may be dead, but it influenced many languages, and in some cases you could trace invasions via accents borrowed from Latin. (Romania is in the middle of the slavic/gallic area yet their language is based on Latin, quite significantly at that.Hungaria is right west of Romania and they speek a completely different language than all those around them (Huns settled there.) All in all at least some study will at least keep track of where we are coming from.)
It is almost like taking family pictures or writing a family tree, only this time with languages. It may not seem like much to the consumerist point of view prevalent now, especially among those of us here in the USA that have NOT been outside the country...
Destroy variety and you'll be left eating hot dogs for the rest of your life. They're not bad, but if it was all you had you'd soon understand why many seek the unusual and the break from the status quo. Preserving some cultures or parts of cultures other than our own might even count as being civilized. (remember our ancestors commiting genocide of entire peoples when they landed here? you should. it is our heritage and forgetting it will let those in power commit those crimes again)
Plus our studies of evolution have barely begun... we need to record some things that aren't fossilized such as art and language. Even if just to leave to future civilizations digging out our leftovers from the ashes of our own stupidity. (ala A.I.)
-DaedalusHKX
PS - parts of this post were not related completely to the article but more to your rants of "its better if everyone spoke english" but I guess many would also say "its better if everyone agreed with the status quo, even if GW threatens to smash all our rights into the gutter so the RIAA and the rest of the corporate world can fatten its portfolio).
PPS - mod as you see fit
" What luck for rulers that men do not think" - Adolf Hitler
I agree that if a language is dying then it should not be saved to perpetuate its use. If the language is dying then it has essentially shown itself to be an inadequate means of expression for the modern world and that it is unable to adapt itself to express new ideas.
That said, I do believe that languages ought to be preserved for academic study since every language is a reflection of its culture and expresses ideas and concepts that are not easily expressed in other languages. For instance, you'll find in a language like Arabic, spoken by desert dwellers and nomads, figures of speech, proverbs, and other expressions depicting the importance of water which would not be found, for instance, in languages spoken by populations in lush, agricultural societies. Something similar could be said regarding regions that experience an abundance of water like South East Asia which has the monsoon season. Although the preceding example is mundane, what I'm getting at is that letting a language disappear is depriving oneself of novel modes of thought and expression.
I think every language has rich concepts to offer other languages. If we don't preserve the languages we do have, we may very well be shielding ourselves from potentially revolutionary ideas.
Counterpoint: Singlish
Singapore, like most former British colonies, has an education policy to teach its school-kids primarily in English. Curiously enough, it's produced a generation that needs a campaign to speak proper English and another campaign to speak Mandarin, the mother-tongue of more than 70% of Singaporeans. One naive, probably superficial, comment we'd make is that young Singaporeans are neither here nor there; they insist on mixing Mandarin grammar and Hokkien words to produce English sentences. The government, apparently, is so worried that Singapore might lose its "natural advantage", that it has a set of "approved" words to be used in locally-produced English-language television shows.
Clearly, it has been very difficult to teach and sustain a standard, uniform, international language for 30 years in a population of 4 million. Now consider the challenges involved in doing this for the entire world.
Let's face it; even if everyone learns and speaks in English, there will still be geographical differences in dialect. The differences will lead to new languages. Just as it has been happening over the last few millenia.
More than mere navel gazing.
Language is probably a bigger barrier on the internet than any firewall or censorware. You can only search Google with words you know. If there are web sites written in a language no one knows anymore, they are effectively lost.
destroyed the last remnants of the Klamath language. (The Klamath is a tribe of native Americans along the border of Oregon and California). There was a project to document the entire language on a set of CD-ROM's, since the only person left alive did not have too many days left. After the CD-ROM's were recorded, 3 teens from the tribe stole the CD's and destroyed them for whatever reason. As a result, the Klamath language is now lost forever.
There are a huge number of yeast infections in this county. Probably because we're downriver from the bread factory.
"the problem is that a highly US-driven mono-culture is sweeping the world."
Now, if it were French that was the basis of the the mono-culture, then the development of a common language would be considered giving culture to to world...not taking culture from the world.
The really funny thing about what is happening now is that the US is not as actively trying to create a mono-culture as the French, Chinese, Japanese, Arabic, Russian and other imperialist nations did in the past.
When the Europeans were the great imperialist powers, you would find great and glorious writings about bringing culture to the backwards people of the world. Even in Bush's war with Iraq, there is a rhetoric of giving back the country to the Iraqis...the war lacks the imperialist overtones of most historic military excursions.
I suspect that the main reason we hear so much yapping about the issue now is that much of the "mono-culture" is being influenced by a country (the USA) that the French and other Europeans consider to be filled with lesser people. If the world were learning French, the development of a monoculture would likely be considered an enlightenment.
Language is a city to the building of which every human being brought a stone. --Ralph Waldo Emerson
Well we're trying at towerofbabel.com
My name is Carlos Montoya. You share files of my music. Prepare to die.
After reading the posts, has anyone noticed that the grammar, diction and spelling of those who are pro-preservation of dying languages, is, for the most part, better than the grammar, diction and spelling of those who wouldn't mind languages dying off?
Ah! The metric system is really superior. Do you know how many feet in a yard, how many feet in a mile, how many yards in a mile? 12 inches in a foot. Have you ever measured something in thought, is that 3/8 of an inch or 5/6. Which is smaller. 2 cups in a pint, 8 pints in a gallon, but how many teaspoons in a cup? How many ounces in a pound? 16. How many pounds in a ton? 2000. All seems arbitrary to me.
The US system is stupid and as an American I'd love to see that switch. Then I'd know the answer to mostly everything measurement wise is a factor of ten! 10 millimeters to a centimeter. 100 centimeters to a meter. How many millimeters to a meter? Easy! Is there an in between? I think it's a decameter but I could be wrong. Anyway divide by 10. How many meters in a kilometer? 1000! Easy. How many grams in a kilogram? Read the last question stupid! ^.^
The canadians way back when (I think 20's or 30's) switched driving from the left handed side (like the brits) over to the right handed side. How did they do it? Overnight they had their army switch the road signs while not allowing anyone to drive and voila, it was done.
How did the Europeans switch from their currencies to the Euro? Several years planning and public discussion, but when it came time, overnight (I think) they made it official and the actual conversion took place over the course of several months.
That is how metric conversion should be done. Everything from now on is labeled metrically. No ands, ifs, or buts. People will complain, but if forced to use it, they will, and this standard nonsense will be forgotten in less than a generation. Metric education as it is now is stupid, a waste of money, and worthless on my 'use it or lose it' principle on how human minds work.
Some people here say that disused human langauges are better off forgotten. And yet I bet the same people would prize every new computer languages ever invented. Even Intercal. And Parrot.
No matter how cynical you become, it's never enough to keep up.
This is exactly why we do not know our past. The percentage of us who care about it is always just low enough that it does not get done well enough in the end.
Not that we are bad, it's that we have other more pressing matters like survival.
Those languages combined tell us a story we will have a much harder time understanding without many of them.
I have often wondered about religion and why it exists. This question is always tied up with our lost roots.
Since each of us always asks these questions at some point, work done to save these languages makes sense. It also makes their loss real once you think past the purely practical matters.
Blogging because I can...
We have invented SMS txt msgs and l33t-sp34k!
When I am king, you will be first against the wall.