Slashdot Mirror


Archiving Digital Data an Unsolved Problem

mattnyc99 writes, "It's a huge challenge: how to store digital files so future generations can access them, from engineering plans to family photos. The documents of our time are being recorded as bits and bytes with no guarantee of readability down the line. And as technologies change, we may find our files frozen in forgotten formats. Popular Mechanics asks: Will an entire era of human history be lost?" From the article: "[US national archivist] Thibodeau hopes to develop a system that preserves any type of document — created on any application and any computing platform, and delivered on any digital media — for as long as the United States remains a republic. Complicating matters further, the archive needs to be searchable. When Thibodeau told the head of a government research lab about his mission, the man replied, 'Your problem is so big, it's probably stupid to try and solve it.'"

405 comments

  1. Microsoft to help! by UbuntuDupe · · Score: 5, Funny

    I can't wait to hear Microsoft's explanation why the project should use one of their proprietary formats.

    1. Re:Microsoft to help! by Anonymous Coward · · Score: 0

      Using a proprietary format will keep your grandkids from reading private stuff you may have written using your favorite word processor. Easier to use than encryption. .. Hmm.. Well slightly.

    2. Re:Microsoft to help! by blindd0t · · Score: 0

      Irregardless of what their explanation is, they should not need more than 640K.

    3. Re:Microsoft to help! by 19thNervousBreakdown · · Score: 4, Funny

      That's not a word.

      --
      <xml><I><am><so><damn>Web 2.0</damn></so></am></I></xml>
    4. Re:Microsoft to help! by Anonymous Coward · · Score: 0

      It's not even a real quote either :(

    5. Re:Microsoft to help! by Anonymous Coward · · Score: 1, Informative

      Sure it is, it means without lack of regard. :-P

    6. Re:Microsoft to help! by RAMMS+EIN · · Score: 5, Funny

      Our formats are industry standards. They are backed by Microsoft, a robust company which has withstood vigorous competition, lawsuits, the .com burst, and the Bolshevik revolution brought about by Stallman et al. Where other companies have folded, Microsoft has flourished. With a known track record of backward-compatibility, your documents are safe with us. Trust us. We _invented_ trusted computing.

      And remember: nobody ever got fired for buying Microsoft.

      --
      Please correct me if I got my facts wrong.
    7. Re:Microsoft to help! by mrchaotica · · Score: 1

      That's unpossible!

      --

      "[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz

    8. Re:Microsoft to help! by sunny256 · · Score: 1

      And remember: nobody ever got fired for buying Microsoft.

      If they can afford to buy Microsoft they probably don't worry about getting fired neither...

    9. Re:Microsoft to help! by Jugalator · · Score: 1

      I'm sure there archival product will work great, but it's just that after two years, it will have been rendered incompatible with the archived data format..

      --
      Beware: In C++, your friends can see your privates!
    10. Re:Microsoft to help! by hdante · · Score: 1

      Mod parent funny. ;-)

    11. Re:Microsoft to help! by sentientbeing · · Score: 1

      Its a perfectly cromulent word.

      --

      ------
      beware he who would deny you access to information, for in his mind he dreams himself your master
    12. Re:Microsoft to help! by timmarhy · · Score: 1

      no body ever got fired for buying linux either, because it's free you fucking tard.

      --
      If you mod me down, I will become more powerful than you can imagine....
    13. Re:Microsoft to help! by Anpheus · · Score: 1

      Anyone who pays for Linux (not support, for Linux) deserves to be fired.

    14. Re:Microsoft to help! by strikethree · · Score: 1

      WTF? By the time they get to "Bolshevik Revolution" the mods should have realized you were +1 funny, not +1 insightful. Ah well, if only they would share whatever they are smoking. :)

      strike

      --
      "Someone needs to talk to the tree of liberty about its ghoulish drinking problem." by ohnocitizen
    15. Re:Microsoft to help! by Anonymous Coward · · Score: 0

      Our formats are industry standards. They are backed by Microsoft, a robust company...

      You have obviously never taken one of those old Word for DOS files and checked out the formating losses. And no, I didn't run every upgrade and patch saving those files over the years.

      I prefer the open standards for documents, including plain Jane HTML, PNG ofr graphics, MP2 and MP4 for movies, MP3 for audio. Since much of Microsoft can't really master these, I don't use much in the way of Microsoft tools. Ever try to burn a LEGAL ISO on a XP machine using only Microsoft software?

      The trick for long term storage is simple, keep the formats open, simple and vendor generic as possible. Backup to common media like CD-R and DVD and test the media often, placing copies in at least 2 places. Not rocket science, just basic realism with FFUD.

    16. Re:Microsoft to help! by OldHawk777 · · Score: 1

      "And remember: nobody ever got fired for buying Microsoft."
        And remember: nobody ever got fired for blaming Microsoft.

      I have noticed for years now that buying and blaming Microsoft products can be a career saver for TEK incompetence.
      Microsoft is always a good tech/admin-excuse to management in a non-technology company.

      --
      Unaccountable leaders are masters, and unrepresented people are slaves. How do US and EU fare?
    17. Re:Microsoft to help! by RAMMS+EIN · · Score: 1

      ``By the time they get to "Bolshevik Revolution" the mods should have realized you were +1 funny''

      Yes, I would have thought so, too. On the other hand, it's not without a grain of truth: the open source movement, particularly w.r.t. GNU/Linux, is often depicted as a band of anarchist rebels, fighting the great capitalist, Microsoft. In fact, this image is being spread by both advocates and detractors of Linux and open source, but Microsoft has been particularly colorful in labeling Linux a cancer that attaches itself in an intellectual property sense to everything it touches. I'm not aware of any Microsoft spokesperson actually having called Linux communist or Bolshevik, but if they had, they wouldn't have been alone.

      --
      Please correct me if I got my facts wrong.
  2. Not too long... by Electrode · · Score: 5, Funny
    "for as long as the United States remains a republic."

    So, they're shooting for about 10 years then?

    1. Re:Not too long... by MECC · · Score: 2, Funny
      "for as long as the United States remains a republic."

      So, they're shooting for about 10 years then?

      10 years or the next presidential election - whichever comes first

      --
      "We are all geniuses when we dream"
      - E.M. Cioran
    2. Re:Not too long... by eln · · Score: 5, Interesting

      Your timeline may be a little off (at least I hope so), but you're right that it's a silly goal. Whether the US has 10 or 1000 years left, history shows us it will most likely fall at some point, and that point will be fairly soon when compared to the entirety of human history.

      Making a format that will survive a thousand years so long as our advanced civilization is still around and still cares is pointless, because as long as there is a continuous line of people that care, they will be willing to transfer at least the more important stuff to new media. The trick is coming up with something that will still be readable when archaeologists dig it up 10, 50, or 100 thousand years from now.

    3. Re:Not too long... by Anonymous Coward · · Score: 3, Insightful

      I've been wondering, with our global nature now, will we need archeologists in the future? While I believe cililiziations will surely 'collapse', won't we all be around to immediately take note of it, and update Wikepedia? Seriously, I don't think we're going to be digging for stuff from this time, the global nature of our society leads me to that conclusion. It's not like when Greek society fell.

    4. Re:Not too long... by thelost · · Score: 4, Insightful

      the trick is... hoping that in a hundred thousand years people still care at all about their past. The slow realization as I read Isaac Asimov's Foundation saga about the origins of the Galactic Empire chilled me, mostly because the people of the empire had become so numb to their past as to have made it vanish entirely.

      --
      Promote Charity on Myspace, Show Your Colours!
    5. Re:Not too long... by Anonymous Coward · · Score: 1, Interesting

      Using a book of fiction for anything isn't usually a good idea.

      Granted it's not like most people care nowadays. Look at any slashdot discussion on education, rather sad how people complain about having to take history (heck or any subject they're not "interested in deeply") in school. People want to be ignorant sheep.

      Hell look at Xena or the dozens of other "historical" tv shows out there, I shudder to think of how many people's knowledge of history is probably based on such crap alone. In 20 thousand years they'll have Princess Diana was running around with a lightsaber killing communists or something.

    6. Re:Not too long... by eln · · Score: 5, Funny

      In 20 thousand years they'll have Princess Diana was running around with a lightsaber killing communists or something.

      Are you trying to say she didn't do that?

      Crap, I am so getting an F on my history paper.

    7. Re:Not too long... by nine-times · · Score: 2, Insightful

      As much as anything, it seems like we might worry about people rewriting the past. It'd be hard to edit part of one of the original copies of the US Constitution without anyone being able to tell the difference, because we actually have a really old piece of paper that someone would have to get access to, somehow erase some ink, and write over top with identical ink.

      But a historical document in the form of a text file on someone's hard drive? That can be edited without a trace.

    8. Re:Not too long... by dbIII · · Score: 1
      Whether the US has 10 or 1000 years left, history shows us it will most likely fall at some point

      Rome fell a very long time after it stopped being a republic. We've already had our equivalent of pirates burning Ostia and leaders trying to be king afterwards. I think George has blown his shot at being George III and things will stay as a republic - barring unlikely actions by uncontrolled intelligence agencies going rogue if a future leader tries to reign them in.

    9. Re:Not too long... by jonbryce · · Score: 1

      Well Britain (or more specifically England) has survived in more or less its current form since 1066, and I don't think it is about to collapse any time soon. Part of the reason is that it is flexible enough to evolve with the times, but not so flexable that it self destructs.

      Switzerland hasn't been around for quite so long, about 800 years, but has probably been more stable in that time, and if you look at some of the Cantons that came together to form Switzerland, you probably have political stability going back further than that. Britain came into being in 1707 when England joined with Scotland.

    10. Re:Not too long... by Anonymous Coward · · Score: 0

      http://www.heliograph.com/diana/ Too late, it's already happened.

    11. Re:Not too long... by constantnormal · · Score: 1

      ... to transfer at least the more important stuff ...

      And who decides what is important? And will they make that decision before the facilities to read the "important stuff" are inaccessible except via some long-dead proprietary format requiring some long-dead program to read it, that requires some long-dead operating system to run it, that requires some long-dead chip architecture to run it ...

      And then there is the choice of media used to preserve this "important stuff". If we are preserving it across collapses in civilization, a series of encodings should be used, with the header of each one containing a description of either the technology needed to read the next deeper level, or where to find said description.

      When we talk about accessing digital data, it's a bit different than accessing historical data, as historical data has thus far been readable using readers each of us carries around in our eye sockets. Digital data assumes and requires a certain level of technology to access it. If we only care about maintaining the data going forward, then periodic copying to the then-current media will handle things adequately. But if we make no foolish assumptions about what level of technology will exist a thousand (or even a hundred) years from now, then we need to consider ways to communicate the required technology to those who may be attempting to access the archived materials.

      Design of encoding algorithms should take into account the possibility that chunks of the data may become missing over extended periods of time, along with markers and tags embedded in the material to enable the readers to function. Perhaps a RAID-5 style striping of parity data would be a wise precaution. And of course, nothing electronic or magnetic should be a candidate for a long-term storage medium. Maybe burning pits into a gold disc might be appropriate, assuming that future archeologists do not melt down the platters for coins.

    12. Re:Not too long... by FooAtWFU · · Score: 3, Interesting
      I've been wondering, with our global nature now, will we need archeologists in the future? While I believe cililiziations will surely 'collapse', won't we all be around to immediately take note of it, and update Wikepedia?

      Archaeology is the search for fact. Not truth. If it's truth you're interested in, Doctor Tyree's Philosophy class is right down the hall. So forget any ideas you've got about lost cities, exotic travel, and digging up the world. We do not follow maps to buried treasure, and 'X' never, ever marks the spot. Seventy percent of all archaeology is done in the library. Research. Reading.

      -- Indiana Jones and the Last Crusade
      --
      The World Wide Web is dying. Soon, we shall have only the Internet.
    13. Re:Not too long... by nido · · Score: 3, Interesting

      Granted it's not like most people care nowadays. Look at any slashdot discussion on education, rather sad how people complain about having to take history (heck or any subject they're not "interested in deeply") in school. People want to be ignorant sheep.

      History is interesting, school makes it suck: "In Year ABC, XYZ happened. Test next week - students who regurgitate well will get an 'A'."

      People don't want to be sheep - totalitarian governments need populations to be docile. School is designed to suck the uniqueness out of children so, as adults, they'll take up a spot on a standardized assembly line.

      Kinda cruel how the government has encouraged the shipping of assembly line jobs to China... Dumb down the population, then get rid of the reason for the dumbing-down.

      See Gatto's Underground History, for example.

      --
      Learn the rules so you know how to break them properly.
      www.teslabox.com
    14. Re:Not too long... by tcc3 · · Score: 1

      Even relatively short term archiving is a problem. There are many musical perfomances, political speeches and other historically relevant data that is already or will soon be lost. Just because no one bothered to transfer to to a more durable medium. How many books are out of print?

      These items are snapshots of humanity and its a shame that they are lost to history.

      Sometimes it doesnt take thousands of years to lose things we'll wish hadnt been lost.

    15. Re:Not too long... by Anonymous Coward · · Score: 0

      Xena is a historical show? I wish I could mod you as funny.

      Using a book of fiction for anything isn't usually a good idea? A great deal of what we have today was first thought up in a science fiction book, and people back then thought it was ludicris.

      10,000 years from now there might be a Princess Diana (of the American Empire *nudge* *nudge*) Who actually did wield a lightsaber and go around killing something.

      People are ignorant sheep cause they choose to be, the path of least resistance is the interstate most will follow.

      And like some of your other replies, schools suck cause we are regurgitating the same crap out of college to further teach our children. School has quite literally become a prison facility, run as a despot, with government and management's thoughts focused on their careers and not our children, and that is the major fundamental problem.

    16. Re:Not too long... by Pollardito · · Score: 4, Funny

      quick, let's update wikipedia to say she did, then you'll have a source for your paper

    17. Re:Not too long... by hurfy · · Score: 1

      Good Point... i know you would rather have the mod point but life's not nice ;)

      I found a history book in the alley. I actually took it home and read it :) The book seemed alot better than the class at least from my foggy memory, hehe. Wasn't bad really tho it was in small bite sized chunks. If it didn't fit on one page it must have changed the world ;) The overlapped of history periods cause it focused on geo areas primarily was a bit confusing sometimes tho.

    18. Re:Not too long... by Chimera512 · · Score: 1

      what about the destruction of original manuscripts, errors in transcription or translation and quoting unavailable works out of context didn't occur? seems to me things like what you describe has been happening throughout history, the technology just makes it easier. just because its an obvious example, look at the plethora of un-canonized Christian texts like the gospel of Thomas, the early church largely succeeded in burying documents like those.

    19. Re:Not too long... by timeOday · · Score: 1

      For purposes of this discussion, I would also count the United States as part of England (i.e. part of the English tradition). There has never been such a break in tradition that we abandoned or willfully destroyed our memory of the past, which is really the issue here. It is one thing to preserve information assuming the complicity of future generations, and quite another to preserve it irrespective (or in spite) of them.

    20. Re:Not too long... by nine-times · · Score: 1

      True, but in those times, there wasn't an expectation of completeness. Now, like you said, it's just easier all around. It's easier to make documents, easier to store them, and also easier to alter them. We often don't even keep old copies of documents, but instead alter them in-place and re-save, which isn't really feasible with paper. Still, we have the expectation that everything is being captured and stored, so we don't even bother to remember things ourselves.

    21. Re:Not too long... by daigu · · Score: 1

      What exactly is "the more important stuff"? Egyptians thought the Pharoahs were pretty important - big monuments and all of that - yet for many years no one could figure out what those hieroglyphs meant.

      Indus script is another example, still undecipherable. You could also check out the Straight Dope for other examples, i.e., Linear A (Greece, 1800 B.C.), Zapotec (Mexico, 500 B.C.), Meroitic (Sudan, 300 B.C.), Isthmian (Central America, A.D. 200), and Rongorongo (Easter Island, A.D. 1800).

      I guess the point I am trying to make is that what is most important may not be obvious. 500 years from now people might wonder what daily life was like today and find the gossip columns of our day a treasure trove of information, whereas if we had to consider it we might come to the conclusion its all dreck.

      It's not a good idea to let the present decide what's important for subsequent generations. Because what their needs are and perspective will be quite different from ours.

    22. Re:Not too long... by lymond01 · · Score: 1

      I suppose passing of information is the biggest determination of whether we'll need archaeologists. If things go as planned, we should have a lot more information available about the 20th century in 2000 years than we have about the 1st century now, making archaeology somewhat unnecessary.

      Assuming the Turkey's plans for revenge on the human species don't come to fruition too soon.

    23. Re:Not too long... by markovg · · Score: 1

      "as long as there is a continuous line of people that care, they will be willing to transfer at least the more important stuff to new media..."

      Enter P2P. The most robust distributed archive on the planet... if the file is worth preserving. But I find it amazing sometimes what these "data collectors" are personally willing to shell out for drive space for files/media of questionable merit.

      So let me rephrase that: The most robust distributed archive on the planet... of South Park, the Evil Dead Film Series, and Conspiracy Theory Movies

    24. Re:Not too long... by mrmeval · · Score: 1

      The longest lasting information storage mediums known are animal products and paper, says a lot about modern technology. :)

      I doubt printing it in an encoded system onto acid free paper is viable. I've thought on this a small bit and the best I could come up with was etched aluminum. Then a recent article on aluminum oxide made me think it might be quickly and permanently markable with a laser but without the high temperatures you'd think would be needed to 'etch' the surface.

      Now I cannot find the article, bother.

      --
      I'd go on a Vegan diet but the delivery time from Vega is too long. --brownkitty
    25. Re:Not too long... by Anonymous Coward · · Score: 0

      What kind of two bit high school did you go to? Where I went ( a public school ), History was explained rather well and pretty interesting. Under no circumstances has what you said ever happened in any history class I've taken and that stuff probably doesn't happen anywhere, except maybe in the LAUSD.

    26. Re:Not too long... by Gothmolly · · Score: 2, Insightful

      Wrong. People DO want to be sheep. How many do you know that willingly trade freedom for security? People don't want to think; they've been educated for years not to. Philosophers have removed the cause-effect link, and so the large majority of people cannot distinguish between a man-made disaster and a natural one. My only solace is that I'll be worm-food by the time the end comes.

      --
      I want to delete my account but Slashdot doesn't allow it.
    27. Re:Not too long... by robvs68 · · Score: 1

      And every once in a while, 'X' actually does mark the spot.

      -- Indiana Jones and the Last Crusade

    28. Re:Not too long... by visgoth · · Score: 1
      ...and things will stay as a republic - barring unlikely actions by uncontrolled intelligence agencies going rogue if a future leader tries to reign them in.

      Don't worry, if our modern Praetorans act like their ancient bretheren, we'll have some interesting times up ahead.

      --
      My patience is infinite, my time is not.
    29. Re:Not too long... by cultrhetor · · Score: 2, Interesting

      Mod parent up. The most interesting aspect of history - one that's never taught in high school - is that it is a constructed narrative: the writings and accounts of several pieced into varying "histories." American high schools teach history as frozen in time, according to the same revisionist history that they were taught fifty years ago: Columbus, "Injuns," Pilgrims, the whole nine yards.

      --
      "Tu fui, ego eris" - Virgil
    30. Re:Not too long... by Anonymous Coward · · Score: 0

      well how would anyone be able to read or know how to use, or even interface with this storage device 10k or 50k years down the line?

      like imagine technological advances thus far in the last 50 years, might as well put all of the worlds history on punch cards, its recent enough that in the year 52006 they couldnt tell the difference through carbon dating. why not carve it into stone? that will certainly last, the thing is will anyone be reading or speaking english? or any contemporary language as we know it.

      and thats if we even survive that long, considering the way population increases, there will be over 20 billion people here in 100 years, 80 billion in 200 years and so on.

      yep, better preserve the old web, im sure future generations will be transfixed at thier terminal, reading this very thread. rapt with interest.

      seriously though, what important information do we really have to record, the contestants of american idol during its run on the air. or the annotated ebaumsworld.

      please get real

    31. Re:Not too long... by 0racle · · Score: 2, Informative

      No. he said "X never, ever marks the spot.' and then later, 'X marks the spot.'

      Personally I think it's wonderful that the Romans were so kind as to give us such a great plot device.

      --
      "I use a Mac because I'm just better than you are."
    32. Re:Not too long... by BoberFett · · Score: 1

      That's the real beauty of the ease of making digital copies. Sure, you can change the copy on your hard drive. But what about the million other copies out there? That's one of the reasons DRM has to die, if culture is made inaccessible, it will eventually be lost completely.

    33. Re:Not too long... by ExFCER · · Score: 2, Informative

      Orwell's beliefs about the control of the past, including the recent past, also derived from his experiences in the Spanish civil war, where he found that "no event is ever correctly reported in a newspaper, but in Spain for the first time I saw newspaper reports which did not bear any relation to the facts." http://www.ihr.org/jhr/v06/v06p--9_Bennett.html Just a point on your side...

    34. Re:Not too long... by ExFCER · · Score: 2, Interesting

      Sorry for beating a dead horse. "indifference to objective truth is encouraged by the sealing off of one part of the world from another, which makes it harder and harder to discover what is actually happening. There can often be doubt about the most enormous events... .The calamities that are constantly being reported -- battles, massacres, famines, revolutions -- tend to inspire in the average person a feeling of unreality. One has no way of verifying the facts, one is not even fully certain that they have happened, and one is always presented with totally different interpretations from different sources. Probably the truth is undiscoverable but the facts will be so dishonestly set forth in that the ordinary reader can be forgiven either for swallowing lies or for failing to form an opinion ..."

    35. Re:Not too long... by Anonymous Coward · · Score: 0

      you're right...

      hopefully, when the 'long tail' economy fully kicks in, human culture will finally recognize the VALUE of uniqueness, creativity and non-conformity.

      who wants rupert murdoch ruling the world forever??

    36. Re:Not too long... by Anonymous Coward · · Score: 0

      > While I believe cililiziations will surely 'collapse', won't we all be around to immediately take note of it, and update Wikepedia?

      Hoo boy! I can just imagine examining the history on THAT article.

      "View (previous 500,000) (next 500,000) (20,000,000 | 50,000,000 | 100,000,000 | 250,000,000 | 500,000,000)."

    37. Re:Not too long... by myowntrueself · · Score: 1

      Meanwhile in much of Europe a good chunk of recent history has been frozen into an un-revisable, legally irrefutable image.

      I am, of course, talking about the laws against so-called 'holocaust denial' which effectively make it a crime to uncover any historical data which would refute the official story of what happened.

      Even if it were a matter of historical fact that, say 5 million Jews were killed in the holocaust instead of 6 million it would, in effect, be illegal to tell the *truth*.

      --
      In the free world the media isn't government run; the government is media run.
    38. Re:Not too long... by k12linux · · Score: 1
      History is interesting, school makes it suck: "In Year ABC, XYZ happened. Test next week

      I absolutely hated history class. I thought I hated the subject of history in general and didn't see a ton of value in studying it.

      Now, with the help of things like reasonably accurate documentaries on the History channel and the like I find it fascinating. In class I learned WHAT people did but not WHY. Seeing events fleshed out with background explanations and real human motivations (many times flawed ones) made the difference. Now the saying, "Those who cannot remember the past are condemned to repeat it," makes sense.

    39. Re:Not too long... by DarkAxi0m · · Score: 1
      Seventy percent of all archeology is done in the library
      yup and 52.642 percent of all statistics are made up on the spot ;)
    40. Re:Not too long... by nido · · Score: 1

      I've a copy of 1984., but haven't gotten to it yet. Hopefully when I learn how to read... (Gatto's A different Kind of Teacher pointed out to me that I haven't really learned, at least not yet).

      Mr. Orwell does seem like he was somewhat presentient - he 'saw the future', and recorded his visions in 1984. Just look at how much of his book has come to pass in the nearly 60 years since he wrote it.

      Thanks for the link.

      --
      Learn the rules so you know how to break them properly.
      www.teslabox.com
    41. Re:Not too long... by eno2001 · · Score: 1

      The problem with your suggestion is that you refer to the stuff that's "important" enough to be transferred to newer media/formats. However, I suggest that things like old photos from the 10th and 20th century that WEREN'T "important" in the large scheme of thing survive to this day. Although they're not "important" to the survival of society, the are extremely important to the understanding and relation to the past by future generations. We get to experience a bit of the past through just photos alone. Add to that written text even personal letters and you have unimaginable cultural wealth. The kind of wealth that will be lost forever if we don't preserve this stuff on something human readable or devise a guaranteed readable format. I think the comment that the problem is so big as to be stupid to try and solve is EXACTLY on the mark. I had suggested a while back that we'd be better off taking quantum snapshots of the entire solar system and transmitting the datastream to remote locations in the universe for possible recovery in the future. I even suggested that there be an automatic way of rebuilding the solar system as needed to recover lost eras. Of course I was called a crackpot even though the science is sound. But at least I made a suggestion. I, personally think that the data requirements for processing such snapshots and the space to hold the data would be much smaller than the solar system itself.

      --
      -"...bad old ideas look confusingly fresh when they are packaged as technology" - Jaron Lanier (Digital Maoism on Edge.o
    42. Re:Not too long... by Anonymous Coward · · Score: 0

      The US is a republic???

      My suggestion: rather get started on time travel. That will solve both the data format problem as well as the republic ... errr ... difficulty.

    43. Re:Not too long... by Builder · · Score: 1

      I loathed history at school. I hated it so much that I thought I just wasn't interested in the past and for 8 years after leaving school, I just never bothered to even read about the past.

      Then I discovered that history doesn't _have_ to be boring. If well researched, well argued and well presented, it can actually be facinating. Now I've gone from almost exclusively reading fiction, to pretty much just reading non-fiction. Our past is fascinating, but schools kill it for people!

    44. Re:Not too long... by mattpalmer1086 · · Score: 3, Interesting

      No one seriously working in digital preservation is trying to make a single thing that will last for 50, 100 or 1000 years. The point is not to preserve information in the event of a total civilization collapse, to make it easier for future archaologists, or some such scenario. The point is to keep our historical digital records *currently* readable at any given point in time. If our civilization collapses, it will be up to those who come after to figure out what we were up to.

      There are two basic strategies to keep our digital files *currently* accessible:

      1) emulation. Check out IBM's Universal Virtual Computer project.
      2) migration. Not only migration of storage media, but migration to new and currently readable formats.

      We will need to migrate all of our digital files every 5-10 years or so to keep them current. And yes, information will get lost along the way - everything decays eventually.

    45. Re:Not too long... by ADamiani · · Score: 1

      Actually, I think there's already an pen-and-paper RPG about that, called "Diana, Warrior Princess"....

    46. Re:Not too long... by Anonymous Coward · · Score: 0

      Don't forget early films. All that old, old nitrate stock auto-oxidising, and thanks to the Mouse and his insane copyright durations, no legal way to copy them onto newer more stable media, or to digitise them...

    47. Re:Not too long... by winnabago · · Score: 1

      I'm surprised nobody has mentioned the radioactive warning signs meant to be legible in 10000 years. Every one of these suggestions of digital media, electronic, or solar powered durable signals was considered, but in the end it was decided to go with architecture - simple concrete with engravings.

      --
      Dammit Otto, you have lupus.
    48. Re:Not too long... by Jesus_666 · · Score: 1

      Depends. If they found out that one concentration camp accounted for people that weren't there for some reason they would accept that fact. However, some people do make some rather outrageous claims up to and including that concentration camps never existed (which today can still be refuted easily because some of the camps' former guards are still alive).

      It's not illegal to uncover historical data - it might be controversial, especially if your sources are unreliable, but it's not illegal. However, claiming that things for which we do have lots of solid evidence did not happen is illegal. The stuff might have been entirely doctored, but that's not very probable.

      --
      USE HOT GRITS WITH STATUE OF NATALIE PORTMAN (NAKED AND PETRIFIED)
    49. Re:Not too long... by mrmeval · · Score: 1

      The misinterpretation of such symbols have been used countless times for scifi stories and in animated and live action movies. :)

      --
      I'd go on a Vegan diet but the delivery time from Vega is too long. --brownkitty
    50. Re:Not too long... by ExFCER · · Score: 1

      http://www.fastcompany.com/online/40/wf_gatto.html

      Well said and insightful... //modup+2//

  3. How is this different by zappepcs · · Score: 5, Insightful

    than the previous ages where all information was kept on paper or in spoken words? The problem isn't so much how to invent something that will always be readable, but some way to always have the applications to read it. If it were not for the Rosetta Stone, much of what we know about the ancient world might still be a mystery.

    1. Re:How is this different by quanticle · · Score: 3, Interesting

      Its different because of the sheer volume of information being created today. Ancient cultures were not creating millions of pages of information every day.

      Your Rosetta Stone analogy is inappropriate. We have not discovered any sort of Rosetta Stone for the ancient Maya hieroglyphs but we have had success in deciphering them because we can apply linguistic analysis techniques to figure out what words correspond to what actions/things. Its a little more complicated for abstract concepts, but you can figure out a surprising amount from basic language knowledge.

      --
      We all know what to do, but we don't know how to get re-elected once we have done it
    2. Re:How is this different by 7macaw · · Score: 1

      So we just have to include a language textbook with every pack of archive DVDs!

    3. Re:How is this different by nine-times · · Score: 2, Insightful

      How is this different than the previous ages where all information was kept on paper or in spoken words?

      Paper actually holds up rather well as an archival medium. Plus, you don't need specialized technology to read it.

    4. Re:How is this different by s20451 · · Score: 4, Interesting

      Say western civilization is disrupted for a period of time that is short by historical standards -- 40-50 years would be enough. Electrical power is only sporadically available, and as a result the Internet collapses and PCs become useless. With much more important issues to deal with, such as finding food, people ignore digital data storage.

      The era of restoration comes. However, when people blow the dust off those old DVDs and players, they discover that the DVDs have decayed to the point of unreadability. Massive quantities of archived data and knowledge are irretrievably lost.

      The main problem in our age is thermodynamics -- information is stored so densely that it tends to decay naturally, on its own. By contrast, ancient stone carvings (as well as their keys, such as the Rosetta stone), are sufficiently durable to last (basically) for ever.

      --
      Toronto-area transit rider? Rate your ride.
    5. Re:How is this different by ThosLives · · Score: 4, Insightful

      It's not so much the Rosetta stone, but the fact that a "Rosetta stone" has a built-in context - it's obviously communication or artwork of some kind. If you have a big pile of digital data, what is it? An image? Compressed text? Audio? Just a sequence of numbers? The thing "printed" information gives you is that the presentation of the data gives you an idea of what it is - we don't yet have any digital data formats for which the presentation of the data gives an idea of the content; in fact, most digital storage mechanisms present all types of information in identical manner.

      That's the real challenge - devising a digital storage format in which presentation can be used to apply context to the data.

      --
      "There are a dozen opinions on a matter until you know the truth. Then there is only one." - CS Lewis (paraprhase)
    6. Re:How is this different by Threni · · Score: 1

      > An image? Compressed text? Audio? Just a sequence of numbers?

      Stick the info in a .nfo file. Keep the data on an array of hard drives, using RAID or similar. Keep backups of the data. If a drive fails, replace it.

      Next problem?

    7. Re:How is this different by hairpinblue · · Score: 1

      Use a holographic image embedded in another ceramic of sufficient strength to last the test of time which is placed to use the Hoover Dam wall (or similar large flat surface) as both the display screen and the surface with which to reflect and focus radiation (from somewhere) through the hologram. It is possible that the ceramic used to encase the holographic image will interact with the radiation as it passes through. The most obvious implementation is to polish a section of a large rock wall to use the sun's rays and focus them through a clear ceramic cube containing a clear hologram of standard colors in such a way that the light coming through the hologram is displayed on another large rock wall. A more complicated implementation would capture more obscure radiation, eg. cosmic rays, and, implementing different absorption and emission properties and the refractive indeces of the requisite ceramic materials necessary to have the correct properties, focus them through the ceramic structure of the required shape, through the hologram of the proper composition, and reemit that radiation onto a surface composed of a material which, when excited with the radiation coming through the hologram, would relax by emitting photons in the proper color spectrum of human visible light. This idea is mine. I claim the IP on it.

      Easter Island, Stonehenge, Woodhenge, that sort of thing, but a little bit more high-tech similar to Stargate and SG-1.

      You're right. There's no way the computer platform and the infrastructure necessary to support it is going to stabilize, any time soon, to be near as secure or robust as a one thousand pound obelisk of impenetrable rock achored onto a slab of granite. At the same time the one thousand pound obelisk isn't going to be able to store and actively display near as much information, or be as readily updateable, as a stack of DVDs.

      Decisions decisions...

      --
      Hustlers exist solely through charity. I see their scams, lies, and deceit: I'm too charitable to outright shoot them.
    8. Re:How is this different by profplump · · Score: 1

      Just include a copy of `file` on the disk.

    9. Re:How is this different by hclyff · · Score: 1

      Plus if we know (and care) that the information will be wanted in the future, why not try making it as easily retrievable as possible? How much information was lost since the first written works due to badly enduring materials or linguistic problems? We know how much hassle it costs us to get 4000 years old information, still we are not making it any easier for people who will live in 4000 years from now.

    10. Re:How is this different by Edzor · · Score: 1

      gentlemen i have your answer. punch cards.

    11. Re:How is this different by Marxist+Hacker+42 · · Score: 4, Interesting

      Now that's the right problem. What is needed isn't some mysterious Universal Translator Format- it's storing the read hardware, with programs in ROM that understand the format, along with the electronic copy. Hell, store the whole thing in ROM chips with a well documented interface printed on the outside of the chip. Libraries could be made up of whatever reading technology exists at the time the library is built- with this common pin-level interface.

      --
      SJW: a person who perceives an injustice, and while correcting it, commits a greater injustice.
    12. Re:How is this different by LynnwoodRooster · · Score: 1

      That's why I'm printing out my own copy of the Internet. In case it all goes away, I'll have a full copy at my fingertips. Now if you would all stop adding to it so I can finish spooling my print job, I'll be on my way...

      --
      Browsing at +1 - no ACs, I ignore their posts. So refreshing!
    13. Re:How is this different by salzbrot · · Score: 1
      Stick the info in a .nfo file. Keep the data on an array of hard drives, using RAID or similar. Keep backups of the data. If a drive fails, replace it.

      Next problem?

      Or even easier: put .jpg at the end of the file and everybody will know that it is an image. Or .doc and people know it is a letter. Common people, it is not that hard. My 11 year old nephew knows that. Also, this technology has been around for ages (probably the civil war, or whenever it was the pyramids were built, I forgot), just like RAID, and it will never become obsolete.
    14. Re:How is this different by YrWrstNtmr · · Score: 1

      Next problem?

      Ensuring that you have something that can read that hard drive in 500 years.

    15. Re:How is this different by autophile · · Score: 2, Funny
      Paper actually holds up rather well as an archival medium. Plus, you don't need specialized technology to read it.

      I do. (Adjusts glasses)

      --Rob

      --
      Towards the Singularity.
    16. Re:How is this different by Anonymous Coward · · Score: 2, Insightful

      put .jpg at the end of the file and everybody will know that it is an image. Or .doc and people know it is a letter. Common people, it is not that hard.

      How many years ago were people putting .pcx .ras and .iff on the ends of their files? How much longer will people know what they mean? Ever met an .arc file?

      Obsolescence has been going on forever. Stopping it is harder than you think.

    17. Re:How is this different by thanasakis · · Score: 1

      Hi,

      I am Greek. From our ancient civilization (you know, the one that invented democracy and other pesky things :-), besides statues, buildings, temples, swords etc there is left an incredible amount of literary works. What's funny is that I can read most of it and understand what it is saying. Of course, Ancient Greek is being taught at highschool, but even it it wasn't, I would still be able to get a very good feeling of what I am reading. Modern Greek is very close. At the same time that some other civilizations' remains can hardly be decrypted, our heritage is being studied by thousands of scholars and philosofers around the world.

      So, if, for example, we want to make sure that the heritage of our collective Western Civilization survives, we must make sure that our languages are preserved(*). I know this sounds like a "no shit, Sherlock" argument, but I think it is a matter of basic school education. All children MUST have rigid foundations that will allow them to speak AND write correctly. This also goes to all technical studies advocates too who think that a laptop at the age of 7 or 8 is surely a good thing. If you take up all their time with stuff like that, they'll never want to or have the time to receive such basic education, which in my oppinion is crucial. Well, at least as crucial as learning computers in our modern age.

      I may be wrong, but think about it.

      (*) Note to ourselves: Also, we must make copies of everything like there is no tomorrow. And not encrypted (or DRMed) copies, just plain unencrypted copies. That will make sure that nothing is lost during the course of time.

    18. Re:How is this different by Tim+Browse · · Score: 3, Funny

      If you have a big pile of digital data, what is it? An image? Compressed text? Audio? Just a sequence of numbers?

      That's what MIME types are for. Duh.

    19. Re:How is this different by smoker2 · · Score: 2, Insightful
      Rubbish.

      If you found a bunch of punch cards then what would you make of them ?

      They are obviously some kind of communication, because they have no artistic value. Whether they are designed to communicate with humans or a Jaccard loom is not the point. They convey information. Same goes for digital. Once someone discovers discrete patterns of ones and zeros, then the intention can be deduced. If the repeating pattern is "knit one pearl two" then you're probably reading a knitting pattern. If it says "Four score and twenty years ago ..." then there's a good chance it's a historical text, or maybe fiction, either way, it's human communication. The Rosetta Stone was only useful because there were 3 different languages represented saying exactly the same thing, and they already knew some of the terms in one of the languages. So, maybe if you have a jpeg file, a txt file and an mp3 stored in the same place, then the sequences of data could be related to each other. None of that actually deciphers the context or the actual message. It just shows that the different mediums are equivalent. You still need at least one variable to start decrypting the rest. Context is irrelevant.
    20. Re:How is this different by Dun+Malg · · Score: 3, Interesting
      The main problem in our age is thermodynamics -- information is stored so densely that it tends to decay naturally, on its own. By contrast, ancient stone carvings (as well as their keys, such as the Rosetta stone), are sufficiently durable to last (basically) for ever.
      Of course, preserving the data is only half the battle. Figuring out what it says is the second part. This is, of course, nothing new. We still can't read Linear A. In the case of the Rosetta Stone we were simply lucky to find something relating hieroglyphics to a language we knew. The Rosetta Stone is rather unusual. Normally we have nothing so convenient.
      --
      If a job's not worth doing, it's not worth doing right.
    21. Re:How is this different by Yartrebo · · Score: 1

      Most file formats have easily distinguished headers, usually in the form of a 4-byte long code.

    22. Re:How is this different by The+Good+Reverend · · Score: 2, Insightful

      (IANAA - I am not an archivist, but I do work in an archives library for a media organization)

      Paper can hold up - we have proof of that in centuries-old paper. But when you look at the percentage of paper that's survived the last few thousand years compared to a) the amount of paper produced and b) the amount of information lost, it's staggering.

      There's no one answer, but rather a set of keys that'll help. These include regular backups, widely adopted standards, multiple backup formats, important backups on more durable media, and inclusion of metadata instructions for reading the data.

      Very little will defend against total annihilation of a nuclear or asteroid sort, a technological hiatus (where things can degrade), or not working to do all the above things. But I'd have better hopes for my data kept as above compared to even well maintained paper.

    23. Re:How is this different by elronxenu · · Score: 2, Insightful
      That's the kind of thinking which almost doomed the modern Domesday book. They thought that preserving the retrieval hardware was enough. Wrong! It required a massive restoration effort to get the material off the laserdiscs and onto the web.

      Ultimately, information only survives if it has been duplicated. The Domesday book laserdisc format wasn't easy to duplicate. It wasn't usable on home PCs, only on specially constructed reader machines in libraries. Consequently, it gathered dust in the "cathedral" it had been designed to inhabit.

      The way to keep the information accessible is to keep migrating it to modern media, and modern formats. That, and duplication. Massive redundancy will ensure the survival of at least some copies of the information.

      Although we may be creating too much information to continually transfer it to modern formats, if we at least keep it on modern media we have a chance to use emulation (or structural analysis) to use it in future.

    24. Re:How is this different by Anonymous Coward · · Score: 1, Interesting

      Stored so densely -- but that is the whole point. Now you can store the same information in more than one spot on the same disc for the sake of redundancy. And you still have enough room to store a lot of information. Also, a disc is cheap enough to be copied and scattered throughout the globe. If you want to improve reliability, also bundle your disc with more discs that contain an OS and other application software. As we take the Moore's Law joyride in our computational amusement park, you might bundle your important data media with the requisite hardware to extract it.

    25. Re:How is this different by megaditto · · Score: 1

      Wouldn't you need a truck to move that thing around?

      --
      Obama likes poor people so much, he wants to make more of them.
    26. Re:How is this different by toddestan · · Score: 4, Insightful

      You're assuming far too much. Remember, there are entire written langauges from 2000+ years ago that we barely know how to read. And we have the context of what they were written on, formatting, what the characters look like and things like that. Now, in 2000 years, if someone came upon your harddrive, or flash memory card, or whatever - assuming they could even read it, they aren't going to be able to pop it into a computer and see c:\My Music\ and C:\Documents and Settings\, and the only challenge left is to figure out what the hell an OGG file is. They aren't going to see files. They are going to see 1's and 0's. Lots of them - billions on a memory card and trillions on a harddrive. They won't have a clue know how to interpet the file system, even for something relatively simple like FAT16. They may not even know that a byte is 8 bits. They won't have context, they will be baffled by the fact that most every OS writes files in fragments all over the drive. They likely won't even be tell areas that were marked as deleted but not wiped from the actual data, let along figure out what the swap file is. I seriously doubt that someone in the future, given a working harddisk but nothing else to go on, would be able to pull anything meaningful from the drive. Heck, look at modern day examples - how long did it take Linux to be able to read and write to NTFS, given the number of very smart people working on it who already had a pretty good idea how it functioned?

    27. Re:How is this different by bheekling · · Score: 1

      And how long before the standards for MIME types change? In fact do we even have one right now?

      --
      "..."
    28. Re:How is this different by adrianmonk · · Score: 3, Interesting
      They aren't going to see files. They are going to see 1's and 0's. Lots of them - billions on a memory card and trillions on a harddrive. They won't have a clue know how to interpet the file system, even for something relatively simple like FAT16. They may not even know that a byte is 8 bits.

      They might not know that a byte is 8 bits, but with a little analysis, it shouldn't be hard to figure out. There are numerous statistical properties that can be exploited to figure this out relatively easily. For example, with most types of data, the higher-order bits (in any size byte) are more likely to be 0 than the lower-order bits are. Think about how booleans are stored in most systems. Think about the characters in this message: 100% of them have a zero high-order bit. To put it a little differently, there is more entropy in the lower-order bits.

      So, to figure out how many bits there are in a byte, you take your data, and for all reasonable sizes of bytes (say, from 4 bit bytes up to 36 bit bytes), you compute the function that maps bit position (low- or high-order) to an entropy value for that bit. Then you can tell by the shape of that curve which guess about bits per byte was the right guess. Heck, it should be such a strong trend that you can probably automate it!

      Remember that future civilizations will probably also use digital data as well, at least ones sophisticated enough to try to read the optical and magnetic media. They may not know the FAT32 filesystem, but they will have invented statistics and information theory, and they will be able to make some awfully good guesses at things. And yeah, it might take them 10 or 20 years to be able to read a FAT32 volume correctly if some poor college student of the distant future has to do it on a shoestring budget of grant money, but if they're reading 10,000 year old data, how much does that matter?

    29. Re:How is this different by cultrhetor · · Score: 1

      I think that the whole discussion (and article, if you want to think about it) misses a very large point - probably because of the audience. We're all talking about a permanently available digital format, because that's what we've trained ourselves to do. But even with this mindset, most people I know never have only one, or even two copies. Important documents are backed up to another hard drive, a cd, a DVD, sure, but even then, don't most of us hang on to the most important documents in hardcopy? I know the government does: they're bitching about data loss in transmission; however, even if it's inconvenient, they have hard copies of most artifacts such as blueprints, etc. How many of you keep only digital copies of your tax returns, knowing that if, four years from now, the IRS audits you and you have multiple storage failures/losses, etc., that you're screwed?

      --
      "Tu fui, ego eris" - Virgil
    30. Re:How is this different by Evil+Pete · · Score: 1

      And paper records should survive the 40-50 year hiatus, if they aren't burned. In fact even pulp stuff written on acid containing paper should last more than that period. Though not by a lot. So textbooks etc will survive ... Even if the DVDs survive you need a PC to play it on (talking software here not video). So you have to have a functioning semiconductor / electronics industry. Fat chance that will still be around.

      Also as you say the density of the information is a problem. If I pick up an old book with a spot on one page I can still read around it and surmise what is in it. But if a zip/gz file gets hit by a cosmic ray and flips some bits: checksum error, too bad.

      I have been thinking about this for a while and have thought that what should be done is that some electronics docs, even slabs of wikipedia should be printed onto acid free paper and saved in some special vaults around the place ... must have redundancy. If we have the 40-50 year failure a lot will be lost but some important stuff will survive. Can't win against entropy all the time, so we just have to choose the battles we can win.

      --
      Bitter and proud of it.
    31. Re:How is this different by LoudMusic · · Score: 1

      Sounds like a good plot for a movie. Moderately similar to "Dark Angel", the television show that got canned, but like Firefly I think it would make a good movie.

      --
      No sig for you. YOU GET NO SIG!
    32. Re:How is this different by pq · · Score: 1

      Nah, just tubes.

      --
      "I will take the Ring," he said, "though I do not know the way."
    33. Re:How is this different by SuperMog2002 · · Score: 2, Funny

      That assumes they know which are the zeros and which are the ones. And that in turn assumes they know there are zeros and ones in the first place.

      --
      Sunwalker Dezco for Warchief in 2016
    34. Re:How is this different by Anonymous Coward · · Score: 0

      Let us all pray for your comment to end up on archive.org.

    35. Re:How is this different by multipartmixed · · Score: 1

      I'm making my copy of the Internet out of wrapping paper tubes.

      It's pretty awesome.

      Thank God for Christmas!

      --

      Do daemons dream of electric sleep()?
    36. Re:How is this different by k12linux · · Score: 1

      But after the pulse when those chips have been fried what happens? We'd better encase examples of them in lead tombs underground... along with a CD with instructions on how to create the ancient power form called "electricity." Aww crap.

    37. Re:How is this different by Anonymous Coward · · Score: 0

      What about in 10,000 years when the filesystem or even the concept of a filesystem probably won't exist? And what of this archaic concept of "file extensions" you speak of? In 10,000 years, everything we know about computers will have been obsolete for a very long time.

      Example: Take a typical CD with pictures stored as JPG files. Assuming the CD survived intact for 10,000 years, in order to view the pictures, you must first know how the data is encoded on the CD. In this case, a series of pits and lands are modulated to represent binary 1's and 0's (although not exactly since ECC codes are used as well). Then you'd have to know how the ISO-9660 filesystem works (which includes the very concept of file extensions), how to read ASCII text to decipher the filenames (assuming the normal alphabet is even used in a future language). Then you'd have to know that a JPG extension meant the file was a compressed image file, then you'd have to know how to decompress it and display it. Basically, the concepts of binary computers, pixels, color depth, reed-solomon codes, file extensions, file systems, JPEG headers, ASCII, and anything else we might simply say "duh" to will be completely obsolete. In 10,000 years they won't be able to just put the CD in their CD-ROM drive and double-click the picture files.

      Oh, and your 11 year old nephew will be 10,011 by then, so he will be no help.

    38. Re:How is this different by thogard · · Score: 2, Funny

      Knowing that there are 8 bits in a byte isn't going to do them any favors if they are looking at some of my programs that I have on punch cards.

    39. Re:How is this different by Threni · · Score: 1

      You'll be doing regular backups. As blu-ray or whatever comes along you'll appraise that and use if relevant.

    40. Re:How is this different by yosofun · · Score: 1
      it's obviously different because it's now a conspiracy between the shelf-life of Memorex vs Ridata vs Lena vs TDK

      WHY IS IT THAT 5 CENT DVD'S DON'T LAST FOR MORE THAN A YEAR OR SO?!?!?

      The photos stored on dvd's fade like forgotten memories on a dusty shelf...

    41. Re:How is this different by yayazozzy · · Score: 1

      There is one underlying assumption here that should be looked at more deeply. How long will the storage media last?

      CD Disks: < 300 years due to degradation of plastics and reflective layers - (unless placed in vacuum & chilled to liquid Nitrogen temps)

      Hard Disk: drive mechanism < 5 years due to degradation of heads (oxidation) drying & degradation of lubricants (I have two Apple LISA 5 MB HDD's that are still operational, but that is probably just Apple over spec-ing components and the robustness of the technology). disk platters < 10 years due to thermal & magnetic noise affecting domain orientation

      Scrolls: ink on scraped goat or sheep skin < 10,000 years (earliest Dead Sea Scroll is about 150 BC, or about 2,150 years old), Cuneiform on Papyrus < 10,000 years (earliest writing on Papyrus is about 2,600 BC or about 4,600 years old)

      Clay Tablets: writing indentations in clay ~ practically imperishable (earliest writing on clay tablets is about 3,000 BC or about 5,000 years old)

      We can come up with many ideas for universal data formats, embedded universal virtual machines that can be run on future systems to read, display, play... our data, but the underlying storage infrastructure must change as well. It is a bit difficult to store a 17 MB JPEG image, or a Seu Jorge album on clay tablets.

    42. Re:How is this different by Tim+Browse · · Score: 1

      Johnny Christmas, do I have to explicitly tag every joke I make on slashdot?

      Apparently so. I must be new here.

    43. Re:How is this different by bheekling · · Score: 1

      Damn, my sleep cycle is finally taking revenge on me. Not the humour centre please! Think of the sense of humour!

      --
      "..."
    44. Re:How is this different by phozz+bare · · Score: 1

      More likely, several inconveniently large buildings.

    45. Re:How is this different by old+man+moss · · Score: 1

      Yes indeed. Have you seen this example of something that looks like it should be easy to "translate" but isn't.

      Any idea what the answer is? Is it a hoax/joke? I spent a whole afternoon on it once... and got nowhere :-(

      --
      rt
    46. Re:How is this different by salzbrot · · Score: 1

      Obviously the concept of irony is becoming obsolete even sooner than RAID and filesystem extensions.

    47. Re:How is this different by CagedBear · · Score: 1

      We are also assuming they will have an ample supply of electricity. Without it, magnetic and digital media is completely worthless. Even if they have power after a long period of darkness due to environmental catastrophe, war, etc, where will they get the plans to build machines to read the media?

      I think this needs to be a two step process. 1. Try to decide what is the most important information and make hard copies on a media designed to resist decay, water and fire. 2. Store the rest of it on a single type of media and include hard copy plans of how to build machines to read and decode it.

    48. Re:How is this different by Jellybob · · Score: 1

      There's XML, which while not ideal because some things can be misinterpreted if you don't understand the schema correctly, but it does embed information on *what* the data you've got is to a certain extent.

      If something is unclear, they could also add comments, that can provide some pointers for our future ancestors, although that does rely on certain assumptions, such as the use of the same encodings for characters.

    49. Re:How is this different by Viol8 · · Score: 1

      "To put it a little differently, there is more entropy in the lower-order bits. "

      Not always - text is an exception. For a string of character data going from 0 to 256
      (or 127 , it doesn't matter) for an idealised language where all letters are used
      equally then bit 0 will be '1' 50% of the time and '0' 50% of the time. Similarly
      bit 7 will follow the same pattern and all the bits inbetween. Ok , in real languages
      some letters are prefered over others but this will effect the entropy of the lower
      bits depending where the letters occur in the character set in use , NOT depending
      on how high the actual values are which is what your point is based on.

    50. Re:How is this different by sakasune · · Score: 1

      If something is unclear, they could also add comments, that can provide some pointers for our future ancestors, although that does rely on certain assumptions, such as the use of the same encodings for characters.

      Or the assumption that they speak English...or whatever spoken/written language it is in

      --
      "You're arguing for a universe with fewer waffles in it," I said. "I'm prepared to call that cowardice."
    51. Re:How is this different by danpsmith · · Score: 1
      They might not know that a byte is 8 bits, but with a little analysis, it shouldn't be hard to figure out. There are numerous statistical properties that can be exploited to figure this out relatively easily.

      If this entire civilization goes bottom up, who knows what will find our stuff and if they will know about "statistical properties" or be able to conduct even a "little analysis." Attempting to leave a hallmark card for the future of everything we "know" is silly. We don't know what's to come when we don't really even know that much of what came before us. It's naive to think this way. If civilization goes bottom up, the cavedwellers that could result from the anarchy could be using our hard drives as hand tools with no idea or care as to what they are or any idea or thought as to how they got there. If it doesn't, well then, the modern information will flow from generation to generation anyway, so they won't have to conduct analysis at all.

      --
      Judges and senates have been bought for gold; Esteem and love were never to be sold.
  4. hieroglyphics by IWantMoreSpamPlease · · Score: 4, Funny

    Worked for the Egyptians didn't it?

    --
    So rise up, all ye lost ones, as one, we'll claw the clouds.
    1. Re:hieroglyphics by Shoeler · · Score: 1

      Isn't the solution to at least the format readability problem pretty simple? Print out schematics for a reading device on a format that will last the longest. Store said format with all media.

      Of course that doesn't fix the problem of archive stability. Tapes are supposed to be relatively long-lived compared, say to a simple CD-R, but haven't we all had one or many more fail on us?

    2. Re:hieroglyphics by MysticOne · · Score: 1

      You're also assuming that somebody will know how to read the schematic in the future. They may not look anything like that in the future, or, the parts may be completely unavailable with which to build such a device. It's not a bad idea, but, I don't think it's any better than many of the others.

    3. Re:hieroglyphics by l0b0 · · Score: 1

      Tonk, tonk, tonk, tonk, tonk, tonk, tonk!

      "Aw dammit, I forgot to catch a systemOutOfMemory error!"

      *Fetches another rock*

      And that, ladies and gentlemen, is why Java will be forgotten by history.

    4. Re:hieroglyphics by kilgortrout · · Score: 1

      No it didn't work for the Egyptians. The destruction of the Library at Alexandria is considered one of the greatest losses to our understanding of the ancient world. This library was considered to be the Library of Congress of its day and was totally destroyed in a fire. To be fair though, the Greeks had taken over by this time so it wasn't really hieroglyphics.

    5. Re:hieroglyphics by StarfishOne · · Score: 1

      Ah! It's good to know that at least my handwritten notes are safe! :D

    6. Re:hieroglyphics by megaditto · · Score: 1

      Why are we so fucking full of ourselves?

      Why do we think the future humans will care what some Joe Nerd brogged after watching a porno in his mom's basement?

      The truth is, 99.99% of all our "information" is simply not worth preserving. The rest, print of some acid-free paper.

      --
      Obama likes poor people so much, he wants to make more of them.
    7. Re:hieroglyphics by kimvette · · Score: 1

      My penmanship resembles hieroglyphics after many years of typing and doing very little writing. That does not mean future archeologists will be able to decipher my henscratch. ;_)

      --
      The Christian Right is Neither (Christian nor right). See: Matthew 23, Matthew 25, Ezekiel 16:48-50
    8. Re:hieroglyphics by kimvette · · Score: 1

      I've found tapes to be horribly unreliable, but I just went through all my CD-Rs I've made since late 1994/early 1995 and every single one was readable; music, data, etc. I tossed about 90% of them after shredding the foil off, but they were definitely readable. I doubt the same could be said of the QIC-80 tapes I have lying around. . .

      --
      The Christian Right is Neither (Christian nor right). See: Matthew 23, Matthew 25, Ezekiel 16:48-50
    9. Re:hieroglyphics by visgoth · · Score: 2, Funny

      Indeed. I highly doubt that 10,000 years ago Thagnar lamented the fact that nobody would remember his daily struggles.

      --
      My patience is infinite, my time is not.
    10. Re:hieroglyphics by Anonymous Coward · · Score: 0

      On the other hand, today there are plenty of people who lament the fact they know nothing substantial about Thagnar's daily struggles.

  5. I've heard this problem over and over by csoto · · Score: 5, Interesting

    Working at a University, this is not a subject I'm not unfamiliar with. We've had lots of discussions about this. Everyone always talks about how many zillions of "pieces of information" are out there. The number of web pages in existence is always brandied about. My point in these discussions is that most of what's out there is crap. Humanity is not lessened by its loss. Good stuff gets reproduced, reviewed, studied, dissected, etc. and survives. It *is* stupid to try to solve this problem, because the problem doesn't need solving.

    --
    There exists no way of exchanging information without making judgments. --Bene Gesserit Axiom
    1. Re:I've heard this problem over and over by jazman_777 · · Score: 1
      It *is* stupid to try to solve this problem, because the problem doesn't need solving.


      For example, a large commercial airplane manufacturer has this problem, their engineering docs all aren't under constant review and update, but their vehicles stay in service for decades. After a certain time they archive on microfiche.

      --
      Slashdot: Failed Car Analogies. Amateur Lawyering. Anecdote Battles.
    2. Re:I've heard this problem over and over by failedlogic · · Score: 3, Insightful

      Things like music, TV shows, movies, literature, toys, magazines etc are all cultural products. For future generations we need to keep records of there items as much as family trees, great stories, buldings, etc.

      Besides, who's to decide what is 'crap' or not. It might be that to the untrained eye, a clay pot from Egypt might not look interesting. The color, shape, its condition, etc might tell someone who used it, why, what cultural value (symbology, usefullness, etc) the pot actually had. And culture evolves from culture. Keeping a record of everything we product allows future generations to inform themselves of who we were and what we did. Quality of the information itself is really unimportant.

      Only thing I'd have to add: I wish future generations all the luck in sorting through our garbage piles and recycling/salvaging what they can. If anything, this amount of waste - or crap - is a record of us as much as anything. I can agree with you on this point about crap in our culture!!! ;)

    3. Re:I've heard this problem over and over by kfg · · Score: 4, Insightful

      Expanding copyright protection to a term equal to two lifetimes means that now even some of the good stuff is being lost because it is not allowed to preserve it.

      If preservation is outlawed, only outlaws will be preservationists.

      I believe Ray Bradbury had something to say on this subject.

      KFG

    4. Re:I've heard this problem over and over by rbegga · · Score: 1

      Amen to the above post. Let's allow a little Darwinism in the selection of what to waste those storage bits on before we create the official book of useless knowledge.

      --
      A little non-sense now and then is relished by the wisest men. -Willy Wonka
    5. Re:I've heard this problem over and over by nizo · · Score: 1

      Let me guess; we could go read the Bradbury story, except you can't post it somewhere because it would be a copyright infringement? Oh the irony....

    6. Re:I've heard this problem over and over by Trespass · · Score: 2, Insightful

      Yes, exactly. It's the ephemera that tells you what life was like in any given era, not the palaces, official monuments, etc.

      I'll wager you could reconstruct far more about the culture of early 21st century from the contents of a convenience store than that of the White House. There's a big gulf between who a people are and the mask they present to the world.

    7. Re:I've heard this problem over and over by Anonymous Coward · · Score: 0

      Working at a University, this is not a subject I'm not unfamiliar with

      Grammar, however, seems to be terra incognito....

    8. Re:I've heard this problem over and over by mypalmike · · Score: 1

      I wish future generations all the luck in sorting through our garbage piles and recycling/salvaging what they can. If anything, this amount of waste - or crap - is a record of us as much as anything.

      If this is the case, then archaeology will not have changed much. The most useful findings in archaeology are often those found in the waste piles ("middens") of the site.

      --
      There are 0x40000000 types of people: those who understand 32-bit IEEE 754 floating point, and those who don't.
    9. Re:I've heard this problem over and over by s20451 · · Score: 2, Insightful

      Expanding copyright protection to a term equal to two lifetimes means that now even some of the good stuff is being lost because it is not allowed to preserve it.

      Huh. So the FSF will win by default. You gotta hand it to somebody who is willing to play the long game.

      --
      Toronto-area transit rider? Rate your ride.
    10. Re:I've heard this problem over and over by Anonymous Coward · · Score: 0

      Working at a University, this is not a subject I'm not unfamiliar with. It's a bad introduction to say that you are not familiar with the subject.

    11. Re:I've heard this problem over and over by Anonymous Coward · · Score: 0

      That's only half of the truth. What it's worth it will be copied - if possible. Thinking about DRM etc. we may face a situation, even worthwhile digital information fades away or gets dumped for centuries within other junk, just because some people weighing their short term monetary interest higher, than the long term cultural value.

      The next possible argument whould be: A society, which allows this, deserves to be (partly) forgotten - or to be remembered exactly for this stupidy.

    12. Re:I've heard this problem over and over by kfg · · Score: 1

      For the moment fair use still allows this:

      http://en.wikipedia.org/wiki/Fahrenheit_451

      I'd say this line must be held, except the line has already been left far in the dust. The line needs to be rolled back. Do not accept any "concessions" by "industry." They are being made to get you to accept the current position of the line. It's a very old trick.

      KFG

    13. Re:I've heard this problem over and over by inKubus · · Score: 1

      Not to mention whomever controls the library can decide what stays and what doesn't. Which means they can create any picture of history they want. Which, as we all know, means they can control the future.

      --
      Cool! Amazing Toys.
    14. Re:I've heard this problem over and over by Anonymous Coward · · Score: 0

      He didn't say he was unfamiliar with the subject. He said he was not unfamliar with it, meaning he WAS familiar with it. Only he also said that this is not a subject he was not unfamiliar with. So he was familiar with it, just not this subject. So we still don't know whether he was familiar with THIS subject or not.

      Is this typical of the way people speak and reason at a university?

    15. Re:I've heard this problem over and over by blaster151 · · Score: 2, Insightful

      I doubt that a historian would see it your way. How many records, judged by their contemporaries as irrelevant, have helped historians piece together valuable perspectives about times past! Like the monks who deemed it appropriate to copy over Bach manuscripts, isn't there hubris when we declare with certainty what is and is not worth preserving? Perhaps we don't have enough perspective to reliably do that.

    16. Re:I've heard this problem over and over by Anonymous Coward · · Score: 0

      "this is not a subject I'm not unfamiliar with."

      I don't think you ended up saying what you meant. With three negatives in that sentence, I think you actually ended up saying that you aren't familiar with the subject. Although, it's clear from the rest of your post that you do consider yourself familiar with the subject. Or at a minimum, like most posters at slashdot, you're not afraid to comment and hold a strong opinion on something you know nothing about. :)

      Anyway, maybe you should consider a career in politics. You certainly have the writing skills to obfuscate your real intentions.

    17. Re:I've heard this problem over and over by failedlogic · · Score: 1

      That I intended as a sarcastic comment. Though there is probably some archeological value to the garbage as well. Not sure paper and media will be well preserved or readable though.

      Zune? Not much to say. There won't be many to be found. I'm sure they will have more to say about Nintendo Wii and the portable Little Wii. ;)

    18. Re:I've heard this problem over and over by Anonymous Coward · · Score: 1, Funny

      Working at a University, this is not a subject I'm not unfamiliar with.

      Well, I have to say, I'm at a University too, and this is far from being not a subject I'm not unfamiliar with.

    19. Re:I've heard this problem over and over by pclminion · · Score: 3, Funny

      Working at a University, this is not a subject I'm not unfamiliar with. We've had lots of discussions about this. Everyone always talks about how many zillions of "pieces of information" are out there. The number of web pages in existence is always brandied about.

      Where can I attend these meetings, where people speak in triple negatives and much brandy is available?

    20. Re:I've heard this problem over and over by wall0159 · · Score: 2, Insightful

      I think you're wrong (and you use a double negative ;-)

      Most people are disinterested in history, hence there is no guarantee of a verbal knowledge continuum in the event of widespread hardware failure.
      We know that the hardware always eventually fails.
      We know that hardware always becomes obsolete.
      We know that civilisations always fall.
      We also know that these things have happened in the past, resulting in the loss of knowledge (in some cases it was because the language became extinct, and has never been deciphered. In others it was because proper documentation was never made, or was lost, or was destroyed, etc. If you think about archaeology, it only exists because of a _lack_ of documentation. It's trying to piece together data from scattered, incomplete fragments).

      The fact that you so easily dismiss this shows a lack of knowledge of history (point 1), and perfectly illustrates that old adage "if there's one thing we can learn form history, it's that we don't learn from history."

    21. Re:I've heard this problem over and over by Joe+Snipe · · Score: 1

      Working for a University, someone should have informed you that using triple negatives makes for poor sentence structure, and obfuscated readability. Please don't not stop doing it.

      --
      Sometimes, life itself is sarcasm...
    22. Re:I've heard this problem over and over by crabpeople · · Score: 1
      "Humanity is not lessened by its loss. Good stuff gets reproduced, reviewed, studied, dissected, etc. and survives."

      Sure. Thats why we have atlantian anti gravity devices and their FTL spaceships right? Im sure we will one day re-invent them, but having a bit of forsight is worth a pound of hindsight - or something. This is a real problem. I personally have been archiving colbert episodes and have similar issues as TFA. Even raids crash sometimes, and tape is WAY too slow.

      --
      I'll just use my special getting high powers one more time...
    23. Re:I've heard this problem over and over by DragonWriter · · Score: 1
      Working at a University, this is not a subject I'm not unfamiliar with.


      I think you intended a double, rather than triple, negative here, though, in that case, you could have just avoided negation altogether and said, "Working at a University, this is a subject I'm familiar with."

    24. Re:I've heard this problem over and over by glwtta · · Score: 1

      Working at a University, this is not a subject I'm not unfamiliar with.

      I don't think that doesn't make you not unqualified to not talk about it.

      --
      sic transit gloria mundi
    25. Re:I've heard this problem over and over by ogrizzo · · Score: 1

      In the meanwhile, you seem to have problems with latin: it's terra incognita

    26. Re:I've heard this problem over and over by tknd · · Score: 1

      It's funny. I like to watch anime and every other futuristic/space based one always features these new civilizations that are rebuilt yet there is 'old technology' sitting around that nobody understands. But when the old tech does work, it's usually better than the existing civilization's tech.

      Sound familiar? Probably because it is. Many historians and even scientists study old technology and for a good reason. The issue here is we don't want to forget what we happened and why it happened that way so that we don't continuously make the same mistakes in the future. Even on a small scale level of weeks, a business managing a group of people could get much better insight if access to the historical data was available to generate charts and trends.

      The problem isn't stupid and I don't think it is that hard. Life has managed to come up with a pretty good solution of persistence and that is to make copies of itself faster than the rate at which it decays. Why can't we do the same with data? We don't have to make the data live forever, we only have to make it live for as long as humans exist.

    27. Re:I've heard this problem over and over by Darth_Burrito · · Score: 1

      We know that the hardware always eventually fails.
      We know that hardware always becomes obsolete.


      If data is important to someone, it should be backed up and kept available on modern hardware. Disk failure and hardware obsoletion should be non-factors. If it's not important enough to take those safeguards, it's not actually important. Software obsoletion is, imo, a much bigger problem but the same rules apply.

      We know that civilisations always fall.

      For us to lose a critical quantity of modern knowledge, it would not be enough for a civilization to fail, it would have to be a global failure. And not just any global failure, but a shitstorm so great that no amount of preparation would save our collective asses. In the event of such a failure, we'd probably still have more hard copy to fall back on from the last century than was produced in the entirety of human civilization prior to that.

    28. Re:I've heard this problem over and over by lazd.net · · Score: 1

      Working at a University, this is not a subject I'm not unfamiliar with. Hmm... If something IS a subject that you are NOT UNfamiliar with, then it IS a subject that are ARE familiar with. Therefore, if something is NOT a subject that you are NOT UNfamiliar with, then it is a subject that you are NOT familiar with. In which case, why would one respect your opinion on the subject? You mention you work at a university... Are you the janitor? ;)

    29. Re:I've heard this problem over and over by anti-pop-frustration · · Score: 1

      Good stuff gets reproduced, reviewed, studied, dissected, etc. and survives.

      Wrong. Popular stuff gets reproduced, reviewed, studied, dissected

      Look at emulation, it has worked wonders at preserving video games because a lot of people care about it. Do you see the same amount of effort being put in to preserving local or federal records?

      In about 50 years from now, lots of copies of Star Trek movies and TV shows will be available in a variety of digital formats because the stuff is popular. Do you think digital records (emails etc.) about the Iraq war produced by the current U.S. administration will be if we let the problem sort itself?

      Digital preservation is an issue that urgently needs to be addressed, what kind of record would we have of the web if the Internet Archive didn't take matters into their own hands in 1996? Simply nothing.

      Digital preservation is not some kind sci-fi "let's send a message to the 27th century" project. It's about being able to access our digital records in a meaningful way 20 or 30 years from now. Letting the problem solve itself is a recipe for disaster.
  6. A huge problem, indeed! by duh+P3rf3ss3r · · Score: 2, Interesting

    I've seen this very thing happen where I work -- we've lost data over the years because of incompatiblity issues. On the other hand, as with many things, it's a huge problem but not an insurmountable one. The key is in planning an anti-obsoloscence strategy into every IT decision. Store data files in open formats on robust media and put someone in charge of ensuring the archives are maintained and accessible.

    It's not easy, sure, but neither are many of the other tasks we take on as humans.

    --
    Give a man a match: warm him for an instant. Douse him in petrol and set him aflame: warm him for the rest of his life.
  7. Republic? by Anonymous Coward · · Score: 0, Flamebait

    Thibodeau hopes to develop a system that preserves any type of document... for as long as the United States remains a republic

    So he only needs to archive up to November 7th, 2000? That should help him with managing the scope.

  8. Biggest hurdle is legislative by Anonymous Coward · · Score: 0

    Basically with the draconian virtual ban on reverse engineering of formats .. this sort of thing can be expected. Especially since copyrights for even abandoned works will be extended indefinitely.

  9. Keep missing my opportunity by JoeyLemur · · Score: 1

    I've been trying to develop software to do it... unfortunately, my amazing abilities at procrastination and wanting to constant redesign the project have left it languishing for nine years.

    Then I keep seeing articles on archiving projects and think I really should get back to work on it...

  10. My solution for digital photos? by OfNoAccount · · Score: 4, Informative

    Since I shoot RAW, I also burn a copy of dcraw.c onto every disc - so even if the current platforms get lost by the wayside, there will be code to convert them still.

    Storage itself? Currently burning onto Delkin Archival Gold, storing cool and dark, and in two physically distant locations.

    They're also stored on my harddisk, and the best are backed up onto a USB drive.

    If it looks like the DVD-ROM drive is becoming obsolete I'll burn them on to whatever comes along next.

    If you're truly paranoid you can always print them on archival quality paper using pigment based inks ;)

    1. Re:My solution for digital photos? by tomjen · · Score: 2, Insightful

      Wonderfull plan - but what if you cant find a working C compiler?

      --
      Freedom or George Bush
    2. Re:My solution for digital photos? by matt328 · · Score: 1

      If it looks like the DVD-ROM drive is becoming obsolete I'll burn them on to whatever comes along next.

      I think you've nailed it right there. Generally speaking, if information's important enough, it'll persist. Someone will take the time to convert it to the next big storage medium. If it's pretty worthless, such as [insert annoying trend here], it'll be (hopefully) doomed to the oblivion of 1s and 0s and archaeologists will look back trying to figure out why Bono wore sunglasses which a) had no tint and b) it was nighttime.

      --
      Check out the cave on the east side of lake Hylia. Strange and wonderful things live in it.
    3. Re:My solution for digital photos? by mashade · · Score: 1

      You port the C source code to $current_language

      --
      Technology tips and tricks.
    4. Re:My solution for digital photos? by Anonymous Coward · · Score: 0

      I've discovered a new archival format for storing my pictures. It's basically a transparent polyester sheet coated with an emulsion containing light-sensitive silver halide salts. As long as it's processed properly, it's been shown to be archival quality and stores an enormous amount of data. I think it's generally referred to as film.

    5. Re:My solution for digital photos? by markov_chain · · Score: 1

      You underestimate C. There will always be a working C compiler, and everything else will always be written in it ;)

      --
      Tsunami -- You can't bring a good wave down!
    6. Re:My solution for digital photos? by JakartaDean · · Score: 1
      If you're truly paranoid you can always print them on archival quality paper using pigment based inks
      IIRC, Cibachrome on mylar was supposed to last 500 years, so my grand^25children will be able to enjoy my snap of a budha on the top of Borobudur. Original has to be a slide, unless they have updated the technology in the ten years since I gave up my darkroom.
      --
      The subject who is truly loyal to the Chief Magistrate will neither advise nor submit to arbitrary measures (Junius)
    7. Re:My solution for digital photos? by Gothmolly · · Score: 1

      Is all that content actually valuable, other than the pr0n ?

      --
      I want to delete my account but Slashdot doesn't allow it.
    8. Re:My solution for digital photos? by clickclickdrone · · Score: 1

      >dcraw.c
      Hey, there's a funny file on this disk called dcraw.c. Hang on.. it says it needs a C compiler? WTF is C?
      Oh man, they used to use that to write software back in the day. No idea where you find one now.

      --
      I want a list of atrocities done in your name - Recoil
    9. Re:My solution for digital photos? by rbarreira · · Score: 1

      That's so silly. The effort of writing a C compiler (it doesn't even need to be a *good* C compiler, just a working one) is trivial... Plus, it's not actually needed, since any decent programmer should be able to understand what the code is doing and write a new decoder from scratch in whatever language is the current fashion at that time...

      --

      The AACS key is NOT 0xF606EEFD628B1CA427BEA93A9CA9773F
    10. Re:My solution for digital photos? by clickclickdrone · · Score: 1

      Is there some unwritten rule about being on slashdot that you have to take everything literally and disable any humour functions in your brain before posting?

      --
      I want a list of atrocities done in your name - Recoil
    11. Re:My solution for digital photos? by clickclickdrone · · Score: 1

      And what's the betting some idiot replies to my previous msg with 'Oh no, I don't recall ever seeing or hearing about such a rule, I think you must be mistaken on that'.

      --
      I want a list of atrocities done in your name - Recoil
    12. Re:My solution for digital photos? by Anonymous Coward · · Score: 0

      I am an American, you insensitive clod!

    13. Re:My solution for digital photos? by clickclickdrone · · Score: 1

      Heh, heh ;-)

      --
      I want a list of atrocities done in your name - Recoil
    14. Re:My solution for digital photos? by Anonymous Coward · · Score: 0

      Either that or your jokes don't look funny at all :P

  11. Store the files in Notes/Domino! by LibertineR · · Score: 1

    IBM will NEVER shoot that baby in the head, so there will be Notes databases around when my grandkids are long dead.

  12. wrong scope by WickedLogic · · Score: 0, Redundant

    ... for as long as the United States remains a republic.

    So like, what the next decade at most.... no problem.

    1. Re:wrong scope by ericlondaits · · Score: 1
      ... for as long as the United States remains a republic.
      So like, what the next decade at most.... no problem.
      We interrupt this message for announcement from our Lord, Darth Dubya...
      --
      As a Slashdot discussion grows longer, the probability of an analogy involving cars approaches one.
  13. Open, well-used, file formats. by Daniel_Staal · · Score: 4, Insightful

    There are only two ways of doing this: keeping a copy of every program used to create these files (and a system to run them on) or converting them to some open and well-supported format.

    For text documents, HTML is probably the best bet. It is so widely used and supported readers are almost garunteed to exist as long as computers do in their current form. (And if something ever truely supersedes it, a mass-conversion program will be written anyway.) HTML probably works for basic spreadsheets too. Graphics support for GIF, JPEG, and PNG is probably at that level as well, and MP3 for music.

    As a bonus, most of the native programs for the documents to be preserved have translators to these formats already.

    Beyond that I have no idea.

    --
    'Sensible' is a curse word.
    1. Re:Open, well-used, file formats. by John.P.Jones · · Score: 2, Interesting

      Keeping 'a copy of every program' is tractable, 'and a system to run them on' however is not. Data (programs) can be easilly copied to new media and thus live forever (as long as people are around to order new media, install it and copy the data anyways but thats just a staffing problem). But hardware is not so easilly ported, that is unless you have an open, easy to port, emulator that will run your programs. Preferably this emulator should require very little say just a functional C compiler for future hardware. So there you have used a common CS solution, you have REDUCED the problem of saving all your data to the problem of maintaining hardware for which you have a functional C compiler, a much easier task. If you can't find such a machine your solution would then be to implement a C compiler, again a tractable problem.

      I have simplified for the sake of being lazy but the essence of portable emulators + extensive software and data backup and storage is sound, you don't even have to concern yourself with speed if you are willing to accept that future hardware will be fast enough.

    2. Re:Open, well-used, file formats. by Anonymous Coward · · Score: 0

      >There are only two ways of doing this: keeping a copy of every program used to create these files (and a system to run them on) or converting them to some open and well-supported format.

      This option might become more difficult as more systems (eg. XP, Vista, Office) requires 3rd part to validate. What if your system relies on WinXP and office 2003 to work, but MS by then no longer exists or supports and validate your copy?

    3. Re:Open, well-used, file formats. by nine-times · · Score: 1

      I think your general sentiment is worthwhile, but HTML for word processing documents, JPEGs for pictures, MP3s for audio? Geeze, lets at least be thinking more along the lines of ODF/PDF, PNG24, and FLAC.

      However, that doesn't really address the question of medium. It'd be nice to have some sort of nearly-indestructable medium to store all this.

    4. Re:Open, well-used, file formats. by nine-times · · Score: 1

      I think that's why he said "open and well-supported format". The "open" part might preclude the use of many Microsoft formats.

    5. Re:Open, well-used, file formats. by Doctor+Memory · · Score: 1
      Keeping 'a copy of every program' is tractable, 'and a system to run them on' however is not.

      Depends on how deep your pockets are. There's a warehouse in eastern PA that has a MicroVAX, a couple of VT240s and an extensive collection of TK50s holding scads of MOL files and pre-clinical trials data "just in case". Not sure if they'll revive everything and re-package it now that there are VAX emulators available, but if you've got data worth (potentially) several hundred million dollars, you'll go to extensive lengths to keep it available. I strongly suspect that these guys have not only the MV, but enough schematics to enable them to recreate some form of VAX, even if they have to cobble up something from FPGAs.
      --
      Just junk food for thought...
    6. Re:Open, well-used, file formats. by tomjen · · Score: 1

      Hmm - the problem with ODF/PDF is that it cannot be chanced by hand - however LaTeX source code can.

      As for music I agree Either FLAC or Wav depending on what you want.

      Media? Codac used to make some gold cds that they claimed lasted 12 times as long as the average cd. Other than that you should look at something like a good oldfasioned stone. They last a real long time.

      --
      Freedom or George Bush
    7. Re:Open, well-used, file formats. by nine-times · · Score: 1

      If future systems are so wildly different from those we have today that they can't have a PNG viewer written for them, how easy will it be to write an emulator now that will run on such wildly different systems, yet faithfully emulate our existing environments?

    8. Re:Open, well-used, file formats. by Daniel_Staal · · Score: 1

      ODF/PDF are probably about at the required level of use for this; I did consider them for a moment. A little harder to convert to, but they preserve more and are nearly as future-proof. I did mention PNG, though I thought about leaving it out: if MS's support doesn't improve it could eventually fade away as a format. (Another alternative would be TIFF, which is quite well supported.)

      FLAC... Isn't widely enough used, in my opinion. WAV or AIFF perhaps.

      Of course, these are all of the top of my head. Given some time and money I could come up with numbers to back specific choices, but it would take a fair amount of both.

      --
      'Sensible' is a curse word.
    9. Re:Open, well-used, file formats. by nine-times · · Score: 1

      Hmm - the problem with ODF/PDF is that it cannot be chanced by hand

      Neither can FLAC or PNG. Anything stored in any format will assume that you have a computer capable of reading it. Hell, I could have plain text files on a hard drive, but I still can read them without a computer, an operating system, and some program that will open a text file and output the contents.

      So the key thing is, I think, to store everything in open formats, and then focus on preserving enough information as is necessary to build a viewer for each format. Hell, write a full spec on how to read the disk and how to write a viewer for each file format, and print those on plastic sheets that won't degrade. As long as mankind doesn't regress to cave men, they should be able to figure it out.

    10. Re:Open, well-used, file formats. by Daniel_Staal · · Score: 1

      That would work, I don't disagree. Assuming you can get emulators of sufficient quality to run everything. Speed isn't necessarily an issue: access doesn't have to be as fast as it is on a native system after all.

      However, you are just time-shifting your complexity, from data input to data output. Given that data input is a continuous process and output is likely to be burst, having the complexity up front is going to present less of an overall issue to use of the system. It is also probably the cheaper in total cost, especially if you need to output the same data several times at widely spaced time intervals. (Where you would need to do the emulate-and-extract separate for each time, as hardware/software/etc. will have changed.)

      Of course, your solution is cheaper up front. Which is an advantage.

      --
      'Sensible' is a curse word.
    11. Re:Open, well-used, file formats. by Anonymous Coward · · Score: 0
      There are only two ways of doing this: keeping a copy of every program used to create these files (and a system to run them on) or converting them to some open and well-supported format.

      For text documents, HTML is probably the best bet. It is so widely used and supported readers are almost garunteed to exist as long as computers do in their current form. (And if something ever truely supersedes it, a mass-conversion program will be written anyway.) HTML probably works for basic spreadsheets too. Graphics support for GIF, JPEG, and PNG is probably at that level as well, and MP3 for music.


      For text and data documents, text format is definitely best, the only issue is line endings. However the real question wrt to format is binary format - big vs little endian, bit depth, etc...

      Other than that, you all seem to be missing the overall point:
      - Tape lasts a maximum of seven years before it must be copied to a new tape.
      - CD/DVD lasts a maximum of 33-100 years.
      - Hard disks will last as long as the drive motor runs, beyond that I'm not sure how long the data will survive on the platters and if an "alien anthropologist" (Thanks Rog) could recover it.

      However, egyptian hieroglyphs on stone and ancient cuneiform on clay tablets protected from the elements have lasted for millennia. That is the point here. We do not currently have a digital medium that will last millennia without manual renewal according to its expiry cycle. Crystal storage holds the best promise for really long term storage without degradation, it is long overdue.
    12. Re:Open, well-used, file formats. by anthony_dipierro · · Score: 1
      However, egyptian hieroglyphs on stone and ancient cuneiform on clay tablets protected from the elements have lasted for millennia. That is the point here. We do not currently have a digital medium that will last millennia without manual renewal according to its expiry cycle.

      I wonder how long punch cards can last. Put them in well sealed container, and you could probably store them for millennia. The biggest problem with current storage systems is that they pack the data so tightly. It really doesn't have anything to do with digital vs. analog. You can write 1s and 0s on clay tablets, and at the physical level tapes and hard drives are really analog.

    13. Re:Open, well-used, file formats. by John.P.Jones · · Score: 1

      I agree we should make best efforts to prevent the need for drastic measures to read the data but that isn't sufficient to avoid preparing for drastic measures. You need to be prepared for doing things the hard way, because we can't predict how hard it will be in the future we can't assume whatever precautions we have taken will be sufficient. That is why I am really proposing the 'worst case' scenario.

    14. Re:Open, well-used, file formats. by Anonymous Coward · · Score: 0
      I wonder how long punch cards can last. Put them in well sealed container, and you could probably store them for millennia. The biggest problem with current storage systems is that they pack the data so tightly. It really doesn't have anything to do with digital vs. analog. You can write 1s and 0s on clay tablets, and at the physical level tapes and hard drives are really analog.


      Hmmm, punchcards of quality acid free cellulose in a sealed container would almost certainly last longer than tape or cd/dvd. Interesting idea.

      And yes, magnetic storage is essentially "analog" - slewing rates of the read/write electronics are critical and ultimately determine the density possible.

      Yes, it is not a question of analog vs digital, it is a question of media durability.

      It occurs to me that given the prolific nature of modern computer data and documentation the real problem for the "alien anthropologist" would be sorting the "wheat" from the "chaff". Even if we did have a medium of infinite durability, sorting the significant material out from all the inconsequential data would be a monumental task.

      Someone suggested above that the important stuff will survive, I can only hope he is right.
    15. Re:Open, well-used, file formats. by nine-times · · Score: 1

      FLAC and PNG aren't likely to disappear anytime soon. Just because you don't see them for what you're doing doesn't mean they aren't in use. The reason I suggest them is that, unlike MP3 or JPEG, they're lossless. Unlike WAV or BMP, they're compressed. I would think the best file formats for archives would be those which would preserve all the intended data losslessly, but with as much compression as possible.

      So for pictures that leaves a couple of options, like PNG or TIF. Audio really should be FLAC to save some space. There are other options, but AFAIK, none as open, well supported, and commonly used as FLAC. ALAC, for example, is supported by Apple and available to FOSS through reverse engineering, but not exactly open.

    16. Re:Open, well-used, file formats. by NineNine · · Score: 1

      So, you want to use MORE obscure formats? Oh yeah, that's a good idea. There are 1000's of programs today that handle MP3. It's even burned in chips for regular old stereo hardware. I know that in 20 years, I'll easily be able to find a program that reads MP3's. Considering there are only a handful now that read FLAC and the other ones you mentioned, I wouldn't be so sure that any of these formats will survive. (And PDF? Are you kidding? PDF is such a shitty format, that I've gone so far as to formally ban all use of it from my business.)

    17. Re:Open, well-used, file formats. by nine-times · · Score: 1

      I'm sorry to disagree so much with someone who has such a similar name. However, if you read my other posts, I've given reasons why I suggested the formats I did. In 20 years, do you want to go back to your archive and find that you've gone through all this trouble of archiving, only to find you've archived MP3s? If the purpose is to preserve the data, then why on earth would you throw large portions of that data away by storing lossy formats?

      FLAC is not an obscure or unknown format. In fact, I would hazard to say it's the de facto standard for lossless compressed audio. Likewise, PDF is a well supported and well documented open format for storing the layout of printed pages. Adobe Acrobat has its problems as a program, but the format itself is very good (for its intended purpose, which is to store accurate representations of print documents). PDF is commonly used in professional print settings and document archive systems.

      If you really want to research the issues, there are options that I haven't mentioned. However, if you were more knowledgeable about this topic, you'd know that my suggestions are, at the very least, worth listening to.

    18. Re:Open, well-used, file formats. by mariushm · · Score: 1

      Yeah ..well... I would say XML would be much better than HTML, as it would be parsed easily. It would work for both ASCII and Unicode text, data ... PNG, TGA could be used for storing images .. both are relatively easy to decode and enough documentation exists. Some vectorisation format and some layered photo format (not proprietary PSD format) would also be needed. For music, two extensible open source formats would be needed, one lossless (OGG, MP3 when patent expires -though it is limited by the number of audio channels it supports) and at least one lossless format (FLAC is the first that comes to mind, APE the second) For videos, there is already an open source format (container) called Matroska and some lossless codecs could be standardized. It is possible, but indeed, very hard. And would not be possible as long as companies care more for their profits than ...

    19. Re:Open, well-used, file formats. by HighPerformanceCoder · · Score: 1

      I'd bet on LaTeX in this stable - its a widely used plain ASCII format that is more or less understandable to humans without a LaTeX compiler.

      I still have LaTeX documents from 18 years ago that I can read, and could compile with minor modifications if needed. The same cannot be said of the word processor documents from the same era.

  14. The problem is semi-solved already by Oddster · · Score: 1

    There are several companies out there which specialize in Document Imaging Software, specifically for searchable archive purposes. The primary problem is simply the manpower to write the number of conversion filters necessary to import external data formats into the database's internal format; the storage and search/retrieval problems are mostly solved already.

    Disclaimer: I used to be an engineering intern at Laserfiche

  15. Thats stupid. by CDPatten · · Score: 1, Insightful

    This isn't the 80's and almost any file being saved in Archives are in formats that many programs can open. Meaning that the specifications for those formats are known... regardless of whether or not it is legal. Even word files are viewable by a number of applications, and nobody is archiving historical information with advanced macros so don't even post with that macro crap.

    Also to assume that future generations won't have the sense or ability to figure out how to open files we write is silly.

    Because "some" businesses (or the military like the articles suggests) find opening archived information ON THE FLY difficult doesn't mean a (more technolgically advanced) society wanting to learn their past will have the same limitations. This article is just another example of entry level "tech writers" and of how low journalistic standards are.

    PS
    I am not a journalist... so save your grammer and spelling corrections for someone who is.

  16. Popular Mechanics asks... by susano_otter · · Score: 3, Insightful

    From TSA: "Popular Mechanics asks: Will an entire era of human history be lost?"

    Obviously not; Popular Mechanics itself has preserved much of the era in traditional hardcopy formats, making it no less lossy than previous printed-word eras.

    Of course, understanding the era from such incomplete and unreliable records will be a challenge to archaeologists and historians; again, not much different from previous eras.

    In conclusion: doesn't matter, hardly news.

    --

    Any sufficiently well-organized community is indistinguishable from Government.

    1. Re:Popular Mechanics asks... by dccase · · Score: 1

      Popular Mechanics has preserved a history where the future had flying cars.

      If we found them in the future would we think that all ancient humans had personal flying vehicles?

      Pehaps in the future, the 7 wonders of the ancient world will be the lost dreams of the 20th century.

    2. Re:Popular Mechanics asks... by susano_otter · · Score: 1

      You don't seem to have much understanding of how serious archaeologogists and historians go about their business.

      If we found Popular Mechanics in the future, we'd think that some ancient humans once believed that they would have personal flying vehicles in the future.

      Besides, it's not as if Popular Mechanics is providing the only hardcopy documentation of the era.

      --

      Any sufficiently well-organized community is indistinguishable from Government.

  17. Government Area of Expertise by ThatsNotFunny · · Score: 5, Funny
    When Thibodeau told the head of a government research lab about his mission, the man replied, 'Your problem is so big, it's probably stupid to try and solve it.'"


    I'd trust that guy. If there's one thing our governrment knows, it's stupidity.
    --
    "Was it a millionaire who said 'Imagine No Posessions?'" -- Elvis Costello
  18. HD-DVD - dark data by openright · · Score: 2, Funny

    Interestingly, This Slashdot article is shown to me with advertisement for HD-DVD, which has a data format "forgotten" by design.

  19. Come now... by Cauchy · · Score: 1

    We are unraveling history using models of mitochondrial dna genetic drift using data collected across the planet, and archivist as concerned about future generations not having Office 2003 compatible software? Ok, so the making it broadly available and searchable to current generations may be a challenge, but they can't seriously be concerned about future researchers not being able to read our data formats. I suppose we should be concerned as to whether the physical media will survive, but I doubt we need to worry about our computer illiterate progeny being able to figure these things out.

  20. The solution by alexwcovington · · Score: 3, Interesting

    In this era of virtualization, the solution for x86 software is as easy as retaining a copy of the primary partition of a computer originally used to work with the desired files. Searchability could be a problem for proprietary data formats, but the move to open standards in the future will mitigate that.

    The real problem is 60 years of archives of antiquated, proprietary, task-spcific and mainframe computer data cards and tapes whose original programmers are halfway to cedar boxes; if the government can't get their support in time it may as well call all the early stuff a loss and hand it over to archaeologists.

    --
    (It's never too late to join the Renaissance)
  21. IT people. by justkarl · · Score: 1

    'Your problem is so big, it's probably stupid to try and solve it.'"

    Sounds like general end-user hate crime to me. Hey, I've been guilty many times of shunning a user because I didn't feel like fixing his stupid problem.

  22. Easy To Do by Conception · · Score: 1

    "how to store digital files so future generations can access them"

    Quite simply, you don't store them in one format. Just move everything every 10 years or so. In fact, with Moore's Law and all, you will probably be able to store everything you had before in 1 of whatever is new 10 years later. Hire some part timers to move it or something. It's not a hard problem. It's just an inconvenient one.

    1. Re:Easy To Do by Anonymous Coward · · Score: 0

      It's pretty tricky to do that when you're dead and no-one else cares about preserving your writings for history because of the famine/epidemic/civil war/etc...

  23. It's whether it's WORTH it by pclminion · · Score: 4, Insightful

    It really isn't a question WHETHER we will be able to read old digital data in the future. After all, humans invented these formats, flawed as they may be, and humans can decipher them with enough effort. We can crack cryptography -- a deliberate attempt to make it as difficult as possible to decipher certain information. So it's hard to imagine any data format that could not be deciphered in the future with some honest effort.

    Instead it is a question of whether the data is WORTH the effort. From an anthropological standpoint, this is valuable historical data, and its value is not decreased by our inability to interpret it. The benefit of digital data is that it can be copied even if we don't know what it means. It will not erode or decay like other historical artifacts, if we put in the small effort required to preserve it. Assuming humanity doesn't self-destruct, there will be plenty of time in the future for historians to decipher and interpret the data when a need arises for it.

    1. Re:It's whether it's WORTH it by anthony_dipierro · · Score: 1
      It really isn't a question WHETHER we will be able to read old digital data in the future.
      Instead it is a question of whether the data is WORTH the effort.

      I think you hit the nail on the head there. Of course, for the stuff that we *know* is going to be important it's even easier than that. Each time you upgrade technologies, hardware or software, you convert the data. With the advent of cross platform networking things get a lot easier, too. Today you can transfer data from any device to any other, so long as each device can be hooked up to a computer with Internet access.

      The benefit of digital data is that it can be copied even if we don't know what it means.

      That has its advantages, but in some ways that's actually the problem. People copy the bits and think they're done, but if you really care about the information you should convert the file format too (as you point out, not doing so doesn't make it impossible to retrieve the information, but it makes it harder, sometimes much harder).

    2. Re:It's whether it's WORTH it by icepick72 · · Score: 1

      Assuming humanity doesn't self-destruct
      Why, amongst every great post, does there usually exist a reference to humanity destroying itself? The topic could be anything about the future, and then Bang! ... there it is. I just realized now that I see this quite a bit. This is not negative feedback about the parent which I think is a really good post. I just see that nagging reference again.

    3. Re:It's whether it's WORTH it by David+Nabbit · · Score: 1
      Why, amongst every great post, does there usually exist a reference to humanity destroying itself?

      It's probably better than every post including a reference to how humanity will be destroyed by alien invaders from outer space. Space invaders, if you will.

      --
      "Her idea of wit is nothing more than an incisive observation humorously phrased and delivered with impeccable timing."
    4. Re:It's whether it's WORTH it by pclminion · · Score: 1

      I mentioned it only because of the supposition that humanity would have essentially infinite time to decode old data. I really doubt our time in this universe is infinite, and given all the possibilities for the death of humanity I think self destruction has fairly good odds. That might be far, far into the future. But since I was postulating an infinite I figured I would cover my ass.

  24. There is software available to do it! by Nautica · · Score: 1

    NuParadigm is a company that has software that does scanning, indexing and worklflow. I used it at WashU and it is terrific, They call the softare DataFlow.

  25. Open Office is the Answer by fernandoh26 · · Score: 0

    Open Office Docs, FTW!

    --
    Chums up, let's do this!
  26. Open formats by Anonymous Coward · · Score: 0

    I think using open formats are good. And standardized formats and well documented formats.
    Plain text files (.txt) is very safe. :)

  27. Extra irony points. by Kadin2048 · · Score: 4, Insightful

    I believe Ray Bradbury had something to say on this subject.

    Perhaps more ironic -- it's a pretty good bet that whatever he wrote on the subject, it's not available online due to copyright restrictions imposed by his publisher or "estate."

    --
    "Ladies and gentlemen, my killbot features Lotus Notes and a machine gun. It is the finest available."
    1. Re:Extra irony points. by kfg · · Score: 4, Funny

      Go to the library while you still can and memorize it. Buy camping gear.

      KFG

    2. Re:Extra irony points. by themindfantastic · · Score: 1

      Oh its available online, it just wasn't put there with permission... if there is one thing that piracy will do is in essence preserve stuff for a long time (albeit perhaps lost in some batch of burned optical discs) but its the question what is being saved via piracy that is in question. I am not sure I would care to watch 1,000 year old porn, probably a little too tame.

  28. Luxury problem... by Kjella · · Score: 1

    There's an infinite amount of trivia that could be recorded. We could all go around recording "my life in HDTV" recorded at 900GB/hr uncompressed, but it just wouldn't be meaningful. Sure, a certain sample of "everyday life in $foo" is useful, but on the whole who cares. And with digital media, this should be simpler than ever since you with proper redundancy should never experience data loss. Obscure image format? Find a decoder, store is as PNG. Yes, it'll be a lot bigger but you'll never have to worry about lost data from the original or keeping support for a kazillion old formats. You just have to be slightly critical, and don't stretch yourself so thin you could lose something actually important. I refuse to believe that everything we do now is so much more "important" than people 50 years ago or 100 years ago. You ahould be able to do more than ever before, and perfection can never be reached. For example, I could say "show me an untouched part of the $foo forest untouched by human hands". Biologists and whatever would love it. Suddenly you're not talking about 900GB*8billion and blew right off the scale. What's "important"? News media, encyclopedias, wikipedia etc. all screen stuff (well, you can put *almost* any trivia on wikipedia). If a bear shits in the woods, and noone gives a crap, why keep a record of it?

    --
    Live today, because you never know what tomorrow brings
    1. Re:Luxury problem... by HBI · · Score: 1

      I think "The Collected Bowel Movements of HBI" would be a worthwile addition to any library.

      --
      HBI's Law: Frequency of calling others Nazis is directly correlated with the likelihood of the accuser being Communist.
    2. Re:Luxury problem... by Anonymous Coward · · Score: 0

      We could all go around recording "my life in HDTV" recorded at 900GB/hr uncompressed, but it just wouldn't be meaningful.

      Are you sure? All that data could potentially be combined one day to create a massive totally immersive virtual reality environment, which would be of immense interest to historians and 'time tourists'. Things you saw at the time might appear mundane and repetetive, but assume considerable importance later, or at the very least add to the atmosphere and veracity of the simulation. Some of the data might never get used or be relevant, but you can never be sure exactly which bits that would be.

  29. DRM by Joe+The+Dragon · · Score: 1, Redundant

    This way DRM is bad as it can make data hard to read many years later.

  30. simple by insertwackynamehere · · Score: 1

    Create or choose a lossless, unencrypted format that fits with each type of file. Make sure they are always supported with free libraries and utilities. Also, find a type of format that can shrink the size of files (like zip or something)

  31. CDs by anshil · · Score: 1

    Can somebody explain me, how much CDs decade? I thought they were pretty much sealed... except that exotic muchroom, that eats the silicium layer... (an even then, altough *we* can't read it now, with a laser, the information should still be there in the plastic...)

    --

    --
    Karma 50, and all I got was this lousy T-Shirt.
    1. Re:CDs by lethalwp · · Score: 2, Interesting

      Afaik, cds are the worst media to 'backup' your precious data.

      The first burnable cds you could buy (in the 90ties) were of a decent quality, i still have some burned ones around, and they are still readable (older than 10yrs).
      But some newer ones (cheaper, & mass-marketing 'mode') are of an awful quality: i have plenty that "died" when reading them: it begins with some bad CRCs, and then more & more & more, till nothing valuable can be read off it. This happened in LESS THAN 2 YEARS.

      The problem with cds:
        - They hate sunlight
        - they hate being in a too hot, or too cold place
        - they hate being in a place with too much/not enough humidity
        - and the worst: they react with air (oxygen).

      It's build with a 2mm plastic, the dye is on top, with some 'protective' layer over it. Some are better than others.

      Now with DVDs, they seem to be from a much better quality already, the explanation is simple: the dye isn't on the surface anymore, but between 2 slides of plastic glued together. The reaction with air seems to be insignifiant. Atm, i have no single failing DVDR that i know.
      But some brands are of better quality than others.

      And btw:
      "Real men don't use backups, they post their stuff on a public ftp server and let the rest of the world make copies." - Linus Torvalds

    2. Re:CDs by Doctor+Memory · · Score: 2, Interesting

      Pressed/stamped CDs (like commercial audio CDs) age fairly well, given appropriate handling (well, at least my 20yo copy of Greetings from Asbury Park, NJ is still playable). Recordable CDs, however, aren't stamped. Instead, they use a phase-changing dye. Some of the earliest used a blue dye (cyananaline?) that wasn't stable and degraded after just a few years (10). Even discs with better dyes are sometimes not sealed properly and can go bad.

      That said, there are some newer dyes that are claimed to be stable for a hundred years. I haven't ever seen these in stores, so they may be seriously expensive, or maybe I just don't know where to shop... ;)

      --
      Just junk food for thought...
    3. Re:CDs by Anonymous Coward · · Score: 0

      Standard mass produced CDs will last a maximum of 33 years. I recall reading of outgassing at the foil-polycarbonate interface.

      More detailed thoughts on the subject:

      http://www.clir.org/PUBS/reports/pub121/sec4.html

      http://hardware.slashdot.org/article.pl?sid=05/02/ 05/0024258&tid=198

      http://www.itl.nist.gov/div895/gipwg/StabilityStud y.pdf [PDF]

      http://www.jts2004.org/english/proceedings/Carou.h tm

      http://www.mscience.com/longev.html

      And the Google search that lead to these links:

      http://www.google.com/search?q=CD+degradation

  32. To Be Honest... by swatward · · Score: 1

    How much of this stuff really has such high priority. I'm pretty sure I wont want people looking back and finding old myspace blogs and thinking... "Wow everyone 1000 years ago deserved to die."

    The good stuff will get saved, the bad stuff, who cares?

  33. But... by Anonymous Coward · · Score: 0

    ...XML of course. XML solves all of the world's ills!

  34. Stuff I can't read by Animats · · Score: 2, Interesting
    Media I actually have useful data on:
    • MacOS floppies. (Maybe on an older Mac.)
    • MacOS-only CD-ROMs. (Could be read on a Mac, if I still had one.)
    • 4mm DAT-II tapes from NT systems compressed with HP's hardware compression. (I still have a drive for this.)
    • 1600BPI 9 track open reel magnetic tape, UNIX TAR format. (I managed to get that copied before the last 9 track drives at Stanford died.)
    • 8" floppies for the IBM Series/1 minicomputer controller for the IBM RS-1 industrial robot. (Not really very useful at this point, but it would be nice to look at that work again.)
    • IBM PC/AT 5.25" high-density floppies in compressed Fastback backup format for DOS. (Years of DOS work, now obsolete)
    • 8" floppies for the Marinchip 9900 (A small theorem prover, in Pascal)
    • UNIVAC UNISERVO steel tape, 8 tracks, 200bpi, written on an UNIVAC UNISERVO IIA on a UNIVAC 1107. (A compiler I wrote as an undergraduate, plus some very early 3D graphics software.)
    1. Re:Stuff I can't read by Anonymous Coward · · Score: 0


      # MacOS floppies. (Maybe on an older Mac.)
      # MacOS-only CD-ROMs. (Could be read on a Mac, if I still had one.)


      The former, if 1.44 mb, can be read on a PC using the right software. ARDI Executor (68K mac emulator for x86) was able to do it at some point.

      MacOS CD-ROM could be read under Linux, I believe.

    2. Re:Stuff I can't read by caseih · · Score: 1

      I've always migrated my data every time I changed formats or computer platforms. Thus I still have intact an exact copy of my hard drive from 15 years ago on my hard drive and on CD-ROM. I also have all my data from the beginning of my computing days (1981) copied from floppies (5.25" and 3.5", neither of which are usable on my current PC) which I brought with me. The original floppies are long gone. So the moral of the story is to never rely on any one medium for long-term archival. Always plan to copy the data and bring it with you. One should never expect to be able to go back to some stored media format that's more than a couple of years old and read it.

    3. Re:Stuff I can't read by timeOday · · Score: 1

      The problem of various hardware formats is shrinking, not growing. New technologies start with a flourish of innovation and then settle down. Most of those media you mention were never created in numbers of more than a few million. Compare that to the CD, which is 25 years old and going strong, with millions of new CD drives (now also capable of DVD storage) sold every year. And the total number of media (discs) in the billions.

    4. Re:Stuff I can't read by The+Good+Reverend · · Score: 1

      But you still have it. One important part of archiving involves moving your data to newer media when you think the old media will become impossible to read. This won't solve all long term problems, certainly, but moving all that data to hard drives, DVDs and Tape Drives will make the next time you have to do this much easier, since next decade you'll only need to access 10 year old media instead of 20-30 year old media.

  35. isn't XML supposed to address this problem? by Anonymous Coward · · Score: 0

    i thought XML is the interim cure for this problem.

    also, think of the gee...what do i say, "exobytes" worth of new data that will be created in the next 100 years. does anyone really think there will be a strong interest in dredging up and analyzing all of today's mostly circularly repeatative drivel? some of it, perhaps a snapshot from the past could be preserved, but why everything??

    how often to you think about accessing your great, great, great, great grandfather's love letters?

  36. Speaking of trash... by csoto · · Score: 2, Funny

    I wonder what archaeologists will think of the Zune :)

    --
    There exists no way of exchanging information without making judgments. --Bene Gesserit Axiom
  37. UK/BBC Domesday book by bLanark · · Score: 2, Interesting

    It happened recently. When I was a lad, the BBC and UK schools composed a "domesday book", which was supposed to be a parallel to the original Domesday book, which was a bit more than a cencus from the UK made in 1086.The modern one used the popular home PC the BBC Micro (made by Acorn). It was made on laserdisk, and distributed around the UK to the schools that had compiled the information.

    Well, 15 years on, it was useless. The then-proprietary format was not readable on anything modern, and there was not much of the old hardware around either. You can google for it ("UK domesday bbc data" should do it), the first link I saw was on the Guardian Online.

    I've still got stuff on floppies, but no-one builds PCs with them anymore. I've got two old laptops with floppy drives, the other three computers have none. (OK, I also have two corpses with floppy drives, and the controllers on two of the new PCs will accept floppy drives, but, please take my point - they're going out of fashion.)

    In 20 years time, there will probably be no CD/DVD drives, we'll all be using a new more portable, more backupable, lighter, faster, probably online-only storage medium. Kids won't recognize laserdisks, floppies, or USB ports. They might not recognise keyboards either - who knows?

    --
    Note to ACs: I won't mod you up, even if you are being funny or insightful. So take a chance! It's not real life!
  38. Ummmm... by Anonymous Coward · · Score: 0

    Did you happen to eat any of that exotic mushroom? Your thoughts are barely coherent.

  39. Coming Soon... by Anonymous Coward · · Score: 0

    XML on paper tape.

  40. Aw crap! by csoto · · Score: 1

    The era of restoration comes. However, when people blow the dust off those old DVDs and players, they discover that the DVDs have decayed to the point of unreadability. Massive quantities of archived data and knowledge are irretrievably lost.

    There goes my copy of Just Like Heaven! Oh the humanity!

    --
    There exists no way of exchanging information without making judgments. --Bene Gesserit Axiom
  41. Doesn't seem that hard, but scale is massive by Burnin'+Bush · · Score: 1

    I have a HD specifically allocated for "stuff I plan on keeping forever". I limit it to one of: pdf, tiff, jpg, gif, html, wav, mp3, and plain txt files. The HD is FAT-32 formatted and reads and writes nicely both from my OS-X Mac and Windows-XP PC. On the mac I have a program (graphicConverter) which will, among other things, do batch converts. In a single command, I can convert *.xxx to *.yyy. For example, convert every single tiff on the hard drive to a pdf.

    While it might be many days of crunching, it would seem that should some format be on its way out, or some new format prove itself to be the "way of the future", there will be programs to convert *.one to *.theOther. It might take a lot of cpu time, but that is not a big deal. (For example, I just recently converted 300+ GB of .wav files to circa 30 GB of poor quality MP3s (so that I could take my ENTIRE music library on vacation with me and not lug around the big hard drive, and this took about 4 days of background CPU time)!

    Nevertheless, I cannot imagine there will not be a simple capability to convert *.one to *.theOther, on a giant scale if necessary.

    This is not like the project I did a couple of years ago, where I converted my reel-to-reel tapes to digital format. That required a massive PHYSICAL effort, mounting reels, monitoring the conversion, etc. Once in digital format, converting to new formats, copying to new kinds of storage mediums, whatever I can imagine in the future, will now be as simple as dragging from one icon to another.

    So why are we worried? Is this just FUD?

  42. persistence by dgmrdt · · Score: 1

    I know this doesn't answer the format question, but the media problem can be solved by having multiple copies "in the ether".

    I reproduce all my (and several other people's) data on several different machines in different geographic locations, doing it efficiently with "rsync" (and other free tools).

    Hard disks come and go, optical and magnetic media fail with time, but the strategy of multiple copies keeps things safe. When was the last time you had 4 machines fail simultaneously in 4 different parts of the country?

  43. CVS? by Anonymous Coward · · Score: 0

    Posting anonymously from Redmond. I think CVS is a good answer. Setup a server, do regular backups, and you're done. Sure, the DB grows as documents change (esp. those not in a text format for diff to work), but all your data is there. You will have to buy a new hard drive every now and then for backups, but your data is safe. If you need security as well, use ssh for the CVS connections, and use some partition encryption program. I wouldn't encrypt the whole partition though, as it might be seen as trying to hide something suspicious from the authorities.

  44. Broadcast it! by calzones · · Score: 1

    Why not just broadcast all data out into space. Maybe we can set up a relayer way far away and bounce it back to earth and back again indefinitely.

    --
    Asking people to think is like asking them to buy you a new car
    1. Re:Broadcast it! by rbochan · · Score: 1
      ...Maybe we can set up a relayer way far away and bounce it back to earth and back again indefinitely.

      Dammit if I didn't read that as realplayer and just about puke.

      --
      ...Rob
      The American Dream isn't an SUV and a house in the suburbs; it's Don't Tread On Me.
  45. Reverse engineering by wkitchen · · Score: 2, Insightful

    Open and widely published formats are good, of course. But if you're looking for a really long term solution (as in multiple millennia), then I think the prime requirement other than physical durability should be easy reverse engineering. This way the data has some hope of recovery even if the knowlege of the format has been lost. This generally means that simpler is better. Things like plain ascii text. Uncompressed and unencrypted image and/or audio data. Verbose ascii based vector graphics. Things like that. Put it all on a durable, low density, and simply formatted media that will easily give up its secrets to relatively low-tech and completely non-specialized tools like a microscope. It's not the most efficient way to store data, but it's much more likely to be useable by future archaeologists than things like MS-Word files, WMA files, JPG's, MP3's, etc.

    1. Re:Reverse engineering by Max+Littlemore · · Score: 1

      Don't forget to place your low-density media inside a stone box with an ASCII/binary lookup table carved into the lid.

      --
      I don't therefore I'm not.
    2. Re:Reverse engineering by Jesus_666 · · Score: 1

      Additionally, include descriptions of popular file formats. That way your medium can be used to decypher further information - for example compressed images on a second instance of your medium of choice.

      --
      USE HOT GRITS WITH STATUE OF NATALIE PORTMAN (NAKED AND PETRIFIED)
    3. Re:Reverse engineering by gorodish · · Score: 1

      Check out Norsam's HD Rosetta product (http://www.norsam.com/hdrosetta.htm). An inert nickel plate, etched with a focused ion beam. It's supposed to last 1,000 years, and it's readable with a microscope.

  46. Obligatory quote ;) by kosmosik · · Score: 2, Funny

    Backups are for wimps. Real men upload their data to an FTP site and have everyone else mirror it. -- Linus Torvalds

    1. Re:Obligatory quote ;) by Dachannien · · Score: 1

      This may be a funny quote, but it also demonstrates tremendous insight.

      One of the main reasons we see time and again that X is lost to the ravages of decay is because X is present only in a static medium. Eventually, your vinyl records, your 8-track tapes, your cassettes, and yes, even your CDs, DVDs, and Flash ROM will fall victim to increasing entropy.

      On the other hand, the 'net has become a marvelous dynamic archival tool. Data important to the public interest is continually shifted through a variety of physical media and logical formats through the use of peer-to-peer technologies. While dynamic archival runs the risk of failing upon occasion due to disinterest or disaster, it tremendously increases the likelihood of having some static medium contain a piece of information when the dynamic archival does fail. As long as someone is interested in some part of our culture, and someone is interested in sharing what they have, that part of our culture will be preserved dynamically, transcending format and medium intrinsically.

      This is why librarians and archivists have stood up against DRM at every turn. If only librarians and archivists had the money that the content cabal has.

  47. Open Standards by RAMMS+EIN · · Score: 1

    This is one of the reasons open standards are important. Not that open formats last forever, but at least they are documented, which means there's some hope of deciphering them after the software that does so is no longer maintained. Of course, that doesn't solve the problem of how to make the actual data survive...hard disks and tapes demagnetize, optical disks become translucent or otherwise unreadable, etc.

    --
    Please correct me if I got my facts wrong.
    1. Re:Open Standards by SgtChaireBourne · · Score: 1

      And also one of the reasons why it is important to get said open standards into daily usage. In your page's call to action, it would help to name specific open standards. OpenDocument, iCalendar, IMAP, and POP come to mind from the examples given there.

      --
      Beta is broken and the link to classic doesn't work. Stop wasting our time or there won't be anybody left here.
  48. Provide an example source reader by Kadin2048 · · Score: 1

    I think the best solution currently available, is to include with each copy of your data (or on each backup volume) some source-code implementation of a document reader or parser, in a commonly understood and well-documented language, probably ANSI C (although Ada has all of its documentation in the public domain, so you could include it as well).

    This wouldn't help you if you expect people to lose the ability to read the media that you're storing the data and source code on, but that's a much more complicated problem. At that point, you're really talking about stone tablets or metal engravings, rather than backup tapes or CDs.

    In terms of practical solutions, ensuring that there are source-available readers, written without external dependencies (besides a compiler), for various document formats, is probably the best way to ensure that they'll be readable. Somewhere else in this thread, someone gives an example: storing a source copy of a GPLed RAW-file processor, on each CD containing RAW images. This seems like a very good idea: assuming that your eventual user can read the media, even if their machine architecture is different and readers don't exist, they have a solvable problem: either find a compiler for their architecture and build the program from the provided source, or use the source code as documentation, to build a compiler in a 'modern' language that can be compiled. The only weakness here is that the language might become a 'lost art,' but that's difficult to avoid. (You could provide documentation on the computer language in a natural/human language, but then you have the same problem of indecipherability of the human language; and ultimately I think a computer language is probably easier to puzzle out than a natural one is.)

    --
    "Ladies and gentlemen, my killbot features Lotus Notes and a machine gun. It is the finest available."
  49. as long as the United States remains a republic by Anonymous Coward · · Score: 0

    At the rate we're going, that's what... about 12 more months?

  50. Store data in humans by Anonymous Coward · · Score: 0

    Humans should become data storage devices - some similar way, as DNA is stored - this way, the data will stay with us, as long as we are here.
    Nothing beyond that really matters (to us).

  51. How Ironic... by Chabil+Ha' · · Score: 1

    "Will an entire era of human history be lost?"

    How ironic that in an age where we have the highest capability to preserve our history, it can become obsolete in a matter of decades. Take the 5 1/4" floppy disc. Assuming that the disk didn't loose it's magnetically bound data, I would be hard pressed in 2006 to find a drive that couuld read it. I don't even have a 3 1/2" drive anymore.

    Another example. My father has a magnetic real from the 30's with a radio recording of my great grandfather. We have no idea how or where we can get a copy of it on a media that we can use, like cassette, CD, or MP3. Who knows what else is quickly evaporating from our ability to use anymore.

    What a shame...

    --
    We're all hypocrites. We all have hidden parts, it's the contrast between them that make us more a hypocrite than others
    1. Re:How Ironic... by mspohr · · Score: 1
      Just the other day I needed to access some files on a 5.25 floppy and couldn't find a drive anywhere... fortunately the customer had a drive in storage to read the files.

      BTW, this customer is still running a copy of software I wrote in 1988 that only needed updating because a required government form changed.

      --
      I don't read your sig. Why are you reading mine?
    2. Re:How Ironic... by smoker2 · · Score: 1

      I have a compaq 8086 with *twin* 5 1/4 " drives, and it still runs ! I even have a copy of Norton Disk Commander, from way back when Norton were relevant. I haven't checked for a while though, so maybe the green screen has died.

    3. Re:How Ironic... by toddestan · · Score: 1

      The other half of the challenge, of course, is to get the data onto a modern machine. Sure, your 8086 can read the files, but if it can't write to a 3.5" floppy disk, the data is kind of stuck on it. You might think you're clever, and try to transmit the data over the serial port - until you find that while you can download an old DOS terminal program off the internet, you have no way to get it onto the old PC. This even becomes a worse problem with machines like old Commodores, and even more with exotic stuff.

    4. Re:How Ironic... by smoker2 · · Score: 1

      Well, I haven't done it for a while, but the last time I did it, I just put a 5 1/4" drive in the new machine. I had a really basic darts game that I transferred to a 3 1/2" floppy and took to work where I ran the game on a much newer pc. Not all games work however, most of them are tied to clock speed, and become unplayable on a modern system.

    5. Re:How Ironic... by ColaMan · · Score: 1

      Never used laplink? You could use a serial cable and keyboard redirect command (using MODE, if I recall correctly) and it could transfer a small bootstrap loader, then copy itself over to the other PC.

      Used to do it all the time - it was easier than copying laplink to a floppy.

      Of course, if you're on some sort of exotic hardware, the same principle applies - pipe data out a port somewhere, even if you're just toggling a data line, then import on the other PC using a serial interface (or some such) to keep an eye on the transitions on that line.

      --

      You are in a twisty maze of processor lines, all alike.
      There is a lot of hype here.
  52. Yes it will be lost. by Anonymous Coward · · Score: 0

    Yes, it will be lost.

    Just like it has been lost before.

    The human race has been more advanced then we are now, and all records have been lost becasue they were stored in a format that became unreadable after the great war.

  53. Well preserved by ambivalentduck · · Score: 1

    Krikey!  This one looks well preserved!  Nameless graduate student, carefully and tediously remove the volcanic ash from this "ard rive."  Yes professor!

    (months later)

    Finally, after 6 long years of restoration and figuring out what the *metal* conductors on the back were for, we can plug the "ard rive" into our nanocomputer and get a glimpse into the life of humans from 2006!!!!

    This is strange, all I can find is a massive and bloated operating system called "Windows" that keeps crashing and a huge folder labeled "pr0n."  I wonder what's in it...ZOMG!  Now we know why they were all killed in 2007 during WW3.  Silly pacifist hippies sat around whacking it and didn't by guns!

    I'm sure to get the Bush Peace Prize for this one.

  54. For as long... by lelitsch · · Score: 1

    "for as long as the United States remains a republic."--So, what does he want to do with data created after 1/21/2001?

  55. An archive already exists... by drgroove · · Score: 1

    It's called "Google"

    1. Re:An archive already exists... by cdrguru · · Score: 1
      Sadly, this is probably a pretty popular answer.


      There are so many things wrong with this that I can only barely scratch the surface of the problem. But, just a few things to think about.

      • Google is an index, not an archive of anything. If the original material goes away, it will be inaccessible. It might not be cached at all, and even if it is, the cache isn't a permanent repository.
      • Google is a advertising service. All of their revenue comes from presenting advertising to people using their web services. Do we really want to place all of the keys to history and knowledge with an advertising company?
      • Google is a commercial enterprise with "permanent archival" nowhere in its charter. The moment they find a good business reason to remove information from their system, they do it. Should they at some point decide that indexing "unpopular" pages is not profitable, they will stop. The moment they decide that indexing things which are otherwise indexed by something else is not profitable or beneficial to their business, they will stop.

      Doesn't sound like any sort of permanent archive that I would trust.

    2. Re:An archive already exists... by drgroove · · Score: 1

      You have a point... I should have specifically referenced the Google Cache in my initial post. Though, I'm not certain what the parameters are around the cache - i.e., how long it's stored for, when it's updated, etc.

  56. Heh by ipooptoomuch · · Score: 1

    Step 1: Hire hundreds of Chinese people that are unemployed. Step 2: Buy 1000's of cheap notebooks (PAPER, not a laptop) Step 3: Write 0's and 1's down on said notebooks with pen.

  57. Digital Archives by yar · · Score: 1

    Thibodeau and the rest of the people at NARA have been thinking about this problem for awhile, as have other researchers around the world. If you're interested in such things, there are a few places to start looking.

    CAMiLEON http://www.si.umich.edu/CAMILEON/
    Cedars http://www.leeds.ac.uk/cedars/
    InterPARES http://www.interpares.org/
    DSPace http://www.dspace.org/

    Lockheed Martin won the NARA contract to develop the Electronic Records Archives.
    http://www.archives.gov/era/acquisition/option-awa rd.html

    After hearing them talk about it at the Managing Electronic Records conference (http://www.merconference.com/), I'd say they have a few things to work out yet... but these are important questions for the preservation of history, culture, and more. These questions also involve authenticity, the value of evidence, and more...

  58. as long as the us remains a republic? by nurb432 · · Score: 1

    Cool, someone got it right for a change.

    --
    ---- Booth was a patriot ----
  59. Online storage by djeca · · Score: 1

    Today's data is different from previous generations' in two ways:

    1. It's digital
    2. The machines that produce it are networked

    1. means that you can copy it as many times as you like without errors creeping in.
    2. means that when you buy a new computer, you just hook it up to the old and copy all your existing data across; Moore's Law means that the old data will occupy a tiny corner of the new computer's hard disk (or whatever the future storage device is).

    Of course, this only works if you /do/ store everything on hard disk. But with storage prices where they are nowadays, a couple TiB is pretty affordable, and I don't see many people generating that amount of data any time soon.

    The problem is, this requires a fairly major rethink from the POV of archivists, who are used to storing things on physical media and hoping they don't degrade. Time to learn - the way to assure long-term survival of data is multiply redundant on-line storage. Physical media are useful solely for sneakernet and short-term disaster recovery.

  60. Books of the Bible were hand copied... by mollog · · Score: 1

    The books from the early days of the written word were hand copied and translated. This was done to preserve them because the books wore out from use. Often, especially in the case of books of gospels, they were edited in an effort to keep them in the current idiom. Today, we struggle to find the earliest works so that we can know more about the original author's intent.

    I suspect we are facing a similar situation with archival of the mass amounts of information/data that we are now making; the very first bits of source code, even the first copies of compiled code, are scarce. And some of it is getting corrupted. Early UseNet stuff is getting harder to dig up.

    For many reasons, I hope this early work is preserved.

    --
    Best regards.
    1. Re:Books of the Bible were hand copied... by arminw · · Score: 1

      .....Often, especially in the case of books of gospels, they were edited in an effort to keep them in the current idiom. Today, we struggle to find the earliest works so that we can know more about the original author's intent.........

      What is interesting about the Bible and what surprised scholars when the Dead Sea scrolls were compared with existing copies of the same material, was the amazing agreement of these with the existing multiply copied ones. In the case of the Bible at least, the copying errors were much smaller than might be expected. None of the errors that did creep in affected the message in a material way.

      Any technology that needs more than our human's built in senses to convey information would have to be used to keep that information current and accessible to our brains. Symbols on paper or other durable medium do not face this hurdle. The problem of deciphering the symbols remains however. We cannot read ones and zeroes, let alone make sense out of them. Conclusion: Print stuff out you want to keep for future generations. Doing this forces us to decide what to keep and what to forget. The sordid e-mails of Mr. Clinton need no print outs.

      --
      All theory is gray
    2. Re:Books of the Bible were hand copied... by Anonymous Coward · · Score: 0

      The older Jewish texts were and there were strict rules about how to copy them exactly without change that was cultural but only applied to those documents. The gospels were a different story since all the early versions of them differ greatly.

  61. USA is not a republic by cbraescu1 · · Score: 0, Offtopic

    USA is not and never was a republic. It is a federation of states.

    --
    Catalin Braescu
    Ofaly.com
    1. Re:USA is not a republic by cybrzndane · · Score: 2, Informative

      republic (plural republics)

      1. A state where sovereignty rests with the people or their representatives, rather than with a monarch or emperor; a country with no monarchy.

      http://en.wiktionary.org/wiki/republic

    2. Re:USA is not a republic by Anonymous Coward · · Score: 0

      The U.S. was a federation of states briefly under the Articles of Confederation and Perpetual Union. Under these Articles, each state made its own decisions; only projects that had support of the representatives of the people got funded and supported. This made central governmental control impossible, as well it should be in a confederation of states.

      The federation of states was dissolved by the treasonous congress of representatives--that body who was responsible for upholding the Confederation, but who wanted to wield that central control--who with their unchecked power to make law, conquered every member nation at a stroke by betraying each: creating a centrally-controlled republic instead. In the words of the dissenting minority ...the new government will not be a confederacy of states, as it ought, but one consolidated government, founded upon the destruction of the several governments of the states."

      The "We the people" rhetoric--that's all it was--didn't last 20 years.

      Several of the former states about 70 years later--for some good reasons, some reprehensible ones--withdrew from the "union of states" that was anything but, and again formed a federation of soverign states, each responsible for its own affairs. However, each of its member states was later invaded--conquered and annexed by force--by that same centrally-controlled government that had previously called each of the bodies "States" and indeed continues hypocritically to do so.

      Nothing seems to have been historically learned from this, and the process is repeating itself with what one would hope would be alarming similarity--for example, in the referenced article, the phrases "independent, democratic member states" and "confederation of independent states" appear prominently despite the body's ability to pass binding federal law upon member "states."

  62. This is hardly a new topic. by jc42 · · Score: 1

    This very problem was, in effect, a major motive for the long development of the SGML standard, and its special cases like HTML and XML. It's also part of the thought behind acronyms like ASN-1 and UTF-8.

    The difficulty of decoding computer files even a few years old is a problem that dates from the earliest days of computers. Programmers have been battling this problem all along. And we know a lot of solutions.

    The only problem is getting people to use the solutions. This means fighting the natural tendency of management to discourage anything not aimed at improving this quarter's bottom line.

    --
    Those who do study history are doomed to stand helplessly by while everyone else repeats it.
  63. Mixed-up title for TFA by Traf-O-Data-Hater · · Score: 1

    I believe the author of TFA has a mixed metaphor there. His title ought to be Digital DARK Age, not ICE Age.

  64. I've solved this problem for my own needs... by rickb928 · · Score: 1

    ... I have about 15 years' worth of e-mail (ok, so I'm wierd), including all that spam until about a year ago (and I'm a packrat, so sue me). Saved in text files, exported from everything from pine to Outlook and many in between. This is all readable for a while.

    I had documents in WordPerfect that I took the time to batch convert to Word for Windows 2, then to Office whatever. I may convert them to ODF pretty soon. Yes, some formatting is lost, but the content is there.

    And I currently have my archives on a USB/Firewire hard drive, CD-ROMs, and 4MM DAT. I'll be going from CD to DVD, and the 4MM will probably go to a DV via camera. And I need some other hard drive soon, this one is 3 years old and due to go 'click'.

    The solutions include:

    - Multiple physical media, and some file management.
    - Converting to new file formats every few years.
    - Occasionally jettisoning the true crap.

    But, perhaps a spam library from the early days is not 'crap'? I got spam from my AOL account, my first FIDO account, and my first 'Internet e-mail' account. I've had my own SMTP server for 10 years, and my account is at least 12 years old. I get a lot of spam.

    There is no single solution. Cover all the bases. Hard drive interfaces will become obsolete, CDs will give way to Blu-Ray, and even 4mm DAT will some day become unknown.

    And I'm glad my wife doesn't know how to add up all this storage. She'd call it a waste and make me delete it all. pfft.

    -rick

    -ps, I already deleted 11 years' worth of old Pr0n. Sorry. I got married.

    --
    deleting the extra space after periods so i can stay relevant, yeah.
  65. I'm doing my part... by dghcasp · · Score: 2, Funny

    I'm doing my part by working on a project where I'm copying every single MySpace page onto stone tablets.

    When future archeologists dig them up and see "LOL Bobby Ray Sucks!" and "D00d 1 pwnz3r3d U!!1!", they'll understand that our civilization didn't just decline; our only choice was to destroy ourselves because we were so lame.

    1. Re:I'm doing my part... by Anonymous Coward · · Score: 0
      When future archeologists dig them up and see "LOL Bobby Ray Sucks!" and "D00d 1 pwnz3r3d U!!1!", they'll understand that our civilization didn't just decline; our only choice was to destroy ourselves because we were so lame.


      http://en.wikipedia.org/wiki/Amusing_Ourselves_to_ Death

      http://en.wikipedia.org/wiki/Amused_to_Death

      And when they found our shadows
      Grouped around the TV sets
      They ran down every lead
      They repeated every test
      They checked out all the data on their lists
      And then the alien anthropologists
      Admitted they were still perplexed
      But on eliminating every other reason
      For our sad demise
      They logged the explanation left
      This species has amused itself to death
      No tears to cry no feelings left
      This species has amused itself to death
  66. It is a solved problem by Colin+Smith · · Score: 1

    It's a process, not a product or solution.

    Keeping old hardware and software round is a non starter. I've been there, it's expensive and unreliable. Instead, convert the data into an open standardised format, ASCII if possible or something simple otherwise. Then put a process in place to move the data from old to new storage media, keep it on the live media.

    Automate it if you can. There are also data lifecycle products which'll manage it for you.

    --
    Deleted
  67. Transmac by arete · · Score: 1

    I don't know about the even older stuff, but for the mac, there was freeware - Transmac I believe - that lets you read MacOS floppies on a Windows machine. (I was using it on NT4, but I'm reasonably certain it or an analog is still around..)

    And I'm reasonably certain a newer mac without a floppy can still read the old floppies if you get a USB floppy drive for it.

    --
    Looking for freelance Actionscript (Flash/Flex) or ColdFusion work and/or freelance developers. Email me, put Slashdot
  68. What I Do by RAMMS+EIN · · Score: 1

    What I Do:

    I run my data through a program that spits out a Whitespace program that generates the original data, then print the program. Nothing beats a paper trail, and there's no need to worry about the ink fading, either.

    --
    Please correct me if I got my facts wrong.
  69. Digital "time capsule" by aktzin · · Score: 1

    I read somewhere that IBM and some other companies were working on a project to keep digital data accessible in the future. A quick Google search didn't bring it up and since I'm still at work I hope one of you kind readers has more details.

    The idea was to create a virtual machine that runs in as many platforms as possible, and within it has viewers or players for all kinds of documents (text, graphics, sound, video, etc.). Development and maintenance of this VM is supposed to continue as time passes, porting it to new OS versions or hardware platforms as they appear, while essentially keeping the internals of the VM the same.

    The catch is the need for maintenance. If it lapses and technology advances too far, the "link" breaks and we're back where we started. Should we start carving bits on stone tablets?

    --
    Quantum mechanics: the dreams that stuff is made of.
  70. Solution in action Down Under by The+Source+(from+the · · Score: 1

    A solution to this problem has been implemented by National Archives in Australia. National Archives is responsible for the care of valuable Commonwealth government records and make them available for present and future generations to use. This organisation has a Digital Preservation area which stores data (extracted from various formats) in XML format.

    This particular project will be presented on at linux.conf.au 2007 by Michael Carden and is relevantly titled, "Digital Preservation - The National Archives of Australia, Open Standards and Open Source". Yet another reason to be seen in Sydney for linux.conf.au this coming January!

  71. Memories of Now by McLuhanesque · · Score: 1
    Vodafone Receiver had a philosophical and cultural take on this very topic a few months back. From the article:
    When a hard drive fails, or the memory card is erased or misplaced, or when a future computer can no longer read today's media, our culture becomes a little more forgetful, and a little more forgotten. The consequence of our technological advancement is that, centuries from now, historians may well look back on our time as a type of dark age. Compared to earlier generations, very little of our cultural history is being recorded so that it will actually exist into the future. We will literally be a forgotten culture, because those who will come after us will have technologically "forgotten" how to read, or even locate, our ephemeral artefacts. If we want to be remembered at all, we must additionally create artefacts that will travel through time, as the writings, art, and photographs of our forebears have come to us.
  72. archiving question by nanodec · · Score: 1

    well one question I will throw out to the slashdot community is people's thoughts on archiving data. Considering how cheap drivespace is now becoming, what is your thoughts on just building a machine which the OS and all files you want to store are placed. Then you just simply keep the machine shut down and unplugged. Basically people are talking about 200-300 (estimated) years with CD-R and DVD formats.. has anyone any insight on just using harddrives that are powered up long enough to store the data on and then powered down? Will the data stored on the platters begin to deteriorate? I'm curious myself about this as I am going to start archiving some documents and mostly treasured family photos and I'm wondering which way would be the best and safest way to store them.. any and all comments are appreciated! Craig

    1. Re:archiving question by TropicalCoder · · Score: 1

      has anyone any insight on just using harddrives that are powered up long enough to store the data on and then powered down? Will the data stored on the platters begin to deteriorate?

      Of course I have pondered this question of how to maintain data, as I am sure most have. I wouldn't count on a hard drive like that. Even if you only fire it up once in a while - I don't think that makes any difference in the end. I don't think it is as much a question about how long the disk can maintain the data as it is a question of the intricate electronic components on the drive. In fact, electronic components seem less reliable that way than if you keep them running ever day. For example - many people have had the experience over the last couple of decades of putting a perfectly good television in storage for a number of months, only to discover that it no longer works later. Maybe that's not such a good example compared to a computer hard drive, but I tried to find something that many people may have actually experieneced or heard of. As well, I believe many people never shut down their computers because they seem more reliable if you keep them running 24/7. It's the in rush of current that will finally kick over a weak electronic component rather than a steady on state. So in answer to what you should use - seems to me DVDs is the way to go.

    2. Re:archiving question by YrWrstNtmr · · Score: 1

      Considering how cheap drivespace is now becoming, what is your thoughts on just building a machine which the OS and all files you want to store are placed. Then you just simply keep the machine shut down and unplugged.

      Consider if someone had done that with his Commodore64+harddrive. Yes, the data is (probably still) there. But what can you do with it? How would you get it off the 64 into a current machine into some viewable form?

      Will a current Pentium machine be able to plug into my kids house network in 2050 and upload the files? Will the OS of 2050 be able to load and run the viewing application? Will there be any way to read that particular digital format?
      I've got about 40GB of absolutely irreplaceable data, mostly family photos, backed up every coupla months on a couple of hard drives.
      Will there be any way to read those drives in the near future?

    3. Re:archiving question by debest · · Score: 1
      I've got about 40GB of absolutely irreplaceable data, mostly family photos, backed up every coupla months on a couple of hard drives.
      Will there be any way to read those drives in the near future?

      Yes. Every time a new method of storing infomation comes along, there is always a transition period where computers have both methods of reading and writing the media supported.


      If some cool optical storage medium comes along that makes magnetic hard drives obsolete, you can bet that there will be at least three years where every new piece of hardware supports both standards. Plus, there will be ways of adding the new technology to old computers, and ways of adding the old technology to new computers for a long time after that. You will have plenty of time to copy your information from one medium to the next.


      It's the same with file formats. You will not be left instantaneously unable to transfer your data (particularly if they are openly documented, like JPEG or MP3) from one format to another.


      The only thing that will prevent you from being able to read your data in the future will be your own neglect.

      --
      Look at the tomato! Isn't it sad? He can't dance! Poor tomato!
  73. As long as the United States remains a republic by Timmy+D+Programmer · · Score: 0

    And why can't we keep the archives after we aren't a republic anymore? Once I'm emperor I'll Be OK with it. So long as you all wear your underclothes on the outside, with the day of the week written on them!.

    --


    (If at first you don't succeed, do it different next time!)
  74. Not a Word? by soloport · · Score: 1

    Not a word? Sure it is, amigo. It's like the word "infamous". It means the same as "in" and "famous"; In-n-n famous. See? So, "irregardless" simply means the same as "ir" and... never mind...

  75. Old stuff. by bgii_2000 · · Score: 1

    I'm generally a lazy person. This fact has always made it difficult for me to keep good back backups. I recently purchased a giant external drive that syncs with my internal one nightly. Up until this point, I had everything stored on CDs (and more recently, DVDs.) A week ago, I began a journey through the stack that was sitting on my desk. It had grown to be over a foot tall, and had begun falling over periodically.

    As I perused the contents of said stack of discs, I found that almost 90% of them were redundant or out of date copies of files I had completely forgotten about. I would estimate that I recovered only about 500MB of files that I had no other usable copy of.

    1. Re:Old stuff. by TropicalCoder · · Score: 2, Interesting

      As I perused the contents of said stack of discs, I found that almost 90% of them were redundant or out of date copies of files I had completely forgotten about.

      Well then I have question that I would like to throw out to Slashdot readers. Like the person who wrote the parent, I have tons of old files on my hard drives. I always run at least two hard drives, using one for backups. Then when I upgrade computers, I bring over one of the old hard drives to the new computer, copy it to the new drive, then continue to use it to backup new material. By now I have files duplicated and triplicated all over the place. After almost a decade of this, I have many gbs of files which would probably condense down to a fraction if all the duplications were eliminated. What kind of software do I need that will analyze all my files and automatically find and remove duplicates? - or do I need to develop such software for myself? ...and if I do, then is there niche for commercialization of such software?

    2. Re:Old stuff. by bgii_2000 · · Score: 1

      Heck, man. It just takes time. Good old fashioned time.

  76. Who cares? by EmbeddedJanitor · · Score: 1
    It is pretty arrogant to assume that our transient information is so important. SOme huge % of email is spam, some huge % of the web is just crap. Surely keeping all this stuff is a complete waste of time?

    --
    Engineering is the art of compromise.
    1. Re:Who cares? by Anonymous Coward · · Score: 2, Insightful

      Doubtless the Anglosaxons felt the same way about their rubbish... and yet archaeologists get orgasmic over the everyday bits and pieces that tell them so much about how those normal people led their lifes.

    2. Re:Who cares? by ericlondaits · · Score: 2, Funny

      Archeologist from the 23rd century going through or email archives: "Wow! These guys must have had humongous penises with all the enlarging going on!"

      --
      As a Slashdot discussion grows longer, the probability of an analogy involving cars approaches one.
    3. Re:Who cares? by c6gunner · · Score: 1

      More likely it'd be something along the lines of:

      "These penis enlargements are so primitive! They just don't compare to my nano-pump(tm)!"

    4. Re:Who cares? by focitrixilous+P · · Score: 4, Funny

      I doubt you'd sell many Nano-Pump (tm) enlargement kits. It's all in the name, even in the future.

      --
      SAILING MISHAP
    5. Re:Who cares? by Filmcell-Keyrings · · Score: 1

      They said that about Microsoft

      --
      Never rub another man's rhubarb
  77. open source software to use those open formats by Matt+Ownby · · Score: 1

    Having mulled this problem over myself, I'd say make sure that all file formats you use can at least be decoded by open source software (written in something very widespread like C). That way, if the platform becomes extinct, you can recompile the software and recover the files.

    So:

    Compression: ZIP (no closed-source stuff like RAR) -- even better, don't use compression at all
    Graphics: PNG (JPG for lossy)
    Video: HuffYUV (MPEG2 for lossy)
    Audio: WAV, FLAC (Ogg Vorbis or mp3 for lossy)
    Documents: plaintext ASCII if you can help it, otherwise I'd go with HTML or PDF.

    And top it all off with .PAR2 files to recover data degradation. Make new full backups at least 4 times a year.

    And do not underestimate the important of good open source emulators :)

  78. Funding by Detritus · · Score: 2, Interesting

    Don't forget funding. I've seen vast amounts of data disappear when nobody was willing to pay for its storage. This is common in large bureaucracies. You've spent years building and maintaining a library, and then it all ends up in a dumpster when the parent organization is eliminated.

    --
    Mea navis aericumbens anguillis abundat
  79. Relax... Google will take care of it... by Panaqqa · · Score: 3, Interesting

    Unless I miss my guess, Google will continue towards its stated objective of making all the world's information searchable and retrievable. Want something archived, Google will take care of it. And if Google fails, my suspicion is the entity that takes their place will take it on.

  80. Haven't we talked about this before? by haeger · · Score: 1
    Yup. Though so.

    .haeger

    --
    You are not entitled to your opinion. You are entitled to your informed opinion. -- Harlan Ellison
  81. The Importance of Historical Records by darrenadelaide · · Score: 2, Interesting

    Just because the difficulties in doing a job isn't easy, doesn't mean its not of importance.

    In the early 1960s a wise man spoke

    / quote

    We choose to go to the moon. We choose to go to the moon in this decade and do the other things, not because they are easy, but because they are hard, because that goal will serve to organize and measure the best of our energies and skills, because that challenge is one that we are willing to accept, one we are unwilling to postpone, and one which we intend to win, and the others, too.

    / unquote

    We Went to the Moon, and all the signals received including a high definition picture quality version (by the technology of the time) was recorded at Nasa (and also I believe at the receiving station of Parkes receiving station in Australia where the signals were received through their deep space network radio telescope), these most important "documents" of our time have been lost, lost and never able to be recovered leaving us purely with the broadcast version which was at a much lower quality standard (eg a poor quality photocopy).

    Its important for the nature of our history and our essence of our technology and who we are as a people to preserve these important events for our future generations.

    When you look at this Planet, we regularly goes on a rampage where the technology is lost and we are thrown back hundreds of years, Take Ancient Egypt, The Technology of the first milenium, The great library of Alexandria, (atlantis etc) so much of the past for which we have lost and are poorer for as a result.

    Cant we get it right this time as we face our possible next destructive surge, whether it be by climate, economic, famine, nuclear war, microbiological warfare / disease (whether natural or manmade), chemical accident causing a chain reaction etc..., so many risks, lets do this before its too late, too late to be done and too late to be able to be done.

    Darren Stephens
    Adelaide, Australia

  82. translate to non-digital by chris_7d0h · · Score: 1
    how to store digital files so future generations can access them, from engineering plans to family photos
    Well, why not transcode it onto a more sustainable format? Just create a device capable of understanding common printing formats (e.g. postscript) and let it store the data onto more proven media, such as, I dunno, stone or some form of metal...

    We have printers for printing rendered data (text/images) onto papers. Why not have a stone or metal printer which uses a lazer with a bot more watts to etch the information into these durable materials? I concede though that searching these etched medias would be more of a problem, but there has been no system that I know of to this date which were capable of accessing vast amounts of information, packed into a small space other than digital/magnetic/photonic storage (comparatively transient in nature).

    After the nuclear winter, the next civilization will simply have to "scan" the recorded knowledge of 21th century from rune stones into computers again.. We did it so why wouldn't they..

    PS. I hear the moon is pretty rocky, so perhaps we can use it for something after all.
    --
    In a society that believes in nothing, fear becomes the only agenda ~ Bill Durodié
  83. Will an entire era of human history be lost? Yes. by Ralph+Spoilsport · · Score: 2, Insightful
    The average slashdot user, as a fan of digital technology, will weep and moan, accuse me of being Flamebait or a Troll, but the fact is, YES, it will disappear. Our digital technology is completely predicated on a vast and complex array of technologies - some super advanced like lasers, others more prosaic such as mining rare and precious metals as well as petroleum out of the ground.

    The question isn't IF it will disappear, the question is really WHEN and HOW. Printing to paper-based hardcopy helps for a few hundred years. It can be recopied from paper to paper easily - it's a very low context solution: ink on paper followed by ink on paper. So, important information about our society can be transferred across generations, even if the generations have no electricity at all. This is how we know Shakespeare, for instance.

    Many people say "Oh, but we'll have some NEW technology that will take care of it". This assumes that the resource base for a new technology will be as generous and dense as our present resource base provides. This is a VERY unwise presumption, as there is categorically no proof that such will be the case. In fact, there are a variety of intense warning signs that suggest quite the contrary.

    From the evidence I have found, and, oddly, I've studied this for a number of years now, I am fairly well convinced that industrial civilisation will simply erase itself from the human record as little more than a horribly polluted stain that destroyed itself through overpopulation and environmental stupidity. All the music you hear, all the shows you watch, all the films you cried at, it will all go away. Poof. This also means that self-absorbed hucksters like Madonna, Britney Spears, Michael Jackson, Tom Cruise, and their supporting technology of TV, Radio, DVD/CD, etc will also disappear - just the flotsam of "entertainment" culture.

    The long term future will be people chasing bison/cows across the prairie or living in small agrarian villages bound by localised population bursts and die-offs. But it will take several centuries to get their. In the meantime we've got our MTV and Orange Crush. The most important thing to remember is this: not getting to that Star Trek future IS NOT A BAD THING. We pissed away the globe's resources on our Xbox's, SUVs, jetset vacationlands, and all the other minutae and ephemera that makes a society "civilised" and provides "leisure activity". All societies have that, to varying degrees. We just had more of it, thanks to our insane and unrelenting exploitation of resources, petroleum, and electrical generation. But it will all go away, and THAT'S OK.

    We will disappear. We Are Atlantis.

    RS

    --
    Shoes for Industry. Shoes for the Dead.
  84. Long Time Gone by Doc+Ruby · · Score: 3, Insightful
    "
    The documents of our time are being recorded as bits and bytes with no guarantee of readability down the line. And as technologies change, we may find our files frozen in forgotten formats. Popular Mechanics asks: Will an entire era of human history be lost?"


    I ask: has this ever happened before?

    Not necessarily in electronic bits and bytes. Not the "Alexandria Library" that was mostly duplicated in other libraries or private collections. Maybe like the Inca quipu, mats of knotted strings that recorded all their empire's operational records, other than the ceremonial records in statues and murals. But some quipu survive, despite Spaniards destroying most of them in the mid-1500s. Enough that we can at least recognize that they did have records of lots of transactions.

    No, something more transient, as transient as our bits, read/written by something more transient than our metal/plastic/glass machines. Maybe songs or other performed stories, like tribal Australians. Maybe woven in more degradable material, like uncured plant matter. Maybe both, like the Pacific star navigation lore taught in temporary woven stics, but carried in the mind. Maybe patterns in some other loseable medium, like animal pelt patterns no longer readable now that the code has been lost, or interbred back into "blankness".

    If it can happen to us, it could have happened before. Our civilization rose from meager beginnings only about 12K years ago, after the last Ice Age that lasted about 12Ky. There was another one before that, with people accumulating knowledge between. And probably a half-dozen or so others since we became as genetically developed as we are today, between 7Mya and 200Kya. We don't even have many records from the first half of the last 12Ky. Could we be reinventing the wheel, literally, every 25 thousand years?
    --

    --
    make install -not war

    1. Re:Long Time Gone by Anonymous Coward · · Score: 0

      Only in a bad Twilight Zone script. Unless you are a believer in Mu/Lemuria or Atlantas, there is no physical evidence for any advanced civilization beyond the ones we alreay know. The pyramids were not built by space aliens or crystal power, but by human sweat. We can see evidence of early metal smelting in the northern ice sheet record. There are climate scientists who think that we would already be in an ice age if it were not for human greenhouse gas effects. Humans leave a mess. If there were advanced group befor us, we would have found their trash by now.

    2. Re:Long Time Gone by Doc+Ruby · · Score: 1

      OK, how about the Iraqi battery urn we only recently found? How about the

      I'm talking about the possibility raised by our own predicament that previous societies have lost records to even more transient media. Like the Inca quipu. How about the rest of the "Linear-B" corpus? Or the rest of the recordings people might have left before the previous ice age?

      How can you be so sure that "writing" of some kind wasn't used before, say, Sumer, when there are 50Ky old cave paintings?

      If all you've got to deny the possibility is some strawman about aliens building pyramids with crystals, when we're talking about how humans could have produced records which could have disappeared, as we now see our own possibly doing, and as much of even the surviving types in the past certainly have, then I'll just take it that the possibility is just as strong as before you popped up your Anonymous Coward response.

      --

      --
      make install -not war

    3. Re:Long Time Gone by mattr · · Score: 1

      Could have happened, but I think the evidence of nothing obvious on the Moon or long orbits probably means that previous generations did not advance as far as we have gotten this time around. Assuming the rise of artificial intelligence within 30-50 years and maybe self-sufficient robots easily within 100 years, I'd expect to see some radio signals from where trojan points or asteroids at least. Of course a physics experiment may go disastrously wrong before then, but I'm hoping we will get our knowledge and genes out of the gravity well before things get much more dangerous.

    4. Re:Long Time Gone by khallow · · Score: 1

      For what it's worth, human society currently has a lot of stuff far less transient than its information systems. Some of the garbage dumps and road systems will probably last millions if not billions of years (well, they'll be buried and preserved that long). The point here is that we haven't found evidence of significantly organized society before the advent of agriculture. If our civilization just ended, future scientists would be able to detect our presence for a long time to come. It's possible that some ancient civilization existed, but that it didn't have a big footprint. But there's no way we'd miss a global human civilization like ours. I honestly don't think such a civilization could have existed at any point in Earth's past.

    5. Re:Long Time Gone by Doc+Ruby · · Score: 1

      We're on one line of extended development, ignoring/neglecting other directions of development from each new development we produce. Even that pattern of development is just our style, not necessarily the way humans develop.

      So while we have reached the Moon, we are standing on the shoulders of giants who reached quite a bit more that we have not.

      For example, recently in Slashdot we read about lost nanotech steel techniques once used in making "Damascus" steel. We have retained the results of the tech revolutions in woven textiles, but how many specific techniques were lost that we reinvented once we industrialized the process, and now nanize the process? Consider the geometry not preserved by Arabs and a few others from the Classical Greek research.

      In the Western Hemisphere, the Inca and some other peoples developed most of the vegetables most familiar to us. Potatoes (in every variety), peanuts, tomatoes, "peppers", most corn... these products of Inca breeding were produced by a science we understand so little that at best we call it a "religion". How much science did they produce now lost? The Incas didn't take our path through the wheel and the arch, but covered their empire in much more extensive roads than did the Romans theirs, over mountains higher than the Alps. Instead they bred the llama and developed the quipu for trained runners.

      The notion of linear progress as a predetermined track along which we travel, progressively increasing our knowledge and sophistication, is a recent idea. It's basically the British teleology developed as a theory of history from the start of the Enlightenment, about 1750. Which is about when Britain gained control over so many previously superior civilizations in Africa, the Americas, Asia. They invented this narrow view of history largely to deny the power of the civilizations they had (mostly temporarily) conquered, to convince everyone (including the conquered) that the British were the natural "new" #1. We believe it, and we fail to see the old #1s who produced so much of what we use to be #1 now. And thereby fail to appreciate so much of what we lost by destroying the custodians of previous inventions and their way of inventing.

      Our space tech is the linear descendant of about 3-6000 years of space research and development, mostly by Mediterraneans and Mideasterners, lately by Euramericans and Russians. But we don't know about much of the space tech (observations/predictions, mostly) lost with the Mayans and other earlier Americans, or even the Egyptians closer to our roots. Or the apparently global civilization that built observatories/markers in Egypt, Stonehenge, Chichen Itza, Angkor Wat, marking the sky as it was about 13.5Kya. And that's to say nothing of the sciences on tracks not our direct ancestors, like the genetic engineering in breeding foods, work animals (including ecosystems) and textiles. Then there's the various psychologies we call "religions" in all the peoples of the world we've missed or destroyed.

      There was about 6000 years from the end of the Ice Age to the dawn of history, about the same period. During which the genetically same people could have had as much, or comparable development. If their investigations were into work that could spurt the way our science has, but at different periods, and with different setback patterns like the Roman collapse, the Aryan invasions of India, etc, they could have reached comparable sophistication in their pursuits as we have in ours, in the same duration with the same world and human genes.

      And they didn't seem to need to abandon the planet. They generally made the world more secure in which to live, until eventually facing our ancestors. We have a lot to learn from them.

      --

      --
      make install -not war

    6. Re:Long Time Gone by Doc+Ruby · · Score: 1

      How about those artifacts of a global civilization still standing in Egypt, Chichen Itza, Stonehenge, Angkor Wat, and certainly elsewhere? They all predate our historical beginnings of agriculture, by several thousand years at least. And they all mark the sky in the state it was in about 13.5Kya, about the end of the last ice age. But though we knew about the buildings, we couldn't read what they'd recorded, as a medium, until maybe a decade ago. There's probably more recorded in them. And probably other recordings from the same global culture we haven't yet recognized.

      And then there's other media. Like the Pacific "stick" charts I mentioned. There's little ancient artifacts remaining in those island cultures, as they're organized to perpetuate transient constructions from the local flora/fauna with which they've coevolved. If European disease had killed them all, in a few thousand years we probably would have lost even the few European records observing their woven records. Especially if we transferred all our own records to electromagnetic. Our successors might find some evidence that we had machines, but not evidence of the transient prior tech we'd recorded before it disappeared.

      There are footprints of plenty of ancient societies. Our own bias in considering whether they're "civilizations", with the persistent artifacts of the actual cities that define "civilization", shows our blind spots. We could very easily just not recognize some very foreign media, like the possiblities I just spouted off the top of my head. Or they could have been as interested in making degradable media as are we in making persistent media.

      A global civilization didn't have to be "like ours" to exist, even to be global - though that's not even the smaller scope of what I suggested. But it does have to be mostly like ours for us to recognize it. However, we don't have to recognize one for it to have existed. When we've got a broader view of how important artifacts can disappear, we'll have a better chance of finding their traces. Facing our own disappearing media will make us more sophisticated in that regard. And, in turn, help us learn from the disappeared media of the past.

      --

      --
      make install -not war

    7. Re:Long Time Gone by talonyx · · Score: 1

      No, because 25,000 years is not enough time for new deposits of untouched surface metals, oil, and everything else to form as it is now. Chances are if we fuck up this attempt at spreading ourselves out in the universe, there won't be enough unprocessed material to jumpstart another society.

    8. Re:Long Time Gone by Doc+Ruby · · Score: 1

      25Ky is enough time to move from ice age to stone age. Without surface metals, oil, and the rest of the materials our current society used to build our current civilization, we'd take another path. Like the paths taken by Pacific Islanders and inhabitants of the Kalahari, also lacking those materials. And others, or some new path.

      Spreading ourselves out in the universe is neither prerequisite not absolute measurement of our success as a species in creating societies. I'm all for it, but I know I'm a creature of this particular society.

      --

      --
      make install -not war

    9. Re:Long Time Gone by khallow · · Score: 1

      Any such "global civilization" is lacking in other forms of evidence in addition to the absence of large scale structures and durable artifacts. There's no indication of human genetic mixing on a global scale. It's odd to have a global human civilization without global gene flow. Anything in the last 20k years or so would probably be detectable. Second, I don't see evidence of common culture or language after that period. If there was a global culture, then where did it go?

    10. Re:Long Time Gone by Doc+Ruby · · Score: 1

      Well, those are good questions. They are questions that we have to answer in light of the existence of the artifacts we have. At the very least we have to accept that a global civilization can leave some artifacts, but not the ones we'd expect. And then start thinking about our own.

      --

      --
      make install -not war

    11. Re:Long Time Gone by khallow · · Score: 1

      I think the genetic data in particular should be considered. If everything except human beings themselves were erased, that is, all artifacts removed, knowledge and past forgotten, geological record "fixed", etc, you'd still be able to figure out 25k years from now that something was up just from the fact that people seperated by vast distances had very similar genetic makeup and histories. Frankly, I think any sort of global civilization would involve global gene flow. That is a fundamental aspect of human societies (at least those before modern times).

    12. Re:Long Time Gone by Doc+Ruby · · Score: 1

      Maybe. Or that society had xenophobia combined with global reach. Or there was a distinct population of "sea people", who either mingled in such small percentages around the world, with no central locus, that their genes can't be detected yet by our stats. The "Malagasy" people seem to have left their marks from North of Malaysia, across India, to Spain and Britain and Libya.

      The fact is that the artifacts almost certainly require a global civilization to explain their consistency. Their indication of very ancient conditions (the sky 13.5Kya) is even more mysterious. But there they are, despite our inability to explain them thoroughly. That shows that we're limited by our preconceptions in ways that the past human developments were not.

      --

      --
      make install -not war

    13. Re:Long Time Gone by khallow · · Score: 1

      Or that society had xenophobia combined with global reach.

      Or something biologically incompatible with humans. I allow this could happen.

      The "Malagasy" people seem to have left their marks from North of Malaysia, across India, to Spain and Britain and Libya.

      Or equally likely is that there were seperate groups on both coasts that adopted similar transportation styles.

      The fact is that the artifacts almost certainly require a global civilization to explain their consistency. Their indication of very ancient conditions (the sky 13.5Kya) is even more mysterious. But there they are, despite our inability to explain them thoroughly. That shows that we're limited by our preconceptions in ways that the past human developments were not.

      Sorry, but you haven't mentioned anything particularly consistent. And while I'm not familiar with the particular source you are using, I'm dubious that there's really any human construction or artifacts that reflect some unique feature of the sky 13.5k years ago. It's too easy to see patterns that aren't there. And there's an industry that specializes in doing just that.
    14. Re:Long Time Gone by Doc+Ruby · · Score: 1

      The Egyptian Giza Pyramids picture Orion, the Mayan Pyramids several constellations, the Uffington Horse Sagittarius and Taurus (and their link), Angkor Wat diagrams Draco.

      Not to mention Stonehenge, and certainly many others, either undiscovered, known only locally, or mistaken for something else, like the world's biggest pyramid, the Visoko Pyramid discovered only this year. Some might turn out to be illusions, once they get the study that mainstream science has mostly denied them so far. But there are enough of them, across vast distances, to signify an ancient global medium.

      I'm not going to anaylze their "meaning". But their global distribution and consistent demonstration indicate an ancient medium now practically lost to reading.

      --

      --
      make install -not war

    15. Re:Long Time Gone by khallow · · Score: 1

      These structures have little in common physically or chronologically and I find the connection to constellations either today or 13.5k years ago to be dubious as well. And that's after I eyeballed some of the evidence given in favor. And as I recall, the Visoko Pyramid hasn't been shown to be artificial. The disparity in ages of the sites is really the most critical problem here. It's one thing to propose there might have been a global civilization 13.5k years ago, and another to claim that buildings constructed less than 3000 years ago are part of that ancient civilization.

    16. Re:Long Time Gone by Doc+Ruby · · Score: 1

      Again, the fact is that those different structures were built in representation of different constellations. Constellations which are arbitrary groupings of stars available to everyone, but which are shared around the world. The different places they stand are unconnected by the official history, but are connected by the star culture that is basic to folklore. Their connections despite their separation in time and space is the evidence for a global culture. Their apparent common reference to a single ancient timeframe, including specific info about that timeframe (the state of the sky), indicates either a longlived culture eventually distributed to the structures' widespread sites, or somehow even a long perpetuation by that culture from that time. Or perhaps some other way to get the old info that is totally inexplicable by our anthropology or history.

      So you're rejecting them as "inconsistent" because they represent different constellations in different styles. But their basic "function" of representing constellations in a consistent "zodiac" is much more consistency than we'd expect from parallel development without cultural consistency among them.

      I'm now repeating myself several times. You've got the same evidence to look at that I do, and the same Web (and international flights). You can explore further for yourself, or drop it if you prefer.

      --

      --
      make install -not war

  85. Shellac Disks, of Course! by CohibaVancouver · · Score: 1
    ...after all, it's the sound of the future:

    http://www.npr.org/templates/story/story.php?story Id=1216161

  86. Re:How is this different - SOLVED by The+Second+Horseman · · Score: 1

    >That's the real challenge - devising a digital storage format in which presentation can be used to apply context to the data.

    I know! ASCII art!!!!

  87. Crazy talk: it don't mean a thing! by M0b1u5 · · Score: 1

    This is crazy talk!

    Look, ever since I began using computers, in 1979, the ability to store data has increased far faster than the data has accumulated - at least on a personal level. That trend has not stopped, and shows no sign of stopping, or even slowing down.

    Ipso Facto, the solution is simply to add more storage. That way you never archive anything. True, the problem then becomes how do you keep a track of all your stored data - and this is where the technical challenge lies I believe:

    "How do I find that bunch of photos I took in about 2009, in 2059?"

    Even if you know on what bunch-of-disks they are stored on, you may not be able to find them without considerable effort. After all, in 2059 I may have petabytes of data stored.

    I wonder if Google will be around to index it all, and keep a good track of it?

    I guess half the solution would just be to have an interface in my brain to allow me to get to it faster.

    --
    How many escape pods are there? "NONE,SIR!" You counted them? "TWICE, SIR!"
  88. Universal Virtual Computer (UVC) solution? by transporter_ii · · Score: 1

    There is a project to help solve this problem. It is called the Universal Virtual Computer (UVC), and it aims to offer: The UVC concept consists of the UVC itself, a logical data scheme with type description, the UVC program (format decoder) and the logical data viewer.

    http://www.kb.nl/hrd/dd/dd_onderzoek/uvc_voor_imag es-en.html

    With the UVC it is possible to read files without adapting them and without the original hardware or software. JPEG images can now be viewed independent of changes in technology. Afterwards, the method was extended for GIF images as well. The UVC project took place between September 2003 and April 2004.

    If you search google, there is a lot of information on this project...and it seems I heard about it on Slashdot.

    Transporter_ii
    --
    Doctors destroy health, lawyers destroy justice, universities destroy knowledge, religion destroys spirituality
  89. I think I'm doing my part. by Upaut · · Score: 1

    Thanks to Lulu books, and the innovation of acid-free paper, I am printing out copies of all of my failing out-of-print documents. From books of my childhood passed on to me from my father, to old documents detailing how to do everything I can think of (Such as purifying insulin from cattle. Should a sudden war destroy my supply I plan on being ready for it...), I will have a new and shiny copy to pass on to the next generation.

    Though it is hard to do on a students budget, as well as slow going... But I will manage. Right now I am formating all of my favorite Royal Society write-ups into a single volume. Good reading in that.

    --
    3 degrees of separation from Vladimir Putin
  90. Forgotten by Colin+Smith · · Score: 2, Insightful

    Look.

    In 100 years, you will be forgotten.
    In 1000 years, your country will be forgotten.
    In 10000 years, your civilisation will be forgotten.
    In 100000 years, your species will be forgotten.

    One thing you can absolutely count on is that you and everything you find familiar will be lost and forgotten. Nothing that you accomplish, no matter how famous, infamous or worthy will be remembered in 10,000 years.

    There is only one contribution you can make which will have any lasting effect at all, and I'll let you work out what that is for yourself.

    --
    Deleted
    1. Re:Forgotten by Fnkmaster · · Score: 1

      There is only one contribution you can make which will have any lasting effect at all, and I'll let you work out what that is for yourself.

      Must.... BREEEEEEEEEDD!!!

    2. Re:Forgotten by JesterXXV · · Score: 3, Funny
      There is only one contribution you can make which will have any lasting effect at all
      Create an article about myself on Wikipedia?
      --
      Yo mama so fake, she failed the Turing Test.
    3. Re:Forgotten by PurifyYourMind · · Score: 1

      Accept Jesus Christ as my Lord and personal savior? Yeah, I'll get right on that.

    4. Re:Forgotten by obtuse · · Score: 1

      1. Let it be forgotten. Read Borges short story Funes the Memorious to know why.

      2. There will be enough data in most formats for archaeological purposes, since archaeologists are interested largely in daily life, and decrypting the data is essentially a linguistic and cryptographic problem, both of which become tractable with enough data.

      3. Most of what needs preserving will be preserved one way or another.

      3. Time capusules are almost always pointless and self serving. "We who dedicate this building want you to have this symbol of our best hopes and intentions."

      What's not there? Real history. There are no extant copies of the city newspapers (Tulsa Tribune) that called for a lynching, and started the Tulsa race riots. All destroyed or thrown away.

      --
      Assembly is the reverse of disassembly.
  91. Not a problem by nodnarb1978 · · Score: 1

    This is not a problem.

    To wit: The NSA currently has a lot of data it has no real use for, and will have no more of a need for in 2100.

    Stop thinking problems, and start seeing solutions.

  92. If you really wanted to archive its basic by CrazyJim1 · · Score: 1

    You get 3 DVDs. You have a system that reads all 3 DVDs. When a flaw is found in 1 DVD, the DVD is destroyed and a new DVD replaced in which then gets data from one of the other 2 DVDs which should be identical. The system would be like a giant juke box, rotating in 3 dvds every so often to check that all the data matches.

  93. This is a good problem. by Gray · · Score: 1

    I agree. The amount of information has increased orders of magnitude but we're certainly not going to lose any *less* information then we did in the past.

    Even if something becomes entrapped in a dead format, if somebody wants it enough a way will be found. Humans are clever.

    The articles first big example pretty much makes my argument.

    The BBC decided to expand The Domesday Book. They stored the new contributions on laser disc. Laser disc died but, "(The multimedia version was ultimately salvaged.)"

    It has has been argued that the wikipedia is the end of archeology; worrying that we'll forget how to read is giving too much credit to Nud Ludd.

  94. A related problem: Digital artworks by robson · · Score: 2, Insightful

    As a game developer, it's profoundly disturbing how casually we treat games just a few years old. Hardware will continue to evolve and OSes will change; we really need a way to secure our ability to play old games.

    Console games are semi-okay because you can at least keep the (static) hardware around, but PC games are in bad shape. PCs evolve gradually, and it only takes one small OS or video driver change to render a game unplayable. Because games are a commercial medium, games simply aren't supported once it's no longer financially beneficial.

    As long as there are programmers out there willing to write emulators, I suppose we're okay... but it still makes me nervous.

  95. Clay tablets by ygslash · · Score: 1

    For physical media, the best solution I know of is clay tablets.

    After thousands of years, there are warehouses full of
    Babylonian cuneiform clay tablets that are still perfectly
    readable.

    1. Re:Clay tablets by jo42 · · Score: 1

      1) Chisel
      2) Hammer
      3) Sound of 1s and 0s being banged out
      4) Profit!!!

  96. That's easy... by FridayBob · · Score: 2, Funny

    ... just print all the ones and zeros out on paper, so that later on others
    can just read it all back in again with OCR! Oh, I know we could use
    punch cards instead, but we don't want our kids to laugh at us, do we?
    Besides, if we print the ones and zeros real small, we can achieve higher
    data densities.

    1. Re:That's easy... by Anonymous Coward · · Score: 0

      You kid, but here's an interesting twist on the idea of making a physical copy:
      dd $OPTS </dev/humanknowledge >/dev/DNA

      Then just make a sexually transmitted retrovirus that copies the set of collected human knowledge onto an extra chromosome or something and then modifies the fertilization process to select the newest available copy of the human knowledge database. Then release a new strand of the virus every flu season.

  97. The Library Machine... by TropicalCoder · · Score: 1

    I have followed the entire discussion, and now am prepared to add my two cents worth. In the first place, the amount of data is growing exponentially, at a rate such that we will never be able to keep up with it. For example, video games are now an important art form, taking a vast mind share away from movies. Every indication is that interactive forms of entertainment will continue to evolve and take over from former "lower-dimensional" forms of expression. Perhaps we are at the dawn of 3D television, if we are to believe recent articles. Certainly interactive television is already here, and may become dominant. Then how do you preserve these art forms? As technology progresses, it is certain that the number of dimensions will only continue to increase. Right now we have multi-channel audio, 3D images - and perhaps it won't be too long before we will have images with scent synthesis, touch, who knows?

    Much of the discussion so far has been about archiving documents, images, and sounds, but as time goes by, these mediums will present a very low-dimensional view of contemporary culture - only the tip of the iceberg so to speak.

    I believe the answer must lie in computers themselves as the storage medium. Perhaps there are ways of building computer chips that will last for centuries or millennium. What we need is a storage machine, that is self replicating and self repairing and capable of constantly updating itself. Then an archaeologist of the future, even a future where technology has take a huge backslide, will run across one of these machines in a cave somewhere, someday. It will sense the presence of the archaeologist and begin to project an image on the cave wall, with accompanying sound track, and tell the story of our history. It will also be capable of teaching others how to regain lost knowledge and technology.

  98. Creating an archive by JoeCommodore · · Score: 1

    I would think whatever the data source is it would have to be sealed from the elements at the very least 4 times, each container high endurance with a buffered feed to the next outer casing - inside each casing would be some sort of written guide (pictures, symbols, etc.) of what is inside (to let discoverers know it isn't treasure in the physical sense) and how to access the archive (Don't know about having displays, etc as those would be way too fragile and may be considered 'valuable' even if broken to future generations.

    A hopeful tong term scenario would be assuming at the discovery at leas one casings may have been broken or worn away with a couple still to go till you might contaminate the core.

    It should definately not be small or light, big and (enviromentally) impervious would probably be best.

    That's what I came up with on my walk home. :-)

    --
    "Enjoy what you're doing! If it becomes drudgery, you're doing it wrong!" - Jim Butterfield
  99. Data Persisting = Data Translation by ShadowC_ar · · Score: 1

    New file formats seem to be more human readable and therefore easier to produce over time. I recently wrote a little Clipper Summer 87' program to translate old .dbf files to .sql statements so they could be imported into modern databases. It is not difficult at all to do data translation in order to ensure data persistence, the real issue here are the media in which data is stored.

  100. 2 words by Tablizer · · Score: 1

    Punched Cards! So what if it takes 5 boxes to store a pic of Fluffy.

  101. Carve wikipedia in stone by Anonymous Coward · · Score: 0

    If the whole of wikipedia was carved onto a large stone plaque today...it would be bigger than 200 libraries of congress. [cite source]

  102. Solutions - EMC, par2, tape, optical media by Anonymous Coward · · Score: 1, Interesting

    Ok, this has been happening since the beginning of computers.

    I still have some punch cards that I need to convert. It doesn't matter that IMSL solved all the same problems better.

    I have some:
    - 5.25" floppies
    - RLL/MFM hard drives with data
    - parallel port QIC80 tapes (250MB ea)
    - 1/4" tapes (not so important)

    All of these need to be converted to useful media. Or, I figure they are probably corrupt by now.

    Hence, the addition of par2 http://www.par2.net/ which provides parity protection against partial media failures. Corruption can be handled by home users. For enterprise customers, EMC, Veritas and STK have handled this for years. For home users, the extra effort that par2 requires, http://en.wikipedia.org/wiki/QuickPar, may not be worth it. But if it is your wedding pictures, RAID on disk, off-site storage and optical media are **all** required to save your marriage. That 1GB of GMail space might be worth it?

    If it is your corporate date - go ahead, take your chances. How much can it possibly be worth if it is missing? YOUR JOB perhaps?
    The cost of time to find all the data and work interuption is nothing compared to gracefully handling a disk failure without a production impact. Where I work, we do backups **very** well and remote vaulting in alternate data centers.

  103. We can answer the question now. by ancient_kings · · Score: 1

    How hard is it to read disks and punch cards from the 1950's and 1960's? and that's just 30 years ago! I don't think those 10 1/2 DEC floppies is going to fit in anything, but, hey!, at least you can read old Mac, C64, Amiga, PC 3 1/2 and 5 1/4 inch disk (even protected) with Cat weasel! http://www.jschoenfeld.de/products/cwmk3_e.htm

  104. "Plays for Sure" vs Zune for Office? by msobkow · · Score: 3, Insightful

    Yep. Microsoft's commitment to their "Plays for Sure" campaign with the Zune really instills confidence in their backwards compatability.

    At least with OpenOffice I can legally archive the source code and install images needed to access the data for that period (say, every year or six months.) Sort of like dropping a copy of TrueCrypt on a DVD full of crypto archives.

    With the new DRM keys and license enforcement policies, I dread someday trying to resurrect an old image so I can access data archives, only to find it wants to register with a DRM verification service that no longer runs or is no longer compatible with a 4-5 year old install image.

    --
    I do not fail; I succeed at finding out what does not work.
  105. The flip-side by grahamsz · · Score: 1

    Right now i've got ~300GB of digital photography. That'll likely grow closer to 500 by the time i finish digitizing all my film.

    Right now it's kept on an external disk and i'm slowly uploading it to an online storage system.

    Within 10 years i'm sure i'll be able to keep the entire thing on one recordable optical disk. What is today a difficult to manage quantity of data will become easier in the future.

  106. Do you know what that means? by woolio · · Score: 1

    Could we be reinventing the wheel, literally, every 25 thousand years?

    Maybe... I think we can expect the USPTO to start extending the lifetime of patents a few millenia.

  107. ascii by Spazmania · · Score: 1

    ASCII text has been going strong for 40 years with no signs of becoming inaccessible in the foreseeable future. If you want your text to be retrievable 40 years hence, just make sure that one of the forms you save it in is ASCII.

    As to the rest, its worth noting that none of the major proprietary formats has become unrecoverable in 20 years. Even the old DOS Word Perfect files are still readable in modern office programs. Lots of minor formats have gone by the wayside, but that should have been obvious up front. It should be similarly obvious that the major compression formats (pkzip, gzip) will be around for the rest of your life while minor formats (zoo, arj) and newcomers (rar, bzip2) may or may not survive the next decade.

    What it boils down to is this: If you use common sense when choosing how to archive your documents, you should have every expectation of being able to retrieve your documents with the software then available for the rest of your life. If you succumb to the urge to use the latest greatest format then you'll probably lose the data.

    --
    Moderating "-1, Disagree" is simple censorship. Have the guts to post your opinion.
  108. The Waste Isolation Pilot Plant site marking by frdmfghtr · · Score: 3, Interesting

    This reminds me of the study done for the Waste Isolation Pilot Plant (http://downlode.org/Etext/wipp/#executivesummary) . The study looked at how to mark the site in such a way that the purpose of the site would be indicated for 10,000 years.

    While the WIPP site won't have the benefit of constant updating of the media (it's designed to be survive on its own for 10,000 years) it does address some of the same points; longevity of the media, a format that will be usable into the future, and ability of future civilizations to understand the message.

    Off-topic perhaps but an interesting read.

    --
    Government's idea of a balanced budget: take money from the right pocket to balance...oh who am I kidding?
  109. The stick in your own eye by Anonymous Coward · · Score: 0

    by UbuntuDupe (970646) * on Monday November 20, @04:56PM (#16921476)

    I can't wait to hear Microsoft's explanation why the project should use one of their proprietary formats.


    Yes, because Open Office has already solved the problem... right?

    BTW... I have an answer. It's called "Portable Document Format". It's not perfect, but it's there and, as long as you scan it as text, it's searchable.

  110. stupid problem by drDugan · · Score: 1

    information that gets used, gets saved.

    create systems on cots technology that are searchable and transferable to new media - and the parts people want to keep they will, either by paying for them or creating groups to preserve them, like museums

    we can not save all our information, and trying too is foolish. no one cares about all the people in ancient Egypt - just some of them to get a good idea what life was like. The same token, we won't care about every photo and every bit of information.

  111. Tier-1 storage by Anonymous Coward · · Score: 0

    If you use commercial-grade storage systems - you will be guaranteed data migration to new hardware and the continuation of your data.

  112. Ironically, just an hour ago... by A_Non_Moose · · Score: 0, Flamebait

    I had the displeasure of trying to pull up some mp3's I'd ripped a few years ago (2000/2001)
    that was likely done on a cendyne cd burner at a max of 4X on imation media.

    Put it in a DVD-/+RW DL and...and...zip, zilch, nada...unreadable.

    Oh, FSCK!

    Fired up son's computer with a (boo!) Sony dvd-rom...and, there it is (yea!).

    So, IMO/E 100+ years is nothing, how about 10?

    Hardware, software, os, format, media as variables to b0rken along the way?

    And to the game dev: Amen! fat32 (maybe ntfs) borked tomb raider and a few other games
    for a while, 2k was rough at first, Heretic2 has a 25% chance of working under XP and very
    few have figred out why (but works flawlessly under 2k).

    Upgrade-itis drives even the most stoic of IT lemmings over the cliff with data one step
    behind.

    Microsoft's OS is partly to blame, but Linux ain't so innocent, either (think breaking wordperfect 8, which I am still sad about).

    New, improved, faster and shiny and with blinkin' lights (ooooh)...but will it last?

    Crap shoot, russian roulette, IMO, and the current stand on virtulazation by MS says about how
    much of a flying fsck at a rolling doughnut they care.

    --
    Have you read the moderator guidelines? Well, have you, PUNK? (and I want a Karma: Gnarly option)
  113. All this is transitory by myowntrueself · · Score: 1

    There was once a king who asked of his wise men and advisors that they provide him with some object which, when he was happy would make him sad and when he was sad would make him happy.

    They researched and worked very hard and finally came up with a simple ring.

    Inscribed on the ring were the words: "This, too, shall pass".

    Then theres the wonderful story of the Opening of the Eye of Horus, as told by that great sage and fool, Aleister Crowley (pronounced to rhyme with 'holy' not 'foully').

    I shall paste the whole darn thing into this /. comment, for posterity thereby archiving it for future generations :) mod me off topic if you dare!

    1. This is the Book of the Opening of the Eye of Horus, of which the symbol in the profane world is the eye in the triangle, and of which the meaning is Illumination.

    2. Thou who readest this doth not read; thou who seeketh shall not attain; thou who understandeth doth not understand. For attainment and understanding cometh only when thou art not thou, yea, when thou art nothing.

    3. Once there was a monk, a disciple of that great Magus of our Order whom men name the Buddha which signifieth He Who Is Awake. For men asked the Lord Gotama, Are you a God? And he answered, No. And they asked again, Are you a saint? And he answered again, No. And they asked then, What are you? And he answered: I am awake. Thence is he known as the Buddha, the Awakened One.

    4. And the monk, in order to awaken himself, practised the Art of Meditation as taught by Buddha, which in its original form before being distorted by False Imaginings and Elaborations of Theologians, was but this: To look upon all incidents and events and Remember to Say Unto Thine Soul of each: This is transitory.

    5. And the monk looked upon all incidents and events, Reminding himself always: This is transitory.

    6. And the monk came close to Awakening, and therefore was he in great peril, for The Lord of the Abyss of Hallucinations, whom Buddhists call Mara, the Tempter, cometh quickly to one near Awakening, to hypnotize him again into the Sleep of Fools which is the ordinary consciousness of Men.

    7. And Mara did sorely afflict the monk with death of offspring, and insanity of loved ones, and eye-troubles, and slander, and malice, and the great curse of Law Suits, and diverse sufferings, but the monk thought only: This is transitory. And he was closer to Awakening.

    8. And Mara, the Lord of the Abyss of Hallucinations, then caused the monk to die and reincarnate as an almost Mindless creature, a Parrot, which flitted from tree to tree deep in the jungle; and Mara thought, Now he has no chance of Awakening.

    9. But a brother Monk of the Buddhist order came one day through the jungle, chanting the Teachings, and the Parrot heard, and repeated the one phrase over and over: This is transitory.

    10. And Mental Activity began in the Parrot, and the memories of his past life came to him, and the meaning of the teaching, This is transitory; and Mara cursed horribly in frustration, and caused him to die again and reincarnate as an Elephant, even deeper in the jungle and further from the languages of men.

    11. And many years passed, and there seemed no chance of Awakening for that soul; but the effects of good karma, like those of bad, continueth forever; and eventually Men came to the jungle, and took the Elephant captive, to sell him to a great Rajah.

    12. And the Elephant lived in the courtyard of the Rajah, and many years passed.

    13. And another monk of the Buddhist order came to the Rajah, and taught in the courtyard, and his teaching was: This is transitory. And memories awoke in the Elephant, and meaning was understood in the memories, and Awakening again came close.

    14. And Mara cursed wrathfully, and caused the Elephant to die; and this time Mara took good care that reincarnation would recur at the furthest possible remove from all chance of Awakening, for Mara caused that the monk be reb

    --
    In the free world the media isn't government run; the government is media run.
  114. physical data by itzdandy · · Score: 1

    data from the past is readable because of 3 or 4 things.

    first: the data is physical and that means lasting. head and sun and rain take a very long time to destroy words is stone

    second: oral history and evolution of languages allows us to extrapolate much of a language based on similar languages.

    third: context keys like a rosetta stone that allows us to put actions and words together a build our knowledge upon that.

    fourth: pure analysis of structure and reoccuring symbols. taking just the smallest parts of language and searching for them by how common they are and building sentance patterns from that data.

    as long as we provide some physical medium then the data can be recovered in time. our best bet would be to print our own rosetta stones on stainless steel or other medium that would last for ages.

    these keys could allow future generations to build workable mechanisms to read the data. images could be used to show direction and the corrisponding words to build vocabulary. each translation would allow the next level of data to be read. even the organization on the data mediums like tapes or dvds could be shown so the significan bits and bit counts would be known and the ansii character set would be known from previous keys.

    since most of this data would still be stored digitally, it wouldnt take very many keys to get your started.

    the only problem is that dvds and cds dont store data indefinitely as the dies degrade in time. tapes loose thier magnetic charge and the plastics decay.

    we need a more permanent medium. something more like laser disks where the data is burned physically to the disk and not just a die being altered. maybe have the disks also be steel and have them laser etched when technology can do it for a reasonable price(which is not far off) steel isnt going to degrade anytime soon and by the time it does people wont care what happened in politics as they would probably only be interested in the genomes present at that period in history and climate, solar flare activity etc etc.

  115. Use a virtual machine by Spacejock · · Score: 1

    Set up a virtual machine with the appropriate readers installed and configured. Then, save the VM with an installer & key for VMWare (Linux version, if you prefer.) In future, a Windows or Linux emulator will be able to install VMWare and then load & run the VM containing all the installed apps prepped and ready to go. Now, the problem is indexing and searching within VMs, but at least the data will be readable.

  116. ancient followers by the_mustang_man · · Score: 1

    To solve this problem, I forsee a chisel, a hammer, a stone wall, and about 200,000 slaves (Jewish or equivalent value). After they are done, they can start on my pyramid!

  117. Vintage porn by harmonica · · Score: 1

    I am not sure I would care to watch 1,000 year old porn, probably a little too tame.

    You obviously have no idea what was going on in good ol' Rome back in the day.

  118. Not as much unsolved as not completed... by dduck · · Score: 1
    There are several current projects that address all or part of this problem, e.g. Fedora (not the Red Hat distro...) and DSpace, as well as a myriad of ongoing projects at various state, national and federal archives and libraries world-wide, such as my workplace The Royal Library in Denmark.

    Somebody mentioned Microsoft. They actually participate in such a project, namely PLANETS, where they do in fact work towards open formats and preservation of our ability to access past, current and future file formats.

    It's exciting work. Relevant too ;)

  119. Long Now by Wellington+Grey · · Score: 1

    The long now foundation had an excellent presentation by Clay Shirky on just this topic some time ago. Well worth watching.

    -Grey

  120. Copyright is only part of the problem by tkiehne · · Score: 1

    Let's not forget DRM (i.e.: copyright enforced by code / encryption). As the content industries move towards greater control over digital media they effectively sidestep the need for stringent copyright. In this absolute control scenario, access to information becomes a matter of market forces, whereby only those works that have some market value and/or are dutifully maintained by their owners will survive.

    Here's a shameless plug for a paper I wrote on this very subject:
    http://thomas.kiehnefamily.us/technologies_of_acce ss_and_the_cultural_record

    --
    -- t_kiehne
    1. Re:Copyright is only part of the problem by kfg · · Score: 1

      Let's not forget DRM. . .

      I wish I could, but if there's a wasp in the room it's best to know where it is.

      KFG

  121. Beyond destruction by v4vijayakumar · · Score: 1

    Somewhere, I read about the event that happened in china. That article also stated that those books (thoughts) came out later in some other name/form. Some works are beyond destruction (say, E = mc^2). To avoid the loss, all information (data and knowledge) can be transformed to as simple as possible (wisdom).

  122. Copyright? by atomicstrawberry · · Score: 1

    Given the trends recently in copyright term extensions and law changes (read: tightening the thumbscrews) it won't really matter, because those Archaeologists a century or more from now will probably be hunted down and shot for even thinking about trying to access digital media without the copyright holder's permission.

    Even assuming that the future is not some 1984-esque dystopia, there are a number of large, annoying and extremely rich and vocal organisations which would be very much against the concept of storing a lot of our data for extended periods in an accessible, searchable and unencrypted format. Google already ran into this problem with their plan to archive books. The only data that could be archived like this at the moment without someone complaining about copyright violation is pure facts and figures and anything that's been put in the public domain. And unfortunately, a lot of this sort of information is not really what Archaeologists look for. The tangible aspects of our society's culture are things like art, writing and music, all of which are coincidentally copyright protected (ostensibly to encourage the development of those cultural aspects by allowing those who expend effort to create them to be able to be compensated for their work).

    So in short, either the publishers of art, music, movies and books need to all band together behind this initiative or copyright law in itself needs to change. And since neither of those look likely to happen, I think this whole archival concept can be written off as unfeasible. After all, why allow a hypothetical future generation to have insight into our culture if you can make piles of money instead?

  123. Not with today's copyright law by jesterzog · · Score: 1

    Good stuff gets reproduced, reviewed, studied, dissected, etc. and survives.

    It doesn't, always, if it's copyrighted, and in today's world all works are copyrighted automatically for more than a hundred years unless the holder explicitly waives that right. Otherwise it'd be easy for me to get hold of a lot old (but very important) textbooks in fields where it's not economic to republish, but which only come on the market second-hand when someone dies.

    Copyright law significantly blocks the distribution of much information, to the point that the remaining stores of it are often lost or destroyed by the time it's legal to reproduce and distribute them to the populace.

    Unfortunately today, this problem has become as much of a legal one as a techinical problem.

  124. stupid question by Eivind · · Score: 1
    This question is ass-backwards. This ain't a new problem, on the contrary, this is a smaller problem now than it ever was in the past. True an individual book is more durable, and more future-proof than a DVD or a hard-disc. But that ignores the one GIANT advantage that digital media have: the ease with which they can be copied.

    My fathers wedding-pictures are film-negatives, stored in a secure-against-fire safe in his apartment. He just has to hope that no fire burns too long or too hot (the safe has limits) that noone breaks in and breaks open or steals the entire safe, that the area is never flooded, etc. They are pretty secure, but there's only one of them and it's terribly expensive to secure against everything, in the end it's just a risk he has to accept.

    My wedding-pictures are digital. I don't have a safe. But the pictures exist on around a dozen different physical hard-discs and around a dozen different individual DVDs. 3 of these hard-discs are in professionally run and daily backuped raids, standing in environmentally controlled mountain-halls. There's copies in 4 continents, and in probably 20 different buildings.

    It'd literally take a collapse of civilization, global thermonuclear war or similar to even have a chance of wiping all these. Each individual copy ain't much secured (if at all), but the added security that comes automatically with multiple geographically dispersed copies means my pictures are a *LOT* more secure than my fathers pictures.

    Lots of information that nobody cares about will be lost. Some of that information will later turn out to have been important, and we'll curse ourselves for not having saved it. But lots of information that peope *do* care about is saved, and will be saved. Since data-storage grows exponentially, the cost of storing old data falls exponentially with time, so there's basically no reason to ever stop saving something once you've saved it in the first place.

    If you've taken the time and expense of saving a set of data from 1980 to 2000, you migth aswell save it forever, if it was savable for reasonable cost in 1980, it's savable for trivial cost in 2000.

    Yes, saving digital works requires active maintenance. Multiple copies, regular moving to newer storage-media. Documentation of file-formats. (or conversion to file-formats that are well-documented) But the cost of this is more than offset by the gargantuan capacity and the dirt-cheap copying.

  125. Archival medium for more than 10K years by take5 · · Score: 2, Interesting
    If you are serious about archiving, print your stuff on thin (0.3-0.5 mm) high grade ceramic plates the size of A4 paper, using a laser to remove ceramic material in order to form letters. Then put the plates in large pyramids, with several copies in various parts of the world.

    Not every piece of digital info can be saved that way, or needs to be saved as others have pointed out. Current college textbooks, some history books, literature and music and an encyclopeadia will go a long way to create a useful memory of our times for the future.

    Some years ago, in California, they opened up an 100 year time capsule. I do not remember the suff that was in it, but it was mostly useless junk by our standards today. If we could send an e-mail back in time, we would ask them to include totally different things. It is easy to make the same mistake now as to content.

  126. Dead Media Project by megabulk3000 · · Score: 1

    This seems like a good place to mention a repository of media that didn't make it. Hundreds of promising (and not so promising) formats that are now unreadable.

  127. Another solution by master_p · · Score: 1

    Another solution is to abort using filesystems and start using databases at O/S level. Therefore future civilizations will be able to read the data because the type information will be in the database; and since there is much printed text about databases, the chances information will be unreadable in the future are minimal.

  128. Aleister Crowley by igb · · Score: 1
    Aleister Crowley (pronounced to rhyme with 'holy' not 'foully').
    Opinions vary. It certainly doesn't rhyme with foully, but there's a school of thought that rhymes it with truly, rather than holy. From odds and ends I've read I wouldn't say it was as clearcut as the Wikipedia entry makes out.

    ian

    1. Re:Aleister Crowley by myowntrueself · · Score: 1

      To be honest, I've never read the Wikipedia article on old uncle Al. Maybe I should. I've just studied him (and many other occultists) for about 30 years...

      He himself came out with "My enemies pronounce it 'Croulley' hoping to treat me foully, my friends pronounce it 'Crow-ley' to show that I am holy'".

      --
      In the free world the media isn't government run; the government is media run.
    2. Re:Aleister Crowley by igb · · Score: 1
      He himself came out with "My enemies pronounce it 'Croulley' hoping to treat me foully, my friends pronounce it 'Crow-ley' to show that I am holy'".
      Yeah, now you mention it I recall that quote, which sounds pretty convincing. But I'd be fascinated to hear a recording of Crowley's voice --- they exist, I believe. Leamington Spa, Malvern School, Cambridge at the turn of the century would be an interesting stew of influences.
    3. Re:Aleister Crowley by myowntrueself · · Score: 1

      But I'd be fascinated to hear a recording of Crowley's voice --- they exist, I believe.

      They certainly do; I've got some mp3s.

      They were recorded on wax cylinders. There is some poetry and some Enochian as well :) Its very interesting hearing the man himself intone "IAO"

      --
      In the free world the media isn't government run; the government is media run.
  129. HTML by Teancum · · Score: 1

    At least HTML v. 1.0 Other "enhancements" have really screwed things up and killed backward compatability.

    The point here is that HTML is an "open" and "widely used" data format, so it is widely used. Plain HTML (and now even some variants of some XML) are now being used by groups like the Gutenberg Project as reliable enough to be used for permanent archiving, where the HTML and XML offers superior text formatting information that is not preserved with plain ASCII text documents.

    And before you go off and say that HTML is ASCII, it is not. It is a mark-up language that does muck arround with the text and do things that sometimes make the text hard to read with just a plain ASCII text reader. Of course this is making a distinction between plain vanilla ASCII text files and something marked up like a web page.

    As for propritary formats... the "conversion" subroutines that reformat to more modern data format: they "mostly" work correctly. Sometimes there are bugs in the conversion process, especially (as was common with older Microsoft Office formats) when there were "undocumented" features in the data format due to the propritary nature of the software using it. In other words, the documentation was in the source code in the form of algorithms and nowhere else. I've seen that far too often among programmers who write these file formats for this and many other file types where only a single application is assumed to use the data generated for that format. While you might be able to recover the raw text of the document through this conversion process, often the formatting is shot to hell at best, or even unusable.

    If you avoided fancy formatting of the document and simply used "default settings" for most of your older documents, the conversion process is usually pretty good. It is only when somebody decided to get fancy and use some of the more obscure formatting styles that you get into some real problems. Unfortunately, those are often the most important documents that you want to access as well.

  130. Pre-IBM Compatible by StarWreck · · Score: 2, Insightful

    To most people, any of the files they used on computers before their first "IBM Compatible" is probably lost forever already. Think of how many files are "frozen" on 5.25" floppy disk for the Commodore 64 alone!

    That dosen't have to be the case though, you can retrieve files from disks of hundreds of different 80's era computers on a modern PC using a Catweasel card. http://www.vesalia.de/e_catweaselmk4.htm

    With the catweasel, a standard 5.25" PC floppy disk drive (hello, ebay), and a 3.5" PC floppy disk drive there's hardly a floppy disk you won't be able to retrieve your petrified files from.

    Finding a program that can do anything with those files is another subject entirely.

    --
    ... and in the DRM, bind them.
  131. Reality in the work-place ... Bosses are not .... by OldHawk777 · · Score: 1

    "When Thibodeau told the head of a government research lab about his mission, the man replied, 'Your problem is so big, it's probably stupid to try and solve it.'"

    The above quote, perfectly expressed the epitome of USA Leadership for the past 30 years in business, religion, and government. You already know the rest of the story "Stupid is as Stupid does (Vietnam, Iraq, IPR, DMCA, TIA, broken-education, healthcare nightmare, Enron, Global Crossings, E-voting Diebold, Opt-Out Privacy, Security_by_Obscurity ... far too many more)!"

    No problem (except stupidity and death) is too big to fix. It is multi-degreed, highly certified, and probably stupid Big-Chicken (White_Collar_Trash) people that can't try and won't solve problems. Government research labs (many government offices, businesses, religions) need leadership not a "PhDoctorate of Career Management" in do nothing wrong or right.

    Politicians, Corporatist, Televangelists ... are "White_Collar_Trash (WCT) Career Management" specialist bent on destroying the USA as a Great Nation, Culture, and People. WCT Career Management specialist will always be able to convince the semiliterate public "probable stupidity is smart" (also it is lethal, but why tell the pitiful semiliterate public).

    --
    Unaccountable leaders are masters, and unrepresented people are slaves. How do US and EU fare?
  132. Playing it safe by vikingpower · · Score: 1

    This is about safety, or "sureness" as French has it. The article, very much to the point, made quoted the accounts from the American Civil War that are still readable. Why are they ? Because they are in text format. Written by human hand, but there is no basic difference between a hand-written letter and, say, an ASCII or Unicode text file, as soon as you keep the ASCII table or the Unicode tables somewhere, for reference. Even "complicated" things like UML diagrams, and entire RDBMSes, can be saved or exported in text format. What we really need, given this, is spectacularly performing text compression and transmission protocols. Who feels up to the task ?

    --
    Religous speak to God. Insane are spoken to by God. When all shut up, one can finally hear Shostakovich in peace
  133. RFC 1 by Anonymous Coward · · Score: 0
  134. You're thinking too literally by csoto · · Score: 1

    The "seven wonders of the world" don't all exist any longer. But we know about them, because "popular" reference to them survives. The dinosaur is long gone, but it's impression in the mud lingers...

    --
    There exists no way of exchanging information without making judgments. --Bene Gesserit Axiom
  135. Fairly simple by Anonymous Coward · · Score: 0

    Build a massive structure, environment-proof as far as we can make it (down to earthquakes and flooding). Make it visible from orbit. Surround it, if necessary, with statues showing humans pondering objects or simply thinking deeply.

    Inside, have a straightforward progression of information in multiple written and pictorial languages, all leading to the same location, all at least tripled at other locations in the complex, and all showing/telling how to decipher a very simple graphical metalanguage. Instructions in this metalanguage should start at stage 2 in the complex, and get physically smaller and smaller, from large attention-attracting start points down to very small 'font' size which explains (and possibly also shows in pictures) how to construct a tool to view the sub-microscopic storage format which all the rest of the archive is stored in.

    Part of the data in the archive will be multiply redundant copies and translations of all the large-size instructions in the structure, plus cross-language dictionaries between as many of them as possible, plus easy-to-understand information and instructions on various redundancy and compression coding schemes, plus instructions on how to read ASCII (Or Unicode, or whatever other digital format the bulk of the data is stored in).

  136. Who cares? by Blackknight · · Score: 1

    We'll all be dead by then any way.

  137. Stop bragging. by jotaeleemeese · · Score: 1

    If the data was useful you would have it in a current format.

    --
    IANAL but write like a drunk one.