The Digital Dark Age
zygan wrote to mention a Fairfax Digital article about the possibility of a digital dark age, as a result of the increasingly short-term lifespan of digital storage. From the article: "It is 2045, he suggests, and his grandchildren are exploring the attic of his old house when they come across a CD-ROM and a letter, which explains that the disk contains a document that provides directions to obtaining the family fortune. The children are excited. 'But they've never seen a CD before - except in old movies - and, even if they found a suitable disk drive, how will they run the software necessary to interpret the information on the disk? How can they read my obsolete digital document?'"
Scary article. But probably too true.
In my opinion data archival screams to be handled in as simple an lowest-common-denominator a way as possible. For me, that means text for documents, and picture formats that would seem guaranteed to be around for a long time, if not forever. I'm guessing a good candidate for pictures would be something like jpg. I can't imagine jpg going away or ever being a non-decipherable picture format. Video might be a tougher nut to crack but I would guess some flavor of mpg.
Note that none of these flavors: text; jpg; nor mpg, include or imply any reliance on vendor proprietary formats (yes, I know there's a certain proprietary tinge to the picture and video forms, but they're pretty universal). So, storing and archiving for historical purposes rules out Microsoft and all of their formats. This would especially make sense considering there are already huge compatibility issues with Microsoft documents among their various versions of their products.
Also, for retrieval assurance it no longer makes sense to me to use "dead" or "inert" methods for storage, e.g., tapes, cds, dvds, etc. Instead, at least for my purposes I maintain multiple physical and current storage devices for all of my important data. This has been a recent (last three years) development for me when I started reading about early failures of the supposedly rugged storage.
So, that being the case that introduces (introduced) the need to devise a strategy for forward migration of all of may data so nothing got left behind. Fortunately, this has been mostly easy since right now the "active" storage du jour seems to be hard disk drives, and the capacity has grown sufficiently with each new generation of drives I have been able to simply roll my data forward onto the new drives with the new data with plenty of room to spare.
This shouldn't be an approach foreign to comapanies with reasonably competent data shops either. But maybe a philosophical change. All is not lost, and hopefully all will not be.
Just my $.02. ~
Nevermind the equipment needed to read it, what about the rights they'll need to read it?
... the Powerhouse Museum's Matthew Connell with an ancient clay tablet that will probably outlive the 1980s tape in his right hand.".. Probably? Definitely more like!)
And even that's ignoring the fact the CD will long since have self destructed, decaying away..
(From TFA: "Dark age
I am a viral sig. Please copy me and help me spread. Thank you.
Maybe they can buy all the necessary components on ebay!
Seriously archeologist have decoded all sorts of dead languages, decoding digital (assuming you can still pick out the bits) would be easier.
In the second box is a player, if the fellow had any real fortune.
Besides, I'd have drawn the map on parchment, and tied it up with a string.
Arrr! Ye Mateys...
You can't talk about Wikipedia's flaws on Wikipedia
They'll take it to that crazy old guy in the corner house with uncut grass in his lawn, for he was once a great programming guru and has a ton of still functioning archaic equipment that requires insanely large amounts of power.
Linux is to the internet as Duct Tape is to the Universe.
perhaps the same way I would read a wax cylinder today
i nder.html
visit a specialist
a good place to start would be here :
http://www.bl.uk/collections/sound-archive/wtmcyl
There are places where the networks are not touching,and there are places where they are-Boeing's Lori Gunter
Whatever is worthwhile to keep will be migrated to new media. Even if 90% of it is lost odds are 10 times more information will be preserved from this decade than the last. Digital media is cheaper to own and operate.
How can they read my obsolete digital document?
The same way we do it today: emulators. Of course, your cdrom is not going to survive that long, so there's no need to worry about that. Have you considered leaving your legacy carved into stone tablets?
Hopefully someone isn't stupid enough to store their will on a CD rom...would you?
To each his own.
Really, who knows what the future holds? And who says we won't be able to trace history back to these days and even further? And just because we don't use a media anymore means it is forgotten and no one will ever be able to read the media again. I mean, if one did some digging, I bet he/she would find information to be able to read punch-cards even. Just my 2 cents.
Sometimes I comment just to hear myself typing.
\/\/H47'$ 4 L3773r?
Each moment arises out of the moment before - call it 'dependent arising'. No object exists in perpetuity - even black holes evaporate over long time spans.
This being said, our digital storage systems, in a collective sense, are becoming more like a brain and less like an archive. 'Memories' of some importance are in multiple locations and accessible via different search methods. They're also being changed, just as memories of our pasts acquire a patina as we age. Someone took something I wrote in the early 90s on Usenet and added it to their humor site. My flickr content is spreading if the hits are any indication, as are my contributions to YouTube.
Public records are an important thing, but understand the other, positive things that are happening in the background as the the internet acts less like a database and more like a neural net with each passing day.
I am very easy to get along with, but I don't have time to waste being nice to people who are being stupid. -Theo
I have a bunch of old DSDD 40-track hard-sector TRS-80 5.25" floppy disks (NEWDOS/80v2 format) that I'd love be able to read.
... uh ... well, OK, twenty to twenty-five years old.
Unless I want to build custom hardware, I don't believe it can be done...
And those are only
Eloi, Eloi, lema sabachtani?
www.fogbound.net
Subject of a Cowboy Bebop episode. This is why I watch anime. They actually take some time to examine an idea like where to find a Betamax player 150 years from now. http://rfblues.aaanime.net/Sessions/session18.htm
Sorry about the writing. Robot fingers, you know? Cliff Steele in DOOM PATROL #23
Just give me the document. I'll print off a hard copy today, that new fangled paper technology looks promising (Assume acid free paper, no sunlight, etc, for you picky individuals). Just leave them a cd with my contact info. I will give them the directions to the family fortune, I promise. You can trust me, I'm a [insert political party of choice here].
as the CD probably couldn't be read regardless. CDs do not last forever. http://www.warehousephoto.com/How_Permanent_is_you r_CD-R.htm In fact many will be unreadable in as little as 2 years. If you want to archive, print it with good ink on acid free archival paper.
I read an article about 10 months ago about the "death of history" due to the electronic age.
In a nutshell, as we've moved to more digital forms of communication (phone and email), one of the primary methods historians use to piece together older eras is going extinct - the written correspondence from one person to the next.
It was an excellent article; my google-fu sucks apparently because I can't find hide nor hair of it. Curses. No +5 Informative for me.
You better watch out, there may be dogs about . .
..a more likey outcome is that patents and DRM will lead to a digital dark age.
Coder's Stone: The programming language quick ref for iPad
people migrate their data to new technologies all the time.
Think about it. A person gets a new computer with the latest technology, then they transfer their data to the new machine.A contant upgrade cyscly.
Same with lerge businesses, they may be using a tape library, but they upgrade there tapes regularly. And if some came out with a 1000 terabytes in a cubic inch of crystal storage device, they would also ahve a way to migrate there clients data. If they didn't they would have a hard time selling any.
The Kruger Dunning explains most post on
Don't forget about CD Rot. While you'd like to believe that if you put that Treasure Map on a CD so you can find the treasure years from now, chances are... your map will have disspeared on you.
This is why I still get my digital photos developed. Last thing I want is all my treasured memories to become suddenly un-readable someday.
0110100100100000011000010110110100100000011000100
Make your problem Google's problem: Mail yourself all your archive files to your Gmail account
Reminds me of a discussion I once got into about analog vs. digital storage. Some of the people on the analog side argued that the myth of digital media being everlasting is false -- which it is. Digital media, on their own, should be seen as temporary storage. The true virtue of digital media isn't even the media itself -- it's the content. Content is what can be copied over and over again with no degradation.
;P
Like oral traditions, the chain of copying needs to remain unbroken for any information to truly last forever, outliving "mere mortal" media. As long as P2P networks continue to exist, I can die happily knowing that the sum of mankind's knowledge will be floating around there somewhere... even if it is buried under millions of terabytes worth of lesbian porn.
Skype is too convoluted... Now I'm reverse-engineering the Kyoto Protocol.
So we're going to lose our information. Who cares? Proton decay will eventually destroy all of it. Sure, that's a long time in the future. You know how things go: it's 10^1032 years away today, but before you know it the kids have moved out and the end of the universe is right around the corner.
Just try and keep those bits in line.
--I'm so big, my sig has its own sig.
-- See?
Why do people keep saying CD's die in 2 year/5 years/x years? Has anyone actually had a CD die on them? I have CD's in front of me at this very moment that are over 10 years old and still work great (yes I did in fact test them). Is there some conspiracy by the blank CD manufacturers to make you think all your CD's are going to die so you need to keep transferring the contents from one disk to another forever?
All too often these are literally rotting away in storage, because the originals are decaying, and the movie companies are unwilling to invest money to rescue them, even though they would sue you for millions if you published these on your own.
"It is a greater offense to steal men's labor, than their clothes"
Media evolution and subsequent obsolescence is what keeps may photographers from adopting digital cameras. Slide film images, though not "forever," are certainly more enduring and readily adaptable via scanning to whatever digital storage medium is the current state of the art.
Ignorance is curable, stupid is forever.
... except for stuff that has copy protection on it...
why? because anything anybody wants to preserve they will either copy it over to newer larger space media or the archiologist will build the device to read the old media.
if there is any concern its with teh ability of the media to hold data... but we were all told how much better cds are to tape and floppy at holding information....
so its on the media industry to be sued when the truth is exposed....????
cd's are to last at least 100 year???
of course there is always writing it out and storing it in some cave at the dead sea site...
Heh, thats something I didn't understand in the scenario mentioned in the summary, why would someone create a paper explaining a document on a cd, but then not bother to print out the document itself? Seems a bit weird to be combining "formats" like that if you will. More than likely what would happen is that the grandchildren would find a spindle of cds that may contain old family photographs and throw them out not knowing what they contained(priceless family memories or they could just be leisure suit larry games)
Monstar L
paper. Use it. Maybe give a copy to the family lawyer who handles grandad's will and shit, also on paper. The stuff isn't going away any time soon. Like, ever.
In 2045 there will still be functional CD drives around, because they are a mass market item. There are still Edison wax cyclinder "record" players around too. Data loss due to format changes are the biggest problem where rare or custom machinery is used, such as at NASA.
There is a huge difference between losing some data and losing all, or nearly all, of it, as in a Dark Age. Yeah, losing the key to the family fortune is a bit tragic for the family, but it already happens all the time without any reference to digital storage, and nobody declares a "Dark Age" over it.
In any case, dark ages aren't even defined by the loss of data, per se. They are the defined by the loss of data because reading and writting itself is lost and/or denigrated. It was not so much the burning of the library at Alexandria that created a dark age, it was the lack of social importance placed on recovering and preserving what had been lost.
People ceased to backup.
At the time backing up was labor intensive and expensive. Now it is quick, easy and cheap. Even, comparitively, for obsolescent/obsolete data storage devices. If the family fortune is really anything substantial it will be recovered because the knowledge of how to recover is maintained and the CD itself still exists.
Nothing has actually been lost, it's just a cost/labor issue to recover.
KFG
The example's contrived. I don't like contrived examples unless they illustrate an important principle, which this one doesn't really do. Such data loss has already started happening even in my own life, but I don't think that's a bad thing. The fairly minimal effort required to keep data up-to-date is a natural impediment to a policy of keeping everything. Data which isn't worth a new hard drive and an rsync dies. Data which isn't worth the effort of importing and re-saving in a newer format dies. This isn't bad. It's not new either.
Data goes the way of the dodo not because of technological obstacles, but because of a decision made or not made to preserve it. We don't know how the great pyramids were built, the obelisks shaped and erected, etc. not because there was no way to preserve that information, but because it wasn't important enough to justify the effort. The same is true of 10-yr-old WP documents I made to bill people when I mowed lawns for spending money, or a million other things that get saved or trashed every day.
If you're serious about the problem, then it's not a technical hurdle. Data storage is cheap. Emulators are good. Batch document conversion is possible. The problem, if you're willing to call it that is that the benefit has to outweigh the cost. Lowering the cost of data preservation only increases the cost of data searching and real information retrieval. And very quickly it becomes a philosophical argument about the value of preserving irrelevant knowledge in a world that has moved on. Yet the argument is couched in terms of data storage and manipulation which is really the tiniest corner of the issue.
High-speed Road Trip (18.000KPH)
It's not something new. There are lots of projects around the world trying to solve it. As with all issues of preserving knowledge, you'll find that even the simplest things can turn to great pains. For instance - you have your great hole in the mountain for storage of nuclear waste. Now, you don't want people in the future walking down there. Assuming they dont speak your language, what do you do? Paint something, like pic 1: human outside, pic 2: human going in, pic 3: human dead. Sounds good eh? Try reading it backwards, as some cultures do. Human dead, goes inside, comes out alive? Not really the same thing, but it's an interesting problem. What will we leave behind? Not many things are written in rock any more.
All this makes me think of Stanislav Lem's "Memoires found in a bathtub", which is a grat book for those of you who haven't read it.
This is already happening with analog recordings. The Piano Paper that you put into a piano and the piano plays the music. The old drums that were originally used to record sound. Records (45s, 33s, 78s), 8 Track Tape, Reel-to-Reel, dictaphone, Cassettes (becoming this way).
Want picture/video? My father has some negatives that are 3 inches by 5 inches. Back before the days of 35mm film. Then there are those old home movies that predate VHS.
The only difference between that and digital is that digital is newer.
Fly me to the moon Let me sing among those stars Let me see what spring is like On jupiter and mars
Only if you expect to be in the situation of having no software to read JPG, and no specification. That's a slightly extreme scenario? Since your data has been, obviously, carried forward. You could always carry forward source code or specifications too, along with your JPG corpus. Or am I missing something?
you had me at #!
Idea #1
What about a semi-intelligent expert system daemon that, given two document formats, could figure out how to convert one to the other?
Consider this: I would like to archive a set of CAD documents, but they're in archaic format X. Modern CAD formats are A, B, and C. CAD programs typically have ancestors that can convert from past versions for migration purposes.
So consider an interlinked set of CAD converters:
#1 can convert formats F, G, H to formats D and E.
#2 can convert formats W, Y, X, and Z to formats I, J, K, L, and F.
#3 can convert formats D and E to formats A, B, and C.
Consider then a daemon that continuously monitors a filesystem looking for documents that aren't in a current format. It then fires up the converters and performs the conversion while archiving all past versions.
So in the example, the daemon fires up converters 2, then 1, and finally 3.
It could also cryptographically sign the files to provide a chain-of-custody.
It also maintains a set of applications and an emulator for different operating systems. When one needs to open an archaic dataset, one can either look at the converted files or call the daemon directly to seamlessly pass an emulated application session to the user if you want to look at it in the original form.
Idea #2
Documents could contain their own viewers. Yes, I know that's a bad idea making document objects executables, but hear me out. The document custodian daemon could also maintain a sandbox for document viewers to run in - it could even be a standardized virtual machine written in something like Java. This is getting a little out of my area of expertise, but I'll ask my girlfriend about it. It would get interesting after several levels of emulated virtual machines.
This year, hard drives became cheaper than tape for the first time in terms of $/GB. RAID with NFS should be way better than tape backup in terms of retention and nearline access, but I'm not really an IT guy.
I'm sure there's a business model in there somewhere.
'Be always mindful, even when ditch-digging.' --D. T. Suzuki
The way I see it is, in 2045, they'll have computers powerful enough to look at any binary data and accurately assess what the data represents, and how to extract any useful information from it.
Copy it to a new format. That is the real beauty of digital. Since it can be perfectly duplicated easily and quickly it's no problem to move it to a newer format. I have data on my drives now that was orignally on 5.25" floppy. It has just been recopied many times. Some of it has been converted to new formats, some of it is unmodified. Either way, it's still here despite being decades old.
I don't know where this silly idea comes from that somehow digital is really fragile and we'll just lose all of it later. Sure, we lose tons of it all the time, but it's worthless, by and large. The by product of the information age is that we produce so much of it, it is not only impossible to archive all of it, it's undesirable. To have more information than you could ever sift through would be almost as bad as having none at all.
Also what's the this stupid notion that we'll forget how to read things? That's like saying that we'll forget how to build sailing ships, now that we have motors. Of course that's not the case, the knowledge is preserved, in the case of sail boats, they are still made.
This is even more clear for computers since emulation is a major protect for many people. We have emulators for all kinds of old systems. Means if you find data for one of them, you just load up said emulator and it'll get at it.
Digital actually seems to be the ultimate prevention against a dark age. The ease of copying information and archiving it in multiple spots means that it's difficult for a single catastrophe to wipe out large amounts of data forever. There was a lot of work in teh past, for example the Mayan Codexes, that was destroyed and is totally unrecoverable. It was fragile precisely because it was hard to copy and thus there wasn't much of it around. Now, of the orignal hundreds of thousadns of Codexes, we have but 3.
I think it's just a bunch of alarmism.
I was thinking of how you could store data that would really stand up to the test of time. History provides us some examples: things cut into stone seem to do pretty well. Paper isn't bad, providing you store it well. Animal skins, not so good. Celluloid isn't either (evidenced by the old movies and cartoons that are degrading).
However glass is really good, and while it might not have the proven track record that stone tablets to, it can also support a much higher data density. For example, Ansel Adams original glass plate negatives are in some cases just as sharp as the day they were shot, and they should stay like that for the foreseeable future providing they're well taken care of. But even they are dependent on the chemicals used in processing -- whether the silver sticks to the glass over time, etc.
So here's what I was thinking: what if you used some sort of photographic process to physically etch a pattern of bits into glass: use a fairly strong acid and get the etching pretty deep, or maybe etch the bits at the bottom of phonograph-like grooves so that light surface touching wouldn't destroy them. If you could make something like this that could be read with a regular CD Rom, that would be even better.
I think some sort of process like this is used on metal (or is it actually glass?) to make the dies for stamping CDs. Basically I'm suggesting just make and retain the masters, but don't degrade them by stamping anything.
"Ladies and gentlemen, my killbot features Lotus Notes and a machine gun. It is the finest available."
Periodically rolling over data from its existing format/media to the latest new fromat/media before the media or format life expires (whichever comes first) is the only way to perpetually save data. The problem is not in having a reliable method to preserve data. It is an unwillingness on the part of the guardian of the data to spend the resources (operational overhead) necessary to do it. The perceived value of historical data is often viewed by how often it is accessed as opposed to what future insights might be derived from it or what commercial value it might have in markets that do not yet exist. In other words we lose data because operational management doesn't see its contribution to the bottom line on the profit and loss statement. The reality is that these judgements evintually do have to be made because we just don't have enouth bandwidth to store all of the data that ever existed even if we wanted to. Until everyone recognizes that maintaining your data is no different than maintaining any other valuable asset such as highways and city buildings the situation wont change.
.......Just my 2 cents worth
So, if you want to have the map to the family fortune readable in 50 to 100 years, store it in the highest linear (uncompressed) resolution available to you and every time there is a paradigm shift in digital media or formats transfer the data again to the new format linearly at the same resolution is the original. If you compress the data and then transcode to a new compression format some data will be lost in each iteration and eventually the data will be unusable.
Check this link regarding the Electronic Records Archive http://www.archives.gov/era/index.html
And don't even think about it if this is in a post-technological or post-apocalyptic scenario. That's when you want hardcopy! Old-fashioned printouts and photographs... with all their attendant preservation headaches. That should be in the bunker too.
Writing software or even reverse engineering formats looks much easier by comparison.
you had me at #!
While file formats and media have presented a problem, there is plenty of evidence to suggest that digital information has an extended lifetime, and the most valuable information will be converted into newer formats as well as more simple and fundamental formats. Simple formats like ASCII text have handled the test of time. I'm more concerned by the potential lockdown of information through overzealous use of DRM technology backed by overbroad intellectual property laws. Just like the last dark age, the next one will be the result of people trying to control other people.
-- $G
For optical media, it's very easy... assuming the media actually survives, it's the same way this guy plays vynil LP's using a flatbed scanner:
7 769,00.html
http://wired-vig.wired.com/news/digiwood/0,1412,5
Obviously, in the future, ultra-high resolution optical input will put the current scanning/video technology to shame; they will just need to scan the thing in and run a program against the data to get the contents of the media back.
I work with a bunch of library science and archvist types who worry about this all the time.
It's such a pain taking care of books that are a few hundred years old. But they miss the point when it comes to digital.
For example, data I had on 5.25" floppies was moved to 3.5" floppies, then to a 20MB hd, then to a CD-ROM, then onto my current system.
If it's that important you transition it to new media.
I'm not sure about Wordstar, but I'm fairly certain with MS Works that if you open up that file in your favorite text editor that your data will be in there, and you should be able to just copy it out. Naturally you'll lose all the formatting, and probably have a lot of crappy unreadable characters introduced, but I'll bet you'll recover the bulk of it.
.txt files now. Anyway, I wanted to say that I can definitely feel your pain with a problem like that, but that there are solutions to at least one of the problems you described.
I had a similar issue once with a very nice (but very dead) word processing program that I used to use called WriteNow -- where the developer has stopped work and sales on the product but refuses to release it even in binary form to the public -- I had a bunch of disks of documents, but none of the original program disks and no computer with it installed anymore. I was pretty stuck until I just opened up the files one day and searched through them until I found the plain text. A few minutes of cleanup in BBEdit later, and I was ready to go.
Needless to say, they're all stored as
"Ladies and gentlemen, my killbot features Lotus Notes and a machine gun. It is the finest available."
Dateline: February, 1890
New "photographs" may be useless for archival purposes
Scientiests say that the dyes used in so-called "cameras" may not provide the kind of lasting record that traditional stone tablets have provided. In fact, left in bright sunlight for 50 years or more, photographs may be completely unreadable by even the latest 1890 technology.
This will surely mean the demise of modern civilization, since future generations are very unlikely to care enough about the past to devote any energy at all into preservation and reclamation of older information. Anything that can't be read by 1900 is likely to be lost forever.
It's yet another sad commentary on the state of modern civilization, and one more reason why manufacturers of stone tablets and chisels shouldn't throw into the towel too soon.
Cheers
-b
If I wanted a sig I would have filled in that stupid box.
I'm sureI have read the same article several years ago,I cannot remember were, maybe on Scientific American or such. After a search on sciam.com I have found this dated January 1995, more than ten years ago. Are we reading the older news ever posted on slashdot?
this post contain no useful information, no need to mod it down
...back and find a computer to read it on. I mean that's what they'll do when they need an IBM 5100 isn't it? :)
This question is akin to somebody in 1900 asking what the world would be like in 2000 when the population kept growing and everybody had horses on the street - "think of all that manure accumulation - how will we walk without stepping in crap?"
The point is - the question is irrelevant. In 100 years, assuming the continued growth of storage mediums, the average personal user will have access to terrabytes, if not more, for personal use. I imagine that the most basic of ISPs (if such an entity continues to exist separately from other existing utilities) will provide users with gigabytes of personal space online to keep store/back-up their data. The only reason to put things on physical mediums will be for short-term backups.
I think a more pressing question is "will we be able to find the needle in the haystack?" Sure - Google does a decent job of indexing the internet now but even they are not 100%. Also the fact that while they may not be 'EVIL' today, it only takes 1 CEO change for them to become what most other companies are and then it's up to the next do-gooder to start an index from scratch. Then, assuming you can find stuff, you'll have to break the 200Mb encryption key. Luckily, the local Kinkos will have a quantum computer that you can use for $7.50/hour.
copy it from one media format to the next BEFORE the old one dissapears or keep your data on your hard drives and copy it to your new ones each time you upgrade.
Low capacity removable media like floppies and to some extent CDs is the enemy of data preservation because it makes the job of copying stuff to fresh media require far more human labour.
note: i'm known as plugwash most places but i screwd up registering that here somehow in the past and now can't register