Avoiding a Digital Dark Age
al0ha writes to recommend a worthwhile piece up at American Scientist on the problems of archiving and data preservation in an age where all data are stored digitally. "It seems unavoidable that most of the data in our future will be digital, so it behooves us to understand how to manage and preserve digital data so we can avoid what some have called the 'digital dark age.' This is the idea — or fear! — that if we cannot learn to explicitly save our digital data, we will lose that data and, with it, the record that future generations might use to remember and understand us. ... Unlike the many venerable institutions that have for centuries refined their techniques for preserving analog data on clay, stone, ceramic or paper, we have no corresponding reservoir of historical wisdom to teach us how to save our digital data. That does not mean there is nothing to learn from the past, only that we must work a little harder to find it."
Our landfills will provide all the info they need.
For justice, we must go to Don Corleone
How much of it is really worth saving? Except The Goatse image and a good RickRoll video I mean...
How many of you have digital files from 15 years ago that you can read today? 20 years? There was no DMCA back then, now just imagine the future....
DRM? No thanks, I'll just get it somewhere else...
to ensure this never happens. This is the same reason why DVD's and Bluerays will never work in 100 years time.
DRM will destroy any record of our current culture, but looking around at the abyss, I really have to say its for the better.
But I already feel bad for the eventual people that will spend far too much time trying to recover "scary movie part 15" or some other 'gem' from our time. But much like 'abandonware' and other areas of trying preserve machine code, lawyers will always race in to make sure all copies are lost forever.
Support things like SIMH while you still can!
The main way ancient writing reached us is because someone copied it. Lots of copies. Sometimes translated into another language and back, for example, a lot of Greek learning went into Arabic and came back out into Latin or Greek. With all the copy protection and encryption on our media today, can we ever copy the data and be able to decipher it again?
The bitter lessons of a veteran coder: http://bitterprogrammer.blogspot.com
Everybody print out all their emails!!!
Those that forget history are doomed to repeat it - but these days, it seems that there's more and more effort put into actively avoiding learning from history.
Or maybe I've just hit that age when The Kids ought to get off My Lawn.
About storing data will change. Historically, we've stored on paper, stone, or whatever could be inscribed. The 'backups' for data has been more about attempting to 'inscribe' media with the digital info.
Perhaps we're entering an era where we'll be trying to keep information 'live' perpetually, with the internet the first attempt at having an active library (though there are currently lots of cracks for information to be lost).
Many of the laws that overly stymie information flow (DMCA etc.), I think, are just a knee jerk reaction in the way printing presses were suppressed, and controlled until everyone realised the benefits of having them opened up.
Still, having the long term offline stores is no bad thing..
It is indeed a big problem. The problem was illustrated recently when Yahoo suddenly pulled the plug on Geocities, wiping out a vast cultural archive that went back to the early days of the internet, a lot of valuable information was lost as a result of that. Yahoo's blatant arrogance caused me to refuse to ever use any of their products again. Geocities was actually a fairly nice service, often people criticised it because of the ads, but how do you pay to continue to offer a free service. The loss of geocities was a perfect example of the need for a permenant store or online archive of information, personal websites and so on that can be maintained as a cultural legacy and informational resource.
Seriously, Slashdot.. until there's a revolutionary insight into this matter.. quick posting these stories ad nauseum.
For further commentary, see previous stories... here's one.. it's from september 2009 and -nothing has changed-.
http://ask.slashdot.org/story/09/09/29/1646251/Archiving-Digital-Artwork-For-Museum-Purchase
You'll have to go to .wav (not FLAC)--just straight bits. This does away with both copy-protection and compression.
http://archive.org/
They've already got a copy of your Geocities sites from the first Digital Dark Age.
IMHO we'll find that our problem is that we drown in a sea of useless information because we can't find the islands of relevance. Trying to archive everything will only lead to failing to archive anything. On the other hand I doubt that we'll lose much important information despite failing at organized preservation attempts, because important information is copied all the time, which is the only way for information to survive quickly changing technologies and file formats anyway.
In a more philosophical light, I think that forgetting is good for us. It frees us from the constraints of our past and makes way for new ideas. Archives are backwards-facing, but we all live in the future, all the time.
I think that many people are failing to appreciate the longevity of information preservation
that cloud computing (more specifically, redundant, geographically distributed network storage) can bring.
If we get the protocols right, and insist on open standards for data interchange, we can obtain
properties such as:
Data bundles that know how to move themselves to more recently commissioned, and/or more
reliable hosts.
Data bundles that know how to check in with copies of themselves, to make sure there are enough of
them alive, and that they are adequately geographically distributed, at every given moment.
If not, then more baby copies of the same data would be produced and stored elsewhere automatically.
There are other issues to longevity of course, like maintenance of software that understands different
versions of data etc. Not trivial but very doable.
How long an individual disk or SSD or stone tablet lasts is COMPLETELY IRRELEVANT to
the prospects for information longevity, given the network, and new levels of automated distribution
that will take place on it going forward.
Where are we going and why are we in a handbasket?
"Movable Type is a dead end. In the long run, the utility of all non-Free software approaches zero. All non-Free software is a dead end."
http://diveintomark.org/archives/2004/05/14/freedom-0
Far outlasts stone, and if you did it right I'll bet you could get nearly 1Mbit per card without running into the problems of Lace Cards
SJW: a person who perceives an injustice, and while correcting it, commits a greater injustice.
We will naturally make multiple copies of everything we consider important, continually transcribing important data onto the latest generation data storage media. (Consider what was the very first publication printed on Gutenberg's big invention.) Unfortunately, that's not necessarily what will be considered important many generations into the future.
I have every confidence that, far into the future, we will have or be able to develop the capability to read any media we preserve today. The problem then becomes how to determine what data we should should preserve now rather than how to preserve it. What do we know now that will be important and useful to someone 10^n years from today?
Ahh - My eye!
The doctor said I'm not supposed to get Slashdot in it!
Just put a massive data server in a spaceship and accelerate it near the speed of light. Data loss would be slowed enough that it would be negligible, and if we have to retrieve anything it should have a fast enough processor to respond to a request in a timely fashion and send off a pre-made copy of the needed data (as it may take too long to copy petabytes at near light speed).
This should work out perfectly- by the time we have the technology to do this, today's worthwhile material should finally be coming out of copyright.
My webcomic
especially that the main insight is that 99% of digital records are useless crap. Just like it won't matter if archaeologists never find 99.9999999% of our cities, when you've seen one Starbucks next to a McDonalds next to a Walmart, you've more than seen them all. The ditto mark will be the most used character recording our drivel... don't even get me started on our mostly devoid of talent "music" and "art" (is a frontal lobotomy prerequisite to being a rap star?)
The Domesday Book was commisioned in December 1085 by King William (aka William the Conqueror, who invaded ngland in 1066). The first draft was completed in August 1086 and contained records for 13,418 settlements in the English counties south of the rivers Ribble and Tees (the border with Scotland at the time). It is a detailed statement of lands held by he king and by his tenants and of the resources that went with those lands. It records which manors rightfully belonged to which estates, thus ending years of confusion resulting from the gradual and sometimes violent dispossession of the Anglo-Saxons by their Norman conquerors.
In 1986, at a cost of £2.5 million, the UK compiled the contents of the Domesday Book into electronic form that was stored on laserdiscs. The information stored on the laserdiscs, which is the equivalent of several sets of encyclopedias, is now unreadable because the equipment needed to read the discs is no longer available. Meanwhile the original book is still readable after more than 900 years.
In my own quest to preserve my digital photos, I've created multiple backups on hard disk including a remote backup which gets updated every few months. I use different disks created by different manufacturers and buy new disks every couple of years (but do not throw away old copies).
I've recently come across another aspect that isn't addressed by the article. Data that is in use in an online copy can be modified (including corrupted).There is no point in copying/propagating data if the data you are copying is damaged. Typically this has happened when I've tried DAM software like Lightroom which will modify the original file despite claiming to be non-destructive I have no proof that photos were re-encoded or quality was reduced but I do know original files were altered, and I want an original unaltered file preserved
Most people when they backup files do very little verification to ensure the files they are copying today are the same files that were created 5 or 10 years ago. They rely too much on backup software to do this for them, with no attention paid to what's happened to the data between copies. To keep this under control I've started putting checksums on all my photo files, which I check when I create a fresh copy.
Of course where my photos are captured in a proprietary format I copy to an open or at least well documented format (typically jpg, sometimes also tif). This is done as soon as I transfer the photos, which are not removed from the camera card until i have 2 additional copies. So I shouldn't have the same issues that the author had assuming jpg can still be read throughout my lifetime.
--
Sammy
These posts express my own personal views, not those of my employer
we are generating data far, far faster than we can save. We have for some time, and while trends for storage are catching up, we will always be able to generate more than we store, as a function of how computing and communications work.
So what to save? The Director of the NLM had a unique insight on this exact question: [paraphrasing] "What is used, is saved." Basically, its the utility of information, that information that people find useful and actually use is the best proxy for long term value. The good thing is that all people are motivated to store and maintain the data they find useful, or their constituents or customers desire. As long as people keep wanting data, it will be stored and available.
This is a very different situation to real-world archeology. In the digital, connected world we can access data today once it's publicly available, evaluate it and use it if we want. There is no dust that covers old data, it does not get buried...
Forget CDs, DVDs, magnetic media, etc. All data should be stored in solid state devices. Google knows.
Rebel Science News
The bigger danger is there is some major event like a plague and not enough people are left to maintain a technical society. Who would know how to make a microprocessor, or even refine gasoline? Tan leather? Grow crops? We could be back to the stone age in a single generation.
In group behavior: 'because they're evil/morons/sheep/crazy' is not 'insightful' it's 'oversimplified'
Archives.org isn't enough. The digital problem is more than just web sites. Every state records legal files with various methods, platforms and formats. Tiff's, JPEG's, WAV's, PDF's and more are the real heart of this issue along with web sites.
The Washington State Digital Archives (http://digitalarchives.wa.gov/) is already taking on this issue and has been for five years. Hopefully, more than just a few state will get serious about this issue.
We may have already lost files from 15 years ago, but that doesn't mean we have to loose files from this year.
(Full disclosure, I work for there.)
At how much Farenheit the digital records combust? Wont be so dark that ages, at least while the fire last.
A century later, we will still will find buried snapshots of wikipedia on devices like WikiReader.With paper books making copies is expensive,to one kind of device usually, and takes a lot of space. Digital records,in the other hand,could be put in a lot of ways, but what must be preserved is how to decode or interpret it (using open formats for it could help a bit there).
The Cranberry DiamonDisc is a 1000 year option already on the market. They aren't cheap but they should come down in price if they're able to get a enough costumers to bring the supply up.
This is what counts for science nowadays?
http://www.americanscientist.org/include/popup_fullImage.aspx?key=vo50G9YwnF6SwlOk2usL5R9EyqRLsNX+YiPzweX/0ZsH0IeSOOXIBip7qwN2/ZRY
Look carefully at the 'digital encoding' of the "simple tone" sine wave. ??? Really? What encoder is that?
cf. http://en.wikipedia.org/wiki/Fourier_Transform
Best way to avoid a dark age: prevent a fundamentalist cult from exercising its fantasies of domination / cultural supression.
Includes: Christianity, Islam, Scientific Materialism, among others (these happen to be the most successful to date).
This topic involves two vastly different things:
- File formats - easy - just make sure everything is stored in an open format, or something so ubiquitous its as good as an open format (odt, txt, jpg, pdf, csv, ogg, etc) and it will be readable forever.
- Physical media - this is the risk - most new machines these days can't read 3 1/2" floppies, let alone anything older, but so long as you migrate contents of your old physical media onto new media formats - AND you have multiple copies of important stuff - that shouldn't be a problem.
- Chuq
With all the pushing by law enforcement for permanent archiving of everybody's web use the problem will solve itself!
Rah! Rah! for terrosists - they hate our freedom but they have saved our culture from fading from history!
When information is power, privacy is freedom.
Virtual machines really eliminate a lot of those concerns. But what we really have to worry about is silent bit rot. I've found a few old files of mine that are corrupted. Not cool. ZFS and drobos... I don't really see a good end-user ready backup system that verifies data integrity.
Cwm, fjord-bank glyphs vext quiz
bad choice! jpeg is lossy format, information is deliberately dropped to make an approximate reproduction!
Many cameras only capture in a lossy format such as jpg. Even those that have RAW sometimes use lossy RAW. Losses only occur once per save. So to mitigate you don't modify files repeatedly. If there is a need to do this, go back to the original, save to TIFF and edit from there. So long as you have the original preserved you can always reapply any edits.
you're like the guy in the India Jones movie who drinks from a fancy chalice, has the flesh and guts dissolve and burn from his bones: "...he chose....poorly....."
I can't tell if you're trolling or just being melodramatic.
really, if you value your work do a little research, maybe standard such as "TIFF Revision 6.0 Final" or similar should be used, and perhaps with widely known and well documented lossless compression.
TIFFs are a poor choice unless multiple edits are going to be made. They slow down current hardware immensely and aren't widely supported. Try flipping through 12MP tiffs using Windows fax and picture viewer on XP.
I think you ought to have some idea what you're on about before you criticise so floridly.
These posts express my own personal views, not those of my employer
Anyone who remembers some of the old Sierra games knew how they occasionally had glitches or bugs that would cause the game to crash completely. Though Sierra would sometimes drag their asses about support, at one point or another the fans would patch the games themselves and publish the unofficial patch to the web. Quest For Glory IV was a perfect example of this; a fantastic game, but it had a few very memorable glitches that would cause the game to crash. The problem was that these fan patches were most often hosted on Geocities or Angelfire sites, most of which have since vanished from the world, leaving the games unplayable, unable to be completed, and unfixed hosts to whatever problems plagued them at the time of their publishing. Something that would have been a "popular" fix even just a few years ago is now irretrievable; forums point to dead links, users have long-since abandoned the posts, and the files themselves are nowhere to be found because of the death of the hosts. Obviously it's minor in the grand scheme of the history of the nation, but it *is* a good indicator of just how much can be culled within a very short period of time.
Gonzo, War on Terror, First Person Shooter...
It would indeed be a tragedy for civilization if such data were lost and the mind of the early 21st century American went unrecorded.
Why do "Al Qaeda" bulletins allegedly authored by Osama Bin Laden sound as if they were authored by Oliver North?
This is something that I've seriously taken a look into on the personal side of things. I look at all the digital data I've collected (and lost due to a drive failure, virus, corruption, disaster, ect.) over the years and it really makes your head go foggy. I only hit this realization putting together a wedding anniversary party for my parent's together in the last few months. My parents brought over bucket loads of photos and keepsakes that I have to rummage through for an overhead slideshow. On top of them being (thankfully) highly organized with their personal keepsakes, it far superseeds what I have for my own family. My wife and I went back and we literally have a 'digital divide' in the last decade for any tangible photos. Most of our memorable moments were done with a digital camera, which is great, but we have SO many dribblets of photos here and there on this burnt CD or that external storage device, ALL of which can get lost much easier, broken or misplaced FOREVER than a big ass, heavy rubbermade toat of pictures my parents have (negative included, I might add).
So I ask myself, what if my copy of my copy of my copy is corrupt? I'm screwed. What if I have something in an unsupported format that I can't find any support for? I'm screwed. What if I have a photo at 320x240 resolution and I want to make a 8x10 photo of it and put it on my wall? I'm screwed. We've successfully stove-piped ourselves for a high rate of non-reproduction of our valued items along with a staggering rate for failure on the mediums we've chosen for them.
I've come to the conclusion that tangible is becoming an obsolete word when it comes to anything I like anymore: music, movies, photography, books, news, conversations, ect. I don't think there is a way getting around it that I can see.
The future generations will just not care about us. Just stop thinking about this already.
I'm not worried. We are pretty soon going to have a bunch of people that are heartbroken about their data from 10 years ago being lost. The travel photos, the e-mailed love letters, the brilliant blog posts. And these people will create demand for longer-term storage and data collection techniques we don't have now. Why should it happen in the near future if it hasn't already? Because we first needed a generation of people that use computers and the internet as the primary way of expressing their life. Nobody was in that boat ten years ago. Now anybody reading this is. So consumer-grade "lifetime" storage options will enjoy a more prominent place on the market. And if you can get some old data to stick around for a half century or more, the value of it bumps up to "time capsule" status. Which means somebody might think to archive your mess of media around the time you die. Maybe some younger cousin of yours will take care of it. Heck, funeral parlors might offer data archival as a service 20 years from now.
If you want to preserve your data, backup your data yourself, and keep it on its own storage medium. There seems to be a growing impetus where "cloud computing" and "thin clients" are envisioned to replace traditional architectures where data is stored and decoded by the individual who owns/created it. I'd rather store my data myself than ask permission to access it through the equivalent of a 1980's green screen dumb terminal from some corporation who's interests run contrary to mine.
The main way ancient writing reached us is because someone copied it. Lots of copies. {...} With all the copy protection and encryption on our media today, can we ever copy the data and be able to decipher it again?
(And as another example of copies being important for preserving : Fritz Lang's Metropolis got recently another 30 minutes of its missing part recovered from a copy located in Argentina)
After a long enough time, virtually any DRM measure end-up being broken. What only matters is time, resources and some clever tricks (to avoid waiting until universe heat-death while bruteforcing a 4096bit key).
So DRM has only 2 direct effects :
- it annoys legitimate users everywhere with no practical reason.
- it forces the basement-dwelling teen with too much free time on their hand to wait until 2 weeks before official launch date, instead of 3 weeks before, because it took 1 week to the pirates to find a way to break the DRM.
This implies 2 results :
- That the 99.99% of pirate users, will never ever interact with the DRM nor be affected by it in any way.
- The important part : DRM protected piece of data will get copied, eventually and a lot. Lots of copies will exist and virtually 99.99% of these copies will be the "pirated" copies. Be it legal backup or unlicensed copies.
So in the end, the DRM-protected data will survive, only not the DRM version itself, but the DRM-free version as found on The Pirate Bay and similar. Case in point : Classics emulation.
Most of the companies which produced the game we played as children are now belly. Of the few remaining, few of them have kept the assets of their old production. Few of them are interested in doing anything with these old assets. The few who do, generally do modern re-imaging and re-interpretation, rather than re-issuing the old.
So in short, if you ever wanted to pull back some of your children memories out of the grave, don't count on the original companies. ... as image of pirated disks. It's practically sure that, if in 2010 you want to play the same game as in 1985, you'd probably see a cracktro in the beginning.
Some time you can find still working vintage equipment and media - but these will eventually break.
Today, the biggest part of these oldies are available
All your Commodore C64, Amiga, etc. favourite games are currently best sourced from download site which contain warez copies that were carried over back from that era, while at the same time the companies went belly up and/or let their assets rot.
So, in 25 years, when most of the current media companies have either disappeared, or completely forgotten about today's media, your children's best way to find a copy of them to remember fond memories, would be finding a copy which will be the digital descendant of what's today on pirate bay.
Yes, **AA, today's EVIL pirate, might be tomorrow's heroic archivist.
In 25 years, when the current maker of
"Sufficiently advanced satire is indistinguishable from reality." - [Tips: 1DrYakQDKCQ6y52z6QbnkxHXAocMZJE61o ]
Was this article sponsored by HP, Epson, Canon, Lexmark, and Xerox perchance?
...to chisel all my ascii porn
Preserving digital data is inherently hard.
Not only do you need to preserve the bits, but you also need to preserve the knowledge about what the bits mean.
So...instead of addressing this issue as important, the content owners have decided to add another layer...
Now, they encrypt the data, to prevent copying.
This makes the problem A LOT HARDER!
The content owners are the ones to blame if we lose entire decades of art and culture.
Seriously. Many copies. Multiple, ad-nauseum uber redundancy. And, so what about that DRM crap? Is it _that_ important to preserve pop music? If so, when did DRM ever stop us? Burn a CD/DVD/Blu-Ray and then RIP it, upload the thing. Put it on a hardened RAID, what...ever.
True, technology is an ever more complex cycle. I guess we should try to get the info/code down to lowest common denominator. Text? If so, what language? Boggles the mind..but if the unthinkable happens then maybe we can assist the great minds millenia hence rediscover what we did. What we did right, and how to avoid what we did wrong.
So I repeat, whatever the content is, tech records can be fragile. To paraphrase, "We need copies. Lots of copies."
I'm not really a web designer, I just play one on the Internet.
Last I checked, Microfilm (and Microfiche) was guaranteed to last 500 years by Kodak. Unlike JPEGs, you can read Microfilm with a magnifying glass, and speaking from experience both writing and reading, the quality isn't bad. It's also not horrendously expensive once you have the writer (I believe a Kodak i9600 Series Archive Writer sells for somewhere around 35-40k, depending on the model you get).
I wouldn't fool myself into thinking Microfilm is some magic solution to our digital storage problems, but it does go to show that there are ways we can save really important data. Given current technology, we could make something similar to Microfilm that didn't require professional development...
what of this idea: an opt in mirror as you browse option for web browsers
allowing anonymous mirroring via tarballs sent to a database and hashed.
some type of plugin could offer this so everything you browse is mirrored
automatically. there could be an option for the web browser's plugin to
also include this feature in the background, similar to how distributed
computing projects work, using a user set predefined amount of cpu time.
this could also be done through a screensaver, the plugin could work
through a fork of tor or something like it, where the plugin works in
such a way where the data is encrypted from the user and anonymized without the user or the collection bank at the end knowing who it came from, only hashes are generated and stored from each of the browser's anonymized submissions. the central databank for all this information could also work in such a way somehow offering torrents of hashed tarballs, allowing for further spread and storage of the data
This story was posted an hour and a half ago, and not a single one of you has made a snarky reference to Foundation.
... but not by much. You'll still see
10101000101010010101010101100100100010010101000101001
and have no clue.
In soviet Russia, God creates you!
Google does one better: Gmail Paper
... and how are you storing the checksum information?
It would be nice to have checksum information, eg MD5, on *all* files in any OS (in the file metadata).. but given that this is not a critical requirement for today's desktop systems I don't see it happening any time soon.
What I'd like is to have all files hashed natively by the OS and have that information available when required.
You have a sick, twisted mind. Please subscribe me to your newsletter.
me too!
The exception will be the Adult Superstores. I expect a lengthy and protracted archeological excavation at each site with all of the items meticulously cataloged and considered for their 'function'.
Well according to that last round of climate data falsification there has already been a digital dark age. The claim is the original source data has already been digitally lost. So no one can prove their numbers wrong, we just have to take their word for it, even though these same people have been caught fudging data that can be proven and not victim of digital dark age. I wonder if the tax man will believe me if I said it was a digital dark age so please take my word for those huge tax deductible donations I made.
And it's a big one. I've spent the better part of 30 years working administrative applications in IT. Every time you change from, say, one accounting system to another, the problem of how to preserve and be able to look at meaningfully old data become significant. Yes, you can have backup takes and disks running out your ears: but you also have to have the formats for this data, you have to have all the progrmas which accessed and massaged this data, you probably have to have the OS these programs ran on, and you have to have the compiler(s) used in the old system. Sounds pretty daunting, doesn't it. Or you can convert all the old data from old system(s) to new one. Do you know how much resources management will allot to that? But all of a sudden, someone has written an article about it, so it's a hot topic again. Sigh ... there's nothing new ...
MK
I thought this sounded familiar. I found a comment I wrote over 2 years ago for a similar story: http://slashdot.org/comments.pl?sid=207582&cid=16922754
Quantum mechanics: the dreams that stuff is made of.
I can still play the copy of Bionic Commando I bought for my C64 20 some years ago, but the ability for me to legally play the copy of Bionic Commando Rearmed I bought two years ago is in jeopardy since the company folded and the software needs to phone home on install. They never released a patch to clear the DRM. If a clearing of DRM on "dissolution" is not in the software license, then don't expect their to be one.
I know the answer to this one! Put librarians in charge of IT!
As someone who has done quite a bit of work in the genealogy line of software development I know what a problem this can be. The best alternatives we have currently for storing digital media last
There is however a company out there that is working on a project they call the "Millenium disc" apparently it can hold roughly as much data as a CD but it has an expected life of ~1000 years. It works by meshing the ideas of hieroglyphics and a traditional CD. This was an idea that was presented at the Family History Technology conference at BYU a few years back. I am not sure where the project is now, or even if it is still alive. But either way, we need a way to come up with a long term storage solution.
I think that some intrepid information preservationists on slashdot are doing their best to preserve a copious volume of our pr0n for future historians! You just know that it what they'll all want to study!
Significant amounts of data have been lost due to:
# Loss of physical mechanisms to retrieve data from media
# Programs and Operating Systems that will not retrieve and convert for use stored data
# Inability to convert documents as they were produced, used, read and received into a modern or permanent archival format (what, it lost the Italics / Umlauts)
But the main problems are:
# Good records keeping policy not applied to data bases / electronic data
# Good records keeping policy not applied to electronically generated documents
# A general decline in the quality of filing and records keeping policy with the computerisation of many office correspondence tasks and with the decline in pay and skill levels for white collar menial labour employees.
At the heart of the problem is a general decline in the quality of filing from about 1965 onwards. As far as I can tell from documentary records, the increase in office duplication technology encouraged this as records no longer became difficult to copy individual items, but became roneo'd and xerox'd pieces of disposable paper.
And that story had a comment just like yours linking to an even earlier story, and so on. Slashdotters LOVE recursion.
Say hello to my little sig.
If human race is alive 5000 years from now it will be in either of the following scenarios,
1. we learned to live within means of this planet and most likely colonized other planets and solar systems, or
2. we failed and a few handful of surviving humans live on a world devoid of easily accessible resources (eg. copper, nickel, silver, oil, etc.)
In first scenario, most of today's knowledge would survive as an interesting history project(s). In the latter, today's knowledge will mean very little.
So yeah, there ain't going to be any future civilizations. This is IT! This is the only shot we have. It is too late to think of "do overs". The last do-over possibility happened before humans discovered industry and antibiotics propelling our population to what it is today.
Apple is the Roman Catholic Church of the Digital Dark Age.
That, and wide-spread usage of open standards and free software, of course.
Free Manning, jail Obama.
It's by no means the most important topic out there (and there's a personal plug here, of course), but online gaming communities have thrived for well over a decade (even longer for some forms) but most of that information has been lost and forgotten. I decided about a year and a half ago to post my old demos of my days playing QuakeWorld Team Fortress (QWTF) on YouTube, a little out of vanity, a little out of preservation, and also because there just wasn't any real footage of QWTF on YouTube. After I got done with my old demos, I decided to start posting other demos that were publicly available on the internet. Some old players found my site and made their own contributions, but I was saddened that a lot of guys would say that they lost their old demos on a hard drive crash or something. Again, not the most important stuff ever, but I think QWTF was a pretty significant part of gaming history because it pretty much defined class-based multiplayer FPS (or at least popularized it), and, as far as I know, there isn't any sort of archive like mine anywhere else. If you're interested, http://qwtf.digitaljedi.com/ or http://www.youtube.com/user/Tickenest.
This is the NFL, which stands for "Not For Long" if you keep making those bulls*** calls.
typically kdawson posts it, what a tard.
If you mod me down, I will become more powerful than you can imagine....
"personal crap" could be just as interesting and as important to future researchers.
Indeed it is. My great-grandmother was left living in the Channel Islands throughout the German occupation during WWII. Over those years she kept a series of diaries, handwritten with a fountain-pen on paper that is now disintegrating.
My mother has since transcribed those diaries, so although they are not yet available online (my mum is for some perverse reason opposed to that), they are at least preserved for posterity.
My Wife is very suspicious of all technology so all our photo's, important Doc's are printed This helped alot when my hard drive crashed
as it is eaten so it shall pass
I would really love to see the expression on their faces when they find that Rick Roll video.
"Always backup in hard copy."
I tried posting the following reply on the website the original article appeared on, but their comment system kept having an error so I'll post it here and expand upon what I would have said.
There is nothing new here; the problems of electronic data deteriorating or becoming unreadable because of proprietary lock in of various closed-source applications is well-known going back more than 20 years that I'm aware of, and certainly a lot longer than that. The use of wire recorders, player piano rolls, 78-RPM records, phonorecords now, 8-track tapes, laserdiscs, 8" floppies, 5 1/4" floppies, Jazz discs, Zip discs, and now 3 1/2 inch floppies, (and lots of other media I know I've forgotten) are all now obsolete storage media, some of which may have data which can no-longer be retrieved because the hardware and/or software to read them is unavailable, lost or forgotten.
RMS on Digital PDP minicomputers running RSX and RSTS and VAX machines and OS . ISAM and PAM on Univac VS/9 OS on 90/60 /70 and /80. VSAM on IBM mainframes (except the few places continuing to run z-System). The Control Data Cyber systems and their data file formats. Gould, Goodyear, Harris and RCA mainframes. All of these are basically obsolete, most if not all are gone, and data stored on media from those systems, if developed by a proprietary application, is probably, for all intents and purposes, lost forever even if the data is still present. The media may have deteriorated, and the systems to read them are essentially nonexistent.
Mechanisms for regular conversion as technology changes have to be provided for. This, however, requires that as the older media ages, that there be budget and personnel available to provide the conversion while both old and new media types are available. As the case of NASA cited in the article (an employee scrounged equipment and tapes on her own in order to keep the data alive until a means to retrieve it could be found), sometimes either or both may not be available.
Libraries have mentioned how their resources are stretched thin as it is, they may not have the funds or trained personnel to export old data to new media. And at the rate media keep changing this is happening more and more frequently. 30 years ago is 1980, 250K 8" disks are still in use. The 5 1/4" 360K disc is popular because of MS DOS machines. 20 years ago is 1990, and then, the 5 1/4 was still and 3 1/2 inch floppies were becoming popular, 15 years ago a reasonable medium for high-capacity storage were 100-meg zip disks. Now I don't even have a 5 1/4" drive, my computer still has a 3 1/2 but I don't have any floppies or use them, because I have a 4 gb jump drive I wear on a lanyard around my neck, and cost ten bucks.
We've gone to digital storage because it's orders of magnitude cheaper than analog. I've pointed out in previous articles that with a digital camera and 4GB SD cards, I can take thousands of pictures at an effective cost per picture that effectively rounds to zero. A single photo might take 1/2 to 1 meg, which means, without changing media, I can take upwards of 3,500 photos. Net cost is $10 when the media is bought; nothing more unless I print an image. When I take pictures, I don't take one, I take 3, or 5, or 20 because the extra pictures are essentially free and I can delete the ones I don't want later. When I was using 35MM film, each photo, with film and developing, was about 30c. A couple hundred pictures would set you back over US$50. Today, for $50 I can take more than 20,000 images.
But my sister still has an older digital camera that uses Smartmedia, She has to be careful to copy her images to hard disc when she uses it because you basically can't buy smartmedia any more and even when you could, the maximum size was 128 meg. Her photos were in the 150K size range so she can still take more than 500 photos on a 64M chip, and also the cost is effectively zero.
And for current media it's still near-zero per image. I bought a 1 te
The lessons of history teach us - if they teach us anything - that nobody learns the lessons that history teaches us.
Big Brother will certainly enjoy this lack of history. It's much easier to fill an empty space than to replace an existing history.
Back in the old days, entertainment media was not seen as something worth preserving. Many media companies junked, trashed, or wiped content because they saw no need to preserve it for the future (or indeed were even contractually obligated to do so). As a result, there are hours upon hours, nay weeks upon weeks, of material that simply does not exist anymore. The story of the classic Doctor Who junkings comes to mind, where if a certain person had been stopped merely a year or so earlier, there would be many more existant stories on film than currently exist.
Nowadays, the problem is quite similar, although on a different scale. Many media organizations currently have excellent archival policies, such that even if material is not currently aired or sold, it is at least maintained in an archive somewhere. The problem lies with smaller-scale content (publically available and private), which doesn't demand the same archival treatment. Admittedly, a vast majority of this content is shit. However, for the benefit of future generations, we should strive to retain as much data as possible, and let them determine what is shit and what is worth keeping.
I'm all for letting future generations sift through my data. They'll mostly find crap, but if they see a few gems in there, all the better.
FC Closer
When I started using computers I had 1 5"1/4 floppy drive, when I got a 3"1/2 drive I made backups of my old floppies, then I got a HD, then a ZIP drive, archived all my floppy and HD inf on ZIP disks, at the same time as I kept getting bigger HD, I kept a backup of my old HD on the new one including backups of my ZIP drives, cds/dvds. It's true today I have ~6TB of data where ~1/3 of them are backups (of backups of backups), and I got those 6TB backuped as well, but I still can get my 25 years old floppies! Virtual Machines & emulation should also ensure being able to access/use those informations, it may require in the future to run emulators through emulators....
Only problem I see: in case of strong EMP/solar flare, all the word's data may be destroyed (would optical media be destroyed as well like when you put them in the microwave?)
Classic stupidization of society. What to do? How about organizing yourself? How about solving your own problems? How about thinking what data you really want to preserve and keep accessible until you die, and organize yourself accordingly? It doesn't have to be digital to be lost: NASA lost the Apollo 11 recordings even if they were in a really quirky and analog format, they could be read today, only they 'misplaced' them in their own terms.
Sheesh people. Build a ZFS array, OpenSolaris is out there, keep everything online 24/7 for 60 years until you no longer need them, let the next generation decide what to keep. If a disk dies, replace it promptly. Have backups. What more do you need besides getting off your ass?
Still, in previous stories I didn't find any reference to PaperBack.
It just lacks a textual description of the matrix format to attach at your centuries-lasting data.
5,000 years from now people or aliens can just Google it.
I am only half joking. Google does (or is capable) doing what in some respect government offices use to do like the library of congress. The only problem is that it is not clear just how long Google keeps its data. Does it really not have a backup of what it crawled and cached of any sort from say 2 years ago or 5 years ago somewhere in a bunker under a mountain?
Living in Chile
The problem nowadays is the excessive amount of data, there is so much that we have no information at all.
This is a
The first part I laughed at was that he had forgotten to make a backup of the program he used to make and read the backups... ......wonder why.... : P
then I sort of forgot to read the rest,
Funny that. I think our forefathers did just that i.e. they (unknowingly?) created backups when knowledge was written by hand...
Certainly not unknowingly. Many, if not most literate monks would have been perfectly aware of the fate of the Library of Alexandria. There is also evidence that there were also scribes who were not really literate in any language, but who were capable of reproducing a text without fully comprehending it. The only possible reason to do this would be to create a backup copy.
But in any case, it is worth mentioning that copying texts by hand has also been done in recent times in the form of Samizdat literature, a means of publication not uncommon even in the 1980s.
they are available on ebay, craigslist, etc. I have bought several and they work fine!
Ask Me About... The 80's!
It's a legal and social issue as well. Unlike something written on paper, works on the web are prone to disappear when payments to the service involved stop. Proposal: Advance Directives for our digital legacies
And a fireplace or wood burning stove.
Junk mail is great. Free heating.
Deleted
LOCKSS - Lots of Copies Keeps Stuff Safe:
http://lockss.stanford.edu/lockss/Home
Paper.
http://ronja.twibright.com/optar/
Stone ain't a medium, it's a hard.
Deleted
And nothing of value was lost.
Hallo, Welt!