Slashdot Mirror


Most Digital Content Not Stable

brunes69 writes "The CBC is running an article profiling the problems with archiving digital data in New Brunswick's provincial archives. Quote from the story: 'I've had audio tape come into the archives, for example, that had been submerged in water in floods and the tape was so swollen it went off the reel, and yet we were able to recover that. We were able to take that off and dry it out and play it back. If a CD had one-tenth of one per cent of the damage on one of those reels, it wouldn't play, period. The whole thing would be corrupted'. Given the difficulties with preserving digital data, is it really the medium we should be using for archival purposes?"

19 of 353 comments (clear)

  1. That's nothing, think of DRM by iamacat · · Score: 4, Insightful

    That content can not be preserved at all. We'll be a civilization without written history, like American Indians.

    1. Re:That's nothing, think of DRM by Rude+Turnip · · Score: 5, Funny

      And if they didn't insist on DRM in their smoke signals, they might still be a pretty formidable group today.

    2. Re:That's nothing, think of DRM by vertinox · · Score: 5, Insightful

      The Romans Conquered the Greeks, the Normans conquered the Saxons, etc. The list goes on and on. The case has ALWAYS been that if some other nation wanted your land and you couldn't stand up to them in a military confrontation, then you were gonna loose that land.

      As a person who loves to study European antiquity I would point out some flaws in this thinking...

      1. When the Romans conquered the Greeks they actually adopted Greek culture and didn't kill off the Greeks.
      2. When the Normans conquered the Saxons they didn't kill off the Saxons nor really conquered their land as much as just intermarried with them (Hence Anglo-Saxon Culture)

      The only whole sale Genocides that history can come up with is the Crusaders massacre of Jerusalem (which wasn't really as much as hatred of Muslims as it was starving Europeans killing off everyone in the city regardless of religion out of rage of having to starve in the desert for several months) and then the Mongol sack of Baghdad which wasn't over so much as land, but out of spite of the execution of Mongol diplomats (considering they burned and salted the lands made the "take your lands" point of conquering sort of a non-issue).

      The genocide and seizure of lands in this scale was never really seen before until the colonization of Americas. It wasn't as much as the Indians could not defend them as much as it was that the westerners thought they were subhuman.

      Which sadly we saw again in the European theatre in WW2.

      --
      "I am the king of the Romans, and am superior to rules of grammar!"
      -Sigismund, Holy Roman Emperor (1368-1437)
  2. Multiple identical copies? by WinterSolstice · · Score: 4, Insightful

    Isn't that the point of digital? Lossless copies are possible (depending on format obviously). Why have one plastic cylinder that can be lost when you can have it in 5 or 10 locations?

    --
    An operating system should be like a light switch... simple, effective, easy to use, and designed for everyone.
    1. Re:Multiple identical copies? by t00le · · Score: 5, Informative

      Any good backup strategy will have multiple media types, so CD/DVD should not be your primary backup media type. If you prefer to have an medium for fast access, then it is still viable. As long as it is not your primary media type, which should be something with tried-and-true longevity.

      --
      When the only tool you have is a hammer, every problem looks like a nail
    2. Re:Multiple identical copies? by gad_zuki! · · Score: 4, Insightful

      The cost of multiple backups is very real. The real issue here is that this is a frivolous complaint. First off, wet tape being readable is an artifact of the medium. The rosetta stone in the british museum is pretty readable but we arent exactly throwing out our modern media to go back to stone. Also, lets consider a reel to reel tape is about 90 minutes (7inch). 650 megabytes on a standard disc at encoding similiar to the quality you get out of a reel to reel tape is something like 1,500 minutes. And its smaller. So lets not go a little too crazy with idealizing the past.

      Also I'm certain for every analog horror story there is a digital lucky story (and vice versa). Not to mention digital encodings usually have some kind of redundancy. A small scrach does nothing but the same scratch on an lp forever destroys some part of the track. I wont even go into the magic of data restoration (which the author ignores). There's really no 'tough medium for the ages' out there that can do it all. Just complaints and blind-luck stories.

    3. Re:Multiple identical copies? by Red+Flayer · · Score: 4, Funny

      So (-1, redundant) should now be (+1, redundant) for posterity's sake? And dupes are posted for archival reasons?

      I'm confused.

      --
      "Trolls they were, but filled with the evil will of their master: a fell race..." -- J.R.R. Tolkien on Olog-hai
  3. Stone tablets by IckySplat · · Score: 5, Funny

    Stone tablets. Just drill a hole for a zero and your away and laughing
    Now we just need a large enough area to store them :)

    --
    Help! help!, the termites are eating my DRAM!!!
    1. Re:Stone tablets by kalirion · · Score: 4, Funny

      Hey, it worked for Moses...the 10 commandments are still around.

      That's out of the original 15.

  4. 3.5" by otacon · · Score: 4, Funny

    At the enterprise level we use 3.5" 1.44MB Floppy drives in an elaborate redundant array. It consists of roughly 70,000 Disks, each changed nightly. We haven't had any problems yet. Hopefully the rest of the world will play catch up soon.

    --
    In a world of acronyms, the words are the real victims.
  5. It's the messanger, not the message by Anonymous Coward · · Score: 4, Insightful

    Ridiculous. It's not the fact that content is digital, it's the fact that the media being used to store the information (CDs etc) is fragile. If these mythical audio tapes had been digital tapes, recovering the signal from them would have been just as easy.

  6. But what you got off the tape... by dave420 · · Score: 4, Insightful

    ... wasn't *exactly* what you put on. You have the appearance of stability, that you can retrieve something off a damaged tape, but the truth is something different. That's the beauty of analogue. The same simplicity and fault-tolerance of the format also means the format will naturally degrade over time. The contents may be retrievable, but they've degraded, and as such are not the same contents as when first written. Digital fails, but when it doesn't fail, you have exactly the same content as you did when you started. Archivists will not run from digital - their techniques will improve instead. or something.

  7. wring recovery method by Red+Flayer · · Score: 4, Informative

    If a CD had one-tenth of one per cent of the damage on one of those reels, it wouldn't play, period.
    That's because you're trying to optically read through the damaged part. It is possible to recover data from damaged discs, as long as only the coating (and not the reflective surface) is damaged. It is quite possible to polish the surface and read the data, or even to fill in some of the damage and repolish for reading.

    Just because it's harder to recover the data doesn't mean it's impossible.

    Of course, anyone using CDs or DVDs for large data backup must have a lot of interns to do the disc swapping.
    --
    "Trolls they were, but filled with the evil will of their master: a fell race..." -- J.R.R. Tolkien on Olog-hai
    1. Re:wring recovery method by Criffer · · Score: 4, Informative
      Exactly. If you try to put a bent CD into a CD drive, you're obviously not going to be able to read it. But that doesn't mean its not recoverable.

      To recover data from a CD, you can simply photograph it at high enough resolution. Even with huge scratches, even with parts of the disc physically missing, you can recover the data exactly as it was encoded. How? Reed Solomon code .
      Quoth wikipedia:

      The result is a CIRC that can completely correct error bursts up to 4000 bits, or about 2.5 mm on the disc surface. This code is so strong that most CD playback errors are almost certainly caused by tracking errors that cause the laser to jump track, not by uncorrectable error bursts
  8. They could try harder by Waffle+Iron · · Score: 4, Interesting
    The CD wouldn't play with an off-the-shelf CD player. That doesn't mean that a special "archaeological" CD player can't be built that would perform extensive microscopic image analysis of the disk surface in order to read the data in the face of extensive corruption.

    Some analog technologies, like old color films, have also degraded and need image enhancement to recover the original content.

  9. Every Superman has his Kryptonite by elrous0 · · Score: 4, Insightful

    Yes, analog tape is durable. But let's take it and that "CD" and put them in front of a large electromagnet and see how each fares.

    --
    SJW: Someone who has run out of real oppression, and has to fake it.
  10. have people already forgotten? by phantomfive · · Score: 4, Informative

    Have people already forgotten the advantage of digital? If you have an analog tape, every time you make a copy of it, the quality will be degraded. But with digital, you can make a million copies and the final copy will be the byte by byte equivilent of the original. So what if CDs only last 10 years before becoming unusable? You can make another copy! So what if this guy wouldn't have been able to recover after physical damage to his media....if it was important, he should have had digital offsite backups! And those backups would have been 100% equivelent to the originals.

    --
    Qxe4
  11. Umm.. by phasm42 · · Score: 4, Insightful

    If a CD had been submerged in water, it would've been fine. There's no point in making the comparison if it wouldn't have been damaged in the first place. They need to find a better example.

    --
    "No one likes working in a hamster wheel, and your shop smells of cedar shavings from here." - TaleSpinner
  12. It's already happened/happening. by Kadin2048 · · Score: 4, Insightful

    This also is the response to the other big cry-wolf thing, "What happens when the data is in a format that's too old???!!11one" The answer is we just keep copying it to new formats. I have digital copies of papers that I wrote in high school. They were written on an old copy or Works for Windows 3.1 and usually saved to floppy. I don't have a floppy any more but it isn't a problem. I long ago transferred them to a harddrive and I just keep transferring them to new drives when I get them. I also periodically load the old documents in to whatever my current word processor is, convert them, and re-save them as a new format.

    I think you're missing an important element here. As you move along in time, the volume of data that must be converted to the format du jour only gets bigger and bigger.

    For a single person, it's probably not too bad. I, too, have pretty much everything I ever wrote since I first got a computer, and every few years I've committed to rolling the whole thing onto new media. So I've gone from offline backups on floppies, to Zip disks (in retrospect a mistake), to CDs, to DVD-R, and now to DVD+R (the -R discs were crappy and I've since heard that +R is a superior format anyway). This isn't much trouble, because the amount of data I have to backup hasn't really grown that much faster than the data density of available media. I'm probably up to a couple of DVDs for the stuff I really, really care about, maybe a binder if I include all the photos and video.

    But what's a basic Saturday-afternoon copy-and-burn job for an individual is a Sisyphean task for a large government agency or library, particularly one who is constantly generating new content. I've seen places that could barely keep up with archiving the stuff they were producing, much less roll their vast archives forward onto new media. So they'd have vaults of hard drives, sitting next to DLT cassettes, next to IBM 3480, next to racks of old half-inch open-reel tapes. Probably back in some dark corner there were piles of punched cards; it really wouldn't surprise me. The problem of data loss due to unreadable formats isn't some abstract 'maybe,' it's already happened in a lot of places (but nobody really wants to talk about it, so it mostly gets buried and whatever's on the tapes gets written off).

    The reason why there's so much interest in preservable formats is because while it may not be strictly impossible to constantly roll old backups and archives forward, it's very hard, and requires vast amounts of effort and expense. If you have a backup that's being written into a format that you know is going to be readable for a long time, even if it's more expensive to write initially, you can save a lot of money and time down the road by not having to copy it forward as often.

    People may get a little shrill when they're talking about these issues, but they're quite real.

    --
    "Ladies and gentlemen, my killbot features Lotus Notes and a machine gun. It is the finest available."