Slashdot Mirror


Ask Slashdot: Keeping Digital Media After Imaging?

New submitter rogue_archivist writes "I'm an archivist at a mid-sized university archives, trying to develop a policy for archiving computer files ('born-digital records' in archival parlance). Currently old floppy disks, CDs, and the occasional hard drive are added to our network storage. Then the physical media is separated from archival paper documents and placed into storage. My question for all you slashdotters out there is: should these disks be imaged and then the physical copies discarded? Is there any benefit for keeping around physical copies of storage media long since rendered obsolete?"

122 comments

  1. Is there any benefit? by Anonymous Coward · · Score: 0

    No. Image and discard.

    1. Re:Is there any benefit? by cusco · · Score: 4, Insightful

      As an archivist, I would think you might want to:

      1) keep multiple copies of each type of media, preferably from different manufacturers, all written with identical data
      2) Separately, a copy of the data contained on the media

      Occasionally check the media to see at what rate their integrity is decaying. As readers for the media become increasingly difficult to encounter develop alternative methods to read the data, checking it against your reference copy. Eventually someone is going to appear on your doorstep with something like the Pioneer spacecraft data tapes or the Nixon Oval Office recordings, and if you can pull the data off it you'll be the hero of the day.

      --
      "Think about how stupid the average person is. Now, realise that half of them are dumber than that." - George Carlin
    2. Re:Is there any benefit? by cusco · · Score: 4, Interesting

      Just to reply to myself . . .

      Target headquarters in Minneapolis gets VCR tapes from security systems all over the country, and they work with the FBI to read them and export the video. Security equipment manufacturers are notorious for using proprietary equipment or file formats to limit interoperability with the competition's systems, and they apparently have a lab that specializes in decoding them to extract the usable data.

      --
      "Think about how stupid the average person is. Now, realise that half of them are dumber than that." - George Carlin
    3. Re:Is there any benefit? by Anonymous Coward · · Score: 0

      Eventually someone is going to appear on your doorstep with something like the Pioneer spacecraft data tapes or the Nixon Oval Office recordings

      The Nixon oval office recordings were analog; there was no such thing as digital sound recording back then.

    4. Re:Is there any benefit? by jc42 · · Score: 1

      Security equipment manufacturers are notorious for using proprietary equipment or file formats to limit interoperability with the competition's systems, ...

      You should have written " All manufacturers are notorious for using proprietary equipment or file formats to limit interoperability with the competition's systems." This problem has been with us from the start of the industrial age, and possibly earlier.

      This was one of the primary reasons why, back in the 1960s, the US military's ARPA (Advanced Research Projects Agency, now DARPA) created the R&D project that led to the ARPAnet, which evolved into the Internet. The military folks were using more and more electronic gadgetry, and had learned that it was impossible to write military specs so precisely that manufacturers couldn't find subtle ambiguities and create data formats that were incompatible with their competitors. The manufacturers' reps always used the same argument: "If you'd bought only our stuff, you wouldn't have problems of incompatibility."

      So they faced the fact that such incompatibilities are a permanent fact of life, and developed a solution: They'd plug electronic gadgets into those newfangled "computer" thingies, where they could write software that would decode a gadget's signals and data formats, translate them to a standard format, and transmit them to another computer with attached proprietary gadgets, where the software would translate the standard formats into their manufacturers' formats. This allowed remote military sites to use whatever equipment they had on hand (which hadn't been destroyed by some enemy ;-), and it could communicate with other equipment anywhere else that they could send the standard-format data.

      This has always applied to "obsolete" equipment, too. Manufacturers want customers to continually upgrade to the latest stuff, and encourage this by making the current stuff unable to communicate with stuff more than N releases old. But if you still have the software that talked to the old models, you can use it and the ARPA/Internet system to communicate with the newer models, using the same translation-to/from-standard scheme.

      It can be amusing to read comments implying that this sort of incompatibility is something new. It's not only not new; it was one of the prime motives for the development that led to the Internet and its protocols half a century ago. Without it, hardly any electronic gadgets would be able to communicate with anything not from the same manufacturer (and "upgraded" fairly recently).

      If you investigate, you may be surprised to learn how much of the Internet's infrastructure is running on ancient PCs that will no longer run MS Windows (or DOS ;-). I've helped build a number of "server centers" that were made up mostly of free PCs whose previous owners just wanted to dump them. It's now 10 or 15 (or 20) years later, and the server software is still sitting there running just fine, talking to any equipment that it can physically exchange bits with. It's all open-source software, so the software is easy to upgrade indefinitely. The vendors aren't very happy with us, though. ;-)

      I've also worked on a number of software projects that can be summarized as "cracking" a company's data, typically from old backups, and sending it to modern computers in formats that they understand. This hasn't been for law-enforcement or military agencies; it has been for the companies themselves who find that their old data is either unreadable or misinterpreted by current IBM/Microsoft software. It's an old story ...

      And to steer back to the original topic, the same comments apply to whatever digital media you may have. If you want your family photos or videos usable 20 or 50 or 100 years from now, you should be translating them to (preferably several of) the current standard formats. Keep multiple copies of each.

      --
      Those who do study history are doomed to stand helplessly by while everyone else repeats it.
    5. Re:Is there any benefit? by cusco · · Score: 1

      Personal conversation with a former boss who had been an intern at the inception of ARPANet indicated an origin slightly different than the 'official' version. She said that her boss was tired of having four different terminals on his desk to communicate with the four different project groups that he worked with. He and some of the other sysadmins slapped together a method that would allow the local mainframe (some DEC monster) to talk to the HP mainframe in (IIRC) Colorado. They guys in Colorado adapted that work to let the HP talk to an IBM in Palo Alto(?). Now her boss could communicate with the IBM and the HP simultaneously, and get rid of two of the terminals.

      Salescritters saw what they had done in, essentially, their spare time and said, "We should be getting paid for this!" The official history starts there.

      --
      "Think about how stupid the average person is. Now, realise that half of them are dumber than that." - George Carlin
    6. Re:Is there any benefit? by jc42 · · Score: 1

      Actually, I've read a number of similar stories, dealing with both the original ARPAnet and the origins of unix at Bell Labs. What they all have in common is the problem of dealing with incompatible gadgetry that use different data/message formats, with a solution that involved connecting the incompatible stuff to a computer running software that acted as a translator. Acting as a remote terminal was part of about half the stories, and translating file formats was often involved..

      I get the impression that the general incompatible of electronic stuff, especially computers, was a common complain in the years around 1970. That was a time when computers were spreading rapidly, and smaller computers were coming out that were cheap enough that they could be the center of a small lab rather than in an organization-wide computer center. So it was feasible for people to consider uses that wouldn't be permitted in the batch-oriented computer centers of the time. Using a computer as a "middle-man" between stuff that couldn't quite communicate was likely a widespread application of such smaller computers.

      It's too bad that a lot of this was so poorly documented, so that often all we have is after-the-fact anecdotes from a lot of different sources. But the people involved probably just thought they were working on personal annoyances with their equipment, not inventing important new things or fomenting some sort of revolution. ;-)

      --
      Those who do study history are doomed to stand helplessly by while everyone else repeats it.
  2. DVDs only live for 7 years by elabs · · Score: 1, Insightful

    For born-analog content, always keep the original physical copy. You never know when you will need to rescan at a higher quality or when you will discover errors in your digital copy. DVDs are not born analog. In fact the only have a shelf life of around 7 years. You need to get everything off DVDs and make several digital copies of it. You should keep the DVDs as long as possible but eventually you will not be able to read them anymore. Make sure your digital copies of the DVDs are error-free because there will come a time when you cannot go back to the DVDs.

    1. Re:DVDs only live for 7 years by Russ1642 · · Score: 2

      You're thinking of burned DVDs. Most professional video DVDs are stamped.

    2. Re:DVDs only live for 7 years by Em+Adespoton · · Score: 2

      You never know ... when you will discover errors in your digital copy. DVDs are not born analog. In fact the only have a shelf life of around 7 years. You need to get everything off DVDs and make several digital copies of it. You should keep the DVDs as long as possible but eventually you will not be able to read them anymore. Make sure your digital copies of the DVDs are error-free because there will come a time when you cannot go back to the DVDs.

      Hmm... DVDs only live for 7 years, eh? In an archive?

      I was just watching a DVD last night that I bought in 2000; it still works fine, with no scratches or degradation. I was also pulling data off a DVD-R the other day that I recorded in 2003. This DID have a slight bit of degradation, so maybe there's an issue here. Never had a problem with properly stored pressed DVDs though.

      For that matter, I've still got 5 1/4" floppy disks that have readable data on them from 198, and Audio CDs from 1990. Got rid of all my cassette tapes though; both the digital and analog ones degraded really quickly with use.

    3. Re:DVDs only live for 7 years by Russ1642 · · Score: 1

      Well we all know that floppy's from 198 were made of stone so of course they're going to last.

    4. Re:DVDs only live for 7 years by PRMan · · Score: 3, Insightful

      Not this tired argument again. I have burned CDs from 1995 that still work perfectly fine. Sure, they "estimated" that they would only last 7 years. Guess they were wrong, since unless I can see physical scratches or other damage, 99% of my discs from my life still work perfectly. The only ones that didn't last and had no physical damage were a cheap brand I got where the dye turned cloudy, but that happened within the first 2 years.

      --
      Peter predicted that you would "deliberately forget" creation 2000 years ago...
    5. Re:DVDs only live for 7 years by SuricouRaven · · Score: 1

      The lifespan of a burned DVD is highly unpredictable. Some can last decades - but don't count on it.

    6. Re:DVDs only live for 7 years by cusco · · Score: 1

      Don't leave them laying in the sun. I was able to say the same thing until I left a burned CD from the late '90s on the window sill a couple of years ago. Four of the five CD players that I put it in couldn't read it at all, and the fifth could only make sense of about half of it.

      --
      "Think about how stupid the average person is. Now, realise that half of them are dumber than that." - George Carlin
    7. Re:DVDs only live for 7 years by Anonymous Coward · · Score: 0

      I'd love to see the stack of digital cassette tapes though...

    8. Re:DVDs only live for 7 years by Anonymous Coward · · Score: 0

      And I have or had commercial DVDs from 1994 that were beginning to show visible cracks and had become unreadable in 2001 even though they had been stored in dark, cool places normally considered ideal for preserving them.

      Lesson: Your mileage may vary.

    9. Re:DVDs only live for 7 years by Miamicanes · · Score: 5, Interesting

      > I have burned CDs from 1995 that still work perfectly fine.

      If they're redbook audio CDs, and your definition of "work perfectly fine" is "I can stick the disk in, hit play, it spins up at 1X, music comes out, and the player doesn't totally gag", you might be right. Now try ripping the disc using software that can monitor the realtime bit error rate. You'll probably be *horrified* to see how high it is.

      Redbook audio CDs are very robust, even when their bits are rotting all over the place. They were designed in an era when hardware couldn't do much in realtime, so they bent over backwards to make sure they had a "plan B" to make sure the show would go on after the disc got scratched, dirty, or whatever else happened to it. They were designed so the audio data is interleaved in a way that when a read error occurs, the left and right channels get merged for 1 sample. A redbook audio CD has to be nearly *destroyed* (cracked, melted, fried, whatever) before it literally won't play, as long as the player is able to find the lead-in and sync up to the spiral track.

      It'll start to sound "rough" and lose channel separation, but things have to be pretty bad before it will LITERALLY stop playing. At least, as long as the player itself is faithfully following the original redbook audio specs, and isn't trying to realtime-rip the audio to a ram buffer and play it back from there (which is what some, if not most, new optical-disc media players do TODAY). I have plenty of CDs that new players choke on and refuse to even try playing, but yet my 25 year old antique CD player that cost something outrageous like $600 or $800 when new, can play just fine. Apparently, it's because first-generation CD players were precision hardware that could blindly track a CD spiral as long as the disc itself was 100% within spec, whereas new players depend upon realtime error-analysis to stumble and wobble around, and make up for the fact that discs no longer spin precisely, and worm-gear optical assemblies no longer track with precision measured in microns.

      That said, my experience has ALSO been that CD-R discs manufactured in THIS century are less likely to rot and become unplayable in new drives, but are more likely to have major problems with old players. The old players were precision hardware, and assumed the discs themselves were manufactured to precision specs. The first-gen CD-R media had dye that deteriorated over time, but their spiral tracks were spot-on, just like pressed discs. As drives got better at handling sloppy tracking, the discs themselves became sloppier.

      Net effect: first-gen redbook audio CD-R media is likely to play with acceptable audio quality on an old CD player from the 80s or early 90s, but be unplayable on many modern drives & be un-rippable on most drives (some will allow you to spin down to 1X & emulate the playback mode of a legacy player if you're running a sophisticated ripping app). Newer discs that are still old will probably skip and have problems playing on an old player, but might still be equally bad on a new one. When today's bargain-bin CD-R media is 10 years old, it will probably be unplayable on anything, the same way my old VHS tapes from the 80s still play fine, but VHS tapes recorded after ~1998 are largely unplayable on anything I can find.

      TLDR point: the storage life of "last-gen" CD-R media is likely to be better than first-gen CD-R media was at the same age, but enormously WORSE than that of the best "turn of the century" CD-R media (the golden era when quality standards were still high, and the worst faults of the first-gen media were addressed. Any box of CD-R media you buy TODAY is probably shit of the worst kind. The best media you can buy TODAY for long storage life? Non-LTH BD-R single-layer discs. But MAKE SURE they aren't LTH... most manufacturers don't go out of their way to scream, "These discs are LTH garbage!"

    10. Re:DVDs only live for 7 years by vux984 · · Score: 2

      You should keep the DVDs as long as possible

      Bottom line, if you have N digital copies then what is the benefit of keeping the original DVD over one N+1 digital copies of the DVD?

      Near as I can tell. Zero benefit. And massively increased storage requirements. So make one extra digital archive and discard them. Better still donate them! to public libraries? independent / private archivists? You don't have to "destroy" them -- which is surely about as counter-instinctual as it gets for an archivist. :)

      eventually you will not be able to read them anymore

      Odds are that if there were errors reading from it today, you won't get a better copy from that disc 50 years from now. Better to make copies from 2 different discs or exchange back ups from another center. 2 different rips of the same disc is better than 2 copies of a rip from the same disc in terms of ever being able to restore missing information from a rip.

    11. Re:DVDs only live for 7 years by vux984 · · Score: 1

      er... "2 different rips of the same disc"

      I actually meant rips of the same title from different discs.

    12. Re:DVDs only live for 7 years by Anonymous Coward · · Score: 0

      They look like standard audio cassette tapes (at least in some incarnations).

    13. Re:DVDs only live for 7 years by Anonymous Coward · · Score: 0

      Can I borrow the time machine you used to get DVDs in 1994? They were invented in 1995. They became commercially available in Japan in 1996, and in the US in 1997.

    14. Re:DVDs only live for 7 years by ganjadude · · Score: 1

      about 1/2 of my C64 and C128 5 1/4 floppies are still working in the C128 (the 2 64s i have no longer boot, i think a cap popped) the other half are unreadable, at least by the handful of 5 1/4 drives I own. Some of these disks are almost 30 years old. Kept in temp controlled room for the past 13 years (when i aquired them from a school). I cant say ive tested all disks (i have over 500 pounds in weight, just in floppy disks and sleves) but I have to admit that some disks that I know worked in the past are not workng as of 6 months ago when I had the urge to set everything back up. But the point im getting at is that there is no way DVDs, DVD-rs or even DVD-rws have a 7 year life.

      --
      have you seen my sig? there are many others like it but none that are the same
    15. Re:DVDs only live for 7 years by Em+Adespoton · · Score: 1

      about 1/2 of my C64 and C128 5 1/4 floppies are still working in the C128 (the 2 64s i have no longer boot, i think a cap popped) the other half are unreadable, at least by the handful of 5 1/4 drives I own. Some of these disks are almost 30 years old. Kept in temp controlled room for the past 13 years (when i aquired them from a school). I cant say ive tested all disks (i have over 500 pounds in weight, just in floppy disks and sleves) but I have to admit that some disks that I know worked in the past are not workng as of 6 months ago when I had the urge to set everything back up. But the point im getting at is that there is no way DVDs, DVD-rs or even DVD-rws have a 7 year life.

      I was just watching a DVD last night that I bought in 2000; it still works fine, with no scratches or degradation.

      Commercially pressed DVDs are a different beast than the ones you write yourself. The foil is etched and pressed into the plastic instead of inks being hit by lasers after assembly. The ones you write yourself seem to have about a 7-10 year life if you treat them well.

    16. Re:DVDs only live for 7 years by Em+Adespoton · · Score: 1

      Can I borrow the time machine you used to get DVDs in 1994? They were invented in 1995. They became commercially available in Japan in 1996, and in the US in 1997.

      Indeed... I figure the cracks were due to the stress of said time travel. When I started collecting DVDs in 2000, video rental stores hadn't even heard of them, and I had to order them online.

    17. Re:DVDs only live for 7 years by Miamicanes · · Score: 1

      > For born-analog content, always keep the original physical copy.

      And if, for whatever reason, you CAN'T retain the original copy, oversample the bejesus out of it, store it with the most lossless compression you can, and sample it multiple times, in multiple ways, with multiple scanners/digitizers/capture cards.

      Case in point: videotape. If your goal is to merely capture a copy to casually watch years later, and you don't mind having a digital copy that's demonstrably worse than the analog original, just about any cheap one-click capture card will do. BUT... if you want to stop the clock and capture a copy that doesn't visibly look worse than the original, compress it to MPEG-2 it at a MINIMUM of 6,000kbps VBR, at 720x480 (or 540, if it's PAL). Yes, we all know that VHS is low-res. The problem is, it's also noisy, and noise doesn't compress well. If a 720x480 video encoded with VBR and the maximum DVD-compliant bitrate available looks as good as the original VHS, consider yourself LUCKY. Regardless, never downsample the vertical resolution. Horizontal resolution with VHS is open to debate, but your vertical resolution is known and definite -- 480 (525) if it's NTSC (or 60hz PAL-J), and 540 (576) if it's PAL.

      Now, we get into my #1 beef with Blu-Ray -- its despicable lack of compliant encoding options for 480i60 and 540i50 source, so there's really no way to profoundly oversample VHS, yet retain casual "stick in the disc and hit play" usability that can be watched directly in a Blu-Ray player, but someday used for restoration efforts.

      If you REALLY want to capture a videotape and preserve every possible nuance for future restoration efforts before the tape degrades further, horizontal oversampling of VHS is mandatory. If Blu-Ray supported a hypothetical 1280x480 resolution with 4:4:4 chroma (remember, with VHS, your vertical resolution is your "good" resolution... it's 480 or 540. It's your horizontal color resolution that completely sucks. Chroma-encoding variants that split color between 2 scanlines will ALWAYS visibly-degrade VHS captures. Don't do it. )

      In a perfect world, there would be a near-line archival video format designed for videotape captures that directly sampled the full-bandwidth FM signal coming off the tape (preferably, at 3 slightly different trackings... spot-on, plus one slightly "off" each way). A few times, I've tried to work out the math and figure out what kind of insane sample rate would be necessary to try magnetically sampling the tape straight-on, with multiple overlapping rows of microscopic read heads, and sufficiently-high density, to literally map the flux of every flake of oxidized metal on the tape and allow future offline reconstruction by digitally simulating VHS's diagonal read path, so that even if the original tape were destroyed, a future restoration could be performed against the digital copy. I have a hunch that something like this is actually do-able now (and probably would have been do-able a few years ago, had multi-terabyte hard drives been affordable), but I've never even read speculation online about such an archival device.

      The point is, if your goal is to someday do color restoration, resolution-enhancement, or whatever, taking advantage of analog's tendency to keep surprising us with ghostly hints of lost detail that we'd always assumed were gone, you need a profoundly-oversampled copy with basically NO lossy compression (or oversampling so profound, you can downsample away the artifacts). Why do you need so much oversampling depth and detail? True hardcore restoration and after-the-fact re-synthesis is going to depend upon what are known as "higher-order artifacts", and those are precisely what get mangled by any kind of lossy compression. Remember, lossy compression is designed to throw away bits in areas where your eyes won't notice detail. An algorithm looking for subtle field-ripples as evidence that a pixel nearby is supposed to be intensely red instead of brownish-orange won't see them if some earlier compression algor

    18. Re:DVDs only live for 7 years by Anonymous Coward · · Score: 0

      480/525 is correct for NTSC, but PAL has never been 540/576. It's 576 visible lines, 625 total. As such I suspect you've never actually used or been anywhere near it.

    19. Re:DVDs only live for 7 years by unitron · · Score: 1

      By 198 I think they were already up to parchment and sheepskin, although papyrus hadn't totally fallen out of favor at that point in some parts of the world.

      --

      I see even classic Slashdot is now pretty much unusable on dial up anymore.

    20. Re:DVDs only live for 7 years by WillKemp · · Score: 1

      Not disputing what you say, but video disks were around in the early 80s. When i was working (as a games programmer) for Thorn EMI in London in 1982, they were working on interactive video disks. These were 10 inch disks as far as i remember.

    21. Re:DVDs only live for 7 years by robogun · · Score: 1

      Never gave a second thought to storage of CDs. I have a very old CD collection - many discs from the 1980s. After reading this, I ran in a panic to see if they still play. Hmmm. they all still play and are rippable. I am a photographer. Have backed up images onto several standard file boxes of CDRs, from 1999-2003, then copied to DVDRs (just filling the first standard file box), then large HDs. They are jammed in sleeves, not jewel cases. Random samples all play. It is the hard drive backups that are miserable failures.

    22. Re:DVDs only live for 7 years by Anonymous Coward · · Score: 0

      Those Laservision disks were analog.

    23. Re:DVDs only live for 7 years by Miamicanes · · Score: 1

      Another big problem that bites lots of people with CD-R specifically... CD-ROM/XA Mode 2, Form 2... and the willingness -- if not eager default -- of early software like Nero to allow users to disable error correction to gain a few more megs of storage space without making it abundantly clear to those same users what the consequences of doing it were. Lots of people went for YEARS burning discs that would develop hard errors at the slightest scratch without realizing WHY it was happening. They just attributed it to drive/media flakiness, and didn't realize that it was their burning software setting them up for disaster.

    24. Re:DVDs only live for 7 years by Miamicanes · · Score: 1

      > Bottom line, if you have N digital copies then what is the benefit of keeping the original DVD over one N+1 digital copies of the DVD?

      The original DVD was pressed. Digital copies depend upon either brittle spinning hard drives that are cost-prohibitive to repair and recover data from, flash media that's the equivalent of a leaky bucket (especially MLC), organic-dye based optical media that shift over time, and manufacturing processes whose lifetimes are absolutely 100% speculation to begin with.

      The more diversely-redundant your backup media, the more likely it is that you'll have enough of it survive to do high-quality archival restorations later. If you have a pressed disc, a non-LTH BD-R, and a hard drive backup, the pressed disc is the most likely out of all of them -- barring some bizarre manufacturing or design flaw -- to still work with then others have failed.

      Hard drives are convenient and cheap (now), but they are NOT long-term passive storage media. You CAN NOT just throw a hard drive in a drawer for 20 years and expect it to work, assuming you can find something compatible to attach it to and software to interpret it with. Ironically, a modern version of the old SyQuest cartridges that was used only under cleanroom conditions would probably be the most robust of all.

      What kills hard drives is mechanical failure combined with the near-impossibility of affordable repair. If Western Digital & Seagate did their own in-house low-cost repairs using their own parts, employees, and tools, hard drives would actually be VERY repairable, and permanent data loss would be rare barring total destruction. Unfortunately, they don't, and what we HAVE is a hard drive data-recovery industry that's basically built around reverse-engineering, parts-cannibalism, and black magic. Companies like WD could EASILY design drives to be serviceable, so that in a worst-case, you'd send in your drive for a warranty repair, they'd do some quick software tests to confirm hardware failure, remove the platters, mount them in their own recovery unit, image them to your new replacement hard drive, and ship it back to you. But no, hard drives are simultaneously engineered to be as cheap & likely to fail as possible, with no cost-effective means of data recovery when it happens.

      Higher-end drives come with longer warranties, but warranties are worthless for data. The one thing the industry refuses to do is make drives that are expressly DESIGNED to fail in repairable ways. I have rather strong feelings about this matter, having gotten bitten HARD by multiple drive failures in a row that left me one drive away from lifetime data loss (well, ok, I did have older scattered backups, but that one drive was the closest thing I had left to a coherent, semi-ORGANIZED backup copy that wouldn't have left me playing data-archaeologist for 20 years).

      BD-R (non-LTH) is a phase-change medium, which is pretty much the best thing we have now. Even better, the media is decoupled from the read mechanism, so failure of the read mechanism won't render the disc itself orphaned.

  3. The contents, not the container by innocent_white_lamb · · Score: 4, Insightful

    Your interest is in the contents, not the container. Therefore, once you have a known-good copy of the data, you're all set.

    Remember to keep a few of the old tapes/drives/whatever for the museum display, of course.

    --
    If you're a zombie and you know it, bite your friend!
    1. Re:The contents, not the container by Rob+the+Bold · · Score: 2

      Your interest is in the contents, not the container. Therefore, once you have a known-good copy of the data, you're all set.

      Remember to keep a few of the old tapes/drives/whatever for the museum display, of course.

      You might be interested in the "container" if it was itself interesting for some reason. A floppy disk owned by a particular person and labelled in their hand comes to mind as an example. Maybe something was crammed in the disk envelope with the disk. If it's of interest, you'd probably want to keep this piece of ephemera with the original item that contained it if it's safe to both items to do so.

      OTOH . . . I would think that this would be avery unusual case.

      --
      I am not a crackpot.
    2. Re:The contents, not the container by doti · · Score: 1

      once you have a known-good copy of the data AND made a backup of that.

      ALWAYS have at least two separated copies, peferably more.

      --
      factor 966971: 966971
    3. Re:The contents, not the container by Genda · · Score: 1

      Indeed... an original floppy disk with a note penned by the hand of Galileo himself, might have real historical value ;-)

    4. Re:The contents, not the container by rk · · Score: 1

      Nah, I have lots of those. Would you be interested in some? $9.99 each, but I'm running a special: 3 for $20!

    5. Re:The contents, not the container by NoobixCube · · Score: 2

      Responsibility for backups should be handled by the Department of Redundancy Department.

      --
      Admit it. You post strawman arguments as AC so you get modded Insightful for refuting them, rather than Troll
    6. Re:The contents, not the container by skids · · Score: 1

      Other things to consider are whether, on rewriteable media, the media may contain shadows of deleted data that may be of historical interest, and even on write-once media, whether the software you are using to copy it is copying everything it can, if there might be, e.g. stenography in a redbook CD Q channel.

      I expect such concerns would only be relevent to certain special cases.

    7. Re:The contents, not the container by westlake · · Score: 1

      Your interest is in the contents, not the container. Therefore, once you have a known-good copy of the data, you're all set.

      assuming nothing happens to the new container.

      and assuming the new container can store all the data recoverable now or in the future from a primary source.

      the "known good" copy from the geek's point of view may not mean the same thing as having everything you need to authenticate the record in a court of law.

    8. Re:The contents, not the container by Anonymous Coward · · Score: 0

      You may be also interested in *how* the data was stored, not just the data itself. The retro community is scrambling to create exact archives of original 5.25 floppies from the 80's because some of the copy protection techniques are still not understood today. If you only grab the "useful" data, you may throw away other valuable info - valuable to anyone interested in history anyway :)

    9. Re:The contents, not the container by RealGene · · Score: 1

      I have several CD-ROMs and floppies where the software's installation key is either printed on the media label, or on another label affixed to the jewel case.
      I could have a pristine digital copy of the contents, and still not be able to access it...

      --
      Mission: To provide products that consume time and energy as entertainingly as permitted by the laws of thermodynamics.
  4. In my archivist job by Sparticus789 · · Score: 4, Insightful

    I work on a team which does archiving. We have multiple layers of data storage. First, we keep all copies of media in a library. The media is imaged and stored on a SAN. The SAN is backed up to an off-site NAS. And once a year, we copy the data to hard drives and ship the drives to another site across the country. If you have the capability, put the originals in an archival storage area. I have never known a single archivist to get rid of anything, so you must be new to this community.

    As an FYI, there is no such thing as obsolete media, as evidence by this project. And trust me, you can usually find a way to image most old media formats.

    --
    sudo make me a sandwich
    1. Re:In my archivist job by rogue_archivist · · Score: 2

      You make a good point about no medium being truly obsolete. As long as there's enough funding, that is. Also, archivists get rid of things all the time through deaccessioning or weeding, and with physical storage space always being at a premium, it prompted my question.

    2. Re:In my archivist job by Em+Adespoton · · Score: 1

      Just remember that imaging the bits is only half the equation -- you're also going to want to document the file format unless you really want someone to have to reverse engineer those .abx documents from 1985 from scratch every time they want to make sense of the contents.

    3. Re:In my archivist job by Spazmania · · Score: 1

      Floppy disks go bad pretty quickly. Few of yours disks from 1993 still work. Tough to find any working disks from 1983. Unless there's something inherent to the disk itself (the "original" software with the artwork and sleeve) there's not a whole lot of point in keeping it after securing the data.

      And God help you with tapes.

      Hard disks have better longevity. If you can find a working PC-AT with a working MFM controller you can probably still boot that 40 meg drive from 1988. But... why? You can fit thousands of those on a modern thumb drive. Physical storage costs money. You can spend it better places than storing obsolete hard drives.

      --
      Moderating "-1, Disagree" is simple censorship. Have the guts to post your opinion.
    4. Re:In my archivist job by Anonymous Coward · · Score: 0

      Are you from an Eastern Bloc nation? Reasonably stored floppy disks last decades. I am a vintage computer collector and coming across unreadable disks is the exception, not the rule. I have literally hundreds of disks from 1983 or earlier and they all work fine and thousands from the later 80's through early 90's.

      My original question was not tongue in cheek. Soviet-era floppies really only could be counted on to work for months to a couple years. That was not an inherent issue with the floppy disk though. I could understand that someone from that time and place who never experienced Western floppies could have that misconception to this day.

    5. Re:In my archivist job by Anonymous Coward · · Score: 0

      I could understand that someone from that time and place who never experienced Western floppies could have that misconception to this day.

      western floppies.

    6. Re:In my archivist job by Anonymous Coward · · Score: 0

      There is a *true* digital archiving system under development.

      Read the "BACKGROUND ART" section of US Patent 8,085,304.

    7. Re:In my archivist job by Sparticus789 · · Score: 1

      You are just completely wrong. I have boxes and boxes full of software from pre-1993 that is readable and that I am able to collect data off of. Just the other day, I booted an Apple SE and read 20 different 3.5 inch floppy disks.

      --
      sudo make me a sandwich
  5. image, don't discard. by drabbih · · Score: 1

    Old media will become obsolete and degrade ofer time. It is best to copy to modern media. The files should be stored based on their SHA hash code, so that duplicates need not be stored. You can't have too many copies.

    1. Re:image, don't discard. by mlts · · Score: 2

      What would be ideal is a file format that stores data with some error correction, so if a block got corrupted on older media, the corruption wouldn't just be detectable, but possibly correctable.

      It isn't really "archival grade", but I've used the WinRAR utility for this. Archives made in 1999-2000 with error correction are still readable, check-able, and repairable, and can be moved from old CD-R to DVD to Blu-Ray, possibly to whatever the next generation of optical media will be. In fact, multi-volume archives that might have one CD or DVD go bad in a set are recoverable because I usually had one recovery volume for every four others, which might add 20% more disks to a set, but it seemed to be a fair compromise for restoring.

      Analog media like photos? Keep. Who knows if there might be a better scanning technique to find more information from a photograph, similar to how one finds info about paintings.

      Digital media? At least make a hash file that goes with the stored data at the minimum so corruption can be detected as the items pass to different storage media over time.

    2. Re:image, don't discard. by SuricouRaven · · Score: 1

      PAR2 files.

      That's exactly what you need. I used to stick 200MB of par2 data onto every DVD-R I burned - if a file was found damaged years later, that was almost always enough to recover the lost blocks.

    3. Re:image, don't discard. by JesseMcDonald · · Score: 1

      PAR2 files.

      That will help if it's merely a bit of file contents which were lost, but what do you do if the error is in the filesystem metadata? You need to be able to access the filesystem to read the PAR2 files, along with the rest of the disc's content. If you lose an inode or superblock then a PAR2 file would be out of reach, even if you can recover the remaining blocks from the disk image.

      Is there a way to add PAR2 data to a raw disc image while still allowing the disc to be read with standard tools?

      --
      "The state is that great fiction by which everyone tries to live at the expense of everyone else." - Bastiat
    4. Re:image, don't discard. by Anonymous Coward · · Score: 0

      dvdisaster

    5. Re: image, don't discard. by nbritton · · Score: 1

      Do we have a digital archival container format? I.e. a tar archive with built-in error correction to identify missing bits, lets say with a SHA hash, and a method to recompute the lost bits, lets say with an XOR operation? Do we have anything like that?

      If you have a good checksum hash and known file size, but the data is corrupted, you could semi-randomly replace bits in a brute force fashion until you produced anther file with the same hash. The possiblity of reconstructing the bits in an alternate fashion that would produce a hash collision with the original and yield useable data is zero for all intents and purposes. If you segmented the file and computed hashes for each segment, you could do distributed parity XOR / Reed-Solomon calculations to recompute any bad segments.

      I'd like to see a container format like this, given enough CPU cycles you should be able to recover just about anything, much like an infinite number of monkeys can produce the works of Shakespeare.

    6. Re:image, don't discard. by SuricouRaven · · Score: 1

      There's no need to keep the PAR2 files on the same media, but even if they are you can identify them by their magic bytes and recover them even if the filesystem is unuseable, so long as they are not fragmented. And once that's done, you can in turn use the par2 files list of slice hashes to conduct a brute-force search of the raw device for matching data. I wrote a program to do just that, many years ago.

  6. What is 'Obsolete,' Anyway? by CanHasDIY · · Score: 1

    OK, keep in mind that I'm being rather abstract here:

    What makes a thing obsolete? That it isn't a commonly used item anymore, or that its usefulness has become non-existent?

    Take, for example, the carrier pigeon - once considered 'obsolete' due to the invention of telecommunications equipment, I can see the medium coming back into vogue in wake of the new knowledge that governments the world over are monitoring our every word over the aforementioned modern channels. Today, you can't send a message along electronic media without it being intercepted, somewhere, by someone other than the intended recipient; however, you can tie a coded message to a bird's leg and be reasonably confident in the message reaching it's intended recipient without interception and decoding (international and relay flights notwithstanding).

    Thus, that which was obsolete becomes useful again, bringing us back to the initial philosophical quandary: What makes a thing obsolete, anyway?

    --
    An enigma, wrapped in a riddle, shrouded in bacon and cheese
    1. Re:What is 'Obsolete,' Anyway? by bonehead · · Score: 1

      OK, that's all well and good, but what scenario do you propose that would make the 5 1/4" floppy disk a useful tool again?

    2. Re:What is 'Obsolete,' Anyway? by lightBearer · · Score: 1

      In addition to the point about carrier pigeons, even modern technology is re-embracing what others might consider to be obsolete.

      See: IPoAC

      --
      - No Bounce, No Play -
    3. Re:What is 'Obsolete,' Anyway? by CanHasDIY · · Score: 4, Funny

      OK, that's all well and good, but what scenario do you propose that would make the 5 1/4" floppy disk a useful tool again?

      Wobbly tables.

      --
      An enigma, wrapped in a riddle, shrouded in bacon and cheese
    4. Re:What is 'Obsolete,' Anyway? by bonehead · · Score: 1

      That's what parking tickets are for.

    5. Re:What is 'Obsolete,' Anyway? by operagost · · Score: 0

      however, you can tie a coded message to a bird's leg and be reasonably confident in the message reaching it's intended recipient without interception and decoding (international and relay flights notwithstanding).

      And drones... anyone for squab?

      --

      Gamingmuseum.com: Give your 3D accelerator a rest.
    6. Re:What is 'Obsolete,' Anyway? by CanHasDIY · · Score: 1

      That's what parking tickets are for.

      Hmm.. shuriken, perhaps?

      Some neat ideas here

      --
      An enigma, wrapped in a riddle, shrouded in bacon and cheese
    7. Re:What is 'Obsolete,' Anyway? by CanHasDIY · · Score: 1

      Ah, yes, a classic.

      That one ranks up with the internet-enabled toaster.

      --
      An enigma, wrapped in a riddle, shrouded in bacon and cheese
    8. Re:What is 'Obsolete,' Anyway? by sabinelr · · Score: 1

      Used to be you could find piles of old asphalt roofing lying around and you could make flying saucers with it. Demands much more proficiency than Frisbees. 5-1/4 floppies are equally good for this, with the added menace of the sharp corners. 3-1/2 floppies might work too. MFM drives are good targets on the shooting range, and you have the added benefit of erasing sensitive data.

    9. Re:What is 'Obsolete,' Anyway? by Anonymous Coward · · Score: 0

      Not if you like to be barefoot... or your child does... or your dog...

    10. Re:What is 'Obsolete,' Anyway? by Miamicanes · · Score: 1
  7. Take a picture. by Anonymous Coward · · Score: 0

    Scan the container for labels and nostalgia. Keep a few samples. Hope you have solid backup policies, and you test your backups. Otherwise, well....

    Another major problem is reading the original file format later. Or even that some media (forth floppies) come without an actual file system. Archivists have been working on that too. So instead of (or, in addition to) asking a bunch of nerds, see what your fellow professionals have been able to come up with.

    Also, "media" is plural, thanks.

  8. if you don't care about the content by gl4ss · · Score: 4, Insightful

    ..or can check all of the content to be perfectly read, then yeah, sure, no loss in destroying the originals.

    however.. if you have the space, why destroy? another issue is sw where you in theory might have to prove ownership of a legit copy or the originals might have some other curiosity value. another thing with paper records is that if you destroy the old ones, what was stopping you from introducing new data like a record for your uncles graduation from said university and with you having destroyed the paper records no way to go check them.

    so my question is, is it really that expensive to store them, just for posterity's sake? even then you could just destroy them via sloppy storage rather than intentionally burning energy for destroying them..

    --
    world was created 5 seconds before this post as it is.
    1. Re:if you don't care about the content by TubeSteak · · Score: 2

      so my question is, is it really that expensive to store them, just for posterity's sake? even then you could just destroy them via sloppy storage rather than intentionally burning energy for destroying them..

      There's no practical difference between an item you can't find and an item that's been destroyed.

      So in reality, you don't even have to destroy the stored items, just go ahead and lose the manifest.

      --
      [Fuck Beta]
      o0t!
  9. I don't get it. by Anonymous Coward · · Score: 0

    Then the physical media is separated from archival paper documents and placed into storage

    What does that mean?

    Those DVDs you burn are stored and the paper is ....what?

    If you really MUST archive stuff, then store it in multiple media - paper, DVD, original, etc ...

    For example, if I were archiving Da Vinci's paintings, I'd keep the original, photograph the original in the highest def digital camera I can get, and photograph it in film - preferably slide film because then you don't have to worry about the second layer. (analog sucks for archive, btw.), and have the most talented copy artist ever dupe it.

    1. Re:I don't get it. by walshy007 · · Score: 1

      Why does analog suck for archiving? sure you can't just get a hash of the data and tell at a moments notice whether it is exactly as it was, however you also can't store a hard disk in a vault for 70 years and have a high expectation of it working.

  10. Why not? by jedidiah · · Score: 1

    Unless you are paying Manhattan real estate prices, why not keep the originals? They serve as another backup. They will likely not be too much of a burden. Most "obsolete" media is still perfectly usable and may be so for quite some time.

    There is simply no need to rush into destroying something you already have and can serve as an alternate form of backup.

    Originals always have some value in being the definitive version of something.

    --
    A Pirate and a Puritan look the same on a balance sheet.
  11. For display along side the files... by HockeyPuck · · Score: 1

    It's always interesting to see the files and what they were kept on. Floppy disks, whether 3, 5 or 9in variety. Old tape reels, large disk platters... "This file took up 3 of these..." or..

    An entire windowing system (macos) PLUS MS-Word fit on two floppy disks.

    My phone currently has more storage than the enterprise datacenter that I used to work at in the 80s. And it was a LARGE datacenter...

  12. The Medium Can Hold Secrets by TechyImmigrant · · Score: 5, Interesting

    If you had a 1979 copy of Wizardry on an Apple ][ floppy disk, you could images the contents. But if you wrote them back to a disk and tried to run it, it would fail.

    This is because as a means of copy protection, Wizardry used track arcing. Part of a track was written on a track. Another partial track was written half a head-step away. The timing of the writes was synchronized so the partial tracks didn't overwrite. Anyone doing a naive read and write, or even a not-so-naive scan of the half tracks would fail, because they would get the timing of the writes necessary to prevent collision and to meet the consistency checks in the program.

    Obviously people reverse engineered this and wrote adaptive copy programs that you could direct to do the right thing, but how is an archivist going to know that?

    If you can get this level of deviousness on a primitive floppy disk, I imagine that there is plenty of deviousness to go around on other formats.

    Keep the media.

    --
    I should use this sig to advertise my book ISBN-13 : 978-1501515132.
    1. Re:The Medium Can Hold Secrets by rogue_archivist · · Score: 1

      That kind of creative copy-protection is infamous, but for largely institutional or personal records, most people wouldn't have gone to that level of trouble just to obfuscate data. If they had, then we may miss out on some hidden data, but I don't think that argument alone can justify keeping around any type of storage medium that may have a timing trick, hidden filesystem, or other form of protection. At the same time, folks have developed functional technology to record timing information as well as bitstreams and sector information when creating a disk image, so if there were the type of tricks you describe they could be reverse-engineered later.

    2. Re:The Medium Can Hold Secrets by Anonymous Coward · · Score: 0
      Ahhh. Those were the days. Wizardry had one of the toughest copy protection schemes out there. Sadly, even my 3.5" disks from the late 90's were unreadable about 5 years ago. If I still had my 5" disks I doubt there's be much left.

      If you can get this level of deviousness on a primitive floppy disk, I imagine that there is plenty of deviousness to go around on other formats.

      The primitiveness of Apple's floppy drive was the key to much of the copy protection used. Everything was software controlled. The track servos as you mentioned, but also the bit encoding/decoding was done entirely in software. Later systems with better hardware abstraction are much easier to copy.

      I doubt copy protection or strange encoding plays a roll in this Ask Slashdot.

    3. Re:The Medium Can Hold Secrets by tibit · · Score: 1

      Other formats are usually readable by a device with fairly closed firmware that is designed to spec. Nobody gives you a CD-ROM drive with access to head positioning servo loop and access to the raw bitstream after clock recovery. Yet this is almost what you has on Apple ][ - that's why the floppy controller hardware was so simple (a couple stock TTL chips, maybe a PAL or two). The magic was in the software (firmware). Same goes for a modern CD-ROM drive, but the firmware is not really easily amenable to hacking, and you can't really replace it as a matter of normal operation of the equipment. On Apple ][ it was trivial, since the firmware ran on the main CPU. On a CD-ROM, or really any other media access device, there's a dedicated CPU - these days it's most often a proprietary SoC with documentation available under an NDA only if you commit to a certain purchase volume. Never mind that the amount of code is orders of magnitude beyond what was routine in Apple ][ days.

      --
      A successful API design takes a mixture of software design and pedagogy.
  13. ask the duraspace.org mailing list by Anonymous Coward · · Score: 1

    I implemented that archive system and I got great support, knowledge and experiance from their community : https://wiki.duraspace.org/display/DSPACE/Discussion

  14. digitize your Gutenberg Bible & toss the origi by Anonymous Coward · · Score: 0

    In the same way a Gutenberg Bible has something a modern reprint can never match...

    loading an 8" floppy into a drive and waiting several minutes to access a text file has something a file on a NAS can never match.

    Not all old documents should be preserved in their original format if they are duplicated elsewhere, but a representative sample of each generation should be kept for posterity. Of course, idiots will damage those 8 inch floppies over the years. So, when in doubt, save more than you will ever need.

    Readable disks are far more useful than museum-relics that can be displayed but not used.

    I am constantly amazed at how well (very) old computers work. My Grid Pad 1 (early laptop) boots just fine.

  15. Image-Discard-and back-up the image by Anonymous Coward · · Score: 0

    Image->Discard->and back-up the image

    Don't forget the last step, make back-ups of your digital copies.
    My family pics reside on >6 hard drives including one in a safety deposit box.

    1. Re:Image-Discard-and back-up the image by doti · · Score: 3, Insightful

      image->backup->check image and backup->discard

      --
      factor 966971: 966971
    2. Re:Image-Discard-and back-up the image by wonkey_monkey · · Score: 3, Informative

      image->backup->check image and backup->discard->sign in triplicate->sent in->send back->query->lose->find->subject to public inquiry->lose again->bury in soft peat for three months and recycle as firelighters

      --
      systemd is Roko's Basilisk.
    3. Re:Image-Discard-and back-up the image by Lonewolf666 · · Score: 1

      Seconded regarding "check image and backup". Only after you have successfully tested the restore process, you know that you actually have viable backups.

      Also, think about the nature of your images. Are they easily migrated to another format, if the original hardware is no loger available?
      For instance, I have encountered one or two floppy "imaging" programs that simply store the contents of all sectors on a 3.5" floppy into a 1.44MB binary file. Good for getting hidden information in seemingly unused sectors too, but if you need to access the information 20 years from now, there may be no more floppys to restore to.

      Depending on your archiving goals, it may be better to copy the contents of the media to a directory on a larger medium.

      --
      C - the footgun of programming languages
  16. From one archivist to another.... by Anonymous Coward · · Score: 1

    Get rid of the darn things. Make sure you have the proper emulation and other tools you need, be sure to reformat, but absolutely get rid of the disks. They will fail (magnetic impulses cannot be captured forever) and you will be left with a goodly-sized stock of unreadable media (in fact, IIRC, latest NDSA suggestions are to remove all files from optical media ASAP). Save yourself the trouble and the expense and dump them from the start.

  17. They won't last by nine-times · · Score: 1

    I don't know what your goals and requirements are, but I wouldn't bet on old floppies, CDs, or even hard drives lasting for very long. There's an essential problem with old physical media in that the readers are becoming more scarce. You may have a lot of floppies, but how easy is it to find a floppy drive? It's not always easy to find adapters for old IDE or SCSI formats as newer interfaces have been developed. Personally, I don't expect CD/DVD drives to be around in 10 years.

    But beyond that, there's an even bigger issue: media goes bad. Of course, how quickly it goes bad depends on quite a few things, including how it was manufactured, and how it's stored. Even if you store a bunch of CDs and floppies under good conditions, I'd expect at least 10% to go bad within 6 years. I'm completely pulling that number out of my ass and I have no science to back me up, but my point is, this stuff is not reliable. I think my 10% number is too low, even, but I'm trying to make sure I don't exaggerate.

    1. Re:They won't last by PRMan · · Score: 1

      I have never had a CD "go bad" except for one cheap batch I got where the dye turned cloudy within a year. Every other CD-R in my possession still reads perfectly unless it is obviously damaged.

      --
      Peter predicted that you would "deliberately forget" creation 2000 years ago...
    2. Re:They won't last by jedidiah · · Score: 1

      Then save your appetite for destruction for when the original media is genuinely unreadable. There is no need to hasten that which is only perceived as inevitable.

      Trash it once it actually is trash.

      Don't bother until then.

      I've stored CDs in pretty harsh ways and managed to get far more than 6 years of shelf life out of them. They aren't nearly that fragile. Some are subject to manufacturing defects but that's another sort of problem and it's hardly universal.

      --
      A Pirate and a Puritan look the same on a balance sheet.
    3. Re:They won't last by hedwards · · Score: 2

      I disagree, keeping the original only makes sense when the original is in a stable format and you have plenty of room.

      I've been dealing with this problem on a much smaller scale, and if you aren't extremely careful it can be hard to keep track of which disks you're keeping because you can, and which ones you're keeping because you have to.

      Dump it to disk, verify the contents, back it up and chuck the original media. In the long term, 1 CDROM is going to last better than 400 or so floppies will.

      Now, if you're dealing with paper, those tend to be incredibly durable provided decent paper and ink was used, those you're generally best keeping if you're archiving and have space.

    4. Re:They won't last by Anonymous Coward · · Score: 0

      This attitude is what leads to hoarding. It's creating a whole raft of problems, while solving none.

      the data is what you care about - arguing that "we must keep the disks as well!" is like saying "we must have all the fragments of marble that were chipped off the block of marble that Michelangelo carved David out of!"

      Don't confuse the useless crap with the content.

    5. Re:They won't last by Anonymous Coward · · Score: 0

      Personally, I don't expect CD/DVD drives to be around in 10 years.

      - You can still get new, cheap vinyl record players even though CDs have been around for 30+ years. For less than $100 you can even get one which plays 78rpm records, which (essentially) stopped being produced 45 years ago!
      - Lots of people have huge CD/DVD collections. Many may have ripped the CDs; not many have ripped the DVDs.
      - Lots of people still buy DVDs (and some blu-ray discs, often a mixture)
      - Many older people (and those who want to rip to flac) still buy new CDs; this will still be a significant (although much smaller) market in 10 years.
      - Blu-Ray players can typically play DVDs and CDs; most people's broadband connections are still not capable of downloading Blu-Ray quality films in any reasonable time; at the current rate of progress this will still be the case for a significant proportion of the market (outside of large towns/cities) in 10 years time.

      If you'd said that in 20-40 years it might *start* to get more difficult to get new and affordable optical disc players (capable of Blu-Ray/DVD/CD playing) you might have had a point.

    6. Re:They won't last by nine-times · · Score: 1

      Well I'm not sure what to tell you. You've been lucky...?

      I've seen plenty go bad. From what I've read on the subject, it supposedly depends on the quality of the manufacture, and the chemicals used in the dye. Though CDs can theoretically last something like a few hundred years, much of what's being sold isn't expected to last more than a few years. Again, that's from what I've read. I know for a fact that I've seen many of my own CDs and DVDs go bad without any physical damage.

    7. Re:They won't last by nine-times · · Score: 1

      - Blu-Ray players can typically play DVDs and CDs; most people's broadband connections are still not capable of downloading Blu-Ray quality films in any reasonable time; at the current rate of progress this will still be the case for a significant proportion of the market (outside of large towns/cities) in 10 years time.

      I think this is the only one of your points that I find compelling.

      For clarification, I'm not claiming that all CDs and DVDs will cease to exist within 10 years. I just think the market will by pretty tiny, and most people won't even have equipment to read them. And also, for clarification, I'm talking less about the media as used by home audio/video equipment, and more talking about having drives in computers being used to store data. Already you're seeing more and more computers being built without optical drives. Everything is being distributed by Internet. In 10 years, I think having an optical data drive in your computer will be comparable to having a floppy drive or a parallel port.

      If I'm wrong, I suspect that it's because you're right, and broadband will still suck in 10 years because the US can't be bothered to build infrastructure. Still, I don't expect that we'll be using optical media so much as flash media, or something newer than that.

      As to this point:

      - You can still get new, cheap vinyl record players even though CDs have been around for 30+ years. For less than $100 you can even get one which plays 78rpm records, which (essentially) stopped being produced 45 years ago!

      I think that records may well outlive CDs. People who like records and care about the physical medium will continue to buy records, but people aren't attached to CD as a physical medium in the same way. It's already digital, and as long as you're getting the same bits, nobody is going to care if it's on a flash drive or a CD or downloaded from the Internet.

    8. Re:They won't last by nine-times · · Score: 1

      Then save your appetite for destruction for when the original media is genuinely unreadable. There is no need to hasten that which is only perceived as inevitable.

      I can think of some reasons to "hasten that which is... inevitable." First, there's the issue of clutter. There's no need to keep 1000 floppies when the data can all be stored on a single hard drive. If there's sensitive/confidential information on the floppies, having lots of disks is harder to keep track of and therefore potentially less secure. It's technically possible for accessing data on unreliable media to lead to some form of silent data corruption. Perhaps as important as anything else, having the old media may lead to a false sense of security, assuming that you have a good copy when maybe you don't.

      They aren't nearly that fragile. Some are subject to manufacturing defects but that's another sort of problem and it's hardly universal.

      Well no, that's not another sort of problem. There are many that are subject to manufacturing defects, in that many manufacturers have used cheap dyes that break down within a few years. Therefore, depending on when you bought the CDR and who manufactured it, it may have a very high failure rate. Ultimately that's the same problem as with floppies, and even old hard drives. Though optical media has the strength of having no moving parts that could break, all media is subject to the danger of being part of a "bad batch".

  18. If you are 100% you have everything off of it. by WarlockD · · Score: 1

    I was recently did the same thing. I had about 2 old OnStream 30 GiG tapes and a hand full of old QIC-80's. Not even mention the CD-R pile in my room.

    During the years I never had the space to just extract everything and sort though it all. Not to mention I would move backup data from tape, to CD, back to tape so I have copy's of the same things all over the spectrum. I have recently started consolidating it all, finding an old OnStream tape drive and old QIC floppy drives to restore everything to a single drive, get rid of all the duplicates and save the important stuff on archived DVD media and/or "the cloud" It was a nightmare but now I don't have to worry about trying to get hold of a bankrupt tape drive company's hardware in another 10 years.

    I will then delete the tapes and burn them.

    If it was hand labeled by a professor he liked or someone famous like Bill Gates I could see that. But there is no other reason to keep it around once it's contents have been properly indexed and stored. The only exception is when you need the obsolete media to be used in another obsolete computer. AKA making a disk for a cC64.

    Let me put it this way. What do you think will happen in 10 years, when someone else finds that box of media. Even if he was told that it was all indexed and stored, he might question it and do it all over again "just to be sure" :P

    1. Re:If you are 100% you have everything off of it. by bluefoxlucid · · Score: 1

      He thinks in 10 years he'll have a Hoarder's Crisis.

    2. Re:If you are 100% you have everything off of it. by hedwards · · Score: 1

      I'm in a similar situation, just not to that extent. One of the tricks is to categorize things and decide roughly where things belong before you start. Then move things to the correct place, verify the copy and back up. After that, I generally destroy the original, especially if it's in a weird format. (And yes, I consider 3.5" floppies to be a weird format, and really anything other than CDROM or DVDROM at this point)

  19. Re:digitize your Gutenberg Bible & toss the or by Anonymous Coward · · Score: 0

    "loading an 8" floppy into a drive and waiting several minutes to access a text file has something a file on a NAS can never match."
    "when in doubt, save more than you will ever need."
    "Readable disks are far more useful than museum-relics that can be displayed but not used."

    Fuck Off

    I've dealt with imaging thousands of floppies and they are all shit and deserve to be in the landfill. The same goes for mag tape.
    The sooner the shelves are cleared of these, the better.
    The DATA is what you care about, not the carrier.

  20. The Medium Can Have As Much Value... by Anonymous Coward · · Score: 0

    ...as the contents.

    Depending on what the contents are, and your reasons for keeping it, the medium may be just as valuable. Or, said another way, the contents may lose their value when divorced from the medium.

    For example, I'm thinking specifically of old copies of MacOS. The primary reason for keeping old versions of MacOS around would be to boot old Macs. If you discard the medium, you'll never boot that hardware again.

    And before you jump and say "I'll just write another copy", it's nowhere near that simple. Original Mac drives spun the discs at variable speeds, while PC drives spun at a fixed rate. You cannot write a Mac floppy disk with a "modern" commodity floppy drive. If you can get your hands on an image (the contents), it will still require multiple generations of Mac hardware and software to backtrack far enough to write a usable physical copy.

    Of course, a fair rebuttal here is "Why would you want to boot an original Mac?", and for that I have no other answer besides "Nostalgia".

    1. Re:The Medium Can Have As Much Value... by hedwards · · Score: 1

      That's good in theory, but it's unlikely that your version of MacOS from the '80s is still going to work. Floppies are terrible in terms of reliability over long periods of time.

    2. Re: The Medium Can Have As Much Value... by hunterkll · · Score: 1

      i wouldn't be so sure about that - my floppy copies of MacOS 6 and 7 worked well on my classics and such. even my powerbook 140, and this was only 5-6 years ago. worked perfectly, original factory disk sets.

      http://www.hardwaresecrets.com/printpage/How-to-Generate-Floppy-Disks-for-Old-Macintosh-Computers/1713

      anyway, for the curious, a rundown on writing old mac floppies.

      (note: if you do them as 1.44MB floppies, they read/write at the same speed, so no special hardware needed as long as you have macs that can read 1.44s. easy way to avoid the hassle if you have the option.)

  21. image, verify replicate job done by maliqua · · Score: 1

    see title

  22. MOD THAT UP! Re:The Medium Can Hold Secrets by Ungrounded+Lightning · · Score: 2

    I have no mod points at the moment. But that's a VERY important point: A straight copy may not be good enough, due to outside-the-standards copy protection schemes.

    Other floppy-based commercial games used a number of other techniques.

    (One, for instance, had track 3 deliberately corrupted, by scratching the medium with a pin. No error on reading it - or writing and re-reading it - and the game would load, erase the disk, and play. This let the person who made the copy think he had a good copy - when in fact he had a blank disk. Let's see you make a good archival copy of THAT. B-b )

    You get the same thing on other media as well - even analog. (Example: Macrovision, which plays with the sync and saturation levels, so that analog TVs intended for over-the-air reception (usually) correct the distortion as if it were a fading signal, while videotape machines copy the "fading" picture and regenerate a non-fading sync, so the copy isn't corrected when viewed.)

    One of the several copy protection schemes for DVDs includes hidden modulation in sync information, decoded by the drive's hardware and detected by its firmware, so you can make a perfect copy of the bits and it still won't play.

    Wikipedia has a long list of such copy-protection schemes, any of which would make archival copying difficult to impossible (without special equipment that would expose you to arrest and federal prosecution if you possessed it).

    --
    Bantam Dominique roosters crow a four-note song. Once you've heard it as "Happy BIRTHday" you can't NOT hear it that way
  23. Re:digitize your Gutenberg Bible & toss the or by Anonymous Coward · · Score: 0

    loading an 8" floppy into a drive and waiting several minutes to access a text file has something a file on a NAS can never match.

    You obviously don't know how shitty my NAS is... :-(

  24. Don't image, copy raw files, keep old media by drnb · · Score: 2

    Re:Is there any benefit?

    No. Image and discard.

    Untrue. Backups get lost, go bad, or otherwise screwed up. At a previous employer old directories no longer accessed got backed up to tape to free up server space. We used a lot of storage at this company. On multiple occasions some years old files were needed. About half the time we would be told that these files were no longer recoverable by IT. After one of these backup failures I recalled that I had made a backup DVD of one old project we were trying to get a copy of. I went to our archivist and she found the DVD, it was readable.

    Save the media. Buy a USB based floppy drive. It makes sense to copy the files on these legacy format to a server for future use but keeping the originals around as a backup is a good idea.

    Do not image the media. These image formats may fall out of favor and not be recognizable in the future. Look at the various problems NASA has had with some of its old tapes using formats no longer supported. Create a folder on a server for a particular piece of media and just copy and verify the files from the legacy media to the folder on the server.

  25. the app store ideas and apples lack of ports is ba by Joe_Dragon · · Score: 1

    the app store ideas and apples lack of ports is bad for archiving.

    We may get to the point where the app store sand boxing makes it so that an archiving app can't put the files in an place where other apps can see it. and we may have hard time reading an outside data source as well.

  26. Keep software and hardware to read it by jfdavis668 · · Score: 1

    Though it is not my primary business, I offer my services when people have difficulties accessing archives. I am often surprised when people come to me to rescue data off floppies, both 3 1/2 and 5 1/4. These are mostly legal documents or contracts. Why people keep floppies but not drives to read them is beyond me. I have a 3 1/2 USB drive I keep for routine work. I saved a Pentium 90 with a 5 1/4 inch at home I use for those rare occasions. The other problem is reading the data. I have Office 97 on the Pentium to convert and read old formats. I have another machine with Windows and Office 2000 to bring documents to somewhat modern formats. I helped one company update their union contract. They had it printed as a small book, and had been giving it out for 20 years. When they finally ran out of copies, they wanted to incorporate the changes over the years and reprint it. They handed me a pile of floppies. Each chapter was a separate document, which I finally figured out were in Wordstar for DOS format. Luckily, there is an Office 97 converter for that. The lesson is, without software to read it, your archives are useless to keep. Save old versions of your software, and if necessary, hardware to run it.

    1. Re:Keep software and hardware to read it by tibit · · Score: 1

      Wordstar is pretty much a text file. All you need to do is mask out the upper bit. You'll have some control characters left, but those are easy to remove. That's what I remember from 2.5 decades ago...

      --
      A successful API design takes a mixture of software design and pedagogy.
  27. The Basics by westlake · · Score: 4, Informative

    I'm an archivist at a mid-sized university archives, trying to develop a policy for archiving computer files ('born-digital records' in archival parlance).

    Get Your Bits Off (Old Storage Media)

    Demystifying Born Digital Reports

    Working Draft of the Levels of Digital Preservation Chart

  28. Keep a sample by Peter+(Professor)+Fo · · Score: 2

    The actuality of bit-rot in media is uncertain. Many documents 500 years old are readable-ish if you have the skills and accept that some parts may have decayed. That tells us a lot about te exact media people used way back then.

    The trouble with digital records is this:-
    Searchability is a requirement (even though we don't expect that with written records). The reason is that there is so much of it when compared with the sparse records of times past. So you need a 'good' copy for data analysis and some original media to inform historians of the future how we looked upon the information, or what 'ordinary people' or 'ordinary businesses' had at their disposal.

  29. DVDs are inherently flaky by gatortom · · Score: 1

    DVDs, even commercially stamped, can suffer from bit rot. Optical disk technology is inherently flaky. Use multiple HD backups and make sure you have offsite storage.

  30. Comment removed by account_deleted · · Score: 2

    Comment removed based on user account deletion

  31. Archiving Digital Data by Anonymous Coward · · Score: 0

    The real question is how long do you need this data to be archived.
    If it more than a few year, print is out, and make a mico-fiche of it.
    From where I stand, digital data only has a life of a few years.

  32. Keep them or pay the price. by Anonymous Coward · · Score: 0

    First, you may have legal issues. You may not be allowed to copy the data but you may also someday prove that you did buy the 'data' and show proof of
    the original. This is a big fear for anyone that has had the Internet Nazis, err The Media Industry come after them.

  33. Print out the best prints and store the prints by jools33 · · Score: 1

    As a photographer myself I would recommend to print out the best choice prints and store them physically, as photographic prints still has the best record for preservation when compared to any / all types of digital media. By all means take running copies of all your data, on and offsite backup. A physical copy of the best prints though is likely to be preserved longer.

  34. Re:Don't forget the physical data by magic+maverick+ · · Score: 1

    This. I logged in to say this.

    An archivist should keep the original as much as possible. Otherwise, what's the point? Would it be acceptable to photocopy a letter and than discard the original because it's too old? No. You photocopy (or actually non-destructively scan these days), and then you keep the original in a climate controlled environment. People can work off the copy, but sooner or later someone will want to look at the original.

    At a minimum you take high-resolution photos of the media and record those photos (and the type of media, etc.) along with the copy of that you are putting into your fancy database.

    What's the point of archiving anything? It's too keep it for posterity.

    Geeze, what do they teach archivists these days...

    Disclaimer: I'm not an archivist, but I have done a lot of study and work in the digital sector (including archiving).

    --
    HELP MY ACCOUNT HAS BEEN HACKED BY AN ILLIBERAL ART STUDENT SET TO DESTROY THE INTERWEBZ!
  35. Re-think the question by danknight48 · · Score: 0

    Should i keep my old vinyls as the CD has taken over?
    It really is down to you, only you know what you really want to achieve.

    - If your only interested in the content, just copy the files.
    - If you only interested in the source container, keep it.

  36. Re:MOD THAT UP! Re:The Medium Can Hold Secrets by tlhIngan · · Score: 1

    You get the same thing on other media as well - even analog. (Example: Macrovision, which plays with the sync and saturation levels, so that analog TVs intended for over-the-air reception (usually) correct the distortion as if it were a fading signal, while videotape machines copy the "fading" picture and regenerate a non-fading sync, so the copy isn't corrected when viewed.)

    Actually, Macrovision played havoc with the sync pulses - it would produce a deliberately too weak one (but enough that most VCRs could lock on) and then produce a strong one, etc..

    What happens is the TV is looking for a sync pulse and sees it, even though it's non-standard (it's a case of basically seeing what you expect - the TV is looking for a pulse, it sees the beginnings and locks on).

    A VCR though wants to ensure that what gets recorded on tape (which has a poor dynamic range) is as strong as possible, so the weak pulse causes the VCR's AGC circuits to kick up the gain for the frame. The strong pulse causes the AGC to kick it down. The fact that the video signal is otherwise normal means the AGC has now made it clip in the first case, and suppressed it in the second.

    And in fact, it was technology (and pressure from Macrovision) that really resulted in it working - older VCRs had slower AGC circuits and could play it just fine as the AGC never messed with the signal before the pulses reset themselves. But as technology and Macrovision pressure increased, the AGC circuits got better and produced this artifacting.

    It's why the simplest cure was a signal regenerator - it normalized the pulses back to standard.

    Other formats are usually readable by a device with fairly closed firmware that is designed to spec. Nobody gives you a CD-ROM drive with access to head positioning servo loop and access to the raw bitstream after clock recovery. Yet this is almost what you has on Apple ][ - that's why the floppy controller hardware was so simple (a couple stock TTL chips, maybe a PAL or two). The magic was in the software (firmware). Same goes for a modern CD-ROM drive, but the firmware is not really easily amenable to hacking, and you can't really replace it as a matter of normal operation of the equipment. On Apple ][ it was trivial, since the firmware ran on the main CPU. On a CD-ROM, or really any other media access device, there's a dedicated CPU - these days it's most often a proprietary SoC with documentation available under an NDA only if you commit to a certain purchase volume. Never mind that the amount of code is orders of magnitude beyond what was routine in Apple ][ days.

    You can still do enough to mess with the disc - a modern CD-ROM or DVD-ROM supports standard cd-ripping, and on data CDs, the first track can be reset back into playing as audio for ripping. Or you could embed a fake audio track on a CD that actually has data on it. There's enough flexibility in the CD-ROM spec because it was designed for audio first - data was a hack.

    DVDs are a bit trickier, but I'm sure all the copyprotection tricks movies use can be employed for regular data as well. Blu-Ray likewise, though it has some tricks of its own embedded in the firmware.

  37. 3-2-1 by pprboy · · Score: 1

    3 is 2
    2 is 1
    1 is none.
    best way to make sure you have it
    3 copies
    2 different media, at least (hd, ssd, thumb, cd, dvd, tape, cloud)
    1 is offsite.

  38. Never discard the original data by Anonymous Coward · · Score: 0

    How many times much this be drilled into your heads?

  39. What really is a digital born record by Anonymous Coward · · Score: 0

    Born Digital Records are born, live, and die in RAM. Anything else is a copy. But we must distinguish among the various types of copies. Most programs will allow the "saving" of data in RAM when it ends. This is the Working Copy (w-copy) of the Born Digital Record. It is common practice to create, maybe several, Backup Copies (b-copy) of the w-copy. It is not guaranteed that any b-copy is the same as the w-copy, however, we rely on a b-copy to recreate a w-copy when needed.

    After I have finished the Great American Novel, my w-copy gets published. It now need to be archived (a-copy). This is a read only copy and is highly protected. In fact, I may have several a-copies in several locations. It is never changed! It must always be available! Hence, another copy, a Library copy (l-copy), where it can be read, analyzed, written about. If this is not desirable, why save it at all?

    This means that all copies are, in a sense, live. They all are easily available, usually online. There is no concern about the creating programs being abandoned, since the software to process all copies is current level. This may mean reprocessing a format X a-copy to a format Y a-copy in the future.

    About the game with the hardware error. That is not a digital born record. You need a specific computer to play the game and they both must be archived.

    As for the my Great American Novel, why I print it out on acid free paper or maybe papyrus or etch it in copper.