Slashdot Mirror


Copyrights and CD-Rs Endanger Audio History

SEWilco writes "A study by the Library of Congress has found that many audio recordings are being lost due to copyright restrictions and temporary media. Old audio recordings are protected by a various US state copyrights, so it's hard for preservationists to get and copy material. Recent data is threatened by being put on writable CDs, because CD-Rs begin to lose data after a few years, so recordings from as recently as 9/11 and the 2008 elections are already at risk."

49 of 202 comments (clear)

  1. Vanishing People by denshao2 · · Score: 5, Insightful

    We will be a mystery to archaeologists of the future.

    1. Re:Vanishing People by Doctor_Jest · · Score: 3, Informative

      It reminds me of the Dr. Who episode (the David Tenant series) where the doctor is aboard a space cruise liner called "The Titanic".... their analysis of humanity was suspect, having cannibalistic rituals after going to war with Turkey or something like that.

      And Kylie Minogue looks fabulous for a 40 year old.... :)

      --
      It's the Stay-Puft Marshmallow Man.
    2. Re:Vanishing People by Inf0phreak · · Score: 3, Informative

      Well... to be fair the guide had faked his degree in earthonomics.

      --
      ________
      Entranced by anime since late summer 2001 and loving it ^_^
    3. Re:Vanishing People by icannotthinkofaname · · Score: 2, Insightful

      I just re-watched "Voyage of the Damned" several hours ago. :)

      However, I disagree that that's anything like this problem. In that episode, Mr. Copper wasn't from Earth to begin with. A closer match to this story might be the misinformation possessed by Lady Cassandra O'Brien.17 in the episode "The End of the World" (2005, ep 2) (although in Cassandra's defense, she's separated from the 20th century by about five billion Earth years).

      --
      Let q be a radix > 1. I am in ur base-q, killing 10 d00ds.
  2. TheLibrarianBay.org by Anonymous Coward · · Score: 5, Insightful

    Rip those CDs, create a torrent, and share that torrent on thelibarianbay.org. Problem solved!

    1. Re:TheLibrarianBay.org by smi.james.th · · Score: 2, Informative

      "librarian" is in this case, an euphemism, I think.

      If you can't figure out what for, what are you doing on slashdot?

      --
      One thing I know, and that is that I am ignorant...
  3. Depends on the Discs by Oceanplexian · · Score: 5, Informative

    I have some optical media that's from ~2001. Most of it's just fine, even after a tortured life. I trust high quality optical media more than anything else.

    CDs are rarely an all-or-nothing affair. Even if you do lose data, you tend to not lose it all in one freak accident, not to mention solid state and magnetic media make fantastic paperweights after a solar storm.

    1. Re:Depends on the Discs by iluvcapra · · Score: 4, Informative
      Since CDs have Reed-Solomon and parity for error correction, and even if samples fail the player will interpolate, you can have a pretty ruined disk before it won't play anymore. It is all or nothing once it starts to fail though-- at the point the interpolation can no longer repair a dirty section, the CD will simply drop out.

      I also recently (yesterday actually!) opened an old DVD+R (with an HFS volume) from 2002 and rearchived it to a new DVD. It still read perfectly, but it's been stored in a cool dark place, and has been mounted maybe 10 times.

      --
      Don't blame me, I voted for Baltar.
    2. Re:Depends on the Discs by aaa_zzz_ccc · · Score: 2, Insightful

      Since CDs have Reed-Solomon and parity for error correction, and even if samples fail the player will interpolate, you can have a pretty ruined disk before it won't play anymore. It is all or nothing once it starts to fail though-- at the point the interpolation can no longer repair a dirty section, the CD will simply drop out.

      I also recently (yesterday actually!) opened an old DVD+R (with an HFS volume) from 2002 and rearchived it to a new DVD. It still read perfectly, but it's been stored in a cool dark place, and has been mounted maybe 10 times.

      I agree

    3. Re:Depends on the Discs by fahrbot-bot · · Score: 4, Funny

      ...but it's been stored in a cool dark place, and has been mounted maybe 10 times.

      Sigh. I know how that feels.

      --
      It must have been something you assimilated. . . .
    4. Re:Depends on the Discs by SupremoMan · · Score: 3, Funny

      I think exposure to sunlight has a detrimental effect on them. So us basement dwellers are safe! I mean... you basement dwellers

    5. Re:Depends on the Discs by X0563511 · · Score: 3, Informative

      Cool and dark are the really important parts. On write-once optical media, UV will fade the dye making it harder to distinguish pit/land transitions, while high temperature will melt the unmelted dye, making the pits/lands closer together (thus also making the transitions more difficult to discern).

      Using RW media will alleviate some of this problem, as this uses a phase-change mechanism instead which is more "digital" than the dye used in write-once.

      --
      For large sets, this will be our guide even unto death, for the LORD will work for each type of data it is applied to...
  4. Conspiracy theorists were right! by uzyn · · Score: 2, Funny

    The government must be using this as an excuse to destroy evidences on 9/11.

  5. Did they forget what year it was? by LBt1st · · Score: 3, Insightful

    Don't worry, there'll be a torrent ;)

  6. Library of Congress by sycodon · · Score: 2, Funny

    Does not the Library of Congress make it a habit to acquire as much of this kind of material as possible? Isn't seen as a mark of success to have your recordings in the LOC?

    --
    When Fascism comes to America, it will call itself Anti-Fascism, and tell you to give up your guns.
    1. Re:Library of Congress by Artifakt · · Score: 4, Interesting

      The Library of Congress used to have a goal of including complete hard copies, at least for items of US origin and 'good grade' (that is, they aimed to have copies of things such as hardback books that were intended to last, more than, say, ephemera such as the pulp magazines). However, that goal has become an obvious impossibility due to sheer volume. After about 1960, the library began being more selective.
                  That's bad enough in some senses, but unfortunately, there's also a secondary effect. Pick a subject you know well, and go to the library, and examine the LOC page at the front of the book for a few dozen volumes of varying ages. That information will tell you if the book has been archived in the LOC, but it will also include other details, such as what topics it is indexed under. For example, a biography of Supreme Court Justice Thurgood Marshall might be indexed more specifically under 'Biographies of Prominent Americans' and not just 'Biography', and it might also be indexed under "Non-fiction', 'Legal Commentary', and "20th Century History". Many of these index terms were developed as a standard system, but that system seems to have more and more glitches with time. In general, you'll see more and more errors, both of accuracy and by simple omission, for the newer books. I don't know if there's any real explanation of why the indexing seems to become worse after the LOC gave up trying to have physical copies of all significant works, but many people think they have noticed a certain 'sloppyness'.
              For works such as audio or video recordings, it could be very hard to get any useful information if the same pattern holds. Imagine for example, researching video and 30% of all the westerns aren't indexed as westerns, while some documentary footage about life in the old west has been miss-classified as 'fiction' and 'western'. Then add there was also once a rule that anything shorter than 8 commercial reels was considered a short, but somebody forgot that rule about 1976 and started thinking it was anything under 30 minutes running time. Whatever the subject, problems such as these are likely to crop up.

      --
      Who is John Cabal?
    2. Re:Library of Congress by ChrisMaple · · Score: 2, Insightful

      There are some difficulties in classifying that have become more prominent in recent decades. Authors put lies or obvious fictions in what are nominally non-fiction books, such as Bill Bryson's travelog "A Walk In the Woods". In the other direction, large sections of Frederick Forsyth's books, nominally fiction, are detailed and insightful descriptions of current events. What's a classifier to do?

      --
      Contribute to civilization: ari.aynrand.org/donate
    3. Re:Library of Congress by TheRaven64 · · Score: 3, Interesting

      Many of these index terms were developed as a standard system, but that system seems to have more and more glitches with time

      It's called ontology drift. It was a big problem for the cyc project. They started entering all human knowledge, and after 20 years found that they were entering the same stuff again because the index terms had changed over time. A large amount of semantic web and AI research is devoted to combatting this problem.

      --
      I am TheRaven on Soylent News
  7. Old news.... by Bomarc · · Score: 2, Insightful

    The slashdot article "How To Verify CD-R Data Retention Over Time?" covered this a LONG time ago...

  8. holy shit REALLY? by Junior+J.+Junior+III · · Score: 5, Funny

    so recordings from as recently as 9/11

    Jesus Christ, that was just last month!

    --
    You see? You see? Your stupid minds! Stupid! Stupid!
    1. Re:holy shit REALLY? by Mogusha · · Score: 3, Interesting

      Actually, it's 11 months in the future, September, 2011.

    2. Re:holy shit REALLY? by citoxE · · Score: 2, Funny

      If you run on a certain date format, your future endeavors are put as risk as well!

    3. Re:holy shit REALLY? by Yacoob+Al-Atawi · · Score: 2, Insightful

      Over here it is next month!

    4. Re:holy shit REALLY? by noidentity · · Score: 2, Funny

      What do you mean? It's next month, the 9th of November!

  9. Not quite right by cappp · · Score: 4, Insightful
    Copyright doesn't have that effect at all. Infact the Digital Millenium Copyright Act specifically creates the option for libraries and archives to create copies for preservation. Check out the actual law which includes

    it is not an infringement of copyright for a library or archives, or any of its employees acting within the scope of their employment, to reproduce no more than one copy or phonorecord of a work, except as provided in subsections (b) and (c), or to distribute such copy or phonorecord, under the conditions specified by this section, if—

    (1) the reproduction or distribution is made without any purpose of direct or indirect commercial advantage;

    (2) the collections of the library or archives are
    (i) open to the public, or
    (ii) available not only to researchers affiliated with the library or archives or with the institution of which it is a part, but also to other persons doing research in a specialized field; and

    (3) the reproduction or distribution of the work includes a notice of copyright that appears on the copy or phonorecord that is reproduced under the provisions of this section, or includes a legend stating that the work may be protected by copyright if no such notice can be found on the copy or phonorecord that is reproduced under the provisions of this section.
    (b) The rights of reproduction and distribution under this section apply to three copies or phonorecords of an unpublished work duplicated solely for purposes of preservation and security or for deposit for research use in another library or archives of the type described by clause (2) of subsection (a), if—

    (1) the copy or phonorecord reproduced is currently in the collections of the library or archives; and
    (2) any such copy or phonorecord that is reproduced in digital format is not otherwise distributed in that format and is not made available to the public in that format outside the premises of the library or archives.

    If you're referencing personal preservation rights then you should read this article from the Standford Libraries on copyright and fairuse.

    1. Re:Not quite right by Anaerin · · Score: 5, Insightful

      Okay, great. So where is the (completely legal under US law) software that the Library of Congress can use to back up Blu-Rays that have been released recently? Or, indeed, legal (Under US law) software that they can use to back up DVDs? Nowhere, because such software is in direct conflict with the DMCA, and thus is illegal.

    2. Re:Not quite right by hedwards · · Score: 2, Interesting

      The Library of Congress decides what is an is not allowed circumvention under the DMCA every several years. And if they're finding that the works are disappearing due to DRM the likelihood of backups being legally recognized by the Library of Congress increases drastically.

      Which is ultimately good news. It's not going to help with Blu-Rays in the short term, but it would make it legal for companies to sell backup software as they'd no longer have to violate the DMCA to do it. And considering that the software needed to intercept the signal going to the video card is going to be protected under the 1st amendment protections, it looks a lot better.

    3. Re:Not quite right by JesseMcDonald · · Score: 2, Informative

      The software itself may be legal in such cases, but it remains illegal to discuss how to create such software, or to distribute such software to others, which means it's still illegal unless you can break the encryption and write the software all by yourself. How many people do you think are capable of that?

      --
      "The state is that great fiction by which everyone tries to live at the expense of everyone else." - Bastiat
  10. Current archive / backup systems are silly by porkThreeWays · · Score: 2, Insightful

    For longevity, current backup systems are just silly. They are simply just not abstracted enough. For REAL archival what's needed is an active system like the Internet but one that guarantees n redundancy. Perhaps a p2p like system with nodes backing up files. This abstracts away whether they are going on SATA, IDE, SCSI, Tape, whatevs. The local machine handles all the hardware details. When newer, better, cheaper technology comes along, the old data is automatically able to propagate onto the new storage mechanisms. I see this all the time working in the IT industry. I have backups from 10 years ago I can not read because we no longer have a working tape drive to read it. We need to separate ourselves from the hardware.

    --
    If an officer ever threatens to taze you, say you have a pacemaker.
    1. Re:Current archive / backup systems are silly by westlake · · Score: 3, Insightful

      For REAL archival what's needed is an active system like the Internet but one that guarantees n redundancy. Perhaps a p2p like system with nodes backing up files. This abstracts away whether they are going on SATA, IDE, SCSI, Tape, whatevs. The local machine handles all the hardware details. When newer, better, cheaper technology comes along, the old data is automatically able to propagate onto the new storage mechanisms. I see this all the time working in the IT industry. I have backups from 10 years ago I can not read because we no longer have a working tape drive to read it.

      You haven't lifted a finger to track down, replace and restore the tape drive you need.

      Why then should we be trusting our data to an (allegedly) fully automated - autonomous - system which is equally likely to be neglected and ignored?

  11. Short term CD-R by JWSmythe · · Score: 3, Insightful

        This is kind of funny.

        I warned people about depending on floppy disks for long term storage. After a few years, the media degrades and the data is impossible to retrieve. They didn't listen until they went back to floppies from years ago that no longer work.

        I warned people that home recordable CD's and DVD's had a shelf life of less than 10 years after they were burnt. I've seen CD's burnt, verified, and then put away in a good climate controlled environment, where a few years later they couldn't be read. For those who have listened to me, I've told them, make at least two copies, in different places, (like their hard drive and a CD), and burn new disks once a year. It sucks to have years of research on something, just to find the old information is lost.

        This isn't exactly news, but every so often someone finds out, writes a story, and it makes the news again.
       

    --
    Serious? Seriousness is well above my pay grade.
    1. Re:Short term CD-R by JWSmythe · · Score: 2, Interesting

          Oh, I know exactly what you mean.

          I'm trying to gather my old digital photos into one place. I've migrated servers several times, and had a couple disaster recoveries along the way. I found some pictures from the World Trade Center 09/02/2001 from about 7am to 11am. Some other pictures that were left in other places, like various workstations and company servers, were lost forever.

          I remember working in a computer store years ago, a customer brought in their PC with a RLL drive. He wanted his data. We didn't have a controller to attach it to, and his was already fried. If you were to bring an old PC into a store now with an RLL drive, you'd just get a blank stare from the tech, followed by a "what is that thing?". As time goes on, things that didn't follow the migration become harder and harder to use. I went through some hell a while back trying to convert some old letters, stored in some ancient format, to something that they could use today. They were important, so I took the time to do it. That was they were legitimately important, not the normal customer "Oh my god, everything on there is essential, I'll die without it", just to find out that they're pictures of their cat from a few weeks ago. :)

      --
      Serious? Seriousness is well above my pay grade.
  12. 300 years... by Freddybear · · Score: 2, Informative

    Memorex claims 300 year life for their fancy (expensive) archival CD-R and 100 years for DVD-R.

    http://www.cdrinfo.com/sections/reviews/specific.aspx?articleid=17324

    Take that with a grain of salt, of course.

    1. Re:300 years... by Nkwe · · Score: 4, Funny

      Memorex claims 300 year life for their fancy (expensive) archival CD-R and 100 years for DVD-R.

      Take that with a grain of salt, of course.

      I would recommend keeping salt and your archival CDs separate.

    2. Re:300 years... by maxwell+demon · · Score: 2, Insightful

      But will Memorex still be there in 300 years, to sue them if their claim proved false?

      --
      The Tao of math: The numbers you can count are not the real numbers.
    3. Re:300 years... by GrumblyStuff · · Score: 2, Insightful

      Salt would be good to keep moisture down.

  13. The essential forgetting by symbolset · · Score: 5, Insightful

    It is essential to the people who will sell us our culture in the future that we forget all that has gone before. If we remembered our heritage it would be necesary to innovate new things. If we can't, then recycled things will suffice - which cuts down the production cost.

    The goal therefore of the media giants is to make us nye culturne. A people devoid of culture. They're having great success at this.

    An opposing project would be Musopen.

    --
    Help stamp out iliturcy.
    1. Re:The essential forgetting by Idbar · · Score: 2, Funny

      It is also important for humanity not to repeat their same mistakes. Justin Beaver recordings NEED to be protected/archived!

  14. OOh. You've got media that lived nine years by symbolset · · Score: 5, Insightful

    That's like, forever, man.

    Kid, the Library of Congress was founded in 1800 - longer ago than your grandfather's grandfather's dad could remember. 210 years ago. Most of the stuff they had then, they still have now. They're not worried about preserving the top40 from your middle school days until you're disrespecting it in college. They want to be the repository for our culture forever. They're sort of like preemptive anthropologists and archaeologists. They know that you don't care but they're expecting that someone, someday will because cultural sensitivity is a cyclical thing.

    It's customary that new generations forget what has gone before and then rediscover it as if it were a new thing. This forgetting is not required. If we can quit forgetting then artists can stand on the shoulders of giants once again and build things of great and complex beauty like they once did.

    Given the current state of copyright though, you can't whistle any four notes in a row in public without getting sued. Anything like a symphony is right out.

    --
    Help stamp out iliturcy.
  15. Hardly by m50d · · Score: 2, Interesting

    Previous generations weren't even trying to preserve anything. Plenty of stuff will make it to the future; it only needs one copy of a CD or whatever to survive

    --
    I am trolling
    1. Re:Hardly by jedidiah · · Score: 2, Interesting

      ...actually the Rosetta stone itself is an example of the problem of decoding old information. It was only due to a stroke of luck that Champollion realized that he had a means to decode it.

      --
      A Pirate and a Puritan look the same on a balance sheet.
  16. Re: even 20 year old hard drives by qubezz · · Score: 2, Interesting

    At least you had the controller to get an idea what to hook the drive up to to make it work. That might give you a better idea if it was formatted RLL or MFM. After you get the drive hooked up with a replacement controller, then there's the challenge of determining the interleave and inputting the bad sector table (hopefully no more were added that weren't printed on the drive).

    The problem would then be how to transfer the data off the computer, mount the drive in something else, etc. At least storing the ultimate data wouldn't be a problem, I could back up 1000 of these hard drives on my keychain fob.

    You might actually find someone that can restore that data, but yes, there are many 'techs' that wouldn't immediately disqualify themselves from touching one of these and would destroy the disk data in attempting. Then try giving a Geek Squad tech a 9-track tape to back up if you really want to see a head explode (and those can be used in modern operating systems too).

  17. Use a camcorder by tepples · · Score: 3, Informative

    So where is the (completely legal under US law) software that the Library of Congress can use to back up Blu-Rays that have been released recently?

    It's called the analog hole, and the MPAA has endorsed it.

  18. Actually, no by abulafia · · Score: 2, Informative

    You are correct about federal law. State copyright law is a very different kettle of snapping turtles with regards to audio copyrights. Due to weirdness in the way federal copyright law is constructed, audio recordings made before 1972 are not covered, and so federal copyright law does not preempt state law, and so audio works made prior to then are covered by state common law copyright. In most states, this affords protection until 2049. Some states passed anti-copying and other laws, making it a huge minefield to figure out what the exact legal status is.

    There's an excellent paper explaining this available, if you want the details.

    --
    I forget what 8 was for.
  19. Repeated but flawed assertion by Dogtanian · · Score: 4, Interesting

    We will be a mystery to archaeologists of the future.

    No we won't, and I'm tired of hearing this trite assertion repeated as a truism. This is one of those things that has become a meme because it sounds plausible, but under analysis it's flawed because it (a) disregards the massive proliferation of digital data and (b) misapplies digital fragility.

    To start off with, most artifacts and information from previous cultures have likely perished too. On top of this we're producing a staggering amount of information- or at least data- in general compared to previous generations.

    It's true that any given piece of data stored on a given digital medium is arguably at higher risk of being lost. But this disregards the fact that there may easily be multiple copies of that information stored elsewhere.

    However, the primary flaw is that it focuses on the fragility of any *specific* piece of digital information, e.g. that photo of your dog in a funny hat you have stored on a mouldering old CD-R is at serious risk of being lost forever. While that's true, it doesn't apply to this situation, because our future archaeologists or historians probably won't require specific pieces of information to have a decent idea of our culture- they'll merely require an adequately large arbitrary selection of such data to get a decent picture of who we were.

    And because there's so much data out there, we could probably lose 99.999% of the stuff at random and it'd still probably be far easier to reconstruct our culture than those that have gone before.

    So yeah, if one is worried about a particular hilarious photo of their dog, or any given film, or whatever... digital fragility is an issue. But using it to asssert that our culture is going to become a digital "black hole" to future generations is fundamentally flawed.

    We will not disappear from history- at least not for those reasons.

    --
    "Slashdot - News and Chat Sites Deviant". (Click "homepage" link above for details).
  20. Re:software error recovery by iluvcapra · · Score: 2, Interesting

    This is effectively what the error codes are for. On a raw level there's no such thing as a "missing bit," you either have a zero or a one, but error codes can tell you if it's the correct value or not. If enough of the data and redundant Reed-Solomon codes are on the media, the incorrect value can be corrected, and if there isn't enough, for small errors the player can interpolate. Because a Red Book CD carries a very specific kind of high redundancy data, PCM audio, the reader can then use various strategies to recover something that sounds remotely like what was their originally, if not exactly.

    A big drawback of the "interpolation" scheme is that CDs can sound excellent and then suddenly start to fail catastrophically; with tapes and records they would slowly wear down over time, but CDs are much more all-or-nothing. A colleague was telling me recently about engineering sessions in the 80s and having to work with DASH machines, which were big 24 track digital audio tape machines, that used Reed-Solomon codes to allow you to edit the digital tape with razor blades. You had to remove the front panel if you wanted to see the LEDs for the error correction system, and you'd watch those very carefully over the course of the session to make sure these weren't working too hard, and if they were, you'd take the tape and run off a clone before you started having dropouts.

    This is very different than a data CD-R -- data CDRs still have error correction and redundant data coding, but if these fail the original file will be corrupt, period, which is why, for added safety, you might create parfiles or something similar.

    I guess the upshot of this is that CDs, as they currently operate, particularly CD-Rs but even glass masters, aren't such a hot medium for archival. The failure modes are too severe and can leave you with a moldy loaf instead of half a loaf.

    --
    Don't blame me, I voted for Baltar.
  21. Re:OOh. You've got media that lived nine years by Reziac · · Score: 2, Insightful

    On that note, I was appalled at what I saw on the discard table at my university's library. Huge pile of old technical and research volumes, some dating to the mid-1800s. Outdated? Yeah. Often wrong? Sure. But a snapshot of the state of science at the time, which is itself a valuable historical resource.

    We no longer believe in (most of) the gods and demons our ancestors did, but it's still culturally useful to have information on the beliefs of the era. We no longer practice the styles of government, the human sacrifices, and whatever else our ancestors did, but it's still valuable to know where we came from. Add more examples as the spirit moves you.

    --
    ~REZ~ #43301. Who'd fake being me anyway?
  22. What about PRINTING the data? by AlejoHausner · · Score: 3, Interesting

    Short of carved writing on stone tablets (eg, the Behistun monument), the longest-lasting medium I can think of is printed paper. Libraries know how to archive it: it's called a book.

    There are ways to take digital files and convert them to bitmaps (eg www.ollydbg.de/paperbak). You can print the bitmaps, and read them back reliably with a scanner. About 500K can fit on one page of paper, so a one-hour MP3 recording (about 60MB) would take up 30 sheets of paper. If printed on acid-free stock, this should last for centuries. The pages could be bound in a book, whose introduction would describe the encoding, and provide an algorithm to extract the data.

    Why rely on currently-fashionable media like the chemical dyes in a CD-R when good old reliable natural-fiber materials like paper are known to last centuries?

    Alejo Hausner

    1. Re:What about PRINTING the data? by AlejoHausner · · Score: 2, Interesting

      Of course it's space-inefficient. But if you're the Library of Congress, you're probably willing to endure the low bandwidth. You certainly won't be able to retrieve the information quickly, but if you're archiving the data, you can tolerate slow retrieval.

      It's not quite as bad as you think, though: if you've saved a 4.3 GB DVD onto 2200 pages of paper, and you placed the printed stack onto a sheet-fed scanner which does about 1 page/second, it would take you about half an hour to do the scanning.

      That's less time than it takes to play the DVD!

      Physical space inefficiency would be an issue. DVDs are small, but 2200 pages takes up as much space as a box of files, about one cubic foot (about 30 liters, or 0.03 cubic meters). Not to mention that paper is heavy.

      That's the cost of permanence.

      Alejo