Slashdot Mirror


Saving Digital History

Gavinsblog writes "The Washington Post is reporting that the Library of Congress in the U.S. plans to initiate the $100 million National Digital Information Infrastructure and Preservation Program (NDIIPP). It is hoped that the project will lead to the preservation of data that is constantly changing on the Internet. But I wonder who will choose what is worth saving?" This may remind you of the LOC's effort to preserve and digitize the audio collection in the National Recording Registry.

133 comments

  1. one persons trash... by trefoil · · Score: 5, Insightful

    is another persons treasure.. I'd say just save it all and allow others to sift through and decide what is worthwhile and what is worthless.. just like the library..

    1. Re:one persons trash... by NotAnotherReboot · · Score: 2, Insightful

      From the article:

      On top of the $5 million the library received for planning the initiative in 2000, the plan approved yesterday releases another $20 million of funding to develop a system for evaluating and storing digital information. Just as the library receives more than 20,000 printed pieces each day but keeps less than half, it now faces the herculean task of deciding what digital information should be saved for future generations.

      --
      The library doesn't keep all of the printed information it receives, keeping all of the information online is an enormous, if not possible task. The archive.org has terrabytes upon terrabytes of data, and they don't even come close to having everything that was on the web at any one time. With the budget they're talking about, keeping all of this information would most definitely not be possible.

    2. Re:one persons trash... by whereiswaldo · · Score: 3, Insightful
      Repeat after me:

      Disk space is cheap.
      Disk space is cheap.
      Disk space is cheap. ....
      Save everything. ;)

      "The Navy has both a tradition and a future--and we look with pride and confidence in both directions."

      Admiral George Anderson, CNO, 1 August 1961.
  2. Will There Be a EUian Counterpart? by tealover · · Score: 0, Flamebait

    EUians hate to have America take the lead on everything so I imagine France will try to create an EUian consortium to do their own version.

    "Bonjour, you cheese eating, surrender monkeys"

    :)

    --
    -- You see, there would be these conclusions that you could jump to
    1. Re:Will There Be a EUian Counterpart? by Anonymous Coward · · Score: 0

      You are so mean! What would your mother think of you?

    2. Re:Will There Be a EUian Counterpart? by Pharmboy · · Score: 1, Funny

      EUians hate to have America take the lead on everything so I imagine France will try to create an EUian consortium to do their own version.

      "Bonjour, you cheese eating, surrender monkeys"


      With Belgium and Germany, and call it the World Digital Information Infrastructure and Preservation Program (WDIIPP), right? ;)

      --
      Tequila: It's not just for breakfast anymore!
    3. Re:Will There Be a EUian Counterpart? by Cookeisparanoid · · Score: 1

      I hope so, the world dosnt revolve around the United States you know. Oh its not EUian it would be an EU counter part.

    4. Re:Will There Be a EUian Counterpart? by Anonymous Coward · · Score: 0

      No, the world revolves around the sun, and if you'd like, we can make France feel like one.

    5. Re:Will There Be a EUian Counterpart? by Cookeisparanoid · · Score: 1

      No thanks I dont think the fall out would be too nice for the UK or teh US for that matter, plus its one thing to take on stone age powers but I thin the US might find taking on fellow Nuclear powers like France (or China for that matter) a trival occurance.

  3. What about the... by ksheka · · Score: 1

    ...DMCA?

    It's nice that the government can ignore it at will, at least till someone in Hollywood notices...

    --
    alias uptime="echo '5:33pm up 22342352324 days, 6:28, 2124315623 users, load average: 2432.40, 12312.31, 123123.19'"
    1. Re:What about the... by wastaz · · Score: 1

      New exciting lawsuit brought to you from the RIAA/MPAA: "The government ate my copyrights, so we sued it and won!"

  4. skip slashdot. by Anonymous Coward · · Score: 5, Funny

    No need to add slashdot as one of the website. They keep reposting stories here as an initiative to preserve their own history.

    1. Re:skip slashdot. by Scud_the_disposable_ · · Score: 1

      hey, I thought that was funny... Why mod it down as flambait???

    2. Re:skip slashdot. by whovian · · Score: 1

      What the....?

      $RecursionLimit::"reclim": "Recursion depth of 256 exceeded."

      --
      To-do List: Receive telemarketing call during a tornado warning. Check.
    3. Re:skip slashdot. by Froobly · · Score: 1

      I know that was meant as a joke, but skipping sites like Slashdot or Kuro5hin, which tend to maintain their own archives quite well, wouldn't necessarily be such a bad idea. Perhaps the sites most worth saving wouldn't be the Slashdot stories themselves, but the sites linked to by them. The slashdot effect tends to make people want to take their content off-line, which means that the most interesting content lasts the shortest. Archiving the contents of those sites ought to be the top priority.

  5. Except... by Anonymous Coward · · Score: 0

    Their archive will only contain French language portions of the internet to keep their language pure.

    1. Re:Except... by Anonymous Coward · · Score: 0

      Sacré bleu!

  6. Saving Digital History by Anonymous Coward · · Score: 0

    Saving Digital History!!! more like celda

  7. New Media Doesn't Last by spun · · Score: 4, Insightful
    It all degrades faster than plain old ink on paper. There are plenty of books that last hundreds of years if kept in appropriate conditions. Film decays pretty rapidly. Tapes don't last, even CDs and DVDs wear out pretty quickly. Gopher is all but gone. Web pages disappear daily.

    The irony is that, while digital files could be preserved indefinitely in absolute perfection, many are being completely lost in much less time than it would take a book to turn to dust.

    Kudos to the folks at the Library of Congress, and other projects like the Wayback Machine who are working to preserve a surprisingly ephemeral media.

    --
    - None can love freedom heartily, but good men; the rest love not freedom, but license. -- John Milton
    1. Re:New Media Doesn't Last by Xzzy · · Score: 4, Interesting

      > There are plenty of books that last hundreds of
      > years if kept in appropriate conditions.

      My suspicion is that punch cards will make a return at some point. ;) No, really. Not only does it resolve the longevity issue, but it could also solve the issue of obsolete reading hardware (seems to me it'd be easier for a distant generation to rig up a punch card reader than a cd-rom drive). Punch cards are in a rather obvious format as well, if worst came to worst and humanity nuked itself back to the stone age.. in ten thousand years a disc that looks like a mirror is probably harder to translate than a piece of paper with regularily spaced holes.

      I think the only difference will end up being the material used; how many centuries could a stainless steel plate with pin sized holes last in a library's basement?

    2. Re:New Media Doesn't Last by ebbomega · · Score: 2, Interesting

      The difference being that this archiving is _digital_, though...

      Didn't you pay attention in that IT class when they were explaining the difference between Digital and Analogue? Digital's main advantage is its reproductability. So if, say, the CIC Lib^H^H^H^H^H^H^HLibrary of Congress were to refresh the information once every five years or something like that, then you've got an indefinate storage period. The problem with it is that it needs constant maintenance. The reason this is better than analogue archives is pretty simple... when analogue decays, it's pretty much never going to achieve its original quality. You can do things to try and make it similar, but you're never going to get it as pure as the original.

      With digital archives, you can avoid the decay simply by transferring. This isn't an option really with analogue because once you transfer, you tend to lose quality. But bits are simply 1s or 0s, and digital transfer can be perfect. Throw some md5 checksums in there to make sure that you don't corrupt the data, and boom... you've got perfect digital copy.

      --
      Karma: Non-Heinous
    3. Re:New Media Doesn't Last by anubi · · Score: 1
      It all degrades faster than plain old ink on paper. There are plenty of books that last hundreds of years if kept in appropriate conditions.

      What scares me too is a lot of the stuff today is not only on very ephemeral media, its also encrypted so that it is readable only under very special circumstances.

      It seems that content is doomed once the technology used to decrypt it is gone.

      --
      "Prove all things; hold fast that which is good." [KJV: I Thessalonians 5:21]

    4. Re:New Media Doesn't Last by Cookeisparanoid · · Score: 1

      Its so true take teh doomsday book as an example the origonal text survives (albiet in a dead language latin) where as the BBC project from the 1980s need to be resqueued only a two decades after it was compiled

    5. Re:New Media Doesn't Last by spun · · Score: 2, Insightful
      Yes, I understand the difference between digital and analogue. I didn't learn that in IT class, I learned it when I was 10, on my own, building a robot from scratch using a Z-80 microprocessor.

      That is why I said, "The irony is that, while digital files could be preserved indefinitely in absolute perfection, many are being completely lost in much less time than it would take a book to turn to dust."

      Did you even read my comment before firing off a snide reply?

      --
      - None can love freedom heartily, but good men; the rest love not freedom, but license. -- John Milton
    6. Re:New Media Doesn't Last by Mostly+a+lurker · · Score: 5, Insightful

      Sorry, but this idea will not fly on a number of grounds. Consider how many punch cards would be needed to save even 4.7GB of data (contents of one DVD). It would take over 50,000,000 cards (even if they did not contain sequence numbers). The creation and storage costs would be astronomical and reading them back in to find any data you wanted would take weeks -- just for a single DVDs worth of data. Further, much of the most useful data (images and sound recordings) are more difficult to store on punch cards than almost any other alternative medium.

    7. Re:New Media Doesn't Last by cshirky · · Score: 2, Insightful

      An indefinite storage period is only part of the problem. Even if you keep the 1s and 0s by copying them every five years, file formats go out of scope, and even if you keep the software the file was saved in, the OS that ran it may well be dead (most are, after all) and even if you save a copy of the data _and_ the application that can read it _and_ the OS, what hardware are you going to run it on?

      So its a nested set of problems, with no one solution -- copying, conversion and emulation will all be required.

      There are two major advantages of analog over digital: the first is that inaction over a period of years does not destroy analog material. If you put a stack of paper in a box in the early 90s, it's probably fine. That degree of inaction, however, can be the death knell for digital material. If you put a stack of CD-ROMs or disks away in the early 90s, chances are at least some of that material is gone.

      The second is that while analog degrades slowly, bit-sensitive digital data (encrypted, compressed or executable files) degrades extremely quickly. If you make a mistake handling a book, say, you may end up with one torn page, but if you lose even a small piece of a bit-sensitive file, the entire thing vanishes forever.

      -clay

    8. Re:New Media Doesn't Last by Xzzy · · Score: 2, Interesting

      I didn't say it was practical, or suggest that the density would be anything to write home about.. but the fact of it is, we haven't yet been able to develop a digital storage format that is longer lived than punch cards. ;)

      i'm sure something better that's got a life of a thousand years or more will come along eventually, but speaking in the here and now the only way to get that is with holes in a piece of paper.

    9. Re:New Media Doesn't Last by JustaGiga · · Score: 4, Insightful

      It's not only a concern that physical media may become obsolete, but also the algorithms in which data is encoded on the media. We have lots of old backup media (reel to reel tape, 8mm tapes) at work that are probably still readable, but no one knows how the data was encoded on that media (or more importantly,) what information is on which tape.

      Most commercial tape backup solutions have proprietary encoding solutions, and who knows if that company is going to be in business/supported in 50 years. In fact, for true(r) long-term storage, it's recommended to copy the data from the commercial tape backup solution copy to plain old tar.

      Keeping an archive on media that will be around in 50 years seems like a minor point compared to finding the exact tape with the right data you need in a format you can still decode.

      -JG

    10. Re:New Media Doesn't Last by tcr · · Score: 1

      Perhaps it's time to create a digital version of the Rosetta stone!

      --


      Information wants to be beer.
    11. Re:New Media Doesn't Last by tcr · · Score: 1

      What's your definition of "digital media"?

      Want to see some punched cards from the 1700's ?

      --


      Information wants to be beer.
    12. Re:New Media Doesn't Last by Cybrr · · Score: 1

      Maybe future archeologists will use plastic food wrappers to translate books and RFCs so they can fire up a copy of Babelfish. ;)

      --
      Why did GEAR crush RDP?
    13. Re:New Media Doesn't Last by ecloud · · Score: 1

      How about punched tape? It would be more compact, I think. Probably best to use some kind of long-lived plastic rather than paper (mylar maybe?). The holes could be really close together and maybe 32 or 64 bits across the width of the tape.

  8. WaybackMachine by ChunKing · · Score: 5, Informative

    Isn't this already being done by the WaybackMachine (http://www.waybackmachine.org)?

    --
    cogito ergo sig...
  9. DMCA??? by attobyte · · Score: 1

    I am sure this would be violate the DMCA somehow. Just thought I would point out that the government has no clue what they passed.

    Mike

    --
    I didn't use the preview button, so get over it!!!!

    Mike

  10. Internet archive already exists by l33t-gu3lph1t3 · · Score: 0, Redundant

    It's called Wayback ^_^. But agreed, we need a more comprehensive method of archiving for posterity...but how do we go about doing that? Hard Drives don't last forever, nonvolatile memory is frikkin expensive, and optical media dies after 10-15 years...

    --
    ------- "From bored to fanboy in 3.8 asian girls" ----------
  11. Related News by LongJohnStewartMill · · Score: 1

    It is hoped that the project will lead to the preservation of data that is constantly changing on the Internet.

    In related news, the Library of Congress has also purchased a subscription to Playboy.

  12. Off-site backup... by Anonymous Coward · · Score: 0

    It's obviously a good idea to fortify the storage site, but what kind of arrangements will they have for off-site backup?

    If these are the most important recordings, it would be a tragic loss to have a natural disaster or similar event destroy what may be the only complete recordings.

  13. That's good. by Black+Parrot · · Score: 4, Funny


    I deleted all my porn, and I was afraid I wouldn't be able to get it again when I need it.

    --
    Sheesh, evil *and* a jerk. -- Jade
    1. Re:That's good. by ubugly2 · · Score: 2, Funny

      <stonerchick>i was on slashdot and all of a sudden beepbeepbeepbeep and all my porn was gone....it was really good porn</stonerchick>

  14. Something Old, Something New by SparklesMalone · · Score: 5, Funny

    How much energy should humanity spend remembering its past? I love history, but frankly I'd rather they fund more discoveries (i.e. NASA) than archive drivel like my slashdot musings.

    1. Re:Something Old, Something New by Anonymous Coward · · Score: 0

      Yay! Same mistakes again.

      History does have a good function.

    2. Re:Something Old, Something New by cyril3 · · Score: 2, Funny

      I have no doubt there will be a special filter for your material.

  15. So the US Gov is setting up a mirror? by dubiousmike · · Score: 0, Redundant
  16. Already doing this in Sweden by Anonymous Coward · · Score: 0

    Just wanted to point out that this is already happening in sweden since a few years back, 1994 I think. The Royal Library of Sweden is constantly archiving the swedish part of the internet, which afaik covers the .se tld.

    1. Re:Already doing this in Sweden by Anonymous Coward · · Score: 0

      So, will they have to buy a new hard drive for the computer soon, or does the 20 megger still have some space?

  17. IA by Anonymous Coward · · Score: 0, Redundant

    What about the Internet Archive? Aren't they already doing this?

  18. Tell them CD's scratch. by zymano · · Score: 0

    oh oh.

    1. Re:Tell them CD's scratch. by Anonymous Coward · · Score: 0

      zappa333@hotmail.com

      spambot food.

      also: what a crappy email address.

  19. What about the DMCA? by kfg · · Score: 3, Interesting

    Good question. Why not sue them for infringement for reproducing your post and find out?

    KFG

  20. Quality not quantity by wiggys · · Score: 3, Insightful
    We already suffer from information overload as it is. Why bother to save the hundred million Geoshities webpages anyway? What's the point of keeping all the data when it's boring and irrelevant?

    Plus not all the data can be saved anyway... sites such the Internet Movie Database, Amazon.com, and even Multimap are database-driven. Even assuming you get access to the underlying database you still need to preserve the code which gets used to generate the pages. And for what purpose?

    Add to that the problem of accessibility. If the data isn't laid out in an easy-to-browse fashion then it's as good as dead anyway. I prefer to browse a library by topic, not searching for keywords and hoping a nice book pops out.

    --

    Sorry, but my karma just ran over your dogma.

  21. Off-topic by Anonymous Coward · · Score: 0, Offtopic

    So nice to see -any- mention of the many protests that took place this weekend around the world. There were -huge- turnouts against the potential war with Iraq. Inspiring, to say the least.

    It was good to see the public finally have a chance to voice their disapproval over this entire matter.

    Too bad the same can't be said of /.

    1-2 million showed up in London. It was fantastic!

    1. Re:Off-topic by Anonymous Coward · · Score: 0

      Ignorance, fear and lack of respect for Arabs - these were the most obvious traits on display in yesterday's demonstration against a war in Iraq. Could so many people really think that it is better to leave Iraqis under Saddam Hussein's vicious tyranny than to liberate them from it?

      Their protests suggest that it is not worth risking anything at all to free Arabs. To risk spilling a single drop of blood to liberate Iraq would be futile - not merely because it would be "destabilising" or "kill children", but because the Arabs have no capacity for "Western" freedom anyway. Behind the demonstrators' slogans lies the assumption that Arabs should be left alone: they don't mind being brutalised, tortured and murdered by a fascist thug like Saddam. Where they come from, it is the natural order of things.

      That line of thought is nonsense. More than that - it is racist nonsense. No one knows better than the Arabs the horror of being oppressed. No one knows better than they that tyrannical oppression is all that they will get so long as Saddam and his family are in power. Saddam's despotism is not a denial of "Western" freedom: it's a denial of the freedom that every person needs to be able to live a worthwhile life. To imagine that the Iraqis don't want to be freed, or are not entitled to it, is simply to suppose that they are less human than us.

      It is shocking to discover how deep lies the prejudice against Arabs being able to enjoy freedom. It is to be found in some surprising places other than the demonstration in Hyde Park: the CIA, for example, and the US State Department have long taken the view that Iraq is so tribal and retrograde a country that only a brutal dictator like Saddam could control it.

      For them, the problem with Saddam is not that he is a murderous, tyrannical son of a bitch. It is that he isn't any longer our murderous, tyrannical son of a bitch. They had to be persuaded by the supposedly militaristic Donald Rumsfeld, the Defence Secretary, and the Pentagon, to give democracy in Iraq its chance. Ahmad Chalabi of the Iraqi National Congress and other exiles are now preparing to take over. Kanan Makiya, one of the most brilliant among them, has been drafting a new constitution for sharing power among Iraq's disparate elements. Since they cannot liberate themselves, others have to do it for them. That is the point of our invasion.

      What is more depressing than the ignorance and fear of yesterday's demonstrators, or even than the prejudice of the State Department, is the opposition to the liberation of Iraq voiced by some of Britain's most distinguished public servants. Sir John Killick, a former ambassador to the USSR, Sir Andrew Green, recently retired as ambassador to Saudi Arabia, Sir Timothy Garden, a former air marshal, and General Sir Michael Rose, have all come out against an invasion. These men must know that the effect of not going to war will be to prolong the rule of Saddam. They nevertheless oppose any attempt to topple him because, they say, the consequences will be dire. There will be untold numbers of casualties, and there will be "explosive instability" in the Arab world.

      Their claims simply do not stand up. Before the last Gulf war, there were many similar predictions of doom and disaster. In the event, the number of casualties on the allied side was less than 200. Half of those were victims of friendly fire. The number of deaths on the Iraqi side was certainly much greater, but even so, the numbers have been greatly exaggerated. This time, Iraq is much weaker after 10 years of sanctions than it was in 1991. American technology is much better: laser-guided bombs are now more accurate and will form a higher percentage of the ordnance. Saddam has no air force of any significance. It means that the moment his troops come out of their bunkers, they will be destroyed by the coalition. As a result, we can be pretty confident that they will not come out.

      It is unlikely that the war in Iraq will consist only of a land invasion. Rather, teams of special forces will be used to seize and secure strategic positions, such as the oilfields and the dams on the Tigris and Euphrates, so that they can be protected from any attempt to blow them up. If this can be done quickly, there may well be no civilian casualties at all: the regime may simply implode, leaving Saddam to the fate of Ceaucescu - a dictator barking orders that no one obeys. Saddam is known to be highly conscious of that possibility. According to defectors he keeps a tape of the toppling and execution of Ceaucescu and watches it regularly.

      Far from leading to an "explosion" in the Arab world, the removal of Saddam would do much to encourage stability in the Middle East. Baghdad would cease to be a haven for terrorists, particularly the Palestinian suicide bombers whom Saddam has subsidised. The majority of Arabs long to see Saddam removed. A number of Arab governments are tyrannies only marginally less brutal than that of Saddam Hussein. They view his removal with anxiety, for they know the precedent it will set: if a democratic Iraq flourishes, it will be an inspiration to - among others - the peoples of Saudi Arabia, Syria and Egypt. It will encourage all of them to get rid of the corrupt dictators who have oppressed, stultified and impoverished their countries - just as the fall of the Berlin Wall encouraged the whole of eastern Europe to replace tyranny with democracy and socialism with private enterprise.

      When he was in Rome recently, Barham Salih, the Prime Minister of Kurdish Iraq, said that he saw around him a parliamentary democracy in a country liberated by America from the fascist Mussolini. So it would be with Saddam. Salih's implication that a democratic, prosperous Iraq is the most likely outcome of an American invasion is absolutely right. It is a testament to the power of ignorance and prejudice that so many people in Britain cannot see it. Anyone looking for evidence of the decline of this country's moral and intellectual authority will find it in the thoughtless stampede with which the peace party has assembled.

  22. the big red dot !?!? by Brigadier · · Score: 2, Interesting



    This may sound like a joke but I really hope they save the big red dot. I dont know if the website is still in existence but a while back there was a website that had a big red button. When you clicked it, it said you have clicked the big red dot. The counter had some ridiculous number. This was back when it was envogue to show off your hit count.

  23. Finally... My Dream job... by ebbomega · · Score: 1

    ... Of Being a Freelance Hacker and Concert Promoter has come true... All I need is a couple of swords and a Pizza Delivery Job and I can make tons just gathering Intel...

    Stephenson's a Genius... We're basically looking at the first instance of the CIC Database....

    Now we can start looking at the Metaverse and nanodrugs.... I seriously can't wait...

    --
    Karma: Non-Heinous
    1. Re:Finally... My Dream job... by AndroidCat · · Score: 1

      But .. watch out for that pool. That one! Right in front of .. oh well, too late.

      --
      One line blog. I hear that they're called Twitters now.
  24. Content by hdparm · · Score: 1
    But I wonder who will choose what is worth saving?

    Well, for start they may as well mirror Google cache and go from there. Panel of recognised authorities should not have too much trouble deciding the standards for the worthiness of existing material. They will need high level of independence, perhaps total autonomy, to be able to do fair job.

  25. sloshdat and Mod Point for history by QEDog · · Score: 3, Funny
    "But I wonder who will choose what is worth saving?"

    Well, maybe they can come up with a system where people post what they think it is important in history and then some of the same people moderate that using a unit called Mod Points up or down to see if they are or not worth saving... maybe call it sloshdat.

    A mechanism would be deviced to protect the figures that make history against the people reading the history, and effect that could be called Sloshdatted.

    I'm sure that with a system like this, historic figures such as many of the presidents would be Modded Down, while anyone who trashes an established monopolistic corporation would appear in the history books.

    A system like this, would, without any doubt, save and Mod Up a comment like the present one for future generations.

    --
    "There is no teacher but the enemy."-Mazer Rackham
    1. Re:sloshdat and Mod Point for history by cybercuzco · · Score: 1

      The only problem is that what people think is historically significant, usually isnt significant to historians. The most information about a people is found in garbage dumps of ancient civilizations. Who knows what future historians will want to look at

      --

  26. database driven by SparklesMalone · · Score: 1

    Could this be just the back-door excuse John Poindexter needs to get his infomration awareness office to mirror databases?

    One Noid; not a pair.

  27. Knowing the current administration by Anonymous Coward · · Score: 0

    ... they'll just change the project to archive everyones e-mail instead ;)

  28. Demi Moore Is My Cousin by Acidic_Diarrhea · · Score: 1
    Save it all? I don't think there would be an article if that was an option. You see, there are time and space limitations imposed by the problem that you simply have not considered. Constantly "saving it all", even those parts that have not changed since the last instance, is a poor idea.

    --
    I hate liberals. If you are a liberal, do not reply.
  29. sounds like a task for trec by Anonymous Coward · · Score: 0

    This seems like a pipe dream as it is. Perhaps they should add a track at TREC for such a large scale search / archive project.

  30. As a society. . . by kfg · · Score: 2, Interesting

    we have an incredible fascination with spending today looking at where we were yesterday instead of where we are or where we're going.

    I'm not talking about history. I love history. My shelves are well stocked with various dead trees delineating history.

    I'm talking about our own lives. When we go on vacation we tend to spend most of our time *documenting* our trip rather than living it. Then we live it "in absentia" as a kind of recreational post mortem.

    It's a fascinating to thing to observe, but I admit it puzzles the hell out of me.

    This point was driven home to me a while ago when someone pointed out how odd it was that I only have one photograph of my SO of 10 years. I only have it because my mother took it. In my mind why would I want a photograph when I could just look at *her*?

    KFG

  31. Dudes by Exiler · · Score: 1

    That is ALOT of pr0n.

    --
    Banaaaana!
  32. Please tell me..... by Scrab · · Score: 2, Funny

    that the goatse man will NOT be preserved in this way......

    *shudders*

    --
    RoseColor red={0, 0xffff, 0x0000, 0x0000};VioletColour blue={0, 0x0000, 0x0000, 0xffff};find / -name *mybase*|chown you
    1. Re:Please tell me..... by ymgve · · Score: 1

      Please tell me that the goatse man will NOT be preserved in this way......

      Oh, but he MUST be preserved! How else can future historicans understand just how much we fear that site today?

  33. National Security by handy_vandal · · Score: 2, Interesting

    It is hoped that the project will lead to the preservation of data that is constantly changing on the Internet.

    One possible reason: because the OIA and Company might need the data to track down terrorists, etc. (Much the same way that the FBI keeps a collection of outdated phones books.)

    After all, when the events of Iran-Contra blew over, Congress quietly passed a bill authorizing the CIA to use any Federal agency for cover. Why not the Library of Congress? Indeed, where else? Makes perfect sense.

    --
    -kgj
    1. Re:National Security by tjic · · Score: 2, Insightful

      Do you think that the intelligence agencies are only
      now realizing that this is a useful idea? This article isn't about the black archives - you can assume that they've existed for years and have no such funding constraints.

  34. Choosing what should stay.... by MosesJones · · Score: 2, Insightful

    The answer is simple... what represents the goverment mindset of the day will be chosen to represent that mindset in the future. Cynical ? Of course not, why would they be even handed ? Will they store what Al Jazeera (sp?) says rather than what the Washington Post says, why would the views of Palestine be represented over the views of Israel.

    Or of course they will stear clear of politics and pick only science and absolute news, thus making it pointless for future historians.

    Saving what is said OVER what is already saved is an interesting idea, but will this be targeted beyond those people who already retain everything (like CNN and the BBC) or will it include them ? The BBC store everything, "Just in case", will this money record that information yet again, or will it concentrate on other fields after ensuring that the BBC information is already available?

    Historians of the future will have more information than historians of any other generation. Their problem will be that the miriad of views reflected via this information doesn't mean an increase in the spectrum of political opinion, but the ability of everyone to be opinionated.

    Their worst problem is that the leaders of the day (Bush, Blair et al ) don't stand out like the leaders of previous years. Will anyone rate the speach of Powell or Bush against, Churchill or Kennedy ? Nope. So how to judge politics of today, how to judge what should be stored, we have no leaders of merit, we have only retoric. So choose what to store, and realise that history will judge as much what you choose to save, as what you saved. This is a different problem to that which has faced historians up till now.

    --
    An Eye for an Eye will make the whole world blind - Gandhi
  35. Don't put it on a floppy drive. by AintTooProudToBeg · · Score: 1

    Our ancestors won't be able to read the data if you put it on a floppy -- Dell and Apple are trying to rid the world of them.

  36. Open plea by grub · · Score: 3, Funny


    Dear U.S. Library of Congress,

    Although not a U.S. citizen, I implore you to retain redundant backups of the website goatse.cx. Losing this website to a disaster would be tantamount to losing the collective works of Shakespeare, DaVinci and Picasso. The goatse.cx guy is an artist in the truest sense of the word.

    Yours very truly,

    grubby

    --
    Trolling is a art,
  37. Time will tell by Timesprout · · Score: 1

    It will be interesting to see how this pans out. Currently I suppose many of items we have from preserved from the past exist because they had some inherent monetary value or were of sufficient quality that time and effort was taken to preserve them. This is not the case with digital media as basically its all 1's and 0's to store the content, which tends to have highly subjective value. What will the future judge to be historically important I wonder, the first recorded blog ?, the first mail promising increased penis size? web services? (insert anything you can think of here)?

    I was wondering also about how they actually plan to physically store this information for extended periods of time. I was going to post a question about it until something occurred to me. In 500+ years time I cant really imagine many people will give a crap about much of the digital material that is being churned out today. It will most likey be a case of viewing sonething like AOTC, falling on their asses laughing at the "special effects" but reaching male consensus that Natalie Portman was a babe.

    --
    Do not try to read the dupe, thats impossible. Instead, only try to realize the truth
    What truth?
    There is no dupe
    1. Re:Time will tell by zcat_NZ · · Score: 1

      ..the first recorded blog ?, the first mail promising increased penis size?

      When google groups went up, the did specifically mention the first major 'spam' (C Greencard) in their press release.

      It all went to shit after that.

      --
      455fe10422ca29c4933f95052b792ab2
  38. No material can be ignored. by Ignorant+Aardvark · · Score: 3, Funny

    We need to take extra precautions to preserve some "movies", because, ahhh, they contain certain "positions" unlikely to be witnessed before or since outside of their "industry." I will therefore generously donate 500 burnt CD's of such movies to the people compiling this digital library.

    1. Re:No material can be ignored. by Anonymous Coward · · Score: 0

      vze23tnc@verizon.net

      spambots... mmmmmmmmmmkay?

  39. IT'S A CONSPIRACY by skinnydskitzo · · Score: 1

    begin sarcasm string:>:/ They abandoned the TIA program where the government saved every record made ever because people cried "big brother". Now they want to "preserve" every record and all data ever made, and people cry "hell yea" end sarcasm string

  40. Actually.. by NotAnotherReboot · · Score: 3, Informative

    From the article:

    On top of the $5 million the library received for planning the initiative in 2000, the plan approved yesterday releases another $20 million of funding to develop a system for evaluating and storing digital information. Just as the library receives more than 20,000 printed pieces each day but keeps less than half, it now faces the herculean task of deciding what digital information should be saved for future generations.

    --
    The library doesn't keep all of the printed information it receives, keeping all of the information online is an enormous, if not possible task. The archive.org has terrabytes upon terrabytes of data, and they don't even come close to having everything that was on the web at any one time. With the budget they're talking about, keeping all of this information would most definitely not be possible.

  41. Wonder what Disney will think by UTPinky · · Score: 3, Funny

    So what I want to know, is if one of Disney's movies get archived, will they sue the Library of Congress?

    --
    I'm only paranoid because everyone is against me...
  42. How do we know its real by Timesprout · · Score: 2, Interesting

    I noticed in the article that one of the topics on which information was being preserved about was 9/11 and that got me thinking.

    On a broader scale news media love the internet because they can make outlandish claims when a story first breaks and then modify it as the facts become available. How do we know whats being preserved is accurate ?

    Secondly, do we trust the people controlling all this nice, easily modified information not to change it to suit some political whim ?

    They say the victor writes the history book. Digital storage will allow the victors to run a few drafts by their spin doctors first.

    --
    Do not try to read the dupe, thats impossible. Instead, only try to realize the truth
    What truth?
    There is no dupe
  43. Geocities by Detritus · · Score: 3, Insightful

    Geocities web pages may be exactly what a future historian is interested in. They tell you something about the common culture and people. Why do you think archaeologists are so fond of ancient trash dumps?

    --
    Mea navis aericumbens anguillis abundat
  44. From the viewpoint of meme theory... by asparagus · · Score: 3, Interesting

    The important information will save itself without outside help.

    For example if talkorigins.org was wiped out of existance tomorrow, the theories it has created will live on in the minds of those who have read them. These essays can be easily recreated by re-reading the various creationist works. On the other hand, if the various creationist works were destroyed, they would probabally not be recreated because they have already been refuted.

    The history of information is the history of massive portions of it being eliminated, but then either re-printed, re-discovered, or re-invented centuries later.

    The Catholic church 'knew' the earth was the center of the universe.

    Along came Copernicus with his helio-centric theory, and the popes tried to lock him in his house for his entire life.

    Now, if the modern versions of these men were to make the same claim, they would be soundly laughed at.

    So, while this is a noble effort, it is merely a collection of data. Time itself the bayesian filter that will determine which parts of the internet are important.

    -Brett

    1. Re:From the viewpoint of meme theory... by JanusFury · · Score: 0, Offtopic

      I agree with your point, but one thing to note...

      People have been saying Christianity is 'dying' or 'going to die' for thousands of years.

      It hasn't happened yet.

      Just some food for thought.

      --
      using namespace slashdot;
      troll::post();
    2. Re:From the viewpoint of meme theory... by asparagus · · Score: 1

      Christianity is a meme, just as science is.

      It has managed to survive by constantly evolving itself through appropriation of new theories. A hundred years from now, I believe that fundamentalist preachers will be espousing DNA from the pulpit and damning those who believe in quantum mechanics.

      The more things change, the more they stay the same.

      -Brett

    3. Re:From the viewpoint of meme theory... by cshirky · · Score: 3, Insightful

      "The important information will save itself without outside help."

      That's whistling past a pretty big graveyard.

      The problem is that time changes the definition of interesting. Would you be interested in the ads from a copy of the NYTimes.com from 1998? Probably not, unless you wanted to chuckle at the 667Mhz Pentia selling for $2500.

      Would you be interested in the ads from a copy of the New York Times in _1898?_ Those ads are a view into a world you never inhabited, and expose the preoccupations of the era in a way that the articles don't.

      We can look at the 1898 ads, not because the important information saved itself, but because archivists did. Someday the ads from 1998 will have the same interests for historians and anthropologists. Who will do the archiving there?

      If we leave it to the present to sort the good from the bad, the future will never know what we considered unimportant. If you'd asked anybody in 1960 what that era's biggest technological revolutions of the time were, they'd have all said atomic energy and space travel. The real answers turned out to be the transistor and the birth control pill.

      We are just about the worst possible people to ask what's important now, because we're too close, and it would be hubris to pretend otherwise.

      -clay

  45. great by Anonymous Coward · · Score: 1, Interesting

    Sounds great, why is it going to cost 100 million dollars? Can we say pork?

  46. CVS for the entire internet? by Mustang+Matt · · Score: 1

    Seems like they should hire google to create a CVS type cache of sites. Can you imagine the amount of storage would be necessary to back up "the internet?!"

    --
    The man who trades freedom for security does not deserve nor will he ever receive either. - Benjamin Franklin
    1. Re:CVS for the entire internet? by mvdw · · Score: 1

      I dunno, I'll donate that box of floppies I have around here somewhere. Maybe if we all looked behind our collective couches we'd find enough floppies to take over the ... ahem ... back up the internet.

      The internet can't be that big that it can't fit on a couple of floppy disks, surely?

  47. Nineteen Eighty Four by Anonymous Coward · · Score: 3, Insightful



    In Nineteen Eighty Four, The Party embraced the digital revolution because they could easily control what the news said about them. (Who controls the past controls the future...)

    Anyway, the point is the government may not be the best to be in charge of this.

    </rant>

  48. TRBBTDDA by trmj · · Score: 2, Informative


    I believe you are talking about The Really Big Button That Doesn't Do Anything.

    A novel concept in its time, it was a strangely addictive big red button on a website. Established in 1994, and linking back to itsef, it was more repetitive than Taco's story postings.

    As interest in it waned, though, they added a message board-ish thing that let people comment on the button. As it was quickly misused, the best comments were left and the worst deleted.

    There, the very first MS bashing in large amounts began with comments like, "Huh? A button that does nothing? Must be a new Microsoft product..."

    Although dead at the age of 5, its final resting place is in its original home, Spatula City.

    --
    Work sucked, until it became unemployment, when it became slightly more tolerable. -Tet
  49. How New? by t0ny · · Score: 1

    things that NDIIP catalogues from the distant past will one day be reported as 'news' on slashdot.

    --

    Manipulate the moderator system! Mod someone as "overrated" today.

  50. Oh kwell! by Anonymous Coward · · Score: 0

    Is the Library of Congress going to collect the world's largest pr0n archive?

  51. Wtf ? by IanBevan · · Score: 0, Redundant

    the $100 million National Digital Information Infrastructure and Preservation Program (NDIIPP). It is hoped that the project will lead to the preservation of data that is constantly changing on the Internet...

    1. Who is to be the judge of what is worth saving ? I mean, let's be honest, there's a *truckload* of 'internet' out there !!

    2. Wouldn't $100 million be better spent on a new hospital or two ? Just a thought...

  52. Come on!! In the era of distributed storage... by Anonymous Coward · · Score: 0

    Ok, might sound far fetched but... Imagine a world where this library is distrubuted among a number of different locations. Imagine some googlistic magic that indexes the most accessed files in that timeframe and mirrors them for easier access. The fact that they are the most requested candidates them as optimal mementos for eternity. The fact that they are distributed makes them easier to access... and store. If only this happened for movie trailers...

    1. Re:Come on!! In the era of distributed storage... by anubi · · Score: 1
      AC, I think you are onto something there.

      If the content is worthwhile, people will hold a copy on their systems worldwide. If its just junk no-one's interested in it, nobody in the world thinks it's worth a hoot, then it will fade into oblivion.

      That was my great hope in P2P networking. If its decent, even if only one person thought so, it would be kept. Even though the Library of Congress may try to keep everything ,there is much variance in demand for the data kept. The fact is that some data may be accessed far more or less than average. The data most desired will be most plentiful, the data least desired will be least plentiful, and the data nobody wants gets dropped into the bit bucket. The neatest thing is that no-one in particular is on the critical path. The production of an entire civilization is maintained within that civilization by the civilization itself.

      Well, it was a star-trekish dream of mine that the public as a whole begins thinking as one organism, keeping the good stuff, excreting the junk, sharing useful stuff for all.

      --
      "Prove all things; hold fast that which is good." [KJV: I Thessalonians 5:21]

    2. Re:Come on!! In the era of distributed storage... by hcdejong · · Score: 1

      If the content is worthwhile, people will hold a copy on their systems worldwide.

      IDK about you, but I rarely copy anything I found on the WWW to my local system. I just create a bookmark so I can find it again. When the original site disappears, I'm toast.

      If anything, hyperlink technology has made information less plentiful: where you have to buy a book if you want permanent access to it, for digital media a link will suffice (for most purposes). Few people will think about the possibility of the original site disappearing.

  53. Preservation vs DRM by dpilot · · Score: 4, Interesting

    Since the public domain died back in the 1920's, and since this is about digital content, it stands to reason that pretty much all of the content that LOC is talking of preserving will be covered by some sort of copyright, and an increasing portion will be protected by some sort of DRM. What will the LOC stand be on this?

    Since the LOC seems to hold some of the strings over implementation of the DMCA, they can obviously craft a loophole for themselves. But it will be interesting to see what that loophole is, and how it will work. Will they simply leave the stuff under DRM, and have their own copy of keys, or will they manage to have an unprotected copy?

    Enquiring minds want to know.

    --
    The living have better things to do than to continue hating the dead.
  54. CDs/DVDs don't degrade rapidly by CGameProgrammer · · Score: 1

    Well I don't know this for a fact, but they're NOT magnetic or even electronic. A laser beam is shone on the surface and it is diffracted and reflected back in patterns corresponding to the data on the CD. But the mirrored part isn't on the surface; clear plastic protects it, so I can't imagine any way this can degrade.

    DVDs are like CDs but the data is layered. The layers reflect different wavelengths of light so that allows the format to take advantage of depth. DVDs are thus slightly thicker than CDs but not larger.

    --
    ~CGameProgrammer( );
    1. Re:CDs/DVDs don't degrade rapidly by hcdejong · · Score: 1

      The mirrored part IS on the surface, or about one layer of paint away from it (it's right underneath the label). Which means the data is vulnerable.

      Also, the materials decay. There have already been reports of early CDs becoming unreadable because the aluminium started corroding. Who knows what will happen in 50 years?

      Yes, CDs are relatively stable, but even the manufacturers aren't promising a CD will be readable in 100 years.

  55. Actually cheaper to save everything by Mostly+a+lurker · · Score: 3, Insightful
    I think the practical solution with online data will be to save everything and worry about indexing and selection decades hence when we have much better technologies to carry out these tasks.

    The actual cost of storage is not that high. The highest costs are involved when human intervention enters into the equation.

  56. Re:Huh by Anonymous Coward · · Score: 0

    What is SO supposed to stand for?

  57. Backing Up the Internet by crashnbur · · Score: 1

    Can you imagine the computational power required for such a task? Now that's what I want on my desktop! (Where's the link to TheOnion's PlayStation 5 story when you need it?)

  58. good content will always persist. by jdkane · · Score: 1

    I this it is the responsibility of the creators of the content to deem whether it is important enough to keep on the Internet or not, or else to archive it. If somebody else tries to archive the Net then I believe we'll end up with 95% fluff, and 5% good stuff. The Internet is now so large at a single point in time, it's sometimes hard to find something current let alone wading through years of archives. I say to forget spending the money on archiving the Internet which is already being done to some degree by TheWayBack machine, and leave the responsiblity in the hands of the content creators/publishers. The good content will continue to survive.

  59. Re:Huh by Anonymous Coward · · Score: 0

    "significant other" you dope. damn newfangled gender-neutral shit.

  60. Google already does this by mrm677 · · Score: 2, Informative

    Google does not evict anything out of their cache. They just keep adding capacity. Hence Google can already see changes to websites. Granted I'm sure that this data isn't durable though.

  61. What about google? by 4_Scythe · · Score: 1

    I wonder if google could do some sort of archive using it's cache system? That is, a snapshot of a page's cache is permanantly recorded at regular intervals.

  62. MGS:SOL by DonFinch · · Score: 1

    Oh no, metal gear solid sons of liberty comes true!

    --
    -- Insert wisdom here:
  63. Not just gender neutral by kfg · · Score: 1

    In fact it was first coined as a substitute for posslq. Now often used as a substitute for "look, it's none of your business just what our relationship is and I'm not prepared to talk about it."

    Perfectly normal, married couples use it in this sense.

    It's a nasty and vulgar bastardization of social language, but it has no real substitute I'm afraid.

    KFG

  64. Archive.org, and its limitations by Animats · · Score: 3, Interesting
    There is, of course, archive.org. That's a surprisingly small operation for what it does. A few volunteers work on the server farm (less than a thousand commodity PCs), and there's a little office at the Presidio of San Francisco. The web crawl is done at Alexa, and the Archive is filled from Alexa's backup tapes, which is why it runs so far behind.

    There's a live backup of the Internet Archive at the Library of Alexandria in Egypt. Thus, no single government can censor the archive. More duplicates may be established in other countries.

    Perhaps unfortunately, it's easy to remove material from the archive. Just put a "robots.txt" file on your site, and not only will it not be captured again, the archive will immediately refuse to display copies of the blocked site. This seems to be enough to keep the militant copyright holders happy.

    Most text is saved, but not all pictures, and very little video. This is good enough for most historical purposes.

  65. web archiving of websites by chrisranjana.com · · Score: 1

    Yes it is already here.
    click here if you want to see how slashdot.org has changed over the years.

    --
    Chris ,
    Php Programmers.
  66. How to save digtial information? by broothal · · Score: 2, Insightful

    It's always a good idea to save a piece of history. Traditionally, it's been done by writing a book. As we've seen, a book can be read thousands of years later. But what about digital information? The media types changes rapidly and todays storage is obselete tomorrow. So, how will the historians read a "Seedee" 100 years from now? Ok, assuming they actually managed to build a device that can read the data of a CD, the data will most likely be corrupted, since CD's has limited lifespan.

    Now, the only way to accomplish this is to make it a dynamic storage. That is, go with the flow and when a new sooper dooper storage device is invented, copy the data to that, thusly ensuring two things. 1) The data is "refreshed" 2) The data can be read by the contemporary hardware.

  67. ....what is worth saving? by Anonymous Coward · · Score: 0


    You sound like a fool. If you're smart, you'll save it all.

    Let me explain. It's just like backups. If you don't backup everything, you'll eventually have to explain to your CEO(fill in any pissed off person with power here) why "I didn't think we would ever need that." And it will be sooner than you think. Maybe it's later. Maybe it's 5 or 10 years from now. Maybe it's data that YOU will want. Anyway...bend over and take it like a man(in prison).

    To each his own I guess.

  68. Preservation vs Storage by bier · · Score: 1

    I think that a lot of these posts are missing a major point. There is a big difference between preservation of digital media (what NDIIP and LOC are doing) vs storage of digital media (what GOOGLE and P2P systems do). When you preserve digital media you have to try to make it so that in the future (100s of years or more) people will still be able to access and view/hear, etc. this data. This might mean continually updating the file format ("format transformation") or it might mean trying to create hardware/software systems that can play back this media ("emulation"). This is a MUCH harder task than just storing the media and many schools and research centers are trying to figure out the best way to do this.

  69. Media by Anonymous Coward · · Score: 0

    There are several media/hardware questions: 1) Which media/hardware will it be stored upon? 2) Which media/hardware will you have to keep around for it be played/run upon? 3) How much upkeep will you have to retain for the media/hardware to be kept functional and to have enough parts lest something break down? Floppies become dysfunctional after four-five years, and even CDs and DVDs have their limits as well. And, how many redundant copies will you keep of the data (in case something goes wrong) ???

  70. Physical format? by yerricde · · Score: 1

    In fact, for true(r) long-term storage, it's recommended to copy the data from the commercial tape backup solution copy to plain old tar.

    GNU tar doesn't help if the physical format of the tape (before the operating system even gets to it) is unknown.

    --
    Will I retire or break 10K?