Slashdot Mirror


Vint Cerf Warns Against 'Digital Dark Age'

An anonymous reader writes: Vint Cerf, speaking at the American Association for the Advancement of Science, said we need better methods for preserving everything we do on computers. It's not just about finding better storage media — it's about recording all the aspects of modern software and operating systems so future generations can figure out how it all worked. Cerf says, "The solution is to take an X-ray snapshot of the content and the application and the operating system together, with a description of the machine that it runs on, and preserve that for long periods of time. And that digital snapshot will recreate the past in the future." Cerf is also pushing for better data preservation standards: "The key here is when you move those bits from one place to another, that you still know how to unpack them to correctly interpret the different parts. That is all achievable if we standardize the descriptions."

34 of 166 comments (clear)

  1. Already happened to a good portion of the early we by Anonymous Coward · · Score: 4, Insightful

    A lot of the original web is gone, whats left is crowded out by seo bs. And I'd rather not have and ad company decide what part of the web is relevant to me.

  2. That is what VM's are for by Anonymous Coward · · Score: 3, Funny

    just put the system in a VM image, save it, and there you go. Problem solved.

    1. Re:That is what VM's are for by scsirob · · Score: 4, Interesting

      I actually bought a license for VMWare 1.0 when it came out, dates back to 1/7/1999. My V2.0 license is from 2000. I also still have the executables from back then. Hmmm. Wonder if I can get them to run on today's Linux kernels.

      Which actually shows a prime issue with modern data and executables. There are a growing number of external dependencies. Stuff that is only accessible if DRM keys are available online, or when license activation servers are up and running. I have some stuff that only runs on Windows-XP. Even though I have a valid license, I may not be able to re-install it in 2025 or beyond, even in a backward compatible VM environment.

      --
      To Terminate, or not to Terminate, that's the question - SCSIROB
    2. Re:That is what VM's are for by lgw · · Score: 2

      Good luck on reading that VM image in 15 years.

      Just this week I ran a Commodore 64 emulator inside a Win95 emulator inside Windows server inside a VM. And it was still blazingly fast compared to the old C64 floppy drive!

      We're getting to the point now where the exponential increase in speed that made emulation so easy has slacked off, but we're also taking more care to properly document formats and archive software, so that less emulation is needed. I'm sure in 20 years I'll still be able to run that C64 software somewhere.

      --
      Socialism: a lie told by totalitarians and believed by fools.
  3. Re:Vint Cerf worried no one will remember him... by cheesybagel · · Score: 2

    A lot of people have been thinking about this problem of digital preservation for a long time. archive.org has a library of software which can run under emulation in Javascript on a browser. Basically the answer to his question is to work on emulation and archival.

    Google has done several steps backwards in their digital preservation projects lately.

  4. Welcome to the 90s! by Megol · · Score: 2

    Or whenever the first person warned about the same thing. Which was a loong time ago...

    1. Re:Welcome to the 90s! by ShieldW0lf · · Score: 2

      As we become more sophisticated, we design things that are more delicate. The more advanced we are, the less likely our creations will be accessible to those who come after we fall.

      Which, considering that we've demonstrated these capabilities once already, and considering how long we or bipeds like us have been around, implies that it's happened before.

      If there were more advanced civilizations before us, there's no reason to think we'd know about them.

      --
      -1 Uncomfortable Truth
  5. Re:Not everything is worth saving by ColdWetDog · · Score: 2

    Not everything needs to be preserved for future historians. Mortality and the oblivion of time are fundamental aspects of the human condition; therefore, the things that do escape oblivion, like better literature and song and monuments, serves as a kind of immortality for men who achieved something worthwhile. Your tweets don't deserve that kind of glory.

    Pretty much this. It is going to take millions of years to sort through the crap that is already here. Just wait until Web 3 or whatever is coming down the pike.

    We need to hit the delete key even harder.

    --
    Faster! Faster! Faster would be better!
  6. Our local time capsule... by bazmail · · Score: 4, Interesting

    I remember our Mayor presided over the opening of a 25 year old time capsule put there by the local schools. Inside was a lazer disc. When he asked to view the contents of it, nobody could find a device to play it. Vint is right. And its not just a DRM thing, its a lack of standards thing too.

    1. Re:Our local time capsule... by solios · · Score: 3, Interesting

      Part of my old job (in a museum Exhibits department) was upgrading interactives and videos from the 80s and 90s to modern equipment - that included "transferring" laser discs the old fashioned way - plugging one of the still-working players from the floor directly into the capture hardware.

      The thing is, I was transferring LD to DVD, which is actually a step *down* in quality. Kind of but not quite like how VHS is a step down from Beta (which I also dealt with).

      The great thing about standards is there's so many of them!

    2. Re:Our local time capsule... by Kjella · · Score: 2

      Well, luckily the trend has been away from tying one specific format to one specific media. Instead of the Audio CD you can have an MP3/OGG/AAC file that'll play from a HDD, SSD, USB drive, burned to a CD, DVD, BluRay, stored on a tape and so on. That eliminates the need for ancient equipment and media. Of course that doesn't really make it any easier for a time capsule, but the way to preserve is to copy forward. Along with integrity checking those photos you take today can be just as pristine in 100 years, unlike the photos of me as a kid. Maybe you could do better from negatives but the paper copies are all washed out and terrible after 30 years. And I doubt you'll have trouble finding a JPEG decoding library even if the RAW format has been lost in the mists of time.

      --
      Live today, because you never know what tomorrow brings
    3. Re:Our local time capsule... by Bing+Tsher+E · · Score: 2

      I remember our Mayor presided over the opening of a 25 year old time capsule put there by the local schools. Inside was a lazer disc. When he asked to view the contents of it, nobody could find a device to play it. Vint is right. And its not just a DRM thing, its a lack of standards thing too.

      In the case of the time capsule you describe, it was a dork thing, not a lack of standards thing.

      Whomever put the laser disk in the time capsule thought they were being all futuristic and stuff. They should have put a cassette in the capsule, or a vinyl LP. Or a message on fricking punched paper tape. Those were all common recognized standards 25 years ago. A Laser Disk was a 'woo-woo' futuristic bullshit thing. It didn't represent what things were like 25 years ago, it was just someone being stupid and buying hype about what would exist in the future.

    4. Re:Our local time capsule... by Bing+Tsher+E · · Score: 3, Informative

      Last week I made a fresh copy of my 'archive' of everything computer related from my 20's. I copied everything off the 5 DVD-R disks that I burned in the early 00's onto a USB hard drive.

      What was on the 5 DVD-Rs was what I copied then off about 30 CDR disks.

      What was on the earliest few of the CDR disks was what I had copied there off DC2120 tape cartridges.

      There is even one of the DVDs arranged with folders called 'CD4, CD4, CD6' and some of the CD folders have subfolders with names like 'Tape7, Tape8, Tape8.;

      I might still have the original CDs in a cakebox somewhere, the DS2120 tapes are long gone.

      I still have all the Windows 1.0 apps that I downloaded off BBSes back in the day. I still have every version of PC-DOS. I still have Microsoft Word 5.0 and all the Borland programming languages and all that stuff stored away. Linux install sets with 0.99.x kernel versions. And all my personal files, email, etc.

  7. Re:Not everything is worth saving by bazmail · · Score: 5, Insightful

    Not everything needs to be preserved for future historians

    With respect, what is and what isn't worth preserving is up to future historians to decide. A bit chicken and eggish but there you are.

  8. Re:Sheldon Cooper sniffs Penny's pooper! by BarbaraHudson · · Score: 4, Insightful

    Well, I must say, that's a better argument for NOT preserving everything that's on the web than I was going to make,

    That being said, from the second link:

    The obsolescence of data is a real problem. Much of my old digital art is on Jaz discs, which are obsolete and very expensive to get transcribed.

    Couldn't have been THAT important if you didn't make a copy to other media when you saw that Jaz disks were obsolete. Just like there were probably 1,000 floppies (5-1/4, 3-1/2) that I tossed while going through my "archives." Anything important was long moved to other media.

    --
    "Transparent" is a shit show that trades on every stereotype going. A man in drag is NOT a transsexual.
  9. So far so good by Dan+East · · Score: 2

    I can fire up a TI-99/4A emulator or an Amiga emulator and run all my old software from three decades ago. Can someone name a general purpose computing platform (IE not including mainframes or supercomputers or other exotic low-volume hardware) whose software we cannot execute using an emulator?

    --
    Better known as 318230.
    1. Re:So far so good by perpenso · · Score: 2

      What mainframes lack in hardware volume they make up for with users. :-)

      An interesting list: http://en.wikipedia.org/wiki/L...

    2. Re:So far so good by ihtoit · · Score: 2

      XBOX Classic. ...

      "Xbox is just like a PC, it's easy to emulate!"

      Yes, we've all heard this silly and pointless argument a million times and it usually ends in the same, and rather ignorant conclusion (or should I say assumption) that just because the Xbox is PC similar, it's hardware should be relatively easy to emulate. That's a very wrong frame of mind. How hard can it be? Very. Xbox's hardware is very complex and still poorly documented to this day. This requires some explanation.

      1. Is a PC easy to emulate? Well, I wouldn't say so myself. Take a look at the source code from bochs. A lot of source code/work isn't it? The 486 emulator for the Archimedes RISC architecture was a: limited in scope and b: REQUIRED an Archimedes 3000 or faster, RISC OS 3.x AND an internal hard drive.

      2. Emulating an x86 CPU is a lot harder than it sounds. I don't know where this mindless assumption comes from. Yes, there's loads of documentation on how the x86 processor *works*, but that doesn't exactly make it easy. First of all, the x86 instruction set is M-A-S-S-I-V-E! There can be at least 20 different versions of one instruction (i.e. There are many different versions of the MOV instruction, as well as INC, DEC, ADD, SUB, SHR, SHL, AND, OR, XOR etc.) and it takes time to implement them all. Of course, that's not exactly difficult. The real problem is that any modern x86 processor including the entire Pentium line from III/Coppermine onward can execute multiple instructions at once. So it's not like emulating a Z80 doing one instruction at a time. The actual algorithm and how x86 does this is undocumented and still unknown. In short, the Xbox's CPU can be emulated, but not accurately and not in real time - which is the whole point of a useful emulation.

      3. Emulating any hardware by NVIDIA is not a walk in the park! The Xbox's GPU, the NV2A is often assumed just a GeForce 3. It's not! It's similar but not identical. It has some GeForce 4 capabilities too, so it's more of a cross between an NV20 and NV25. This is by no means easy to emulate either. NVIDIA's GPUs have very large register sets and afaik not even half of them have been discovered, and a large portion of known registers have unknown purposes. There is little to no documentation on how NVIDIA GPUs work. The best thing to do is to look at similar GPUs such as RIVA, TNT, and older GeForce cards. Some registers are similar, but not identical. The best place to look for information is in open source drivers available on the net. Adding to the dificulty is that no one has ever discovered how pixel shaders work on NV2x cards, vertex shaders yes though. The Xbox GPU also has exclusive registers that are not found in other GeForce cards. Information on the NV2A's GPU registers were just hitting hacker boards three years ago. And yet, there's still a long way to go. The GeForce 3 series is the most mysterious of all NVIDIA GPUs (G7x and G8x aside) and the NV2A is alot worse. "But can't you just directly execute the NV2A instructions on another NVIDIA card?". No, it is impossible. Its MMIO addresses are different and the exclusive registers must be emulated. Plus, in windows, we don't have ring 0 access anyway, so you all can scratch that idea now. Then comes the NForce 2 chipset. This is where it get easier. The NVIDIA MCPX is the control centre for things such as audio, USB for input, Network adapters, PCI, AGP, etc. These things are not really that difficult to emulate IMO except for the audio.

      4. The Audio system is rather complex. Xbox's audio consists of at least 4 DSPs, an audio codec (AC '97) and an NVIDIA SoundStorm APU. The DSPs shouldn't be a problem (just figuring out what they all are is) nor should the AC '97 but the NVIDIA SoundStorm APU is the really difficult part. Information is scarce to nonexistent.

      5. The Xbox BIOS isn't fully understood. The basic execution process of the BIOS is understood, but details on the process are at a loss. What we do know gives us hints, but before the BIOS can be emulated, w

      --
      Political debates have me rolling my eyes so much I think I got optical whiplash. I should sue. - Foamy The Squirrel
  10. Re:Cerf by bazmail · · Score: 2

    Nothing to do with DRM. If the DRM scheme/keys are preserved then whats the difference. And besides in 50 years time refrigerators will have more computing power than todays data centers and devices will crush todays DRM like it was nothing.

  11. Re:Already happened to a good portion of the early by Anonymous Coward · · Score: 3, Funny

    A lot of the original web is gone, whats left is crowded out by seo bs. And I'd rather not have and ad company decide what part of the web is relevant to me.

    Luckily I saved it to a floppy.

  12. The Apple business model. by solios · · Score: 4, Insightful

    Anyone who's used Apple software for more than five years has been burned by forced format obsolescence - ClarisWorks, AppleWorks, old QuickTime codecs, the PICT format, SimpleText, Font Suitcases, the list goes on. And on. And that's just *one* platform and set of formats off the top of my head. I lose data to software "upgrades" so often that it's the single biggest determining factor in my upgrade cycle and a huge determining factor in the uptake and use of new software. We aren't heading for a digital dark age - we're in one already.

    1. Re:The Apple business model. by Bing+Tsher+E · · Score: 2

      A long and twisty conversion path. Just like with Microsoft. You need Word for Windows 2.0 to read the Word for MS-DOS files. You need Word for Windows 6.0 (Office 4.3) to read the Winword2 files. You need Office 97 to read the Office 4.3 files. It's even worse with Apple, because they can.

  13. Probably short sighted. by dmomo · · Score: 2

    If we survive as a society, in 500 years, our technology will be so advanced there will be systems we cannot even conceive of that capable of analyzing pretty much any data or bytecode you throw at it. Documentation or support systems will most likely serve a more historical than practical purpose.

  14. Re:Our time capsule...Library of Congress by BoRegardless · · Score: 2

    Library of Congress seems like the logical place to set up to archive old OS's, hardware, emulation and other items needed to read, archive, restore & recover old media. That is what the LoC does for 'documents.'

  15. Already happened to a good portion of the early we by jordanjay29 · · Score: 3, Informative

    I think this is Mr. Cerf speaking as the man who was instrumental in the Internet's creation, not as a Google employee.

  16. FUD ? by swell · · Score: 2

    This is a great idea. Preserve previous generations of software & data, emulate old hardware . Then we will be able to enjoy all the goodies from the Apple ][, IBM 360 and Commodore 64 era!

    But wait, we can already emulate just about any old equipment. Most of what was worthwhile on floppy disks or tape is now online, available to most of us. Even our government, slow though they may be, has found ways of bringing old software & data to modern machines. Cloud storage and networking brings more interoperability over time and the future looks bright. Movies from the 1920s are available on modern media as well as Edison's cylinder recordings. So what's the problem? Oh, your dad's home movies. Sorry about that.

    --
    ...omphaloskepsis often...
  17. only ancient encryption not breakable by fast comp by raymorris · · Score: 2

    >. You obviously do not understand encryption. Unless a weakness in the underlying algorithms is found, "a faster computer" will never be sufficient to break modern encryption.

    Indeed you seem to be completely ignorant of the subject. The whole science of encryption is all about finding operations that a) can be done quickly by a smart phone yet b) cannot be undone slowly by a cluster. That's far from a solved problem. In fact it's funny you mention "modern encryption" because ALL modern methods of encryption have been broken within about 10-30 years. The ONLY unbreakable encryption is an old method, the one-time-pad. It's unbreakable because the key is at least as long as the combined total of all of the messages it will ever encrypt. That makes it not particularly useful in most cases. Any and all other methods of encryption are subject to at least brute-force attack, which means they can be broken almost instantly, given sufficient computing power.

      A strong cipher is one which takes a lot of computing power to break. That can calculated as (resources required to brute force) / (shortcuts known). Both of those factors always get less secure over time. The cost of the computing resources required to break it drops quickly, while at the same time new methods are discovered to break it with smaller amounts of resources.

  18. Stop using non-Free Software by jbn-o · · Score: 5, Insightful

    Quite right, in fact most of what gets posted to /. including this story could be responded to with a phrase Eben Moglen has been saying for years in his talks: "RMS was right". Richard Stallman had it right years ago and, equally importantly, for the right reasons. Not "Open Source" (the younger movement Brad Kuhn rightly points out is built to greenwash proprietary-supporting non-copylefted Free Software (copy 1, copy 2) but strongly copylefted Free Software released and developed for freedom.

    The Affero GPL version 3 or later will keep software Free as in freedom and meet the needs of the future. Users will undoubtedly want to know how things work and benefit from software written by programmers allowed to understand how things work. This will help us avoid the very trap the grandparent post referred to (and you wisely advised against).

  19. Re:Legacy Support by itzly · · Score: 2

    There is no good excuse for Apple, Microsoft and others making the OS's not compatible with legacy software.

    Sure there is. It costs a lot of money to keep everything compatible.

  20. Re:only ancient encryption not breakable by fast c by Dutch+Gun · · Score: 4, Interesting

    Any and all other methods of encryption are subject to at least brute-force attack, which means they can be broken almost instantly, given sufficient computing power.

    I think you may be underestimating how much computing power is required to brute-force attack modern encryption, especially when using a sufficiently long key. At the moment, we're talking about a modern PC operating until the heat death of the universe timeframe to break the encryption of a 2048-bit SSL certificate. Many of the early encryption schemes were broken because of flaws in the algorithms which allowed massive shortcuts to be taken or were weakened with very short keys (remember 48-bit keys?). Remember, with every bit added to the key, we double the inherent strength of the encryption, and cryptologists have gotten much, much better at creating incredibly secure algorithms as well.

    It really isn't just a matter of waiting for hardware to catch up. Even with exponential speed increases in computing power (which isn't happening anymore, btw), in 30 years, we'll still be nowhere close to breaking today's state of the art encryption unless breakthroughs have been made that allow us to shorten the compute time via a weakness in the algorithms. It would take an unbelievable leap in computational efficiency (say, quantum computing) before we can even dream of brute forcing keywords of today's most secure algorithms, even within our lifetimes.

    --
    Irony: Agile development has too much intertia to be abandoned now.
  21. Re:Not everything is worth saving by Greyfox · · Score: 3, Funny

    Yeah! 200 years from now, all the music of today will be considered classical and be played on future-NPR with the same pretentious tones they play Mozart in today. "That was Dr. Dre's 'All My Bitches' in D-Minor. After the break we'll be playing 'Smell Yo Dick' and 'What What, In The Butt?'"

    --

    I'm trying to teach myself to set people on fire with my mind... Is it hot in here?

  22. I would suggest vinyl disks by fustakrakich · · Score: 2

    Some of mine are over 50 years old and still work perfectly. Reproduction doesn't even require electricity. They are very low maintenance, but not very space efficient.

    --
    “He’s not deformed, he’s just drunk!”
  23. Re:Not everything is worth saving by ganjadude · · Score: 2

    reading thaqt post in a low monotone NPR voice just made my day

    --
    have you seen my sig? there are many others like it but none that are the same
  24. RIP iPod click-wheel apps by CODiNE · · Score: 2

    I used to really like the Tempest game for that, and the card game wasn't too bad.

    I wanted to get Monopoly and Mahjong for it and some other games but by then they stopped selling them on the iTunes store. There was a pirate torrent going around but the apps were encrypted and no way to install them on another device. Finally they just stopped making them.

    Apps like that... gone forever.

    iOS 3.0 apps that got the App store going... gone forever. I still remember playing the unofficial lights off game, beat all 150 levels. (Wrote a program to solve them)

    Unless someone wants to make an emulator for the original iPhone you could do it by downloading the ROMs just like old Apple emulators, but how would you approve the apps without a 3.0 app store around?

    In theory someone could crack Apple's old signing keys and have a local "FakeAppStore" program that validates them and allows installation on the emulator.

    The "cease and desist" letter would probably arrive less that one minute after putting such project online.

    --
    Cwm, fjord-bank glyphs vext quiz