Slashdot Mirror


Long-Term Personal Data Storage?

BeanBagKing writes "Yesterday I set out in search of a way to store my documents, videos, and pictures for a long time without worrying about them. This is stuff that I may not care about for years, I don't care where it is, or if it's immediately available, so long as when I do decide to get it, it's there. What did I come up with? Nothing. Hard Drives can fail or degrade. CD's and DVD's I've read have the same problem over long periods of time. I'd rather not pay yearly rent on a server or backup/storage solution. I could start my own server, but that goes back to the issue of hard drives failing, not to mention cost. Tape backups aren't common for personal backups, making far-future retrieval possibly difficult, not to mention the low storage capacity of tape drives. I've thought about buying a bunch of 4GB thumb drives; I've had some of those for years and even sent a few through washers and driers and had the data survive. Do you have any suggestions? My requirements are simple: It must be stable, lasting for decades if possible, and must be as inexpensive as possible. I'm not looking to start my own national archive; I have less than 500GBs and only save things important to me."

669 comments

  1. Amazon S3 by bokmann · · Score: 0

    Amazon S3. dirt cheap, there forever.

    1. Re:Amazon S3 by Anonymous Coward · · Score: 1, Funny

      Yes, and it's better than dirt.

    2. Re:Amazon S3 by Anonymous Coward · · Score: 5, Informative

      Its not really that cheap, and not that simple to use for personal backups. Unless you are willing to write your own backup scripts, its going to be a headache.

      Querying S3 for a list of stored files is *very* slow, and you only get 1k results per query. This means you have to index what files you put in S3 in a local db. This allows you to ask the db what files are there (and how to grab them).

      If you only have a few files you can use the S3 browser extension for Firefox (or one of a many file system mounting, ftp simulating, etc tools). Just keep in mind the 1k file limit per query and box things in folders of no more than 1k items. Otherwise you will have a very slow browsing experience.

      I have around 120 GB of family photos and purchased mp3s that I would like to store. To store 120 GB at .15 per gigabyte/month for 1 year would cost me: $216 (at $18 a month).

      We use it where I work, with great success, but it would be much to much work for me for a personal backup system.

      Considering the cost, I would go with a consumer targeted app (there are LOTS of them). A number of them charge a flat flee for "unlimited" storage. Beware of how you interface them. Some support windows only.

    3. Re:Amazon S3 by TheOtherChimeraTwin · · Score: 5, Informative

      That is an interesting suggestion, but 500GB would cost $900/year (plus transfer costs) which I don't call "dirt cheap". As far as being there forever... who knows if there will be an Amazon in 10 years? Amazon might be more stable than most hosting options, but forever is a long time.

    4. Re:Amazon S3 by Anonymous Coward · · Score: 0

      Amazon S3. dirt cheap, there forever.

      Explain "dirt cheap", and "forever". The OP said he didn't want to pay yearly fees, but perhaps a one-off payment would be acceptable to him.

      So, with S3, can you pay once for 25 years storage? How much per gig?

      Come to think of it, is there any website anywhere offering pay-once, 25-year guaranteed storage and access?

      That would be great for "store and forget" applications, such as leaving things for (near) posterity.

    5. Re:Amazon S3 by Albanach · · Score: 1

      Well it's not dirt cheap for 500GB - that's $75/month. But who really has 500GB of critical data.

      Many folk here probably make a living out of making sure company data stays around 'forever'. It can easily be a full time job. The lesson from that is that if you want data to stay around forever you need to look after it yourself or pay someone else.

      S3 is a perfect solution for this. BeanBagKing needs to decide what data he really needs preserved forever then pay accordingly.

      The other alternative is doing it yourself with RAID arrays at different locations. Sync your 'must be preserved folder' nightly to both locations using rdiff-backup or similar. Regularly test recovery and make sure all the disks in both arrays are working without error. This is more challenging these days as domestic broadband sees caps being introduced, but for moderate volumes of data it should work fine. If they have a garage separate from their house they may be able to host a RAID array in the home and in the garage, and hope that any fire/flood/tornado/earthquake doesn't take out both.

    6. Re:Amazon S3 by Anonymous Coward · · Score: 0

      There forever, really? So if Amazon fails or gets bought out there's no chance of losing it? Yeah right, I believe that.

    7. Re:Amazon S3 by Anonymous Coward · · Score: 4, Informative

      Jungledisk takes care of all the tedious backup stuff for you, and it is only a one time charge for the app.

      But you're right, S3 isn't cheap. To store 500 GB of data would be about $75 a month, plus the $50 to put it on the server in the first place.

    8. Re:Amazon S3 by Anonymous Coward · · Score: 1, Interesting

      Well it's not dirt cheap for 500GB - that's $75/month. But who really has 500GB of critical data.

      If most of that 500GB are photo's then for $24 a year, one can get a pro account at flickr with unlimited storage and retrieval.

      The rest can then be backed up to Amazon's S3 (which is only $0.15 a GB, very reasonable for small amounts)

    9. Re:Amazon S3 by bcrowell · · Score: 3, Insightful

      Well it's not dirt cheap for 500GB - that's $75/month.

      The OP specified "decades if possible." So we have two problems here. One is that 30 years at $75/mo comes out to $27,000, which is a little pricey. For that amount of money he could probably hire someone to come to his house once a year, verify the readability of his media for him, and transfer them to new media as the old ones become obsolete. $1,000 for a few hours' work? I'd take it.

      The other problem is that the probability that Amazon S3 will exist in 30 years is very low. This is basically the problem with any possible answer to his question. There isn't any computer-related service or equipment that you can be sure will still be there in 30 years. A more realistic goal would be to do it in 10-year steps; if that's all he wants, then the shoebox full of flash drives should work fine, and then 10 years from now he can transfer those data to something else.

      But who really has 500GB of critical data.

      I initially misread that as 500 Mb, which is about the amount of critical data I have that needs backing up. 500 gigabytes is kind of a crazy amount of data. One way to get that much might be that he has a gigantic collection of mp3s, or possibly a moderately huge collection of music in a less lossy format. But then that's not critical personal data, it's just a music collection. And the chances are that as the decades go by, he'll realize that the music he thought was so important and wonderful in 2008 no longer seems so important to him. I know plenty of people who still have their Kool and the Gang LP's from the 1970's, but it's not like they're willing to spend a thousand dollars a year to obsessively maintain them.

    10. Re:Amazon S3 by Gilmoure · · Score: 1

      I have about 800 GB of ripped movies (replacing stupid Sony DVD juke box with Apple TV setup). Now, sure, I have the original disks (330 or so movies) so yeah, I wouldn't lose the data but I would lose the 5 months time it took (ripping 3-4 disks a day) to rip. For now, I'm swapping out 1TB external drives when I add movies (post Christmas will suck) but yeah, I could use some kind of decent backup system.

      --
      I drank what? -- Socrates
    11. Re:Amazon S3 by Jeppe+Salvesen · · Score: 1

      Nobody knows where Amazon will be in 10 years. However, it seems unlikely they will go under in such a way that all files are lost and cannot recover.

      I've uploaded close to 100gb of RAWs onto amazon s3, and the bill is not frighetning.

      However, I also keep a local copy on my current harddrive.

      For me to lose all my data, I will have to both experience local failure and amazon failure at the same time. Though this is technically possible, I think that nuclear holocaust will leave me with larger problems than the loss of my files.

      --

      Stop the brainwash

    12. Re:Amazon S3 by Duckie01 · · Score: 1

      Except that I'd rather spend the money all at once and actually have something, like a nice NAS sollution with 2 raid disks in it. Could be as simple as this. Way faster, failing disks covered by raid.

      Only advantage I'd see for Amazon is that it's offsite, so you get to keep your data in case your house burns down. Yay!

    13. Re:Amazon S3 by tomhudson · · Score: 1

      If most of that 500GB are photo's then for $24 a year, one can get a pro account at flickr with unlimited storage and retrieval.

      The rest can then be backed up to Amazon's S3 (which is only $0.15 a GB, very reasonable for small amounts)

      ... or store the rest of the data in images using teganography ...

    14. Re:Amazon S3 by maxume · · Score: 1

      A lot of people shoot home movies and then become obsessed with preserving the footage. That adds up fast.

      It seems that deciding what isn't important is a hard part of backing up.

      --
      Nerd rage is the funniest rage.
    15. Re:Amazon S3 by malefic · · Score: 1

      I would figure that it's video... specifically, probably family home movies. If that's the case, I could see it getting up to 500GB taping birthdays, Christmas, family get togethers, vacations, etc... And those are the type of things that become almost more precious with time as opposed to less, so it's doubtful he'd look back on it and decide it wasn't important.

    16. Re:Amazon S3 by Anonymous Coward · · Score: 2, Informative

      What the OP needs to understand is that someone has to do periodic maintenance over those decades. Either he does it himself or hires someone (either directly or as subscription to a service). All archiving requires monitoring of the storage media and preventative transfer to newer media before the information is lost. Given that media is not perfectly predictable, you need redundancy and frequent monitoring too, so you can discover a data loss and restore redundancy before all media sets are lost.

      Using flash/solid state drives is probably going to improve the shelf-life of a storage copy. But you still have to monitor for loss. It doesn't reduce the chances of theft, fire, flood or simple misplacement.

      I do it myself, with cheap Linux software RAID boxes in separate locations, piggybacking on existing Internet services to do rsync mirroring between the sites. The machines are kept alive, running disk scans. The rsync mirror process is also periodically accessing the files at application level. I may add some cron job to do checksum verification some day.

      The maintenance cost is the power supply, internet service, eventual hard disk replacements (for failure or size increase) and my time. It is the cheapest solution for me that has the level of reliability I can understand and control myself. It is affordable because I have a friendly site letting me colo my remote box for free/good will. I don't pay their internet or electricity bill. I am geeky enough to consider the several hours per year of effort to be part of my computer hobby.

      My solution has evolved to this point and run stable for the past 5 years or so. Prior to that I did similar concepts but with less reliable equipment for each mirror, e.g. depending on fileservers at school and work that I could piggyback on with a few tar files, and just using single-disk machines locally. I've propagated my important date in "online" form for about 15 years now. Anything from before that was floppy disks and is lost to me, but also doesn't matter because we're getting back to teenage years by then.

    17. Re:Amazon S3 by grumbel · · Score: 1

      Well it's not dirt cheap for 500GB - that's $75/month. But who really has 500GB of critical data.

      Who can tell what data what data will be critical in 25 years down the road? Especially with random stuff from a home user things that you might consider random junk today might very well turn out to be the things that you want back most a quarter decade later.

    18. Re:Amazon S3 by mfnickster · · Score: 2, Insightful

      > I would figure that it's video... specifically, probably family home movies.

      Digital video has opened a HUGE new can of worms. We have problems even today viewing video created just yesterday (especially over the Web) because of all the myriad codec standards.

      Imagine what it'll be like in 20 years-- anything other than NTSC, PAL, or SECAM will be effectively extinct.

      --
      "Slow down, Cowboy! It has been 3 years, 7 months and 26 days since you last successfully posted a comment."
    19. Re:Amazon S3 by Myrddin+Wyllt · · Score: 4, Funny

      Of course Amazon will always be there, just like Ford and GM....

      --
      [ ]Half Empty [ ]Half Full [x]Twice as big as it needs to be
    20. Re:Amazon S3 by Bandman · · Score: 2, Interesting

      I have trouble with this in my mind, since so much of my work is devoted to making sure that information always exists and is accessible all the time. I look at these personal solutions for backup, and I'm so used to evaluating enterprise-type products that I scoff.

      I guess that I don't know anymore what is appropriate for 'home users' when they say they want to keep data long-term. The submitter stated that tape drives were inadequate when that's still the most reliable method that enterprises use.

      Sure, there are VTLs, but to not keep your data offsite as well would be counterproductive. I guess I just don't know what the submitter wants.

    21. Re:Amazon S3 by Mutant321 · · Score: 0, Redundant

      Jungle Disk is also very useful for S3. Allows you to mount drives in Windows, Linux or OSX. You have to pay, but only once per S3 account (so can use on multiple machines).

      But as you say, it's too expensive for personal use once you hit the hundreds of Gigs. I'm sure it'll come down over time, but the reality is it's just expensive to store things online.

      I really think the best solution is to use external HDDs (or even internal since he's not worried about access). Failure is a possibility, but they're pretty cheap, so just buy two and back it up twice.

    22. Re:Amazon S3 by erc · · Score: 1

      "I'd rather not pay yearly rent on a server or backup/storage solution"

      Amazon S3. dirt cheap, there forever.

      Either you can't read, or you're a shill for Amazon, which is it?

      --
      -- Ed Carp, N7EKG erc@pobox.com PGP KeyID: 0x0BD32C9B What I'm up to: http://intuitives.mine.nu
    23. Re:Amazon S3 by Anonymous Coward · · Score: 0

      Troll? Since when is a reference to a large stash of porn on Slashdot considered trolling? +1 funny damn it. Or maybe +1 insightful as that would be a great way to reduce the amount of storage necessary. But troll? You should lose your moderator points around here :)

    24. Re:Amazon S3 by vux984 · · Score: 4, Insightful

      A lot of people shoot home movies and then become obsessed with preserving the footage. That adds up fast.
      It seems that deciding what isn't important is a hard part of backing up.

      After a loved one dies, even the lowest quality outtake with a thumb covering half the shot can be a priceless memory.

    25. Re:Amazon S3 by hobbit · · Score: 1

      Amazon S3. dirt cheap, there forever.

      For unpredictably short values of forever.

      To the OP... your question reads like this:

      "Yesterday I set out in search of a way to eat my cake, pastries and sweetmeats without ceasing to have them. What did I come up with? Nothing."

      Just keep good backups of your current data, and every time you buy a new hard drive, create a folder called "contents of old hard drive" and copy everything into that. You'll still have the other 90% of the new drive's capacity to fill up with new stuff.

      --
      "Wise men talk because they have something to say; fools, because they have to say something" - Plato
    26. Re:Amazon S3 by paganizer · · Score: 1

      it's kind of crazy, anyway.
      I have a lot of digital media I keep backed up, plus source code, records. all told maybe 150gb.
      but, guess what? follow these simple steps:
      go to pricewatch. order a cheap barebones system, $150.
      Order 3 1TB SATA drives, and a 80gb drive, $300
      Install debian on the 80gb drive. make the 3 1TB drives into a 2TB software RAID 5 array.
      Hook into the network, in your garage preferably.
      Tadaa! 2 terrabyte of storage, costing $450, that will last you for years and years.
      note: dust it occasionally. I would also suggest using wake on LAN to keep your power bills low, unless you live in a place where there are temperature extremes that might effect it.

      --
      Why, yes, I AM a Pagan Libertarian.
    27. Re:Amazon S3 by toddestan · · Score: 1

      500 gigabytes is kind of a crazy amount of data. One way to get that much might be that he has a gigantic collection of mp3s, or possibly a moderately huge collection of music in a less lossy format. But then that's not critical personal data, it's just a music collection.

      I have about that much in slides I have scanned into TIFF files, so it's not unreasonable. In my case, I'm not too worried about it, as it's stored on several computers, and a few external drives too, plus burned to a set of DVDs. Harddrives don't last forever, but if you have enough people storing the data on their computers and transferring it as they upgrade, there is going to be copies around.

    28. Re:Amazon S3 by tverbeek · · Score: 1

      The only thing that has to happen for those files to vanish is for someone at Amazon to say, "We're getting out of this business now." They don't have to actually go out of business themselves.

      --
      http://alternatives.rzero.com/
    29. Re:Amazon S3 by Anonymous Coward · · Score: 0

      So.. you need a local db - where do you store that safely? I guess you can follow me with this recursive paradox...

    30. Re:Amazon S3 by DontPanic6x9 · · Score: 1

      I think the moderator was referring to the grammar mistake. He used than when he should have used then.

    31. Re:Amazon S3 by SanityInAnarchy · · Score: 1

      Unless you are willing to write your own backup scripts, its going to be a headache.

      Many of the scripts are written -- I kind of like Duplicity. But I agree, more needs to be done.

      Querying S3 for a list of stored files is *very* slow, and you only get 1k results per query.

      How slow is "very" slow?

      And "only" 1k results -- sounds like you need to reread the API docs, specifically the "prefix" option. If your files are stored hierarchically, you could use a single query to return everything matching a prefix of some path -- when was the last time you had more than 1k files in a single directory?

      And that isn't even that limit. From the API docs:

      You can iterate through large collections of keys by making multiple, paginated, list requests. For example, an initial list request against the dictionary bucket might only retrieve information about the keys 'quack' through 'quartermaster.' But a subsequent request would retrieve 'quarters' through 'quince', and so on.

      In other words: Your complaint is that, if I somehow needed to view more than a thousand items, it would take two HTTP requests. I consider that an acceptable performance hit.

      This means you have to index what files you put in S3 in a local db.

      *sigh*

      Or, you could put them in a DB file -- maybe an sqlite database -- which you store at some preset S3 URL. Or you use Amazon's SimpleDB. No need for it to be local.

      Or, as above, you use S3 properly, and you don't have that problem. This post is already too long, or I'd design one right now...

      I have around 120 GB of family photos and purchased mp3s that I would like to store. To store 120 GB at .15 per gigabyte/month for 1 year would cost me: $216 (at $18 a month).

      True -- and that's just storage. It would also cost you $144 to upload it all, plus a few pennies in requests. And another $244 to download it all.

      Still, I absolutely would recommend it in small quantities, for things like financial data, or things which you've created yourself -- in other words, things which would be irreplaceable, and are relatively small. As far as I know, S3 has occasionally been inaccessible, but it's never actually lost data.

      However, I would also challenge you to find something with as much redundancy as S3 for the same price. You mention a consumer-targeted app -- do you have one which doesn't require Windows, and also doesn't include vague clauses in their terms of service like "excessive amounts"? It would kind of suck to upload those 500 gigs to the backup equivalent of Comcast.

      --
      Don't thank God, thank a doctor!
    32. Re:Amazon S3 by SanityInAnarchy · · Score: 1

      There's also the fact that when the first disk fails, someone has to notice, buy a new one, and replace it. This leaves a period during which, in a typical configuration, one more disk failure will obliterate all data. If you don't notice the disk failing, it could be a considerably long period.

      So, the additional advantage to Amazon is that it truly is "set it and forget it" -- upload it once, no need to monitor it until you need it.

      And yes, there's the possibility of your house burning down. Or getting struck by lightning. Or you spill Mountain Dew on the NAS. Or any of a large number of things.

      I would say, prioritize your data. Most of the time, there's a small amount of data which cannot ever ever ever EVER be lost -- stuff like financial history, that book you're writing, etc. Even if you throw in, say, /etc on Linux, it's still a small enough amount of data that it would cost pennies to store on S3.

      Put the rest of it -- your porn, whatever -- on that NAS. This is stuff that it would suck to lose, but it's not exactly irreplaceable -- or it is, but you're not willing to pay a premium to protect it.

      --
      Don't thank God, thank a doctor!
    33. Re:Amazon S3 by afidel · · Score: 1

      DLT is 24 years old and tapes written on 14 year old DLTIV tapes are still readable both physically and logically today (assuming tar format for backups). If you had tapes from 1989 onward you would need 2 drives to read every tape, something that can read DLTIII and a new SDLT 320 drive to read DLTIV onwards. It's not the cheapest solution for home backups but if you are serious about longterm archives it's about the only solution that doesn't require refreshing data.

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    34. Re:Amazon S3 by afidel · · Score: 1

      Yep, over Thanksgiving we watched 32 year old wedding footage with my wife's grandmother and parents, the glimpses of her grandfather and great-grandfather brought up a lot of emotions for them all.

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    35. Re:Amazon S3 by SausageOfDoom · · Score: 1

      You're all ignoring the fact he said he didn't want to pay for remote storage. RTFQ.

    36. Re:Amazon S3 by Peter+van+Hooft · · Score: 1

      Actually, I think what is needed here is data management. Storing your data for a long time is one problem (which can be solved by re-storing your data on other, and, very important _the at that time newest format_ devices), the other problem is you need to re-store your data in the file formats du jour, ie import it in an application and save it in the then current file format.

    37. Re:Amazon S3 by ThePromenader · · Score: 1

      "Well it's not dirt cheap for 500GB - that's $75/month. But who really has 500GB of critical data."

      Photographers! Or any image-oriented media for that matter.

      My company (a photo-equipment rental company) is starting a long-term secure-storage service as an answer to the very questions asked of this thread, not to mention the dozens of data-loss stories we hear each day from our photographer clientelle. It is a huge investment for a company our size, but one of the reasons we began is that - no-one is doing it yet. At least not in a way adapted to our (and our clientelle's) needs.

      In the many months of research preceding the development of our service, one of the most frustrating things I saw was a total lack of transparency; very few sites disclose how they 'secure' data (if they do at all). Also, after having reviewing and testing the many storage architecture/media solutions out there, and the cost/capacity ratios of each, I tend to doubt the 'security' of a system that only charges $75 for 500gb a month.

      --

      No, no sig. Really.

      ThePromenader
    38. Re:Amazon S3 by Duckie01 · · Score: 1

      There's also the fact that when the first disk fails, someone has to notice, buy a new one, and replace it.

      True, with a NAS you'll have the burden of hardware maintenance. How often would that be tho?

      This leaves a period during which, in a typical configuration, one more disk failure will obliterate all data.

      Ehh... I thought the question was about a backup sollution... so I meant that the NAS is the backup, not the primary place to store data. So if one disk would fail, the data would still be redundant. If two disks would fail, you could still access the data (either the original or the limping RAID) but have no redundancy. Only if 3 disks would fail at once you'd lose all data.

      If you don't notice the disk failing, it could be a considerably long period.

      So, the additional advantage to Amazon is that it truly is "set it and forget it" -- upload it once, no need to monitor it until you need it.

      True. It's amazing tho what kinda software these NAS devices have these days. If you get the "set it" part of S3 right you *sure* can get the failure notification of these NAS devices going.

      And yes, there's the possibility of your house burning down. Or getting struck by lightning.

      Yeah I already acknowledged that. That hasn't changed tho now we have digital data. I havent seen anyone store the negatives of their family photos offsite... It's gotta have something to do with statistics, the chance of your house burning down.

      Which, now that I'm thinking of it, might actually vary greatly depending on where you live... I don't concern it to be a big issue here...

      ... above water level is more important to me ;-) I live below sea level, and I've seen pics of a flooded home theater basement where they also had a server and some backup storage... I'd be really really sad ;) My small pos server and backup storage is all on the second floor for this very reason.

      Or you spill Mountain Dew on the NAS. Or any of a large number of things.

      Yeah well, if you'd take a look at them, your idea would quickly turn into "shake a bottle of mountain dew till you're afraid it will explode and spew the dew into a vent hole because otherwise I'll just wipe the thing clean without even turning it off".

      Also, let's just assume that if you're smart enough to install a NAS for the just-in-case backup you'll also find a suitable spot to place it safely. I've got one between 2 shelves, just used two lathes and a couple of screws to mount it. I fixated the cables with simple clamps I also screwed onto the shelf.

      It won't drop down short of an earthquake... but we don't have quakes like those. You'd really have to try hard to damage it with liquid.

      I would say, prioritize your data. Most of the time, there's a small amount of data which cannot ever ever ever EVER be lost -- stuff like financial history, that book you're writing, etc. Even if you throw in, say, /etc on Linux, it's still a small enough amount of data that it would cost pennies to store on S3.

      That's great advice of course :-) Perhaps I'm just too paranoid about trusting my personal data to third parties :-(

      Put the rest of it -- your porn, whatever -- on that NAS. This is stuff that it would suck to lose, but it's not exactly irreplaceable -- or it is, but you're not willing to pay a premium to protect it.

      I'd turn that into "put *everything* you want to backup on the NAS anyways". If it'd cost pennies to store on S3 it won't get in your way on the NAS, and S3 *has* had downtimes, tho not a lot... I also don't have a triple redundant pipe at home... ISPs have their downtimes as well.

      There's also a lot stuff I don't backup at all... like gigabytes of stuff I got

    39. Re:Amazon S3 by Duckie01 · · Score: 1

      I have trouble with this in my mind, since so much of my work is devoted to making sure that information always exists and is accessible all the time. I look at these personal solutions for backup, and I'm so used to evaluating enterprise-type products that I scoff.

      So why bother replying to a thread if the level is so far below your own you can't say anything but "don't know"? If it's to advertise your blog... well it worked... i read some of it, and it represents a nice view on a person just learning how to tunnel some traffic thru ssh... or reading "interesting" stuff on the wikipedia page about system administration...

      I guess that I don't know anymore what is appropriate for 'home users' when they say they want to keep data long-term.

      They're confusing keeping the data long-term with keeping the data carrier long-term.

      The submitter stated that tape drives were inadequate when that's still the most reliable method that enterprises use.

      They're reliable and cheap so enterprises tend to use them a lot. They don't make the data easily accessible though.

      Sure, there are VTLs, but to not keep your data offsite as well would be counterproductive. I guess I just don't know what the submitter wants.

      My requirements are simple: It must be stable, lasting for decades if possible, and must be as inexpensive as possible. I'm not looking to start my own national archive; I have less than 500GBs and only save things important to me."

    40. Re:Amazon S3 by SanityInAnarchy · · Score: 1

      True, with a NAS you'll have the burden of hardware maintenance. How often would that be tho?

      More often than "none at all", which is what the OP was asking for.

      That's great advice of course :-) Perhaps I'm just too paranoid about trusting my personal data to third parties :-(

      So encrypt it. Then you have to store your crypto keys somewhere -- but crypto keys are relatively tiny, and if they're sufficiently secure, unchanging. You could, for instance, fill a DVD with copies of the same tarball, of that key, any script you wrote to retrieve the data, and a text file of instructions. Or use Paperdisk.

      If it'd cost pennies to store on S3 it won't get in your way on the NAS, and S3 *has* had downtimes, tho not a lot...

      Yes, store it on the NAS, and also on S3, so if the NAS blows up for whatever reason, you have that critical stuff. Remember, this is stuff on the level of "I am not willing to ever lose this."

      And S3 has had downtime, but I don't remember it ever losing data. Downtime just means you try again later.

      The main reason to not completely trust S3 -- to have your script periodically check it, and maybe to have a NAS of some sort anyway -- is because even if you assume Amazon is trustworthy, anyone who gets a hold of your keys can delete your data -- even you, in a moment of stupidity (think rm -rf /). But then, that's also a reason to not completely trust a NAS.

      There's also a lot stuff I don't backup at all... like gigabytes of stuff I got from torrents... If anything happens to those I'll just d/l again what I want at that time.

      Right -- my point is, when you ignore everything you can just re-download (but that it might be painful to have to find again), you're left with tiny amounts of data that S3 makes sense for.

      --
      Don't thank God, thank a doctor!
    41. Re:Amazon S3 by Anonymous Coward · · Score: 0

      Mozy is $5/month/computer for unlimited storage. If you EOL a computer, you can get it's files switched to another machine.

      It's got Mac and Windows clients. It's not perfect, but it's not bad.

    42. Re:Amazon S3 by Jeppe+Salvesen · · Score: 1

      In addition, my harddrive must crash-n-burn at that exact time. I keep local copies, as I'm not a teenager nor a Republican.

      --

      Stop the brainwash

    43. Re:Amazon S3 by pAnkRat · · Score: 1

      Don't write these scripts yourself,
      tools like duplicity: http://duplicity.nongnu.org/
      bring S3 as a target out of the box.

      --
      we need an "-1 Plain wrong" moderation option!
    44. Re:Amazon S3 by Duckie01 · · Score: 1

      True, with a NAS you'll have the burden of hardware maintenance. How often would that be tho?

      More often than "none at all", which is what the OP was asking for.

      That doesn't make it a non issue... the OP was asking for the impossible, like it should last for decades, easy setup, zero maintenance and be inexpensive at that. I mean, he isn't gonna get what he was asking for anyways, so some tradeoff needs to be made, just as Amazon S3 has its downsides as well. To find out if the tradeoffs are acceptable or to optimize them you should be able to quantify them somewhat somehow.

      When I've had trouble with hard drives, it was either due to my own stupidity (physical shocks while they were running) or with pretty old hard drives. You can prevent the stupidity part by mounting the NAS in a safe place, and by only doing something to it physically when it's turned off ;-)

      Apart from the stupidity I've had 2 harddisk crashes in my server in over a decade, both hard drives were living their second life after at least 3 years of being desktop whores.

      Hard drives will fail sooner or later but personally I don't think it'd happen so often that it'd become a chore, I think it'd be an acceptable sacrifice.

      That's great advice of course :-) Perhaps I'm just too paranoid about trusting my personal data to third parties :-(

      So encrypt it.

      Yeah... I'm seriously looking into S3 now to use it in addition to the NAS... it really does make sense to put the highest priority data offsite as well. I see that this is going to be one of the downsides for S3... it's gonna cost a lot of time getting something going that'll encrypt and store the data periodically in a reliable way. Muuuuch more than it took to configure my NAS and some rsync jobs ;-).

      Unless of course you happen to know some free (speech) software that'll do the trick easily.

      More S3 pitfalls ahead... I didn't have an amazon.com account and created one. Obviously I had to create an AWS account as well. Apparently I need a credit card to pay for S3 which I don't have and actually would rather not have... so now I either gotta get me one anyways or forget the S3 plan alltogether... feh.

      Then you have to store your crypto keys somewhere -- but crypto keys are relatively tiny, and if they're sufficiently secure, unchanging. You could, for instance, fill a DVD with copies of the same tarball, of that key, any script you wrote to retrieve the data, and a text file of instructions. Or use Paperdisk.

      The keys I got so far are so small that even just printing them on paper and having to type them back in in the unlikely scenario of me needing the offsite backup would be very acceptable ;-)

      If it'd cost pennies to store on S3 it won't get in your way on the NAS, and S3 *has* had downtimes, tho not a lot...

      Yes, store it on the NAS, and also on S3, so if the NAS blows up for whatever reason, you have that critical stuff. Remember, this is stuff on the level of "I am not willing to ever lose this."

      And S3 has had downtime, but I don't remember it ever losing data. Downtime just means you try again later.

      True. It's at least as annoying as replacing a hard drive though. When I'm arriving at that stage I've lost the data in my desktop, my backup on the NAS, my critical stuff backup on my server, and probably replaced a lot of hard drives. By that time I really really want at least that data... and I mean not maybe in 5 minutes, or maybe 2 hours, or whatever scenario my spinning head will come up with at that time... but NOW damnit... hehe.

      Or perhaps I'd just shrug it off with a "oh maybe tomorrow" and return to the smoking remains of my home to see if there's any *real* stuff left...

    45. Re:Amazon S3 by Hognoxious · · Score: 1

      But that was funny to.

      --
      Confucius say, "Find worm in apple - bad. Find half a worm - worse."
    46. Re:Amazon S3 by SanityInAnarchy · · Score: 1

      Unless of course you happen to know some free (speech) software that'll do the trick easily.

      Look into Duplicity. I haven't used it -- this is something that's interesting enough to me that I'll probably write my own. But the software definitely exists.

      The keys I got so far are so small that even just printing them on paper and having to type them back in in the unlikely scenario of me needing the offsite backup would be very acceptable ;-)

      Sounds like a passphrase, not a key. And I'd be using a large-ish, random key.

      True. It's at least as annoying as replacing a hard drive though.

      Really?

      S3 downtime: You get an email, and by the time you go to look at it, S3 is probably back up and your script has taken care of everything.

      Replacing a hard drive: You get an email, and you order a new drive from Newegg, unpack it, open up your NAS, and push the drive in, assuming it's hotpluggable. If not, you'll have to carefully power it down, probably open it, hook up the drive, close it, lock it, and boot it back up.

      Or perhaps I'd just shrug it off with a "oh maybe tomorrow" and return to the smoking remains of my home to see if there's any *real* stuff left...

      Probably. But since the things I've created are digital, I would still be pretty worried about that -- if my house burns down, there's a lot of nice things I've lost, and probably a fair amount of money. So I go to work, make some more money, get a new house, and buy more things to fill it -- just takes time, but it's all replaceable.

      If I lose that killer app I was working on, or that great American novel, I have to start over. I may be able to recreate it, but most likely, I will never get it back.

      I searched and couldn't find what they're actually doing to ensure my data will be there always, apart from a vague "99.99% availability - or you'll get service credits" claim.

      Read up on Dynamo, among other things.

      More relevantly, that is now their job, not yours -- which means you can relax. It's hard, I know :)

      --
      Don't thank God, thank a doctor!
    47. Re:Amazon S3 by Duckie01 · · Score: 1

      Look into Duplicity. I haven't used it -- this is something that's interesting enough to me that I'll probably write my own. But the software definitely exists.

      That's cool tho... because I've used duplicity for a while before switching to plain rsync. Apparently they've added an S3 backend since then, and even a gmail one as well... yay :-) It's fairly simple to setup too.

      It didn't pop up in my google/freshmeat searches for S3 stuff... perhaps they'd need to advertise that some more ;-) Thanks!

      The keys I got so far are so small that even just printing them on paper and having to type them back in in the unlikely scenario of me needing the offsite backup would be very acceptable ;-)

      Sounds like a passphrase, not a key. And I'd be using a large-ish, random key.

      Those were the AWS key and secret key, as amazon calls them. You might just call them random passphrases. They're like 20 or 40 characters long. I don't have any S3 keys yet... no credit card yet... might not get one either depending on how this duplicity/gmail thing is going to work out. I just stored and restored some data with it. S3 seems more suitable to dump a couple hundred megs of photos on tho, while even a couple of gigabyte still amounts to pennies per month :-)

      What would be a sufficient keysize? Would it really need to be more than 128 characters?

      True. It's at least as annoying as replacing a hard drive though.

      Really?

      S3 downtime: You get an email, and by the time you go to look at it, S3 is probably back up and your script has taken care of everything.

      Yeah that might be a more likely scenario than the doom I had in mind ;-) It'd just be annoying to have to wait for it and not being able to do something about it.

      Replacing a hard drive: You get an email, and you order a new drive from Newegg, unpack it, open up your NAS, and push the drive in, assuming it's hotpluggable. If not, you'll have to carefully power it down, probably open it, hook up the drive, close it, lock it, and boot it back up.

      Well usually i just pick up my hardware on my way to or from work... so it'd cost like 10 minutes to replace a hd I guess.

      Or perhaps I'd just shrug it off with a "oh maybe tomorrow" and return to the smoking remains of my home to see if there's any *real* stuff left...

      Probably. But since the things I've created are digital, I would still be pretty worried about that -- if my house burns down, there's a lot of nice things I've lost, and probably a fair amount of money. So I go to work, make some more money, get a new house, and buy more things to fill it -- just takes time, but it's all replaceable.

      If I lose that killer app I was working on, or that great American novel, I have to start over. I may be able to recreate it, but most likely, I will never get it back.

      Yeah... I don't really have anything digitally at home that's making me money... there's just stuff I'd hate to lose... so I've got the NAS and do some backups on my server as well. Of course when the house burns down I'd still love to get at least some digital stuff back ;-)

      Read up on Dynamo, among other things.

      I just read the paper ... cool stuff :-)

      More relevantly, that is now their job, not yours -- which means you can relax. It's hard, I know :)

      Heh.. nah.. I still sleep fine ;-) It's just that when you're creating offsite backups it's for the worst doom scenario ;-)

    48. Re:Amazon S3 by Bandman · · Score: 1

      Well, obviously I want people to check out my blog, since I put the link in my signature, but that's definitely not why I posted the comment. If it sounded like I thought the subject matter was below me or something, I gave off the wrong vibe, and I apologize.

      When I said 'I guess I just don't know what the submitter wants', it was sort of rhetorical. You quoted the sentence. He might as well have been asking for unicorns. Anything stable and lasting for decades is a tape at this point. Or cuneiform.

    49. Re:Amazon S3 by Bandman · · Score: 1

      Also, I'm open to ideas on improving my blog. I generally cater to people who admin small infrastructures, because most people work on smaller networks (around 70% of admins have a network with less than 100 machines), and that's an area lacking documentation and vendor support. That being said, I try to provide something of interest to everyone, so like I mentioned, I'm open to suggestion and improvement. You can email me at standalone.sysadmin@gmail.com if you want.

    50. Re:Amazon S3 by SanityInAnarchy · · Score: 1

      Those were the AWS key and secret key, as amazon calls them. You might just call them random passphrases. They're like 20 or 40 characters long.

      I assume those are sufficient -- and I believe there's some actual x509 certificates you can use, if not.

      No, I'm talking about encrypting your data so Amazon can't see it. Those keys are just authenticating you to Amazon, they don't encrypt your data.

      Well usually i just pick up my hardware on my way to or from work... so it'd cost like 10 minutes to replace a hd I guess.

      I live in a small town -- nowhere I can go and just buy a hard drive, certainly not one I expect any reliability from.

      --
      Don't thank God, thank a doctor!
  2. Hard drives kept online by matts-reign · · Score: 2, Insightful

    Hard drives, while they may fail, are still probably your best chance. Using RAID-1 or -5, you can keep the drives running (possibly intermittently) and can avoid failure. With the rate of hard drive growth, you can just replace them with bigger drives when the time comes you need more space. It isn't exactly the same as throwing them in a cold room and forgetting them, but it isn't too expensive either.

    --
    Waffles rock.
    1. Re:Hard drives kept online by Annymouse+Cowherd · · Score: 1

      Since he won't care about this stuff for years, theres no reason he should even have the drives running. Drives that are off should have a data retention of at least 100 years.

    2. Re:Hard drives kept online by boner · · Score: 4, Informative

      I recently built my own cheap backup server using OpenSolaris and ZFS. I used my old SATA drives (6x400GB), a $75 motherboard and AMD Athlon X2 combo, 4GB of DRAM ($69) and an old tower case. I did add two SATA 5-bay hot-swappable disk bays ($110 each) so that I can easily replace/upgrade my disks. Once a week I update data from my main server (also Solaris) to the backup server using ZFS incremental snapshots.

      My PC's and Mac's all mount their user directory from my main server, and I rsync my laptop every day. The main server also serves as a SunRay server so I do most of my daily chores on a SunRay. I run Windows inside VirtualBox and I rarely ever turn on my windows PC anymore (the Windows instance in VBox also mounts from my main server). Inside my main server I have 2x 1TB drives, in a ZFS mirror setup, for the user directories and 2x400GB for the OS and scratch directories (all drives are SATA).

      I'm very confident in this setup, also because I can yank out my drives in under 30 seconds in case of fire. The only thing I still have to do is put my backup server in a different room from the main server - that is a todo project for the near future.

    3. Re:Hard drives kept online by ewilts · · Score: 4, Informative

      I recently built my own cheap backup server using OpenSolaris and ZFS. I used my old SATA drives (6x400GB), a $75 motherboard and AMD Athlon X2 combo, 4GB of DRAM ($69) and an old tower case. I did add two SATA 5-bay hot-swappable disk bays ($110 each) so that I can easily replace/upgrade my disks. Once a week I update data from my main server (also Solaris) to the backup server using ZFS incremental snapshots.

      My PC's and Mac's all mount their user directory from my main server, and I rsync my laptop every day. The main server also serves as a SunRay server so I do most of my daily chores on a SunRay. I run Windows inside VirtualBox and I rarely ever turn on my windows PC anymore (the Windows instance in VBox also mounts from my main server). Inside my main server I have 2x 1TB drives, in a ZFS mirror setup, for the user directories and 2x400GB for the OS and scratch directories (all drives are SATA).

      I'm very confident in this setup, also because I can yank out my drives in under 30 seconds in case of fire. The only thing I still have to do is put my backup server in a different room from the main server - that is a todo project for the near future.

      Problem 1: If you are not home and your power supply decides to catch fire, you have lost everything.

      Problem 2: If you are home, you better be spending those 30 seconds trying to get your butt out of the fire, not running after hard drives.

      If you think your DR plan relies on yanking drives out, you're in serious trouble. One B&E or a fire and your data is gone. Now this may be perfectly acceptable to you. It is to a lot of small companies, until it happens to them.

      Personally, I vault offsite on a daily basis as well.

      --
      .../Ed
    4. Re:Hard drives kept online by ColdWetDog · · Score: 2, Interesting

      Since he won't care about this stuff for years, theres no reason he should even have the drives running. Drives that are off should have a data retention of at least 100 years.

      [citation needed], please. Really, my understanding is that non powered hard drives are NOT good long term candidates due to 'stiction' (maybe not so much an issue now, maybe it is) and perhaps other problems.

      (Warning: Anecdote time) I've had several previously good HDs in my junk box fail to start when I decided I wanted to play with them again. Any real data?

      --
      Faster! Faster! Faster would be better!
    5. Re:Hard drives kept online by boner · · Score: 1

      I hear you, my biggest worry isn't fire itself, its fire after earthquake. In addition to my backup solution described above, I keep rotated drives with snapshots at work.

      If my house burns down completely and all data is unretrievable I will have lost at most 6 months of data. Not all.

    6. Re:Hard drives kept online by WhatAmIDoingHere · · Score: 1

      And this is the real answer to the original question. Do everything you can to make backups and copies at home and rotate those disks with alternates kept off-site.

      It won't be cheap to set up, but once it's going your only cost is going to be replacing drives that fail. It would be a lot cheaper than 500GB with Amazon's S3.

      --
      Not a Twitter sockpuppet... but I wish I was.
    7. Re:Hard drives kept online by theaveng · · Score: 2, Interesting

      What's "striction"? I'm asking because I just switched-off my USB drive in order to save power. Maybe I should turn it back on & let it run continuously?

      My suggestions:

      - Home videos - nothing lasts as long as analog videotape. Twilight Zone episodes recorded in the 1960s are still viable today, whereas DVD-Rs have a nasty habit of self-erasing themselves in just 2-3 years. I think Super VHS tape is the safest way to store family memories.
      - For files smaller than 20 megabytes, I upload them to yahoo mail and Gmail.
      - For larger files I dual-duplicate them across both my c: and usb: drives. If one fails I know I'll still have the backup of the other.
      - For source code, printing to paper is also an option. Good quality paper will outlive you.
      -
      - And finally: Ask yourself if you really need that stuff? Do you need to save your old CSE 101 project? Probably not. I used to save that junk but then I realized it was pointless. Nowadays I save very little because most of it is not worth keeping or will never be used again.

      --
      FOX NEWS.com should be BANNED from television and internet. Have the Congress take it over and give us Truespeak.
    8. Re:Hard drives kept online by Toll_Free · · Score: 0, Troll

      You have a backup server and a main server?

      For you HOUSE?

      You really don't WANT to get laid by a member of the human race, do you? :)

      --Toll_Free

    9. Re:Hard drives kept online by ColdWetDog · · Score: 4, Informative

      What's "striction"?

      I'm not sure what "striction" is (and I suspect I don't really want to find out). "Stiction" on the other hand, refers (among other things, check out the Wikipedia link) to the tendency of the lubricants inside the HDD to increase in viscosity over time. After a long enough period of time, the motor can't rotate the platters. The fun thing about this problem is that the 'repair' involves brute force - banging the HDD to move things a bit.

      Supposedly was a problem on older drives. My brief Google search did not yield any useful info. The problem with HDDs is that they are designed to be on and NOT designed for long term, powered off storage. That said, turning your USB drive on and off shouldn't be much of an issue. Remember the thing was probably made 6 months before it was installed in your housing and then sat in a warehouse for a while. The question is - can you expect a random HDD to fire up after sitting in storage for x years (x being larger than 2 or 3)?

      --
      Faster! Faster! Faster would be better!
    10. Re:Hard drives kept online by confused+one · · Score: 4, Informative

      Until you have a power supply failure take out multiple disks or a controller failure corrupt them all. Then your data is GONE!!!. Don't rely on a single RAID array, of any kind, or any combination in one chassis, to store data you want to keep long term.

    11. Re:Hard drives kept online by Anonymous Coward · · Score: 1, Insightful

      I'd argue though that out of all my data the last 6 months of it is exactly what's most valuable to me.

    12. Re:Hard drives kept online by tomhudson · · Score: 1

      I doubt stiction is that big a problem. I had a computer that literally sat for a decade before I decided to boot it up "just for the heck of it" - ahh, the good old days of 17-second bootups

    13. Re:Hard drives kept online by Ihmhi · · Score: 1

      Could you compensate for stiction?

      What about an archival PC? Low power. Turn it on one day a week to spin up all the drives. Leave it on for a bit and then shut it down. Wouldn't this solve the problem without greatly compromising the life of the drives?

      Hell, you could probably use the BIOS and some simple programming to have all of this automated.

    14. Re:Hard drives kept online by QuantumLeaper · · Score: 1

      I have an IBM XT which has been sitting in storage for almost 2 decades, and the HD started up fine. DOS has those nice fast boot times, but my C64 would boot in 2 seconds.

    15. Re:Hard drives kept online by iamhassi · · Score: 5, Informative

      "Hard drives, while they may fail, are still probably your best chance."

      I tend to agree, however I'm a bit confused over what exactly is being requested.

      "I've thought about buying a bunch of 4GB thumb drives....I have less than 500GBs and only save things important to me."

      At first glance I thought you had 500gb you were trying to store, but then you mentioned "buying a bunch of 4GB thumbdrives" and I can't imagine someone buying 125 4gb thumbdrives to use for backup. So exactly how much data are you trying to store?

      If less than 50gb, I'd suggest a few SD cards. 8gb SD is ~$11, or 16gb for $30. While more expensive than hard drives per gb, SD cards are remarkably resilient, surviving a week in the ocean, and a few in a ziplock bag stored in a safe deposit box would probably last close to forever.

      SD will probably still be around at least for the next decade or longer. SD has already been around since 1999 and all modern card readers read SD cards by default. SD slots are in nearly every form of consumer electronic device, and every manufacture of digital cameras uses SD except Sony and Olympus, almost guaranteeing the card readers will be around for many years to come.

      I would suggest against USB anything since they're already discussing cutting the cord on USB and going wireless USB. While I don't predict that will happen overnight you wanted a solution that would be available decades from now, and wired USB might go the way of the parallel port, which was the standard external port in the 80s and 90s but was replaced by USB late 90s. Parallel port only had a lifespan of about 20 yrs and is no longer on modern PCs, and USB has been out just over 10 years so it's feasible in 10 years PCs will no longer have USB ports, everything will be wireless USB.

      --
      my karma will be here long after I'm gone
    16. Re:Hard drives kept online by Anonymous Coward · · Score: 0

      How about simply sending files to one's Gmail account?

      If one wishes a redundant backup, it is possible to do so with, say, a Yahoo mail account, using a command like:

      $ fetchmail --smtphost smtp.YourISP.com --smtpname YourYahooAccount@yahoo.com -k -a --ssl -P 995 -u YourGmailAccount@gmail.com -p pop3 pop.gmail.com

      It is unlikely that both Gmail and Yahoo (or Hotmail or Lycos mail) disappear at the same time

      Alternatively, e-mails could also be downloaded from Gmail to a local drive (instead of being redirected to Yahoo)

    17. Re:Hard drives kept online by the_womble · · Score: 2, Insightful

      You can still get USB to parallel port adaptors.

      The same thing will happen with whatever replaces USB.

      I really hope we do not move to wireless USB. It will just be an extra set of security holes and other problems.

    18. Re:Hard drives kept online by boner · · Score: 1

      I'm married...

      nuf said.

    19. Re:Hard drives kept online by LWATCDR · · Score: 1

      Why not archive quality DVDs? They are more expensive then standard DVDs but cheaper then SD cards.
      Make copies just to be sure.
      http://www.mediasupply.com/mamgold.html

      --
      See my blog http://ilovecookes.blogspot.com/ for light hearted technical information.
    20. Re:Hard drives kept online by Anonymous Coward · · Score: 0

      A few years ago I fired up an old PC that had been sitting in the basement for around 7 years, and the hard drive worked fine.

      This is, of course, ancedotal evidince with no statistical significance.

    21. Re:Hard drives kept online by mazarin5 · · Score: 1

      The question is - can you expect a random HDD to fire up after sitting in storage for x years (x being larger than 2 or 3)?

      To be fair, that hard drive should get a little time powered up when he syncs. (Hopefully, that would be more often than every 2 or 3 years.)

      How long does a HDD have to run to reset the stiction clock?

      --
      Fnord.
    22. Re:Hard drives kept online by Bandman · · Score: 1

      You'd have to replace the power supplies occasionally, but I suppose it might work.

    23. Re:Hard drives kept online by Bandman · · Score: 1

      It's a bit costly, but Iron Mountain provides physical (and online) backups.

    24. Re:Hard drives kept online by decavolt · · Score: 1

      An "archival PC" - or anything else locally stored - is useless in the event of a fire, theft, flood or some other calamity.

    25. Re:Hard drives kept online by decavolt · · Score: 1

      500 gig of data on Gmail? For small amounts of data, sure, but not for anything substantial.

      Further, Gmail is not and should not be relied upon as a long-term storage space. It's still just a free account on a service that's still "beta" with no guarantee whatsoever that data will be retained.

    26. Re:Hard drives kept online by decavolt · · Score: 1

      I'm very confident in this setup, also because I can yank out my drives in under 30 seconds in case of fire...

      That requires you to be home. Utterly useless in the event of theft, or any other calamity while you're away - which is the most likely time for such things to happen.

    27. Re:Hard drives kept online by Molochi · · Score: 1

      I first heard of stiction in reference to very very old hard drives. Something about not parking the drive heads off the platter when they were powered down would cause the drivehead and platter to stick together. When you powered them back up it would damage the platter or even rip off the head. I never had to deal with anything like that and, true or not, modern drives shouldn't be able to fail this way.

      Driveheads now are automatically parked and don't touch the platter, headcrashes excepted. I would think if you bought a new drive, used it for a single large backup, and then stored it offline safely, that it would probably have about the same shelflife as a drive sitting in it's box new and unused.

      --
      "The Adobe Updater must update itself before it can check for updates. Would you like to update the Adobe Updater now?"
    28. Re:Hard drives kept online by Kessler · · Score: 3, Informative

      Really? Taken apart any modern hard drives lately? "Parking" is simply the act of moving the heads to an area of the platters not used for data storage. The heads are still very much in contact with the platters, but if the heads get bounced around, there's no data under them to damage.

      Stiction is caused by centripetal force of the spinning platters gradually drawing spindle bearing lubricant across the surface of the platters. This happens gradually over a long period of time. Once that happens, if you leave the drive powered off (parked or not) for a significant period of time, the contaminants can bond the heads to the platters.

      Newer drives take steps to prevent this such as using better bearings and parking the heads close to the spindle so they can generate more "break-free" force from for a given amount of motor torque.

      If you copy data to a brand new drive (preferably a lower capacity unit with a single platter and only a pair of heads) and then take it offline and store it in a climate controlled environment, stiction is not likely to ever become a problem.

    29. Re:Hard drives kept online by toddestan · · Score: 1

      Likewise, I've have a bunch of drives that have been sitting on a shelf, some for many years. I've never had a problem with stiction. Though what does happen is the drive will work fine for 1-2 days, then will crap out. Usually just enough time for me to get it set up with whatever project I'm doing, then I have to start all over again.

      I would say store the drives in a cool, dry place. Try to remember to run them a few hours every few months. Replace them every 3-5 years whether you need to or not.

    30. Re:Hard drives kept online by Seraphim_72 · · Score: 1

      I am not a server person so forgive the ignorance, but can you make a RAID of RAIDs? (assuming that you had more money than Bill G himself)

      --
      Slashdot, where armchair scientists get shouted down and armchair theologians get modded up.
    31. Re:Hard drives kept online by Idiomatick · · Score: 1

      I had two servers for a while and i'm getting married...

    32. Re:Hard drives kept online by corerunner · · Score: 2, Informative

      You can indeed make a RAID of RAIDs. It's called using nested levels. Really RAID should be used for high availability and performance though, and not as part of a disaster recovery policy.

      --
      "Don't hate the media, become the media." -Jello Biafra
    33. Re:Hard drives kept online by Anonymous Coward · · Score: 0

      "[citation needed], please. Really, my understanding is that non powered hard drives are NOT good long term candidates due to 'stiction' (maybe not so much an issue now, maybe it is) and perhaps other problems."

      I'm going to counter your anecdotes with my own, I have a driver nearing 20 years old that is never used that still turns on, as I recently checked to see what was still on it. I also have many drives from the mid to late 90's that have been off for near a decade and still work just fine thanks. I've used them infrequently to test really old hardware.

    34. Re:Hard drives kept online by thermowax · · Score: 1

      Can you yank out the drive in 30 seconds when the fire starts and you're not home?

      Can you yank out the drive in 30 seconds when lightning strikes and fries every single component in every piece of equipment in your house? And don't think any "surge suppressor" is going to stop it- lightning travels several thousands of feet to get to your house; that 1/16" MOV isn't going to do squat.

      I didn't think so.

    35. Re:Hard drives kept online by DigiShaman · · Score: 1

      Wireless will never replace Wired USB as devices still require the host PC to power them. It's not just about data transfer.

      --
      Life is not for the lazy.
    36. Re:Hard drives kept online by a_nonamiss · · Score: 1

      wired USB might go the way of the parallel port

      I can still purchase both a USB/Parallel adapter and a parallel port PCI card at my local computer store for less than $10. People keep saying that every form of media is going to be obsolete in 20 years. They cite examples like "How many 8" floppy drives have you seen lately?"

      This is a poor example. How many 8" disks were ever produced? Maybe a couple hundred thousand total. (Shot in the dark, but not THAT many.)

      Contrast that to how many CD's have ever been produced. I don't know that number, either, but I'd guess billions easily. At least several thousand times more than 8" floppy disks. Truth is, I don't think there will EVER be a time (before the rapture, nuclear war or a massive tribble invasion) when something that reads CD's isn't readily available, because the medium is so ubiquitous. Never in human history has a single medium held even a small fraction of the data that are stored on CD/DVD's.

      Now, in 10,000 years, long after we're dead and the apes have evolved into intelligent beings, and they find our CD's, they might have a real challenge getting data off of them, but I don't believe in the forseeable future, this will be an issue.

      --
      -Arthur
      Cave ne ante ullas catapultas ambules
    37. Re:Hard drives kept online by ThePromenader · · Score: 1

      Yes, you can, through 'mirroring' - either through a second level software raid (say, making a raid-1 out of two raid-5 arrays (or call it raid-51)), or you can do it on the server level by 'mirroring' two computers each managing their own raid array. The first method assures simultaneous synchronisation (but counts on the managing computer's uptime and health); the latter provides a "HA" factor (High Availability), as should either an array or a server go down (even for maintenance/recuperation), the other server will take over, thus giving you uptime.

      HA is open source, as are more elaborate HA programs such as DRDB.

      --

      No, no sig. Really.

      ThePromenader
    38. Re:Hard drives kept online by theaveng · · Score: 1

      After reading all this I've come to the conclusion that a Flash ROM probably is the best way for archival storage. No moving parts so lubrication is not an issue, and they are relatively compact so you can store them in a fireproof safe.

      --
      FOX NEWS.com should be BANNED from television and internet. Have the Congress take it over and give us Truespeak.
    39. Re:Hard drives kept online by theaveng · · Score: 1

      IBM, Amiga, and other disk-based OSes can do the same thing if you go directly to a command line (which is essentially what a Commodore 64 does).

      --
      FOX NEWS.com should be BANNED from television and internet. Have the Congress take it over and give us Truespeak.
    40. Re:Hard drives kept online by stanjam · · Score: 1

      I have an 8" floppy or two around somewhere. Significantly more of the 5 1/4 inch (and who uses them anymore?). Of course I have the 3 1/4" disks around as well. Some of my personal computer builds even still have the 3 1/4" drives! THey are all dead tech though. So in essence he is right. You can not count on ANY tech storage device being around in 20 years. I have seen companies run into this problem. They HAVE the backup data, but no ability to actually SEE the data. Still, there are a few choices available to him. USB drives are one he is considering, but I don't like it. They can only be used a limited number of times before they lose their ability to store and retrieve data (most people don't ever get to that point). It also would take a LOT of drives. A solid state drive might be a better answer. That way he can have his hard drive solution, without his fear of the hard drive failing. Still, I would make multiple back ups if he is truly concerned, and one of those would be on high quality DVD.

      --
      Open Source: Eroding the Digital Divide
    41. Re:Hard drives kept online by Anonymous Coward · · Score: 0

      > Of course I have the 3 1/4" disks around as well. Some of my personal computer builds even still have the 3 1/4" drives!

      I have never seen a 3 1/4" drive in my life... 3.5", yes!

    42. Re:Hard drives kept online by Molochi · · Score: 1

      That's good to know and, no, I haven't torn apart a HDD made in this century, nor for any reason other than magnet collection before the drive got tossed.

      Does a 20+ year data retention figure for a properly stored HDD sound about right? Or do you think bit rot would set in sooner? I'd tend to think multiple platters would be better for this type of use (reducing data density) though I'm assuming you meant a greater number of heads would increase the chances of incurring stiction.

      Do you think notebook drives would be a better choice? They tend to be designed for more abuse and for having higher use and static shock limitations. Do they also leave the drivehead over the platter?

      I wouldn't mind using 2 or 3 500GB HDD for archival video footage (maybe 8-10 years, then move the data to something newer and better when it comes along). They aren't all that expensive. Maybe use different brands to minimize exposure to design flaws. Getting data off them shouldn't be a problem.

      --
      "The Adobe Updater must update itself before it can check for updates. Would you like to update the Adobe Updater now?"
    43. Re:Hard drives kept online by Xygon · · Score: 1

      As someone who works in the industry... don't use these storage devices if you really hope something will last. SD cards, USB cards, and all get the worst quality memory. Unreliable, unready, pre-production... the main reason is that it's still good enough for 99.9% of people's use for these types of drives. You copy the data off pretty soon, maybe rewrite the drive ten or twenty times before you've gotten one four times bigger to replace yours, or lost it, or broken it, or moved on to a camera with a new device style. They're great for what they are, but they're not meant for long term storage. If you care why: The majority of drives use MLC technology, and one manufacturer (and soon to be most) use a three-level cell. NAND memory stores data in a floating gate, where electrons are trapped in the gate of a transistor. In time, some of these electrons will dissipate. In higher-reliability SLC, there's more space between a 1 and a 0 for these bits to fade, and not change the data value. In MLC, there are four levels... thus each are closer together, and moving from a 00 to a 01 is going to happen. In a normal system, wear-leveling and basic use model designs will refresh the data, so this won't happen. But if you unplug that system for years, that's not happening. So couple a technology that is designed to last ten years on a fresh drive, with low quality versions of the memory (based on the needs of USB/SD markets), and I DO NOT recommend this for long-term storage.

    44. Re:Hard drives kept online by Anonymous Coward · · Score: 0

      Have the backup system at a different off-site location. Then take the incremental snapshots and apply them to the backup system. You can use a USB thumb drive to transport the snapshots or even send them over the internet depending on data volume. If use the internet then definitely use SSH or another form of securing your data. I use a similar solution but use ZFS's RAIDZ2 for the pool and generate large snapshots such that the internet solution is not feasible.

  3. EASY! by The+Yuckinator · · Score: 2, Funny
    1. Re:EASY! by Gilmoure · · Score: 1
      --
      I drank what? -- Socrates
    2. Re:EASY! by darkpixel2k · · Score: 1

      ClayTablet

      Good idea--plus if you 'encrypt' it in some old, dead language, you can have the scientists of the world recover your data like they did with the Rosetta Stone. I suggest backing up in French.

      --
      There's no place like ::1 (I've completed my transition to IPv6)
    3. Re:EASY! by Gilmoure · · Score: 1

      Gotta' include a Klingon version of some stereo remote control instructions though, so they have a key.

      --
      I drank what? -- Socrates
    4. Re:EASY! by jacobsm · · Score: 1

      I have box (2500) of unpunched 80 column punch cards that are as good as new after being in my possession for 15 years. As long as they don't get wet I expect that they would last for another 15-20 years without any problems. Finding a card reader might be a problem however.

  4. Not enough history by mgkimsal2 · · Score: 5, Insightful

    We don't have enough history on this tech to know what, if anything, will "last for decades". Possibly "paper" and "microfiche" might fit in that list, but those aren't the sort of things you're talking about. Best option I can think of right now would be to get a couple 500gig drives, put everything on both, and then put them in different areas. In 3-5 years, back them up to something newer, and repeat that every 3-5 years. Maybe in those intervening years, we'll have more data and newer tech that's demonstrably suited for what your needs are.

    1. Re:Not enough history by PincusJr · · Score: 0

      Do hard disc drives really fail that quickly? Say right now you buy a 500gig drive, then pop it in your computer, copy all the data, then remove it from your computer and store it in another box/container. You're telling me that in 5 years time the hard disc drive could just die? Exactly why would anything like that happen? "Nothing" is happening to the drive, yet it is able to stuff itself up somehow. Excuse my ignorance :)

    2. Re:Not enough history by calmofthestorm · · Score: 1

      They'll want to be spun up every so often (6mo or so).

      --
      93rd rule of Slashdot: No matter how obvious my sarcasm is, my comment will be taken seriously by someone.
    3. Re:Not enough history by PincusJr · · Score: 0

      Ah okay. I didn't know that. Thanks :) P.s. I didn't know hard disc drives were sentient.

    4. Re:Not enough history by sjames · · Score: 4, Informative

      The spindle bearings can seize up.

      The problem isn't that the drive will inevitably die after 5 years, it's that it won't inevitably last longer.

    5. Re:Not enough history by fracai · · Score: 2, Informative
      --
      -- i am jack's amusing sig file
    6. Re:Not enough history by gad_zuki! · · Score: 4, Insightful

      One drive should be "live" and the other archived. Considering we all own computers, throwing a 1tb drive into a box isnt so difficult. Hell, you could write a script to power it up once a month and then power it down, if people are worried about energy costs but dont want to keep it spinning 24/7. It doesnt need to be ever mounted.

      Better yet both disks should be running in a RAID 1 array. This is a cheap solution, but its not a "toss in the closet and forget" solution. If this guy actually cares about his data I dont see why he cant spend 200 dollars or so for two drives and a raid 1 card.

      I see this question at slashdot every couple of months. The answers are still the same. Keep it live on a disk until a better solution is found. Upgrade the disk every so often. That's it. Mods, stop posting the same damn question every month.

    7. Re:Not enough history by Anonymous Coward · · Score: 0

      Not sentient, just anthropomorphic ;-)

    8. Re:Not enough history by value_added · · Score: 1

      The problem isn't that the drive will inevitably die after 5 years, it's that it won't inevitably last longer.

      So you're saying an extra drive is not entirely unlike a real backup solution?

    9. Re:Not enough history by Anonymous Coward · · Score: 1, Funny

      This "ask slashdot" question is like masturbation. It is kinda fun but, in the end, pointless. The answers haven't changed much since last month (or whenever it was posted last) but it will still probably get 500 responses. fapfapfap

    10. Re:Not enough history by tylerni7 · · Score: 1

      Well, it really depends. I'm not a hard drive expert, but if you just leave a very complex mechanical device sitting around in an uncontrolled environment, it will be subject to things like thermal stress (which is bad for magnetic things and small, calibrated mechanical parts).
      Then of course you have to worry about magnets not holding their state forever.

      Probably the best bet would be to get a really high quality, slow spinning drive (or one that can be made to spin up slowly), and hooking it up to a tiny embedded controller that will spin it up gently (accelerating a disk to 7200 RPM also isn't too good for mechanical longevity) every once in 6 months or so, as calmofthestorm suggested below. Then run some sort of checksum on the data, and do some drive diagnostics, and if the drive looks like it's failing, send an email to someone. And keep a couple of these setups in different locations with the same data in temperature controlled environments.

      Keeping a hard drive powered off isn't bad for a drive, but no storage system can last forever. And, once bits are stored in such a small space, quantum effects will start to degrade data visibly over time no matter what the medium.

    11. Re:Not enough history by darkpixel2k · · Score: 1

      If you absolutely must store your data on-site or can't afford off-site backups, consider something like this hard drive safe from thinkgeek.

      Either that, or make arrangements with a good friend to purchase 4 identical hard drives.

      He holds on to one of your drives, you put one in your backup server.
      You hold on to one of his drives, he puts on in his backup server.

      Every week, go visit your friend and take the drive from your backup server. Give it to him for storage at his house, and you do the same for him.

      After the switch, you can both run off the second set of drives until next week when you meet and swap drives again.

      It's a good idea, because you have an on-site backup, and an off-site backup at your friends house--plus it forces me^H^Hyou out of the basement to go have human contact every once in a while.

      The down-side is if you store your data unencrypted, your friend has access to all your porn.

      --
      There's no place like ::1 (I've completed my transition to IPv6)
    12. Re:Not enough history by Anonymous Coward · · Score: 0

      Say right now you buy a 500gig drive, then pop it in your computer, copy all the data, then remove it from your computer and store it in another box/container.

      Ummmm, they also have this cool thing now called a USB Drive, available in 500GB and larger sizes.

      Just plug it into a USB port on your computer and it works just like an internal drive. Copy all your data to it, unplug it and store it in a safe place.

    13. Re:Not enough history by sjames · · Score: 1

      So you're saying an extra drive is not entirely unlike a real backup solution?

      It's not entirely unlike a backup solution. However, an active storage system where the drives are used gives a much better chance of detecting the failure when there is still time to do something about it. Otherwise, you're quite likely to discover that both drives are dead for the same reason if you just stick them in a vault somewhere for 5 years.

    14. Re:Not enough history by simplu · · Score: 1

      And you can backup everything on DVDs also once in a while. And you can keep them elsewhere.

      --
      L.
    15. Re:Not enough history by tomhudson · · Score: 1

      Ummmm, they also have this cool thing now called a USB Drive, available in 500GB and larger sizes.

      Just plug it into a USB port on your computer and it works just like an internal drive. Copy all your data to it, unplug it and store it in a safe place.

      What is this 500GB and larger USB drive you speak of ...

      Certainly not flash memory. More likely an external hard drive with a USB connector. More problems than an internal hard drive, due to lack of ventilation, etc.

    16. Re:Not enough history by Anonymous Coward · · Score: 0

      Can you point at the line on that page saying *ANYTHING* at all about hard drives? I couldn't find it.

    17. Re:Not enough history by who's+got+my+nicknam · · Score: 1

      I don't see the downside to that at all, as it means I have access to all his porn too!

      --
      "Apparatus dignosco occultus, satis non supernus."
    18. Re:Not enough history by S.O.B. · · Score: 1

      How about the first sentence:

      Bit rot, or bit decay, is a colloquial computing term used either to describe gradual decay of storage media...

      The article describes a phenomenon that applies to many types of storage media of which hard drives are one.

      --
      Some of what I say is fact, some is conjecture, the rest I'm just blowing out my ass...you guess.
    19. Re:Not enough history by Anonymous Coward · · Score: 0

      For the love of god don't use a raid card if you ever want to use your data on another system or potentially different raid card. Use Linux MD raid.

    20. Re:Not enough history by mollymoo · · Score: 1

      Do you really want to maximise the probability that they'll all have the same design or manufacturing defect and will all fail closely spaced in time? I'd go for different makes and models.

      --
      Chernobyl 'not a wildlife haven' - BBC News
    21. Re:Not enough history by icepick72 · · Score: 1
      Mods, stop posting the same damn question every month.

      That's what you said last month.

    22. Re:Not enough history by ramjambam · · Score: 1

      Buffalo make a 1Tb network attached drive containing 2 500Gb disks. Configure it in RAID 1 (with the config wizard)and mirror the info. Plug in once a month to stop the disks sticking. Done. Cheap and simple.

      --
      Artificial Intelligence stands no chance against Natural Stupidity
    23. Re:Not enough history by Anonymous Coward · · Score: 1, Insightful

      Well, thankfully they DID post it another month. Because I just reached critical mass with my storage needs and this very question has been in my mind the past 3 weeks. There is no such thing as a stupid question, and just because You have it sorted out doesn't mean everyone else has. That is the reason this forum exists.

    24. Re:Not enough history by darkpixel2k · · Score: 1

      Do you really want to maximise the probability that they'll all have the same design or manufacturing defect and will all fail closely spaced in time? I'd go for different makes and models.

      I'm not talking about using Hitachi drives... ;)

      But seriously, I never trust drives purchased in bulk for being part of a backup system or RAID set. If 10 drives all came off the assembly line one after another, there's a good chance some proximal event in time affected all 10. For all I know, someone sneezed when the drives rolled by, and every single one has a snot dropplet that's messing with the read/write heads. I don't want them to all fail within hours, days, or even weeks of each other.

      (I know that's a very over-simplified way of looking at HD manufacturing.)

      --
      There's no place like ::1 (I've completed my transition to IPv6)
    25. Re:Not enough history by Anonymous Coward · · Score: 0

      [needs citation]

    26. Re:Not enough history by Bandman · · Score: 1

      I've got all kinds of external drives > 500GB. One for each day of the week to do dailies onto. We use La Cie drives for it. http://standalone-sysadmin.blogspot.com/2008/05/backup-scheme.html

    27. Re:Not enough history by NormX · · Score: 1

      This is clearly an institutional problem. The technical solutions proposed here are part of the solution but you need an institution to store your data and need that institution to have a adequate incentive and financial means to really keep it. Here is a long and complex description of such an institution: http://cap-lore.com/BigStore/DataBank.html . I have 100 year old photographs. I think the data bank would likely keep your bits that long. I wish someone would steal my idea!

    28. Re:Not enough history by tomhudson · · Score: 1

      Many external hard drives are made from batches of hard drives that failed QC at their full capacity. They get reformatted at a lower capacity (so the heads aren't stroked across the whole platter, for example, or the drive's firmware modified so it doesn't use a defective platter). Your LaCie could very well have maxtor drives in it, in which case it's a real piece of shit - the maxtorgates (seagates from what was Maxtors' China plant) are absolute crap. Out of 12 drives so far, I've had ONE that lasted more than a month.

      Those "usb backup solutions" are cheap for a reason.

    29. Re:Not enough history by KenSeymour · · Score: 4, Interesting

      I let a lot of my early programs go. I had them on 9 track digital tape, at 6250 bpi. I had converted some of them from the lower bpi rates before. It used to be that each employer I worked for had one of those tape drives and I could access the files
      when I wanted to. But not anymore. Besides, would I really want to convert FORTRAN programs?
      In theory, I could have kept converting them to newer and newer storage media, but I didn't.
      Later on, I had one of those QIC tape drives that could hook up to a floppy disk controller.
      I think I have those files backed up on a hard disk somewhere -- I think. When I was going through old backups one year, I noticed I had the same directories appearing many many times.
      I spent hours selecting the best version of everything and then backing up that too.
      I haven't digitized all my vinyl record albums either, even though it is theoretically possible.
      I have not scanned all the 35 mm slides or prints I took going back to the 1970s. At some point, the new stuff going on in your life gets more interesting than converting the stuff that used to get you excited 25 years ago.

      Imagine how hard it is going to be to preserve people's papers after their dead. Do you want to keep converting someone else's stuff when you have a few terabytes of your own to attend to?

      --
      "We can't solve problems by using the same kind of thinking we used when we created them." -- Albert Einstein
    30. Re:Not enough history by setagllib · · Score: 1

      This is vastly overkill. I find it's a lot simpler to just keep your most valuable data all organized and checksummed, calculating the checksum as soon as you have the data, and replicating this wherever you have space. If you don't trust the privacy of the storage you're replicating to, archive encrypt the data as well, and then checksum that too. Digitally sign it if you're extra paranoid.

      If you just keep re-replicating, even just as you buy new computers and external disks, you'll find that by the time any single disk fails, you already have 5-10 copies. By the time the next disk fails, you might even have 15 or 20. They're all still checksummed and it's virtually impossible for all of them to be corrupted in the same ways. Even if one copy loses one file, the other copies still have it.

      If you're trying to design a hardcore backup solution for copied movies, games, music, etc. it's a waste of time and money. It's probably cheaper to re-acquire whatever is corrupted or lost, than to prevent its loss through highly redundant and contrived archival methods.

      --
      Sam ty sig.
    31. Re:Not enough history by jwhitener · · Score: 1

      Agreed. I see this question far too often. No one can say "do this one thing for 20 years".

      Hardrives, backed up locally or remotely depending on your paranoia, using any number of raid, zfs, or other redundant methods, and upgrades every few years.

      You'll need to babysit any solution, keeping it relatively modern by upgrading media/filesystems/hardrives, etc..

    32. Re:Not enough history by thegrassyknowl · · Score: 1

      You could do what I am setting up.

      1. Keep everything locally on a mirrored or striped RAID for reliability. That helps get back up quickly in the case of a single disk failure (most common failure mode). Periodically I copy everything from one array to a new one as I increase my available space.

      2. Keep a removable hard disk on site and about once a week plug it in and back up to it. That helps secure your data in the case of power supply or controller failure of your main rig.

      3. I am building a low power box to leave at a mate's place. We both have decent Internet connections and a new Wireless connection is on the way. Leave said box at mate's place and connect to it with a VPN. Write a bunch of scripts to encrypt everything that's changed since the last backup with my public GPG key (ie encrypt it to me) and upload the changed .gpg files to the remote server. This keeps my data safe in the case of theft from my house or fire or some other bad thing. The only thing I need to keep out of the backup regime and ensure that I don't lose is my GPG private key, which can be stored on 2 or more USB sticks and carried around with you on your key ring. It also pretty much negates the need for step 2.

      --
      I drink to make other people interesting!
    33. Re:Not enough history by thermowax · · Score: 1

      You're screwed. Why?

      Fire.
      Theft.
      Lightning, if you RAID it up and leave it online.

      I've been thinking about this for a while myself- this is for absolutely irreplaceable data. Offsite, secure storage is required. It can be online, but that's vulnerable to hacking. What we're really talking about is something you can put in a safe deposit box and ignore for 20 years.

    34. Re:Not enough history by Anonymous Coward · · Score: 0

      MicroFILM is much more common than microfiche. However, both have a very long life if properly stored.

      Microfilm (16mm) is the ONLY federally recognized "archive" technology. It's been in used for more than 100 years, and more often than not, you can still recover about 95% from film 100 years old.

      I worked in the field converting film and paper to digital images.

      Every land record since incorporation is stored on microfilm somewhere (in the usa anyway). Most modern counties use a dual system. They use a fully digital access and storage system but also microfilm everything because the fed requires it.

      Advanced microfilm even includes indexing and fast search capability. Although, that usually requires it be made correctly, and it rarely is.

      Kodak makes a gizmo that burns a stream of images to microfilm. Very cool. WAY to expensive to use at home.

      Another advantage of micro film is that you can read it with a basic magnifying glass and a light source. Technology not required.

    35. Re:Not enough history by Whiteox · · Score: 1

      Yes but he may have a different fetish to yours.

      --
      Don't be apathetic. Procrastinate!
    36. Re:Not enough history by Anonymous Coward · · Score: 0

      Punch cards.

      Anonymous Coward...

    37. Re:Not enough history by Anonymous Coward · · Score: 0

      Hell, you could write a script to power it up once a month and then power it down, if people are worried about energy costs but dont want to keep it spinning 24/7.

      Or you could write a script that backs up all your data whenever this question gets asked on slashdot. Depending on which slashdotter you believe, your data will be backed up once a month/year/eon.

    38. Re:Not enough history by Whiteox · · Score: 1

      There is a saying for modern historians that when you are faced with too much data, your job is to select representative sets.
      No-one will ever be able to keep everything. I had to discard collections of 50 years of hard to get subscription papers, newsprint etc because no-one wanted them, no storage possible and no digital scanning available at the time. It got thrown out, with only some items kept. Even those are getting damaged over time, but I don't want to spend hours scanning them, storing the data or making them public. No-one cares for that sort of stuff and your own collections will eventually be thrown out. I'm faced with the same prospect and I hope someone will keep a representative set of what I had for a few years.
      Ultimately, you backup, then convert, then backup as a cycle until you die.
      Then, out of the blue, someone in the future will come across parts of your life represented by these collections, either in a 2nd hand shop, the garbage tip, deceased estate, relative or what/whoever and some of it may be kept.

      So you're approaching 60 and you still have 1000 LPs? Are you going to spend 45,000 minutes resurrecting them? Maybe some rarities.
      Photos? 8mm film stock? I have 1000's of photos of dogs in dog shows in the 60s and 70s. Most of the people are dead, the dogs certainly are and as they are slides, there is no captions or any indication what they represent.
      So the decision is - Is it worth archiving? Who will eventually be the custodian? What value has it for future generations?
      When you look at it like that, then there's very little you want to keep.

      Here's some links for everyone's consideration.

      Are you interested? http://www.cs.arizona.edu/patterns/weaving/lace.html

      What about this? http://dsr.nii.ac.jp/toyobunko/index.html.en

      Or: http://www.oucs.ox.ac.uk/ww1lit/gwa/document/8685?REC=6

      --
      Don't be apathetic. Procrastinate!
    39. Re:Not enough history by linhares · · Score: 1

      This "ask slashdot" question is like masturbation. It is kinda fun but, in the end, pointless. The answers haven't changed much since last month (or whenever it was posted last) but it will still probably get 500 responses. fapfapfap

      Or, precisely because we get it every month, there just may be a business opportunity in here...

    40. Re:Not enough history by Anonymous Coward · · Score: 0

      True comment. Upon the death of my mother, I'm going through all her collected stuff. Just focusing on pictures and slides that she and my father had. Most mean squat to me. I don't care about scenery pictures except those that my dad took with my mom in the foreground. The only real pictures of interest are those of family. I'll keep, scan and share those with my siblings and relatives; the rest get tossed. The letters I'll keep and pass on to my kids. Let them figure out what to do with them.

      My kids will probably toss most of the stuff that I've accumulated (digital and paper).

    41. Re:Not enough history by sydb · · Score: 1

      Except:

      I'd rather not pay yearly rent on a server or backup/storage solution.

      I think this is just a case of "cheap, reliable, easy - choose any two". The problem is we have a refusal to eliminate one of these.

      --
      Yours Sincerely, Michael.
    42. Re:Not enough history by Anonymous Coward · · Score: 0

      Create a G-mail account and mail your documents to yourself. As I understood, these items will be archived "forever" by Google.

      Jan Bettink

    43. Re:Not enough history by speculatrix · · Score: 1

      Do you want to keep converting someone else's stuff when you have a few terabytes of your own to attend to?

      When my uncle died, we inherited photos in boxes, photos in albums, and negatives, some 50+ years old. We don't know who many of the people are. I have scanned many of the significant photos - I assume they're significant because they're enlargements which would have been costly. The chances of the people being identified reduce every year as relatives die off.

      The above problem is relatively small, as we're talking less than a hundred photos. One day someone might inherit many thousands of photos and hundred of video snippets that I created. The only information on them is the date and EXIF data. That means in 50 years the person inheriting the collection, if it were preserved, would have a massive problem deciding what's important or not! Even if the camera had a GPS for geo-tagging it would only give a small clue.

      There's a reason we change jobs, a reason we get new friends and lose old ones, a reason we grow old and die: it's because it's important to not be burdened by the past, burdened by our own legacy, to forget, to move on...

    44. Re:Not enough history by Anonymous Coward · · Score: 0

      the answer is: in a little while cloud services will be good and cheap enough so you put it all in the cloud, and whoever is holding it will back it up several times, and you'll be happy snappy; all this shit is only going to be done at home for a little while longer

    45. Re:Not enough history by OhHellWithIt · · Score: 1

      I let a lot of my early programs go. I had them on 9 track digital tape, at 6250 bpi. I had converted some of them from the lower bpi rates before. It used to be that each employer I worked for had one of those tape drives and I could access the files

      I had a similar problem. I brought home a bunch of programs from college on punch cards. Eventually I threw them out when I realized that not only could I not read them on anything, but I would have absolutely no use for them if I did.

      Now, if I could only get myself to do the same thing for the rest of the stuff in my basement.

      --
      "Who controls the past controls the future. Who controls the present controls the past." -- George Orwell
    46. Re:Not enough history by jgrahn · · Score: 1

      When my uncle died, we inherited photos in boxes, photos in albums, and negatives, some 50+ years old. We don't know who many of the people are. I have scanned many of the significant photos - I assume they're significant because they're enlargements which would have been costly. The chances of the people being identified reduce every year as relatives die off.

      [...] There's a reason we change jobs, a reason we get new friends and lose old ones, a reason we grow old and die: it's because it's important to not be burdened by the past, burdened by our own legacy, to forget, to move on...

      That's one opinion, but ... don't you feel you're losing something valuable which cannot be recreated?

      We've gone through my grandparent's papers in recent years (although they died in the early 1980s). There were photos: my grandfather working in the fields, his brothers as dressed-up immigrants in the USA, my uncles playing soccer or fishing, school photos from first grade where I knew some of the children as old men ... These are valuable to me. I don't want to live as if there was nothing before me.

  5. Well by areusche · · Score: 1

    No storage medium is perfect. Remember paper? That actually roots after a period of time.

    Personally a data CD is probably the best most long term solution. Not a DVD mind you, a 700 or 800mb data cd. Get some cases for it and call it a day.

    Mind you, there isn't a perfect way to store any type of information for long periods of time. Personally I think a CD would be the safest bet.

    1. Re:Well by Anonymous Coward · · Score: 5, Funny

      Remember paper? That actually roots after a period of time.

      That's why paper is specifically prohibited on my network.

    2. Re:Well by Eravnrekaree · · Score: 1

      for 500 GB that would be a lot of CD-Rs. While CD-R lasts longer than DVD-R, i am still skeptical. Buying 3 500 GB hard drives every few years and using them in RAID might still be the best solution.

    3. Re:Well by eclectro · · Score: 1

      It takes a long time before paper takes root though.

      --
      Take the cheese to sickbay, the doctor should see it as soon as possible - B'Elanna Torres, "Learning Curve"
    4. Re:Well by WhatAmIDoingHere · · Score: 1

      If you keep the CDRs in a dark air-tight safe, maybe. But any sunlight or air will eventually destroy those CDs. I had a couple of burned discs sitting around on my shelf for 2 years, and when I went to use them they were totally destroyed.

      --
      Not a Twitter sockpuppet... but I wish I was.
    5. Re:Well by synthespian · · Score: 1

      Acid-free paper (used in all quality book, e.g., Springer Verlag) is a very durable medium.

      In fact, Gartner research recommends paper if storage is supposed to last over 10 years.

      http://www.computerworld.com/hardwaretopics/storage/story/0,10801,64684,00.html

      Of course, digital content management peddlers will disagree.

      Check out the following story: a University of Southern California neurobiologist wanted to read the records of the Viking mission but couldn't read the data on the magnetic tapes. He had to find the paper records and pay people to type everything that was in them:

      http://www.usatoday.com/tech/news/2003-01-17-digital_x.htm

      Photos are also stable if they weren't made with the new inkjet printers (all those fast photo services), but the old-fashioned way. They last for many decades, as we know from family photos and museums.

      --
      Main difference between the BSD license and the GPL license: one is from California and the other is from Massachusetts
    6. Re:Well by Anonymous Coward · · Score: 0

      Good paper and pigments last for centuries. I have a print from about 1680 on the wall. Some texts outlive their civilizations.

  6. Magnetic Tapes... by tgatliff · · Score: 1

    Even-though longevity of magnetic tapes has not been explored extensively, it is understood that if they are kept in a stable humidity and temperature environment with no light, that they should last for at least 20 years.

    1. Re:Magnetic Tapes... by ewilts · · Score: 5, Interesting

      Having a tape around for 20 years doesn't do you any good. 20 years ago, I was writing to 1600 and 6250bpi tapes. Today, my data center doesn't even have a drive that can physically read them.

      Today's tape technology is no different. 3 years ago was writing to SDLT tapes. By next year, I won't even have an SDLT drive in my data center, having migrated everything over to LTO.

      Yeah, I have round tapes in my offsite storage. I have 4mm and DAT tapes out there. We're just wasting money storing the media, since we have nothing that can read them.

      If I could read the old media and extract a really old database, would today's database app be able to read it? Probably not. And could I install that app on today's OS? Probably not. And could I install the OS from many years ago on today's hardware? Probably not. Could I compile source from 20 years ago with today's compilers? In many cases, actually I can't. And if it really did all magically get compiled, is anybody around that can still knows how to run the app?

      Don't forget that 20 years ago, many systems didn't have TCP/IP installed. In 1988, mine didn't - it was a combination of RS232-attached terminals and XNS-attached graphics workstations. Drive sizes were 80-160MB. A couple of MB of memory was a lot.

      For those of you not still in school, ask around and see how many folks in your IT department can name the server that held your financial data 10 years ago.

      --
      .../Ed
    2. Re:Magnetic Tapes... by klashn · · Score: 0

      And could I install the OS from many years ago on today's hardware?

      You can definitely do that today with DOS and Windows 3.1 on todays hardware. If we move away from x86 architecture, then everything is out the window, but for now, yes, we can install OS from many years ago on todays hardware.

    3. Re:Magnetic Tapes... by maxume · · Score: 4, Informative

      So start building VM's of operating systems and software that are in use. Archive those. Far from perfect or complete, but it should narrow the scope of the problem a little bit.

      As far as personal stuff, I think the best solution is to have 2 or more live copies of all important data and just migrate them to whatever makes the most sense at a given point of time, and then also have backups of stuff. That doesn't work with the question, but there isn't really a cheap answer to the question at this point.

      --
      Nerd rage is the funniest rage.
    4. Re:Magnetic Tapes... by jonaskoelker · · Score: 1

      20 years ago, I was writing to 1600 and 6250bpi tapes.

      Does anyone else smile when they think of capacity measured as "bits per inch"? ;)

    5. Re:Magnetic Tapes... by Gilmoure · · Score: 2, Interesting

      At my work (18,000 desktop Nat'l research lab), we have a special group to keep old hardware around, just so they can go and pull data from old archives.

      --
      I drank what? -- Socrates
    6. Re:Magnetic Tapes... by jimicus · · Score: 2, Interesting

      You can definitely do that today with DOS and Windows 3.1 on todays hardware. If we move away from x86 architecture, then everything is out the window, but for now, yes, we can install OS from many years ago on todays hardware.

      Tried that lately? Not sure about DOS and Win3.1, but certainly Win95 had a boatload of timing issues on newer hardware.

    7. Re:Magnetic Tapes... by Anonymous Coward · · Score: 0

      And so, 20 years from now, will you have the VM software that lets you mount your VM?

    8. Re:Magnetic Tapes... by Duckie01 · · Score: 1

      Even if we move away from x86... there'll be x86 emulators.

    9. Re:Magnetic Tapes... by nine-times · · Score: 4, Insightful

      I think your post is very insightful, and I have an additional problem to throw into the mix: sorting through all the crap you've archived, even assuming you can read it all.

      I don't know about you, but I've run lots of different backups on lots of different systems, and one of the problems that always comes up is just finding the revision of the file you want. People say, "I want the copy before I made this revision-- I think I did that about a month ago." Check the backups and there are no revisions from a month ago, but there are 20 from the month before. Next thing you know you're checking 20 copies by hand, and none of them are what you're looking for-- and that's even when your backup/archive system is working.

      So when devising any kind of archive, I think it's at least worth considering, "How am I going to find what I'm looking for in 20 years?" Imagine yourself in 20 years, and you have every piece of data you've ever generated stored on some kind of media that holds hundreds of terabytes of data. You want to find some spreadsheet you made today (20 years ago). Maybe you don't remember exactly when you made the document-- you think about 15 years ago, but it's actually 20. You can't really remember what the filename was. You can't remember if you made it in Excel or OpenOffice, so you're not even sure what filetype you're looking for. What's going to be your method for finding that file?

      I'm not suggesting it's an insoluble problem. It might be that it's not even a problem in 20 years because indexing/searching has become so good that your AI will be able to sort through terabytes in a couple seconds and make some good guesses about what you're looking for, but do you really want to rely on that happening?

    10. Re:Magnetic Tapes... by ewilts · · Score: 1

      Your post sums up exactly why a long-term backup is not an archive. I have to explain this to my user community several times per year.

      I help manage hundreds of terabytes of storage at work - so many I don't even regularly keep track and don't blink an eye when we place an order for 10-20 additional terabytes. I also helps manage the backups for that data. And get the requests for restores of the form "it was named something like this, existed somewhere around this time, and might have been on this server or in transit to this other server, might have been processed and renamed, but it's really, really important and I need it back in an hour". And why don't we do full backups of everything every day and keep the media onsite forever?

      --
      .../Ed
    11. Re:Magnetic Tapes... by maxume · · Score: 1

      If you are really serious about it you will; it might be running in a VM though.

      --
      Nerd rage is the funniest rage.
    12. Re:Magnetic Tapes... by ion.simon.c · · Score: 1

      And why don't you do daily full onsite backups? Sounds like you'd save a ton in shipping costs and software complexity!

      NB: NOT SERIOUS.

    13. Re:Magnetic Tapes... by Blue23 · · Score: 1

      Today's tape technology is no different. 3 years ago was writing to SDLT tapes. By next year, I won't even have an SDLT drive in my data center, having migrated everything over to LTO.

      After going through a large effort recently to migrate old tape formats to new tape formats, I hear you. Now I try to take that into consideration. Which doesn't mean sitting on tapes for 20 years, but migrating them over time.

      For instance, we've moved from LTO1 to LTO3 tapes, and the LTO3 can read all the LTO1s, and TSM can fairly easily migrate without needing a lot of my time. Planned in 2010-2011 is to go to LTO5, which can read LTO3.

      The other half of this is that with copy pools I have (at least) two copies of the data, so media failures (and there will be some across that many TB of data) can be overcome.

      To get back to the OP, it takes regular migration to new media types and enough redundancy to overcome mathematically inevitable corruption in order to have long term storage.

      Good luck.

      --
      LITTLE GIRL: But which cookie will you eat FIRST? C. MONSTER: Me think you have misconception of cookie-eating process.
    14. Re:Magnetic Tapes... by beezel · · Score: 1

      Do you really think that he's trying to backup and restore apps or databases? He's trying to make sure his diary and nudie pics don't get lost. 10-15 years ago i was storing naked pics of girls in gifs, and they're still readable (tho maybe not tasteful). Tape would work for longevity, and if it comes down to the hardware - just dont throw it away. We're not talking about a crazy enterprise situation where the farm is getting cycled. We're talking about some boxes in this guy's garage. If the data is important to you and the storage mechanism holds it - dont upgrade. just keep that box around w/ the tapes.

    15. Re:Magnetic Tapes... by Anonymous Coward · · Score: 0

      he could consider Tapestry (http://www.inphase-technologies.com/products/default.asp?tnn=3)

      50 year disk life, and their company is focused on professional archivers.

    16. Re:Magnetic Tapes... by Anonymous Coward · · Score: 0, Interesting

      to be perfectly honest people think their data is so important as to be wanted or needed in '20 years time'. Perhaps there is some data that might, but mostly it's just a digital form of hoarding, nothing more.

      Just look around you and wonder what you don't have now that you wish you could get back from 20 years ago. If you're honest with yourself, I doubt there is much really, and what there is, is probably not appropriate for a digital world (an old favourite cuddly toy, or a sentimentally valuable lamp that was lost in the last move).

      I'm always surprised for instance how much email people keep, my archived inbox is has never really been more than a couple hundred MBs. I've never in the 15 years or so I have had an email account (and moved/iported inboxes) ever regretted anything I have deleted or inadvertently lost.

      Perhaps with the rise of MP3s and movies on line this might be more relevant, but I think not that much. People just buy it again. Think Vinyl, cassette tape, CD and now MP3.

      It's mostly about ego or 'my backed up stuff is bigger than your backed up stuff' with people measuring their personal worth in the amount of bits they keep.

    17. Re:Magnetic Tapes... by HiThere · · Score: 1

      Wrong, I think. MSWind95 wouldn't install on recent hardware in 2002. It couldn't handle the CD drive. There are ways around it, but the only ones that I have seen proof of are illegal.

      One *OUGHT* to be able to run MSWind95 under an emulator, but the emulators I'm familiar with don't emulate an old enough CD drive.

      --

      I think we've pushed this "anyone can grow up to be president" thing too far.
    18. Re:Magnetic Tapes... by raddan · · Score: 1

      Tape is underrated. I don't understand why people say that it's not practical. Look-- if you value your data, keep your drives with your tapes. It's worth the cost. Have at least one spare drive. Linux and BSD have excellent legacy tape drive support. I have SCSI devices from the early 90's that still work on my linux machines! But if you're paranoid, periodically move your data to newer formats.

      Data density on modern tape drives is excellent. LTO-4 has a native capacity of 800GB, and tapes are in the $60-$70 range. They have a shelf life of 30 years. Sure, drives are expensive, but you pay that cost once, and otherwise, it's an outstanding bargain. IF, that is, you actually assign any value to your data. Where I work, our IP actually is the source of our revenue (publishing company), so I can pretty much fix a cost to the data itself, and for us, it's a no-brainer.

      I typically use my bonus pay at work to pay for things that are normally expensive necessities: new tires, repair work, etc. Assuming that we get bonuses this year, I'm putting it toward a better archival system. Right now, my strategy at home is: make many, many copies, and store them in different places. A tape system would be a nice improvement.

    19. Re:Magnetic Tapes... by ewilts · · Score: 1

      Assuming that we get bonuses this year, I'm putting it toward a better archival system. Right now, my strategy at home is: make many, many copies, and store them in different places. A tape system would be a nice improvement.

      I've got a tape drive at home. A Colorado T3000 that connects to a floppy controller and holds 1.6GB of uncompressed data. Yours if you want to come get it. I think it's about 15 years old. Great archive device with about zero chance of me getting anything off of any of the old media I may have lying around in a fireproof lockbox.

      Personally, I'm using a combination of multiple copies of my data in my house (automatic backups via Windows Home Server), daily live backups to the "cloud" (KeepVault) and an occasional off-site copy sitting in my office desk drawer when I remember to bring the drive home and update it.

      The people who think that storing a tape drive offsite with their tapes for long-term archival may also get an unpleasant surprise when they come to realize that systems 10 or 20 years from now won't even have the physical interface necessary to connect that tape drive. How many of your systems have a floppy interfaces to connect that T3000?

      --
      .../Ed
    20. Re:Magnetic Tapes... by Anonymous Coward · · Score: 0

      ...People say, "I want the copy before I made this revision-- I think I did that about a month ago." Check the backups and there are no revisions from a month ago, but there are 20 from the month before. Next thing you know you're checking 20 copies by hand...

      There actually is a software solution to the problem you just raised.
      It's called BOS, the Backup prOxy Server, and it solves the exact same problem you mentioned.
      It uses a simple HD as a storage medium and keeps a complete database of all the files it keeps a backup of, revisions and all.

      The magic in all of this is that there is no archiving at all.
      all your files are ready for use as soon as you can find them on the filesystem.
      The downside is that its proprietary and Windoze only software.
      Well... you win (no pun intended) some and you loose some I guess...

      http://www.chief-group.com/?doc=150017

    21. Re:Magnetic Tapes... by harry666t · · Score: 1

      I've got an original Windows 95 installation CD (from ~1996) somewhere around, I think I'm gonna check it out.

    22. Re:Magnetic Tapes... by synthespian · · Score: 1

      What LTO-4 tape drives do you recommend that work with FreeBSD?

      --
      Main difference between the BSD license and the GPL license: one is from California and the other is from Massachusetts
    23. Re:Magnetic Tapes... by redhog · · Score: 1

      I have seen a working PDP11-emulator for Linux, with full OS running. Ok, there might not be a real PDP11 around since many years, but popular hardware gets emulated :)

      --
      --The knowledge that you are an idiot, is what distinguishes you from one.
    24. Re:Magnetic Tapes... by Reziac · · Score: 1

      DOS (by which I normally mean M$DOS7 from Win95 or Win98) has been fine on everything I've tried it on, up into the P4-3GHz range. However, Win95 (OSR2.0b, the good one) threw up all over my now-antique P3-550/BX440 chipset. IIRC it installed fine, but would not run.

      --
      ~REZ~ #43301. Who'd fake being me anyway?
    25. Re:Magnetic Tapes... by Jorophose · · Score: 1

      I think that's why we need "proper backups". No more of this silly business where every day a new backup is sent out...

      Rather, you've got RAID1 or RAID5 on your workstation/server, and anything considered "finished" (movies, music) should go onto CDs or DVDs (with whatever quality you want/need). And then you make a second copy and send it off to a friend for safe keeping. Your typical office crap? I really couldn't give less of a damn if I lose the documents I work on, unless it's something I haven't finished or sent to its proper place or printed (whatever applies).

      I mean, if I ever lost my music collection, well, it wouldn't take me very long to fire up a P2P client and get it all back. Same for movies, just longer. Only pictures are fairly important; and even then, I have relatives in Northern Europe I send photos to, and they send me photos back, both deadtree and digital. That's pretty good for backups, don't you think?

      And it's not like any of this really matters. Photos often ruin the moment; sure, proffesional photography is nice... But I'm not going to cry too much if I lose a picture of me at some party. And did it really warrant taking the time to snap a picture right in the middle of something? If it was important, I'll remember it. I remember faces, places. And I don't really have a spectacular memory. But I remember. I don't need photos; they're nice but not essential.

      I don't get how you can get 500GB or 2TB or more's worth of stuff and start fretting about how to keep it all together. P2P is your friend. I don't need tapes; I have the internet.

    26. Re:Magnetic Tapes... by toddestan · · Score: 1

      I think most people realize that they'll only ever use a small fraction of the data they keep around ever again. The problem is trying accurately predict which fraction is the fraction they are going to need again. So it's safest just to keep it all around, and with storage so cheap, why not?

      The cheap storage also has another, compounding effect: People find it cheaper and easier just to keep everything rather than spending the time and effort to try to sort through it all.

    27. Re:Magnetic Tapes... by centuren · · Score: 1

      I'm sure my parents remember their wedding, but I'm personally very appreciative that they had a friend take some photos.

    28. Re:Magnetic Tapes... by nine-times · · Score: 1

      Well first, RAID is not a backup. It's just hardware redundancy. If you delete a file or overwrite a specific revision, it's gone. If a file gets corrupted, it can get corrupted on your mirrored drive just as fast.

      CDs and DVDs are decent backup media, but better in terms of archives. There's a difference. Archives are probably what you're talking about when you say things are "finished", backups are those "silly" things that we do on a regular basis.

      P2P might be fine for popular movies or music, but it doesn't really work for personal stuff or confidential/sensitive business information.

      If you don't have any data you care about, then fine, don't back anything up. Some of us have >10 TB of business data, and god forbid, if the server room catches fire and all the servers melt, we'd like to be able to get back all the business-critical data as it was the day before the fire.

    29. Re:Magnetic Tapes... by SanityInAnarchy · · Score: 1

      How many of your systems have a floppy interfaces to connect that T3000?

      Two out of three. Before I sold the last one, three out of four.

      Granted, I don't use a floppy. I kind of wish they'd put something more useful there instead. But people still do use floppy interfaces, and floppy drives. It's still (sometimes) more convenient to put disk drivers (RAID, etc) on a floppy for XP, rather than finding a trusted Windows machine to use nLite on.

      Oddly enough, I'd be worse off if it was a SCSI tape system...

      --
      Don't thank God, thank a doctor!
    30. Re:Magnetic Tapes... by Anonymous Coward · · Score: 0

      So you're problem is you don't keep the hardware you use to read your backups? Wow, that's a real pickle you're in with a complete lack of options</sarcasm>

    31. Re:Magnetic Tapes... by jimicus · · Score: 1

      There was a timing issue which caused exactly that on anything faster than about 350MHz.

      A patch was released for it but personally, I found it wasn't very reliable and you were better off installing Win98.

      I have had some success running Win98 under virtualisation on modern hardware.

    32. Re:Magnetic Tapes... by raddan · · Score: 1

      I really like the Quantum Scalar series (we have a Scalar 24). We use fibre channel to attach them. We're presently using Linux with this device, but it works fairly well.

      FreeBSD is "Quantum Certified" to work with many of their drives, so I suggest having a look at their compatibility matrix to see if you can find something with your interface of choice and in your price range.

    33. Re:Magnetic Tapes... by Reziac · · Score: 1

      Win98 seems to do okay on 3GHz machines, far as I've seen. What issues have you seen?

      Considering that in consumerspace, the P60 was still newfangled and the P100 was bleeding edge when Win95 came out, guess we shouldn't be too startled if it falls over on vastly faster hardware :) I preferred the utterly slick performance and relative simplicity of Win95, but was forced to W98 by this issue.

      Tho sometimes what gets blamed on Windows ain't wholly its fault. Frex, the 47 day rollover bug *doesn't* happen on about half of all hardware. Seems it needs a matching system timer (hardware) bug to manifest. -- Myself, I've never had an everyday system with this bug, so I think it's normal for Windows to run for months on end. :)

      --
      ~REZ~ #43301. Who'd fake being me anyway?
    34. Re:Magnetic Tapes... by jimicus · · Score: 1

      This was Win95 and about 9 years ago, but on my K62-400, it generally crashed horribly towards the end of the boot process. Disabling L1 and L2 cache in the BIOS enabled it to boot fine - provided you didn't mind it being unusably slow ;)

      Win98 was fine.

      I stand by my original point, though. Just because the hardware is in theory compatible all the way back to the original 8086 doesn't mean it's safe to assume that you can still install DOS and fire up your 20 year old application.

    35. Re:Magnetic Tapes... by Reziac · · Score: 1

      Some of the K6-2 CPU family, notably 400MHz chips made in (IIRC) Sept 98, had a bug that crashed 32bit code. (Linux wouldn't run on 'em either.) Win95's installer is 16bit.. so it'll install but not run. Win98 seems to have included a workaround for this CPU bug. You seem to have discovered a new and different workaround for Win95. :)

      A friend encountered it and raised hell with AMD until he got to a tech dude who knew what the problem was, and who confirmed that it was indeed a CPU bug. But AMD refused to warranty these chips, and the bug also affected one of my customers ... which is one of several reasons I've become an Intel CPU bigot. :)

      As to your original contention... I agree, you can't count on compatibility across the ages. And even if DOS runs perfectly on 2025AD's 30GHz CPU, that's no guarantee that your DOS *apps* will still run -- witness the many old Pascal apps that die with "Runtime Error:200" on 200+MHz CPUs, and the available patches don't fix all of 'em. Emulators are not perfect. Media adapters are quirky. The reasons why digital data can become orphaned are as endless as the varieties of digital storage!

      Most of my irreplaceable data is essentially text... and I'm reminded that I ought to update my printouts. Time to drag that out old cheap-to-run pin-impact printer and find that case of perforated paper :)

      --
      ~REZ~ #43301. Who'd fake being me anyway?
    36. Re:Magnetic Tapes... by Jorophose · · Score: 1

      That's a wedding; it's not your birthday that you can barely remember...

    37. Re:Magnetic Tapes... by Jorophose · · Score: 1

      I don't know, I just find it unusual how you can accumulate TBs worth of critical information...

      I've personally considered just getting an external hard drive and making daily backups to it, and then burning any videos/music I might have to CDs and DVDs...

      I know RAID isn't backups. It's only there to stop you from losing all the data off a hard drive... But how many times do you delete a file and decide you want it back?

    38. Re:Magnetic Tapes... by Terje+Mathisen · · Score: 1

      Magnetic tapes are used to store the raw data of seismic (oil) survey missions: Here in Norway we keep them in an underground temperature and humidity-controlled vault in the central part of the country.

      Every 5 years all the pallets of tapes have to be taken out, re-read and copied onto newer technology tapes, but each time they find that this interval is sufficient to suffer permanent read errors on at least some tape files, which means that that particular piece of data is gone forever.

      Terje

      --
      "almost all programming can be viewed as an exercise in caching"
    39. Re:Magnetic Tapes... by nine-times · · Score: 1

      I don't know, I just find it unusual how you can accumulate TBs worth of critical information...

      I assure you it's not hard if your company deals with lots of audio/video files. To take an extreme case, uncompressed 1080p video can bitrates as high as 3Gbps. Given that, 1 TB can hold as little as 45 minutes of video.* Is it really so hard to imagine that someone somewhere might have multiple terabytes of critical data?

      I've personally considered just getting an external hard drive and making daily backups to it, and then burning any videos/music I might have to CDs and DVDs...

      That's a fine backup strategy for your personal data. When I'm talking about video, though, I'm not talking about archiving my iTunes video collection.

      But how many times do you delete a file and decide you want it back?

      Me personally? It's happened a couple times, but it's very rare. But how many times has one of my users accidentally erased or overwritten data that they or someone else needed? That happens all the time. And it also happens sometimes that one of our scripts purges something it shouldn't have, or that someone changes data in one of our databases that shouldn't have been changed.

      * (1sec/3Gb)*(1min/60sec)*(8Gb/1GB)*(1024GB/1TB)=~45 minutes per Terabyte

  7. Flash drives by devman · · Score: 1

    For important stuff I use memory cards in my safe deposit box at the bank. I could see flash being a viable long term storage, some of them coming out with 10 year to lifetime warranties.

    1. Re:Flash drives by timeOday · · Score: 3, Interesting
      I'm excited about write-once (WORM) flash. All sides seem to agree it will be more stable, and preventing overwriting is just as important as hardware failure or format obsolescence. The only problem is this product was announced in June and still isn't available, even at sandisk's own website.

      By the way, I *have* had an SD card fail. It was in my digital camera the whole time, worked fine for a couple years, then quit. The camera itself showed no sign of damage, so I don't think it was abused. It was a Kingston, too, which I consider reputable.

    2. Re:Flash drives by boner · · Score: 2, Informative

      Actually Flash/memory drives are sensitive to radiation. Long term storage without regularly accessing the drive can lead to situations where blocks go bad beyond the ECC/CRC capabilities of the drive to fix. If you intend to store valuable data on memory devices for the long term you should (a) use multiple redundant drives (b) use a file-system with block-level ECC/CRC error correction and redundancy (like ZFS) (c) write each block to the device twice in different location (i.e. an mirror on the drive).

      The future of Flash memory is such that unless they extend the ECC/CRC capabilities of the controller, the susceptibility of these devices for radiation will increase when the cells get smaller.

      In case anybody doubts the impact of radiation on electronic devices, here is an interesting experiment you can do: take your digital camera, put the lens cap on and do timed exposure with increasing exposure times (1,2,4,8, ... seconds). Then analyse these pictures for bad-pixels, or better, subtract the pictures from each other. The random bits scattered around on these frames are impacts of cosmic rays. Now apply the same principle on memory devices with much longer exposure times...

      To cut my somewhat rambling post short: use memory devices as long term storage? No. Not without thought about the required data reliability.

    3. Re:Flash drives by TeknoHog · · Score: 1

      Looks like somebody reintenved PROM. Besides, the word "flash" in flash drives refers to erasing whole blocks at once so it doesn't make sense in a write-once medium.

      --
      Escher was the first MC and Giger invented the HR department.
    4. Re:Flash drives by Anonymous Coward · · Score: 0

      I have also seen flash memory drives fail. They do have a finite life, affected by the number of read/write operations. Leaving one permanently plugged into a computer will lead to its early death.

    5. Re:Flash drives by Anonymous Coward · · Score: 0

      Random bits, no streaks from rays coming in on a shallow angle? That's more like noise generated inside the camera.

    6. Re:Flash drives by Anonymous Coward · · Score: 0

      Flash memory is known to fail after so many read-writes. Its likely thats why the card failed. If he only wrote to it once it might stay there. The issue with memory spoilage, mainly dvds, is that there are organic components that literally spoil. Flash memory just has not been around long enough to know what will happen if its left for long periods of time.

    7. Re:Flash drives by HiThere · · Score: 1

      I think a lot of what you're seeing is thermal noise. Not certain, though. This doesn't mean that radiation isn't a problem over longer spans of time, it's just that I don't trust your example.

      --

      I think we've pushed this "anyone can grow up to be president" thing too far.
    8. Re:Flash drives by boner · · Score: 1

      noise generated in the camera is a statistical fluctuation (0-aT) T exposure time, a is temperature dependent noise coefficient, cosmic particle hits act like a stuck pixel. Cosmic rays definitely do not streak in digital devices. They most commonly dissipate their energy in one cell only.

    9. Re:Flash drives by boner · · Score: 1

      True, thermal noise is uniform over the detector (N=aT), N= noise level, a=Temperature dependent noise coefficient, T exposure time. Cosmic particle hits act like stuck pixels

    10. Re:Flash drives by synthespian · · Score: 2, Funny

      Ok, dudes, if he's got radiation problems, he's got bigger problems than data storage.

      --
      Main difference between the BSD license and the GPL license: one is from California and the other is from Massachusetts
    11. Re:Flash drives by mok000 · · Score: 1

      1. In 50 years, the supplier of your memory cards most likely will no longer exist

      2. If your data's gone, what good will the warranty be?

    12. Re:Flash drives by devman · · Score: 1

      I imagine over the course of fifty years I will be changing out the actual device as I update or add data to them. I also think its tough to say what will be around in 50 years and what won't.

    13. Re:Flash drives by Reziac · · Score: 1

      I've understood this to be a problem with insufficiently shielded system RAM too (ie. plastic-sided cases).

      Got any sample images to share? My digital camera doesn't do exposure times or lens caps (it's old and cheap :) but it sounds like an interesting experiment.

      --
      ~REZ~ #43301. Who'd fake being me anyway?
    14. Re:Flash drives by SickLittleMonkey · · Score: 1

      No bigger problems than yours or anyone else's.

      E.g. from Wikipedia:
      "Cosmic rays constitute a fraction of the annual radiation exposure of human beings on earth. For example, the average radiation exposure in Australia is 0.3 mSv due to cosmic rays, out of a total of 2.3 mSv."

      You're being hit by these right now, and more than the above if you're reading this on a plane.

      SLM

      --
      main() {1;} // zen app
    15. Re:Flash drives by HiThere · · Score: 1

      Yes, but isn't *most* of the thermal noise auto-suppressed. Which should mean that you only get a statistical outlier now and again. And isn't this why they need to cool really sensitive cameras with liquid hydrogen to just a few Kelvins? (I'm not sure of this, as it may be that it's because they need to camera to be cool enough to not significantly emit in the wavelength being photographed.)

      But the thing is that thermal noise is *noise*. Which means that it follows a statistical distribution. Which means that you can't filter it perfectly, without filtering out what you're trying to capture. But you may be able to do a "good enough" job that it won't usually be significant. However if you take a timed exposure with no incoming signal, then it wouldn't be surprising if you got a certain amount of noise / time-interval that wasn't filtered. And the same filter that's suppressing most thermal noise is suppressing cosmic radiation (again, imperfectly). So what I'm proposing is that most of the "signal" detected with the lens cap on is from thermal noise. (P.S.: Thermal noise *also* degrades memories...so this isn't a really significant argument, except that perhaps it argues for storing the stuff in a cold dry place...at which point one needs to worry about static electricity, though that can usually be handled.)

      --

      I think we've pushed this "anyone can grow up to be president" thing too far.
    16. Re:Flash drives by boner · · Score: 1

      The point is that the thermal filter cannot adjust for cosmic radiation because the cosmic ray hit is far above the thermal noise level.

  8. Not being answered by suso · · Score: 4, Insightful

    Obviously this question hasn't been answered for the general public because this is like the 4th year in a row that this question has been asked on Slashdot.

    1. Re:Not being answered by Albert+Sandberg · · Score: 5, Funny

      well the backups storing the questions have been lost.

    2. Re:Not being answered by sjames · · Score: 4, Insightful

      That's because there is no fully satisfactory answer. We'd all like a just do this, throw it in the corner and when you come back for it in 50 years it'll all be there sort of solution, but there is no such beast within the realm of affordability.

      It's a problem with several aspects to it as well. Let's say there is a SATA drive out there that absolutely CAN sit in a safe deposit box for 50 years and then work perfectly every time. In 50 years, all computers will have whatever the successor to whatever replaces SaS and when you mention SATA, the old timers will all get nostalgic and go on about tying onions to their belts (which was the fashion at the time). You'll then have to take the decidedly NOT affordable step of having someone build you a one-off SATA controller that can interface with a computer of that time. That is, if you can get the old-timers to stop reminiscing about the Vista debacle of aught eight long enough to recall the specifications of SATA. Be sure to duck, some of them might throw a chair for ilustration.

    3. Re:Not being answered by tomhudson · · Score: 1

      Obviously this question hasn't been answered for the general public because this is like the 4th year in a row that this question has been asked on Slashdot.

      Nah ... it's like a quiz on economic theory - the questions don't change, but over time the answers do.

      Hardware changes, software changes, the economics change. 20 years ago, hard drives were so expensive that they weren't practical for backups. Nowadays, hard drives are throw-away items.

    4. Re:Not being answered by crazyvas · · Score: 1

      well the backups storing the questions have been lost.

      The backups weren't lost. This /is/ the backup.
      The OP is extremely smart. What he did was to compress and encrypt his data, and hide it inside a seeming ask-slashdot submission. This guarantees a long life for his data, as /. will regularly dupe it.

    5. Re:Not being answered by synthespian · · Score: 1

      No, we'are conducting a hive-mind back up of this story.

      You just don't get it.

      --
      Main difference between the BSD license and the GPL license: one is from California and the other is from Massachusetts
    6. Re:Not being answered by xactuary · · Score: 0

      The parent comment may be true, but for me the scarier truth is just how many of the unwashed masses never do a backup. Hence, ANY effort to backup files practically screams l33+, except of course on slashdot. I've read through most of the comments on this topic and would only add that getting a copies off-site is very important. Stuff happens, locally.

      --
      Say hello to my little sig.
    7. Re:Not being answered by Anonymous Coward · · Score: 0

      Good answer.

      I still have data from 20+ years ago that started on a Commodore VIC-20, that I've migrated to the different computers, and media throughout the years. That first migration was through a null modem cable.

      Keep it moving. Or put it on paper

    8. Re:Not being answered by SanityInAnarchy · · Score: 1

      when you mention SATA, the old timers will all get nostalgic and go on about tying onions to their belts (which was the fashion at the time). You'll then have to take the decidedly NOT affordable step of having someone build you a one-off SATA controller that can interface with a computer of that time.

      This seems unlikely.

      First of all, just look at what's happened with IDE -- came out in 1986, lasted until SATA appeared in 2003, at which point many computers still came with IDE instead of SATA, and it's certainly not difficult to find an IDE drive, or a brand new motherboard with IDE built-in.

      So, it won't be easy, but you probably can find some old computer which has the correct interface. But how to get it from that old computer to a newer one? It seems incredibly doubtful to me that we'd have completely abandoned Ethernet, or USB...

      In fact, that might be the better solution -- USB is ten years younger, but new versions of it keep coming out, and these new versions are always backwards compatible. It'd be slow, but you can plug a USB 1 drive into anything -- including a USB 3 port.

      --
      Don't thank God, thank a doctor!
    9. Re:Not being answered by sjames · · Score: 1

      First of all, just look at what's happened with IDE -- came out in 1986, lasted until SATA appeared in 2003, at which point many computers still came with IDE instead of SATA, and it's certainly not difficult to find an IDE drive, or a brand new motherboard with IDE built-in.

      Yes, IDE has made it for 22 years so far. Do you really expect it to go another 28? (to reach my example of 50 years). Go back just a couple more years (nearly half as long as the 50 years I spoke of) and see if you can find an old RLL , ST506 or ESDI interface that fits in a PCI slot. Do you happen to know where I can find a PCI interface for a magnetic drum memory? (THAT is 50 year old storage tech.)

      If you have an original PC, your best bet for interfacing with it today is good old RS232.

    10. Re:Not being answered by epine · · Score: 1

      You're comparing an inception point against a saturation point. I'd like to think that all the crap around now will vanish as efficiently as you suggest, but I suspect not.

      If you have an original PC, your best bet for interfacing with it today is good old RS232.

    11. Re:Not being answered by sjames · · Score: 1

      I have no doubt that things being specified today will likely stick around much longer than they did in the past, but in a 50 year span, it's still best to assume at least one or two interim storage and format conversions.

      Even very long lived standards like Ethernet have seen changes over the years that could lead an old machine to get terribly confused. How often does an old machine with a 10base2 interface get included in a bake-off these days when new specs are tested, even when backward compatibility is part of the test?

      I can easily imagine an old ethernet card locking up as the newer machine is in the process of negotiating back to half duplex 10 Mbps. There's also a tendency once enough years pass to start dropping backward compatibility with the most ancient specs. I don't see copper going away terribly soon, but I can easily imagine in 10 to 20 years someone working on 30Gbps ethernet wondering if it REALLY needs to support CSMA. After all, when's the last time anyone used a hub? Evan if the compatibility is 'maintained', what are the odds anyone will dig up a media converter, hub, a machine that actually has an ISA bus and a 10base2 or 10base5 card to test against?

      Things like that are just likely enough that long term archiving MUST be assumed to be an active process rather than fire and forget even in the unlikely case that longevity of the physical media can be assured.

  9. Quality DVDs, archival storage, repeated backups. by wiredog · · Score: 3, Informative

    Long term:

    Use quality DVDs. Redo the backup on a schedule such that everything is re-backed up every three years or so. Every month, say, you make one DVD. Keep the backups in a climate controlled, dark, secure place, such as a safe deposit box at the bank.

    Short term:

    Back up everything you want to save to an external hard drive weekly. Every three months swap it with a drive kept in the safe deposit box.

    Daily:

    If you have a Mac, use Time Machine. If Linux, some sort of cron job running a Python script that copies /home to an external hard drive. If Windows, I dunno.

  10. 2 drives by Anonymous Coward · · Score: 0

    Just get two drives and put the data on both of them.
    Controllers can fail,2 drivers failing at the exact same time is unlikely.
    Also put your most important data in a location different than where the original is kept.

  11. Ask Slashdot AGAIN by Dun+Malg · · Score: 5, Informative

    How many times has this question been asked on Slashdot? I swear, it shows up on the front page at least three times a year.

    As for the question itself, the answer is pretty simple, but unhelpful. Basically what it comes down to is that there is no safe place for your data. You're asking for the best type of basket to put all your eggs in. If you look at it that way, the solution is easier to arrive at. Your choices are A) spare no expense and build/buy the world's strongest basket and pray no flaw arises, or B) start copying your eggs around to all sorts of cheap baskets and continuously add more baskets in the expectation that the oldest baskets are going to fail.

    Copy all your stuff to all your computers. Burn to DVD and/or CD ROM. Buy SD cards and USB flash drives. High capacity storage devices are so cheap now that you can keep all your valuable pictures of your vacation to Cleveland quite safe by constant duplication. That's the value of digital. Copies are perfect. Make lots.

    --
    If a job's not worth doing, it's not worth doing right.
    1. Re:Ask Slashdot AGAIN by eclectro · · Score: 5, Funny

      How many times has this question been asked on Slashdot?

      It needs repeated backing up.

      --
      Take the cheese to sickbay, the doctor should see it as soon as possible - B'Elanna Torres, "Learning Curve"
    2. Re:Ask Slashdot AGAIN by Anonymous Coward · · Score: 0

      I went to cleveland and all i got was this shoddy tshirt and a 500gb maxtor

    3. Re:Ask Slashdot AGAIN by Anonymous Coward · · Score: 0

      EMC Centera via Mozy

    4. Re:Ask Slashdot AGAIN by dubl-u · · Score: 1

      That's the value of digital. Copies are perfect. Make lots.

      This is similar to the answer that a lot of people interested in data preservation have come to over the years, including archivists and Kevin Kelly, one of the people behind the Long Now Foundation.

      However, one thing that people might not think of from your advice is to keep things hot, to keep them moving. You can't tell when a bunch of burned DVDs start to go bad. Knowing whether your data is good requires constant integrity checks, to make sure that you can still get your data back.

      Personally, the way I handle this is by running two hot servers in different locations with all of my critical data on both, with rsync updating nightly. With continuous checksumming turned on, that forces reads, so I know all the data is still safe.

      But for those who aren't quite that geeky, then I think a service like Amazon's S3 is the way to go. You keep one local drive hot, and one set of data in the cloud. And you regularly verify that all the data you want is still there. Yes, this costs more money than another local disk. But S3 data sits in at least two spots in a secure data center on an infrastructure designed by people who really get reliability.

    5. Re:Ask Slashdot AGAIN by Lost+Race · · Score: 1

      That strategy almost always leads to version reconciliation nightmares. Which of the N copies of this directory tree has the latest version of which file? Often different parts of a single file can be more current in different copies. Maybe in photos_collection copy #3 this photo has been cropped; in copy #7 it's been color-balanced.

      I'm not saying it's a bad strategy; I'm just saying you need to have a good policy and strong discipline about which copy is "live" and which is "backup". It can be very tempting to make "temporary" edits on a backup when that's the only copy you have handy.

    6. Re:Ask Slashdot AGAIN by Dun+Malg · · Score: 1

      I'm not saying it's a bad strategy; I'm just saying you need to have a good policy and strong discipline about which copy is "live" and which is "backup". It can be very tempting to make "temporary" edits on a backup when that's the only copy you have handy.

      Yeah.... I guess... I haven't had that much trouble with it though, and I am by no means disciplined. Keep one "live" version on the main system, continuously copy it out to all three servers, the other desktop, and the laptops and periodically burn it to DVDs. It's pretty easy to not edit a backup copy when it's stuffed into a sub directory under /var/z000z/backups/.

      --
      If a job's not worth doing, it's not worth doing right.
    7. Re:Ask Slashdot AGAIN by Agent+ME · · Score: 1

      This is why files have 'last modified' dates saved on them.

  12. Re:Quality DVDs, archival storage, repeated backup by Soul-Burn666 · · Score: 1

    What I have in my home system is a small back-up drive sitting inside the PC. Every night, the drive spins up, personal/irreplacable data gets rsynced to it (therefore very little work) and is spun down.
    Easy, cheap, and lets the HD work for the minimal time needed.
    Also serves as a sort of recycle bin if I mistakenly delete something.

    --
    ^_^
  13. Not the media that's the problem by jwilkins13 · · Score: 5, Insightful

    Here's the thing. Flash drives will *probably* last long enough. I wouldn't at all be surprised if they were still readable in 20 or 30 years. But a) what's the odds of your current WinUx or iHoloTablet having a usb connector in 30 years? and MUCH more importantly, what's the odds of having anything capable of reading those historic Word 2007, Acrobat 5, or any other type of file format in 20 years? Yes, there are some folks technical enough that they can still read and readily interact with Geoworks, Wordstar, Xywrite, etc. stored on 8" floppy disks. But if you ain't one of them, and I'm happy to admit I ain't, the fact that the flash drive is physically capable of being read in 30 years simply won't matter. That's why I crack up reading various vendors' claims of CDs, DVDs, BDs etc. lasting 50 or 100 or more years. The disks will be readable but you'll have no mechanical or logical way to read what's on them.

    1. Re:Not the media that's the problem by jwilkins13 · · Score: 1

      Someone reminded me in another forum that you could always microfilm everything - microfilm only requires a light source and a magnifying source, and a drop of water in candlelight would work in a pinch. Then again, who wants to film 2 TB of data? :)

    2. Re:Not the media that's the problem by Anonymous Coward · · Score: 0

      Psh, its not like he's gonna wake up 30 years later and go "Oh yeah! My data! I need it right now!" And then feverishly try to push the USB slot into the wireless receiver, and then start screaming "OH THE FOLLY OF MAN"

      If USB really started disappearing, you would go get your USB drives and copy the stuff off of them

      Besides, people used to back things up on 5.25" disks, a technology that was invented in 1976 according to wikipedia, and now, 32 years later i don't think you'd have to work too very hard to find a 5.25" drive to attach to your computer if you had to.

    3. Re:Not the media that's the problem by C_L_Lk · · Score: 4, Interesting

      There really is a simple way around this - and it is what I've done - I've got data 25 years old and it's still relatively easily manipulated with a little work. I've found floppy disks are relatively resilient, and old hard drives seem to keep their data for a long time. I've got a computer, display, keyboard, and associated peripherals stored for every generation of data that I kept:
      1.I have a Commodore 64 with floppy drive and cassette drive stored in a box with the floppy disks and cassettes from that generation (late 70s/early 80s).
      2.I have an IBM PC/XT with keyboard, a 5 1/4" floppy, 3 1/2" floppy, internal 20MB hard drive, and CGA monitor stored in a box with a load of 5 1/4" floppies filled with data from that generation (Mid 80s).
      3.I have an IBM RS/6000 with display, keyboard, and mouse and internal 500MB hard drive loaded with all my docs and projects from that generation (early 90s).
      4.I have a Pentium 2/300 PC * 15" monitor with windows 98, CD R/W drive, 3 1/2" floppy drive, and USB ports - and a crapload of CD's and 3 1/2" floppies full of stuff from that generation (Mid/late 90s).

      When the current generation looks like it's going to be moving on, I'll put away a Core 2 Duo system with 1 TB of hard drive full of stuff with the different OS's I used loaded on it with boot manager (Ubuntu, XP, FreeBSD), a crapload of USB keys full of documents, along with burned DVDs etc. That'll take care of the "'00" generation.

      The answer lies in not only archiving your data "of the generation" but the essential equipment needed to access it. I may have a heck of a time moving data off of my Commodore 64 - but I can at least see it and access it - I believe I stored a modem with it - so at worse I could set up a terminal server that it could dial into and dump data to. All the other systems I'm pretty sure I could recover stuff from - even if the PC/XT does have an MFM hard drive, etc.

    4. Re:Not the media that's the problem by jwilkins13 · · Score: 2, Insightful

      And what happens when the spring that pops the C64 floppy open finally gives out? The MFM drive? Etc. The hardware won't last forever, and even in a world of eBay goodness that's not really a viable solution for the majority of folks. I further submit that most of us don't have the familial/spousal support to keep 5 old clunkers around and operating.... :)

    5. Re:Not the media that's the problem by Anonymous Coward · · Score: 0

      Here's the thing. Flash drives will *probably* last long enough. I wouldn't at all be surprised if they were still readable in 20 or 30 years.

      And then again, the may not. I've had a flash drive fail on me while plugging it into the computer. Static had built up while walking across the room with it, put it into the usb port, arcness happened, and what do you know, the drive no longer worked.)

      But a) what's the odds of your current WinUx or iHoloTablet having a usb connector in 30 years?

      Probably pretty good. One can still get RS-232 standard on laptops if one looks, I'd imaging that there will be usbX to usb 2 converters.

      and MUCH more importantly, what's the odds of having anything capable of reading those historic Word 2007, Acrobat 5, or any other type of file format in 20 years?

      You may have heard of this up and coming technology called "Virtual Machines", which lets you run two or more different OS's on the same machine! Of course, maybe this VM thing is just a fad and in 30 years no one will remember this modern marvel

      Yes, there are some folks technical enough that they can still read and readily interact with Geoworks, Wordstar, Xywrite, etc. stored on 8" floppy disks. But if you ain't one of them, and I'm happy to admit I ain't, the fact that the flash drive is physically capable of being read in 30 years simply won't matter. That's why I crack up reading various vendors' claims of CDs, DVDs, BDs etc. lasting 50 or 100 or more years. The disks will be readable but you'll have no mechanical or logical way to read what's on them.

      With a bit of forethought, and some due diligence this can be avoided. Either store your data in the cloud and let someone else take care of the problem, or do a refresh of your data onto new media every 5 years or so.

    6. Re:Not the media that's the problem by jwilkins13 · · Score: 1

      I agree with your first point - in fact it kinda supports mine. "if one looks" - the vast and overwhelming majority of laptops AND their users will not look; if you think there will be RS-232 connectors on those 30-yrs-from-now laptops you have much more faith in manufacturers than I do. Shopped for a laptop with 3.5" lately? WRT VMs, my point above holds equally well - unless you move in different circles than I do ain't a lot of people loading multiple OSs and versions into VMs. And by a lot I mean more than the .01% technically inclined to do so - I mean the folks buying their laptops at Best Circuit FryMart. What about pesky licensing, pesky apps that don't run in VM because they phone home regularly for a license, pesky issues around providing VM that can fully support something as complex as Vista, etc. As to your final point, storing data in the cloud is good WRT storage media but still raises the question of readability. It's not useful for me for Google Docs or S3 to store my PDFs if I have nothing to read them in 30 years. Or is the intent to refresh all of your file formats every 5 years or so? B/c nobody does that today. You may. I may. But that same overwhelming majority doesn't. It also raises the problem of what happens to all that data when Microsoft buys Google, or Google goes broke, or gets nationalized and that service shut down, or or or....On the other hand, refreshing the data onto new media assumes a) you do it while the media and formats on it are still readable and that you refresh the data formats and that there are no things to intrude while that 500GB of data and media and format migration is taking place like a job or life. And that number's only going to increase for most of us.

    7. Re:Not the media that's the problem by Cyberax · · Score: 1

      No. Flash drives will ALMOST 100%-likely die.

      Flash cells are essentially small capacitors. And they slowly leak. So 3-5 years are probably the maximum life for USB drives.

      Also, I can read a CD printed in 1985. That's 28 years already. So I'm fairly certain that most media can survive 20-30 years.

      And after 20-30 years it'll be easy to re-encode it into different format.

    8. Re:Not the media that's the problem by Anonymous Coward · · Score: 0

      In what regards Word 2007 and Acrobat 5, the only solution is to avoid proprietary software.

      There's no vendor lock-in if you stick to open source solutions.

    9. Re:Not the media that's the problem by jwilkins13 · · Score: 2, Insightful

      Again, missing the point. I *know* that CDs - particularly printed or stamped ones as opposed to burned or RW ones - will last plenty of time. What I question is whether you'll have anything that can read that decrepit code in 30 years or that can play that physical media in 30 years. I also take issue with your point that "it'll be easy to re-encode it"...that's only if the disk and software on it are still readable and that's only in isolation. It is not at all easy to re-encode 500GB into a different format whether we're talking about file format OR media; put the two together and there is no easy answer today. NARA's ERA ain't it, the lofty encoding schemes like OAIS and EAD ain't it.

    10. Re:Not the media that's the problem by grumbel · · Score: 1

      But a) what's the odds of your current WinUx or iHoloTablet having a usb connector in 30 years?

      I don't know, but the odds of being able to connect USB will be close to 1 I would say. USB is all over the place today and it is so far backward compatible all the way down to the serial port or 3.5" drives, thanks to adapters. It won't just blip out of existence because Apple or whoever says so. USB also is used by tons of stuff, keyboard, mice, webcams and whatever, which will make it even harder to get rid of. And of course it doesn't really have a successor these days, which given how slowly PCs evolve will mean at least a decade of native USB slots in the box. Even if it becomes one day incompatible, you can bet that adapters will be available. And you also have to keep in mind that today almost every device speaks TCP/IP, which makes it very easy to copy stuff around, so even if the USB port is missing on your main device, it will very likely be present on another device and thus allow you to copy stuff around.

      The whole incompatibility data format thing is easily solved by emulation and really not much an issue if you just want to access the data. If you want of course transfer the data into whatever new system or format you are using you have to do some manual work, but that is hardly avoidable.

      Overall I find the whole compatibility issue to be really blown out of proportions. NASA certainly had issues reading old tapes, but that wasn't so much because they were old, but because the readers were *rare*. USB is not rare, there are likely more USB ports then people on the planet, they won't die out anytime soon.

    11. Re:Not the media that's the problem by jwilkins13 · · Score: 1

      Right. And that's workable for how many organizations today? And it's not just that - Acrobat is now an ISO standard (32000). Even if you could move everyone over to e.g. OpenOffice, ODF, etc. (and don't get me started about OOXML), it still assumes there is enough interest in 30 years for someone to have created a reader for them. Yes, you or your organization might be able to do that. What about your local municipality, who "standardized" on OOXML? Or the older version of ISO 32000 or ODF that is now deprecated b/c the standards bodies found a critical issue? Just saying.

    12. Re:Not the media that's the problem by Cyberax · · Score: 1

      It's funny, but just a month ago I helped to decode a program in Fortran from old 8" disks. It used an ancient Russian charset encoding in comments and that required several minutes of Yandexing to find a translator program which supported this encoding.

      Apart from that, we recovered ~35 years old program just fine. We were even able to run it in another 40 minutes!

      As for my data, I don't store anything in proprietary formats. Plain text and PDFs for documents, JPEG/PNG for pictures, etc.

      If you're very paranoid then store technical specs in plain ASCII text for these formats on your media.

    13. Re:Not the media that's the problem by Anonymous Coward · · Score: 0

      But a) what's the odds of your current WinUx or iHoloTablet having a usb connector in 30 years?

      The RS-232 connector is over 30 years old, but most new PCs still have one. And for the laptops that don't, you can buy a USB->RS-232 adapter. 30 years from now you'll be able to by a whatever->USB adapter.

      and MUCH more importantly, what's the odds of having anything capable of reading those historic Word 2007, Acrobat 5, or any other type of file format in 20 years?

      The CD music format is over 20 years old, but current PCs can still read it. The FAT-12 filesystem format is over 20 years old, but current PCs can still read it. Vinyl records have been around over 60 years, and you can still buy a device to transfer the content to a current PC.

      Basically, if it is a common consumer item right now (with readers available from a variety of vendors), you'll be able to buy a reader 30 years from now.

    14. Re:Not the media that's the problem by jwilkins13 · · Score: 2, Insightful

      You've got to be kidding me - and you didn't respond to my issue, which is file formats. Tried to read a Wordperfect 4.2 or Microsoft Word 1.0 doc lately? WordStar? Xywrite? Geoworks for C64? AutoCAD v2.1? Lotus 1-2-3 v2 with macros? Now think about how much more fantastically complex a Word or Excel 2007 document is, complete with pivot tables, lotsa macro-y goodness, and you really think those will be readable in 30 years? Cause it's not like anyone would do anything important in a complex and/or proprietary format, right?

    15. Re:Not the media that's the problem by Anonymous Coward · · Score: 0

      what's the odds of your current WinUx or iHoloTablet having a usb connector in 30 years?

      To prevent copyright terrorism, there will be no input or output devices for anything not licensed by the MPAA.
      Please report to room 101 for reeducation now.

    16. Re:Not the media that's the problem by rhizome · · Score: 1

      If USB really started disappearing, you would go get your USB drives and copy the stuff off of them

      Absolutely. The OP doesn't account for time itself being a transfer mechanism.

      --
      When I was a kid, we only had one Darth.
    17. Re:Not the media that's the problem by Anonymous Coward · · Score: 0

      Also, I can read a CD printed in 1985. That's 28 years already.

      You might want to unarchive some of those arithmetic operations from the big hard drive called the brain.

    18. Re:Not the media that's the problem by mebrahim · · Score: 1

      What? You mean you haven't kept any power supply suitable for each generation?!

    19. Re:Not the media that's the problem by Anonymous Coward · · Score: 0

      It's the internals of NAND flash I'd worry about, not the USB interface. USB is ubiquitous today and doesn't have any moving parts, which makes it extremely likely that there will be adapters from USB to whatever its future successor is... much like today you can find anything-to-USB adapters.

      Flash has to hold a charge, though, so I'd worry about whether the material can physically maintain that ability for decades unplugged. We already know that the material does degrade (because it'll fail after enough re-writes).

    20. Re:Not the media that's the problem by decavolt · · Score: 1

      Simple?

      You have 4 boxes and all of the peripherals needed to run them. You're not just archiving data, you're archiving hardware. Great for some, but pretty impractical for most.

    21. Re:Not the media that's the problem by Anonymous Coward · · Score: 0

      If you just keep it in a box, just the archive, no 'active' version of the data, then why keep it to begin with? Are an historian, planning to become one. start a museum, or want to leave a 'pile of old data inheritance'?

      All this new technology has lead to a new type of hoarding, data hoarding.

      Sort out just the good stuff, and keep that in a well preserved state. Display/keep it somewhere where you can enjoy it (ah the memories). The keywords here are 'the good stuf' and the action is 'sort it out'.

      Everything you couldn't be bothered to sort out in the last five years, you won't be interested in five years from now either.

      I guess it takes 'getting older' to realize that some things just slowly keep getting less interesting, for yourself but at least as much your data for others...

      When its just sits getting stale, with bugs getting into it, get rid of it.

      Just like how, when the junk starts spilling out of the piles in their garage, people tend to go out and start paying montly rent on 'storage units' that they (or their heirs) will only 'unstore' towards an auction or the city dump: When keeping your data becomes a hassle, don't just blindly spend more and more effort hang on to it without first sorting out the junk from it.

      You'll end up richer, both emotionally and financially.

    22. Re:Not the media that's the problem by Anonymous Coward · · Score: 0

      And the next time you power on that old equipment and it's antique power supply surges and fries the motherboard in it, you will do what?

      Moving your data to the latest media/format continually is the only way to ensure you will have access to it.

    23. Re:Not the media that's the problem by Anonymous Coward · · Score: 0

      So I take it you'll definitely need to borrow my truck when it comes time to move out of your mom's basement?

    24. Re:Not the media that's the problem by toddestan · · Score: 1

      I wouldn't worry about USB either, about the only thing that I can see as less likely to go away would be Ethernet. I'd be leary of lesser used connectors like eSata or Firewire, as I can see both of those in the "long gone and mostly forgotten" category in 30 years.

    25. Re:Not the media that's the problem by rantingkitten · · Score: 1

      The disks will be readable but you'll have no mechanical or logical way to read what's on them.

      Come on, it's not really that hard. I can walk into any thrift store around here and buy an 8 track player if I really wanted to play old 8 track tapes. My father still has some reel-to-reel tapes, even. If he really wanted, it wouldn't be that hard for him to get a reel-to-reel player capable of reading the media. I've seen those in thrift stores too, and plenty of electronics nerds or audiophiles have this stuff around for no reason other than they like having it.

      The point is that even today you can easily get your hands on equipment for media that nobody has used in decades. If you couldn't find one in a thrift store, a quick ad on Craigslist would turn up a few people willing to let you use their equipment, especially if you're willing to pay.

      In fact, you can even get ancient computers fairly easily. Not long ago I stumbled across one of these babies for sale, a TI-99 computer. First computer I had, back when I was about five or six. Want to read those old solid-state catridges they used? The computer cost all of seven dollars or so. Computers with 5.25" drives are everywhere as well. It may have been foolish, twenty years ago, to rely on those disks for long-term backup, but certainly not for lack of equipment to read it.

      The longevity of the media is certainly more important than whether or not "computers of the future" will be able to read it. It's just not that hard to find old equipment, and if it's that important to you, you'll shell out the ten bucks to get an old computer. Who cares if the media is "standard" later?

      See, while I'm sure I could find a reel-to-reel player for my father's old tapes, I doubt the tapes themselves are reliable. That's why media is more important here.


      Of course, if I really cared, I'd just back up to a hard drive or two, and repeat the process in another three years. An hour or two every couple of years to keep a fresh copy around isn't that much of a sacrifice. :P

      --
      mirrorshades radio -- darkwave, industrial, futurepop, ebm.
    26. Re:Not the media that's the problem by jwilkins13 · · Score: 1

      Still not getting it. The immediate question was about media. Part of my response was about readers. You don't have any response here about being able to read the SOFTWARE and FILE FORMATS on the media. Reel-to-reel, that's nice. TI-99, hokay. How about something that can read Word 95 files in 30 years? Zywrite? Wordstar? Alchemy databases? AutoCAD 1.0? dBASE III? Etc. It's NOT the longevity, it's the ability to read what's on them. That's hardware AND software.

    27. Re:Not the media that's the problem by arminw · · Score: 1

      ...old hard drives seem to keep their data for a long time....

      I have a mac SE/30 from 1989 and a Mac Color Classic from 1993 and they both still boot fine and all their programs and data files seem to be ok as well. I wonder if modern hard drives, because of their much higher data density also will keep their data readable. What if anything deteriorates in a HD just sitting in a box or an unused computer? Keeping an old computer with one or more hard drives could make those old files still accessible.

      --
      All theory is gray
    28. Re:Not the media that's the problem by Anonymous Coward · · Score: 0

      Here's the thing. Flash drives will *probably* last long enough. I wouldn't at all be surprised if they were still readable in 20 or 30 years. But a) what's the odds of your current WinUx or iHoloTablet having a usb connector in 30 years?

      I would say "Fair". Considering that USB 3.0 is coming out soon, backward compatible and same plug and all, I guarantee that it will be on devices 15 years from now.

      Considering that the serial port (RS-232) is from the 1960s, on many computers still like some netbooks (yuck), I think this fear is overblown.

    29. Re:Not the media that's the problem by Anonymous Coward · · Score: 0

      Perhaps he should buy a couple of cheap netbooks with ports for whatever he is storing his data on too. I'd bet that in 50 years there will still be something that will be able to connect to them using ethernet or wireless.

    30. Re:Not the media that's the problem by Anonymous Coward · · Score: 0

      That sounds good now, but what happens when you need to read something off your Commodore 64 in the future and a rat has eaten all the internal wiring or a flood destroyed the computer?

    31. Re:Not the media that's the problem by Anonymous Coward · · Score: 0

      That is strangely very comforting...

      Unless your place catches on fire or gets hit by a natural disaster of best suitability for your geographic region.

      Then for each machine I lost, a decade of depression.

    32. Re:Not the media that's the problem by Anonymous Coward · · Score: 0

      That sounds good now, but what happens when you need to read something off your Commodore 64 in the future and a rat has eaten all the internal wiring or a flood destroyed the computer?

      Irrelevant. ANY method you pick is going to be useless if it gets destroyed - that much should be obvious!

    33. Re:Not the media that's the problem by rantingkitten · · Score: 1

      Sure, but you're talking about the format now, not the actual media (disk, tape, CD, whatever) or equipment you'd need (disk drive, tape drive, CD drive, etc).

      My point was really about the people crying foul against relying on something like an external USB drive, or an SD card reader, because "there might not be USB in the future!" or other nonsense. Maybe there won't be USB in the future, but who cares -- finding old hardware is not difficult.

      To tackle the format issue, as long as you're backing up your files, why not back up the software you'd need to read it as well? Or create virtual machines and back those up while you're at it. It's not a perfect solution but it does narrow the problem significantly.

      To deal with your example, if I wanted to read old Word 95 files, it's not hard to find rogue copies of Win95 floating around. I've got a VM for one just for fun. I could either give those files to the VM, or install 95 and Word95 on the ancient 486 I dredged out of storage or grabbed at a thrift store, or whatever. There are always options, and if your stuff is really that important to you, there's no excuse these days for not having a few VMs around, loaded with the appropriate software needed.

      And before the hue and cry of "How will we run the virtual machines!", you've got the option of loading them on old physical hardware (using the installation media you backed up as well...), but that probably won't be necessary. I don't see virtualization going away.

      --
      mirrorshades radio -- darkwave, industrial, futurepop, ebm.
    34. Re:Not the media that's the problem by Anonymous Coward · · Score: 0

      im sorry, did you say floppy disk and resilient in the same sentence? back when i actually used floppies i found at least 3/4 of them went bad pretty fast. a few lasted. but by no means a decent amount. also it would be easier to buy a small hdd than to locate a floppy drive and a disc, and a motherboard with a floppy port in 10 years? its hard finding those now

    35. Re:Not the media that's the problem by Anonymous Coward · · Score: 0

      I might point out that while everyone is crowing that USB or SATA or what have you might not be around in 30 years. Those of you running a computer more than a couple of years old ( is it possible?) look on the back: you'll see a host of "legacy" ports - e.g. PS/2, serial, perhaps even parallel. They get included because they're still sometimes useful and the chips and interface are so darn cheap, why not?
      I think USB will be around for a while, even if it's only as a "legacy" port.

  14. Wrong question by the+real+chahn · · Score: 4, Informative

    It's impossible for guarantee 100% storage integrity, just like it's impossible to guarantee 100% uptime. What you want to ask is what risk of data loss you are willing to take.

    This page compares some of the options in terms of Mean Time To Data Loss (MTTDL). For the amount of space you're looking at (~500gb), a three-way mirror is probably sufficient to last for your lifetime.

    But there's always the risk of fat-fingering "rm -rf" or having the building catch fire, so maybe you want to have two synchronized sets of mirrors, stored in different physical locations. Only you can decide if that's too paranoid for you (or not paranoid enough).

  15. Simple solution by johnw · · Score: 1

    Paper tape - accept no substitute.

    Apart from anything else, the standing waves you can get as it goes through the reader are alone enough to justify it.

    1. Re:Simple solution by confused+one · · Score: 1

      paper tape is too easy to damage. It should be stainless steel ribbon, punched to the same standard.

    2. Re:Simple solution by Dark$ide · · Score: 0, Redundant

      I prefer punch cards, but you've got to watch out for those pregnant chads.

      --

      Sigs. We don't need no steenking sigs.

  16. Jesus saves, but Buddha makes incremental backups. by n1hilist · · Score: 2, Informative

    But seriously, I've had the same (but growing) data set in my /home for over 15 years, and going. I find the easiest way is to just keep it on my drive, and have a few frequently updated copies on external media (optical or solid or dirve) and to keep it on another PC too, disk space is so goddamn cheap. I also have a large music collection, and instead of wasting time backing it up onto optical media, I just keep it on both my notebook and PC, its unlikely both will fail at the same time, and incase of a robbery, I can also archive it at work.

    Don't expect any form of media to last forever, it's multiple, frequently updated copies that will ensure your data lasts forever.

    Also, if you have friends and family you can trust, make a copy for them to keep for you, off-site backup is also important.

    Obviously this all depends on how important and/or private the data is.

    my 0.2

  17. Gmail by Opr33Opr33 · · Score: 3, Interesting

    As the tag implies, Gmail is your friend. 7 gigs per account, searchable, accessible from any connected computer, free, and if in the future, google starts to decline, you can transfer to their replacement.

    1. Re:Gmail by Seraphim_72 · · Score: 1

      And indexed and searched by people that are not your friends. Sorry, I honestly cant believe that anyone uses Google mail for anything serious.

      --
      Slashdot, where armchair scientists get shouted down and armchair theologians get modded up.
    2. Re:Gmail by RJFerret · · Score: 1

      As the tag implies, Gmail is your friend. 7 gigs per account...

      Unless you lose your gmail account, I know someone who did, now he backs up his google calendar weekly among other things.

  18. Hardware solution by VinylPusher · · Score: 0

    I'm currently evaluating the cost-effectiveness of a DroBo (http://www.drobo.com/) for archival VS ease-of-access storage. It's not an inexpensive item, exacerbated by its storage method which gives you 2.7TB actual storage when fed with 4x 1TB drives.

    However, the unit is far more resilient than any conventional RAID solution.

    Connectivity is via Firewire or USB, with ethernet via an add-on. I'm more interested in this than any other long-term archival method.

    I will feed it with some 640GB drives, with a set of spares taken from different vendors. Job done.

    1. Re:Hardware solution by Shawn+Parr · · Score: 1

      I've just put a Drobo into use as well. My usage is two tiered:

      1. Time Machine Backup for all the machines in our house
      2. Archival of documents/media/stuff that we don't need immediately but may someday want

      In order to make #1 to happen I connected Drobo to my existing Airport Base Station. Time Machine is happy, and functioning on all machines.

      The cool factor with Drobo is that it gives you a lot of convenience of a RAID system without the administration issues. It allows for drive failure, it also supposedly allows for bit rot. It has a nice LED meter on the front so I can tell capacity at a glance, and it has nice status LEDs for each drive in the system. It also excels over typical RAID systems as I can add/replace drives ad-hoc, and I don't have to worry about whether my drives are matched capacity or not. Sure with some mis-matches in capacity some space is wasted, but when I add/replace a drive that space auto-magically could come back to being useful.

      Of course it also doesn't have some advantages traditional RAID systems have. It's performance is quite a bit less than even cheap RAID5 systems - on firewire it gives me pretty close to 30MB/s average. I also can't tweak the stripe size and whatnot to squeeze performance for my specific needs.

      Of course all of that doesn't matter for how I'm using it, and for how it sounds like the submitter is. The ease of use factor really wins out. And the concern about drive failure becomes greatly lessened as when a drive fails just replace it with a larger one for the same or less cost. Now you have your full redundancy still and more capacity.

      And in some cases, if you are below a storage threshold, when a drive fails, it can rebuild the RAID across the existing drives so that you could suffer another drive failure while waiting for the replacement drives to arrive. That is pretty slick. Of course someone hitting the Drobo with a lot of data probably would be above that threshold.

    2. Re:Hardware solution by ColdWetDog · · Score: 1

      Please hand in your Geek card now. REAL nerds don't Drobo. That's for the Rest of Them.

      Seriously, why not a bog standard RAID box that you buy or make? The Drobo's use an unknown, proprietary algorithm. They are a small company that could well go tits up.

      I actually looked at them (just looking, didn't inhale) and aside from the fact you did it their way or no way and the cost, there were too many forum posts with various problems. I ended up with a Netgear ReadyNas because I was too lazy to roll my own. With the newer firmware, it's been a bit slow but well behaved. The fact that it's a real NAS rather than something hanging off a firewire port makes it even more useful.

      --
      Faster! Faster! Faster would be better!
    3. Re:Hardware solution by dimension6 · · Score: 1

      Yep, second the ReadyNas. I bought a diskless Duo (available here in Japan), and threw two 1TB WD Greenpower drives (quiet and power-saving; though spin-down isn't supported yet due to some bug apparently attributed to Western Digital). If this one starts to fill up, I'll go for the NV+ in a heartbeat.

      Generally, I keep all my archived files in a folder on the NAS (it's set up with Infrant's RAID-X, which is roughly equivalent to RAID-5 with expandability), which provides me with a decent level of redundancy. My main machine is backed up on a separate external drive. Honestly, my files aren't important enough to pay for some offsite storage, but keeping the archived files bundled with my active files ensures their integrity and allows me to access them whenever.

  19. An archive is not a long-term backup by ewilts · · Score: 5, Insightful

    There is absolutely nothing that you can put away for decades and expect to be useful. Your requirements are not simple - they'll actually very, very hard to meet, even if you want to throw a lot of money at the problem.

    You don't know that a jpeg, for example, will be readable in 30 years. The format may be so deprecated that there might not even be a viewer available. Like my old Microsoft Works 4.0 documents - although I have the data, I have nothing that can read them unless I want to spin up an old Windows image, assuming that I can generate a virtualized environment that can support an old Windows (Windows XP probably won't even boot on any PC being produced 30 years from now). And some of that data is only a few years old, not decades old.

    You should store not only the data, but also the applications that created the data. And the computer you need to run those applications. And backups of those. And then every few years, pull it all back and validate it and update as required.

    You may have only 500GB now, but 10 years from now that will be 5TB. And then you need a way to actually be able to find something you added to your "archive".

    I deal with this at work regularly. An archive is not a backup that you keep for a long time. It's much, much more than that. Once you start thinking about all of the issues that come up, you'll see that the media is the least of your problems.

    --
    .../Ed
    1. Re:An archive is not a long-term backup by Mistah+Blue · · Score: 1

      One way to keep those applications around is to instantiate them on a virtual machine, and archive that VM along with the data and the executable for the VM. At least for the moment - VMware VMs are runnable by later versions of the product. This would of course be something to keep track of going forward.

    2. Re:An archive is not a long-term backup by Crookdotter · · Score: 1

      I don't think this is as big a problem as you are suggesting. Yes, while jpg may not be readable in 30 years by anything of that day, emulation will still be possible. I'd imagine that every system will be part of a grand emulator in 30 years

      While I can't get my hands on a decent 48k speccy easily anymore, I can run any emualted code I like through a PC. XP (and all associated software) is so massive in the world of computers that I think at least some people will still run it for fun in 30 years, if not in reality then on virtual systems.

      It wouldn't be hard to store a disc image of XP with associated software along with your files, and if there's not an NTFS PC emulator in 30 years I'll eat my solar powered holographic hat.

    3. Re:An archive is not a long-term backup by Anonymous Coward · · Score: 0

      When I first heard of Works, I had sense enough to NEVER use it for data. It was obvious when it came out that it would ALWAYS be a dead end. You need to convert them now or just forget them them and move on.

    4. Re:An archive is not a long-term backup by eean · · Score: 4, Insightful

      I think we do know that a JPEG will be readable in 30 years. Formats that have been around for like 10-20 years like JPEG are going to be here for a long while longer; I'd say until the end of civilization at a minimum (and even then, it wouldn't be hard for people to figure out the format). The worse case is that in future generations only a librarian or data archaeologist would have the tool to open it. Given the open source nature of JPEG, more likely you'll just download a JPEG viewer.

      MS Work 4.0 documents is completely different. There was always only one implementation, it wasn't open source, it wasn't a documented standard, and the life span of the format was small to tiny.

    5. Re:An archive is not a long-term backup by eean · · Score: 1

      The source for JPEG viewers is open. You don't need to run emulated code, there'll be native readers.

    6. Re:An archive is not a long-term backup by Anonymous Coward · · Score: 0

      Not only to keep track of, but to redo periodically.

      Or, at least, re-read. That is, periodically attempt to run those applications (in whatever mode, native or VM or whatever's out there 2-3 years down the road) and see if those applications can read (and, perhaps, recopy onto yet another medium, and then verify the new copy) your data.

      It's not clear if the OP's requirement is for a static 500GB, or as one commenter inferred, that the actual requirement is for an accumulation of 500GB per year. If the latter, then re-archiving (including re-indexing) the data annually or even quarterly if feasible, might be viable. This would include rechecking the usability of the archiving applications, etc.

      Yes, a very interesting problem to consider...

    7. Re:An archive is not a long-term backup by maxume · · Score: 1

      Just build a VM and install the last working version of VMware in it.

      It will only really be complicated after a few generations.

      --
      Nerd rage is the funniest rage.
    8. Re:An archive is not a long-term backup by jimicus · · Score: 1

      The whole point of the GP post is there may not be.

      If, in the next 5 years, some alternate format offering real benefits came to be and was widely implemented in digital cameras and other devices that use JPEG, it's likely that as a proportion of the number of images stored, usage of JPEG drops. Sooner or later it'll drop so low that it's not worth maintaining the code to read them - and even if it does get maintained, who knows if bugs have been introduced that can't easily be tested for because not enough people have a large archive of JPEGs to test against.

      (To be fair, JPEG's a bit of an extreme example, but I'd be happy to bet against Word .doc files being easy to read in 20 years time).

    9. Re:An archive is not a long-term backup by nine-times · · Score: 4, Insightful

      I think you're exaggerating the problem a bit. Formats like GIF, JPEG, and ODF will most likely be readable somehow in 30 years. They may not be the format of choice, but we have open source readers for those things, so for as long as lots of people have data in those formats, someone will be maintaining viewers that allow reading them and probably converting them to newer formats. Besides, it's not clear to me that we're going to come up with much better compression methods for static images, or that we really need to bother coming up with much better compression methods for static images, which means it isn't that unlikely we'll still be using JPEG in 30 years. I'm not saying it's a lock or anything, but it's not *that* unlikely.

      Now, with a format like ODF, if adoption isn't bigger before something new comes along, you might have a hard time reading that just because of the relative obscurity of the format (which is a problem JPEG doesn't have). In that case, it will probably depend entirely whether enough people have enough valuable information in ODF that some developers somewhere think it's worth writing a viewer.

      Yes, ideally emulation would be available for every obsolete platform, and we'd all keep VM images of all our old operating systems. We'd all keep all of our old applications to install on those images, and VM software would always be backwards-compatible meaning that we'd never lose anything. I'd love to know that someone somewhere is working on that, if only for historical preservation. However, for the individual who might have limited resources, it probably won't be necessary. If it ever becomes necessary for that to happen for most people, someone will be able to make a lot of money selling a solution.

      In most cases I'd say the best bet is to stick to open formats, keep copies on multiple different media, and continually migrate to new media. So, for example, back everything up to a hard drive and create checksums for every file, and then burn multiple copies to DVD. In 3 years, pull them all out, check all the checksums for corruption, and copy known-good copies (and checksums)to your brand new 5TB hard drive, and burn a couple BluRay discs. In another 5 years, check the checksums again, get known-good copies, and copy them to your 50 TB SSD and burn a couple copies into your super-ultra-cool whatchamakallit.

    10. Re:An archive is not a long-term backup by sam0737 · · Score: 1

      Text/Publication is fine. We have Plain Text, HTML, PostScript that we used for decade, and it's reason to assume that it could last for at least 10 more years. Or we can rasterize it. Or read it with my eyes, unless you are saying Unicode will die faster than them.

      Graphics is also ok. JPEG, GIF, BMP, PNG, also lasted for decade, it's believable they will make it to the next decade too! (After all, a JPEG reader shouldn't be that hard to code)

      If they could live for 10-15 years...I have no objection to reconvert every 15 years.

      Audio. MP3s and OGG should be fine for a while. I would say it can be assumed that player is still available in next 5-10 years. RAW WAV isn't taking many spaces.

      Last thing is Video. This is holy shoot! I am even having problem to deal with tons of CODECs in a modern system. And there are WMVs, MP4s, MOVs, Flashes, RealMedias, last but not least the Ogg Theora. I am not even confident that these are readable in next year! And in fact, this is preventing me from taking more video clips from my camcoder as I am so frustrated everytime to think how I am going to store it. RAW is bad, I don't have a big harddisk. Especially for High Def, size is an issue even when compressed!

      I just hope Ogg Theora can get more mature, or someone would open source some CODEC so that it will be more popular and a workable instance can keep spining around the world.

    11. Re:An archive is not a long-term backup by ewilts · · Score: 1

      The OP asked for an archive that would span decades. You're referring things like HTML that haven't been around for 2 decades, and let's not even get into the discussion as to how long specific versions have been out - of the 22 elements in the original public specification, only 13 still exist.

      You may think VMs are the answer, but they're not. Hardware platforms change. You may not have an x86 VM emulator in 20 years. There aren't good enough emulators for the hardware that was around 20 years ago.

      Let history be your lesson. Look at what was out 20 years ago that you can do useful stuff with today. Why should the future be so much different?

      JPEGs in 10 years - probably. 30 years? probably not. GIFs, less likely.

      Word processing documents? Most formats will not survive.

      PostScript? Unlikely to survive - how many people really know how to code in PostScript? I've had professional developers wonder why they send HTML directly to a printer and scream bloody murder when I politely tell them that printers don't do HTML (and seriously, printing HTML from Outlook is not the same thing!).

      Pretending a backup is an archive for just data for 5-10 years isn't that hard. Maintaining applications is even harder. Extending it to "decades" is nearly impossible for the average person to accomplish.

      Scrap all attempts at native documents and save exclusively in something like PDF or JPEG or TIFF. Yeah, you might have something useful. But you'll lose a lot of intelligence out of the data set, and frankly, the vast majority of users won't do it. They'll save natively and call it a day.

      --
      .../Ed
    12. Re:An archive is not a long-term backup by synthespian · · Score: 1

      Wait, a jpeg will be readable because we have access to the standard . So it's a matter of having the standard and then writing the software.

      http://en.wikipedia.org/wiki/JPEG

      The problem is all those beloved Microsoft and other vendor proprietary formats, which they only know how what the specs are. Some time ago I met somebody who was responsible for archiving a shitload of financial statements reports (required by law in his business to be kept around for decades) and they were spending serious money trying recover files from another era, such as WordPerfect files.

      --
      Main difference between the BSD license and the GPL license: one is from California and the other is from Massachusetts
    13. Re:An archive is not a long-term backup by synthespian · · Score: 1

      Usage of another format will probably arise and it can skyrocket up to the wazoo - it doesn't matter, because as long as people stick to standards - like jpeg, jpeg2000, png, etc - people will remain able to read those file formats for EONS to come because we know the MATH required to do it.

      --
      Main difference between the BSD license and the GPL license: one is from California and the other is from Massachusetts
    14. Re:An archive is not a long-term backup by maxume · · Score: 1

      When things get to the point that Open Office starts culling formats, it is more likely than not that someone in the data recovery field (or some other concerned individual) will build a Linux VM that supports the old version that can deal with those formats.

      Based on that line of reasoning, it might not be easy or cheap to access those formats( http://wiki.services.openoffice.org/wiki/Documentation/OOo3_User_Guides/Getting_Started/File_formats ), but it will almost certainly be possible.

      On the other hand, staying as close to text as possible and using widespread/open formats has present day advantages. (html and such may die, but the idea that there will be no way to look at the file is just silly; somewhat amusingly, a good strategy probably involves rendering svg to a binary format like png, and dumping binary databases to something like csv (if size allows))

      --
      Nerd rage is the funniest rage.
    15. Re:An archive is not a long-term backup by vedant_lath · · Score: 1

      About file formats not being readable after a few years, if a file format is mainstream, you can be almost certain that there will be converters or tools to migrate files in the old file format to the new file format.

    16. Re:An archive is not a long-term backup by ewilts · · Score: 1

      The whole point of the GP post is there may not be.

      If, in the next 5 years, some alternate format offering real benefits came to be and was widely implemented in digital cameras and other devices that use JPEG, it's likely that as a proportion of the number of images stored, usage of JPEG drops. Sooner or later it'll drop so low that it's not worth maintaining the code to read them - and even if it does get maintained, who knows if bugs have been introduced that can't easily be tested for because not enough people have a large archive of JPEGs to test against.

      (To be fair, JPEG's a bit of an extreme example, but I'd be happy to bet against Word .doc files being easy to read in 20 years time).

      Word isn't easy to read now! :-).

      As for JPEGs, it's not that extreme. HD Photo could take off soon, and likely will. From February, 2008:

      "The Joint Photographic Experts Group decided earlier this week to officially endorse the Microsoft HD Photo file format as the eventual heir apparent to the JPEG standard. In order to be adopted, the format has lost its vendor-specific name and other Microsoft proprietary controls to ensure universal compatibility. The new format will hence be known as JPEG XR whereby XR stands for eXtended Range.

      Microsoft believes that wide-spread use of this file format could begin in about a year."

      full article

      --
      .../Ed
    17. Re:An archive is not a long-term backup by IvyKing · · Score: 1

      When things get to the point that Open Office starts culling formats,

      Compared to StarOffice 6, they already have. SO6 could import the likes of Lotus Manuscript (which is probably still better than the latest version of M$-Weird for handling long documents, Ami-Pro and a bunch of stuff from the 1980's.

    18. Re:An archive is not a long-term backup by maxume · · Score: 1

      I would guess that it is a licensing issue more than it is an explicit decision to stop supporting the code (but I have no idea).

      I guess someone concerned about future format conversions would add Star Office to their VMs.

      --
      Nerd rage is the funniest rage.
    19. Re:An archive is not a long-term backup by jelle · · Score: 1

      iirc, pdf is an open standard http://en.wikipedia.org/wiki/Portable_Document_Format

      And luckily, there exists software such as 'pdfcreator' (on sourceforce.net) that let you print from any windows program into a pdf file.

      While you can, install pdfcreator on a computer that can still read those 'ms works' documents, and convert them to pdf.

      http://sourceforge.net/projects/pdfcreator/

      If you can't be bothered to do that for your old documents, then you most likely don't need to go out of your way to store them in any kind of super-safe way either, burning them to a dvd will do and when the dvd isn't readable anymore the garbage can will do fine...

      --
      --- Hindsight is 20/20, but walking backwards is not the answer.
    20. Re:An archive is not a long-term backup by IvyKing · · Score: 1

      You're probably right about the licensing issues. Having the appropriate version of StarOffice would be very useful to someone in need of conversions. The extra import filters were one of Sun's selling points for StarOffice over OpenOffice.

    21. Re:An archive is not a long-term backup by Reziac · · Score: 1

      And hardware backups. Some of the very old hardware is fine if it's powered up regularly, but tends to die if stored unpowered.

      I've seen that so often with old-fashioned I/O cards (especially of the VLB type) that frankly I wouldn't trust any of 'em to last in storage at all. That means for old IDE HDs, find a motherboard with onboard connectors; don't rely on that ISA or VLB I/O card you've stashed to match the HD. (Some very old HDs are not recognised when hooked to post-486-era IDE channels, so this is a real issue if you need to recover that data.)

      Some old-type hard drives have a similar issue: Conner HDs were fine so long as they were in regular use, but 6+ months in storage and they invariably lost the ability to boot (fixable with FDisk), and some also lost all their files, or at least anything that could be ID'd as a partition table and/or FAT.

      --
      ~REZ~ #43301. Who'd fake being me anyway?
    22. Re:An archive is not a long-term backup by shotgunefx · · Score: 1

      It's a good point, though I think JPEG, or at least code to read it will be around in 30yrs.

      I've recently hit this myself. I've been going through everything since the late 1980s and archiving it. Every floppy, every harddisk, everthing.

      First PC was a tandy 1000 and I had a lot of drawings I did, PNT, files, nothing reads those. I also had a bunch of .art files, from a freeware long dead art program (VGA Art Studio) which supported no other format. Luckily, the latter install had some (partially wrong) info on the format, so I had to in both cases reverse engineer the formats and write some perl scripts to convert them to BMP. Luckily, they weren't too complex and it was an interesting process, but it also got me thinking too, I've got do wrap all this stuff up now because 10 yrs from now might be too late.

      I also notice that USB enclosures do not like old IDE drives. I still have PCs with a PATA interface so that wasn't a big deal.

      I was surprised that most of my floppies (hundreds) were fine (~95%)and they weren't well stored.

      All my HDDs were good except my 386 laptop which crashed on boot.

      So basically I made images of everything. Carbonite for it all, two 16g flashdrives (one offsite) for all the important stuff as well as an external HDD and DVD-R archives every few months. At the moment, I can't see how I can improve on it much.

      I have a feeling the current generation of hardware will not hold up as well as the old iron.

      --

      -William Shatner can be neither created nor destroyed.
    23. Re:An archive is not a long-term backup by Anonymous Coward · · Score: 0

      PNGs are already starting to eat away at JPGs.

      I've been doing web design since browser backgrounds were always grey. I always used to use GIFs and JPGs based on scenario. When PNGs came on the scene I avoided them because IE wasn't compatible then. Now GIFs are sorta kinda deprecated, and PNG is starting to eat away at JPGs. I wouldn't at all be surprised if they fell into complete disuse in a few decades.

    24. Re:An archive is not a long-term backup by Anonymous Coward · · Score: 0

      actually, there's a decent chance that jpeg will go away. jpeg is a compression method used to make images smaller because transfer speeds of the day weren't so fast you didn't need compression.

      now and into the next 5 years or so transfer speeds and processor speeds and everything else will be such that compression simply won't be necessary.

      this alone won't do away with Jpeg but if we also come up with a clearer image storing method with better colors, or perhaps one that doesn't give you large rectangles of 'black' in the middle of the your new baby's mop of hair when you print out a picture. jpeg will start to loose favor.

      in short, we no longer need jpeg compression, so we'll only keep it for as long as it's not enough of a hindrance to bother us.

      but then again there's also the likely hood that the ability to read and write jpegs will still be included in the appropriate applications simply because we have old jpegs and the resources to do so is trivial.

      the case for even a staple like jpeg isn't so clear

    25. Re:An archive is not a long-term backup by ukemike · · Score: 1

      There is absolutely nothing that you can put away for decades and expect to be useful. Your requirements are not simple - they'll actually very, very hard to meet, even if you want to throw a lot of money at the problem.

      Wrong. He should pick out the most important photos, and have them printed to archival paper. He could then print all the the important documents to paper.

      I have tens of thousands of photos on my hard drives, but most of them are near duplicates, bad exposures, or have some other flaw. I have a few hundred really good ones. I've printed them and they are in a drawer with all my pre-digital camera photo prints. Easy. Paper is still the best archival tech we have. It isn't a difficult solution, but it does require a bit of prioritizing.

      --
      -- QED
    26. Re:An archive is not a long-term backup by Anonymous Coward · · Score: 0

      You may think VMs are the answer, but they're not. Hardware platforms change.

      Which is why it's got to be VM + Emulation.

      You may not have an x86 VM emulator in 20 years.

      Why not? We have at least one reasonably portable x86 implementation in C, in Qemu.

      There aren't good enough emulators for the hardware that was around 20 years ago.

      20 years ago would be 1988, right? So we'd probably be using DOS, right?

      We've got DOSBox, and FreeDOS, and that x86 implementation in Qemu.

      PostScript? Unlikely to survive - how many people really know how to code in PostScript?

      Doesn't matter -- virtually every Unix printing implementation uses PostScript at some point. High-end printers, modern and decades-old, implement PostScript, some over the network -- I've had one which I can TFTP a PostScript file to, and it'll print. There may not be great tools for editing it, but it's pretty much guaranteed to at least be visible.

      Open standards will survive.

    27. Re:An archive is not a long-term backup by AndyCanfield · · Score: 1

      He said "Windows XP probably won't even boot on any PC being produced 30 years from now." The last time I installed MS-DOS on a computer was maybe 2006. It runs. I have an image of a CP/M-86 boot disk that still runs on the latest Intel hardware, and even runs on a Qemu virtual machine. Intel is incredibly good at upward compatability, unlike everyone else I could name.

      JPEG's can always be read by archiving a viewer and taking a screen shot of whatever it shows. MP3's can be read with a speaker and a microphone.

      Yes, you'll have to archive the software as well as the data. Sounds like you have to read and write it every few years to keep the storage live. But it's doable, and not too expensive

    28. Re:An archive is not a long-term backup by synthespian · · Score: 1

      Word is a proprietary format.

      You'd be wise to convert all your Word stuff to OpenOffice format.

      --
      Main difference between the BSD license and the GPL license: one is from California and the other is from Massachusetts
    29. Re:An archive is not a long-term backup by CAIMLAS · · Score: 1

      The only possible choice, I think, would be redundant solid state devices. We're getting to the point where they are both cheap enough and will hold enough data to last that long. There is quite a lot of computing equipment out there from 30 years ago floating around out there, still functional and usable. Usually, the storage - platter or tape based - is what causes it to lose its functionality.

      Yes, it'll take a bit of money to store 500Gb, but 100Gb isn't such a far stretch and wouldn't be too cumbersome. You could get two solid-state computing devices of some sort (something like the Eee or EeePC due to the Eee's battery, maybe) and a handful of 32Gb USB flash drives. We'd still have 120V A/C 30 years from now, and if not, there'll be adapters.

      The devices are likely to last - at least one of them - for that long if they're not used. Ethernet is likely to still be around (due to its omnipresence now), and if not it'll be easily adapted to. The same can be said for the various data formats in common use.

      There's also the longevity of the storage. It'll probably still work in that time if it's taken care of. I, and several others I know, have CF USB drives about 10 years old which have undergone all sorts of damage and they still work just fine. Hell, I've got quite a lot of 20 and some 30-year-old equipment which still works fine.

      If I were going the long-term storage route, this is how I'd go. If it looks like the interfacing format will no longer be available, then you've got the option of converting beforehand. I see absolutely no reason why such an approach wouldn't work for at least 10 years at a go with reasonable certainty of data recovery.

      --
      ~/ssh slashdot.org ssh: connect to host slashdot.org port 22: too many beers
  20. Re:Quality DVDs, archival storage, repeated backup by zappepcs · · Score: 1

    This is really close to what I do. Running a home network, all the data is dumped to a RAID-1 and monthly copied to CD/DVD, which are kept in suitable storage space. This gives 3 levels of recovery:
    1 - local hard drives have the data - manually done
    2 - RAID on the network has a copy - scripted backups
    3 - CD/DVD has a copy - manually initiated scripted backup

    If I was truly worried, I'd make two CD/DVD copies and store one in Iron Mountain or something similar.

    You can substitute USB drives for one of the CD/DVD copies if you like. The only answer to fragility of storage mediums is to make multiple copies and refresh those copies often enough that the inevitable failure is mitigated. I personally choose to use TAR and GZIP for now as I trust these formats will be usable in the months ahead. If they become outdated at some point, I can change that going forward and save a LIVE-CD with those utilities on it with the older data.

    You can encrypt the CD/DVD copies easily enough for security, but long term you might want to make sure you write that password down :-)

  21. Sorry, it's insoluble. by dpbsmith · · Score: 4, Interesting

    I believe this to be a serious problem with no good solution currently. That's the truth. You'll get lots of dismissive posts saying it's no big deal, but it is.

    Forget media integrity. The problem is technology drift. Everyone thinks "ubiquitous" (as in every computer has a USB port) is the same as "eternal," and it isn't. Twenty years from now, your USB thumb drives and CD-R's may have their data physically intact, but only museums will have equipment that can read them.

    It is a fantasy to suppose that you can successfully perform Sisyphus-like task of systematically recopying your data to new media and formats. The proof of this is the innumerable stories of big, well-funded organizations that have neglected to do this. If the NASAs of the world keep finding reels of tape with important data on it that can't be read due to technology skew, what makes you think that you can do much better?

    (What makes me bitter is failure of vendors to give adequate warning when software updates remove the capabilities of reading file formats that were formerly supported. I once verified that my new Mac could read my old MFS diskettes, and did not notice when a software update to the OS removed that capability. Microsoft was less than forthcoming when they removed the built-in ability of Excel to read Multiplan files).

    1. Re:Sorry, it's insoluble. by bhtooefr · · Score: 1

      Scanners will still be around, in some form.

      Paperdisk.

      Then, file formats will be your only problem. Stick with open formats, and it'll be able to be figured out.

    2. Re:Sorry, it's insoluble. by puppyfox · · Score: 1

      Let's see, Paperdisk stores 1MB per page, so he would need to store more than 500,000 pages of paper somewhere. Not all that convenient.

      --
      The cookie told me to.
    3. Re:Sorry, it's insoluble. by Anonymous Coward · · Score: 0

      I doubt we lose USBs anytime soon unless it is an upgrade port that can still use them. My new computer still has both a SERIAL and PARALLEL ports for God's sake.

    4. Re:Sorry, it's insoluble. by Leomania · · Score: 1

      Twenty years from now, your USB thumb drives and CD-R's may have their data physically intact, but only museums will have equipment that can read them.

      Nah... according to my wife, I'll almost certainly still have hardware "archived" that can read those formats. She's of course delighted that I feel the need to save humanity from a potential digital Armageddon... she didn't want to park the car in the garage, anyway.

      --
      You don't use science to show that you're right, you use science to become right.
    5. Re:Sorry, it's insoluble. by TubeSteak · · Score: 1

      Twenty years from now, your USB thumb drives and CD-R's may have their data physically intact, but only museums will have equipment that can read them.

      This type of statement always pops up in discussions about archiving, but I think it just means you haven't really thought it through.

      20 years ago, computers were expensive and scarce.
      Today, computers are cheap and plentiful.

      What I'm trying to say is that the base amount of current tech is so vastly larger that, in 20 years, finding a guy who can pull an old piece of hardware from his basement will be much much easier than someone trying to do the same thing today.

      --
      [Fuck Beta]
      o0t!
    6. Re:Sorry, it's insoluble. by ColdWetDog · · Score: 2, Insightful

      It is a fantasy to suppose that you can successfully perform Sisyphus-like task of systematically recopying your data to new media and formats.

      You raise some good points but perhaps protest too much.

      Of course, it depends on exactly WHAT form your data is in. Text files will likely be readable until Kingdom Come. Microsoft Works files seemingly get deprecated every version. It is quite likely that common graphics files (JPEG especially, TIFF probably) will be readable for quite some time. JPEG especially isn't going anywhere. Both formats are well described and if the OP had any data in these formats that was of interest to a digital archaeologist in 2300 they could probably recreate the format. But that's not what most people want to do with their data. If you want that kind of permanence, create a cult and make giant statues out of rock....

      Copying data from one format to another shouldn't be a Sisyphusian task. Assuming you don't put them on CDs or DVDs and have a metric shitload of them. The trick is to switch to the new media BEFORE the old one is "extinct". I shifted plenty of data from CDs/floppies to hard drives, I've upgraded my hard drive arrays several times. I presume that when holographic storage finally gets real, I will be able to just as the OS to copy the data to the new format.

      Keeping the old programs that read the data is a good idea and essentially a freebie. Since most of my photography is stored in a proprietary format (Nikon NEFs), I have a copy of David Coffins dcraw in various places (A neat, open source program that reads pretty much any RAW camera format out there and the basis for a number of commercial programs.

      My data based stuff that I want to keep (taxes, personal files and such) have at least one copy in .txt format. And unfortunately, I don't think that PDFs are going anywhere (may Abode burn in whatever Hell is reserved for Evil Corporations). Convert the important files to a couple of different formats. Cheap, fast. Let your grandkids sort if out. Which they likely will do by hitting whatever is the current equivalent of the "Delete" key.

      --
      Faster! Faster! Faster would be better!
    7. Re:Sorry, it's insoluble. by fermion · · Score: 1
      This is a generally insoluble. Nearly anything that is kept over a period of time will degrade, often to the point of effective destruction. The only way to stop this process to constantly monitor and minimize or repair the damage. On macroscopic physical objects this may not be so bad. The damage to a painting, the loss of frames in a movie, the fading of text in a printed document, all can be filled in by the human viewer. But what we are dealing with here is coded media, often compressed into a small space and requiring machines to decompress, decode, translate, and represent in a human readable form.

      My basic premise is that most things are not useful for the long term. For instance, I don't have my Fortran code from 1980, my Apple programs from 1985. WHat I do have has been physically shifted from one media to another to insure that I can still have access to them in I need them. For instance, the hundreds of CDs I have now will likely be thrown away, and I will only take the time to transfer a fraction of the information. Some of it may be transfered to paper form, some to whatever electronic storage form is popular in 10 years.

      To summarize the long term archive problem has never been solved completely, and maybe that is good because if it were would be drowning under a flood of old junk. Entropy prevents this. Those thing we want to keep must be maintained, or at least, periodically updated. It is possible that someone has a great novel on punch cards, but getting it to human readable form would be a great issue if the storage format was never updated.

      --
      "She's a scientist and a lesbian. She's not going to let it slide." Orphan Black
    8. Re:Sorry, it's insoluble. by Khopesh · · Score: 1

      There is a solution, it just requires maintenance every few years. Most of us do it in some form already and just don't realize it.

      Buy a system with reliable parts, build a redundant disk array (RAID 5 if pressed for cash, RAID 1 otherwise), and put your data there. If you want to be even more reliable, also burn it to Blu-ray disks (blu-ray burners are already under $250 with disks under $10 each). Heck, if you're archiving 25gb or less, you could put it on a single blu-ray disk and merely burn fresh/updated copies of it every few years.

      Every few years, you'll do the exact same process with your data. Always maintain the system, making sure that it's got security and bugfix updates and that the hardware is still useful. Replace it every few years.

      Bonus points: make it a file server (samba/nfs) and use it for your active data, too.

      --
      Use my userscript to add story images to Slashdot. There's no going back.
    9. Re:Sorry, it's insoluble. by nine-times · · Score: 1

      Not only that, but since lots more people have usb devices today than had tape backups 20 years ago, it's more likely that there will be a market in supporting those devices to some degree. If lots of people have USB drives when they move over to the new connector, someone will figure out that it's worth money to produce a USB-to-[new thing] adapter.

      And that will probably happen in the short term, but there's still some point at which USB drives won't be readable anymore. It may take 50 years or 1,000, but it'll happen sooner or later. One question is, will anyone really care?

    10. Re:Sorry, it's insoluble. by Duckie01 · · Score: 1

      The whole point is moot actually. People are buying new computers every 3-5 years at most but want their data carriers to last for decades?

      I don't think so. Ten years from now you'll just copy your entire USB backup system to whatever you bought for half the money, taking up 1% of the new capacity and running at 100 times the speed, just like I've done for the past decades...

      I bought a 1TB external USB hd for 93 euros just 2 weeks ago... and it's gonna fit in the bag I used for an iomega zip drive over a decade ago just nicely! The zip drive cost more at the time, not even counting the 100 MB disks. The transfer rate of the usb hard drive is about 1000 times as fast. I no longer have a parallel port in my pc, and while every hardware shop in town sold at least a couple of interface cards with a parallel port back in those days.... they're actually pretty hard to come by right now.

      And while I actually do have an internal pata zip drive and my pc does have pata... I don't intend to use it because I copied over all data I want to keep to newer systems.

      Everything changes, everything stays the same.

    11. Re:Sorry, it's insoluble. by ext42fs · · Score: 1

      There is a workable solution.

      The key thing is to keep it on-line with good enough redundancy (e.g. raid1 + off-site backup). Actively keeping it on-line (replacing harddisks by bigger and more modern ones) solves the technology drift problem and adds space faster than can be filled with meaningful data (for most of us). Once the data hits the (raid1) disk it is reasonably safe because of ECC. The off-site backup is insurance against rm/fire et.al. Caveat: software, firmware, hardware all have bugs so _independent_ verification is important: tar/cpio/rsync bugs, SDRAM bit flips once every 20G, I've seen them. That's why I wrote this simple tool: http://www.frankvm.com/fsindex .

      Everything stored off-line is inaccessible and just waiting to become permanently lost.

    12. Re:Sorry, it's insoluble. by dpbsmith · · Score: 1

      "The trick is to switch to the new media BEFORE the old one is 'extinct.'" That's the part that sounds easy but is very difficult. Here are the two things that make it difficult:

      a) you need to copy all of your data to the new media, not just "the important stuff." The distribution of what's important has very "fat tails." You'll never need 99.9% of what's on those 5-1/4" CP/M floppies but when you need that 0.1% you'll really need it.

      b) You can't judge accurately when the old media is about to become extinct. It's too laborious and expensive to do the whole job, say every five years. But if you wait until you notice there's danger, you may wait too long. As noted in my post I've personally been burned twice by formats becoming extinct without my noticing.

      The proof that a) and b) are more difficult than they sound is that large organizations with people hired to take care of such things consistently fail.

    13. Re:Sorry, it's insoluble. by RAMMS+EIN · · Score: 1

      ``(What makes me bitter is failure of vendors to give adequate warning when software updates remove the capabilities of reading file formats that were formerly supported. I once verified that my new Mac could read my old MFS diskettes, and did not notice when a software update to the OS removed that capability. Microsoft was less than forthcoming when they removed the built-in ability of Excel to read Multiplan files).''

      Technology skew is certainly a problem to be reckoned with, but it's not insoluble. Almost everything I have that is important to me is in simple file formats, mostly consisting of ASCII text. The rest is in formats that can be processed by software written in ASCII text. I'm sure ASCII text is going to be around for a long time. I'm sure my software is going to be around for a long time; it's all open-source and there are plenty of copies of it. It's built on Unix APIs, which easily go back 30 years and show no signs of going away. And all this can be run in a hardware emulator if, one day, compatible APIs do disappear.

      All in all, I think technology skew is not the greatest of my worries. So long as I move my data off obsolescent hardware and onto new hardware, I don't think I have much to fear.

      What does cause me to lose data is software bugs, faulty storage media, and my own mistakes. I've lost data through faulty filesystem drivers, I've lost backups because the media went bad, and I've lost data because I accidentally destroyed it before I had it backed up. More than data, though, I have lost interest. There is a lot of data that I could still access if I wanted to, but I don't want to and can't imagine that I ever will. Finally, both of these figures are dwarfed by the amount of data I actually use, from frequently to rarely. As long as I can maintain the balance the way it is now, I feel I'm doing a good job.

      --
      Please correct me if I got my facts wrong.
    14. Re:Sorry, it's insoluble. by Anonymous Coward · · Score: 0

      The CD format has been around for 25 years, yet the number of CD readers in the world (which includes all CD-related, DVD-related, and Blu-Ray-related hardware) is still skyrocketing. CD is on course to be the second-most entrenched format ever, behind, say, paper books.

    15. Re:Sorry, it's insoluble. by triffid_98 · · Score: 1
      Cartman: [...and enters an elevator] Just send a maintenance guy to my room. I want this Nintendo hooked up to my float screen NOW!
      [the doors close]
      [Cartman's room. A maintenance guy is trying to figure out how to connect the Wii to the screen.]
      Cartman: Come on! Come on! Dude, what is taking so long! I wanna play!
      Maintenance Guy: Uhh, what kind of output does this have? This is some ancient Super-VHS output or somethin'. I can't connect it to your float screen.
      Cartman: There's gotta be some way to hook it up! It's the freakin' future!
      Maintenance Guy: It may be the future for you, but I can't hook up anything to a float screen without at least a laser-7 output.


      Forget media integrity. The problem is technology drift. Everyone thinks "ubiquitous" (as in every computer has a USB port) is the same as "eternal," and it isn't. Twenty years from now, your USB thumb drives and CD-R's may have their data physically intact, but only museums will have equipment that can read them.

    16. Re:Sorry, it's insoluble. by Anonymous Coward · · Score: 0

      For individuals, I don't think so. Organizations have far more data than an individual, and nobody who actually knows what and where it all is.

      Re the hardware drift issue, this seems to be much less serious for consumer-commodity grade gear than for commercial-specialty stuff. You can still buy 3 1/4" floppy drives that can read things written 20 years ago on 750K disks, and modern DVD-RW's can still read CDs from the 90s, whereas the funky early eighties word-processor disk formats were all toast within five years. At some point mobos will presumably start to appear that lack the floppy and ide connectors, but there will be adapters (longer for ide than floppy, no doubt).

      As for software, open source seem to handle this *a lot* better than propriety - I have some early nineties win 3.1 programs that I thought were useless due to not working on 2000/xp/vista, but somebody asked me if they worked under Linux, and they turn out to work fine under Wine (win98 bottle).

      So if you're paying attention, the likelihood of completely missing a technology jump seems rather low - even when you can't buy floppy-capable equipment from the shops any more, half of your neighbors will have some in their garages and attics.

    17. Re:Sorry, it's insoluble. by Anonymous Coward · · Score: 0

      I'm sure there will never be a demand in the future to interface old media with new technology. I mean, how likely would that be?

    18. Re:Sorry, it's insoluble. by Anonymous Coward · · Score: 0

      This overstated a bit. You can still buy today floppy drives that can read late 80s 750K floppies (just tested one), and recent motherboards still have the FDD cable socket. The key is probably to use commodity-type hardware, and be alert to migrate off anything more specialized, like Zip disks, that clearly seem to be on the way out (I did my zips to hdd&CD about 5 years ago maybe, tho the zip unit still works).

    19. Re:Sorry, it's insoluble. by Anonymous Coward · · Score: 0

      I believe this to be a serious problem with no good solution currently. That's the truth. You'll get lots of dismissive posts saying it's no big deal, but it is.

      Forget media integrity. The problem is technology drift. Everyone thinks "ubiquitous" (as in every computer has a USB port) is the same as "eternal," and it isn't. Twenty years from now, your USB thumb drives and CD-R's may have their data physically intact, but only museums will have equipment that can read them.

      It is a fantasy to suppose that you can successfully perform Sisyphus-like task of systematically recopying your data to new media and formats. The proof of this is the innumerable stories of big, well-funded organizations that have neglected to do this. If the NASAs of the world keep finding reels of tape with important data on it that can't be read due to technology skew, what makes you think that you can do much better?

      (What makes me bitter is failure of vendors to give adequate warning when software updates remove the capabilities of reading file formats that were formerly supported. I once verified that my new Mac could read my old MFS diskettes, and did not notice when a software update to the OS removed that capability. Microsoft was less than forthcoming when they removed the built-in ability of Excel to read Multiplan files).

      Yeah, and this summer Ubuntu suddenly stopped supporting sound on my 5-year-old laptop. Bye-bye Ogg, YouTube, DVDs, CDs, etc. I finally gave up trying to patch it after 30 hours. Technology drift gets everyone, I guess. My solution: keep the laptop and buy dedicated players for the other stuff. It'll be another 5 years before I come close to filling the hard drive, if it lasts that long. I guess I'm trying to exert a little drag against this whole Moore's Law/White Queen scenario (running ever faster to stay in the same place).

    20. Re:Sorry, it's insoluble. by rtechie · · Score: 1

      It is a fantasy to suppose that you can successfully perform Sisyphus-like task of systematically recopying your data to new media and formats.

      Um, no.

      I transferred data from my Commodore 64 (mostly txt files and images) on 3.5" floppies to my old XT's RLL and the MFM hard drive and from there to a SCSI hard drive on my 386 and so forth right up to the Dual-Core system I have today.

      In recent years I've got a ton of archived software and other crap, but that's not personal data. My personal data is a lot smaller and so could easily transition across these formats. This guy says he's got 500 GB of personal data so I suspect most of that must be home movies because I can't imagine much else would eat up that kid of space.

      And there are a lot bigger problems than hardware compatibility. I've got original Quicken and TurboTax files. Think you can open them in modern versions? Think again. That's WHY I have all that archived software and (increasingly) VMs to run it in.

  22. Re:Jesus saves, but Buddha makes incremental backu by n1hilist · · Score: 1

    Oh and, don't forget to use tools like Gmail drive, you can upload your stuff there, and if I recall you can store up to about 6-7GB on a gmail account nowdays, plus it's free.

  23. flash cards by juenger1701 · · Score: 1

    get flash cards make at least 2 full copies (3 is better) and do full 100% reads from all the cards at least twice a year or buy space with a major hosting company and store a copy there and have at least 1 copy on flash cards that you check regularly it's not the throw it in a drawer and forget for 10 years solution you want but that solution doesn't exist

    juenger1701

    1. Re:flash cards by TheBig1 · · Score: 1

      If you do use flash drives (or CDs / DVD, for that matter), make sure to do a 100% redundancy par2 copy on one of them. This will allow you to retrieve data using either disk, even with massive corruption.

      Cheers

    2. Re:flash cards by Anonymous Coward · · Score: 0

      I'm surprised par2 hasn't been mentioned more. Surely two identical DVDs, both par'd would provide a good few years of reliable archiving?

  24. It doesn't exist. by NReitzel · · Score: 4, Insightful

    You are the analogy of an investor who wants a high-yield, low-risk, completely liquid instrument. The term is TANSTAAFL.

    I maintain two (yes, two) USB external drives. Every couple of years, I migrate to a larger, or otherwise better medium. I use an incremental backup system (for me, cpio) that ends up keeping too much stuff, but at least I have the stuff I want if I need to get to it.

    In a decade - in my case, four decades - one can accumulate a remarkable amount of crap, along with things one truly wants to save. I have a total of about 90 gig of actual data, plus a far larger amount of music and video, which I consider more or less disposable. It is not difficult, nor expensive, to purchase another external drive and copy the data. My oldest backup is on IBM 2314 disk pack, but the data still held on that disk is also present on my current backup, a WD 160G in a USB-1 enclosure. Sometime next year, I'll go to a 500 G drive in a USB-2 enclosure.

    An important consideration is to periodically check to see that the data ostensibly held on a drive (or CD, or DVD) is actually readable. DVD/RW in particular has a tendency to get flakey over long periods of time, expecially if stored under adverse conditions (jammed in back of desk drawer, under sixteen pair of scissors, stapler, a box of pop-tarts, and four old coffee cups. I always keep my last few generations of backups, and if I find an unreadable datum, I make an effort to recover it from the previous backup.

    While it may be stating the obvious, it's a Bad Idea (TM) to wait to back up data until you have a problem. I back up all of my data every week or two, and critical data, daily, without fail. Critical data is cached as a three-generation dataset (IBMese).

    Good luck. There are no real solutions, just ways to cope.

    --

    Don't take life too seriously; it isn't permanent.

  25. Obviously the best storage method ever. by Anonymous Coward · · Score: 5, Funny

    1) Encrypt your data
    2) Tack it onto the end of some anime or porno mpg
    3) upload to kazaa/gnutella/whatever

    It'll still be circulating the net long after our grandchildren are dead.

    1. Re:Obviously the best storage method ever. by Anonymous Coward · · Score: 0

      It'll still be circulating the net long after our grandchildren are dead.

      And will continue to circulate long after the people downloading it have desktop quantum processing and are leafing through your family history.

    2. Re:Obviously the best storage method ever. by Anonymous Coward · · Score: 0

      Real Men don't make backups. They upload it via ftp and let the world mirror it.

    3. Re:Obviously the best storage method ever. by Anonymous Coward · · Score: 0

      Ah, so that's why with all those amateur porn videos I always seem to hit some corruption half way through.

    4. Re:Obviously the best storage method ever. by Anonymous Coward · · Score: 0

      Wait... makes me think of this "junk DNA" thing.

      *are* we porn? is life a big p2p infrastructure?

  26. Stone tablets by DaveLatham · · Score: 5, Funny

    Very durable. Write speed is a bit slow though...

    1. Re:Stone tablets by Anonymous Coward · · Score: 0

      Yep, just a bunch of holes for the 0s and cracks for the 1s.

    2. Re:Stone tablets by Anonymous Coward · · Score: 0

      Very durable -- not so much. You still can get data loss and you need to write it in three formats.

      http://upload.wikimedia.org/wikipedia/commons/2/23/Rosetta_Stone.JPG

    3. Re:Stone tablets by Renraku · · Score: 1

      Using a phased array of lasers, the write speed would be very fast.

      Finding an application that would read .sor (Slab o' Rock) files would be a little harder.

      --
      Job? I don't have time to get a job! Who will sit around and bitch about being broke and unemployed then?
    4. Re:Stone tablets by Lutz · · Score: 1

      I suggest leather instead. A proven concept, and won't make your back hurt that much.

    5. Re:Stone tablets by Anonymous Coward · · Score: 0

      As long as it supports ext3, it should work seamlessly with all your applications.

  27. Re:Quality DVDs, archival storage, repeated backup by b4upoo · · Score: 1

    I can't imagine a guy with 500G of personal files. It must take a month just to read the titles of his various files.
                But people are different. I'm not prone to believing in the media failure reports that we have all seen. For example I have a pile of floppy disks that are still intact after 15 plus years and I stored them like a barbarian. Hard drives also tend to last for me. And I suspect that any quality CD or DVD will last for quite a long time if handled and stored carefully. I would worry more about the PCs being unable to use ancient formats or OSs that can't cope with older stuff. In order to avoid that issue he would have to make certain to keep trying his disks every couple of years.

  28. Again, Quality DVDs by Ormy · · Score: 2, Informative

    If you buy quality DVDs and take good care of them they will quite probably last for decades, perhaps half a century. They are expected to degrade over many years but some of the CDs written back when CDs were first invented are still readable today so nobody really knows how long they might last. There is a similar problem for HDDs, while in constant use MTBFs are well established, but for a HDD that is written to and then left unpowered for many years, well again nobody really knows because we haven't observed it yet. I'd say go for both, obviously HDDs have the massive advantage that you can plug one 500GB and write all your data to it all in one go. To store that much on DVDs will take you days or weeks to write to each disc one or even two at a time. I know, I have over 1.5TB of data backed up on DVDs which number over 500 already.

    1. Re:Again, Quality DVDs by Anonymous Coward · · Score: 0

      Nope. The National Archives regularly sees failure rates of professionally produced CDs and DVDs of 10% over 5 years. They are currently using hard drives, and migrating the data every few years.

    2. Re:Again, Quality DVDs by emandres · · Score: 1

      That may be true about pressed CDs (i.e. CDs made in a factory by a process in which the pits are physically pressed into the media by a custom die), but for user-burnable media the bets are off. When you pop a CD into your drive and burn it, basically the laser causes chemicals to distort the shape of the media, which is a lot less reliable than the commercial process. As for me, I wouldn't trust a burned CD/DVD for more than 3 years in the best of conditions.

      --
      The only way to tell the difference between a hamster and a gerbil is that the hamster has more white meat.
    3. Re:Again, Quality DVDs by Onymous+Coward · · Score: 1

      I saw this recently on some Wikipedia editor's profile:

      The plural of anecdote is confirmation bias.

  29. yes it works-for equipment failure by way2trivial · · Score: 2, Insightful

    right up until you have an enviroment diasaster.
    (enviroment can be as small as a tiny fire in the power supply of that PC)

    Theft of the PC? are you covered?

    FFS it is IN THE SAME CASE!

    --
    every day http://en.wikipedia.org/wiki/Special:Random
    1. Re:yes it works-for equipment failure by Anonymous Coward · · Score: 0

      Wireless transmission into a fireproof safe bolted to the floor.

    2. Re:yes it works-for equipment failure by Anonymous Coward · · Score: 0

      with gold-plated pair of time-sensitive keys to the safe, I am sure.

    3. Re:yes it works-for equipment failure by decavolt · · Score: 1

      How are you going to get power into that fireproof safe without compromising it's integrity and thus fire protection?

  30. stone tablets by Maddog_D97 · · Score: 1

    Write the information on stone tablets. If you can't be bothered to do that, then at least use a medium that doesn't require electricity.

  31. Easy by Ctrl-Z · · Score: 1

    I believe that ridulian crystal is your best choice for long-term storage.

    --
    www.timcoleman.com is a total waste of your time. Never go there.
  32. No pain, no gain... by parrini · · Score: 1

    More than one copy in separate geographic locations is important but media diversity also is. Suppose you buy 3 500Gb HDs from the same manufacturer just to discover they have the same defect. There is no way to have a catastrophe proof backup without putting a great deal of work in it. Disciplin, automation, location diversity, backup media renewal, media diversity, everything is important.

  33. Raid 5 primary, plus Raid 1 backup by klahnako · · Score: 1

    DVDs will fail on you when you least expect it. Service providers will dissapear over the years.

    The only sure method is to maintin the backup, and incure the monthly maintnenence fee (either actual money for backup service, or consume your time maintining the backup server yourself).

    My primary machine is Raid5, so data is not lost. My backup server is Raid1 (because drives will silently fail). I use a backup service just in case the other two fail (but sensitive information can not go there).

    My experience is that a server can last almost a year without hardware failure. Same with hard drives: I have 6 drives, one or two will fail inside the year.

    That reminds me, my backupserver died last week. :(

  34. Re:Quality DVDs, archival storage, repeated backup by PacketMaster · · Score: 0

    Actually Microsoft has a free, rather nice tool that functions mostly like rsync called SyncToy. It's a native Windows application so you don't need a Windows-compiled rsync like many other tools do. It's GUI based to make is easy for the most unfamiliar of users but it is useful even to the advanced for periodic synchronization of data from Windows hosts to other drives or a network location. It has a built-in scheduler to allow it to function as as service.

    (Note: This isn't an MS add. I recently discovered this tool by accident to backup my wife's laptop and was pleasantly surprised it "just works")

    --

    Some people take their .sig way too seriously

  35. Jungle Disk by MarkKnopfler · · Score: 1

    Jungle disk frontend for s3. Easy, pay as you go and redundancy and multiple backups are taken care of by s3

  36. Re:Quality DVDs, archival storage, repeated backup by larry+bagina · · Score: 1

    If Linux, some sort of cron job running a Python script that copies /home to an external hard drive.

    why a python script?

    --
    Do you even lift?

    These aren't the 'roids you're looking for.

  37. Re:Quality DVDs, archival storage, repeated backup by Ma8thew · · Score: 2, Insightful

    Depends on your definition of 'personal' files. Video can take up a lot of space.

  38. Stop asking Slashdot about this... by Animaether · · Score: 2, Informative

    ... honestly, Slashdot - and others - have covered this time and time again. Nothing has changed. There still isn't a cheap digital storage medium that we know for sure -will- retain your data -and- be readable (in terms of media -and- the hardware to read that media) down to the very last bit for your grandchildren.

    IF and when there's a breakthrough, I'm sure Slashdot is one of the first places you'll hear about it.. but it won't be in an answer to an Ask Slashdot.

    http://it.slashdot.org/article.pl?sid=07/04/21/1257249 - Digital Media Archiving Challenges Hollywood
    http://hardware.slashdot.org/article.pl?sid=06/11/20/2036247 - Archiving Digital Data an Unsolved Problem
    http://hardware.slashdot.org/hardware/06/12/11/1714232.shtml - How To Choose Archival CD/DVD Media
    http://hardware.slashdot.org/article.pl?sid=05/06/26/218250&from=rss - Archiving Digital History at the NARA
    http://tech.slashdot.org/article.pl?sid=07/03/31/2141204 - How To Properly Archive Data On Disc Media .. and so forth and so on.

    Yes, I realize that you stated "I'm not looking to start my own national archive; I have less than 500GBs and only save things important to me". However, it doesn't really matter whether you're archiving hollywood movies, NASA records or just your own random crap. If it is important to you - important enough that you want it to be "lasting for decades if possible" - then your concerns are the same as NASA's... and they're struggling with the exact same question.

    The 'best' answer so far is one you will find in each and every single discussion on this - including this thread, so I'll just point you there:
    http://ask.slashdot.org/comments.pl?sid=1061489&cid=26102825

    You mentioned 'cheap', as otherwise all the answers saying "duuuuude, ditch the digital - go analog!" might have some validity.. take a wild guess as to what it would cost to have thousands of photos transferred to negatives/prints, or video transferred to tape/film, etc. Plus you mentioned documents.. some of those may not transfer to e.g. paper (easily) at all depending on the 'documents' in question; e.g. CAD files.

    1. Re:Stop asking Slashdot about this... by Anonymous Coward · · Score: 0

      take a wild guess as to what it would cost to have thousands of photos transferred to negatives/prints...

      Don't have to guess. Photos cost about $0.09 each to print at Wal-Mart. No wild guess: $90/1000 pictures.

  39. Hard drives fail, but rarely at the same time... by wernst · · Score: 1

    Look, I know how you feel. I'm a professional writer and journalist, and the digital files I save are pretty much the only result of my life's work, so I try to save them for the future. I'm in the same position, but I seem to have a system that works.

    Amazingly, (or sadly, depending on your opinion) I also have all of the documents and material I've ever created (starting with the disks from my Apple //e from high school in the mid 80's, to my emails and my first book's Word files from the early 90's, and everything since including about 300GB of photos) still immediately accessible and viable. It comes from the following understanding:

    Hard drives (or basically any storage medium) can and do fail, but they tend to do so in a predictable pattern, but rarely at the same time, so the key is duplication and regular maintenance.

    For example, hard drives tend to have a 5-year lifespan, but two identical hard drives in two different machines aren't going to fail on the same day unless there's a fire or natural disaster that wipes out your room/building/city/whatever.

    So what I do is have a system where your data is regularly backed up or duplicated at least two times to at least two devices, and then regularly check them. These days I use syncing software that copies my files to a second drive (internal or external - doesn't matter) and then to a second drive to another computer on the LAN. If the syncing software can't read the destination disk during the sync process, then I immediately know something's wrong with that disk, but it's no big deal to replace it and resync, because the chances of the original disk and the first duplicated filestore of both going bad in those couple of days is basically nil. The syncing happens every night. (And I know, I'm already up to three drives, but two drives would be fine, and drives are cheap anyways.)

    Every few months or so, I grab a portable external drive from my office a few towns over and make a new copy of my files, and then return the portable drive. The chances of both all my home copies of my files AND the external drive at my office going bad at the same time is practically nil. And the chance of a major natural disaster destroying the disks at both locations is very remote too.

    And then replace any drive that fails as the years pass. The new drives will be much bigger than the originals, allowing for more room for more data as the years pass.

    Frankly, I think the notion of keeping digital data store that remains inert for decades is silly. You're always making new data. You (probably) always want it backed up. So the data store should always be changing. Once you accept that, and then accept a storage medium that allows for regular changing and updating, and then accept the need for duplication, then data storage isn't risky. Add automated software to do the syncing for you, and it isn't even troublesome.

    (And what about those Apple // files, you ask? I have two working Apple //s I still play with to read the duplicated original disks. But those disks were also copied to a Compact Flash card, which I now use on the IIGS as a hard drive. The CF card is backed up to the PC, which an Apple II emulator reads just fine. The backup image of the CF card is always synced between the hard drives. I know, TMI.)

  40. Skydrive by viggity · · Score: 1

    I know M$ isn't that popular here, but skydrive.live.com gives you 25 GB storage for free. And I really doubt MS is going anywhere in the next couple decades. Plus, I'm sure that the 25 GB limit will be increased in the not so distant future.

    1. Re:Skydrive by nachoboy · · Score: 1

      I know M$ isn't that popular here, but skydrive.live.com gives you 25 GB storage for free. And I really doubt MS is going anywhere in the next couple decades. Plus, I'm sure that the 25 GB limit will be increased in the not so distant future.

      This may be perhaps the worst idea in the whole thread. Skydrive is designed for lightweight storage and sharing of documents and pictures, not long-term archival and backup of user data. Some of the reasons why:

      • It's free. By not paying Microsoft, you've established no business relationship with them. This means they have little to no business incentive to reliably keep your data for long periods of time.
      • No API. The only way to upload files is via the browser. There is no file system integration or other API available, which means there are no tools which can assist you to backup or restore your data in an automated fashion.
      • Max file size. Skydrive only accepts files up to 50 MB, which means you'll have to do your own splitting + joining for large files.
      • Content limits. Microsoft lists 18 types of data you may not store on the service, which includes "anything that mischaracterizes content you post or upload" (could be interpreted as no encryption) and anything that "contains the same or similar content to other content you have already posted" (no document revisions).
      • Undisclosed limits. The Skydrive help says "Microsoft may limit the number of files that each user can upload to SkyDrive each month."

      25 GB seems alluring, and the price is right, but don't let that blind you to the other limitations of the service that make it unsuitable for backup purposes.

  41. Paper of course! by Anonymous Coward · · Score: 0
  42. Solid State Drives? by klashn · · Score: 0

    How about Solid State storage drives? They are increasing in capacity. Intel currently has an 80gb model out. It uses NAND flash. They are quite expensive right now though. That 80gb model costs about $600-$700. Of course as the manufacturing process gets better the drives will become cheaper and the sizes, larger.

  43. Re:Jesus saves, but Buddha makes incremental backu by larry+bagina · · Score: 1

    Gmail is only free if you don't care what google (and their CIA/FBI/government overlords) knows about you.

    --
    Do you even lift?

    These aren't the 'roids you're looking for.

  44. Changing formats by DynaSoar · · Score: 3, Insightful

    [From Slashwayback]: Dear Keypunch, I have data I want to keep for decades. Should I invest in a good card reader, or should I transfer my data to these far more efficient but newfangled "floppy disks"?

    It's pretty ridiculous to expect one storage format to be viable for 'decades'. Not because it goes bad (even though it probably does), but because you're not likely to be able to maintain the necessary equipment for that long. If you find a storage solution, you need a retrieval solution to equal it. What equipment will you be able to find decades on that can access your storage, even if it stays good? You have no idea.

    I've been maintaining a collection of Apple IIs and recopying the programs and data regularly (mostly through full HD backup, reformat with error block deletion, reformatting and replacing) to keep it readable. I have machines and data between 20 and 30 years old. I recognized long ago this had become a hobby in its own right, as most of what I had hasn't been of interest to me for many years. The little bits that have been useful have been transferred to newer machines and formats several times. That's decreased as more and more of it can be found easily on the web (previously FTP/gopher/etc.).

    Get used to transferring your data to new formats as they come into widespread use, and recopying as necessary to keep them readable. Or else:

    [From Slashwayforward] Dear Galactic EM Field Computing, I just found about 20 pounds of aluminized plastic disks that used to have data on them, but I can't read them to tell if I still want it. Is there any museum that might want these? Or are there still any operating plastic recycling centers that might give me a few bucks for them?

    --
    "I may be synthetic, but I'm not stupid." -- Bishop 341-B
    1. Re:Changing formats by evilviper · · Score: 1

      It's pretty ridiculous to expect one storage format to be viable for 'decades'.

      Why? Decade and a half old USB devices continue to be compatible with practically all modern computers.

      Data CD-ROMs were popular in the mid 1980s, and Audio CDs were around even before that... Yet nearly every computer today can continue to access them.

      Even though optical drives are falling somewhat in popularity due to massive hard drives and USB Flash devices, I'd still bet very, very good money that CDs, and even DVDs, will be easy to access in a couple decades.

      --
      Slashdot gets worse every day... Pipedot: News for nerds, without the corporate slant
    2. Re:Changing formats by Anonymous Coward · · Score: 0

      floppy disks you choad!!! they are the future, get with the program n0000b

    3. Re:Changing formats by Anonymous Coward · · Score: 0

      20 pounds of aluminized plastic disks

      Still not metric, you say? :(

  45. redundant storage on the web by MarkWatson · · Score: 1

    I would suggest using at least two quality options from:

    1. Apple's MobileMe (used to be called .Mac): you get only 20 gigs of storage with a basic subscription, but storage upgrades are cheap enough.

    2. high quality managed (or partially managed) hosting - if you need this anyway for business use, get extra disk space. Make sure that their backups are regular and secure.

    3. Other paid for storage options.

    4. back up to DVD-Rs, and recopy every 2 or 3 years. I buy different brands, and rotate which I use. Redundant copies, redundant copies, redundant copies...

    Basically, secure long range backup requires laying out some money.

    For video, giving copies to friends and family members is good also, except that DVD-Rs have a short shelf life, unless you pay a lot extra for gold foil DVD-Rs.

  46. Two hard drives + fireproof safe. by urbanriot · · Score: 1

    Take two hard drives, different makes / models, dump all your important data to them and put them in a fire proof safe.

    1. Re:Two hard drives + fireproof safe. by maxume · · Score: 1

      Make sure it is a heat proof safe (I have seen people recommend getting media safes), not just a fire proof safe. Melted circuits are just as bad as charred circuits.

      Off site storage of one backup is probably worth it if a fire safe is worth considering.

      --
      Nerd rage is the funniest rage.
  47. The only fool proof thing I can think of- by Anonymous Coward · · Score: 0

    Hard copies of any important documents. Its been working for hundreds of years and is impervious to technology leaps. It might sound archaic now but in 10 years when your .wpd file or whatever needs a 8 year old program on a 15 year old OS to be read... well... it just makes sense.

    The other option would be to save everything as .txt or .rtf files. Universally readable on any OS.

    As for graphics files, I would try the .jpg or .png format. All programs still read .gif's, and those are 20 years old at least. I would suggest that most programs 20 years from now will read .jpg files or .png. As for media storage... I dunno. Perhaps a dedicated laptop with SSD drive big enough for all your data would be good. Integrated OS and media reader, should be good for 20 years.

  48. Re:Quality DVDs, archival storage, repeated backup by boner · · Score: 1

    Even quality DVD surfaces (on DVDs you can burn yourself) degrade quickly over a period of time (in my experience 2-4 years). Doing a re-backup every 3 years is too risky, it would have to be every two. In my case, with close to 1.6 TB of personal data (video, pictures, the works) it is not even practical, it would mean doing a re-backup of a DVD every two-three days.

  49. Continuous copying to new media by MpVpRb · · Score: 1

    My answer is to periodically copy all of my backup data to a new, larger hard drive. Drives get bigger and cheaper...my data grows...perfect match.

    As interfaces, formats, or software become obsolete, I always keep up with the current technology.

    I am skeptical of any solution that promises tens or hundreds of years of accurate retention.

    1. Re:Continuous copying to new media by Adaptux · · Score: 1

      What's the error rate which results from copying the data?

  50. Think Different by crdotson · · Score: 4, Insightful

    I think the issue is that people are thinking about this incorrectly. You don't really want to 'archive' this data -- keep it with you! Keep it with all of the data that you are using day to day and back it up and move it along with that.

    My home workstation still has files from 15 years ago on it. I've replaced the computer many times, had a few hard drives fail, etc. but I've always restored both current and 'archive' data from backups and kept going.

    1. Re:Think Different by Lally+Singh · · Score: 1

      I must disagree.

      An accidental rm -rf backups/foo/crap-that-washn't-what-I-meant can be pretty bad.

      A backup you keep and manually pull up when needed is safer in this regard.

      --
      Care about electronic freedom? Consider donating to the EFF!
    2. Re:Think Different by Anonymous Coward · · Score: 0

      Agreed. I do the same thing. All of my live data is on my laptop, and I back that up as and when I see fit to an external HD. I'll replace my HD (and other hardware) every few years or so anyway. New hardware -> drag and drop everything -> Bob's your uncle, I'm good for a few more years!

    3. Re:Think Different by crdotson · · Score: 1

      Sorry, I wasn't clear. Absolutely definitely make backups!

      But don't 'archive' data -- keep your 'archive' data together with your 'live' data. When you back up your 'live' data, back up your 'archive' data as well. When you move your 'live' data to a new machine, move your 'archive' data as well.

      Each new generation of storage is usually at least double the capacity of the previous one, so you don't normally have a problem with running out of space.

  51. Re:Quality DVDs, archival storage, repeated backup by gbridge · · Score: 1

    I can't imagine a guy with 500G of personal files. It must take a month just to read the titles of his various files.

    Off-topic but I thought I'd share how easy it is to accumulate 500G of personal data very quickly as I'm well on my way there: Digital photography. If you're a serious amateur and shoot in RAW format with a 14MP or so digital SLR, each photo can be around 15meg.

    I can take between 250 and 400 photos on a night out with friends, still having a blast at the time, and these all add up very quickly. Throw in photos of family, holidays and such-like and you're running into hundreds of gigs of photos very quickly; I've got just over 200G of photos after having my camera for around 10 months. Software like Lightroom, iPhoto or Aperture makes organising and searching them very quick and easy, too.

    The only things I'd be mortified about losing are my photos. It'd be a pain in the ass but code can be rewritten (probably quicker second time round, too). Mail is stored on the server. I can't ever again recreate the moment I pressed the shutter on the tens of thousands of photos I have, and I still haven't found a reliable way to back them all up frequently either.

    G.

  52. whatever you do, do it twice by petes_PoV · · Score: 1
    When you do find a solution - or just get sick of looking for the "perfect" one and therefore settle (which is what we all do in the end), don't just leave it there.

    Assume that something will go wrong. So don't just keep one copy - make sure there are at least two. Keep them in geographically separate locations: maybe with a family member, if you can trust them.

    Personally, I'd go for two different solutions: maybe one magnetic and one optical. However, whatever you decide on, make sure to get it all back and test it. Even better, rewrite it every few years.

    --
    politicians are like babies' nappies: they should both be changed regularly and for the same reasons
  53. industrial-grade compactflash by Anonymous Coward · · Score: 0

    i use industrial-grade compactflash cards.

    Sandisk used to make very reliable stuff, but they don't produce 'em anymore...

    nowadays it seems SiliconSystems SiliconDrive is the only product that's actually designed for extreme circumstances (-40 degC to 85 degC; 8% to 95% non-condensing humidity; vibration, shock and altitude according to MIL-STD-810F) and long-term storage (10 years guaranteed). they also have very detailed datasheets publicly available, so you can easily construct a reader in the future.

    the advantage of compactflash? small, durable, and very easy to interface with an IDE-bus - you just have to gamble on the fact that there will be legacy-IDE busses on the market in the future...

  54. Re:Quality DVDs, archival storage, repeated backup by Anonymous Coward · · Score: 1, Funny

    Because nobody wants to learn perl or bash, and C/C++ is just overkill.

  55. Worst case scenerio over-planning... by Anonymous Coward · · Score: 0

    Wow, I'm surprised Slashdot could be so miss-informed when it comes to this. While I agree that there is no way to put data in a spot for 10 years and expect to be able to use it easily, there are plenty of solutions for this depending on your needs.

    If you take the technology-skew argument to the next level you could say you will never be able to store your data for more then a year, because a asteroid would destroy it. NASA is not comparable to even a large business let alone the home user who posted this question.

    1. Open file formats or least common denominator formats. Convert documents to plain text. Avoid proprietary formats, obviously.

    2. Nothing fancy like SCSI. S-ATA obviously today. I'm sure they will still be offering systems with S-ATA capacity in 2018. Use what ever home consumers are using. 2028? Well since you used a consumer-grade system, you will be able to find supporting systems on eBay. How old is the Commodore 64? Even less popular systems can easily be found.

  56. Simple fix to this non-probem... by dmx11523 · · Score: 0

    Buy 5 Hd's....You'll have 5 copies and if you see technology changing, put it on the "new" tech....you'll be fine. The chance that 5 HD's all fail at once is zero. If that worries you, make copies of the drive on DVD's or Blu-Ray. I have HD"s from 15 years ago that I just copied onto newer drives. I left the info on the old drive too, I have 5+ copies of all my data at least. Then the copy on my PC. It would take a nuke hitting Detroit to destroy my data. If that happened (and I survive) I'd just head to my family in LA. I shipped them a drive a few years back that I check in on once in awhile. They send it to me sometimes to update but overall it's chillin' in LA safe.

  57. Eternal solutions? by BrentRJones · · Score: 1

    Might be better to start weaning yourself of anything that will not pass the death boundary. In a few years there will be so much personal information in digital form that no life form will ever get to find any MEANING in it.

    Philosophical yes, this posting is.

    --
    Help end the use of Sigs. Tomorrow
  58. There are no really good options by SwashbucklingCowboy · · Score: 1

    There just aren't any good long term options today for either consumers or businesses. Yes, CDs and DVDs can fail, but there's another problem - in twenty years are there going to be working devices that can reliably read that media? Maybe, maybe not. Earlier this year, the Storage Networking Industry Association (SNIA) started up a Long Term Data Retention group to address this very problem. Perhaps they'll be able to come up with something.

    Until then the only "solution" is to migrate data from medium to medium every few years.

  59. Parchment Scrolls? by gelfling · · Score: 1

    Clay tablets? Carving into marble?

    1. Re:Parchment Scrolls? by amn108 · · Score: 1

      Data density is just too low for the kind of volume he is dealing with ;-)

    2. Re:Parchment Scrolls? by Anonymous Coward · · Score: 0

      Not really, if he can get a device to etch it using lasers quickly and cheaply. (such at the optical disc mod to etch onto, rather than chemical change)

      Or modify a printer to do a similar thing.

      Or go classic and modify a typewriter with pneumatically powered keys.

  60. Re:Quality DVDs, archival storage, repeated backup by ColdWetDog · · Score: 1

    Or try digital photographs. Photography is my avocation. I have 1.2 TB of pictures after 6 years of shooting. I could probably cull another 500 GB off if I really wanted to... But managing that much data for long periods of time is an issue (which is why it crops up here from time to time).

    It's not just porn and Slackware iso's.

    --
    Faster! Faster! Faster would be better!
  61. Digital Preservation by akp · · Score: 1

    Well, you can start with the Wikipedia article on Digital Preservation. And then follow links around some. Or go ask your local archivist. Or find a librarian forum and ask there.

    Seriously--they're the experts. And there are a lot of digital archives at colleges and such that are on a small enough scale that their solutions would be practical for individuals too.

  62. "For decades if possible?"....... by Slugster · · Score: 1

    How about having it printed on acid-free paper?
    Seriously, if you want something that can interface with a common personal computer, then what matters is not the storage medium so much as the device interface. Once upon a time everything had 5.25" floppies, printer ports and serial ports.

    Just get a couple external USB2 hard drives, (two for redundancy) copy everything onto both of them, and there you go--ten years of good storage.

    (unplug them from the computer and the wall power when you're not actually reading and wrriting on them)

    If you happen to notice that USB ports are disappearing from new PC's before then, then you know it's time to convert to the next interface.

    It ain't that hard.
    At a job I had for a while, I would occasionally have to read stuff off drives that were 10+ years old. And I had that job back around the year 2000, so these drives were from the mid-to-late 1980's.
    ~

  63. Being that all the techology solutions can fail, by PolygamousRanchKid+ · · Score: 1

    Yesterday I set out in search of a way to store my documents, videos, and pictures for a long time without worrying about them.

    Try a different approach, why not simply *stop* worrying about them?

    Let your own brain be your backup. Lost great photos and videos of relatives and friends? At least you will have some fond memories of them.

    And when you die, and thus your backup is gone, then you can *really* stop worrying about them, like totally.

    --
    Schroedinger's Brexit: The UK is both in and out of the EU at the same time!
  64. Re:Quality DVDs, archival storage, repeated backup by Anonymous Coward · · Score: 0

    I have been using the free version of SyncBack (www.2brightsparks.com) for a few years now, and just recently decided to try SyncToy. I have personally used both versions of SyncToy to backup my data using Vista as my OS. Neither version of SyncToy was capable of backing up everything that I asked it to.
    First of all, I set it to Echo, which should make the backup mirror the source (ex. Deletions on source are deleted on backup). But, that didn't happen. At least, not all the time. Sometimes it would delete all the contents of the folder, yet not the folder. Other times, it would add new files to the folder while leaving the old ones as well. Very strange behavior if you ask me.
    So I went back to using SyncBack because I didn't exactly trust the way SyncToy worked. It actually found files and folders that SyncToy never backed up. I think SyncToy has a limit on either the number of characters in a filepath, or the branches in the tree structure.
    I don't know whether it was Vista to blame or not, but I kind of doubt it, seeing as how Microsoft says that SyncToy works with Vista.
    So far SyncBack has been the best free backup solution I've used for Windows. I challenge anyone to find a better (easier/faster) free backup program on Windows, because I would honestly love to try it. I just haven't found anything better at the moment, so I will keep using SyncBack.

    Note: I am not affiliated with www.2brightsparks.com, just a satisfied user.

  65. Friends + rsync over ssh by jlcooke · · Score: 2, Interesting

    I have gigs of photos (wedding, long lost family picnics, etc) and music that I can't bare to lose.

    Find someone who doesn't want to lose anything either and setup rsync over ssh. Synchronize often, rsync is very friendly to bandwidth.

  66. my solution by thermian · · Score: 1

    I have three levels of backup

    I have copies of important files stored on two different drives on my PC, then a current archive external HD, and then I fill external HDs and put them away safely.

    I just change my archive drives as technology improves such that I can fit more on one drive. Next in the purchase queue is a 1Tb mirrored raid external drive.

    If you move your data to whatever the latest technology is, your storage media won't go out of date.

    Works for me anyway.

    --
    A learning experience is one of those things that say, 'You know that thing you just did? Don't do that.' - D. Adams
  67. Mutually Exclusive by Thad+Zurich · · Score: 0

    "Store ... for a long time" and "without worrying" are mutually exclusive, just like "perpetual care" is an oxymoron (ever seen a historic cemetery?) Somebody has to periodically check the data integrity and correct single bit errors as they creep in, because over time it's "will", not "if". The key to data protection is redundancy. The key to data protection is redundancy. The second mechanism is "diversity" as in "bio": using technologies that will "go bad" at different rates to cross-check each other. Good cheap alternatives available now are Blu-Ray burners and external (eSATA) hard drives; however, you have to keep checking and fixing data integrity every year (so plan to check every six months). If you have enough data, a RAID 5 SATA NAS backed up to BD-R may be optimal. It should be possible to build a BD-R RAID, but I don't know of any market products at the moment.

  68. Some ideas by Sandman1971 · · Score: 1

    Nothing will last for decades without some monetary investments. You can get media that will last for decades, but will the computers 30 years from now be able to read them. The only way to get around that is to also pack away a PC with an OS and hopes it boots up 30 years from now. Or every 2-3 years you transfer the data to new media

    If you have a DV tape based camcorder, you can use it as a backup drive and keep the tapes offsite. I'm not sure how much data a DV tape can hold other than its in the multi gigabit range. Being tape media its good for years. I've re-watched videos that I made 5+ years ago on DV tape and the quality was still as good as the day it was filmed.

    For pictures, you can always get the ones you must absolutely keep printed (Walmart, Costco, etc..) and keep them offsite in an album or 2. If you want to re-digitize them after a loss, you can always re-scan them.

    You can buy archival DVDs. Those are a lot more expensive (3-4$ a disc) but are guaranteed for a hundred years (at least mine are.. but I probably won't be around in 100 years to test them). But again, you have the problem of reading that media sometime down the road. DVD readers will one day become obsolete, as will the data format.

    --
    It's better to burn out than to fade away
  69. Combination by The+Second+Horseman · · Score: 1

    I use drives, DVD-R (I don't use dual-layer) and things flash drives. Then, on top of that, I use Mozy's unlimited backup service. One reason? If I get multiple media failure, Mozy's a good extra option. On top of that, if I have a problem with a single file, it's just a hell of a lot easier to grab it back that way. I'm using my own encryption key for Mozy. Backing that up? Well, I've got it on a couple of different storage media, including two in a safe deposit box. And if Mozy goes away? Oh well, I've got my other copies, and I'll just switch to a different online backup service.

    I'm currently scanning in my old film and slides. It's 1 to 2 GB per roll of film, and there are hundreds. So for me, this is a long-term issue. And for any pictures I really care about that were taken with digital cameras, I eventually want to get high-quality prints for each of the kids as well. Paper has a way of lasting. After all, I know how my digital backups are set up, and my wife does, but it's hard to imagine that someone else might in 20 years. It's really easy for a relative cleaning up your house after you're gone to just think "oh, hey, some old hard drive" and ditch it. Besides, there's something about just being able to pick them up and look at it. I notice that on a computer, my kids want to flip between pictures quickly, but an actual print they'll examine in detail. There's just something different about how people look at it.

  70. This questions again. by Spaceball_3000 · · Score: 1

    Seems like ever year we this this same questions on Slashdot. There is no perfect solution that doesn't require some maintenance down the road. Personally I just keep two 500gb HD in software raid-1 mirror, on an free\cheap old pc (from craigslist) and boot once ever year, make sure both drives work, if once fails, swap. Then when future tech changes enough move the data to that equivalent format.

  71. Jump drives should give 10yrs or so by mrflash818 · · Score: 0

    After all, they are basically EEPROMS.

    I would imagine if someone wrote infrequently to one, then only used it to read data 10yrs later, there would be no corruption?

    --
    Uh, Linux geek since 1999.
    1. Re:Jump drives should give 10yrs or so by mrflash818 · · Score: 0

      I posted before reading previous comments. I see others have already had similar thoughts that the USB jump drives are a good candidate.

      --
      Uh, Linux geek since 1999.
  72. can't be done right now by mlwmohawk · · Score: 1

    It is just not possible to create a digital storage system that will be archive quality and guarantee it will be accessible in a time frame measured in decades.

    Burnable CDs and DVDs degrade, the foil comes off and the plastic corrodes. Magnetic tape may be a good idea, but will a computer, 10~20 years from now, support the storage media?

    I have stuff on 360K 5 1/4 inch floppies. If I could FIND a drive that worked, could I get the data? maybe. What about my CP/M floppies?

    A 10 year time-frame may be doable. 20 year time is problematic. 30 years? Not a chance.

    1. Re:can't be done right now by Anonymous Coward · · Score: 0

      > It is just not possible to create a digital storage system that will be archive quality and guarantee it will be accessible in a time frame measured in decades.
      > A 10 year time-frame may be doable. 20 year time is problematic. 30 years? Not a chance.

      Wouldn't a PROM chip last upwards of 50 years?

    2. Re:can't be done right now by mlwmohawk · · Score: 1

      Wouldn't a PROM chip last upwards of 50 years?

      You are assuming a PROM chip of today will be usable in 50 years? When was the last time you SAW a vacuum tube?

    3. Re:can't be done right now by Anonymous Coward · · Score: 0

      > You are assuming a PROM chip of today will be usable in 50 years? When was the last time you SAW a vacuum tube?

      It's a fair assumption. You can still build tube circuits. Some people, like audiophiles and guitar purists, will use nothing else.

    4. Re:can't be done right now by mlwmohawk · · Score: 1

      It's a fair assumption. You can still build tube circuits. Some people, like audiophiles and guitar purists, will use nothing else.

      I don't know how old you are, but my being 45 years old, I think I have some insight that may help you.

      Time was, long ago, you could go to radio shack and buy tubes. There were hundreds of different types. For all purposes that transistors do now. There was even a tube-tester in each radio shack so you could find a "bad" tube and buy a replacement.

      Fast forward today. There may be some audio-phool luddites who only use tubes, but that does not mean it is in any way a viable technology or that a tube manufactured 40 years ago is usable today. The socket type, the voltages, the gain, and the plate configuration could be anything.

      Now, take this analogy to microchip proms. Just 25 years ago most chips were DIP types with a 0.01 pin spacing. These days it is all surface mount or PGA.

      Would you call music on an 8-track tape usable today? How about Betamax? VHS is dying as we speak. Floppies? QIC-02 tapes? DEC-RKO drives? Have an old MFM-hard disk around?

      It just simply can not happen. Space needs grow too fast for any standard to last and be practical for any long period of time. Just look at hard disks. 10 years ago, 15G was a fairly large hard disk. These days disks are 40 times that size.

      Your best strategy is to just keep copying it on to new medium as time goes on.

  73. Caringo is close... by Dri · · Score: 1

    Recently I've been working with a storage cluster technology from Caringo called CAStor. This is ofcourse for larger deployments and similar to Amazon S3. Main difference is that you can buy your own hardware and run in your own basement. Their long-term strategy is to run their proprietary software on top of commodity hardware. As long as they are around you can always upgrade your iron to the latest commodity and upgrade the storage cluster every 3-5 years. Your data is available via HTTP/1.1. The standard "www" protocol of the Intertubes will ofcourse change over time, but CAStor will adapt or you can pull your data out and move it to whatever "eternal" platform you might find in 10-15 years.

    --
    Girls are strange. They don't come with a man page.
    -- Michael Mattsson
  74. Re:Quality DVDs, archival storage, repeated backup by maxume · · Score: 1

    Are you certain of the integrity of every bit on those floppy disks? That's a lot different than not having a problem the last time you pulled a file off of one of them.

    I tend to agree that a lot of media and storage failures are due to hammer-hands, but why risk it when the companies selling the products are almost universally competing on price.

    --
    Nerd rage is the funniest rage.
  75. There is no surefire way to do it, but... by mstroeck · · Score: 1

    ... you can make life for your future self easier doing the following:

    1) Only store information in formats that are ridiculously ubiquitous right now. Use (in that order): TXT, JPEG, PDF, AAC (or MP3, if you must), MPEG-4/AVC (h.264). The sheer amount of information available in (an on) these formats makes it more likely that you will be ably to recover them in the future.

    2) Keep REPLICATING it ALL THE FUCKING TIME, especially when there is a big shift in storage media (like floppies -> CDs or, right now, magnetic hard-drives -> solid state drives) and keep at least two copies in two different places.

    3) In addition, move it into the cloud if you can, as redundancy for your two physical locations.

    1. Re:There is no surefire way to do it, but... by DragonTHC · · Score: 1

      Thanks Steve Jobs!

      We are trying to break free from your turtle-necked tyranny!

      --
      They're using their grammar skills there.
    2. Re:There is no surefire way to do it, but... by mstroeck · · Score: 1

      What are you talking about? I don't chose which formats are are most ubiquitous and/or easy to implement.

    3. Re:There is no surefire way to do it, but... by DragonTHC · · Score: 1

      mp3 and avi are by far the most ubiquitous. any other belief makes you a fanboy.

      --
      They're using their grammar skills there.
    4. Re:There is no surefire way to do it, but... by LarsG · · Score: 1

      Avi is container, not codec.

      --
      If J.K.R wrote Windows: Puteulanus fenestra mortalis!
  76. Mod DVD writer to physically etch by Anonymous Coward · · Score: 0

    Mod a DVD writers laser to physically etch a pattern onto a disc.
    Create a reader to read it.

    Buy really cheap computers to run it from, and store them with it as well.

    Physical etchings still survive 1000s of years later in tombs and so on, so why not?
    True, the lasers probably won't etch very deep, but as long as it is sealed in a room with very little disturbance, its more of a non-issue.

    Also, you probably won't need to use an actual DVD or CD.
    In fact, there is probably better materials out there that could be cut to order for etching.

  77. Solid state drive by Anonymous Coward · · Score: 0

    I know that in the next year, solid state hard drives will be release and if I remember, with comparable storage size as regular hard drives. You can load your data onto one of those and keep it in a shoe Box if you like.

  78. Re:Hard drives fail, but rarely at the same time.. by Anonymous Coward · · Score: 0

    And the chance of a major natural disaster destroying the disks at both locations is very remote too.

    Guess you don't live on the Gulf Coast or any where near New Orleans....

  79. Gmail by Anonymous Coward · · Score: 0

    At least for documents, google docs maybe a good option. Also, gmail can hold small attachments (you email to yourself). My strategy is redundancy, to backup as often as at many different media as possible

  80. Offline RAID with striping like PAR by Anonymous Coward · · Score: 0

    Take the size of the total data to be backed and split it across the size if the medium (usually dvd-r at this time) Then archive the total mass split at the size of the medium. Then use something like par to create restore stripes. Backup the restore stripes in triplicate and the data in duplicate and you should be good to go.

  81. Depends... by Hurricane78 · · Score: 1

    Warning: Most CDs are just as bad for archival purposes. Buy CD-RWs. They have a higher grade material that lasts 10 years instead of 2-3. And buy media with a data layer out of gold. Not the green or blueish ones. They are organic and die rather quickly. Especially under light. Look for disks with a golden look.

    I personally would go with a DVD-RAM for 2x-3x speed. They also use gold, and are specified to survive 30 years. They also have many other advantages like defect-management (like hard disks), sectorization, being rewritable as often as a hard disk, and allowing full UDF usage. If only they were 500GB a piece... :)

    I personally recommend buying a good hard disk where all your data fits on, and then two others that are exactly the same.
    Then put them in hot-swappable bays, and create a mirror setup with a basic well-known file system. Now write the data to it and put the disks at 3 different locations (eg work, home, car [in a sealed bag!]).

    Now put them into the bays every month, and let a tool completely read and write the data on the 3 disks again, thereby letting the error correction do its job. This prevents the motor from sticking and the data from bit-rot.

    And just as important: Keep the system as it is, and keep it around, just for that purpose. Because some day, you may have disks, but no system that can read them.

    I would recommend making everything 3-times redundant (eg using 3 systems, and replace the failing ones, as soon as they fail), but excanging the system by a new one with new technology and new media every 10 years should also do the job if you're not planning for a nuclear war.

    --
    Any sufficiently advanced intelligence is indistinguishable from stupidity.
    1. Re:Depends... by darkpixel2k · · Score: 1

      Now put them into the bays every month, and let a tool completely read and write the data on the 3 disks again, thereby letting the error correction do its job. This prevents the motor from sticking and the data from bit-rot.

      What kind of tool are you using for refreshing the data on the disk? I'm not aware of anything other than SMART in linux that will perform non-destructive read/write tests--but I'm personally know knowledgeable enough to know if SMART is actually reading and writing the actual data and potentially correcting problems...

      --
      There's no place like ::1 (I've completed my transition to IPv6)
    2. Re:Depends... by vyrus128 · · Score: 1

      badblocks has a non-destructive read-write testing mode. You can't run it while mounted, though, for obvious reasons; with SMART you probably can.

    3. Re:Depends... by setagllib · · Score: 1

      You can use badblocks in Linux to exercise each block. If you're storing checksums you should assert those as well. At least you'll know very quickly which files are lost and need to be recovered from backup.

      --
      Sam ty sig.
    4. Re:Depends... by robbak · · Score: 1

      dd -if=/dev/ad0 -of /dev/ad0

      It's actually in the man page. It will read every byte and write it back. The drives error correction will ensure that the data comes off correct, and any bad sectors will be mapped off.

      If you like buying things, spinrite will do a very complete job of it.

      --
      Prediction for end of Universe #42: Fencepost error in Quantum_bogosort.cpp
  82. Re:Quality DVDs, archival storage, repeated backup by Anonymous Coward · · Score: 0

    I have a pile of floppy disks that are still intact after 15 plus years and I stored them like a barbarian.

    How does one store a barbarian?

  83. The answer is simple... by multimediavt · · Score: 1

    To the OP:

    Answer this question: How important is your data? If it's really important, like it is to a Fortune 500 company, the Federal Government or other important data creator then a tape backup system is worth the investment. I don't get your assertion that they (tape drives) have "low storage capacity". LTO and DLT drives have native capacities of 200GB or more these days. Yes, the drive will cost $1000+ and the tapes are about $50 each, but if your data is really that important that you have to hang on to it for decades, then that's your best solution.

    As someone that has lost some "precious" data over the years I will also say this: If it's more than five years old, chances are you don't need it anymore and you're just being a paranoid pack rat. But, if you are dead serious about long-term data storage, then tapes are the way to go.

  84. PRINT the documents and photos by ffflala · · Score: 1

    If properly printed and stored, the photos and documents will outlive you.

    For the documents, I believe that you want laser prints on acid-free archive quality paper.

    For the photos, call a university art library and ask them for recommended paper & ink combinations.

    Still use digital storage for easy access; keep the printed material stored. But until the media is proven, it's probably best to consider all optical and magnetic storage volatile, either due to its nature or due to obsolete hardware. Just be redundant --store thumb drives and a hard drive, for example, and copy the files onto your working machine every time you get a new computer.

    1. Re:PRINT the documents and photos by the+eric+conspiracy · · Score: 1

      I'm not too fond of laser printing for archival storage. The thermoplastic tends to stick to opposing pages. The best I have seen is pigment based ink.

      A technology that I have used in the past is data encoded by punching holes in paper. No ink to fade. I'd think that would work very well so long as the paper is stored sealed.

    2. Re:PRINT the documents and photos by ffflala · · Score: 1

      I've seen thermoplastic sticking before, but never so severe that it degrades the content. My understanding is that sticking occurs in overly humid storage environments.

      OTOH, I've seen severe ink degradation, though that is more a problem with color images than documents. Maybe you're right -- the worst material I've seen has been over a century old, at which point poor paper quality has much more of a detrimental effect than fading ink.

      At the risk of letting a joke whiz by over my head, it seems that that the punch card method would run into the same problems of obsolete hardware one gets with other file-type methods.

    3. Re:PRINT the documents and photos by the+eric+conspiracy · · Score: 1

      With a little practice punch cards are human-readable.

  85. USB Hard Drives, lots of them by grumbel · · Score: 1

    If you need storage for 500GB I would go with harddrives, they certainly won't last forever, but they can last quite a long long while and they also happen to be cheap and in the case of USB drives they are also very easy to use if you want to check stuff and also highly compatible with every OS out there. Duo to their cheapness you can simply use lots of them, copy data from one drive to another every year or so and in not much time you will have plenty of drives with redundant data sitting around. And in a year or two you might be able to switch to SSD drives instead of spinning plattern for more reliability.

    If that isn't enough I would also suggest to go multiple routes if possible, i.e. you might want to backup your emails and other stuff that is small enough to CD or DVD in addition to the harddrive.

    Another important issue to worry about is data integrity, since data that is still readable might still be trashed. So keep MD5 checksums or Parchives around for checking and recovery.

    In the end the most important point simply is to keep multiple copies of the data around and if possible off-site, so that a burned down house doesn't kill it all.

    1. Re:USB Hard Drives, lots of them by petes_PoV · · Score: 1
      The biggest problem isn't the hardware - it's the interfaces. 25 years ago, all the 5 1/4 inch hard drives had either SASI (not SCSI, that came later) or HP-IB (IEE-488) interfaces. I have couple lying around still. Now, the data on these may be OK - they haven't been mistreated, however there are no interface cards for PCI buses, nor the drivers to use them with any modern operating system.

      For that reason, you'll have to refresh the hardware every 5 years or so, just to keep up with the changes to interfaces, O/S's and filesystems.

      --
      politicians are like babies' nappies: they should both be changed regularly and for the same reasons
    2. Re:USB Hard Drives, lots of them by grumbel · · Score: 1

      Thats why I mentioned copying stuff around. Even if you do it only every 5 or 10 years you will have zero problem with the interface. And in case of a harddrive its really trivial since its a single command without any extra manual work, no disk swapping required as with floppies, CD or DVD.

      I also find it a little hard to image that USB will go out of fashion anytime soon, after all almost *everything* external these days uses it, which wasn't the case for 5.25" drive interfaces.

  86. the answer is time tested.... by 3seas · · Score: 1

    ...Paper! as what ever is on it will always be scan-able and software can be written to decode it.
    So how much paper does it take to print out the code of the linux kernel? Depends on how fine the print is.

    There is another, but it has not yet been developed to a consumer level. Perhaps its only an idea in a lab yet.

    We can make diamonds and other synthetic gems as well as many other things like Integrated circuits.
    By permanently growing the data into the crystalline gem structure or Integrated circuit that is then encased.....

    In essence establishing a permanent atomic structure that represents the data.

    Of course with enough heat or hammer it too can be destroyed.

             

  87. Do what the Rosetta Project does by Anonymous Coward · · Score: 0

    The Rosetta project is an organization that seeks to document all of the worlds languages before they die out, and their medium for doing this is to use a laser to burn a copy of some book (I think it's Genesis) deeply into stainless steel plates.

    In 30 years you might have to rig up something to extract the data and process it, but it will still be intact.

    Also, I'd like to point out that analog and physical storage media last a long time if they're cared for, much longer than digital copies. Think about it: record players are still around, laser discs are not. And even if record players vanished, you could buy a used one, or even build your own mechanical phonograph player (you just need a needle, a turntable, and an acoustical horn to magnify the sound). Try building your own DVD player from raw materials! The same thing is especially true for video. Film projectors are not terribly mechanically complex, so if your video is all on archival quality 8mm or 16mm film, you or a handyman could build a projector to play it in 30 years. Those WMV's? Not so much.

    And, of course, use printouts on acid-free paper in a dark, dry storage area, and they'll outlast your stupid Word docs by a factor of ten, or more.

  88. Not the first civilization so challenged... by Saint+Ego · · Score: 0

    Considering the speed with which both data and medium deteriorate with age, perhaps it is not surprising that ancient civilizations continuously relied on stone carvings and other physically substantial means to record information.

    Being able to create something that will last down through the ages has been one of the classic challenges to mankind.

    Even if you were able to come up with a solution for preserving data integrity, there is nothing to prevent purposeful destruction by third parties. Even the library at Alexandria burned.

    Unfortunately, the only information that has ever been successfully preserved involves science, and that is because we appear hard-wired to reverse engineer how the world works. It is rather difficult to forget the laws of physics when they are repeatedly proven on a daily basis.

    --
    Reality is prettier inside my head...
  89. Media Migration by Anonymous Coward · · Score: 0

    I think everyone here has touched on it, but its something we discussed 20 years ago at work, and its still the same "issue".

    The *only* solution really is to migrate your data to whatever new form of media exists every 10 years or so. In the 90's you'd have been burning your data to CD's, but I'm not sure the media itself will last more than maybe 20-25 years. Now we have DVD, and most can still read CD's (plus Audio-CD's will keep the basic "CD" idea around a while). But how long until that goes the way of the 8-track and vinyl records?

    Tape generally is worse, someone mentioned the old reel 1600/6250 tapes - yup, used those.. then 4mm DAT, DLT... quite honestly CD's have been a better long-term option than any tape format I've encountered in my 25+ years in IT. Tape is meant for *backup*, not long-term archival.

    That of course leaves the other side of the equatio, since you are talking about "digital media" - the software side of things comes into play (had this at work as well)... Does Excel still read "visicalc" files? Will it support Quatro Pro format in 10 years? If you have a pile of documents from the early 90's in WordPerfect for DOS, will you be able to read them when you need to? Convert them all to PDF? Will PDF be around in 20 years? If not, will a copy of Acrobat 9.0 still install on whatever hardware/os you are running then? Could you even install a copy of XP on such a system, to then install Acrobat and read your files? Is the data, in 20 years, even still relevant/useful, or are you just piling up data that'll never be needed? How do you decide w/o taking it on a case-by-case basis?

    Lots of questions, and as someone said, even NASA and a lot of other companies haven't figured it out...

  90. Paper! by SnEptUne · · Score: 1

    Unless it is video, paper tends to be more reliable than all those high-tech methods. Books were still readable after thousands of years.

  91. Not Amazon S3 by Anonymous Coward · · Score: 5, Informative

    If your data is not in your possession, how do you know others won't see it or edit it without your permission? For archiving purposes, the best technology is magneto-optical. Despite the fact that makers have been exiting the market due to competition from faster and larger-capacity technologies, MO remains the champ for data storage duration. Remember, it is partly based on a Natural phenomenon that lets geophysicists detect which way the Earth's magnetic field was oriented, hundreds of millions of years ago --data retention just doesn't get much more long-term than that. MO disks are removable from the drive, and every modern drive can read any older same-size disk (they come in the standard 3.5" and 5.25" sizes, but have quite a range of capacities), so if the drive fails, just make sure you have spares. Perhaps, sooner rather than later, the manufacturers will realize that archival storage is a niche market that will demand that they stay in the MO business.

    1. Re:Not Amazon S3 by drsmithy · · Score: 2, Insightful

      If your data is not in your possession, how do you know others won't see it or edit it without your permission?

      Encryption ?

    2. Re:Not Amazon S3 by anagama · · Score: 3, Insightful

      Encryption only works because brute forcing the scheme on current hardware is ridiculously time consuming. Encrypting with today's standards does not protect against future advances in computing power.

      --
      What changed under Obama? Nothing Good
    3. Re:Not Amazon S3 by MikeBabcock · · Score: 3, Insightful

      If someone's willing to brute-force a password to change your data, they're willing to copy your DVD's ISO, edit it, and burn a new copy to replace it with.

      --
      - Michael T. Babcock (Yes, I blog)
    4. Re:Not Amazon S3 by edgez · · Score: 1

      Use 7zip to compress/password/encrypt --encrypt file names. and err don't forget your password ;)

      --
      BiTTNews.com
    5. Re:Not Amazon S3 by osu-neko · · Score: 1

      If your data is not in your possession, how do you know others won't see it or edit it without your permission?

      It's called a lack of clinical paranoia, combined with the fact that I'm not the president, a CIA agent, or anyone else of importance. I could post my data publicly on a website and count on the fact that no one would see it, much less make any attempt to edit it.

      There are people who have to worry about the kinds of issues you raise. But for every one of them, there are hundreds of people who are concerned about those things due to some delusional state, not sure of it's paranoia or delusions of grandeur.

      Likewise, I don't need triple-DES for my encrypted data when no one would even bother with the effort to run it through rot13...

      --
      "Convictions are more dangerous enemies of truth than lies."
    6. Re:Not Amazon S3 by anagama · · Score: 1

      Physical objects not connected to the internet are much simpler to secure if only for the fact that at most 100s of people would even be close to the data in the physical world, but millions are "close" to it in the digital world.

      --
      What changed under Obama? Nothing Good
    7. Re:Not Amazon S3 by paganizer · · Score: 2

      yes..
      But, they have to climb the wall, get past the guard dogs, fool the electronic entry system, climb through the ductwork, fool the motion sensors, crack the safe, then manage to get off the premises alive before the response team gets there, and even THEN they will have problems defeating the ROT13 encryption my backed up savegames are protected with.
      I take PBEM games Very Seriously.

      --
      Why, yes, I AM a Pagan Libertarian.
    8. Re:Not Amazon S3 by Anonymous Coward · · Score: 0

      exactly. i mean, how do you know someone won't sneak some kind of transmission device inside of your magnetic tape cartridges? are you going to handcraft your own backup media and tape drive?

      if you want to be that paranoid then you better get all your money out of the bank and invest in some precious metals.

    9. Re:Not Amazon S3 by paganizer · · Score: 1

      But seriously, folks..
      I've been using CD's for critical information backup for over a decade; when I got my first single speed burner, writable CD's were huge, heavy, and apparently near indestructible; the only protection mine have enjoyed since 1994 is that I keep them in a briefcase.
      When the media started getting thin & flimsy, around '99, I switched to "black" media; once again, no problems at all.
      It's getting harder to find the blacks lately, so I've started doing, I guess you would say "environment stress testing" on DVD rewritable media; they seem to be the toughest thing available.

      BUT. for day to day operations? RAID-5. It's a no-brainer. I've got a client I setup with a software SCSI RAID-5 for their day-to-day storage back in 2000, 3 35gb drives originally, they added one in 2000, 2 more in 2001; I then switched them over to one built on 3 73gb drives; that lasted until early 2007, when they went to a 4x146gb array; in retrospect they probably should have waited a year, but oh well. Please note that the original 175gb array and the subsequent 365gb array are still in use in different departments with lesser needs. when they hit 730gb with the current array, I might suggest that they think about doing a pair of Debian soft SATA 4 disk arrays to save money; you can put together a 3TB 4 disk SATA array tomorrow for $328, while SCSI is apparently, suddenly, inexplicably, dead. can't quite figure that out.

      --
      Why, yes, I AM a Pagan Libertarian.
    10. Re:Not Amazon S3 by DaffyDuck101 · · Score: 1

      I second the MO suggestion. We've used them for storage in my line of work for years before changing to CDs. While the CDs (archive grade charge-you-through-the-nose special coating yadda yadda) bork at an impressive rate, I've never had an MO disc fail on me (1500 discs between 10 and 15 years old)

    11. Re:Not Amazon S3 by Reziac · · Score: 1

      It doesn't matter. If your encrypted file is in someone else's hands, and they want to damage your data, all they have to do is assault the file with a hex editor and overwrite the data with a bunch of random bytes. Your data is now toast, despite the encryption being intact.

      This is essentially the same issue as "my hard disk has bad spots and it's eating my files!" except induced deliberately by a 3rd party rather than accidentally by techno-breakdown.

      On that note, when files are out of your hands, you are now relying on someone else's backups and data integrity -- which may or may not be better than you can achieve on your own!

      --
      ~REZ~ #43301. Who'd fake being me anyway?
    12. Re:Not Amazon S3 by tscheez · · Score: 1

      Paranoid much? S3 is pretty secure. If someone gets access to my S3 account they'll find lots of pictures of my kids, some documents and that's about it.

      --
      Supplies!
    13. Re:Not Amazon S3 by mhall119 · · Score: 1

      Encryption only works because brute forcing the scheme on current hardware is ridiculously time consuming. Encrypting with today's standards does not protect against future advances in computing power.

      So don't give them all of your data. Treat online storage like a RAID, and give give each one a few "stripes" of data. No single entity would have enough to reconstruct your data, and if any single entity suddenly disappeared you'd still have enough in the others to reconstruct your data.

      But at that point, just keep it on DVDs and re-copy them every few years. Seriously, it's cheap, easy and reliable for as long as you're willing to put in a minimal amount of effort. You also won't run into a problem of having an unsupported format in the future, because when your current format is going obsolete your next copy can be in a newer format.

      --
      http://www.mhall119.com
    14. Re:Not Amazon S3 by SanityInAnarchy · · Score: 2, Insightful

      It actually kind of does.

      In particular, modern RSA, even considering future advances, would require fundamentally different technology (quantum computing) to make it feasible.

      By "feasible" I mean that if quantum computing doesn't become practical, a 4096-bit RSA key will survive the heat-death of the universe.

      Crypto is very, very rarely the weak point. It's almost always how that crypto is used.

      And by the way, mods -- blanket statements like the parents' are easy to come up with, and easy to make sound intelligent, but there's no meat to them. It's kind of like the blanket statement of "The only secure computer is one that's not on the Internet." Easy to come up with, sounds reasonable, also entirely wrong.

      --
      Don't thank God, thank a doctor!
    15. Re:Not Amazon S3 by afidel · · Score: 1

      SCSI has been replaced by SAS (Serial Attach SCSI). You won't find huge capacity SAS drives because most applications need IOPS and if you need more storage you add additional spindles which gives you more of both. There's also the problem with 1TB drive rebuild time, heck we're concerned enough about rebuild times on our 300GB fibrechannel drives that we are going with 5% hotspares to reduce the chances of a double fault causing data loss (despite a 1.5% annual failure rate for similar drives in our array).

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    16. Re:Not Amazon S3 by wirelessbuzzers · · Score: 1

      Encryption only works because brute forcing the scheme on current hardware is ridiculously time consuming. Encrypting with today's standards does not protect against future advances in computing power.

      Actually, future increases in computing power are about the one thing modern crypto can protect against. There is not enough energy in the galaxy to brute-force a 256-bit symmetric key:

      ...brute force attacks against 256-bit keys will be infeasible until computers are built from something other than matter and occupy something other than space."

      --Bruce Schneier

      Ah, but what if someone builds a quantum computer? Well, it turns out that in that case, they might be able to brute force your key, and it might only take enough energy to boil the oceans dry, instead of all the energy in the Milky Way. So if you're worried about that, encrypt it 3 times (not just twice... someone able to store a quantum bit of memory on every atom in the earth might be able to boil the oceans to decrypt it...)

      Seriously. While modern crypto cannot protect you against new developments in mathematics or physics, bugs, side channels, malware, torture, telepathy, theology or blind luck, it can protect you against increases in computing power if you choose the right key size.

      Same goes for authentication, by the way... if you MAC all the data, it should be secure against tampering forever, if not longer, with high probability. [Longer than forever, because the MAC may not contain enough information to brute-force the key.]

      --
      I hereby place the above post in the public domain.
    17. Re:Not Amazon S3 by paganizer · · Score: 1

      If the standard SCSI drives were available in the same sizes as are SATA drives, I would probably say go with RAID 6; you don't have to worry about an additional drive failing while you are rebuilding (it happened to me. once. Nightmare.).
      By the way, I haven't done the 3x1TB SATA array, but I have done a 4x500gb one; rebuild time was 6 hours, without load. With load... well, thats the reason I suggested dual 3TB arrays; throw DFS/VFS on them, and when you have to rebuild, remove the one with the problem from the network. a Mirror set of arrays.
      But, honestly, I know squat about SAS; it's obviously something I'm going to have learn about, it sounds interesting as hell.
      Luckily, no one has asked me any questions about it yet...

      --
      Why, yes, I AM a Pagan Libertarian.
    18. Re:Not Amazon S3 by gnupun · · Score: 0

      It's called a lack of clinical paranoia

      One man's paranoia is another's common sense. All these government bodies did get power handed to them on a silver platter. They had to bully, deceive and twist a lotta arms to get it. If the collective sheeple willingly and stupidly hand over power (in the form of their information files), these bodies will happily accept. In this age, information is a valuable asset, not something you want to blindly entrust to someone else, if possible.

    19. Re:Not Amazon S3 by mathew7 · · Score: 1

      You all talk about encrypting, but did you ever think about the key? What if you loose your key? Maybe in the future you will use another key and forget about the one you used for achiving. And when you need the data......where did you put that key?

      I already started using one DVD-R disk and one DVD-RW disk for each archive. While I can check in the future and possibly retrieve damaged data from at least one, I'm thinking what will happen when I'll have 100 archives? Will I allocate 1 year to check 2disks/week?

    20. Re:Not Amazon S3 by pyite · · Score: 1

      Encryption only works because brute forcing the scheme on current hardware is ridiculously time consuming. Encrypting with today's standards does not protect against future advances in computing power.

      Do you know much about encryption? Actually, no, you've already answered that. Today's encryption standards sufficiently protect against any conceivable increase in computing power in your and many generations of your offspring's lifetimes.

      --

      "Nature doesn't care how smart you are. You can still be wrong." - Richard Feynman

    21. Re:Not Amazon S3 by wirelessbuzzers · · Score: 1

      You all talk about encrypting, but did you ever think about the key? What if you loose your key? Maybe in the future you will use another key and forget about the one you used for achiving. And when you need the data......where did you put that key?

      You can write the key down and put it in your wallet or a safe, use a key you will remember, put hints on the server, or whatever. Ultimately, you need to retain some token (either a physical token or information) which will allow you to access the data, and a key is about as small and easy-to-protect a token as you could ask for.

      --
      I hereby place the above post in the public domain.
    22. Re:Not Amazon S3 by mathew7 · · Score: 1

      I really don't think you want to "write" even a 512-bits key. 512 bits means 64 bytes, but they are not human-readable. To make a human-readable key you need to convert it to another format (like base64, uue etc.) which means another algorithm, another skill and human error (I even have trouble with game keys that are 5x5 long).
      If you mean a password then you should know that either the key is created from that password or the real key is encrypted with that password. In any case, you still need the algorithm that created the key from the password, which in 10 years could be scrapped and todays applications (which use it) may not run on future computers (ok...this is a little exageration, but I'm obsessed by "worst-case scenario").

      Achiving is a big compromise between availability and security. The security part can have many points of failiure which could render the archive unreadable.

    23. Re:Not Amazon S3 by Chyeld · · Score: 1

      And by the way, mods -- blanket statements like the parents' are easy to come up with, and easy to make sound intelligent, but there's no meat to them. It's kind of like the blanket statement of "The only secure computer is one that's not on the Internet." Easy to come up with, sounds reasonable, also entirely wrong.

      Because we all know that in addition to being off the next, it should be encased in concrete, then placed in a faraday cage, then encased in epoxy, then buried in the mantle, and never ever turned on.
       
      Duh.

    24. Re:Not Amazon S3 by MikeBabcock · · Score: 1

      SAS is a very cool protocol. One of the coolest features is how they learned from SCSI and even made it forward compatible to SCSI.

      That is to say, you can plug a bunch of SATA drives into an SAS controller card if you like, to save money or have higher capacities with a high-end SCSI controller.

      --
      - Michael T. Babcock (Yes, I blog)
    25. Re:Not Amazon S3 by wirelessbuzzers · · Score: 1

      Point taken: you could put the software on the server, but that's no good if the server is malicious.

      But really, in looking for a technological solution, you're assuming that some baseline of technology will be around in 10 or 20 years, even if not in common use. So you can probably stick the source for GPG on a disk and use that disk to read the data.

      --
      I hereby place the above post in the public domain.
  92. Open source it. by lidong2121 · · Score: 1

    Or let other one pirate it.

  93. Re:Amazon S3 - jungle disk makes it easy to use by blahbooboo · · Score: 1, Redundant

    Its not really that cheap, and not that simple to use for personal backups. Unless you are willing to write your own backup scripts, its going to be a headache.

    Querying S3 for a list of stored files is *very* slow, and you only get 1k results per query. This means you have to index what files you put in S3 in a local db. This allows you to ask the db what files are there (and how to grab them).

    If you only have a few files you can use the S3 browser extension for Firefox (or one of a many file system mounting, ftp simulating, etc tools). Just keep in mind the 1k file limit per query and box things in folders of no more than 1k items. Otherwise you will have a very slow browsing experience.

    I have around 120 GB of family photos and purchased mp3s that I would like to store. To store 120 GB at .15 per gigabyte/month for 1 year would cost me: $216 (at $18 a month).

    We use it where I work, with great success, but it would be much to much work for me for a personal backup system.

    Considering the cost, I would go with a consumer targeted app (there are LOTS of them). A number of them charge a flat flee for "unlimited" storage. Beware of how you interface them. Some support windows only.

    Try Jungle Disk http://www.jungledisk.com/ . It makes S3 easy to use...

  94. the quandary... by Dzimas · · Score: 1

    The vast majority of our "irreplaceable" personal data will be completely meaningless in a couple of decades. Personal conceit leads us to preserve everything "for our kids and grand kids." However, your grand kids probably don't need fuzzy snapshots of you with a skanky ex-girlfriend on your lap, or a hundred photos of some long-forgotten dog. The best approach is to create and document a "best of" collection of videos and photo. Preserve it in multiple locations and recopy it onto new media as it becomes available. Come to think of it, this is much the same comment I had last time this question slithered onto the /. front page...

  95. Analog - the only way by Anonymous Coward · · Score: 0

    You mentioned 'cheap', as otherwise all the answers saying "duuuuude, ditch the digital - go analog!"

    The only way to go is to select the photos that mean a lot to you, pick the documents that you don't want to lose, then print them. Many photofinishers use archival papers today, and a 5x7 is pretty cheap. Store in a cool, dry place away from direct light. Paper can be stored in a file cabinet for decades. Multiple copies can be made and distributed to friends and family.

    The cost can be managed. Just think of who else would want to keep all 500GB of photos and documents. When you review this stuff again, if ever, you can pick out anything else that should be saved.

    1. Re:Analog - the only way by dangitman · · Score: 1

      If you want archival photos, you don't want color prints. You want color separations recorded on silver halide monochrome film. That's not cheap at all. A 5x7" print is not going to store all the data in your original images, anyway.

      --
      ... and then they built the supercollider.
  96. Re:Quality DVDs, archival storage, repeated backup by Zwicky · · Score: 1

    I've never used SyncToy, but for my Windows box I regularly use SyncBack, of which there is a free version available.

    --
    "Three eyes are better than one" -- Lieutenant Columbo
  97. OP here by BeanBagKing · · Score: 1

    There's two reoccurring theme's I keep seeing in the posts here. 1) Is the media I make the backups to going to be readable in 20 years? i.e. will there be compatible hardware and software in 20 years? and 2) Off site storage in case of fire, theft, etc.

    1) As one person put it
    "Psh, its not like he's gonna wake up 30 years later and go "Oh yeah! My data! I need it right now!" And then feverishly try to push the USB slot into the wireless receiver, and then start screaming "OH THE FOLLY OF MAN"

    If USB really started disappearing, you would go get your USB drives and copy the stuff off of them "

    2) Again put perfectly by a poster, what if an asteroid came along and destroyed it! I could keep all my stuff offsite, but what happens if the offsite location burns to the ground? There is no failsafe from natural disasters, we all plan the best we can, but to me this is a separate issue and not one that I care about when I posed the question.

    Several people point out that this shows up on slashdot every few months, and the reason it does is because there has yet to be an answer. Perhaps I was hoping my predicament was different and there was an answer for it, but it doesn't seem so. Maybe I should scribe all my data to papyrus and bury it in a pyramid, that's the only thing proven to last several thousand years.

    It seems odd to me though that, as a whole, as a world that's invented so much technology, we don't have any media that will reliably last beyond 5 years, much less 20. Sure, anything could happen to any media, I could have my data inscribed on steel plates and they would eventually rust. Failure is always an unknown, but I'm talking about reliably. We know HDD's are reliable for at least 2 years, most are warranted for 5. In this technological era, why don't we have anything that's reliable in for 20 years? Who cares if it's readable, sure, a fire could burn it down, but we don't have one digital storage solution that's reliable for that long? I guess this was what I was hoping would be answered, but as has been stated, there is no good answer yet.

    BTW, the best solution I've seen so far is a 500GB external and a live backup living on my computer. Hopefully someone can come up with something better when this question is posed again in 3-6 months.

  98. Suck it up by Anonymous Coward · · Score: 0

    Suck it up and purchase backup hosting from Mozy. That's your best bet. $5 a month for someone else to be responsible for your data. You also have the opportunity to add to your backed up data whenever you want with an unlimited cap. It'll be a lot less expensive in the short term than buying 2 external hard drives.

  99. Comment removed by account_deleted · · Score: 1

    Comment removed based on user account deletion

  100. SSD RAID1 or 5? by travbrad · · Score: 1

    I realize SSD's are still pretty expensive for large capacity storage, but they will come down in price over time. They are already a great solution for smaller capacity needs. SSDs can detect when a cell of the drive is going to fail, and write the data somewhere else automatically (unlike with traditional HDDs these failures tend to occur on writes rather than reads). If you put two or more of these together in RAID, that's a pretty secure backup.

    It doesn't solve the problem of natural/environmental problems of course, but I suppose you could make monthly off-site backups if you really wanted to (to either a standard HDD or another SSD).

  101. Suggest a better approach...keep your data modern" by blahbooboo · · Score: 2, Insightful

    There really is a simple way around this - and it is what I've done - I've got data 25 years old and it's still relatively easily manipulated with a little work. I've found floppy disks are relatively resilient, and old hard drives seem to keep their data for a long time. I've got a computer, display, keyboard, and associated peripherals stored for every generation of data that I kept:
    1.I have a Commodore 64 with floppy drive and cassette drive stored in a box with the floppy disks and cassettes from that generation (late 70s/early 80s).
    2.I have an IBM PC/XT with keyboard, a 5 1/4" floppy, 3 1/2" floppy, internal 20MB hard drive, and CGA monitor stored in a box with a load of 5 1/4" floppies filled with data from that generation (Mid 80s).
    3.I have an IBM RS/6000 with display, keyboard, and mouse and internal 500MB hard drive loaded with all my docs and projects from that generation (early 90s).
    4.I have a Pentium 2/300 PC * 15" monitor with windows 98, CD R/W drive, 3 1/2" floppy drive, and USB ports - and a crapload of CD's and 3 1/2" floppies full of stuff from that generation (Mid/late 90s).

    When the current generation looks like it's going to be moving on, I'll put away a Core 2 Duo system with 1 TB of hard drive full of stuff with the different OS's I used loaded on it with boot manager (Ubuntu, XP, FreeBSD), a crapload of USB keys full of documents, along with burned DVDs etc. That'll take care of the "'00" generation.

    The answer lies in not only archiving your data "of the generation" but the essential equipment needed to access it. I may have a heck of a time moving data off of my Commodore 64 - but I can at least see it and access it - I believe I stored a modem with it - so at worse I could set up a terminal server that it could dial into and dump data to. All the other systems I'm pretty sure I could recover stuff from - even if the PC/XT does have an MFM hard drive, etc.

    I have data 18+ years old . You're approach is admirable, but why not just move your data forward with technology?

    When floppies started dieing, hard drives got large enough so I moved all data off the floppies to hard drives and optical media. When word processing software I used started dieing, I moved all my documents or obtained converters to MS Word format. Also, I don't archive music and movies. I do archive pictures etc.

    I believe the best approach is just keep your data moving forward & current and not in some archaic format. This means I have 3 redundant copies of all my data on hard drives using a current OS.

    Granted I don't archive "silly things" like music which I can re-create; but rather just personal data (i.e. personal documents, pictures, personal videos) so the total quantity of data after 18+ years is only about 13gb.

  102. Dreamhost file forever ... by Anonymous Coward · · Score: 0

    Dreamhost has a service they call file forever. It is $.01/4 MB to store your files forever. You pay once and you can read your file forever.
     
    Now the question is what happens if they disapear...

  103. Redundant copies, frequently maintained by gweihir · · Score: 1

    Either that or archival grade tape. That will cost more than 1000USD/EUR though.

    There used to be a solution that was adequate: MOD. But nobody cared enough and it was not cost effective to develop it further. It is still used to archive digital X-rays, but apart from that it is pretty dead. BTW, the MOD "successor" DVD-RAM is a bad joke.

    As to USB-FLASH, you will still need to check and refresh the contents every 5 years or so.

    My personal solution is to have a archive directory on all my machines that is synchronized. Currently I do this manyally, but I plan to move to rsync-backup or an SVN server.

    --
    Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
  104. BackBlaze? by cflannagan · · Score: 1

    I'd rather not pay yearly rent on a server or backup/storage solution.

    Well, I guess I cannot recommend BackBlaze then. I pay only $5 per PC monthly to have every file backed up, at a remote server.

    Very, very easy to set up. I installed it from their website, and that's it. Simplicity is key.

  105. Twibright Optar by Anonymous Coward · · Score: 0

    Has anyone considered twibright optar...data storage on paper! woohoo!

    http://ronja.twibright.com/optar/

  106. Baked clay tablets by wexsessa · · Score: 1

    The oldest known written story, The Epic of Gilgamesh, was written in cuneiform on clay tablets, about 8000 years ago. They were probably baked (or perhaps just kept dry). Some of these still exist and are as readable as when they were written. Some combination of current technology with modern ceramics might work well. Store them in a deep mountain cave in Utah.

    1. Re:Baked clay tablets by hearnz · · Score: 1

      I also have invented a process for creating a rock inside of a computer, one that all of the people in the world could artificially engrave in a tombstone-style text whatever they wish. If built, this rock would enable all people on Earth to store one paragraph or more worth of information that would be permanently stored on the computer. The information stored would outlive the person whom engraved the rock because the rock would be of a 0.8 micron process with 500,000 transistors in the space of a 486 Central Processing Unit. A 486 Central Processing Unit actually has over 800,000 transistors. My design would be more reliable than a 486. Some people may think that a 0.8 micron process is too slow - this is incorrect if it is a 1024 bit or higher processor, then it could do more in increased volume than a smaller processor. The processor would last many hundreds of years and this is why the space shuttle uses similar technology - where failure is not an option. The information engraved in the rock which is purple and blue and marble-like and is black in some areas where the operating system blocks out information that a person may chose to remove from the rock. The information people place on the rock is permanent. Data is stored in the style of something similar to a Nintendo video game cartridge which is Read Only Memory (ROM) and will almost certainly last many lifetimes before failure. The rock is rectangular and information within it could be searched through or zoomed in and out of viewing range. The rock would cost based on the price of data storage media. For instance: an 80 GigaByte hard disk can hold 80 billion characters of information - this would give every single person on Earth approximately 13 characters of information on the rock for about $50 worth of failure prone storage like a personal computer hard disk. The design intentions are to make the rock outlast 10's of lifetimes before repair, to be redundant in all ways and last for eternity. The rock is for love letters, poems, eulogies and anything at all. This rock is free and will remain free and will never cost monetary values to use the contents of it or place information on it. Light from the fiber optic inter-connects would be magnified and sent to to solar panels and then that energy would be used to power the system. It would be electrically efficient. This idea was invented by Shampoo.

    2. Re:Baked clay tablets by kisrael · · Score: 1

      Some of these still exist and are as readable as when they were written

      Actually one of the saddest things about sitting down to a nice modern translation of Gilgamesh is how many chapters are just plain missing, or otherwise distorted, with the translators having to make up bits as best they can.

      --
      SO YOU'RE GOING TO DIE: The Comic for Dealing with Death
  107. If you do go digital, it will take some work. by khasim · · Score: 1

    Stone tablets last a fairly long time.

    But if you really must go digital, then you're going to have to put in a little bit of work and money every 5 years or so.

    Just store a machine that can read the CURRENT media. Then, 5 years from now, get a controller for the old machine that can handle the NEW media. And transfer your files from the old stuff to the new stuff. And repeat the process every 5 years or so.

    Don't go with whatever the cool tech is. Always go with the mainstream stuff.

    I have a friend who recently copied over all her Zip Disks to USB sticks. Now she's set until that format is replaced.

    Of course, this only addresses the files themselves. Who knows if there will be anything that can read the data from them in 50 years? There are word processors from 20 years ago that created files that it is very difficult to find a reader for now.

    1. Re:If you do go digital, it will take some work. by sjames · · Score: 1

      That's exactly my point, the only solutions are active in nature. You simply have to take data archiving as a continuous process of storage management and data conversion.

    2. Re:If you do go digital, it will take some work. by icebraining · · Score: 1

      So don't use strange binary formats, just plain old ASCII and basic, raw, uncompressed image/audio formats (bmp & wav?). Making a decoder to simple formats like those is very simple, I'm sure most programmers can do that.

    3. Re:If you do go digital, it will take some work. by sjames · · Score: 1

      So don't use strange binary formats, just plain old ASCII and basic, raw, uncompressed image/audio formats (bmp & wav?). Making a decoder to simple formats like those is very simple, I'm sure most programmers can do that.

      That's a very important point. No matter how much the vendor swears otherwise, proprietary formats and the software that reads them goes missing all the time. Even seemingly safe formats can require translation. MAny early home computers filled out the ASCII chart with various different glyphs for the upper half. It's 'ASCII', and mostly readable, but some of it doesn't look right anymore.

  108. the right way by confused+one · · Score: 1

    Several people have tried; but, noone has done it right. So, here goes

    Etch the data into stone tablets. Choose a hard stone unaffected by water immersion; the harder the stone is to work, the better. Include a primer in multiple languages, which explains the content, data format and duplicates the first page or two. Make multiple copies of it all. Distribute in many locations worldwide. Every few years add more copies to distribution.

    That should give you a few thousand years to come up with a better method. If your lucky, a copy will be incorporated into a building, monument, or other structure. This will enhance the data's chances for survival.

  109. Can't stress the file types enough. by jackpot777 · · Score: 1

    Other people have touched on it, so I will too. If you have the material in some proprietary document format, even if that format is the most popular format on the planet right now, still convert it into multiple versions of other file types. Here's an example I had to deal with: ClarisWorks (.cwk) was the most popular computer document format before Microsoft Word introduced the .doc format, and I had a lot of docs in .cwk from my old G3 Mac. Apple stopped supporting it a year ago, so I had to save my older (AppleWorks 5 and before) .cwk documents, and resave them as an RTF file for Pages before transferring them to the new iMac. I now have them saved in PDF (that can be read in Preview, Photoshop, and a multitude of PDF reading programs), OpenDocument format (.odt with NeoOffice, Mac's version os OpenOffice), HTML, Word document, and that Rich Text file. The same could happen to .doc files in a decade or two. MS have decided to push the .docx format in newer versions of Word, so who knows when the older files will require some workaround to be accessed? Do the same with images (RAW, JPEG, PNG-24), sound files (mp3, AIFF, WAV), whatever it takes to ensure your media won't be unreadable.

    --
    Shiny. Let's be bad guys...
  110. several HDs by obarthelemy · · Score: 2, Insightful

    For 500 gigs.

    A couple of hard disks, stored in different locations (cities, not drawers), that you update+check once a month or a quarter.

    Burn DVDS of the really important stuff (pictures, documents) around once a month, and mail them to your parents/family.

    What NOT to do:

    - RAID is NOT a backup solution, it is a high-availability solution. Of all the problems bakcups need to adress (theft, destruction, viruses...), it solves very few.

    - don't keep your backups online and/or in the same spot: viruses, power surges, fires, theft... will destroy them

    - don't have only ONE backup: Murphy's law, if your live data disapears, the backup will turn bad also

    - don't forget to check that your backups are still good

    - don't delude yourself into thinking that any physical media in use today will still be easily readable more than 5 years from now. (except for the consumer type media: CDs, DVDs)

    --
    The Cloud - because you don't care if your apps and data are up in the air.
  111. Re:Quality DVDs, archival storage, repeated backup by genghisjahn · · Score: 1

    "If Windows, I dunno." NTBackup? http://en.wikipedia.org/wiki/NTBackup

    --
    Sorry about the mess.
  112. Re:Jesus saves, but Buddha makes incremental backu by n1hilist · · Score: 1

    True. I only backup the stuff that is very important to me, my /home is about 16GB or so, so that's an easy thing to manage, various other things, (like ISOs and stuff I keep handy) i have on multiple machines anyway, those can be re-created from the source media, so those aren't an issue. Media I don't backup since it's a) too big, b) too much effort to do so and c) I don't really care, because it can easily be acquired again.

    Personally I'm longing for the day when 64GB flash disks are at the price 4GB ones are currently, because having 3 of those (1 off site) would be a huge benefit to me.

  113. Nonsense... by berend+botje · · Score: 2, Insightful

    Sure, you can easily shoot hundreds of pictures a night. But, FFS, sort those suckers! Maybe five of those are good shots, and if you're lucky one of them is a great shot.

    Just toss the rest! Really! Nobody cares for the reams of out-of-focus or incorrect compositions.

    Keep the great shots (one in a hundred, if you're a good photographer) and delete te rest.

    1. Re:Nonsense... by Anonymous Coward · · Score: 0

      Mod disparent up.

      It's easy as hell to fill a drive with dumb 1's or whatever, any other worthless data, that's not the point. The moral is: if you can't fap to it, it's a waste of space.

    2. Re:Nonsense... by dangitman · · Score: 1

      I wouldn't recommend that for anything except totally out-of-focus or blank pictures. And if you're a decent photographer, you don't take many of those - perhaps 2% at most. Historians find even the most mundane poorly composed or grainy photos to be of great value. You never know what will be of value in the future. People will pay dearly love to get outtakes from the early careers of famous photographers. And things which seem poorly composed at the time may seem like genius in the future.

      Culling the worst of my photos would be an insignificant reduction in data storage. It makes a lot more sense to be equipped for large amounts of storage. Especially as file sizes are going up as resolutions and bit-depths increase, and RAW is becoming the preferred storage format.

      --
      ... and then they built the supercollider.
  114. Store images in jpeg2000 format by synthespian · · Score: 1

    Jpeg2000 supports lossless compression. Image won't degrade over time, as with normal jpegs, AFAIK.

    http://www.oreillynet.com/pub/a/javascript/2003/11/14/digphoto_ckbk.html

    http://en.wikipedia.org/wiki/Lossless_JPEG

    --
    Main difference between the BSD license and the GPL license: one is from California and the other is from Massachusetts
    1. Re:Store images in jpeg2000 format by Anonymous Coward · · Score: 0

      > Jpeg2000 supports lossless compression. Image won't degrade over time, as with normal jpegs, AFAIK.

      WTF? "Lossy" doesn't mean it degrades over time. Is this a joke???

    2. Re:Store images in jpeg2000 format by maxume · · Score: 1

      Jpegs don't degrade over time. If you go through repeated open->edit-&gtsave cycles, there is a degradation in the quality of the image, but if it is just stored as data, it acts like any other data (maybe 'over time' was just awkward phrasing...).

      --
      Nerd rage is the funniest rage.
  115. Re:Quality DVDs, archival storage, repeated backup by LVSlushdat · · Score: 1

    I have about 6gb of irreplaceable stuff, that often changes (development source, financial stuff, pix), and I use Karens Replicator http://www.karenware.com/powertools/powertools.aspto sync it nightly to my Windows2K3 server, then once a week it uploads an incremental to Amazon S3 via Jungledisk. The laptop has a copy of the 6gb of irreplaceable stuff, on a Truecrypt container on an 8gb SDcard. Everytime I use the laptop, I sync the SDcard via Truecrypt/Karens Replicator on the desktop machine... Works fantastic for me!!

    --
    THANK YOU, Edward Snowden!! Americans owe you a debt of gratitude (whether they know it or not..)
  116. Repeat Backup by Anonymous Coward · · Score: 0

    I myself just addressed this problem... It's for a different situation, but it works. I keep all my Data (Even old stuff I don't care about like family pictures from the past 5 years and other stuff) on my storage drive. My storage drive is one of the new 1.5TB seagate drives. I have a external enclosure connected via eSATA with another 1.5TB and I use Synctoy 2.0 to backup and echo the changes made on the internal storage drive every few days or week... and if one of them fails I just buy another and copy it over.

  117. Google Docs? by awarrenfells · · Score: 1

    I suppose it depends on the specific type of document or file you want to save, but I have been using Google Docs for years to save my important documents. Its fairly safe storage [I dont see google going down any time soon], and available from any PC.

  118. Worthwhile read from the Long Now Foundation by synthespian · · Score: 1

    Long-Term Digital Dilemma

    http://blog.longnow.org/2007/12/24/long-term-digital-dilemma/

    Apparently, Hollywood is resorting to cellulose.

    Hey, let's have those Super-8s reborn!

    http://en.wikipedia.org/wiki/Super_8_mm_film

    --
    Main difference between the BSD license and the GPL license: one is from California and the other is from Massachusetts
  119. ZFS of FreeBSD by RT+Alec · · Score: 1

    I am doing much the same thing, but with FreeBSD. For long term storage, how about a RAIDz with 2 parity drives, and use the largest SSD thumb drives you can find/afford. They ought to last a while, and up to two of them could fail with no data loss.

    1. Re:ZFS of FreeBSD by boner · · Score: 1

      Not in favour of SSD drives for long term storage. I use RAIDz2 too in my setup. FreeBSD/OpenSolaris/Solaris/MacOS X, all doesn't matter to me, as long as it is ZFS (or something comparable).

      The SSD drives are not immune to data corruption in the long term (older technologies actually fare better than newer high density SSDs), unless you 'respin' them once in a while.

  120. Re:Quality DVDs, archival storage, repeated backup by Anonymous Coward · · Score: 0

    If somebody is skilled enough to just think about writing scripts on linux, and knows python, would have to 'learn' simple bash scripting? GTFO.

  121. Re:Amazon S3 - until? by Ralph+Spoilsport · · Score: 2, Informative
    Amazon S3. dirt cheap, there forever.

    Yeah - sure - until Amazon goes out of business or gets bought and then the new owner dumps the service and you're S.O.L.

    RS

    --
    Shoes for Industry. Shoes for the Dead.
  122. Two hard drives. Why even ask? by whistlingtony · · Score: 1

    Go buy two 1Tb hard drives. ~$300 bucks total. Put your data on both of them.

    Wow, that was hard.

    Seriously, who lets this get on the front page?

    1. Re:Two hard drives. Why even ask? by Anonymous Coward · · Score: 0

      > Go buy two 1Tb hard drives. ~$300 bucks total. Put your data on both of them.
      > Wow, that was hard.
      > Seriously, who lets this get on the front page?

      Except you missed the part about "long-term stability - lasting for decades if possible."

      Good luck reading those 1Tb drives in 30 years, dumbass.

    2. Re:Two hard drives. Why even ask? by synthespian · · Score: 1

      It gets posted on the front page because it's a serious matter.

      You forgot to say he has to run checks periodically to check for drive "health" status.

      --
      Main difference between the BSD license and the GPL license: one is from California and the other is from Massachusetts
  123. Join the RIAA by Simonetta · · Score: 1

    Incorporate yourself as a 'record company', and then call all your videos and documents your product, your intellectual property. Join the RIAA and paste BIG warning labels on the opening screen/page of every file stating a 100-year prison sentence and 100 billion dollar fine for every instance of unauthorized copying of any of these files.

        Naturally the files will get copied and stored by thousands of people. The files will get stored on secret third-world sharers dedicated to illegally preserving and presenting the world's culture.

        Then in a few years from now, when you want a file, just search for it on BitTorrent or Kazaa!.

        The record companies don't realize it, but the internet has changed them from being musical recording distribution companies to being digital storage companies. The whole point of the P2P networks is not to pirate so-called intellectual property but instead to ensure that the digital files (into which this 'intellectual property' has been formatted) don't get destroyed by media damage or obsolescence.

        The P2P networks were a gift from the computer nerds to the recording industry; a little transition present to get them from the 20th to the 21st century. It's sad that they don't have a clue as to what to do with our gift to them.

  124. How to stop worrying and learn to love Delete by wikinerd · · Score: 1

    Just ask yourself what destruction is going to happen to your life if you lose the data. If the answer is "no problem", then hit Delete or just let the data to rot in an old RAID-1 setup until both disks crash and then just stop worrying.

    If, however, you have some data that you think are particularly important, you can rent a safe deposit box in a bank for about 30 EUR per year (for an A4-sized box) and put copies of your data in there, paying attention to choose a bank that is too big to let fail :)

  125. The only proven answer is paper - no joking by HighOrbit · · Score: 1

    For text, nothing beats non-acidic archival quality rag-paper or vellum. Make sure to get the right kind of non-acidic paper and ink. Print your stuff out. Either bind it or file it in an organized and indexed manner. It will truely last hundreds of years, will always be a "universal" read-write format, and will never be technically obsolete.

    This also means you will probably have to review it to seperate the wheat from the chaff. The paper and the space to store it will be limited. So you will have to cull out the junk and figure out what's worth keeping.

    1. Re:The only proven answer is paper - no joking by amclay · · Score: 0

      I usually enslave a bunch of Chinese kids and have them memorize the 0's and 1's. They last 40-70 years.

      --
      It's all fun and games till someone divides by 0. Then it's hilarious.
  126. Density is too low by berend+botje · · Score: 1

    You can't store much information on paper/steel tape. It might be more economical to use stainless steel punchcards.

  127. Re:Jesus saves, but Buddha makes incremental backu by berend+botje · · Score: 1

    It is called hard encryption. Learn it. Use it.

  128. Compare to PCX vs. PNG by tepples · · Score: 1

    You don't know that a jpeg, for example, will be readable in 30 years. The format may be so deprecated that there might not even be a viewer available.

    Not bloody likely. PCX is deprecated in favor of PNG, but PCX is implemented in too many Free programs to ever disappear. Likewise, JFIF, the file format defined by JPEG, won't die until C and C++ die because just about every program written in C or C++ that handles photos uses IJG's libjpeg. And given how long Fortran has been in use, it's not bloody likely that C++ will die any time soon either.

    Like my old Microsoft Works 4.0 documents

    I see where you're going with this: formats that are publicly defined only as "what some piece of proprietary software writes". Convert your documents to a format with a widely recognized public specification, such as HTML or ODF or PDF, and it will be much more likely that you can compile a Free app (even an ancient one) when you need to view the data.

    assuming that I can generate a virtualized environment that can support an old Windows (Windows XP probably won't even boot on any PC being produced 30 years from now).

    That's what an emulator is for. NES games won't boot on any console produced today, but Nintendo still sells them on Wii Shop Channel bundled with the "Virtual Console" emulator. (And there arrr other ways to run ROM dumps on a PC.) So wrap up your environment in a virtual disk image in a well-documented format, test it in a current Free virtual machine such as VirtualBox, and you'll probably be able to emulate it later.

  129. File formats by jbolden · · Score: 1

    For long term storage you not have to worry about media degradation but also file formats. So far being able to read old data has been complicated over periods like 20-30 years. For example I've moved from: Apple/Mac Write, Wordstar, Wordperfect, Amipro,
    Word, open office write over the last 25 years.

  130. This is actually a good idea by Simonetta · · Score: 5, Interesting

    This actually is a good idea. If the porn files were maybe one half sex imagery and one half encrypted private data, and there was no easy way to separate the two halves, then people would download, store, and upload the files in order to view the porn. Anyone who had data in the private section of the file could download it from various P2P sites.
        The cost of filming and creating the porn file would be covered by the people who would be using the file for long-term distributed storage. Say a 1.5Gigabyte file that was an hour of MP4 video entertainment and 500 megabytes of distributed storage. The fees received by the producers for the storage would pay for the video production costs. Since porn is cheap to produce, this may solve the problem of piracy and secure storage at the same time.

    1. Re:This is actually a good idea by jabithew · · Score: 1

      ...an hour of MP4 video entertainment...

      Now that's a euphemism.

      --
      All intents and purposes. Not intensive purposes.
    2. Re:This is actually a good idea by arashi+no+garou · · Score: 1

      As a bonus, your data will fall under Rule 34, which reinforces the meme.

    3. Re:This is actually a good idea by janderson · · Score: 1

      This isn't the type of thing I'd usually reply to, but it's marked as "Interesting" rather than "Funny"... I don't think he's serious: The price of the 0.5 GB of storage that would pay for the "cost of filming and creating the porn file" would have to be astronomical in today's GB/$. Let's say very conservatively that the production cost was $5000 -> who's going to pay $10000 per GB?!? Storage on Amazon S3 costs about $0.10 per GB, and for $10k, you could get yourself a nice RAID array with more spares than you'd go through in 10 years.

    4. Re:This is actually a good idea by Anonymous Coward · · Score: 0

      Until some genious transcodes it to rmvb and everyone decides that thats a much better format because the Prime Minister told them so!

    5. Re:This is actually a good idea by Anonymous Coward · · Score: 0

      what if the 1.5gb movie is compressed for easier p2p sharing? You'd still be able to watch the movie but you'd likely corrupt the imbibed data

    6. Re:This is actually a good idea by Walter+Carver · · Score: 1

      Once the sharers discover about this they will hack the file to remove the 500MB and make it download (and upload) faster. You can't do this in the open. But it would be neat!

  131. Value versus endurance by arachnoid · · Score: 1

    Having read the comments and having given this issue much thought over the years, I have to say that only useful ideas will be preserved, and no one has 500GB of useful ideas. My point is that the fate of one's personal data archive is ultimately in the hands of others, and they decide what's important.

    It could be worse. In the pre-technological era, apart from a small handful of writers, the closest thing to a persistent data archive was a gravestone encryption. On that basis, and with an appropriately skeptical view of the durability of storage media, choose a small handful of critical data items and engrave them onto a stone. This actually works -- there's a beach in Wrangell, Alaska that archives messages from a prehistoric native culture, engraved into stones. The messages make up with persistence what they lack in depth:

    http://www.wrangell.com/visitors/attractions/history/petroglyph/index.html

  132. Doesn't apply to optical media by Burz · · Score: 1

    ...since DVDs etc. can be retrieved with any high-res optical scanner. You just have to fetch the software to read it.

  133. Eggs and Baskets by countach · · Score: 1

    It seems to me if you bought half a dozen 500GB drives, then in 10 years time at least one (probably most) of them will still work. Get different brands, different sizes to guard against particular problems. Maybe even do the 4GB Flash thing as well, and get some good quality DVDs too. I think the key here is not to put all your eggs in one basket. This is an open research question, and we may be interested how your solutions go in 10 years time.

    Oh, if you want to be *really* sure, do the Mormon gold plates thing.

  134. Did anyone mention.... by digibud · · Score: 1

    Anyone wanting something to last for 10 years should include the player. Don't bother putting away a CD, DVD, tape OR hard drive. Put the entire computer into storage. It would be foolish not to take it out once a year and fire it up just to keep the oils and lubricants working. If it's important enough to store for 10 years then it's important enough to check once a year. If you put anything like a CD, DVD, or other item in storage for 10 years without a player you take a huge gamble but if you include the device it runs on you have a chance to view it. You may have a hard time retrieving it because there may be no USB or Firewire connectors (there will most certainly NOT be any such connectors). What that ultimately means is you CAN'T easily store electronic documents that long. You must convert them from device to device, from tape to floppy to zip drive to hard drive to holographic drive to biometric drive (I come from the future). A storage medium from one generation can't be read by the next generation so the question is about storage AND being able to access and convert the data - otherwise storage itself is meaningless.

  135. online rsync hosts by lawpoop · · Score: 1

    Just get a couple of online rsync hosts. Two, in case one goes down. When you get a new system, just sync the data back to it.

    Although, since you do want to store 500 GB, that could be a little expensive at this point. Maybe you can juggle external drives for now, and then do an online host when the price comes down.

    --
    Computers are useless. They can only give you answers.
    -- Pablo Picasso
  136. JPEG2000 by pikine · · Score: 1

    Besides, it's not clear to me that we're going to come up with much better compression methods for static images, or that we really need to bother coming up with much better compression methods for static images, which means it isn't that unlikely we'll still be using JPEG in 30 years.

    We already have. It's called JPEG 2000, using wavelet as opposed to DCT to compress the image. This gets rid of the blocking effect prevalently observed in highly compressed JPEG files. However, I don't see JPEG 2000 making JPEG obsolete for various reasons.

    • There is already a vast amount of image in JPEG format, and format conversion contributes non-trivial loss in image quality, so there is incentive to retrieve images in that format even if it becomes archaic.
    • Given the amount of storage we have nowadays, image size is peanuts, so there is less stress in developing highly efficient image compression algorithms. A 20 megapixel image sensor will produce a raw (uncompressed) file at about 40MB. A BD-R disc at 30GB can store 750 such images.
    • JPEG in high quality mode does not really exhibit blocking problem, so it's not like JPEG 2000 is addressing a critical flaw of the JPEG format.

    However, there is still much research to be done on inter-frame compression used for motion pictures. I think we had hit a limit in intra-frame compression, so the focus for highly efficient video codec has been on synthesizing interpolated frames from neighboring frames. At this point, I think video codecs, even H.264, still stands a high chance to become obsolete within the next decade.

    --
    I once had a signature.
    1. Re:JPEG2000 by nine-times · · Score: 1

      JPEG in high quality mode does not really exhibit blocking problem, so it's not like JPEG 2000 is addressing a critical flaw of the JPEG format.

      This is more or less what I was getting at when I said, "it's not clear to me... that we really need to bother coming up with much better compression methods for static images." If you use the highest possible quality of JPEG on most photographs, the loss in quality to compression isn't that big. If even that small a loss in quality is a deal-breaker for you, then you should use a lossless format. So the only real reason to worry about coming up with better compression (in my mind) is to be able to shrink file size. However, with ever-increasing bandwidth and storage capabilities, it doesn't seem to me that even the highest quality JPEGs are all that unwieldy.

      You're right that there's a greater likelihood that we'll want better video compression, but even there I'm not sure. How much more efficiency can we get (by which I mean, how much smaller can we get the video files for a given video quality)? And at what cost of processing power for encoding/decoding? Given our increasing bandwidth/storage capabilities, I'm not sure how much work it's worth putting into improving that efficiency. Like let's say you can shave another 2% off your filesize, requiring 10% more processing power to decode, and it'll cost a couple million dollars to develop and promote that new encoding. Is that worth it to anyone? Obviously I'm just pulling those numbers out of thin air, but getting people to adopt a new format is very hard when the existing format is sufficient for their needs.

  137. Why by coldtone · · Score: 1

    Why store so much data? Because you can?

    Storing countless photos and hours of video is a nice idea, but really what is the point? Are you going to review it all? If all you do is maintain data from the past what kind of a life is that?

    You should be very careful what you hold on to. It becomes a weight that you have to carry. Filter it down to the very best, and only keep that. You might find it easier to keep a smaller amount of data.

  138. Yes, but can you FIND it? by careysb · · Score: 1

    I have thousands of photographs that I've taken over the last ten years that I back up to multiple off site hard drives. I am fairly good at culling redundant or poor photos, but that's still a lot of photos. I have an immediate need to be able to search for specific photos in all that data. Currently I use Lightroom 2.0 for that. It works pretty well but I consider it "expert friendly". If I die and my daughter inherits all that data she will probably throw it all away if she doesn't have an *easy* way to find the hand full that may be important to her. Organization, cataloging, and search tools are critical.

  139. What about SSD? by StCredZero · · Score: 2, Insightful

    If NAND flash SSD lifetimes are determined by write frequency, then wouldn't this be fantastic for archival storage? Just write the data once, then read it as many times as you like.

  140. Files Forever by Anonymous Coward · · Score: 0

    Dreamhost has a system where you can store your files "forever". I think you need a Dreamhost account, though.

  141. Hard storage by twopoint718 · · Score: 1

    For truly long term storage you're going to have to leave the realm of computer tech and look at archival methods. Microfilm may have a shelf life of up to 500 years if stored correctly (according to Wikipedia). You could convert binary data to a visual representation with something like Optar (http://ronja.twibright.com/optar/), and then microfilm that. Optar has a data density of around 200kB/page and around 2,400 letter-sized pages can be stored on a spool of microfilm, leading to a storage density of roughly 450MB per spool. Not great, but this is data that you *really* don't want to lose.

  142. Teganography by fucket · · Score: 2, Funny

    Which, of course, refers to the process of getting the data (in binary form) tattooed on your body by tattoo artist Tegan Stadnyck of Seabrook, New Hampshire.

    1. Re:Teganography by tomhudson · · Score: 3, Funny

      Actually, its steganography, but I used typoganography to hide the "s" :-)

      That's my story, and I'm sticking to it ...

  143. Re:Suggest a better approach...keep your data mode by C_L_Lk · · Score: 1

    For me, the amount of time required to check back through and move over 10,000 or 20,000 documents "per generation" just isn't reasonable. I move forward email from platform to platform - but that's about it. I figure one device per "8-10"years put away means I shouldn't really have more than 8-10 devices I need to archive throughout my lifespan. I rarely ever fire up or use the devices - I started up the XT about 6 years ago and copied off it 500 or so things I wanted (it had an old 10 base 2 network card in it - I still had an old 10base2 - 10baseT repeater around) - I loaded a packet driver and used an old DOS based FTP to move the things off to my current server. I may fire it up again someday - I may never. It doesn't take up too much room in the office closet.

    I find that format shift isn't too bad - my documents that were on my XT from the mid/late 80s were Word Perfect, GIF's, Lotus 123, and a few other applications, which opened just fine in modern apps. I know everything on my "90s" era computer would open fine on any modern application. Again, "Commodore 64" era documents are likely the only thing that I couldn't easily work with now - of those I really only have maybe 200... and I doubt I'll ever actually *really* need anything 25-30 years old.

  144. Parity Data & DVDisaster by Venner · · Score: 1

    For my valuable data (documents, family pictures, etc) I keep
    (a) the originals on my hard drive
    (b) a second set on another, offsite computer
    (c) a third set on an external RAID-0 array, which I power up only when in use, and
    (d) a bi-annual DVD backup on quality media with lots of error correction

    I used to keep all sorts of parity data for the DVD media (QuickPar, etc), but recently discovered a program called DVDisaster which actually lets you augment the ISO/filesystem of the DVD itself with the redundant data. I typically archive 2Gb of files with 2 to 2.4Gb of error correction data per disk. I figure that should give me a hefty boost in reliability, especially if I follow good DVD storage practices.

    Obviously, also keep a copy of the DVDisaster program handy. It's open-source GPLv2 and multi-platform. I include it in each ISO.

    --
    A preposition is a terrible thing to end a sentence with.
    1. Re:Parity Data & DVDisaster by Venner · · Score: 1

      Incidentally, I used to only do occasional backups of my data until I had a major loss of some (thankfully) non-essential files. I healthy dose of paranoia and a modest investment of time & money should safeguard my data for a long time to come now.

      --
      A preposition is a terrible thing to end a sentence with.
  145. SSD by mebrahim · · Score: 1

    He talks about keeping many 4G Flash disks. Why not use few big SSDs instead? I think they should be as durable as Flash disks. Am I wrong?

  146. Off-site Location by maz2331 · · Score: 1

    You can increase your reliability by renting rack space in a data center and putting up your own "backup box" that essentially mirrors a "backup box" at your home/business location. Use something like DRBD (over a VPN connection) to keep the disks in sync, and secure the everloving hell out of the remote box.

    It's still not perfect, but does get you around the "house burns down" type of data loss, and you can still periodically replace the "local" and "remote" boxes/drives as time goes by.

    The downside is that colocation hosting isn't free, but every "9" you add to the reliability of your backups is going to multiply your cost in a non-linear fashion.

    You can have reliability or low cost - pick one.

  147. You guys remember the 256Gb sheet of paper? by synthespian · · Score: 1

    You guys remember this?

    http://arstechnica.com/news.ars/post/20061126-8288.html

    See also:

    "Can you get 256Gb on an A4 sheet of paper? No way!"

    http://www.techworld.com/storage/news/index.cfm?newsid=7432

    Overall, it was a scam. But the idea of somehow using a durable physical medium seems pretyy good, no?

    PS: OK, this doesn't solve the OPs original question.

    --
    Main difference between the BSD license and the GPL license: one is from California and the other is from Massachusetts
  148. Simple is Best. by Anonymous Coward · · Score: 0

    1) save it on a HD in yer house and do DVD backups.
    2) save it also at another location such as safety deposit box (dvd's and/or thumb drives)
    3) save it on google or wherever.
    4) transfer it to new media as it becomes availalable.

    simple and simple.

  149. Is a perfect backup worth infinity resources? by IsMyNameTaken · · Score: 1

    So far several of the strategies revolve around the premise that you still need to keep your data safe if your computer, in-house server, house and neighborhood all blow up. Now given this situation, one of two events has happened:
    A) World War III
    B) Our Alien Overlords are displeased

    Now assuming one of the above did occur, having backups, even for a super secrete important business file is going to be moot. The only secure way to store an archive or backup is to take it out of phase with reality such that it is no longer subjected to the dangers of time and space or other things out of phase with reality. This being said, there are some ways to increase the survivability of your data.

    First, keep at least one on site copy as far away from danger as possible. Dig a nice big hole in your back yard. Build an air/water tight lead lined box with several hard disks and perhaps a few flash drives and a stripped down computer capable of accessing the media and place the whole thing in said hole. Next run some piping underground from the box to your house or business and through the basement wall. Make sure to have some string already through the pipe to do wire runs or just do the Ethernet and power cables as you go. If the cable is less than the maximum transmission length (300 feet I think) then you are all set. If you really want your data to survive your house being bombed then multiple boxes could be used to extend the distance away from your house along with some form of wireless communication (very low power directed signal to prevent hacking or discovery) to prevent any of the pressure waves resulting from the explosion from destroying your data at the other end of the pipe.

    This plan does have a few drawbacks as you are basically hiding in plain site. First digging the hole and pipe trench can be a major pain as would retrieval every few years to replace parts. Second, there is no fast way for retrieval if that data or power connection is damaged. Finally, heat dissipation could be problematic depending on the design of the case. The ground temperature usually stays fairly constant (if you go down far enough) so testing during spring/summer won't mean things will freeze during winter.

    Off-site backup has already been explored to death with the main conclusion being that it is necessary. Obviously there is only so much you can do to protect your data especially with regards to cost. You really just have to decide how important your data is, how much it would cost you if it was lost and if that amount of money can build a decent backup strategy.

    --
    while(1){sig.get()}
  150. Stupid question by Anonymous Coward · · Score: 0

    How come these stupid questions make the fron page? Slow day? Of course, start your own frickin' server! and learn to care for it! Problem solved!

    PS LTO-3 tapes hold 400 GB uncompressed. Since you are asking stupid questions on slashdot, they will be all you need. If you find them expensive or something, then your data doesn't deserve to survive. Ding! Next stupidity!

  151. For Documents by failedlogic · · Score: 1

    For some of my important documents, I've been writing them up in TeX. The format has been around forever. It tends to format everything properly with new releases*. And since you have the 'source code' not only to the software but to your document, you should be able to scan your source code from paper format should your digital copy go bad and still have close enough to the original in 10 years with maybe some slight changes in formatting.

    * Compare this, at least to multiple document layout and file format changes with MS Office (which 90% of the PC world uses). I tried restoring a Word 1997 document in Word 2003 and the formatting was really buggered up. I didn't wish to repeat that.

  152. Documented binary formats should be OK by IvyKing · · Score: 1

    Open and well documented binary formats should be OK, e.g. there are so frigging many devices that read or write to the JPEG format that devices that read the format should be available for decades. As an example, up to about 1980, most consumer grade record changers could handle 78 RPM disks, even though the format was obsolete in 1950. Similarly, it is still possible to purchase turntables that will handle 7" 45 RPM records - a format that was introduced in the late 1940's and passe about 1980.

  153. Analog by magisterx · · Score: 1

    If you want true long term storage, simple paper is probably the way to go. Invest the time and money to print it, and organize it, and (especially if you use special acid free paper) it can easily last for centuries and you never have to worry about either file format or medium becoming obsolete.

    My other suggestion, is to accept that it will become unreadable after a certain amount of time, and simply make at least two copies, stored separately, and every few years (5 is probably a good number), take the time to move it to whatever medium is sota for archiving at that time. You can't forget for decades at a time, but you can forget it for years at a time and the copying should only take a couple of hours. When you change mediums you can also look at changing file formats as needed.

  154. Social Security has to solve this exact problem... by grikdog · · Score: 1

    Simple. Massive redundant storage under Cheyenne Mountain, plus blue-green lasers and a simple COBOL program which writes 1's and 0's onto the Moon.

    --
    ``Tension, apprehension & dissension have begun!'' - Duffy Wyg&, in Alfred Bester's _The Demolished Man_
  155. Trailing edge technology by Anonymous Coward · · Score: 0

    Archival quality paper. You can keep it in an unheated storage facility, and you won't need to find obsolete equipment to read it when the time comes.

  156. here we go again (again) by Eil · · Score: 1

    Seems like every three months or so, this question pops up on Ask Slashdot:

    "Dear Slashdot, please tell me how I can make redundant archives of all of my important data for the indefinite future. Oh and just one other thing: I don't want to pay anything for it nor can I be arsed to put any of my own precious time into it."

    Yup, sounds like middle management material to me!

    The real answer to the OP's question is that no current consumer-level media is suitable for archival purposes on its own. To really archive something, you need high-quality media that won't degrade easily, a way to read it in the future, and a cool, dark, dry vault to store it in. All of this is (not surprisingly) outside the financial feasibility of most businesses and consumers.

    The cheapest, easiest, and most reliable solution is to throw together a network file server with redundant disks and backups. Monitor it continuously for failure and upgrade it every couple of years. If the data is super important, perform periodic audits on it. The advantages to this method are:

    1) You have immediate access to your data at all times. Need an old-ass file? Just copy it over the network.

    2) You have backups. You can even set up an incremental system if you want, so that you can get that file as it was on Monday that you zapped on Tuesday.

    3) You have redundancy. Build the server such that if it one part fails (especially a disk), the system as a whole does not suffer.

    4) It scales up and down very gracefully. You can get by with one of those cheap two-disk consumer-level NAS boxes or employ a rack of CORAID boxes, depending on the size of your data set, bandwidth requirements, and how "safe" you want to be from hardware failure.

    5) You're actively in control of your data, whereas if you hire it out to a third party, you really have no absolute guarantee that your data is safe.

    6) If the system is properly monitored, you are notified when something's going wrong whereas if you write some data to media and stash it on the shelf, you have zero indication if the disk loses its airtight seal, the flash chip is degrading, or the aluminum is slowly delaminating from the plastic disc of the DVD.

    If this seems like too much work, then your data isn't important enough to protect in the long term.

  157. Peer-to-peer by Anonymous Coward · · Score: 0

    p2p neworks might be a really nice solution considering their resilience, although I don't know any that is oriented towards personal data storage. It would take more than half of the internet to go crashing in flames for you to lose your data, and if that happends, you will probably have bigger worries.

  158. Solid State drives (SSD)?? by valnar · · Score: 1

    What about these new 30,60,120,250GB type solid state drives? Yah, they aren't that cheap yet, but in a year or two it seems like they will be. No moving parts like hard drives. Most of the reviews state their longevity over 10 years, but that accounts for daily use and *WRITING* to them. If it were a read-only archive with minimal writing, I wonder how much longer they would be slated to last?

  159. Old school solution - archival photo book by Biologist · · Score: 1

    This is not a perfect solution, but one I have considered... Photo printing services like QOOP (works with Flickr) now print photo books (up to 600 pages). One could (1) Back up to Flickr (does have an annual fee, but then has unlimited back-up) (2) print all pictures to a photo book (~ $13 for 20 pages, then 40 cents per page, 20 small photos per page). Not cheap, but there's nothing like paper...

  160. Mozy Home by Anonymous Coward · · Score: 0

    Seriously, if you really want your data, get mozy home. It requires you keep a copy on your computer (windows only), but it costs $5 a month for pretty much unlimited storage.

    Compare that to a) buying equipment, and b) maintaining equipment c) worry, etc. $5 a month is pretty cheap!

    Also, you can (optionally) encrypt your data with your own key using their software . Just don't lose your key!

    1. Re:Mozy Home by decavolt · · Score: 1

      Mozy is great... until you get into large amounts of data. Uploading > 500 gig to Mozy will take a freaking month just for one cycle so you can forget about recurring and timely backups.
      Same for Amazon S3 or any other over-the-net remote storage.

    2. Re:Mozy Home by Anonymous Coward · · Score: 0

      When you update your data on Mozy, it only updates things that have changed. With any network based storage solution you will have this problem. So long as you have a good connection, again I say its the best way to go!

    3. Re:Mozy Home by decavolt · · Score: 1

      For most users, this is all well and good. For anyone with large amounts of data it's pointless.

      Only updates files that have changed? OK. In the last 24 hours, the file weight of items I've changed is over 650mb. And it's the weekend, I'm barely working.

      I've tried Mozy and Jungle Disk. Even after waiting for a decade for the 1st load to get onto the server, my nightly backups still aren't finished by the time I start work again in the morning. That's no good, which is why I say that remote over-the-net storage for large amounts of data is not practical.

    4. Re:Mozy Home by Anonymous Coward · · Score: 0

      So this guy is going to be updating his SD cards that he has in safe deposit boxes? You are preaching at the wrong church.

  161. You're going to need a laser printer by Anonymous Coward · · Score: 0

    a crapton of paper, and a couple of fireproof safes

  162. Even on slashdot... by Pinchiukas · · Score: 1

    ...people need to learn to use the fscking search function - http://ask.slashdot.org/article.pl?sid=08/08/27/2119252

  163. data decay by maquah · · Score: 1

    The more 'compact' a data storage format, the more likely that just a little bit of 'aging' will degrade it beyond intelligibility.

    If it's REALLY important, rewrite it so that it's interesting to more than 'just you,' and publish it on archival-quality paper. If there are several thousand copies around, the odds are that a few will survive for a couple of centuries. Decay that would render any digital media format unreadable, just makes archival paper smell a bit musty.

    Beyond the 'clay tablets' mentioned above, the most enduring long-term storage I've found is 'oral tradition' in a cultural context that values both integrity and history. Assuming that whatever you want to save long-term is interesting and important to anyone besides you, tell it in a memorable format to your grandchildren's generation.

    A couple of decades ago, I worked on a history project involving both oral tradition and archival material: the elders' memories of events retold through the generations were, in some instances, MORE accurate than archival documents. But, of course, most of the younger generations in that community are paying more attention to TV and the Internet than to their elders.

    My dad was a maniac about data preservation: made multiple copies of everything, indexed it meticulously, etc., etc. My sister threw it all away after he died.

  164. Only 500GB? by Anonymous Coward · · Score: 0

    I'm sorry but this man should set up mirroring or a hotswappable raid array. I've had raid 5 for many many years and have not been disappointed, the key is to replace drives before they fail and keep the drives cool. I can't see how he can complain given the cost of hard drives at the moment. If data is really that important to you, buying 3 600GB+ drives should do more then the trick for less then $300. If that's "too expensive", I think he's not really serious about backing up his data IMHO.

  165. Constant migration by PatMcGee · · Score: 1

    I keep multiple copies of the same data.

    My data is smaller, around 15-20 GB. I think I have 6 copies right now. I constantly use Unison to synchronize them and to compare hashes every month or so. Three copies are in the computers I use all the time; another two are on hard drives that I keep in a media fire safe except for their weekly update. Every time I get a new computer, I just make another copy. Every time a drive fails, I just throw it away.

    The last time I lost any data was when the firewire controller on my B&W G3 started intermittently introducing errors of about 10 bytes per megabyte of copying. Comparing hashes showed me there was a problem, but it took a while to sort out which versions of the files were OK and which were corrupted.

  166. ZFS storage by Anonymous Coward · · Score: 1, Interesting

    I have a system using FreeBSD servers running ZFS as the file system. The servers are built with around 6 disks each in a ZFS RaidZ2 pool. This allows each system to have upto 2 disks fail without losing any data. Data stored on ZFS is also internally checksumed which means that the files will stay 100% pure and not suffer from bit rot. I have 2 physical machines in different locations which are synced using rsync. Each machine runs smartd to run SMART checks on the hard drives weekly and they also run a zfs "scrub" weekly to verify that all of the data is in place and is correctly checksum verified.

    For critical files (wedding photos etc.) I use par2 software to create a further checksum and generate additional redundancy. The par2 software allows corruption to be detected and corrected. This was more of an issue before I was running ZFS however it's still better to be safe then sorry.

    Daily snapshots are taken of the data to protect against accidental deletes. Critical storage is moved to read-only zfs volumes.

    The system has a few flaws, the first is that it doesn't protect against malicious damage. It would be possible for someone with admin access or an axe to "zfs destroy" my files. I have some critical fil2es dumped to DVD however I should probably have a third redundant "offline" site where all of the files are backed upto manually. Perhaps just a collection of removable drives.

    I am also concerned about data file compatability. I currently store my digital camera files as RAW camera files. Critical images I also store as Adobe DNG files and as JPG's. One of these formats should be hopefully supported by Photoshop CS428 in the year 2100.

    The system can periodically have the hard drives, pc hardware and software upgraded or replaced without disturbing the data so I can see no reason why the system can't keep the data valid for many many years.

    One concern is that after I die and stop maintaining the system it may stop working. I guess it won't be my problem.

  167. Re:Social Security has to solve this exact problem by mfnickster · · Score: 1

    > ...plus blue-green lasers and a simple COBOL program which writes 1's and 0's onto the Moon.

    Looks like all they've managed to write so far is: 010000110100100001000001

    --
    "Slow down, Cowboy! It has been 3 years, 7 months and 26 days since you last successfully posted a comment."
  168. "How about having it printed on acid-free paper?" by Onymous+Coward · · Score: 1

    Hm.

    Printed at a modest 750 DPI, with a half-inch border... ((7.5*10)in * (750 DPI)) =~ 4.22 M dots.

    5.27 Mebibytes / sheet, conservatively single-sided

    2.52 Gibibytes per ream (500 sheets)

    If you move up to 1200 DPI, and if your toner or ink won't cling between sheets you can use both sides: 12.9 Gpr.

    Printer drum will last 20,000 normal pages, but let's say 1,000 50%-full (i.e., data) pages. Drums (or printers themselves) are around $100. Thus $50 / ream.

    Cartridge will last 9,000 normal pages, and say 450 data pages. Cartridges perhaps around $75. Thus $80 / ream.

    Acid-free paper 500 sheets (20 lbs.) ... USD$5.

    So...

    About ten bucks a gig for simple storage that'll last 500 to 1,000 years.

    Good idea.

  169. this might sound daft but by Anonymous Coward · · Score: 1, Insightful

    why not write an application to encode your data as coloured dots and print it on archival grade paper with archival grade ink? then use another application in the future to read back the scanned page and convert the data back (I guess scanners or something like will still be around in 2250), you should be able to get 4mb to a page with some trickery (2048x2048 pixels), thats 2 gig per ream, 100 reams = 200 gig, humanity has developed the art of storing paper information to a fine art, it should last far longer than a lifetime with care

  170. Re:Quality DVDs, archival storage, repeated backup by decavolt · · Score: 1

    Granted, it's not all personal per se, but I have well over 500 gig of data that needs to be backed up and archived. I'm a freelance graphic designer, my client files end up getting massive. 300 to 600 dpi images, multi-layer PSD's in multiple stages of revision... It's massive. I only retain client data on a "live" disk for a year, then it goes into storage. After 3 years, some if not most of that data gets purged. 500 gig really isn't that much when you consider files types other than text data.

  171. Redundancy by stevied · · Score: 1

    Redundancy (both of media and of types of media), checksums, regular inspections and regular transfer of data.

    Nothing's going to last for ever, and even if it does, as other people have pointed out, the file formats may not be easily readable. Spread your stuff out over, say, a few hard drives, flash drives, archival quality DVDs. Store some stuff off-site, in case of house fires. Make sure you have redundancy. Go back every year or so and check everything is still readable, both the media and the actual file formats. Burn new DVDs, check the SMART data on the hard drives, etc., etc.

    As time goes on, increasing storage densities should allow you to condense things, assuming you're not adding data at a faster rate than the technology is progressing. As file formats become out of date, convert stuff. You may even find that as time progresses, you realise you don't need to keep all that stuff after all :-)

  172. My Tomb by Anonymous Coward · · Score: 0

    This is unacceptable! I want my glory to shine in the far future, when my elaborate tomb is found by future scientists. I want my email and pr0n to be fully accessible! They will obviously remmeber my greatness, but they must know the full details. For instance I save the html of every first post I've ever made!

    My data must survive the eons, despite flood, fire, nuclear war, the sun going nova. Nothing else is acceptable, after all these great historians who have devoted their lives to my greatness undoubtabily travelled far at great expense. Hyperdrives cost money, hell they probably had to re-evolve back into physical beings!

  173. what I do by Mike_ya · · Score: 1

    I purchased 2 small form factor USB external hard drives. I encrypted both with truecrypt. Since I am a Windows user I use MS's SyncToy to 'backup' the stuff I want to them. Synctoy makes it easy for me. One hard drive sits at work plugged into my PC, one at a bank in a safe deposit box. If my home computer dies and I need access to my data, I VPN into work and mount the encrypted drive or go to the bank and pick the other one up. I have a routine where on a set schedule I take the one from work to the bank and swap it out with the one in the safe deposit box. I take that one, synch up my data and then take it to work.

    If one of the hard drive dies I still have the data at home plus on the other external hard drive. If one of the drives dies after the 5 year warranty period ends then I will buy another drive. If USB starts to become obsolete I would save some money up and buy 2 more drives that uses whatever technology we will be using then.

    You don't know what technology we will be using to access data 10 or 20 years from now. Use what you think will be around and change later if that turns out not to be the case.

  174. 2 external FW drives - one local, one remote by decavolt · · Score: 1

    Two 1TB hard drives thrown into Firewire enclosures. One stays in a fireproof disk safe at my office, the second goes into a safety deposit box across town. I flip-flop the drives every month, both of which are a mirror of a "live" disk running on my primary workstation.

    I'm a freelance graphic designer, my client files end up getting massive. 300 to 600 to 1200 dpi images, multi-layer PSD's in multiple stages of revision, complete bundles of print-ready pieces, digital art with file sized upwards of 300mb each...
    I only retain client data on a "live" disk for a year, then it gets archived. After 3 years, some if not most of that data gets purged. 500 gig really isn't that much when you consider files types other than text data.

    For any large amount of data, S3 or Mozy or Gmail will NOT work. It's not practical to spend a month to upload just one interation of your data to these services.

    Storing critical or even mildly important data only in your house or office is just stupid. Fire, theft, flood, earthquake... any calamity will eat your data and there's nothing you can do. Planning to grab your hard drive on the way out of the house in the event of a fire is pretty short-sighted and unrealistic. If you're fighting to breathe or save your kids from burning to death, you are NOT going to waste time yanknig hard drives out of your computer. Further, if you're not home (which is more likely) your brilliant plan is useless.

    High capacity hard drives are cheap. External enclosures are cheap. Don't rely on loading one up and forgetting about it for 5 years because it's likely to fail. Someone above said to keep your data "hot", and you should. Get into a routine and cycle your drives monthly, semi-annually or annually.

  175. Photographs.. by the_rajah · · Score: 1

    I have family photographs that are over 150 years old and quite legible. They're daguerreotypes of one of my 3rd great grandfathers and his grandfather, who was born in 1771. The photographs were taken in 1853 give or take a year. That's pretty good archivability.

    --


    "Do the Right Thing. It will gratify some people and astound the rest." - Mark Twain
    1. Re:Photographs.. by Raffaello · · Score: 1

      I recommend soot black, resin, and beeswax ink on papyrus for truly archival purposes. Should get you several millenia quite easily if stored in a cool, dry location.

    2. Re:Photographs.. by synthespian · · Score: 1

      No kiddin' !

      --
      Main difference between the BSD license and the GPL license: one is from California and the other is from Massachusetts
  176. Nobody cares... by mok000 · · Score: 1

    Chances are, that after you are dead, nobody will care about your gigabytes of data, your holiday photos or your emails to other unknown citizens.

    Unless you are exceptionally famous, forget about it. Nobody will care.

    Forget about your backups. Get a life.

  177. Offsite backup by Anonymous Coward · · Score: 0

    If it isn't off site, it isn't truly protected, flood, theft, or fire can put an end to your precious data if it is kept locally. The method I employed is to take a second hard drive and use backup/encryption/ftp software to transfer it to the second drive for my initial backup. Then I gave that drive to my brother to put in his computer and he did the same and gave me his.

    The software performs incremental backups over the wire in a completely automated fashion.

    I've tried some of the online systems such as carbonite, etc and the features and volume just aren't there yet to keep up with my video files. 400gb data. (The kicker for me is that most of these files are static so and doing the initial backup locally made this method so much better).

    One very important note is to keep your encryption key backed up somewhere remotely otherwise you won't be able to recover your data when your system dies. I sent mine to myself in an email.

  178. Use a laptop with SSD and OSS by guggs · · Score: 1

    I've been think about this problem as a 30 year archival system and a laptop with dual SSDs running open source software is probably the best way to go. If a drive or laptop fails, the data can still be recovered. If all existing formats goes the way of the dodo, the information can still be retrieved from the laptop. It's not enough to store the bits, you have to make sense of them in the future. -guggs

  179. Media, format, migration by Lally+Singh · · Score: 1

    Some thoughts on this issue:

    1. I wouldn't trust technologies that haven't been used for a decade or two yet, so people know what makes them last & fail. See here:
    http://findarticles.com/p/articles/mi_m0BRZ/is_8_23/ai_109665179/pg_1?tag=artBody;col1

    2. File formats are a more dangerous issue than the physical media. Text-based formats (including XML) is something someone else could write a reader for later. That's a lot easier than trying to find both the software and the machine it ran on. Virtualization helps -- it may be worthwhile to keep a VMWare image with your stuff so that you have a copy of the software required. Emulators exist for most any obsolete system I can think of, and I'm pretty sure you're going to find x86 emulators for span of your grandkid's lifetimes. If paranoid, use an open-source emulation platform instead of vmware, and include the source of the emulator.

    3. Archival formats are another problem. I love zfs dumps for their current usability, but I know I can't trust something that system-specific to last terribly long.

    So, here's a procedure for you. Note that this is for important data. Don't interpret this as a method for keeping your Hentai collection. Of course, each has their own priorities in this area...

    1. Choose something you know will be around "for a while." Easily available, good track record, easy to use. Don't worry about extreme-long term. Step 3 addresses that. More important are the short-term technological concerns: Is it simple & reliable enough that you _will_ back up reliably with it?

    2. Keep a Master Image of what you want to back up. For example, a root directory with a per-year, per-project, or per-customer set of subdirectories. When you add stuff to your backup, you're appending to the master image. As storage (capacity & demand) is lightly exponential, you can usually afford to store all your old backups with the new (long term), as the old stuff is relatively small.

        Note that I'm *not* saying copy this stuff every time you back up. The master image is append-only. You'll probably want more than 1 copy of the whole thing, but that can be two stacks of DVDs, two boxes of tapes, two RAID arrays, whatever.

    3. Plan to migrate this growing master image to new formats. The entire master image moves to the new format. When you migrate, all your backup data is in the new format, and your old-format backups can be stored as a hedge against your new media having surprising failures later.

    4. Treat this stuff *seriously*. Don't skimp or cheap-out. Spend a month or two every couple of years setting this stuff up correctly, and make it something you can rely on. A little organization and automation once makes worthwhile. Cron et al for Unix, and a good commercial backup app for everyone else.

    5. TEST YOUR RESTORES. TEST YOUR RESTORES. TEST YOUR RESTORES. TEST YOUR RESTORES. TEST YOUR RESTORES. TEST YOUR RESTORES. TEST YOUR RESTORES. TEST YOUR RESTORES. TEST YOUR RESTORES. TEST YOUR RESTORES. TEST YOUR RESTORES. TEST YOUR RESTORES. TEST YOUR RESTORES.

    --
    Care about electronic freedom? Consider donating to the EFF!
  180. Stone. by feanor981 · · Score: 1

    Definitely stone.
    The only media demonstrated surviving more than 2k years... and counting.

    I admit it's not very portable, but hey, here we're talking about durability.

  181. Re:Quality DVDs, archival storage, repeated backup by Lally+Singh · · Score: 1

    You're going to have to tier your data into different (cost of archive, cost of loss) groups, and set up procedures for that.

    I generally have two tiers, one for professional/academic concerns, another for large media (video, not music).

    The latter's on a spare USB drive, the former's on a few drives and an off-site backup service. Not expensive (~$30/mon) for my data needs.

    --
    Care about electronic freedom? Consider donating to the EFF!
  182. Re:Hard drives fail, but rarely at the same time.. by Reziac · · Score: 1

    As a hedge against physical disasters, I'd also do incremental backups on some far-distant system -- get yourself some cheap hosting (I use 1&1, 120GB for $5/mo.) and dump copies there. It won't necessarily be any more reliable than your own system, but it does provide a remote backup not subject to local disasters.

    As the tagline goes.. The four California seasons: Fire, Flood, Riot, and Earthquake. :)

    --
    ~REZ~ #43301. Who'd fake being me anyway?
  183. It's the data stupid by democrates · · Score: 1

    Dude, storage tech is in constant flux so your mission is to keep copying your data onto current storage media using redundancy to protect against corruption.

    You're constantly adding ever greater volumes to your collection anyway, so that works out cheaper and more convenient in the long term.

    Even if some storage media last decades, the devices that read them will be a pain to maintain in years to come and may not be accessible on future computers, I'm keeping an old pc just to use my 10yo scanner for pete's sake, even Nasa can't access some of their own Apollo mission data.

    Now, in 2038 will you be able to open a Microsoft Publisher file with any program? Page and text format for example is information, lossless retrieval must represent the original, open formats are your only hope.

  184. mulitple copies on multiple harddrives....... by Anonymous Coward · · Score: 0

    Copy it to several different folders on the same harddrive.

    Copy that harddrive to another.

    Put them in separate physical locations.

    At least one of the drives will work 5 years from now. At least one of the directories will be readable.

    Copy to latest drive technology at that time.

    Repeat until some "Permanent" storage comes along. Even then use two of them. :)

  185. Re:Quality DVDs, archival storage, repeated backup by Anonymous Coward · · Score: 0

    Why on earth use python to copy files?

  186. synctoy 2.0 by Anonymous Coward · · Score: 0

    Well I use a combination of things

    Firstly - I use synctoy to sync all my important data between the three live computers with big enought Hard Drives in the house.
    One is a linux server running raid 5 and 2 are windows xp machines. When ever my dad buys a new PC - I chip enough for an extra drive and I use an external drive to update that info every 6 months.
    for music I keep copies backed up on friends machines and for photos I keep backed up google with paid storage. for documents I keep backed up on my EEE PC which travels with me and on USB key. not perfect means I wont lose much besides video.
    BTW - does anyone know a Linux equivalent to synctoy (i mean in terms of user friendliness ?)

    Ta
    Phill

  187. The old standby... by Anonymous Coward · · Score: 0

    tar -zcf brittany-sex-tape.mpg /home, then upload on P2P.

  188. Ask Ted Stevens: Long-Term Personal Data Storage by dangitman · · Score: 1

    Dear Sen. Stevens,

    I was recently sent an internet. I'd like to keep this internet safe for a decade or more. Being a technology expert, I thought you'd be the best person to ask. What storage solution do I need; a dump-truck or a semi-trailer?

    Warm regards,

    S. Palin

    --
    ... and then they built the supercollider.
  189. 500GB - easy to fill with home video... by patniemeyer · · Score: 1

    Every hour of video I take on my Sony HD camera consumes 8GB. You can easily eat up a few hundred gigs a year if you have a kid :)

    Fortunately two things are working in our favor - HD video codecs are pushing the file size *down* as the storage price drop exponentially. e.g. a raw DV stream used to be 13GB / hour... worse than the 8GB for an HD stream today.

    You can also now get a 1TB disk for $100.

    What I'm really waiting for is for 1TB of SSD in a raid-1 configuration for a few hundred dollars... I really don't want to lose all of our photos and home video due to a lame raid.

    Oh, and beware of most cheap raid solutions out there right now. If they don't do disk scrubbing (parity checks or checksums like ZFS) then you really don't know if your data is safe.

    Pat

    1. Re:500GB - easy to fill with home video... by paganizer · · Score: 1

      I've never had any problems (or seen any real problems) with software RAID 5, all the way back to NT; I've read & seen load of problems with hardware raid.
      Kinda makes me wonder why anyone bothers. sure, it's a little more CPU intensive. so?

      --
      Why, yes, I AM a Pagan Libertarian.
    2. Re:500GB - easy to fill with home video... by patniemeyer · · Score: 1

      I've had good luck for years with my RAID 5 setup, but it's just too complicated and too proprietary... It has too many moving parts (literally). Obviously if cost allows we'd rather have a RAID-1 (simply mirror) ZFS pool with a couple of SSDs... And if they are really cheap you could rotate one out as an off-site backup. I think this setup would also allow you to easily upgrade storage size by dropping in larger drives.

      Of course you can do the above with regular disks... just stating my ideal situation.

      Of course I don't know anything about the long term storage of flash /SSD tech, but presumably you'd just keep this chain going continually and maybe throw the old disks in a box somewhere for extra protection.

  190. Re:codec lifespans... by patniemeyer · · Score: 1

    Imagine what it'll be like in 20 years-- anything other than NTSC, PAL, or SECAM will be effectively extinct.

    Well, there aren't *that* many standards that are really in consumers hands. I suspect that in 20 years we'll still have VLC or its successor and that developers will dutifully create or recreate codecs for virtually every video format that has ever existed.

    Look at people digging through ROMs of old video games and creating emulators... There is a bigger market for recovering old video, I suspect.

    Pat

  191. 500Gb that's a lot of "personal" stuff by Anonymous Coward · · Score: 0

    Wow, you have 500Gb of porn?

  192. longest term storage by dragunkat · · Score: 1

    Your best hope for storage that's going to be more than 20-30 years is optical storage, such as DVD or Blu-ray. most flash drives are only rated to last 10 years.

  193. Duplicate by plnix0 · · Score: 1
    Use RAID. Burn multiple copies to DVDs. Keep duplicate copies of everything, preferably in different types of storage media.

    You are fortunate in not losing data when your flash drive went through the washer/dryer -- mine died the first time.

  194. re: something new by Anonymous Coward · · Score: 0

    perhaps the best solution to this:
    dude... life is short, nothing is that important. stop worrying about nostalgia, go do something new...

  195. Short answer by carvalhao · · Score: 1

    If you have some time to throw at it... Create a virtual machine with the operating system and all the required software to read/edit your data AND your data. Convert this VM to newer versions of the VM software as it arises. Keep the VM duplicated in two geologically different sites.

    Just my 2 cents (of an â) :)

  196. miniDV by lothlorien · · Score: 1

    After a few years of trying to find the solution to this problem, I looked at my miniDV camera... I was happy to discover that people thought of it before (http://tech.slashdot.org/article.pl?sid=03/03/03/0246245&mode=nested&tid=126&ti%20d=137&tid=106). Same as tape backup, but no need to purchase more (expensive) hardware, if you happen to use a miniDV camcorder. Even if you don't, those are much cheaper than "real" tape drives.

    --
    /wrld
  197. Re:codec lifespans... by Anonymous Coward · · Score: 0

    > Well, there aren't *that* many standards that are really in consumers hands.

    True, but it's not just the consumer formats; the biggest problem right now is proprietary solutions that are deployed over the Web. The number of codecs out there keeps multiplying. People download these videos and players today, and then tomorrow it changes.

    > I suspect that in 20 years we'll still have VLC or its successor and that developers will dutifully create or recreate codecs for virtually every video format that has ever existed.

    I think we'll still have something like VLC, but it's absurd to expect it to support every codec ever devised. There are already dozens in existence, and many of them are patented and/or proprietary. I expect most of them to just vanish into the mists of time.

  198. Venti: archival storage with off-site backups by anothy · · Score: 1

    I use venti, an archival storage system originally develloped for Plan 9 from Bell Labs and now available on a host of other systems via Plan 9 from User Space. Venti is strictly archival: it stores blocks permanently. This storage is organized into "arenas", or pools of a fixed size. When one arena fills up, it is sealed, never written to again, and the system starts dumping bits into the next one.

    Primary storage for my venti system is a pair of mirrored SATA disks. Yes, magnetic disks can fail, but with mirroring they're still cheap enough to almost certainly be your first line of defense. When an arena fills up, I burn it to CD (by default, they're 512MB each) and mail that to a friend three time zones away. If my house burns down, I can recreate everything up through the last arena by basically dd'ing the contents of those CDs to a new disk.

    Using a real archival system has other neat benefits, too. You don't have to worry about whether you saved the right version of something, or how to organize different versions over time; it's all automatic. I've used this for "work stuff" for a long time with very good results; after my last laptop hard drive crash, I've started using it for personal stuff (although I haven't made that quite as automatic yet). I can now "cat /n/dump/2008/0712/usr/a/src/cmd/ngcscatgen.c" to see the version of that file as I was working on it over the summer. Pretty nice.

    --

    i speak for myself and those who like what i say.
  199. Copy forever by OverflowingBitBucket · · Score: 1

    Set up a single dedicated machine with four large (500GB) drives. Script it to replicate the data across the drives. In case of errors or differences, it should default to the most reliable copy (compare the copies across drives). Make it notify on drive failure or file loss.

    Once a year, boot it up, run the script. Leave it for the day or two it takes to run. Remove any bad drives, and replace them. Once the process is complete, shut it down again. Repeat next year. Upgrade the hardware from time to time, and replace any drives that get "too" old, even if they haven't failed.

    If you want to do it on the cheap, always use the last-gen drives and hardware on it. Better than throwing it out.

  200. Old photographs - data that really did survive! by ags · · Score: 1

    All this talk about making sure data lasts for the future. Well, I'm currently mining old data that was laid down well before my birth date, that's old family photographs back to the 1860's.

    For the first time I'm seeing images of my great, great grandfather and his family candidly posed in an Aberdeen studio. My great grandfather and his brothers on an old tintype photo plate.

    Some of the images are in a fairly poor state, but half an hour with the Gimp has got them back into serviceable condition.

    When my relatives submitted to the new craze of photography some 150 years ago, did they ever believe that one of their descendants would lovingly resurrect the data? On a machine that would have been a complete fantasy to them.

    Oh, and how do I know who they all are? Simple; they wrote their names on the back of the photos for posterity.

  201. 500G? by Seraphim_72 · · Score: 1

    I have seen the back ups of the College I work for. Excluding Student Data for true data we clock in at under 100G. *Including* student data we barely break 400G And that is historical data going back 30+ years. There is no way on this earth that you have 500G of actual data. I am sure that you think you do, and I am sure that someone will respond to this post with 'proof' that they do have that kind of data. However if you make a "must survive me/must survive a fire/anyone other than would give a damn" list I cannot see anything over 5G.

    --
    Slashdot, where armchair scientists get shouted down and armchair theologians get modded up.
    1. Re:500G? by Anonymous Coward · · Score: 0

      "Digital" photos ARE data. What would you say they are? Garbage? Just because you don't own a 10 megapixel camera and take thousands of photos doesn't mean nobody does. I usually have 2 gigabytes (or more) of photos from one photo shoot alone. And don't get me started about the files from scanning old photos for restoration!

  202. Too easy by holophrastic · · Score: 1

    Really hope I'm not the first to suggest this.

    Build a really simple desktop machine, with ordinary, inexpensive, large hard drives, in a RAID 1 mirror. When one fails, or every few years, swap it out, let the mirror rebuild in a few hours. I've had good drives last for over ten years, but you don't have to trust them if you simply add another drive every year.

    Your cost becomes one drive per year -- a whopping $50 - $150 -- and the one-time cost of the cheapest server you can find. Or you could use any number of N.A.S. solutions that support multiple physical drives in a RAID 1 configuration.

    You're guaranteed compatible access to your data for as long as hard drives are used -- that'll cover the looming SSD era as well. Basically, for as long as SATA is used.

    Hard drives are very stable and reliable over at least 5 years. SSDs, when they become cheaper, will be even more reliable, and you'll be able to simply swap them gradually into your mirror configuration. And, at any future point thirty years from now, you'll easily be able to convert your array from SATA to the new hotness, and you'll have an overlap of at least five years during which it'll be convenient to do on even the cheapest system.

    Standard storage options, standard configurations, inexpensive everythings, unlimited capacity, unlimited redundancy.

    But you should still worry about fires and floods and theft. Maybe take one of the drives away to some other locations once in a while. Or nightly FTP to your office closet -- incremental to save on bandwidth.

    Oh yeah, and you can use any number of encryption methods, just to be on the safe side.

  203. Film-based backup media? by Kadin2048 · · Score: 1

    I was looking at some old 35mm black-and-white film a few days ago and thinking that it would actually make a pretty nice data storage medium, if you rigged up a machine to write to it. Sort of a "poor man's microfiche."

    You'd need to document the hell out of it, but I think it'd be possible to make a data-storage medium that used film, and would last 100 years easily, and be readable with only a very basic system of sensors and light sources.

    I was imagining having eight parallel tracks down the film, although you could probably easily fit many more. Depending on the film grain you could probably get hundreds of tracks, and each "bit" would only have to be a few thousandths of an inch wide, depending on the grain of the film stock. You'd just move the film over the write head, and expose it with short pulses; the result would be like a continuous 2-dimensional bar code. (Maybe you'd want to waste one track with timing information? Probably.) Develop it in some nice high-contrast developer (since you only want two tones, black or clear) and then store it as you would any other film. Modern film stock is very stable; in a cool, dry place in a metal tin, it lasts practically forever. (Old celluloid isn't nearly as good, but chemically modern films have very little in common.)

    Reading it wouldn't be that different from decoding a big barcode (and it's very similar to the way digital soundtracks on theatre films are written). But even in the absence of any existing hardware, it's the sort of thing I'd bet any undergrad EE with access to a machine shop and electronics lab could come up with in a couple of days. As long as there's no fancy encoding or compression done to the data, you just read it off as quickly or slowly as your equipment allows.

    Anyway, it's totally impractical for a personal backup system, of course, but it might not be a completely ridiculous idea for organizations looking for very long-term archival storage. The advantage I think it has, versus other candidates (e.g. magnetic backup tape) is that reading it would be relatively easy and intuitive. Anyone looking at a piece of film with a magnifying glass or microscope would be able to see that there's something there, plus it's a lot harder to wipe accidentally.

    --
    "Ladies and gentlemen, my killbot features Lotus Notes and a machine gun. It is the finest available."
  204. For now. by Koatdus · · Score: 2, Interesting

    There really isn't a good solution right now.

    After thinking about it a while I realized that:

    1) Most of the stuff on my computers could be replaced.
    2) The one thing I would really, really hate to loose are family photo's.
    3) Hard drives WILL fail sooner or later.
    4) Tapes are reliable for a while but even in a climate controlled vault I have had tapes at work end up bad after a couple of years. (not to mention the pain in the neck it is to find a working legacy tape drive after 10 years)
    5) DVD's will probably have the same issues.

    My solution for now is redundancy.

    Digital photos get offloaded to my Linux pc. I use a program called Digicam.
    I have a bash script that syncs the new photos to a Windows share on my wifes pc.
    My wife has one of those .5T USB external disk drives with the "one button backup" program that is set to run nightly.
    When I have a couple of new directories of photos I run another script that compresses the whole directory and splits the output into a bunch of 45 megabyte rar archives.
    I then upload them to Microsoft's free "Skydrive". Microsoft just upped their free disk storage to 25GB.
    I also have some documents saved on the free AOL Xdrive.

    I figure in a couple of years there will be a better long term storage option. It will probably be something like a solid state drive that lasts for two hundred years. At that point I will save everything to that and store it in my safety deposit box at my bank.

    --
    Every wrong attempt discarded is a step forward - T. Edison
  205. Here's a variant of the probem by mbstone · · Score: 1

    Same problem, but do not assume I am able to access the media between now and X years from now. Assume also I am flat broke and have no lawful access to privately owned real estate or safe deposit boxes, and have no reliable / trustworthy / willing friends or family to assume physical custody of the media. In other words, I am most likely talking about burying or otherwise concealing physical media in a public place in case I am out of the country or otherwise unavailable for a period of years. Other than encrypting and appending the data to a pron file:

    1. What is the longest lasting / hardiest machine-readable media excluding paper tape or other overly bulky media. Flash drives?

    2. Assume I have a small waterproof canister containing the media. What redundant waterproofing or other physical media protection measures should I take? Extra layers of waterproofing? Silica gel?

    3. What is the most viable and secure long-term hiding place for the canister? Burial in a public park? Near a freeway on-ramp?

  206. Buddy backup by XNormal · · Score: 1

    I think a "buddy backup" system makes a lot of sense. Keep multiple copies of your data on the hard disks of friends and family through the net (and let them keep their data on yours, of course). Encrypt it if you like. Each participant only needs to invest in a big hard drive every few of years (1.5GB is as low as $133 these days). No need for RAID - geographically distributed redundancy is better.

    There is a commercial application to do this called CrashPlan. I'd really love to see an open source project with similar functionality. It should be easy to use for non-technical people and run on multiple platforms.

    --
    Stop worrying about the risks of nuclear power and start worrying about the risks of not using nuclear power.
  207. Use the little known DVD-RAM by Anonymous Coward · · Score: 0

    It is designed for archival purposes and has a specified minimum lifetime of about 30 years. It can be written to like a regular hard disk (though at much lower speeds) without any burning software. The drives perform data verification in hardware when writing and have a hardware based defect management. While not as cheap as DVD-R and all the other less secure DVD formats the drives and discs are affordable (about 150$ for the drive and 1,50$ per disc).
    I have not once seen a read or write error on these discs and use them to store my digital photo archive. One copy is stored to a hard disk for easy retrieval without searching the disc and another copy gets stored to DVD-RAM. Store the DVD-RAM in a different location.
    DVD-RAM is used in libraries around the world so you are sure to find an operating drive in 30 years even if you missed to keep a working one yourself.

  208. Light of experience by gbloon · · Score: 1

    I have used computers daily since 1981. From every one of the media on which my data has been stored, I have suffered loss of that data at some time, with the sole exception of the hard disk. Floppy disks, tapes, CD's and DVD's have all become unreadable after a while, sometimes after 3-5 years, and some in as little as 6 months.

    I am not saying that I have never had a hard drive failure; however, when it happened there was plenty of warning in the form of frantic clicking, slow reads and program errors. If I did not already have a backup of everything on that disk, I knew it was time to make one. I had no idea that the CD's on which I had stored a large number of movies over a period of 3 years would by the end of that period be about 10% totally unreadable and 50% recoverable with some effort. Those CD's were in proper storage cases , unused and kept in a cool dark place. And yet they rotted away unseen. I could actually see holes through the reflective film in some of them.

    I believe that a hard drive, kept unused most of the time, but run occasionally, is very unlikely to fail. The probability of two such drives failing simultaneously is negligibly small. Therefore my solution is to make backups on two separate external hard drives. As the typical cost of HD storage has now fallen to about 10 cents per GB this is also a very inexpensive method, and of course it is also relatively simple and fast.

  209. long term storage by Anonymous Coward · · Score: 0

    The big mistake is to assume you can "fire and forget" - there's no such luck. All backup systems need maintenance and migration as technologies age and get replaced. To assume anything else is pure fantasy.

  210. Framework III by Anonymous Coward · · Score: 0

    That makes me think about a bunch of "Ashon Tate Framework III" spreadsheet dating back to the 80's, that someone asked me to read and reuse in a modern Excel.

    Long term storage in our times and certainly in ever-changing America is utopia. Given a media is found, that won't be supported anymore soon. Given a bunch of stored formats, those won't be supported or even understood anymore soon. Today new inventions replace so fast the older that there's not even time to write history: no one will remember the iPhone or what DRM was in hundred years, for what we are concerned in a hundred years spreadsheet might appear directly into our brain, without any visual aspect anymore...

    Long term personal data storage is like trying to save legacy in an attic, where both the things and the attic keep changing shape and use. Impossible.

    There is anyways an interesting and ambitious point in the wish to save history. I would even think of a public agency with the only job to save legacy in a retrievable way, after having found that way, because history is foundation in human life.

  211. When in doubt, use brute force... by Genda · · Score: 1

    If you're looking for simple archival storage, get an ink jet printer, load with a high quality metal based ink with an insanely long shelf life and chemical stability out the wazoo. Pick a mylar film, something again that is chemically stable and has a probable shelf life in the hundreds or thousands of years. Use the first dozen of so sheets to print the digital key so that the rest of the data can be decoded by anyone in the foreseeable future, then crank through all your data until it's on mylar sheets. So now your data has been converted into a simple binary code on film. You pick whether ink dots are '1's or '0's.

    Store the sheets in a hermetically sealed light tight metal box with a pure nitrogen atmosphere, and super low humidity. Keep the box in a safe place. Check the data every 5 years for integrity. Replace bad sheets if and when they appear. I'm guessing your great, great, great, great, great grandkids will be enjoying your data.

  212. Longest serving data storage systems by Rockin'Robert · · Score: 0

    1. Carved in stone. 2. Oil paints on canvas.

  213. where to store stuff by Anonymous Coward · · Score: 0

    store it on the internet

  214. Re:Quality DVDs, archival storage, repeated backup by Anonymous Coward · · Score: 0

    Don't use time machine, it's utterly useless. Freezes while backing up and when it came to restore time when I had a problem the backup was incomplete and everything went the rectum output shape.

    Use SuperDuper! instead, it works which makes it good.

  215. Re:Social Security has to solve this exact problem by grikdog · · Score: 1

    001110110010110100101001

    --
    ``Tension, apprehension & dissension have begun!'' - Duffy Wyg&, in Alfred Bester's _The Demolished Man_
  216. proper solution probably expensive by jollyreaper · · Score: 1

    They say that paper is still a pretty good long-term storage medium. There's been talk about using high-quality printers and high-quality paperstock to print out tiny, complex patterns that look like UPS shipping data (those weird patterns replacing barcodes) that's actually a representation of encoded data. The encoded version is supposed to be able to fit hundreds of pages worth of data on one page.

    OF course, paper is still capable of deterioration. CD's and DVD's sound like a really good idea with a laser burning the data as areas of light and dark on a special substrate but even that substrate can decay.

    When we get right down to it, pretty much all of our storage media come down to moving something past a read/write head, be it magnetic tape, magnetic disk, or optical disk. Flash RAM is really the only one doing it differently. What we need is a media that won't degrade. That makes me think it won't be bendy so any tape-like solution is out. I'm guessing the answer we want is going to come down to a material with very high stability that is unlikely to decay regardless of heat, moisture, etc.

    My guess is the true long-term solution will be some sort of glass that can be fed into a drive that burns patterns in with a laser. There's no substrate, the pattern is on the glass itself, and burned deeply enough to account for a little erosion. Now the next question is whether the glass will be cylinders, disks, square tablets, etc.

    Now since I'm able to think of this, I'm sure other people have as well, smart people. Since we aren't already hearing about this solution on the market, there's probably bigger pitfalls than I'm anticipating that's holding it up. I'm guessing it'd probably be really, really frickin' expensive.

    --
    Kwisatz Haderach
    Sell the spice to CHOAM
    This Mahdi took Shaddam's Throne
  217. Re:Amazon S3 - too slow by durdur · · Score: 1

    I have a consumer grade DSL line and it's unacceptably slow. I tried to use it for backup of a large set of files (audio, family pictures and some video) and it was running for days, and finally timing out or failing (don't remember how large the batch I tried, but it was at least 20GB in many small files). Even if the cost was fine that's not ok. (And of course if you lose your home disk then you have to do this in reverse to restore; if you can't restore it's no good).

    My solution so far is to put it on a portable hard drive and store it offsite, encrypting anything that's sensitive.

  218. Good Reminder by Anonymous Coward · · Score: 0

    Every year when this question comes up on Slashdot I know it's time to do my long-term backups.

  219. Make it DIYable by Thor+Ablestar · · Score: 1

    Since you cannot know what will be used then, you should find the data media with the interface you can recreate from scratch with your almost bare hands. There are only 2 kinds of media with such an interface: 16-bit parallel CF (IDE is excluded due to lubrication aging) and 1(4) bit serial SD/MMS (You cannot easily recreate USB, SATA or any kind of rotating media). Then include the clay tables with description of the interface and the data format. Any future computers will have the ability of some parallel i/o, it's enough.

  220. One scheme by joe_frisch · · Score: 1

    I have 2 computers, cross-backup every few days. Then every couple of weeks copy everything (copy, not update) to an external HD. The external HD is normally kept off site (at my work). If I accidentally trash a file, I can recover from the other computer. If I somehow wipe out everything on both computers, or if my house burns down, I can recover an at most 2 week old backup from off site. If storage technology changes, I will change the backup media I use. At the moment I use conventional hard disks. I can go to flash, or hyperspace quantum drives or whatever as technology changes. I need 3X the disk space required for a single copy of my data, but disks are cheap. Since my media are regularly used, I know if things are going bad.

  221. For that price... by Artifex · · Score: 1

    To store 500 GB of data would be about $75 a month, plus the $50 to put it on the server in the first place.

    For that price you could buy a new hard drive monthly and copy to it. Have one fail? You still have all your other drives. Remember, in this example you're talking about a steady 500GB or less of data, not something being revised monthly.

    --
    Get off my launchpad!
  222. Re:Social Security has to solve this exact problem by Anonymous Coward · · Score: 0

    01000001 01011001 01000010 01000001 01000010 00110010 01010101!

  223. Re:Quality DVDs, archival storage, repeated backup by ja4444 · · Score: 1

    ...If Linux, some sort of cron job running a Python script that copies /home to an external hard drive. If Windows, I dunno.

    I use "rsync -aH /home/me /media/usbdisk/homebackup" to preserve file permissions, dates, links, etc.. I suppose you could put this in a cron job to automate it.

  224. Make it Easy.... by scottm52 · · Score: 1

    I keep an online backup and then mount my backup volume as a network drive. That way there's two copies (local and remote) that I can access anytime. I picked up a huge SATA for about 100 bucks and have OPENRSM CloudBackup take care of the offsite part. It's easy, cheap, and if I replace the home computer I've already got a place to restore from. You can do it with S3 if your techy at all too. The big thing is the way I do it is easy, cheap, and gets the job done without reinventing the wheel.

  225. Encrypted Online Backup by janozaurus · · Score: 0

    Amazon S3 has been mentioned before and it's cheaper then all comparable offers. (For typical personal use (<< 100GB) S3 is pretty affordable.) With JungleDisk, a commercial (US$20) client, encrypted backups are trivial.

    If you want to beat Amazon's pricing you probably need a good friend working at some large data center.

  226. My best suggestion for now... by Anonymous Coward · · Score: 0

    Buy two different brands of high quality archival rated DVDs and make two backup of everything. One to each type of DVD. Better yet, 4 copies. 2 on each type. For the cost of blanks, this is cheap peace of mind. I'd also burn at less than your max drive speed. Say 4x.

    Store them somewhere safe (and dark) and repeat the process every 1-5 years.

    As for this question being asked over and over. Well until someone solves this problem, don't complain about the question being asked. It's a basic consumer problem that is ignored by industry. There are at least one of these in every industry. That one piece of the puzzle that everyone wants solved or improved, but no one ever does anything about.
    For example, I can buy a car that can e-mail me when it needs an oil change, or parallel park itself (in the event that I suddenly turn retarded), but they all still have the same useless damn windshield wiper technology that's been around for a hundred years.
    Asking for simple, reliable, obsolescence resistant backup systems is like asking for windshield wipers that work for more than 3 months, that don't freeze to your window, and don't leave streaks after the first pollution filled rain leaves it's oily filth on your window.

    As long as the problem exists, people will want a solution.

    Now if I could only invent super wipers that store 5GB of data for 20 years, I'd be a millionaire.

    The Real Tachyon

  227. Benefits of RAID? by Adaptux · · Score: 1

    The other alternative is doing it yourself with RAID arrays at different locations. Sync your 'must be preserved folder' nightly to both locations using rdiff-backup or similar. Regularly test recovery and make sure all the disks in both arrays are working without error.

    If there is fully redundancy by storing the data in two locations anyway, and this is a back-up solution only anyway, is it really so beneficial to use RAID? I mean, sure, using RAID in one or both locations increases redundancy, but also increases complexity (increasing the number of ways in whcih something can possibly go wrong) and cost.

    1. Re:Benefits of RAID? by Fastball · · Score: 1

      (increasing the number of ways in whcih something can possibly go wrong) and cost.

      However, it reduces the severity of each failure by an even greater extent. Sure, one disk means only one point of failure. But that failure is absolute. With a RAID array, a failure in one disk (assuming a truly redundant RAID level, no RAID-0 stripes) affords you the ability to recover from the failure.

      There's a small learning curve to RAID setups, but once you overcome it and it saves your bacon, you'll wonder why you when to war without it.

  228. disk scans by Adaptux · · Score: 1

    I do it myself, with cheap Linux software RAID boxes in separate locations, piggybacking on existing Internet services to do rsync mirroring between the sites. The machines are kept alive, running disk scans.

    Just fsck scans, or full disk scans in which you read every sector that actually contains data? If the latter, I'd be very interested in any statistics that you may have accumulated about disk blocks going bad. For example, is there a correlation between bad blocks, with blocks close to each other on the physical medium having a greater probability of going bad at roughly the same time than blocks with are further away from each other?

  229. SD cards - rate of bad blocks? by Adaptux · · Score: 1

    If less than 50gb, I'd suggest a few SD cards.

    What's the rate of bad blocks for SD cards, over time?

  230. one word, DROBO by Anonymous Coward · · Score: 0

    get yourself a DROBO, problem gone.

  231. Pressed optical media by riclewis · · Score: 1
    I don't know what your cost constraints are, but if you want optical media that won't degrade over time, go to a duplicator and get your data pressed onto the ubiquitous optical media of your time. A professionally pressed DVD will not degrade like a burned DVD because it actually makes pits in the media, as opposed to just manipulating a dye.

    Repeat this process every decade or two and you should be in okay shape. This is of course assuming that optical media continues to be relevant. If we end up back at magnetic (or FSM forbid cloud) storage, it gets trickier.

    FWIW, it'd be interesting to know more about this data. Any particular reason it can't be printed out into a book or onto microfiche? The only long-term important media to me is stuff that lives on after I do, and that's really means diaries/logs and photos.

    1. Re:Pressed optical media by hazydave · · Score: 1

      Actually, glass mastered DVDs fail, but for other reasons: the layers separate, the aluminium oxidizes, etc.

      You're better off with archival-quality DVD-Rs stored in the dark... no to mention avoiding a pretty hefty pre-disc charge for the glass master.

      --
      -Dave Haynie
  232. Magneto Optical Drives by Andypcguy · · Score: 1

    Get a Magneto optical drive. These are designed for archiving and I know the government uses them for data storage. I've seen drives on ebay for $20 -$50 and media for a few bucks ea.

  233. is this really so hard? by Anonymous Coward · · Score: 0

    Cmon? really? this question still has multiple answers? I'm sure that most of the people here on slashdot have at least a desktop and either another computer or a laptop. set up a script to copy files between them automatically. video files? how long can it take? using vista's normal SMB i can get 55-60MB/s, copying a 1.5gb 720p 45min video takes a total of under 30 seconds. I run a script on my laptop that whenever it is plugged in using a cable to my network it syncs photos, videos, university documents, saved files, downloads, application installers, drivers etc between my main desktop, laptop and a HTPC. using windows its very easy, open the scheduled task, set one to start when a network device is active, and write a small batch file using robocopy (u can get it for xp with the power tools pack and it comes with vista) if u dont like using windows programs, using rsync through ssh and copy it that way, both can do copying resuming so u could do it over wireless too (which i do with small updates). if you have 2-3 computers and all their drives fail on you at once... well you werent supposed to have those files anymore, were you?

  234. Re:Quality DVDs, archival storage, repeated backup by Anonymous Coward · · Score: 0

    Depends on your definition of 'personal' files. Video can take up a lot of space.

    I presume the number of space for the video files varies between the duration of being single and the number of ex's one had.

  235. On the other hand by techdojo · · Score: 1

    I really don't think it's in your best interest to offload the data to a passive storage media. As quickly as storage is increasing, it's not unreasonable to literally keep every picture/movie you've ever taken, every email you've either sent/received, or any other piece of data you would vomit over if you lost it with you via either a network share or a local resource.

    Personally, I keep a RAID-1 mirror established with some non-system internal SATA drives and have an external USB drive that I periodically back up to. When we leave for a vacation, I just lock the drive up at my office. I wish I could say I was disciplined enough to have two external drives and keep swapping them between my office and home, but I'm not there yet.

    Bare-minimum, if I get hit with a nasty piece of malicious code or do something just plain stupid, I've only lost the delta between then and the last backup. If it's a simple drive failure (which has happened more than once,) I'm covered.

  236. Kodak 24k gold DVD by Anonymous Coward · · Score: 0

    Anyone tried the Kodak 24k gold cd-r / dvd-r? They advertise them for 100 years I think...

  237. Parent = Backup by MasterOfGoingFaster · · Score: 1

    So that suggests the YOU are the ideal backup. Could you kindly provide longevity statistics of your relatives so we can compute MTBF?

    --
    Place nail here >+
  238. The solution is a routine, not a medium by hazydave · · Score: 1

    Basically, you have two fears -- media rot and format rot. Using reliable media, common interfaces, multiple copies, and regular attention to the issue, you avoid any single point of failure.

    To deal with media rot, you ought to deal with the idea that no single piece of media is guaranteed, so backup often and in multiple places. Archival quality DVDs are a good bet. I shoot lots of digital photos, and I do a new backup when I have roughly 1/3 new material on my HDD. Thus, I have a pretty good chance of dealing with single point failures on all my archival backups.

    I also keep pretty much everything online in a RAID (actually a Drobo... same idea, more flexibility) -- that gives me access to the work at any time. This changes over time, of course... and with technological improvements every few years, the storage medium never goes out of style, and never runs out of space (I'm currently on a 2TB Drobo for this stuff... all my digital photos, all my Dad's digital photos, many scanned transparencies, etc. all in one place, as well as living on DVD backups). In fact, in the case of the Drobo, getting close to full every now and then means swapping out a drive or two for more room, thus automatically cycling in new drives for old, and extending the life. This unit can deal with the total failure of one out of four drives, so it's a fairly painless way to get past the single-point-of-failure issue, even without backups (but still, do the backups, at least of data that's important going forward).

    USB drives are cheap, but not ideal... check the expected storage live of your particular unit. Some multi-level flash drives are expected to last only around 10 years without refresh.

    Moving to formats, think "consumer product". CD, DVDs, and now BDs are consumer products... they have a pretty long life in the marketplace, and it's currently fashionable to support compatible upgrades. USB is similar... you're far more likely to find USB 3.0 ports on a computer 10-20 years from now than a SATA connector. But as well, these things don't change overnight... you can react to the changes as they happen. That's part of the routine, just like backing things up. I used to do backups on CD, now it's all DVD, and it may eventually be Blu-Ray, or something else that shows signs of a long market life in the consumer world.

    Media is also critical... a gold archival DVD is designed to last much longer than the $0.20 cheapies. You're also going to actually see that long life keeping the discs in the dark, under the proper humidity and temperature conditions, etc.

    --
    -Dave Haynie
  239. Watch your file formats, too by hazydave · · Score: 1

    Another form of data rot is the data file. While you probably notice that every audio, video, photo, wordprocessor, etc. format accepts a plethora of file formats, there are some that are virtually certain to be around in 30 years, and others not-so-much.

    First of all... any format with an open source reader can be potentially usable in the far, far future... but to be really pedantic, including the source code in your archives every do often.

    Next, go to consumer or archival formats. JPEG is a pretty easy win for photographs -- it's a web standard, it's been used in virtually every digital camera since the dawn of digital cameras, etc. On the other hand, you may get in trouble with RAW formats, which are camera-specific... I know a number of RAW readers I have don't support the RAW format from my old Canon Pro90IS. You're better off converting to 16-bit TIFF or perhaps Adobe's DNG format (no reason not to ALSO store your original RAW files if you want, other than storage efficiency).

    For video, similar things: consumer formats like DV, MPEG-2, AVC, etc. are likely to be around for a long time, simply because they have such a high level of support. Something Open Source like XViD is another one you can probably count on being usable in some way or another in the distant future... and if it vanishes, you can hack up a new version yourself (well, I know I could) if you keep that source code handy.

    For documents, you're probably safe with PDF, particularly now that it's been certified as a standard... but to date, older versions have always been supported in Adobe's reader, and there are many open source readers. ODF is another one that's likely to be safe simply because it's open source, so someone's likely to be able to deal with it in the future. Microsoft formats tend to change every week or two (ok, I guess they're slowing down a bit nowadays)... they're common, which means they're LIKELY to be supported, but at some point, they become uncommon and may well get dropped by current software, with no recourse on such closed formats.

    For audio, open sourced formats like OGG or huge standards like the MPEG formats (Layer 2, Layer 3, AAC) are likely to be around pretty much forever. WAV and other very simple computer formats too, well described and ultra common. Odd formats ought to be avoided, particularly if they're proprietary.

    And ultimately, you need to be smart about it. Check into your backups every so often, read the release notes on your new software. If the new version of Photoshop or The GIMP drops support for Amiga IFF/ILBM images, I might want to think about updating my archive files to something more modern, while that's still a pretty easy thing to do...

    --
    -Dave Haynie
  240. DROBO by Apollo123 · · Score: 1

    Get a DROBO. Data redundancy, and flexibility to grow the array if required.

  241. It's my favorite scripting language. by wiredog · · Score: 1

    Much more readable than bash, infinitely more readable than perl.

  242. A Tv's media player! by alexandre_ganso · · Score: 1

    Hey,the only things I care for are pictures. And I've accumulated about 80gb of them in the last 20 years.

    Here's what I do:

    1) set up a simple website with all of them. This site is basically an html generated from whatever those "html album" proggies. Don't depend on php nor anything else, just plain html. The pictures are stored into folders separated by date.

    The advantages of this:
    - these folders are the same in my machine (where the pictures are in my /home). So I have two copies of them anyway
    - as I don't backup manually very often, I've created some space in my tv's media player (a tvix) with time machine. So every time I connect my computer there to upload some movie I've downloaded, I have a full backup of them.

    The media player thing + time machine solves the issues of (a) doing backups regularly, as you will connect your computer to it often (b) you will always want more space for hd movies and music. So you WILL replace the hard drive every years or so.

    And the website, besides of being an extra copy (with some searchable information being the album covers with some text), helps you by being found by archive.org and google cache.

    And don't forget to copy all your data to gmail. Simple, searchable and free.

    And let life keep going on. 500 giga is not too much to have on a backup, and in two years is not too much to have even in your notebook's hd.

  243. 2 NAS units synched by gregconquest · · Score: 1

    You buy/make 2 NAS with RAID 5 or 6. Your friend somewhere far away does the same. One of your NAS's becomes his, and vice versa. You set them up to mirror each other. If your house burns down or the contents get stolen, you buy/make two new NAS and get the data back from your friend. Upgrade in 5 years and continue indefinitely.

  244. embrace the pain by Anonymous Coward · · Score: 0

    Embrace the pain of data loss, taking comfort in the fact that losing something that you really didn't *have* to have in the first place is actually freeing ;)

    1. Re:embrace the pain by A+famous+reader · · Score: 1

      Absolutely! This is the only way to peace of mind. When watching footage of families being evacuated from bushfires, tornadoes etc they will usually carry only their pets, and a couple of family albums. I have code from 30 years ago, and when I look at it now, I reckon it can be left behind in the hope that nobody discovers it and wonders what this guy was on. Get the best of your photos onto archival paper and into an album, forget everything else.

  245. How about a meta-physical backup? by owndao · · Score: 1

    As far as maintaining your data's integrity such that a retrieval program can have a shot at recovering it you have a few options. You can backup to a physical medium that is indestructible (within the timeframe and conditions you must decide yourself) or the medium can be meta-physically indestructible within the same conditions as above. This would appear to you as a black box since you do not need to concern yourself with the 'how' aspects of its implementation just the means of interfacing with it (physical and communication protocol) which would be done through any number of interfaces which as time progresses might change just as the definition and implementation of a "computer" may change.

    Now we know that when we submit something to this black box it will be stored and as long as computers and computing don't change too drastically we be able to retrieve it at some future time. The black box portion of this could be in the form of a company that handles all of the details of looking to you like a black box as described for a reasonable fee but here you run into a problem. The company becomes a single point of failure that all of your data at risk. Perhaps governments could provide the black box duties but I, personally, have several problems with that option.

    An alternative way of providing the black box functionality and eliminating the single-point-of-failure problem is to somehow, see that the open source community produces a multi-platform (easily migrated code is what we need here), distributed, well-designed and documented, open source, peer-to-peer, backup program. I know that the topic has been discussed here before but don't know the current status of any of the projects.

    This would be a great project for the open source community as it would solve a problem that has been faced by anyone that deals with large amounts of data and doesn't have a fortune to spend keeping it retrievable let alone easily accessed and navigated. There are many products, both software and hardware, that provide or, at least, approach meeting the requirements of the overall solution.

    --
    Be as you would have the world become.
  246. SkyDrive? by pascalv · · Score: 1

    http://www.labnol.org/internet/best-online-storage-live-skydrive/5771/ has a positive appreciation of Windows SkyDrive "The Best Online Storage Service"

    Extract from the above blog post: "5. SkyDrive is Microsoft service and an integral part of their larger Windows Live strategy so you really don't have to worry about its future existence."