Slashdot Mirror


Ask Slashdot: Best Offline Storage Method For Large Archives?

An anonymous reader writes "I have a collection of large projects (Indesign files with associated images), which are typically 40GB to 60GB each. In this current climate, what is the 'best' method of archiving these? Spinny magnets? Solid state drives? USB? Tape? Blu-ray? All have pros and cons and price considerations. If I remove the price issue (my data is important to me), does this change the choice?"

397 comments

  1. Go with tried and true by Pikoro · · Score: 0

    Put it all on 5 1/4" floppies :)

    --
    "Freedom in the USA is not the ability to do what you want. It is the ability to stop others from doing what THEY want"
    1. Re:Go with tried and true by kimvette · · Score: 0

      Single sided, single density, right?

      --
      The Christian Right is Neither (Christian nor right). See: Matthew 23, Matthew 25, Ezekiel 16:48-50
    2. Re:Go with tried and true by Frosty+Piss · · Score: 0

      Put it all on 5 1/4" floppies :)

      I don't get it. Sure, I realize that it would take a HUGE number of floppies to accomplish this, but I'm a bit lost as to why that's "funny" as in the "smiley face" emoticon. Can you explain?

      --
      If you want news from today, you have to come back tomorrow.
    3. Re:Go with tried and true by davester666 · · Score: 1

      Papertape is the way to go. Not susceptible to magnetism at all. And it'll be easy to reinvent a way of reading it when the 4th Reich comes to power, as there will always be some way to tell a computer "yes, I can see a light, or no, I cannot see a light".

      --
      Sleep your way to a whiter smile...date a dentist!
    4. Re:Go with tried and true by cjcela · · Score: 0

      Nah. You should actually print it, at least with a font size of 12 points. And then, laminate each page, for extra protection!

    5. Re:Go with tried and true by Anonymous Coward · · Score: 0

      Put it all on 5 1/4" floppies :)

      What compression program do you use that can make 40 GB fit on 5 floppies?

      Personally, I'd store it on /dev/null. There's plenty of space there, it's way faster, and you can easily backup it to a usb stick and carry it with you.

    6. Re:Go with tried and true by cvtan · · Score: 1

      You are kidding right? Use 8-in floppies that can store 180k per disk.

      --
      Sorry, but gray text on gray background is making my eyes bleed.
    7. Re:Go with tried and true by ls671 · · Score: 1

      Right, 1.2 MB used to be less reliable for me and the guy said that his data was important to him.

      --
      Everything I write is lies, read between the lines.
    8. Re:Go with tried and true by KieranC · · Score: 1

      Print it

      --
      Like food, this sig will also pass
    9. Re:Go with tried and true by Pikoro · · Score: 1

      Can you imagine how long it would take to toss even a single gig of data onto 360k floppy disk? The funny part would be how much money you would waste by sitting in front of a computer switching disks every 60 seconds or so and then writing labels for them all, and sticking them on (straight of course or you have to carefully peel it off and put on a new one).

      Lesse.. wolfram alpha says just over 4 weeks (assuming an 8 hour work day) http://www.wolframalpha.com/input/?i=60gib+%2F+360kib+*+60+seconds

      For a total of 174,763 floppies. That would be a stack of floppies (http://www.wolframalpha.com/input/?i=174763+*+2mm) ~350 meters tall, or just taller (1.2x) than the Eiffel Tower.

      Might be easier to just buy a bunch of TB hard disks and put them in a Raid 5 configuration (with hot spares of course) and be done with it.

      --
      "Freedom in the USA is not the ability to do what you want. It is the ability to stop others from doing what THEY want"
    10. Re:Go with tried and true by Anonymous Coward · · Score: 0

      That might work, if you mean use the 5.25" floppies as coasters to stop the hard disks from getting scratched up on the desk...

    11. Re:Go with tried and true by bennomatic · · Score: 1

      What is this "MB" you speak of? My 1541 drive stores 170KB per floppy. That should surely be enough for anyone.

      --
      The CB App. What's your 20?
    12. Re:Go with tried and true by cheeks5965 · · Score: 1

      wow, those would be five really tiny floppies.

      --
      -- Flame me and I will happily flame you back. Bring it!
    13. Re:Go with tried and true by UNIX_Meister · · Score: 1

      Don't forget to punch the floppy so you can use the back side!

    14. Re:Go with tried and true by toastar · · Score: 1

      No Cost consideration? 2 Servers with Raid 7 Arrays/Servers using enterprise drives of course. LTO 5 Stacker With Weekly Iron Mountain Pickups. This is the same setup I would suggest for my clients for Seismic Data. I'd be happy to set it up for you for $125k. Depending on your opinions and general level paranoia, we can discuss online backups, the pricey part always seems to be fat pipe.

    15. Re:Go with tried and true by Pikoro · · Score: 1

      Uh oh. MichaelKristopeit is on my side? I don't know how I should feel about that.

      --
      "Freedom in the USA is not the ability to do what you want. It is the ability to stop others from doing what THEY want"
    16. Re:Go with tried and true by Senes · · Score: 1

      As both a printer and a graphic designer, I have to say ---> THIS
      There's no such thing as one perfect solution. If you REALLY care, you'll apply multiple solutions. Put it on DVD, then put it on at least one hard drive, then put it into print. If both DVDs and hard drive are ruined then at least you can reproduce the paper documents.

    17. Re:Go with tried and true by Anonymous Coward · · Score: 0

      load "$" ,8 ,1

      SEARCHING FOR $
      LOADING.
      READY.
      LIST

      0 "PWNED ASSHOLE" 88
      0 BLOCKS FREE
      READY.

      Nooooooooooooo!

    18. Re:Go with tried and true by Mister+Pedant · · Score: 0

      I would be careful with paper in that 4th Reich, the previous Reich promoted the burning or 'Säuberung' of paper based documents.

    19. Re:Go with tried and true by Anonymous Coward · · Score: 1

      She says you're a little on the small side.

    20. Re:Go with tried and true by Anonymous Coward · · Score: 0

      You win sir.

    21. Re:Go with tried and true by datapharmer · · Score: 1

      This is right on. I actually do backups for an enterprise company (and numerous smaller ones). My enterprise customer uses raid for live data, raided disk arrays for current backups, intranet transfer to an alternate facility on another continent for mirroring and current backups, daily and weekly to tape and tapes are rotated out to Iron Mountain weekly for a month of offsite data. It is expensive, but so is losing the information your business relies on. For this person though, let's be realistic - get a couple external hard drives, flash storage media, blueray, or tapes - pick any two and keep them in two different locations. test them on a regular basis to make sure they are readable and refresh the media as specified by the manufacturer (replace optical at least every year, replace tape based on wear or every 3-5 years and replace hard drives and flash media based on wear or when they are out of warranty.

      --
      Get a web developer
    22. Re:Go with tried and true by Rob+the+Bold · · Score: 1

      Put it all on 5 1/4" floppies :)

      I don't get it. Sure, I realize that it would take a HUGE number of floppies to accomplish this, but I'm a bit lost as to why that's "funny" as in the "smiley face" emoticon. Can you explain?

      More than 20 years ago, I'd back up a 30 Megabyte disk on 1.2MB 5 1/4" floppies. The process took two or three 10-pack boxes of disks and several hours of babysitting the computer, swapping out floppies, labeling them in order, hoping they were all good and finally labeling and storing away all the boxes now full of my most recent backup. Now imagine how tedious (expensive, error-prone, ridiculous, utterly impractical) this would be for tens or hundreds of gigabytes .

      If that doesn't do it, then imagine filling a swimming pool with a teaspoon. Mowing the lawn with scissors. Dialing every possible phone number because you forgot the last four digits.

      If none of these things sounds funny, I guess this just isn't your style of humor. I didn't LOL, but I did chuckle.

      --
      I am not a crackpot.
    23. Re:Go with tried and true by Anonymous Coward · · Score: 0

      She says you're a little on the small side.

      FTMFW

    24. Re:Go with tried and true by Mad-Bassist · · Score: 1

      Heh, that brings back memories of 88KB on my Atari's floppies, which I doubled by cutting a notch on the other side and flipping it over, even if they were sold as Single-Sided Single Density.

      As for the OP, so far I'm keeping stuff on external hard drives, but I know it's not the best solution for long-term. Where are those multi-layer FMD discs I kept hearing about?

      --
      "The only legitimate use of a computer is to play games." - Eugene Jarvis
    25. Re:Go with tried and true by Mad-Bassist · · Score: 1

      There's always PaperDisk, apparently able to store 1 meg per sheet. They quote 4MB, but that's after compression and BS to people like us. :-)

      --
      "The only legitimate use of a computer is to play games." - Eugene Jarvis
    26. Re:Go with tried and true by ficuscr · · Score: 0

      rsync

    27. Re:Go with tried and true by Frosty+Piss · · Score: 1

      ... Mowing the lawn with scissors. ...

      Did that in USAF Basic Training.

      --
      If you want news from today, you have to come back tomorrow.
  2. Hard drives by serkit · · Score: 0

    Definitely hard drives

    1. Re:Hard drives by JMJimmy · · Score: 2

      Put it in the cloud! *waves arms like it's something mystical*

      Seriously though, there is no great solution. Burned discs separate over time, there's not enough data on SSDs yet but it's not looking promising, platter drives are susceptible to radiation, tape to magnetic fields and degradation. HDD in triplicate, replace every 7-10 years is the "best" method right now. So despite being modded down, serkit is right. Hard drives.

    2. Re:Hard drives by gmack · · Score: 1

      Definitely hard drives

      I don't know why that got modded down. Thanks to SATA having a standard plug positioning and hot swap connectors, there are some nice solutions that allow you to just slot in a SATA drive and pull it when you are done with it. You can get them direct SATA that plug into the front of the machine or external USB/Firewire models.

  3. Multiple copies by Anonymous+Cowar · · Score: 0

    If you're worried about long term reliability, try a raid-1 array of a spinning drive mirrored against an SSD. Make monthly backups to optical. That way, if your SSD fails, you still have two other options. This would probably be the most affordable method us mere mortals could have to hope to store long term data with a pretty good reliability. Unless you can get your hands on a second hand tape drive and some 500g tapes.

    1. Re:Multiple copies by Anonymous Coward · · Score: 0

      Unless you buy extremely good archival grade discs, optical media is the worst suggestion. Have you ever tried doing an md5 check on a 5yr old dvd-r? Next, with 40-60GB files, thats what right now...$100 for every two files if you mirror them to ssd? Why not just buy 20 3TB hdd's, from different batches of course, and run an uber raid-1 mirrored across all 20. That'd be cheaper than buying the amount of ssd's required to back up just one single 3TB hdd.

    2. Re:Multiple copies by Z00L00K · · Score: 1

      Right now it seems like the best solution is to have servers that replicates the data between them and use classic hard disks.

      Two or three servers with a RAID setup on each. Hotswap disks are preferred since disks do fail now and then. Synchronize between them using rsync, and have the servers at different geographical locations. (at least have one hosted at a friend - or at work.)

      --
      If builders built buildings the way programmers wrote programs, then the first woodpecker would destroy civilization.
    3. Re:Multiple copies by Joce640k · · Score: 2

      Unless you buy extremely good archival grade discs, optical media is the worst suggestion.

      Even with archival-grade disks it's still the worst suggestion.

      Apart from tape - yeah, let's put all our data on something that can't be read without specialized hardware! (where will you get a tape drive from in an emergency?)

      Hard disks can be connected to any PC, they're cheap, they're fast. The only problems I've ever had with USB disks is failure of the cheap-ass wall-warts they supply them with. Luckily all USB drives use either 5V/12V so it's easy to wire them up to a spare PC power supply. I have one under the desk and any USB disk which is switched on all day gets connected to that. The wall-wart goes in a drawer for emergencies.

      All other considerations aside though, the only thing that's going to garantee long-term success is:
      a) Use something that can be read on any machine with no special hardware or drivers.
      b) Make multiple copies of the data and store it in different locations.
      c) Use some widely used, non-proprietary format for combining/compressing the files (eg. zip).

      Base whatever you do on this philosophy and you should be OK.

      --
      No sig today...
    4. Re:Multiple copies by jones_supa · · Score: 1

      Hard disks can be connected to any PC, they're cheap, they're fast. The only problems I've ever had with USB disks is failure of the cheap-ass wall-warts they supply them with. Luckily all USB drives use either 5V/12V so it's easy to wire them up to a spare PC power supply. I have one under the desk and any USB disk which is switched on all day gets connected to that. The wall-wart goes in a drawer for emergencies.

      Or, to make things even simpler, use laptop disks so you can pull the power through USB.

    5. Re:Multiple copies by Joce640k · · Score: 1

      Laptop disks tend to be either much lower capacity or much more expensive than equivalently priced 'desktop' drives.

      Plus ... you soon run out of USB ports if you want half a dozen of them (a lot of the bigger ones need two USB cables to get enough current). This means buying powered USB hubs ... and ... the hubs are usually powered by cheap-ass wall-warts!

      --
      No sig today...
    6. Re:Multiple copies by datapharmer · · Score: 1

      If you think tape is a bad idea you clearly haven't done any serious work in archiving or backup. Where would I get a tape drive in an emergency? I would probably use the one mounted in the rack attached to the backup server like I did yesterday, or if that failed I would use the spare we keep in case of failures, and if it all burned down I would drive over to the other facility and use one of the two drives located there.

      --
      Get a web developer
    7. Re:Multiple copies by datapharmer · · Score: 1

      bad idea. Those usb powered drives tend to fail more often because many draw more power than the usb spec allows (which is why some ship with two usb connectors), sure they might spin up, but the low amperage is pretty hard on them long term.

      --
      Get a web developer
    8. Re:Multiple copies by pnutjam · · Score: 1

      At over $1k each, tape drives are out of the budget for most home users. Tape is also much slower then most people realize, much slower...

    9. Re:Multiple copies by loners · · Score: 1

      Don't forget to use checksums for bit rot detection.

    10. Re:Multiple copies by DarkOx · · Score: 1

      where will you get a tape drive from in an emergency

      That is certainly something you should have address in your disaster recovery plan, but I would not characterize it as a hard problem to solve.

      --
      Repeal the 17th Amendment TODAY! Also Please Read http://www.gnu.org/philosophy/right-to-read.html
  4. Rotational media by BWJones · · Score: 4, Interesting

    For this project, we have multiple multi-terabyte (5-18 terabyte) datasets that need backup. We have online and offline strategies and the offline strategy is simply multiple, redundant copies on hard drives stored in static proof containers onsite and off site.

    Hard drives are *very* cheap all things considered, are easy to store, take up very little physical space and if things go badly, restoring from them is faster than just about any other method. For datasets in the GB range, its a no-brainer to go with hard disks.

    --
    Visit Jonesblog and say hello.
    1. Re:Rotational media by Anonymous Coward · · Score: 0

      Yes, with highly fault tolerant, such raid 6. and a _good_ storage controller such as a xyratex.

    2. Re:Rotational media by Mad+Merlin · · Score: 5, Insightful

      I concur on this point, online storage really makes the most sense. Cheap, high performance (for sequential read/write) and easily expandable. You can get a single machine with dozens of SATA drives in it (including the drives) for way under 5 figures. When drives fail, they're simple to replace, and every couple years, migrate the whole thing to newer (faster, bigger) drives. Mirror your data unless you don't care about it. RAID 1/10 for really small datasets (2-4 drives), RAID 6 for moderate size datasets (5-10 drives) and RAID 60 for anything bigger.

      A very important note to keep in mind... stay away from hardware RAID! When your controller dies, so does all your data, unless you have an identical spare controller card (buy it up front, they won't exist in a couple years). The same goes for fake RAID (ie, software RAID driven by the BIOS), but s/controller card/motherboard/g;. Pure software RAID (ie, using mdadm) is a safe bet.

    3. Re:Rotational media by juventasone · · Score: 2

      Even on a small scale this makes sense. The easiest is 2.5" external drives ($100 for 1TB). This avoids the mess of power adapters. If you need significantly more storage, you may want to consider a dock ($50) and internal desktop drives ($80 for 2TB). Consider this: you can buy from anywhere a USB adapter that will plug into a 20+ year old drive and any OS will mount it. Wish I could say the same about all my removable media...

      Traditionally the way to do this is with tape. As you replace the drive (and you will), your tape capacity increases, but it will be read-compatible with your old tapes. The investment is huge, but it makes it very easy to replicate, take off-site, archive, etc.

    4. Re:Rotational media by Local+ID10T · · Score: 2

      A very important note to keep in mind... stay away from hardware RAID! When your controller dies, so does all your data, unless you have an identical spare controller card (buy it up front, they won't exist in a couple years). .

      I have to disagree... Adaptec raid card -they stand the test of time. Hell i can still buy a 2940 controller card if I want one...new!

      --
      "You want to know how to help your kids? Leave them the fuck alone." -George Carlin
    5. Re:Rotational media by lucm · · Score: 3, Informative

      > and a _good_ storage controller such as a xyratex.

      I would rather run Windows Home Server on a RAID-0 of IBM DeathStars installed in a HP Pavilion than deal with a Xyratex.

      --
      lucm, indeed.
    6. Re:Rotational media by lucm · · Score: 1

      I agree with that, unless the budget is insanely low hard disks are a no-brainer. They are also easier to encrypt than tapes and offer a much higher rate of reuse.

      --
      lucm, indeed.
    7. Re:Rotational media by lucm · · Score: 1

      > stay away from hardware RAID! When your controller dies, so does all your data, unless you have an identical spare controller card

      I totally agree. On some systems even a broken RAID-1 cannot be recovered using a different controller.

      It's a good thing that more and more filesystems are going beyond RAID (such as zfs or btrfs) and even storage unit are moving to block replication.

      --
      lucm, indeed.
    8. Re:Rotational media by mlts · · Score: 1

      Just make sure your encryption program is available, perhaps as freeware, or use a utility that is widespread and easily gotten. I've seen fancy encryption programs for drives that would be useless if there ever was a restore needed just because the license keys likely wouldn't work.

      For encryption, I'd go with LUKS, or TrueCrypt on a disk level. For file encryption, gpg is solid, perhaps tar, bzip2, and gpg, although you might need a utility for error detection/correction to repair any data lost due to bad sectors.

      The hardware is one piece of the puzzle. Having the right software to pull data off is the other. This is what is nice about hard disks -- assuming a common filesystem, a hard disk that has USB access today likely will be readable by most machines 10 or 20 years from now either directly, or via an adapter. Tape can be universal, but oftentimes a tape archive may not be readable because it was created in some wierd format (like the DOS backup programs that backed up to QIC drives in the '90s, all incompatible with each other.)

    9. Re:Rotational media by Spazmania · · Score: 1

      I've used a dozen different brands and models of hardware raid cards. The only ones that haven't lost my data are the defunct Mylex DAC960 and HP's Smartarray controllers.

      Had a problem with the mail server recently where a malfunctioning drive kept killing the scsi bus, causing the Smartarray controller to see every drive except that one as faulty. Once I figured out the bad drive, recovered without losing the data. Had similar problems using Adaptec and LSI cards a few years ago. Data gone. Maybe the current generation of Adaptec and LSI cards is better, but the ones I trust are the HP Smararray.

      Linux software raid is also superb for reliability if you can tolerate the performance of software raid. Which for backups you generally can.

      And for the record, an Adaptec 2940 is not a raid controller. It's a plain jane SCSI card -- something which unlike RAID controllers Adaptec did very well.

      For the OP - go with hard disks. Use different brands and models and always use more than one. Some production runs are defective; I had a 10 disk raid 5 once where 6 went bad, all from the same production run. Killed me because they went bad faster than I replaced them in the raid 5. Putting copies of the file on at least two different brands and models will save you from that. Use different sizes too... if you put one copy on a 2TB seagate drive, put the other copy on a 1TB WD drive.

      Keep some at home, some at work, some in the bank safe deposit box.

      --
      Moderating "-1, Disagree" is simple censorship. Have the guts to post your opinion.
    10. Re:Rotational media by JMJimmy · · Score: 1, Funny

      Online storage makes no sense. Give data you obviously care about to a company who's going to implement the cheapest "backup" system they can get away with and won't give a crap if they lose your data because they're protected by their terms of service. That and uploading 60gigs of data would take forever let alone multiple sets of 60gigs.

    11. Re:Rotational media by baitisj2 · · Score: 1

      Here's a spiffy way to do backups.

      Create five-way ZFS mirror. Remove the front two drives of the mirror. When backup time comes around:
      * zpool scrub
      * perform system snapshot
      * insert pair of drives from bank deposit box into disk array
      * zpool replace the two disks, wait for resilver to complete - you now have a five-disk mirror in your machine

      now, you can offline / store your data elsewhere.

      * remove the front two disks. system operates in "degraded" state with three-way mirror.

      When backup time occurs again

      * zpool replace the offline volumes
      * remove the next two disks

      This allows you to rotate out disks, ensuring that they are spun up from time to time. zfs checks for checksum errors, helping you identify a bad drive before anything serious happens (also check SMART data).

      I think this is best done with drives of the same size from different manufacturers to help reduce the chances of multiple drive failure.

      I really wouldn't trust any other filesystem in a RAID. There are too many really good reasons to go with ZFS, and I've had some awful crashes with Linux software RAID.

    12. Re:Rotational media by Keruo · · Score: 4, Informative

      > Online storage makes no sense.
      In storage, online means the data is connected and instantly available(harddrive etc) vs offline(dvd,tape etc)

      --
      There are no atheists when recovering from tape backup.
    13. Re:Rotational media by profplump · · Score: 1

      If you put tar archives on tape in 1976 they'd still read with modern GNU tar on a standard linux distro today, assuming you had a working tape drive. There is *no* random-access filesystem for which the same thing can be said. Certainly not all tape formats survived the test of time, but if you choose wisely tape a safer bet for longevity than any random-access file system, if for no other reason than the format is physically easier to read and the filesystem is easier re-implement.

      That being said, for data you don't want to spent $20k per tape/disk/etc. to recover, the only real option is to keep rotating it forward every time storage technology changes. If you keep a copy on two different kinds of "modern" storage (including file formats) and rotate to a newer version every ~5 years you'll always have your data. Anything short of that will almost certainly lead to eventual data loss for any data not in active use.

    14. Re:Rotational media by Anonymous Coward · · Score: 0

      And I've had some really terrible ZFS crashes and exactly 0 linux-md crashes. Neither of our cool stories are particularly good evidence that one approach is superior to the other.

      I also I honestly don't understand how anyone can recommend ZFS until/unless recovery tools become available. At least with md the recovery is block-level and you *can* reconstruct a standard bitstream without only a little bit of scripting/understanding of the on-disk layout. In ZFS the on-disk layout is quite complicated and there's no way you could script up `dd` and `bash` to re-built a partly failed filesystem/array. Once there are reasonable recovery tools available for ZFS it might be a reasonable option for many applications, but until then you're putting all your eggs in one brand-new, lightly-test, no-chance-of-recovery-on-error basket.

    15. Re:Rotational media by DMUTPeregrine · · Score: 1

      Wrong "online" here it means actively powered/powerable. Make your own NAS style.

      --
      Not a sentence!
    16. Re:Rotational media by Anonymous Coward · · Score: 0

      NEVER build a RAID array from disks from the same batch.
      Or at least if you care about the data.

    17. Re:Rotational media by drsmithy · · Score: 1

      Linux software raid is also superb for reliability if you can tolerate the performance of software raid.

      In most cases. with modern hardware, software RAID has superior performance to hardware RAID.

    18. Re:Rotational media by Z00L00K · · Score: 1

      The downside with Adaptec is that they don't have the best performance - even with raw access to the disks through the controller.

      --
      If builders built buildings the way programmers wrote programs, then the first woodpecker would destroy civilization.
    19. Re:Rotational media by adolf · · Score: 1

      Media doesn't age as fast as you think.

      I've got a USB 3.5" floppy drive that I bought about six years ago. I've only used it a few times, but it's a glorious little thing when it's needed.

      I've also got a couple of 5.25" drives (both low and high density, which is important due to the difference in head sizes), but those are available (and cheap!) too.

      I've got a box full of QIC-80 and DDS tapes, both of which (AFAICT) are abandoned standards. I don't have a drive for either of them anymore, but that's not really a problem: If I ever wanted to try to read one, Ebay has a working drive.

      Now, sure: If I buy used hardware, there's a chance that it doesn't work. So what? Buy several pieces of it. At about $25-50 each, it's not a big deal. (And if your old data isn't worth $100-200, it's probably not worth reading anyway...)

      So, I guess my point is that my removable media from 20 years ago should be very readable indeed, if bit-rot hasn't gotten to it first. And then, after performing the restores and the format-shift to something modern, re-sell the drive(s). (You'll recoup at least part of your investment, and help someone else out. Ebay can be an excellent rental program.)

      Probably the most important thing, I think, when choosing a medium for long-term storage is this: Buy whatever it is that is both good, and that is compatible with whatever everyone else is buying, and avoid obvious cheese. This ensures that the hardware will still be available in a decade or two, and avoids cheesy things like LS-120, MD Data, and Floptical that never were very good and would be a bear to deal with in the present.

    20. Re:Rotational media by Local+ID10T · · Score: 1

      And for the record, an Adaptec 2940 is not a raid controller. It's a plain jane SCSI card -- something which unlike RAID controllers Adaptec did very well.

      My point was that they still make them. Not all hardware is gone in a few years.

      --
      "You want to know how to help your kids? Leave them the fuck alone." -George Carlin
    21. Re:Rotational media by Anonymous Coward · · Score: 0

      Would that configuration even boot completely before failing? ;-)

    22. Re:Rotational media by adolf · · Score: 1

      Why?

      I try to avoid hardware RAID at all costs (the Linux md driver is just fine for my purposes, is free of hardware dependencies, and can probably also have its volumes read with *BSD with a bit of massaging if needed. Besides, CPU and IO time is cheap for what I do, so there's no performance hit for doing it in software.).

      So. If I wanted a hardware RAID solution for some reason (perhaps for improved OS independence and/or less CPU and IO overhead), why would I not want one from Xyratex?

    23. Re:Rotational media by fellip_nectar · · Score: 1

      stay away from hardware RAID! When your controller dies, so does all your data, unless you have an identical spare controller card

      dmraid can read most hardware raid layouts these days...

      --
      Worst. Signature. Ever.
    24. Re:Rotational media by Anonymous Coward · · Score: 0

      "Different brands" isn't always a safe bet these days, with more and more "brand" just being re-badged drives from another manufacturer.

    25. Re:Rotational media by evilviper · · Score: 1

      you can buy from anywhere a USB adapter that will plug into a 20+ year old drive and any OS will mount it. Wish I could say the same about all my removable media...

      Maybe if you used FAT. Any other file-system at all, and you're struggling.

      Ext2 is fine on Linux, but I haven't found any other OS with a halfway decent implementation, though. Sure, there are plenty of half-working implementations, but fsck on ext2 outside of linux? Forget it. UFS/FFS might have worked, but interoperability between implementations is pretty damn poor.

      And besides, how would you have known, 20 years ago, that FAT and UFS would still be in-force? Maybe you thought some other OS was going to survive in the long-haul. Okay, well it didn't, nothing else knows the filesystem, and even if it ran on x86, your ancient OS doesn't have USB support, and is quirky enough it probably won't run on a modern PC... Hell, even tried to load Windows 95 on an K7 or better? Sure, there are patches, but only because Windows 95 survived.

      Personally, my concerns are more fundamental. I don't want my important data to be ONLY online. If I need it in 20 years, I want several off-line hard copies. How about subtle bit-errors in your primary system being slowly replicated to your online backup? Maybe it's defective hardware, or maybe viruses and the like... Sure, it's a great model for simple home usage, where the data isn't all that valuable, and not important for too many years. And it's great for short to medium-term archiving for businesses, greatly reducing the number of hard-copies you have to make and store, but it's not something to be entirely trusted, without those hard-copies available in the worst-case scenarios.

      Traditionally the way to do this is with tape. As you replace the drive (and you will), your tape capacity increases, but it will be read-compatible with your old tapes. The investment is huge, but it makes it very easy to replicate, take off-site, archive, etc.

      The problem is making sure you bet on the right technology... The right technology being the ones big companies are using, so there will be demand for the drives in a few decades. Use some cheap junk "personal" tape drive, and you won't find drives that can read it in a few years. Show me a SaS or SATA Iomega Ditto tape drive... Sure, fortunately the old IDE and SCSI-1 interfaces are still around, for the moment, but they won't be, for too much longer.

      --
      Slashdot gets worse every day... Pipedot: News for nerds, without the corporate slant
    26. Re:Rotational media by slashmais · · Score: 1

      1. Encrypt the files and make them available on a web-page.
      2. Wait until the Internet Archive dudes have done their thing.
      3. ... (backed-up for ever!) ...
      4. profit!

      --
      time time everywhere and not a second to spare
    27. Re:Rotational media by orange47 · · Score: 1

      um, whats wrong with HP Pavilion?

    28. Re:Rotational media by Spazmania · · Score: 1

      Not really.

      If you sprung for hardware raid without the battery/capacitor needed for the write back cache then you're a fool.

      With the write back cache, the instant return for data committed to the cache makes the hardware raid insanely faster than software raid for most disk loads.

      With a write-through cache (no battery) a software raid 1 is usually faster. You have to have a pretty crappy card before a raid 5 or raid 6 is faster on the main CPU than on the dedicated card. The extra computing on the main CPU and the extra I/O overhead to multiple devices instead of one takes its toll.

      --
      Moderating "-1, Disagree" is simple censorship. Have the guts to post your opinion.
    29. Re:Rotational media by drinkypoo · · Score: 1

      The extra computing on the main CPU and the extra I/O overhead to multiple devices instead of one takes its toll.

      The first part yes, the second part no. The same amount of data is going across the bus either way.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    30. Re:Rotational media by m50d · · Score: 1

      The same goes for fake RAID (ie, software RAID driven by the BIOS), but s/controller card/motherboard/g;

      Not really; linux's dmraid can read all the common fakeraid formats from any controller. You may not be able to boot from your disks in another machine, but you'll be able to read them.

      Mdadm has lost me large amounts of data in the past; after a while you will have isolated bad sectors on all your drives, but linux believes the correct way to deal with a single URE is to boot that drive out of the array. Of course this doesn't happen until you try to read it. So to my mind the only option is ZFS's raid-z, which handles this case correctly, and has a way to verify your data before it breaks.

      --
      I am trolling
    31. Re:Rotational media by Anonymous Coward · · Score: 0

      Why not use an online backup vendor, who charges on GB/TB basis? I use YON Systems(www.yonsystems.com) for my laptop archival. They also do custom plans, since in this case there is no continuous data transfer and it is mostly an one time archival and when-ever needed you may want to restore it. Not sure if they are available in your country, but worth a try. For me they sent a person with their USB HDD to collect the initial data and then onwards the backup is incremental, so I save on my initial upload via internet.

    32. Re:Rotational media by NJRoadfan · · Score: 1

      They don't make official 64-bit drivers for them under Windows however.

    33. Re:Rotational media by unencode200x · · Score: 1

      Some private online backup companies (such as MSP providers) use very high quality stuff (such as EMC's Avamar), backup to multiple data centers, and provide managed backup services with SLAs and an annual (or more frequent) DR test/verification. The company I work at provides this service. It's expensive, but guarantees backups are safe.

      --

      Chance favors the prepared mind.
      Perfect is the enemy of good.
    34. Re:Rotational media by maxume · · Score: 1

      For the cheap IDE adapters, the cutoff point is somewhere around 1993.

      I know this because my cheap adapter would not talk to a ~1993 drive that worked just fine in my 1997 Dell. So I think the adapter probably only goes as far back as ATA-1.

      --
      Nerd rage is the funniest rage.
    35. Re:Rotational media by vegiVamp · · Score: 3, Insightful

      Fully concur, and let me sum it up: The best type of long-term storage is "redundant".

      --
      What a depressingly stupid machine.
    36. Re:Rotational media by petermgreen · · Score: 2

      For offline archival I would avoid raid, it's just another thing to potentially go wrong when you try and hook the drives back up and retrieve the data.

      To protect against corruption and drive failures either just keep multiple copies and checksums so you can tell which copy is good (simpler but less efficient in terms of the protection you get relative to the storage space you use) or use something like parchive. Maybe combine the two with two sets of drives in different locations holding the data and then parchive in case the same drive becomes failed/corrupt in both sets.

      Whatever you do with offline hard drives make sure you do tests once in a while. so you can see when drives are failed/failing and replace them before the situation becomes critical. You should also keep an eye out for drives using obsolete interfaces and migrate the data to more modern drives.

      KISS should be your guiding principle. If you can't retrive your data with nothing more than the raw drives, a standard PC, a linux livecd and common FOSS tools then you are doing it wrong.

      --
      note: i'm known as plugwash most places but i screwd up registering that here somehow in the past and now can't register
    37. Re:Rotational media by Anonymous Coward · · Score: 0

      I concur on this point, online storage really makes the most sense.

      As long as you're OK giving someone else your data.
      Also, the whole part about "...best offline storage..." seems to indicate they are anticipating a scenario where the internet is unavailable when they need it.

    38. Re:Rotational media by petermgreen · · Score: 1

      And besides, how would you have known, 20 years ago, that FAT and UFS would still be in-force? Maybe you thought some other OS was going to survive in the long-haul. Okay, well it didn't, nothing else knows the filesystem, and even if it ran on x86, your ancient OS doesn't have USB support

      As long as you can get it to run in some sort of emulator the lack of USB support probablly doesn't matter much since your emulator can take the USB drive (or an image of it) and present it to the guest OS as an IDE or SCSI device.

      But in general you are right, while hard drives have fared better than most removable media there is still a need to keep on top of your backups if you want to keep them viable over a period of decades. Keeping on top of them involves both checking for failed copies to maintain redundancy and checking if migrations are needed due to obsolete hardware or software.

      --
      note: i'm known as plugwash most places but i screwd up registering that here somehow in the past and now can't register
    39. Re:Rotational media by flonker · · Score: 1

      I have to agree regarding staying away from hardware raid. We even saw the same model number, same part number have completely different firmware, hardware and driver.

    40. Re:Rotational media by Daniel_Staal · · Score: 1

      For large storage sizes and reliability, ZFS would be well worth looking at. Set up a multiple-mirrored array, with dedup and compression, and scrub it regularly to detect hardware failures and disk degeneration. It's not really 'offline', but I'd be hesitant to use HDDs as purely offline storage. They aren't really designed for that use-case.

      I don't know if that's the best option for the submitter, but it's worth looking at. Easy to set up under FreeBSD as well.

      --
      'Sensible' is a curse word.
    41. Re:Rotational media by Anonymous Coward · · Score: 0

      Until you run out of working computers which have a PCI slot.

    42. Re:Rotational media by Zemplar · · Score: 1

      Ditto the ZFS suggestion; consider Solaris and commercial support from Oracle if this data is valuable. Due to the importance of the data and support options, I'd not suggest OpenSolaris or FreeBSD unless you consider yourself very competent with either. One day that support option just might pay for itself.

    43. Re:Rotational media by the_B0fh · · Score: 2

      *WHAT* century are you living in? In the 1990s, Adaptec was a good brand. Not any more. Not just not good, but actively sucky.

      http://marc.info/?l=openbsd-misc&m=125783114503531&w=2
      http://marc.info/?l=openbsd-misc&m=126775051500581&w=2
      http://marc.info/?l=openbsd-misc&m=128779369427908&w=2

      (Read 'em in order)

      Adaptec: another way of saying "my data is not important"
      Adaptec: unsafe on any platform.

    44. Re:Rotational media by LordLimecat · · Score: 2

      I cant believe my ears. Someone is asking for advice on archival media, have suggested that tape is an option and price is not a big factor, and the response is "yea, you should buy less reliable, more expensive rotational media".

      Why on EARTH wouldnt he pick up a cheap LTO3 drive (which can be had for $200 these days), grab a couple $20 tapes (holding ~300GB), and call it a day? Theres your multiple copies, LTO supports WORM tech, and youre basically guarenteed that you will be able to read the tapes for the next 10 years or so (read-ability is guarenteed 2 generations back in LTO, so LTO5 drives can handle LTO3 media).

      Hard drives just are not generally convenient to have multiple media sets (unless you want to find a hot-swap bay and a zillion drive cages), theyre prone to disaster, SATA connectors REALLY arent meant for rotation (theyre rated for 50 insertions per connector...), they have motors and bearings that can wear out, etc, and at the end of the day are more expensive.

      Think of it this way-- once youve bit the cost of the tape drive, cost wise you can afford 3 tapes for every single drive. So if your data is really valuable, take into consideration the number of backups you can feasibly afford on tape vs spinning platter.

    45. Re:Rotational media by LordLimecat · · Score: 1

      Doesnt change the humor of the suggestions flying around. For archival purposes, online is brain-dead; the whole point of offline archiving is that it doesnt spontaneously wear out or die or get hit by a surge while its holding your valuable data.

      The OPs post was practically begging for someone to say "use tape".

    46. Re:Rotational media by LordLimecat · · Score: 1

      They are also easier to encrypt than tapes and offer a much higher rate of reuse.

      LTO4 has native encryption support; failing that you can simply encrypt your data on the way to the tape drive (backupexec supports it, and i imagine it wouldnt be too hard to gzip w/encryption the data on the way to the drive). As for reuse, LTO tapes are rated for 5000 loads/unloads; SATA connectors are rated for 50 connects/disconnects.

      Not sure about USB durability, but when you go to external usb drives you can run into issues with drives being bumped during operation.

    47. Re:Rotational media by whitelabrat · · Score: 1

      A very important note to keep in mind... stay away from hardware RAID! When your controller dies, so does all your data, unless you have an identical spare controller card (buy it up front, they won't exist in a couple years).

      That's just plain wrong. Using RAID 1, your data is fine, not matter what controller you use. It may be a pain to extract, but your statement is misleading. Recovery does become more difficult if using any other RAID other than a plain mirror.

      If recovery is a primary concern then mirror your disk with hardware, and then stitch them together with JBOD if your hardware allows it or with software.

    48. Re:Rotational media by Hatta · · Score: 1

      a cheap LTO3 drive (which can be had for $200 these days)

      Where?

      --
      Give me Classic Slashdot or give me death!
    49. Re:Rotational media by Anonymous Coward · · Score: 0

      I concur on this point, online storage really makes the most sense. Cheap, high performance (for sequential read/write) and easily expandable. You can get a single machine with dozens of SATA drives in it (including the drives) for way under 5 figures.

      I think keeping the last few weeks/months worth of data online makes sense, but anything more than (say) a year probably has diminishing returns. You also have to worry about backups (multiple drives to go south, not to mention flooding, etc.).

      So a tape drive may be useful in these longer-term instances. An LTO-4 drive is under $2K these days, and tapes are under $100 (holds 800 GB native). LTO-5 is still kind of pricey (though often under $3K; 1.5 TB per tape), but once it comes down, you can replace your LTO-4, and still read your old media.

      If you're running Linux/Unix-y OS, you can combine the software RAID and add Amanda or BackupPC to the mix to copy/move files to tape.

    50. Re:Rotational media by Anonymous Coward · · Score: 0

      The extra computing on the main CPU and the extra I/O overhead to multiple devices instead of one takes its toll.

      The first part yes, the second part no. The same amount of data is going across the bus either way.

      Depends on which bus, RAID level, and operation you're talking about. If you're talking storage "bus" (e.g. SAS, SATA), then the transfers are going to be the same. It's also the same for a read from a fully redundant array (or degraded RAID1). Once you need to reconstruct the missing blocks or do writes, differences emerge. If you're talking about the memory/system PCIe, the hardware RAID usage is significantly lower. You need to move only the data you're writing for hardware RAID. Software RAID1 means you have to move that data twice. For software RAID5, you need to read the old data blocks you're replacing and the corresponding parity, and then you need to write the new data and new parity. 2 reads + 2 writes >> 1 write. Add another read and write if you are doing RAID6, plus having to do a Galois transformation without hardware acceleration (or a lot more reads to do orthogonal parity). The hardware RAID needs to do the extra reads and writes too, but they stay on the card's cache and local buses and don't compete for system resources.

    51. Re:Rotational media by Anonymous Coward · · Score: 0

      Ideally you want at least 2 different geographic regions for your backups. If you have one at your office, and one at the bank next door...a natural disaster that takes out one will likely take out the other as well. Encrypt the second copy and send it to someone you know on the other side of the continent/world.

    52. Re:Rotational media by oneiros27 · · Score: 1

      And there's also 'nearline'. (when it's a jukebox or similar tape robot so you can request stuff to be mounted without going and manually loading it).

      --
      Build it, and they will come^Hplain.
    53. Re:Rotational media by Anonymous Coward · · Score: 0

      p>A very important note to keep in mind... stay away from hardware RAID! When your controller dies, so does all your data, unless you have an identical spare controller card (buy it up front, they won't exist in a couple years). The same goes for fake RAID (ie, software RAID driven by the BIOS), but s/controller card/motherboard/g;. Pure software RAID (ie, using mdadm) is a safe bet.

      Many Linux distros will automatically reconstruct and allow you to mount raid arrays so the data can be transferred off. I had a situation in the past where my Striped Array failed due to a mobo death. Ubuntu detected and mounted the array in seconds.

    54. Re:Rotational media by WuphonsReach · · Score: 1

      Ext2 is fine on Linux, but I haven't found any other OS with a halfway decent implementation, though. Sure, there are plenty of half-working implementations, but fsck on ext2 outside of linux? Forget it. UFS/FFS might have worked, but interoperability between implementations is pretty damn poor.

      And besides, how would you have known, 20 years ago, that FAT and UFS would still be in-force? Maybe you thought some other OS was going to survive in the long-haul. Okay, well it didn't, nothing else knows the filesystem, and even if it ran on x86, your ancient OS doesn't have USB support, and is quirky enough it probably won't run on a modern PC... Hell, even tried to load Windows 95 on an K7 or better? Sure, there are patches, but only because Windows 95 survived.


      Twenty years ago most file systems were proprietary and the implementations were all closed-source or wrapped up in NDAs.

      Less of an issue these days, especially with the open-source file systems. Ext2/Ext3 are probably the safest bets at this point, maybe FAT/FAT32 (because it's such a simple file system and easily reverse engineered well enough that you can read the data).

      And frankly, for long-term storage - you do not just put the data in a box and assume that it will be readable in 20 years. Every 3-5 years (tops) you need to be pulling those long-term archives, validating them, and possibly moving them to newer technology.

      --
      Wolde you bothe eate your cake, and have your cake?
    55. Re:Rotational media by pnutjam · · Score: 1

      Agreed, RAID is a waste of time for most people. Never use it on a desktop, consider it for a server, but not for an archival server. RAID will just propagate any errors to both drives and SATA drives are fast enough that your network will be the bottleneck.

    56. Re:Rotational media by pnutjam · · Score: 1

      Stay away from RAID, mirror the data with Rsync or something similar so that you have a chance to catch corruption. If this is archival data it should not be changing that often. You only need to refresh your mirror when you change your data.

    57. Re:Rotational media by pnutjam · · Score: 1

      Offline HD has replaced tape for almost every application. It is so much cheaper and more reliable. The only place tape makes sense is extremely large data sets that use robotic tape changers, and something I want to vault for 30 years. No other technology can sit for 30 years and still have a very good chance of being readable.

    58. Re:Rotational media by pnutjam · · Score: 1

      Maybe we care about recovery time?
      Tape backup is substantially slower then hard drive backup. Recovery time is a nightmare. Even recovering one file take forever on tape compared to HD.

    59. Re:Rotational media by pnutjam · · Score: 1

      That's why archive need to be maintained. I could have moved the same data, verified it, and maintained it for 20 years and still spent less then someone invested in one tape drive.

    60. Re:Rotational media by pnutjam · · Score: 1

      well said!

    61. Re:Rotational media by pnutjam · · Score: 1

      His solution would be better if he recommended putting in the drives and using rsync or a similar command to mirror the data. ZFS is too new for archives and RAID introduces too much complexity.

    62. Re:Rotational media by greed · · Score: 1

      Just because it's a hard drive, doesn't mean you have to put a filing system on it. (Other commenters have covered the future-life of filesystems.)

      Heck, just because it's optical doesn't mean you have to use ISO9660 or UDF.

      I still have a script that hooks into the user-exits in GNU TAR to bridge a TAR across multiple optical discs. You can read it back on any OS which lets you get block access to the drive, which is at least Solaris, AIX, Linux, BSD and Mac OS X. (I've read back the burns on all of them. Burning means you need cdrecord, which, hey, works on all of those as well.)

      You can do the same thing with HDDs. Sure, filesystems are convenient, but they're not necessary--neither is partitioning. A hard drive is a random-access block storage device; an optical drive is a sequential-access block storage drive with fast seeks; and a tape drive is a sequential-access block storage device with slow seeks.

      Which means, if it works on tape, it will work on optical. If it will work on optical, it will work on hard disk.

      Pick the combination you think suits your future needs and STORE THE DOCUMENTATION WITH YOUR ARCHIVES.

    63. Re:Rotational media by jp10558 · · Score: 1

      Naw, encrypt it and post it to Usenet with appropriate PAR2 sets. Now you've got free geographical redundancies... Re-post the dataset every 700 days or so to keep it on spool...

      --
      Opera, Proxomitron-Grypen,GPG 0x0A1C6EE3
    64. Re:Rotational media by godefroi · · Score: 1

      Repeat after me: "RAID IS NOT A BACKUP STRATEGY" Do not use RAID on a backup device. Do mirror your data onto multiple media sets, but do NOT acquire a dependency on any specific hardware (RAID controller) or software (software RAID implementation) to do so.

      --
      Karma: Poor (Mostly affected by lame karma-joke sigs)
    65. Re:Rotational media by SuperTechnoNerd · · Score: 1

      Not to mention proprietary hardware raid drivers. I agree, I use pure software raid, for my needs the overhead is negligible.

    66. Re:Rotational media by Cytotoxic · · Score: 1

      Seconded. And since cost is not the main consideration, I'd toss in a commercial online backup/storage solution as an added offsite protection. You might not want to rely on commercial online storage for your sole backup as the vendor might go out of business, but as a part of a tiered backup solution, it is a pretty nice option.

    67. Re:Rotational media by Spazmania · · Score: 1

      A 6-disk raid-5 has at least 12 times the activity on the memory bus and at least 6 times the activity on the card bus as the same 6-disk hardware raid 5: one pass computing the parity bits and a second pass pushing the block out to all the disks. That's assuming you're reading sectors linearly. If you peck a sector here and a sector there your software raid 5 pulls the entire block into main memory and change it.

      So no, it isn't the same I/O in and out of main memory.

      --
      Moderating "-1, Disagree" is simple censorship. Have the guts to post your opinion.
    68. Re:Rotational media by LordLimecat · · Score: 1

      Look on ebay, get one used. Even brand new theyre only about $900, and you can do far far cheaper for the older tech.

      And I meant to suggest LTO2, since that is even cheaper, and still adequately supports OPs requirements.

    69. Re:Rotational media by LordLimecat · · Score: 1

      2 things.

      One, LTO3 read speeds are around 80MB/sec, which is not awful.

      Second, when youre picking an archival method, "time to recover" should be really really low on the priority list unless youre talking "multi-day". For 60GB of data, even LTO1 (which is 1/4 the speed of LTO3) could perform a full backup or restore in 60,000 / 20MBps = 3,000s, or 50 minutes.

      Seems to me that far more important than whether it takes an extra 5 minutes to restore, is whether your data is actually restorable.

    70. Re:Rotational media by LordLimecat · · Score: 1

      Agreed, cloud backup is a really nice secondary backup in a lot of ways, particularly because it often gives you fine-grained versioning, and satisfies the "backups should be offsite" requirement.

      Go with www.crashplan.com (using their "PROe"), and you can be your own cloud, and host "online backups" at another location.

    71. Re:Rotational media by LordLimecat · · Score: 1

      Offline HD has replaced tape for almost every application. It is so much cheaper and more reliable.

      Lets do a quick comparison.

      Price:
      SATA 2TB drive: $80, or $0.04/MB
      SATA 1TB drive: $54, or $0.054/MB
      LTO5 3TB tape: $68, or $0.023/MB
      LTO4 1.6TB tape: $25, or $0.015/MB
      Even if you drop the "compressed vs native", LTO4 is still $0.03/MB, about 75% of the cost of sata at its best.

      Durability:
      SATA connector: rated for 50 connections
      LTO-5 tape: rated for 5000 loads/unloads

      Complexity:
      LTO5 tape: spool of magnetic media
      SATA drive: Spinning platters, motor, controller, read head. Very sensitive to bumps during operation.

      You go with your disk-backup; Im sticking with tape. Enjoy your added costs of removable drive chassis.

    72. Re:Rotational media by pnutjam · · Score: 1

      Yes, but I can backup most small businesses completely, with redundancy, for less then or equal to the cost of 1 tape drive. If I ever need to restore a 1TB server I will be done in afew hours, you will still be running your tape restore the next day (and probably the day after that too).

    73. Re:Rotational media by pnutjam · · Score: 1

      If you work with tape every day and test your backups, you can probably be assured that your tapes will restore fine. However, most people do not. They either unknowingly, or knowingly do not test their tapes. Testing tapes requires you to read from them, did I mention how slow that is? Hard Drives are a tried and true solution for most people under the 50TB mark. I don't argue that tape has a place, however, it's place is not small business or home users.

    74. Re:Rotational media by Fjandr · · Score: 1

      Well, yeah. RAID isn't a backup solution, it's an availability solution. Unless you're using something relatively unknown, it's not hard to get a replacement controller. Even if you can't, you can recover your data from your actual backup solution when you replace the failed RAID system.

      Anyway, if you really need high availability, there should be mirrored RAID controllers so a single controller failure doesn't halt your operation.

    75. Re:Rotational media by Deadplant · · Score: 1

      that doesn't look right.
      you used MB where I think you meant GB (but that doesn't matter for comparison purposes)
      It looks like you're using 1000 GB = 1 TB for your calc so I'll continue with that.

      the LOT05 cartridge has a capacity of 1.5TB not 3
      So the cost is $0.045 per GB

      the LTO4 has a capacity of 800GB
      So the cost is $0.031 per GB

      The large tapes are more expensive than the sata drive but very close and certainly cheaper than I expected.
      Of course they externalize the cost of the tape drives but the cost of that would depend on the size of your operation.
      For a huge organization with thousands of tapes it is a small cost.
      For a small org or an individual with say 30 tapes it is huge. To back up a 45TB collection (very large for an individual or small business) a $1600 (from newegg) tape drive would add $53 to every tape. So the real cost would be more like $0.081 per GB for the tapes.

    76. Re:Rotational media by Anonymous Coward · · Score: 0

      I wouldn't say so much that you are disagreeing, just that you have an exception to a generally rule of safety.
          Which is: if you don't know, use a software raid for anything but mirroring.

    77. Re:Rotational media by mattack2 · · Score: 1

      What does "scrub it regularly" mean? (no soap jokes)

      Sounds like you mean some sort of regular (tedious) maintenance. Is that sort of maintenance needed with RAID too? I'm not sure, that's why I'm asking. (I thought, at least with the 'right' type of RAID, it tells you a drive died, you plug a new drive in, and it fixes itself..)

    78. Re:Rotational media by LordLimecat · · Score: 1

      It looks like you're using 1000 GB = 1 TB for your calc so I'll continue with that.

      That is correct. I didnt feel like it was necessary to complicate things when it would have no effect on the comparison.

      the LOT05 cartridge has a capacity of 1.5TB not 3

      That is technically correct, and I addressed that below the price comparison.
      However, in practical usage it DOES approach 2:1 compression on data (I've used LTO3 and 4 drives for quite some time; they almost always hit at least 80% of their "compressed" rating).
      And you are wrong that tape is more expensive than sata-- LTO4 as you noted is 0.031 per MB in native capacity, so even if we ignore LTO hardware compression (which is nothing to sneeze at), you have 75% of the cost of your cheapest, lowest-common-denominator SATA drives-- assuming you pony up for 2TB drives, rather than the even more expensive 1TB drives.

      So the real cost would be more like $0.081 per GB for the tapes.

      Thats not quite fair, because you havent calculated the enormous cost of trying to back 45TB up to hard drive, and keep any kind of decent rotation going, not to mention parity and the rest. I might as well start talking about the cost of a controller capable of handling 45TB of storage (a 16-slot LTO5 autoloader can be had for around $4k-- thats ~48TB, while a 16-channel controller costs around $1200 by itself).

      Even for smaller customers, I recommend tape-- LTO2 @ 200/400GB capacity is around $200-500 depending on where you get it, and the tapes are very cheap to replace. And nothing really beats having multiple copies.

    79. Re:Rotational media by Daniel_Staal · · Score: 1

      It's a ZFS command, not all that tedious. Just run it from a cron job. ;)

      It goes through every block on the filesystem and re-computes it's checksum, and compares it to the stored checksum. If they differ, it initiates repair. (Pulling the block from a mirror, if possible.) ZFS will also do that on read, but if you are doing long-term backup you may not be reading things all that often. Scrub will do it for everything. Note that in ZFS directory mapping blocks and so on are stored in the same way as other blocks, so this also makes sure that the directory structure is good.

      How long it takes depends on how much space has been used (not the total size of the array; it only needs to work on actual used blocks) and how fast your processor is. It typically gets done as a background process. It'll eat a bit of performance while it's running, but that's about it.

      This catches minor errors, and bad blocks. If your drive dies hard the scrub will report it, but it's not the only way to notice. Either way, you can swap the drive if you need to: ZFS can in modes equivalent to RAID 0,1,5,6,0+1, and a couple of others that don't have direct RAID equivalents. (RAID 5 with 3 parity, multiple mirrors in level 1, etc. Nothing majorly different there.)

      --
      'Sensible' is a curse word.
    80. Re:Rotational media by LordLimecat · · Score: 1

      Testing daily is less important as you have more copies of your data. It seems far more silly to have a single copy of your data on HDD, and test it daily, then to have 20 copies but test once a year.

      And there is a reason tape is considered archival media, while HDDs are not (even on their best day).

    81. Re:Rotational media by LordLimecat · · Score: 1

      If I ever need to restore a 1TB server I will be done in afew hours, you will still be running your tape restore the next day (and probably the day after that too).

      You very clearly have NO experience with tape.

      1TB tapes (LTO4, LTO5) have speeds around 150MB/s+, and are usually on either a SAS or SCSI adapter which means their overhead is far less than your sata drives. So for 1TB of data, it will take about 2.5-3 hours to do a restore.

      As for redundancy, presumably youre not using desktop-grade drives (what with their tendency to cause raid controllers to mark them offline on a bad block), and presumably youre using a raid controller, so for 1TB data we're already at $500 between the cost of 2x RE4 drives and a suitable controller; plus if you want multiple copies, you now need hotswap chassis (anyone saying "sleeveless hotswap" should be ignored, as sata connectors are not rated for many insertions) and backplane. Cost is rising, and you have not addressed potential issues like "changing drive letter (or identifier)" depending on OS.

      For instance, Is the backup drive D:? E:? Is it /dev/sda, or /dev/sdc? What happens when someone plugs a flash card in, and the OS assigns it the identifier you were intending to use for backup? Cant use UUID, since we are (presumably) backing up to multiple different media for redundancy.

      And you also havent solved the whole "disks are inherently more complex, have more moving parts, and are more breakage prone".

    82. Re:Rotational media by toddestan · · Score: 1

      I haven't found any of those adapters that allow you to input the head/cylinder/landing zone/etc information. So basically they only work with drives that auto-configure (properly), which rules out most drives made before about 1995 or so. Anything older and you pretty much have to hook them up to a real IDE port and use the BIOS to manually configure the settings.

    83. Re:Rotational media by maxume · · Score: 1

      My recollection is that the dell auto-configured the drive, but it was a while ago and I was more interested in dumping the data than I was in the process.

      --
      Nerd rage is the funniest rage.
    84. Re:Rotational media by Anonymous Coward · · Score: 0

      Recording engineers and producers have this problem. Record companies have made billions from re-releasing old recordings, so there's a major incentive to do it right. The Audio Engineering Society recommends (I believe in aes22-1997), storing multiple copies on at least two types of media. I mainly see bare hard drives and tape being used.

    85. Re:Rotational media by JMJimmy · · Score: 1

      Tape is great as a backup medium there's no denying that, backups are actively refreshed. The question was talking about archiving which is a completely different set of issues. The biggest issue is how long can the object sit on a shelf unused before it starts to corrupt the data. HDD has an active life of 5 years but a shelf life of 10 years, tape has a shelf life of 5 to 10 years. Tape also has the disadvantage of sequential access. It's the preferred method for professional archives because of this fact. It's very easy to transfer all the data sequentially from one tape to another, the data transfer rates are faster, and compression makes a big difference. For home archives though those aren't really considerations, cost, ease of access, and reliability long term are the more important issues.

    86. Re:Rotational media by evilviper · · Score: 1

      Less of an issue these days, especially with the open-source file systems. Ext2/Ext3 are probably the safest bets at this point, maybe FAT/FAT32 (because it's such a simple file system and easily reverse engineered well enough that you can read the data).

      As I said, Ext2 SHOULD be compatible, but even today it is not. 3rd party implementations are horrible and flaky. Trying to use an ext2-formatted USB drive on both Linux and FreeBSD quickly turned into a nightmare. You obviously haven't tried it, and ignorance is bliss. The world doesn't work the way you think it should...

      The fact that the reference is GPL'd means it's not spreading as much as something with a freer license would (eg. UFS/FFS). This has been seen time and time again. Sadly, I wouldn't bet on UFS today, either, as even Linux has a complete crap UFS implementation that I've never seen work.

      Hell, if you think open source is going to guarantee comatibility, why aren't you recommending MINIX-FS? It's open source, and has been for years, so it should be readable everywhere by now.

      --
      Slashdot gets worse every day... Pipedot: News for nerds, without the corporate slant
    87. Re:Rotational media by evilviper · · Score: 1

      Just because it's a hard drive, doesn't mean you have to put a filing system on it.

      I'm very well-aware of that. I've used tar directly on floppies innumerable times, and plenty of other tricks like writing files directly onto block devices. In fact I just recently stopped recording ~820MBs of PAR2 data to CDs in Mode2/XA as my corrupt media recovery method (RAID-5 on CD-R, really).

      However, when people talk about using hard drives for backup, they are talking about a file system, and for good reason. Hard drives don't have any advantage over tapes in cost/GB, speed, shelf-life, etc. What they do have is their quick and easy recovery, verification, reuse, and updating (eg. with something like rsync). Once you start using TAR rather than a file system, you lose just about all of the advantages of HDDs.

      And besides that, you're not talking about TAR, anyhow. You're talking about proprietary GTAR features that aren't found just about anywhere else. So, instead of betting that EXT2 will be around in a couple decades, you're betting that GTAR will be around in a couple decades. That may be a somewhat safer bet, but not by much.

      --
      Slashdot gets worse every day... Pipedot: News for nerds, without the corporate slant
    88. Re:Rotational media by cthulhu11 · · Score: 1

      HP assumes that *everything* is a desktop machine plugged into a tube and keyboard, even rackmount systems. They nickle/dime everything they possibly can, including functionality on their HBA's and service processors. Yes, one has to give them ~$150 to unlock features on a FRIGGIN HBA. The third strike: their tech support is offshored, with all the worst features that brings: techs who call during their working hours (not yours), who you can't understand anyway. Techs who don't read the ticket you enter and give non-sensical, non-sequitor answers that are clearly cut/pasted from a list.

    89. Re:Rotational media by cthulhu11 · · Score: 2

      We had a single JBOD made by Nstor, who were bought by Xyratex. Xyratex in theory serviced it, but we had to rip the interface board out and send it away for several weeks, no advance or on-site replacement, and when it came back, it was still broken. This array actually had two disks per tray, which meant that for online servicing one had to know that in advance (not always obvious when one is remote) and lay out volumes very carefully. Mind you this wasn't a Xyratex product per se, but the way they handled it didn't impress me. The Linux md driver is meh, but at least it doesn't require compiling metadevice layout into the kernel. It's there and it works, but like SVM/SDS/ODS from 15 years ago it's dated and limited, and mdadmin et al are clumsy at best. "Hardware RAID" means different things to different people. Two main divisions: o HBA RAID. Claimed advantage in that mirror/parity writes don't clog up the host channel. Another is that on systems with anachronistic BIOS stupidity, booting from a mirror when the primary fails can be difficult or often impossible to set up. Big disadvantages: - No model that I've yet seen does 3-way mirrors. None. Disks fail, and more than once I've had a mirror fail while replacing the other side. It can take time to get bad disks replaced, and if remote local hands yank the good side of a mirror pair instead of the bad, you're screwed. - SPoF. Volumes can't span HBA's, and yes, HBA's do go bad. - Crummy monitoring. Some vendors supply CLI's for some OS's, but ongoing support is a uncertain, and the interfaces are always downright horrid. raidctl, for example, on my systems behaves in at least three distinct ways. - Want to use one for the boot device? Be prepared to hit a multi-key sequence during a split-second at the BIOS level after POST. These often assume that you're sitting in front of a desktop, and can enter Alt-keys or function keys to invoke. Using a remote serial console? Sorry, you're screwed. o Chassis RAID: SPoF unless 2-3 identical arrays are mirrored in software. No monitoring to speak of. If you get a serial console, it may need a proprietary adapter cable, null-modem, *and* DB9-RJ45 adapter, as many mistake a 1/8" round headphone-style jack for a serial connector. Linux storage is miserable, still stuck in the mindset of a desktop user with 1-2 drives. The sda/sdb/sdc/etc. naming convention is meant to echo legacy MS-DOS drive letters, and obscures vital information about which drives are in what slot of what array. cXtXdX FTW. When adding an HBA or disks, existing disks can even suffer name changes. ZFS / btrfs is *desperately* needed, but there is no indication of viability anytime soon.

    90. Re:Rotational media by cthulhu11 · · Score: 1

      Hardware RAID indeed requires a software mirroring layer on top of multiple HBA's/arrays to be really trustworthy.

    91. Re:Rotational media by AmiMoJo · · Score: 1

      The OP doesn't say what OS he is using, but if it is Windows then tapes might not be a good options. Tape support in Windows was always poor and was eventually dropped with Vista and Server 2008. A few years back I had to get tape backup working with Server 2008 and ended up having a headless Linux box that 2008 wrote backups to over the network and then a cron script spooled them out to tape.

      Anyone tried tape with MacOS X?

      --
      const int one = 65536; (Silvermoon, Texture.cs)
      SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC
    92. Re:Rotational media by LordLimecat · · Score: 1

      And for reliability, which are you more confident of, a spool of tape being mechaincally able to unspool, or a harddrive's motor spinning up, its platters rotating, being dust free, the controller correctly operating, and the readhead moving freely?

    93. Re:Rotational media by pnutjam · · Score: 1

      Nobody is disputing tape has a place, however, I wouldn't touch it or recommend it for most people for the reasons I outlined above. Even if I was archiving to tape, I would stage to HD.

      Obviously you know how to use tape, I am a little rusty. However you must not have worked on any non-enterprise equipment in a long time. I do use off the shelf hard drives, I don't use RAID. RAID is stupid and useless for a backup system until you get into the hundreds of TB realm.

    94. Re:Rotational media by LordLimecat · · Score: 1

      Having a single copy of your data on a single external medium with no RAID is asking for one of the following scenarios:

      A) power surge across your SATA power lines. All drives knocked off line, backup media is also offline, need to go to data recovery lab to repair controllers.
      B) Data corruption or deletion occurs prior to backup, goes unnoticed for 1 week (or more). Backups now consist of the "bad" data that you need to roll back from.
      C) Your backup drive dies, followed by a failure of your main drive (Not uncommon, if both are installed in the same time frame and are from the same vendor). You now have no data.

      THIS is why tape is king IMO-- not praying that your one backup medium is OK.

    95. Re:Rotational media by pnutjam · · Score: 1

      Your scenario would not happen in any disk based backup I configured. You should have at least 3 to 4 copies of your data:
      1. live data
      2. online backup
      3. offline backup 1
      4. offline backup 2

      Due to rotation, you need two sets of offline disk. All my disks use rsync with hard links. This has several benefits. First of all it saves space, so if I am backup up 1 TB, I only need about 1.5TB to maintain 30 backups. Second, Each of those backups is a full backup, there is no load the full backup, then run 3 days of incrementals. Each backup is "full", however it only takes up the space necessary for any file changes from the last backup (try doing that with tape). My online backup will generally maintain a 30 day history. My Offline backups will probably contain a weekly, monthly, and annual. Since I am reusing space, I can probably go back 5 years without storing 5 times my online data.

    96. Re:Rotational media by Divebus · · Score: 1

      I have to agree with LordLimecat. LTO tape is still cheaper per GB than hard disks, plus:

      1) Tape has an average shelf life of 17 years or 30 years depending on who you ask.
      2) I've encountered drives which won't start after a long idle time (six years) unless you thump them on the top cover. Whew. They usually start.
      3) Carpet your machine room area with thick padding because dropping the drive out of the cradle may kill the data. I've seen it happen too often. I can throw an LTO tape across the room and recover the data.

      I'm going with tape.

      --

      Most of the stuff on /. won't survive first contact with facts.
    97. Re:Rotational media by JMJimmy · · Score: 1

      lol linking to company's claims does not make it true. The data presented at the IEEE storage conference found the actual life of tape is 5-10 years depending on the quality.

    98. Re:Rotational media by Divebus · · Score: 1

      I'll tell you what's true; I can pull any 20 year old 8mm Exabyte or DDS-1 tape off my shelf and run it. You can link to my claims because I actually do it at this organization that uses a gold vertical rectangular outline for a logo. Do you actually do it or did you just Google some papers? I can Google papers too, you know, and they agree with what I'm actually doing.

      It's all about the storage environment. In 1995, they said this: "Estimates for tape life expectancy may vary from less than ten years to several decades, depending on temperature and RH levels. As a general rule, lower temperatures and drier conditions lead to longer life span." (https://www.imagepermanenceinstitute.org/webfm_send/303)

      Fast forward to 2005 and they said this: : ...at moderately controlled temperature and humidities the projected lifetimes for these tapes is sufficient for archival storage and lifetimes of 50 to 100 years can be anticipated for the more stable ones." (http://storageconference.org/2003/papers/15_Judge-Media.pdf)

      So, don't buy anything from the low bidder. It probably isn't any good. I've taken hundreds of Exabyte tapes and rolled the data to several LTO-3-4-5 tapes over the years. Anyway, I've seen hundreds of drive failures compared to one or two tape failures, usually because of the transport getting hung on eject but you can fix that with a splice back to the leader.

      The bigger problem in the very long term is keeping the technology around to recover any of this media, tape or disk - a working transport, a computer with the right HBA, the right OS, and the right version of archive software, the ability to handle the file system etc. You have to run your own museum of technology. I dug out an old AS-6000 to try mounting a RAID pair of disks a few years ago. Had to pull some AIX expert out of retirement to understand the system. It almost worked except we couldn't resolve the sym links in the data which spanned across the disks.

      --

      Most of the stuff on /. won't survive first contact with facts.
    99. Re:Rotational media by JMJimmy · · Score: 1

      Exactly that, high quality tape storage designed for archiving is not cheap. It's in fact quite expensive in comparison. The typical consumer grade tape is what we're talking about here and in an uncontrolled home environment. In that use-case drives typically last about 50% longer than tape. And I completely agree, I've seen more drives fail than tapes which is why I also recommended backing up to drives in triplicate in case of such a failure. This does several things:

      1) Protects against mechanical failures
      2) Protects against human error (drops/improper handling)
      3) In case of mechanical failure across all 3 drives, if the data is important enough, they could be taken to a professional for repair; even if that meant salvaging parts from one drive to fix the other

      In a professional archiving environment I 100% agree with you, tape is the best choice for now. In a home environment the quality and control is just not there to make tape as reliable as HDDs.

    100. Re:Rotational media by Divebus · · Score: 1

      That's the difference really - determining the long term asset value for your data and considering how much there is to archive. My company expects to pull something off the shelf in 100 years and use it.

      However, every piece of math I've done about storage says LTO tape is less expensive and safer in the long run. That gap may be narrowing, but the first 100TB of storage is still cheaper on LTO. The second 100TB is dirt cheap compared to hard drives unless you use consumer grade drives instead of enterprise grade - and don't make a redundant copy.

      I can store 200TB of data on 134 LTO-5 tapes for about $8,700. Add the $2,500 single external tape drive and you're just north of $10k. That would be fine for a small video facility if they value their data but I can see where a consumer would grimace. I can't tell you how often I've helped a small video facility recover data (if possible) from too many G-Tech and LaCie portable drives. Entire ad campaigns and years of media are GONE and the clients want these guys dead. Since I installed an LTO drive in that small facility, their data loss has ceased - as long as they remember to archive.

      For the consumer, the short term storage offered by a pair of 2TB external drives may suffice. I've got about 25TB I'd like to archive right now, so I'm within striking distance of using LTO instead of drives. My RAID system losing two drives in a week was frightening enough. I nearly lost 10TB in one shot.

      For a professional building a 200TB spinning disk array, you're talking about a Fiber SAN and a bunch of RAID boxes which would cost a minimum of $80k, or more like $180k if you're going to do it right. Not to mention the kilowatts of power to run it, cool it, time to monitor it and pop spare drives in when half have failed in four years (that's about the average for our plant).

      If you're going to do it on single drives (not a SAN), you can put the drives on a shelf like a tape but you had better do at least two hard drives if you want insurance. In the days of consumer equipment doing nothing but data out of HD video cameras, 18Mpixel still cameras, music, movies etc, there will be a capacity threshold where they should have used data tape. Figuring out how much you're going to generate plays into what you're going to use.

      Cheers.

      --

      Most of the stuff on /. won't survive first contact with facts.
    101. Re:Rotational media by JMJimmy · · Score: 1

      Agreed, there is definitely a data threshold for the cost/benefit calculations. I peg it around 30TB rather than 100 but I also air on the side of caution and an extra layer of redundancy that most would consider excessive.

      For a spinning disk setup I'd beg and plead before going to a RAID array. They're wonderful things in the short term but long term there are just too many things that can (and do) go wrong with RAID. I can't argue that they do solve some storage issues that would just not be practical or financially feasible otherwise.

      If I may ask, what's brand of HDD that you've seen fail most often at your plant?

    102. Re:Rotational media by JMJimmy · · Score: 1

      wow... can't believe I just typed that. "err on the side of caution".

    103. Re:Rotational media by Divebus · · Score: 1

      Hitachi. I had a 14 disk array of Hitachi SATA drives where all but three drives failed within a year of turning three years old. I have another array of 50 Hitachi drives that runs fine that's about four years old. The rest are HP labeled 10k 2.5" SAS drives which I believe are made by Hitachi. Out of 144 of those drives, I was replacing about one a week after they turned three years old. I've built about 400TB of RAID storage cabinets using Seagate 7200.11 drives (1.5TB) and have had four failures in three years.

      Whatever the case, the advertised 300,000 hours MTBF on hard drives is supposed to mean that half your drives fail after 34 years. That's an overstatement by an order of magnitude.

      --

      Most of the stuff on /. won't survive first contact with facts.
    104. Re:Rotational media by JMJimmy · · Score: 1

      Interesting, I can't say that I've ever touched Hitachi and it sounds like I was right to be wary of them. Maxtor has always been the trouble maker for me with about an 80% failure rate after 4 years. Western Digital is the primary brand I use and some Seagate. I've only had 1 WD fail in less than 5 years, though I'm not operating on nearly the same scale. Only a paltry 57 computers to backup in a non-intensive environment.

  5. Tape by sirsnork · · Score: 2

    You probably need to define "best". How long do you really want to keep them for, and in what sort of environment.

    Traditionally the answer is tape, and probably will be in your case too for files of that size. Optical isn't proven enough (at least for the sizes your're talking about) to be trusted, and HDD's need to be run up fairly regularly to keep working.

    --

    Normal people worry me!
    1. Re:Tape by uvajed_ekil · · Score: 1

      Why not punch cards? Huh huh... so clever!

      --
      This is a hacked account, for which the owner can not be held responsible.
    2. Re:Tape by afidel · · Score: 1

      For that small of files the most reliable inexpensive method is probably a small LTO2 library with each backup going to multiple tapes. $20 per tape with each capable of holding quite a few projects (200GB native). I just checked ebay and there are a ton of 2x LTO2 libraries for ~$500.

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    3. Re:Tape by Anonymous Coward · · Score: 1


      and HDD's need to be run up fairly regularly to keep working.

      Citation needed.

    4. Re:Tape by Local+ID10T · · Score: 1

      Stage it to a HD for ease of access, back up the HD to tape for long term storage.

      Dont forget to test your backups! Just because your software "confirms" that it wrote the data does not mean you can restore the data.

      --
      "You want to know how to help your kids? Leave them the fuck alone." -George Carlin
    5. Re:Tape by Anonymous Coward · · Score: 0

      Tape isn't obsolete. Not by a very long shot. I earn a very good living by managing backups that are stored to tape; if disk really were cheaper/better, companies would have abandoned tape a long time ago. (Hint: they haven't.)

      If you're talking large quantities (hundreds of terabytes) of data that needs to be accessed relatively infrequently (in the sense that you need to work with a particular file, or set of files, for a period of time, and then move on to the next file set, leaving the one you just finished with languished, untouched, for a few months or years), you cannot beat tape. Hard disks have to be kept powered on, and there are issues with spinning them up and down on that, if I remember rightly, reduce the lifespan. With tape, you only need a couple of tape drives powered up; the raw storage sits there quietly, needing nothing more than physical space.

      To the OP: you really need to define your usage patterns. How many of these files are we talking about? (ie: how much total data?) Presumably, by "archive" you mean "move it off my working space, onto something where it won't be touched for an extended period of time" - in which case, I'd buy an LTO drive (LTO2 and LTO3 are reasonable for consumers; LTO5 if you can afford it, and can front it with some very fast drives so the data spools down at streaming speeds) or two, and possibly a tape library to go with it. I'd only bother with external hard drives if money was very tight, or the quantities of data small enough that it would fit on a relatively low (5-10) number of tapes.

    6. Re:Tape by Pooua · · Score: 1

      Yeah, tape is the traditional answer, and still gets a lot of support in the industry. Unfortunately, I have never seen a successful restore from tape. I understand that my experience is not unique. Many companies have made regular tape backups for years, but never tried to do a restore, until that day came when they needed it. Then, they found out that it didn't work.

      I've been backing up data to hard drives for 20 years. I've rarely had a problem getting my data off a hard drive, and redundancy has been a big help. I maintain about 4.5 TB of data on about 7 TB of external HDDs.

      --
      Taking stuff apart since 1969 (TM)
    7. Re:Tape by Z00L00K · · Score: 1

      You can probably print it on archive-persistent paper instead in binary form with ECC coding. Then scan it back into the system whenever necessary.

      The disadvantage would be the huge amount of paper and storage facilities needed.

      Personally I would go for a solution involving 2-3 independent nodes with a RAID array on each and then replication between them. At least one of these nodes shall be on a remote location to cover for any cases where a fire or theft occurs.

      If you don't feel secure enough then you shall go for a "cloud storage" as a secondary backup.

      --
      If builders built buildings the way programmers wrote programs, then the first woodpecker would destroy civilization.
    8. Re:Tape by Joce640k · · Score: 1

      you cannot beat tape.

      I've seen plenty of tape failures.

      I've also been in the situation where I had tapes but no working tape drive. Buying a new tape drive can be very expensive, especially if it's an 'obsolete' model (and in the tape world the hardware changes frequently). If the drive fails, it's after four in the afternoon and you need the data NOW then you're totally screwed.

      OTOH I never had a problem connecting a hard drive to the nearest PC. Hard drives are so cheap now that making a few redundant copies isn't a problem, redundant copies can be stored in different locations in case of fire.

      Tape most definitely can be beaten.

      --
      No sig today...
    9. Re:Tape by Joce640k · · Score: 1

      I have never seen a successful restore from tape. I understand that my experience is not unique. Many companies have made regular tape backups for years, but never tried to do a restore, until that day came when they needed it. Then, they found out that it didn't work.

      I've been backing up data to hard drives for 20 years. I've rarely had a problem getting my data off a hard drive, and redundancy has been a big help. I maintain about 4.5 TB of data on about 7 TB of external HDDs.

      I'm 100% with you on all that. Tape looks/sounds good on paper but never seems to work when you need it most.

      Hard disks have never failed me so far.

      PS: Always test your backup system to see if it can restore.

      --
      No sig today...
    10. Re:Tape by Xacid · · Score: 1

      I'm actually a little surprised at all the comments voting for tape instead of huge disk arrays or other fancy such configs, but it makes sense.

      I'm curious - have any of you dealt with these guys? http://cache-a.com/index.php It looks like they're geared for video work, can be networked, and runs on tape. Think it could be handy for what the OP is after (despite being graphics as opposed to video)?

    11. Re:Tape by NJRoadfan · · Score: 1

      I got lucky and managed to recover data off of a DAT tape I made 10 years ago. The key is that you have to have the backup software that created the archive to begin with! Also, tape was NEVER intended to be a long term backup medium. Moisture collection over time is tape's biggest enemy and slowly destroys the binder and lubricant in the tape. Tape also requires contact with a read/write head which adds wear and tear, plus the drives have a lot of moving parts (high rate of failure).

    12. Re:Tape by jabuzz · · Score: 1

      Not if you are using enterprise grade tape. So these days that is primarily LTO. Sure even LTO tapes do fail, but having only only copy of something is dumb regardless of what you store it on.

      However as another person said tape is incredibly cheap if you have a lot of data to store. We pay something like 16GBP for a labeled LTO4 tape, that is 800GB raw and when it is sitting in the library consumes zero power. You don't even need air conditioning for a tape library it consumes so little power. Admittedly at work we have a couple of IBM TS3500 libraries with several expansion bays in two physical locations several miles apart.

    13. Re:Tape by Anonymous Coward · · Score: 0

      I agree with this one.

      My own experience is that a hard drive put away for a year will not be usable when you want it. Does anyone have any hard data on this ?

    14. Re:Tape by WuphonsReach · · Score: 1

      Tape only makes sense once you cross the line where the cost of the media outstrips the cost of buying / servicing / maintaining multiple tape drives. Which is somewhere around the point where you purchase that 400th tape. Plus you need people who are technical enough to monitor the operations, deal with broken tapes, remember to clean the tape drives, etc. Makes a lot of sense in the big operations with multi-million IT budgets and a large technical staff.

      For small/medium sized businesses, it's a heavy burden and they're far better suited with removable hard drives or attached external drives. Instead of being a big upfront cost, you can roll it out more slowly, buying a drive or two here, a drive or two there.

      --
      Wolde you bothe eate your cake, and have your cake?
    15. Re:Tape by WuphonsReach · · Score: 1

      Tape backups require care and feeding. Cleaning the tape drives, monitoring for bad tapes, running verify operations on the backups, rotating out old media on a regular basis, constantly bringing new media into the pool.

      Only the bigger firms with the big IT budgets and people who can monitor the drives and backup process for at least 1-2 hours a day can pull that off.

      Everyone else tries to do it on a half-hour a day with someone that is being paid minimum wage because they think you only need to swap tapes as requested and it will magically work. They don't do verify cycles, don't do test restores once a month, and nobody ever looks at the backup report to see if it ran.

      --
      Wolde you bothe eate your cake, and have your cake?
    16. Re:Tape by linear+a · · Score: 1

      I pretty often will put a hard drive away for 1-2 years and use it without problem again. Put some away for 5-7 years, some worked, some woke up dead.

    17. Re:Tape by linear+a · · Score: 1

      You may have a statistical survivor's bias (actually, a non-survivor bias) going on here. Like myself, you probably recall the tape restore failures more clearly than all the routine successful tape restores. I clearly recall the various times when our tape restores have failed. That being said, I'd estimate our tape restore successes to be in the 90-95% range. This is for a mix of regular backup restores and archive restores. Which is why I'm reading this thread to see better ways to archive data.

    18. Re:Tape by MikeB0Lton · · Score: 1

      I was thinking along the same line. Always have two copies of your data at any given point. Most likely, one would be a disk array and the other tape. You might get away with a simple 4 drive NAS, or require a 12 disk chassis with SAS/iSCSI. It all depends on your storage needs. If you can't afford to store it, you can try compression, but you might have to consider whether or not it is worth keeping around.

    19. Re:Tape by HiThere · · Score: 1

      No citation, but I've heard this too. From multiple sources. The claim is that the oil in the bearings becomes too sticky for the disk motor to start.

      True? False? What's "fairly regularly"?

      I don't know, so I wouldn't want to depend on it being wrong.

      --

      I think we've pushed this "anyone can grow up to be president" thing too far.
    20. Re:Tape by Anonymous Coward · · Score: 0

      Well - let's see...

      1 punch card = 80 bytes
      smallest dataset = 40GB

      40GB/80B = 500million cards

      A box of cards is about 18" x 8" x4" and holds 2000 cards
      500M cards / 2000 per box = 250,000 boxes of binary cards

      Can you come up with an equally ridiculous scenario for tape?

  6. Large removable disk on the low end, tape highend by mlts · · Score: 4, Informative

    BD-R disks are an idea, and relatively inexpensive, but your best bang per buck would be large removable disks in the 2-3 TB range. The reason I state "disks" plural is for obvious reasons.

    I would also use a program like WinRAR with a recovery record, or one of the PAR utilities used for USENET to store your files in. This way, you can tell if there was file corruption, and have a good chance of recovering from it.

    For serious stuff where money is less of an issue, I'd consider a LTO-5 tape drive and multiple tapes. Tapes tend to last longer than HDDs because they have very few moving parts.

    Don't forget to see about copying your archives to new media every couple years. It isn't uncommon to be able to pop a 10+ year old tape or HDD in and pull off the contents... but it isn't uncommon either to find the HDD clicking, or the tape full of hard errors.

  7. Plain old SATA drives by Anonymous Coward · · Score: 0

    Use SATA drives, possibly in hot-pluggable trays. Treat them right, store them right, spin them up periodically, and use a filesystem (like ZFS) that can do data integrity checking. And as others have said, if it's important, mirror it (RAID 1, etc).

    1. Re:Plain old SATA drives by GrantRobertson · · Score: 1

      This is exactly what I was going to recommend.

      A lot of people assume that if one is going to store data on a hard drive then that drive must be powered up all the time. All reliability figures are based upon the assumption that the drives will be powered up and in use for most of a working day. However, if you only power up a drive when you need to store or retrieve data - data that is written only for archival purposes - then the drive could last a life-time.

      In my system I use an external, dual-drive, eSATA connected, setup. (I like this one.) I only turn on the drive when I need to transfer files to it. When I don't need a drive in the dock, I put it in an anti-static bag with a desiccant packet (just as they came from the manufacturer), squeeze the whole thing into a slightly modified old VHS case (I cut out the things that go into the reel holes in the tape), and put it on the shelf - labeled, of course.

      I prefer the dual dock so I can simply do a full-drive copy to make backups of my archive disks. At full eSATA speeds it doesn't take nearly as long or take up nearly as much real-world space as tape, and it is less expensive as well.

    2. Re:Plain old SATA drives by anagama · · Score: 1

      All reliability figures are based upon the assumption that the drives will be powered up and in use for most of a working day. However, if you only power up a drive when you need to store or retrieve data - data that is written only for archival purposes - then the drive could last a life-time.

      Has anyone done any studies of the effects of turning on and off repeatedly? Turning things on is a not infrequent cause of failure.

      --
      What changed under Obama? Nothing Good
    3. Re:Plain old SATA drives by GrantRobertson · · Score: 1

      Has anyone done any studies of the effects of turning on and off repeatedly? Turning things on is a not infrequent cause of failure.

      Even if they have, it is irrelevant to this situation. Remember, one does not turn on a hard drive to archive a major project multiple times per day. Perhaps not even once per day. Only when the project is complete and ready to be archived. Certainly not "repeatedly." Therefore, this is fewer power cycles than a normal hard drive endures. I can only presume that when a reliability test says the drive is on a certain number of hours per day, that they had to turn the thing off during the other hours of the day. Thus, at least one power cycle per day is incorporated into the reliability tests. If I am doing fewer power cycles, then my drives will last longer. Heck, I only turn one drive in my collection on about once a week or so, and only leave it on for a few minutes at a time. So, again, I figure my drives will last a lifetime.

    4. Re:Plain old SATA drives by pmontra · · Score: 1

      This is my anecdoctical experience: I've been using a 160 GB 3.5" hd for daily backups since about 2005. I switch it on in the morning, run the backup script, which also pulls some data from some servers and switch it off at the end of the script. It usually stays on for less than an hour. The only thing that failed was the power switch. Luckily I could replace it.

    5. Re:Plain old SATA drives by linear+a · · Score: 1

      For archival disks (assuming that I turn them off when not actively being used) I'd be more concerned with them going dead on the shelf while not in use. I see comments on that but don't have hard data on those failure rates. Also, that's the sort of thing that seems it would be very dependent on the manufacturer/model.

    6. Re:Plain old SATA drives by GrantRobertson · · Score: 1

      It's not as if they are fruit or anything.

      I have some crazy old IDE drives sitting around that I use for things like storing all the installation files for almost every version of every program I have ever installed. In fact, I have one fired up as I write this. They still work just fine. Sure, they may go bad sooner than other drives - especially if I start running and accessing them all day - because they are older. But I have no worries that any of them will go bad on me just sitting there in the ancient IDE-USB drive enclosure it is sitting in.

      I have been dinking around with computers since the days when we had to store our programs on cassette tape. I have continuously finagled all kinds of drive setups to make use of older / cheaper drives. I have never heard of any modern drive going bad just sitting on the shelf. Sure, I guess it is possible for the magnetic field to dissipate over time.This used to happen on some of the very first PC hard drives (does anyone remember RLL and connecting two data cables to each drive?), back in the eighties, but I haven't heard of it happening since. So, you can just run SpinRite on them once a year or so to refresh the magnetization and you are good to go.

  8. VHS by commo1 · · Score: 0
    1. Re:VHS by Anonymous Coward · · Score: 0

      You're an idiot.

      At 200 K/s an arvid system will take 27 hours to write a 20GB file, and take 10 180-minute tapes. Why not just convert to ASCII and print out hardcopy?

    2. Re:VHS by ZorinLynx · · Score: 2

      Whoosh!

    3. Re:VHS by lucm · · Score: 1

      I'll wait until they have it on Blu-Ray or at least Tivo

      --
      lucm, indeed.
    4. Re:VHS by NJRoadfan · · Score: 1

      Unless its D-VHS. Those store 25GB per DF-240 tape and only takes 2 hours per tape.

  9. HDD + Tape by Anonymous Coward · · Score: 0

    Get a bunch of 3tb HDD's and put them in a raid 6 then you are protected against multiple drive failures and so monthly/ bi-monthly tape backups

  10. Spinning HD by Anonymous Coward · · Score: 0

    Multiple spinning drives regularly upgraded. I have 1.9 TB of my photography that I try to keep archived. I maintain HDs of different brands at work, another set at home and another set at the parents 100 miles away. Copy and duplicate these regularly. Right now integrating 300GB of photos into my system and its a PITA, but worth it. Anything short of the Ruskies frying every HD in America with an EMP burst has me sleeping soundly.

    And though I would love to, the thought of burning some 500 archival DVDs or ~100 Blu-Rays has me staying away from optical media (this is what I used to use, and I currently have close to a 1000 CDs and DVDs of photos and projects sitting in my storage unit)

    same goes for the half TB of music and movies, but my photos are more precious, therefore receive the extra attention.

    1. Re:Spinning HD by linear+a · · Score: 1

      Ruskies could also take a personal interest in just getting rid of YOUR data.

  11. NAS storage by Anonymous Coward · · Score: 0

    Hard disk NAS storage would work best. Spinning disk has been around long enough to make it reliable and cheap. for 250$ you can get a really good NAS setup with 2 to 3 TB.

    1. Re:NAS storage by perpenso · · Score: 1

      Hard disk NAS storage would work best. Spinning disk has been around long enough to make it reliable and cheap. for 250$ you can get a really good NAS setup with 2 to 3 TB.

      And some NAS devices have multiple drives and can be configured for RAID.

  12. No such thing by Orgasmatron · · Score: 0

    There is no such thing as offline storage any more, except as a transient backup of spinning media.

    RAID it and spool a copy out to tape.

    --
    See that "Preview" button?
  13. Bare Drives via Hot Pluggable Trayless SATA by bynick · · Score: 3, Informative

    Screw tape... you pay $2,000 USD for the drive, $50+ per tape for a couple of hundred gigs. Go with bare drive external: Install a trayless SATA bay for 3.5" hard drives... this will run you $12. Buy some bare SATA drives.. these run $50 for 1TB and are available up to 3TB. I buy bare drive hard cases for about $3 each. My Intel ICH10R on-board RAID controller supports hot-swap -- so in effect it's a big 3.5" floppy.. that's right. If your tape drive breaks, you're out another two grand. This is far less expensive, faster, higher density, and random access. In addition, you can boot from it. Want RAID0? Install two trayless SATA bays for a total of $24 and back up in pairs.

    1. Re:Bare Drives via Hot Pluggable Trayless SATA by Anonymous Coward · · Score: 0

      I really want to do this, really bad, but you're way smarter than me.

      If you could provide newegg links to the right items to buy for this, you'd be my hero.

    2. Re:Bare Drives via Hot Pluggable Trayless SATA by Anonymous Coward · · Score: 0

      a slashdot reader that can't handle dealing with hot swap hard drives? wtf?

    3. Re:Bare Drives via Hot Pluggable Trayless SATA by lucm · · Score: 1

      > Screw tape... you pay $2,000 USD for the drive, $50+ per tape for a couple of hundred gigs

      What brand of hardware are your buying? Rolex? A LTO-5 cartridge is about 60$ and unless you format it with a very, very big cluster size it will store 1.5TB, not 200GB.

      > I buy bare drive hard cases for about $3 each

      Ok, I see. For tapes you shop at Rolex but for your "neat setup" you browse eBay until you find one of those Hong-Kong power sellers that ship on the Pacific Princess Express Line. Makes sense.

      --
      lucm, indeed.
    4. Re:Bare Drives via Hot Pluggable Trayless SATA by bynick · · Score: 1

      Rolex - tehe.

    5. Re:Bare Drives via Hot Pluggable Trayless SATA by bynick · · Score: 1

      Coward.

    6. Re:Bare Drives via Hot Pluggable Trayless SATA by LordLimecat · · Score: 1

      If price/capacity is your aim with LTO, you would go with LTO4-- its much cheaper on the drive front, and about 1/3 the price on the tape end for 1/2 the capacity. For $25 on LTO4, you can store ~1.4-1.6 TB. Good luck beating that with hard drives.

    7. Re:Bare Drives via Hot Pluggable Trayless SATA by MikeB0Lton · · Score: 1

      LTO-5 may also be way too fast for the OP, when LTO-3/4 would work fine.

    8. Re:Bare Drives via Hot Pluggable Trayless SATA by Anonymous Coward · · Score: 0

      According to newegg, only one rackmount tape drive available for about $750, with most costing about $2000, some far above that.

      http://www.newegg.com/Product/ProductList.aspx?Submit=ENE&N=100007545%20600026079&IsNodeId=1&bop=And&SpeTabStoreType=1&Order=PRICE&PageSize=20

  14. Drobo... by Anonymous Coward · · Score: 0

    http://www.drobo.com/

    1. Re:Drobo... by sco08y · · Score: 1

      Only makes sense if you're doing small scale stuff, and (because you don't understand depreciation) will want to hang on to old drives.

      Drobo is very flexible, but horribly slow.

  15. External SSD hard drive by Billly+Gates · · Score: 1

    Make sure it is hot pluggable with USB (if one exists yet) as both IDE and Scsi have changed many times with incompatible adapters and cables with different plugs. Odds are they will change again and be unreadable in a couple of years.

    Dvd's have rot in which the metallic thin sheet peels off. They say it is based on UV light damage but I found a Gentoo cd under a dark bed in a blinded room from 2005 that is rotting away as we speak. So BluRay discs are out of the question.

    Another slashdotter mentioned an external hard drive but magnetic interference from the Earth would erase it like my old audio tapes within a decade or two.

    Whatever you choose make sure it is external as USB and Firewire like to remain backwards compatible and this makes it easy to share between machines. Something solid state is the best way to make sure the data remains secure. Or find an internet provider where you can upload it too if you do not mind paying per month or year.

    1. Re:External SSD hard drive by Anonymous Coward · · Score: 0

      I have old hdd's from 93-96 that worked flawlessly when I spun them up for shits n giggles a month back. I agree that optical media is the worst solution. Almost every disc, which I carefully stored after burning at 4x and md5sum'ing, now fails that same md5 less than 5yr later.

    2. Re:External SSD hard drive by Spazmania · · Score: 1

      Burnable CDs and DVDs depend on your brand. I have collected thousands of Verbatims over the last decade and have lost data on very few. The discs in my software kit from 1998 all still work. On the other hand, I bought some Windatas because I wanted a few 9gb discs and was too cheap to cough up the cash. About a 80% failure rate on those after 1 year... not total but portions of the disc are irretrievable.

      Nevertheless, hard disks are the way to go. Much lower hassle factor and the price per gigabyte is approaching parity with burnable DVDs.

      --
      Moderating "-1, Disagree" is simple censorship. Have the guts to post your opinion.
    3. Re:External SSD hard drive by BillX · · Score: 1

      Good points overall, but I have to nitpick about the idea of the Earth's magnetic field wiping hard drives over time. Have a look at coercive field strength for typical ferromagnetic materials used in hard drives (or audiotape for that matter)... weak (below the coercive threshold) magnetic fields like that of the Earth won't demagnetize the media. Heck, the hard drive itself has extremely strong neodymium magnets to move the heads; these park less than 1cm from the platters harmlessly :-)

      As for your audiotapes, extended proximity to some AC-powered device (linear amp transformer in a home stereo?) is the more likely cause of failure.

      --
      Caveat Emptor is not a business model.
  16. We really need..... by pcjunky · · Score: 4, Informative

    Eternally Yours, The case for the development of a reliable repository for the preservation of personal digital objects.

    http://explorer.cyberstreet.com/CET4970H-Peterson-Thesis.pdf

  17. The practices are the same... by fuzzyfuzzyfungus · · Score: 2

    Depends on price: HDDs are crazy cheap, for the capacity; but untrustworthy. However, thanks to the cheapness, redundancy, preferably in multiple locations, periodic testing/copying to newer disks/etc. is fairly affordable. Make sure that you have(either manually, at the utility level, or at the FS level, hashes/checksums) and hope for the best. LTOs are rather more durable, having fewer moving parts in the storage media; but the cost of entry is substantially higher. All the same principles apply, though.

    There are no truly reliable storage mechanisms for large quantities of digital data, only storage mechanisms cheap enough that you can duplicate your way to reliability.

    1. Re:The practices are the same... by hedwards · · Score: 1

      One possibility would be keeping an offline copy on something along the lines of a DROBO and then storing a copy with Crash plan. It's expensive to send disks, but for images that size it's just about the only way to ensure that the data is being properly backed up off site.

      For 125 the OP could back up up to 1tb of data by mail. Doing that weekly is probably a good deal and between those chosen days a copy could be taken home.

      Ultimately with data in that kind of volume there is no good way of dealing with it. But it's going to be a lot more cost effective to deal with disks and a service like Crash plan than it is to just keep a copy on site.

    2. Re:The practices are the same... by LordLimecat · · Score: 1

      Seconding the crashplan suggestion. It is essentially a "roll your own cloud backup" solution that supports versioning, encryption, and automatic verification. Ive used it at several clients, and it has been the least problematic backup system Ive ever used-- with virtually no maintenance it has quickly resolved several dataloss scenarios.

    3. Re:The practices are the same... by fuzzyfuzzyfungus · · Score: 1

      Might want to avoid the Drobo, though. Those things are convenient in that you can switch up the number and capacity of disks without the rigidity of a normal RAID controller; but they are expensive, not very fast, and have some Serious Quirks. For a relatively large body of data, and a setup you really care about, some normal RAID flavor will be cheaper, faster, and way less likely to do something horrible on you.

    4. Re:The practices are the same... by LordLimecat · · Score: 1

      I was not intending to endorse drobo, and if it came across that way I retract it. Aside from having no experience with them, devices with "magical new methods" of doing things with little documentation make me nervous. Nervous for one because "if it dies, can I load up a linux distro and read the data off of the disks?"

    5. Re:The practices are the same... by Anonymous Coward · · Score: 0

      fuck the drobo. they are slow, and opaque how they store your data.

  18. SATA drives by kimvette · · Score: 0

    Buy three SATA drives and make multiple copies, and store them separately. Hard drives are dirt cheap now.

    --
    The Christian Right is Neither (Christian nor right). See: Matthew 23, Matthew 25, Ezekiel 16:48-50
  19. Tape. by Anonymous Coward · · Score: 0

    You're talking about archiving, which I assume means permanent to long term (10 years or more) storage. I wouldn't trust flash, writable optical media, or rotational media for that time frame. This is what tapes were built for. Say LTO3 will let you put 10 projects on a tape, and new tapes are ~$30 each. The initial outlay isn't super-cheap ($600-1k for the drive), but you'll make up for it in the long run.

  20. Online Storage by esten · · Score: 1

    Why not go with an online storage solution such as Amazon S3 and let them be your backup and not worry about doing it yourself. I know that you can ship them a hard drive so you don't need to spend time uploading data.

    1. Re:Online Storage by lucm · · Score: 2

      The title choosen by the author of the original post: "Best Offline Storage Method For Large Archives?"
      Your answer: "Why not go with an online storage solution such as Amazon S3"

      I suspect that one of you is off-topic, but I also wanted to say that S3 is really a great service and quite cheap.

      --
      lucm, indeed.
  21. Laptop drives by Fordiman · · Score: 0

    Oh, no. I've been in this argument before.

    Ok, so Seagate sells a USB 3.0 SATA drive where the adaptor can be removed and used for other drives. Get one of those, and when if fills up, just start buying 1TB laptop SATA drives. They're small, cheap-ish, and more impact-resistant than their desktop counterparts.

    --
    110100 1101000 1101000 1100110 0 1101111 1101000 1100011 1
  22. 3 hdds by after.fallout.34t98e · · Score: 1

    1: Current online storage
    2: online backups (live, hourly, daily, whatever...): a backup drive ready to take the place of the online storage at any time
    3: offline backups

    every month:
    2 becomes 3
    1 becomes 2
    either 3 becomes 1 or new drive becomes 1

  23. Tape/climate control by triffid_98 · · Score: 3, Informative

    You can't argue with Tape. It's been proven to last since the 1960's if kept in a climate controlled space (dry/cool). Just make sure to keep a spare tape drive handy (just ask NASA), because spare parts for 40 year old tape drives are surprisingly difficult to locate.

    Optical isn't even close, assuming you're talking burned discs. Taiyo Yuden claims a 70 year shelf life, but they have only been around for what, 8 years tops?

    Hard drives are an option if you've built a redundant array, but even with that you're still going to be out of luck if you burn up your raid controller.

    1. Re:Tape/climate control by hitmark · · Score: 1

      Just wish that tape drives where a bit more available in prices ranges outside of the business stuff. Then again, i guess that is where optical come in. Burn two copies, and store them cool and dry. In the end, any media that allow the separation of rw hardware and the actual storage object without requiring a clean room is more reliable imo. This because rw hardware failure can then be recovered from.

      --
      comment first, facts later. http://chem.tufts.edu/AnswersInScience/RelativityofWrong.htm
    2. Re:Tape/climate control by timeOday · · Score: 1

      I would just copy each project to two separate external 3TB drives and store one offsite. Each drive is then large enough to hold about 50 projects of the specified size. This gives you redundancy, protection against accidental modification/deletion (unlike RAID), and no single point of failure, and no reliance on obscure or vendor-specific technology. Oh, and it's cheap. So what's wrong with it? Too easy and inexpensive?

    3. Re:Tape/climate control by Anonymous Coward · · Score: 0

      Store archives on many hard drives with much parity data.
      Don't even use a filesystem if you're worried about a bad block in the wrong place corrupting the tree.

    4. Re:Tape/climate control by Lando · · Score: 1

      I don't do enough tape to know what the market is. I was burned so badly on colorado tape drives and qebit? that I've never seriously considered it again.

      --
      /* TODO: Spawn child process, interest child in technology, have child write a new sig */
    5. Re:Tape/climate control by Anonymous Coward · · Score: 0

      Optical not even close? Around for 8 years tops? crack pipe down pls.

      For the smaller projects Blue Ray would be appropriate for backup and in the future would enable better transportability in the next few years as the technology becomes main stream.

      With blueray just pop in the disc(s) and load the project you need. Pack of 25x50GB will cost you $219

    6. Re:Tape/climate control by NJRoadfan · · Score: 1

      Funny, everyone says the 15 year old CD-Rs I have shouldn't work anymore. They all read and verify just fine despite being first generation dyes. DVD-Rs are better then CD-Rs simply because of the added top lacquer coat they all come with. The dyes are also mature and stable and drives universally available, something hasn't been proven with BD-R yet.

    7. Re:Tape/climate control by Anonymous Coward · · Score: 0

      Better than that, encode the data with error recovery software like parchive. Break up the data into little pieces, which comprise up to 76% of five DVDs/blurays. Then create 24% redundant par2 files. You will ensure 100% accurate storage and recovery, even if you lose one DVD/bluray. That backup will be good for way longer than the measly observed 2-4 year MTBF of consumer burnable DVD/bluray media.

    8. Re:Tape/climate control by hitmark · · Score: 1

      Sounds complicated, but probably worth it. Now to set up some kind of easy to use program that can do it all simply by aiming it at the right set of files (and swapping out blank discs as requested).

      --
      comment first, facts later. http://chem.tufts.edu/AnswersInScience/RelativityofWrong.htm
    9. Re:Tape/climate control by gad_zuki! · · Score: 1

      Sure you can. For the price of a 400/800 DLT tape I can get a 1 or even a 2TB drive. I can even put them in a RAID array and monitor them. Or spin them occasionally. No surprises. No loss.

      In the case of a failure, I can send it off to be fixed with a decent chance of some kind of recovery. Tape failure? Just toss it out, no one can service it.

    10. Re:Tape/climate control by greed · · Score: 1

      There were certainly early DVD-Rs that had bad dyes.

      Fortunately, those are easy to spot: the ones I had all became visibly "blotchy" on the data side of the disc. That, and all the ROM drives I had started saying, "No medium present" when I loaded them. That was back when $3/disc was too good to be true... and it was.

      Hmmm, I had CD-RWs go visibly bad like that, too: it looked like the dye "leaked" out of the grooves and made puddles inside the polycarbonate.

      Heck, I've still got 3.5" floppies from the 80s that people say shouldn't work any more, and even some C64-era 5.25" stuff (prerecorded floppies from the early 80s) that still works.

    11. Re:Tape/climate control by NJRoadfan · · Score: 1

      Sounds like you had a spindle or two of those infamous white top Princo discs from 2003-04.

    12. Re:Tape/climate control by Tetsujin · · Score: 1

      You can't argue with Tape.

      You can, actually, it's just not very good at arguing back.

      --
      Bow-ties are cool.
    13. Re:Tape/climate control by Fjandr · · Score: 1

      Hard drives are an option if you've built a redundant array, but even with that you're still going to be out of luck if you burn up your raid controller.

      A RAID array is not a backup solution; it's meant to keep the server up while you replace the failed hardware. Also, an important array should be using mirrored controllers or you should have spares available. Anything else is madness.

    14. Re:Tape/climate control by Caetel · · Score: 1

      Taiyo Yuden claims a 70 year shelf life, but they have only been around for what, 8 years tops?

      The company itself has been around for 60 years. Granted, they haven't been optical discs for all that time, however their oldest discs are coming up for 30 years now, so there should be some indicator of quality by now.

  24. Why settle for one type of media? by perpenso · · Score: 0

    In this current climate, what is the 'best' method of archiving these? Spinny magnets? Solid state drives? USB? Tape? Blu-ray?

    If the data is important why settle for one type of media? At least external HD and tape, maybe external SSD too. Move to newer media periodically.

    My QIC-80 tapes from the 90s are probably unreadable by now. However their contents were moved from tape to CD to DVD over the years. My backups just sort of accumulate and grow over the years. Its practical for me to do this since I religiously keep things in /src and /doc hierarchies and only back these up. I don't bother with operating systems and applications, they can simply be reinstalled.

    1. Re:Why settle for one type of media? by triffid_98 · · Score: 1

      You'd be surprised. Just for grins I restored my (circa 1989) QIC-80 tapes a year ago. No problems at all.

      Optical, I've had both DVD and CD bitrot, even on the old Kodak 'gold' discs.

      ...That said, your point is totally valid. Multiple archive copies is the safe way to go. If you want to be even more secure, go with PAR. PAR or RAR recovery records will tell you when chunks are corrupt and can allow you to recover an uncorrupted copy even if both archives are damaged.

    2. Re:Why settle for one type of media? by NJRoadfan · · Score: 1

      Optical, I've had both DVD and CD bitrot, even on the old Kodak 'gold' discs.

      Use good media, and don't fall for the "gold archival" hype, those MAM-A discs (also sold under the Delkin brand) are/were subpar and overpriced. I'm willing to bet most of those bitrot discs were cheapo CMC Magnetics, Princo, or Ritek made discs. Problem is its getting harder to find known good blank media.

    3. Re:Why settle for one type of media? by perpenso · · Score: 1

      There used to be a guy, maybe there still is, working at a disc duplication facility who would periodically evaluate various manufacturer's media. I haven't checked in years and manufacturing may have moved so I won't mention the old results. Maybe some googling can (re)discover such an effort?

    4. Re:Why settle for one type of media? by perpenso · · Score: 1
  25. Punch cards by JulianDraak · · Score: 1

    Punch cards made out of steel. As long as they don't deteriorate, you're good to go.

    1. Re:Punch cards by Onlyodin · · Score: 1

      How about Punch cards made out of titanium?

    2. Re:Punch cards by linear+a · · Score: 1

      How about carve it onto the moon with big lasers?

  26. Magneto-optical or, cautiously, tape by mentil · · Score: 1

    Last I looked into this, the best format in terms of reliability was magneto-optical. It heats up the disc with a laser before the magnetic bits are able to be manipulated, so it's unlikely to be corrupted by only magnetic interference or only light/heat. You can get a 9GB rewritable disc or 30GB write-once for ~$50.

    There's also tape, which has massive capacities, but every anecdote about tape I've heard ends in "but the tape backup was partially/completely corrupted". Make two tape backups from the source (not copied from one tape to the other) using different technologies from different manufacturers if you absolutely must use tape. Keep one copy on-site for a few days if you'd normally ship backups to a storage center, so that you may not be required to recall the truck if the server explodes and the on-hand copy works.

    --
    Corruption is convincing someone that the selfless ideal is the same as their selfish ideal.
    1. Re:Magneto-optical or, cautiously, tape by afidel · · Score: 1

      Dude, I've put 6400+ LTO 1-4 tapes through 4 different libraries over the last 5 years and had exactly 3 failures to read, two of those cartridges were dropped and so the failure was not unexpected, the third had a manufacturing flaw where an extra piece of tape somehow got into the cartridge. We verify every tape after writing using a different drive in the library and do restores almost daily so it's not like I'm blindly trusting the backup reports. I've restored data from old DLT IV carts that were almost 15 years old without issue. People having problems with tape reliability were almost always using Traven, DAT/DSS, or QIC tape systems, enterprise grade DLT or LTO tapes (or IBM or StorageTek's proprietary formats) have always been reliable.

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    2. Re:Magneto-optical or, cautiously, tape by dbIII · · Score: 1

      I'll second that, only with nine track reels from the 1980s that really should have been stored a bit better (hot, humid storage shed for 30 years) but have still been recoverable with care. Out of around four hundred there were two tapes with problems. What should have have been done with those tapes over the years is a copy every few years onto newer media - but in the case of all those tapes they are theoretically a third copy made for transport and the data belongs to the client. It's a bit too much of an expense moving 6000+ reels onto newer media when a tape will only ever be required if the client throws out their copies and then finds out later that they want the data again.
      The cheap 4mm tapes were apparently crap but there have been reliable formats at larger sizes all the way in time from reels to LTO5.
      Tape does mean you've got to get a drive, controller, and a big pile of tapes because a single copy can always get lost even if nothing else happens to it. For short term storage and transport USB drives are good, but once again you can't trust a single one, they are a lot more fragile to transport and they won't last anywhere near as long (a lot more modes of failure and some are more dependant on time, like loss of lubricant or heads sticking from diffusion).

    3. Re:Magneto-optical or, cautiously, tape by Anonymous Coward · · Score: 0

      Magneto-optical drives are completely unreliable, they are very sensitive to dust and temperature variations. While I haven't had any problem with the media itself, I've mantained several drives on clients that failed within a year.

    4. Re:Magneto-optical or, cautiously, tape by inKubus · · Score: 1

      I second LTO tapes if you really want an "archive". Also, storage is a concern, so you'll want to look at iron mountain or access co for offsite storage in a tape vault with Halon. Usually around $25 a month plus tapes and transport.. That being said, you can get a highly probable recoverability by having lots of copies of the data. So lots of drives in lots of locations work. Rsync.net has a really good online service (and it's cheap). I wouldn't recommend any other place since I've worked with rsync and know them to be a good company. But still, unless you have a physical copy on tape somewhere, there's no guarantee. It's all about making it improbable that you'll lose the data. After that, there's insurance ;)

      --
      Cool! Amazing Toys.
    5. Re:Magneto-optical or, cautiously, tape by HiThere · · Score: 1

      So if you throw enough money and person hours at the problem, you can mainly solve it. Check.

      At work we used to do tape backup with 4 current tapes. Last year, last month, last week, and yesterday. Every week was stored until the end of month. Every month was stored off-site until the end of year. Every year was stored off-site.

      Well, it worked. We did have a real problem verifying that the backups were good, and occasionally they failed. (At one point the read head engaged the write head, so that after reading the tape, it was blank. Yuck!)

      But that's a solution that's expensive, slow, and requires a lot of personnel involvement. Not at all suitable for an individual. Also, eventually we got to the point were we were still backing up yesterday's data when people started logging in. Uh-oh!

      I don't know what the current solution is. I expect that individual computers are no longer being backed up, but I don't really know.

      A better answer is needed. Better has dimensions of both cheap and easy. Removable hard disks have obvious attractions, but it's still too slow. My current (personal) system has two hard disks that are synced occasionally with rsync. Not perfect, by any means. I also have the really important data occasionally copied to a usb key. (So it's got to be *really* limited in size.) And a larger chunk is occasionally copied to a DVD. Not a good answer, but better than nothing, and not so onerous that I won't do it whenever I start getting nervous. But this is a SMALL dataset.

      P.S.: I consider relying on current cloud offerings to be ridiculous. Companies frequently go out of business without warning, and do you own the data that's on their computer? If the company goes bankrupt, what assurance do you have that it won't either just be discarded, or sold to the highest bidder at the bankruptcy sale? You sure won't be able to sue them! And I'd bet that their EULA permits them to just quit offering the service without any warning and without any reason, so even bankruptcy isn't necessary, just a change of their business plan. When we relied on off-site storage, we owned the tapes. I.e., the physical media. So it would be illegal for a bankruptcy to sell them off. (Doesn't mean it wouldn't happen, as bankruptcy courts seem to frequently ignore the law, but it's much less likely than if they owned the tapes.)

      --

      I think we've pushed this "anyone can grow up to be president" thing too far.
    6. Re:Magneto-optical or, cautiously, tape by DocSavage64109 · · Score: 1

      How many times was each tape reused? I find that for smaller budgets, tapes tend to get reused fairly often and they can't really last many write cycles.

    7. Re:Magneto-optical or, cautiously, tape by afidel · · Score: 1

      Our email tapes have been reused ~30 times each, due to the way we were doing legal retention most of the others were only used once. If your data isn't worth $.10 a GB (or less with newer than LTO 2 tape systems) then I have to ask why you are even bothering to generate it.

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
  27. No Price Ceiling? by krotscheck · · Score: 1

    Carve the files onto titanium plates and store them in an underground bunker somewhere with little seismic activity.

    --
    This signature can save you $400 on your car insurance!
  28. Multiple backups by uvajed_ekil · · Score: 1

    If you have more than a few of these projects, SSDs are not yet a good choice for backups. You say price is no issue, but I doubt you'd want to buy 50 SSDs at their current prices. I'd suggest a few "spinny magnets," perhaps in an array if you need more apce than a couple of terabytes. Pros: low cost per terabyte, reasonable transfer speed, decent reliability, easy to implement as a tried and true technology. And of course for added safety, mirror the drive/drives to a second set. USB (if you are refering to removable thumb drives) would not be your best choice, though tape might be worth considering, especially as a secondary backup. I know nothing about Blu-ray, so I won't comment, though the capacity of the disks is a little low, isn't it?. Personally, I'd go with redundant spinny magnets to prevent having your collection on multiple removable drives/discs/whaetevers that can be lost.

    Hook up a redundant raid array, or two arrays, put them in a safe place, forget it. Tapes or a portable HD array to be taken off-site to guard destruction against fires, tsunamis, tornados, hurricanes, Godzilla, and bombs. How much data you have, and how frequently you will add to the collection, are factors that need to be considered but aren't mentioned here. My suggestion assumes that you will backup frequently and have a lot of your 40-60GB projects. Less data or less need to back it up might steer you towards something else.

    --
    This is a hacked account, for which the owner can not be held responsible.
  29. unlimited money by Anonymous Coward · · Score: 0

    if money is not an object, dump it all on standard HDD and burn discs for backup. i dont know how long you plan to keep it but relevant quote
    "How long will a Blu-ray Disc last?
    It is expected to last 30 years or more when stored at room temperature. The optimum temperature is 68F, and the optimum relative humidity is 40%."
    from relevant source http://www.tapestockonline.com/sotdfubldibd.html

  30. discs and lots of them by Anonymous Coward · · Score: 0

    It doesn't really matter what you choose to backup to. Just make sure you have multiple copies (and are stored in multiple locations) and also look at having some kind of corruption method built in that you can use should you be unable to get the data back (see http://en.wikipedia.org/wiki/Parchive). Also if your going the cd/dvd/blueray method make sure you verify the burns in a separate drive than you did the burn in (different manufacture also helps, md5sums of files is fine)? And make sure you store the media is a cool dry spot that doesn't see the light of day (http://en.wikipedia.org/wiki/Humidor ??) It will take a little extra time but it's well worth it if your data is indeed important.. using a offsite backup service is the easy way if you can schedule how often you will want data offsite

  31. Proven for over 100 years: by Anonymous Coward · · Score: 0

    Put your important work on paper.

    It's been proven to last long and can contain quite a lot of information.

  32. It Doesn't Matter by rally2xs · · Score: 1

    what you do, it'll all be lost in the next giant solar flare that gets shot at earth, or the next EMP attack. Nothing will survive in the way of computer equipment to be able to read it.

  33. Drawn patterns by Anonymous Coward · · Score: 0

    Forgot what it is called, but it is where multi-colored lines and patterns are drawn out on a sheet of paper/fabric/other material. This kind of storage as I recall can do up to several terabytes (maybe up to 50TB?).

  34. The answer is the same as the last time by BitZtream · · Score: 1

    The real answer is ... hire someone that knows what they are doing, as by asking the question you clearly don't.

    Yes, thats a shitty answer that you're not going to like, but its the right answer.

    The longer answer is ... you back it up int he same place you back all your important data up. Which could be done any number of ways.

    Spinning platters is fine if you maintain them, as is every other method of data access under the sun.

    Stop trying to stick the data in some sort of long term storage and just keep all your data active, as YOU MOVE to new storage mediums, you move ALL your data with you at the same time. So you are always using current technology and worrying about pulling those bits off something that is hard to find in 10 years won't be an issue because you'll not be using something hard to find in 10 years, you'll be using whatever is popular in 10 years.

    This is really easy to accomplish.

    You have server A and server B. You work on server A, its close to you, has redundant storage and fast access ... and automatically syncs to server B, which is also full of redundant storage and several thousand miles away from you for disaster recovery purposes.

    --
    Persistent Volume manager for Kubernetes - https://github.com/dwimsey/openshift-pvmanager
    1. Re:The answer is the same as the last time by HiThere · · Score: 1

      That's a lousy answer. It's a good answer for neither a person nor a company.

      Off-site storage is mandatory if you care about data retention, because accidents happen. Fire, flood, tornado, etc. If you really care, then off-site storage at multiple locations. (This is why the joke about backing things up by putting them on an ftp server.)

      I don't have a good answer, but yours is terrible. Active media are more subject to damage than stored media. Many kinds are subject to electrical power surges. You can generally recover from that, but it's quite expensive.

      I can see arguments in favor of tape, of hard disk, even of DVDs (though only for small data sets). I can't see ANY in favor of keeping everything on-line. Even the cloud is preferable (and the cloud, itself, is a terrible answer unless it's your own custom cloud, but then you need to back-up the cloud).

      OTOH, your argument about media not staying constant over a decade is valid. It doesn't however, justify your proposal. It merely points up weaknesses in other approaches. And they ALL have multiple weaknesses. (I.e., I don't know of a "solution". Merely of approaches that reduce the risk to an amount proportional to the amount of effort and cash that you are willing to spend. But EVERY serious approach relies on off-line storage.)

      --

      I think we've pushed this "anyone can grow up to be president" thing too far.
  35. You neglect the most important question... by stox · · Score: 2

    How long?

    What is good for a decade may not be good for a century, and vice versa.

    For millenium+ archives, nothing beats punch cards.

    --
    "To those who are overly cautious, everything is impossible. "
    1. Re:You neglect the most important question... by artor3 · · Score: 2

      For millenium+ archives, you better just make it a book. Computers have been around for how many centuries? What's that you say, approximately zero? Well, then we probably shouldn't assume that they'll look the same in ten. Heck, even your writing will probably look like "Hwaet! We Gardena in geardagum, theodcyninga, thrym gefrunon, hu dha aethelingas ellen fremedon," to people in 3011, but there will probably be historians who can manage it. Maybe there'll be historians who specialize in all the different file formats we use today, but I wouldn't hold my breath.

    2. Re:You neglect the most important question... by Anonymous Coward · · Score: 0

      1837 to 2011 is 174 years, so 1.74 centuries. 1.74 != ~0. For long term storage (1k+ years), try papyrus stored in a desert, clay or stone tablets, or copper or gold metal stored in a dry environment. What do i win?

    3. Re:You neglect the most important question... by Muchsake · · Score: 1

      Wrong! Paper tape is more compact and just as durable.

  36. Redundancy by method by Onlyodin · · Score: 1

    The key to data protection is risk mitigation. Depending on how important your data is, you should probably consider employing multiple methods of protection, such as a Disk or SSD based copy with a Tape or Optical based copy.

    Personally, I'd keep a near-online copy by means of an External Drive or NAS device which can be powered down if necessary, but if you want to go further you could lock that in a fire-resistant safe/filing cabinet, but you should definitely have another copy offsite somewhere.

    You could even use an online storage provider? Let them worry about maintaining the hardware? But you still need a second (offline, offsite) copy, imho.

  37. Obvious answer by Anonymous Coward · · Score: 0

    Multiple copies

    Multiple formats

    Multiple locations

    If your data is truly worth something to you, this is the only and best approach. LTO-5, BD-R, and RAID are not bad ideas.

  38. Parts List for This Method by bynick · · Score: 1

    Trayless SATA - http://www.newegg.com/Product/Product.aspx?Item=N82E16817998041 - This isn't the exact brand I used, but this is the style. Do some comparison shopping. The case I use for each drive is the ADIDT HS-1 for 3.5" HD. I bought them off ebay for about half of newegg's price. I couldn't find them listed at the moment on ebay, but there are plenty of hits on the web.. hit google and you'll see the pics.. assorted colors. They're stackable too and have spaces for labels. This is a strong case -- it takes two fingers for me to open the snap. I also print numeric tags with a label maker to stick on the drive for identification in the corner. 1 TB hard drives - http://www.newegg.com/Product/ProductList.aspx?Submit=ENE&N=100007603%20600003269&IsNodeId=1&name=1TB%20and%20higher - my pick is the Western Digital Green drives.. read up on their soft seek technology which made them the quietest drive at the time I researched them. They come in consumer and RAID versions. The consumer version works well for both applications and costs less. For the cost savings using this method, you can double up in drives which is a given for storing any data -- always have at minimum two copies. Because they're just plain drives, you won't need special hardware to read them if your PC is destroyed by natural disaster or stolen. Store one set off-site... safe-deposit box works good. Encryption is a plus http://www.truecrypt.org/.

  39. Data storage vs transmission by Anonymous Coward · · Score: 0

    Spinny magnets? Solid state drives? USB? Tape? Blu-ray?

    One of these things is not like the others. USB is not a way to store your data: it's a way to transfer it.

    On the original topic, though: since you said that price isn't an object, whatever you go with, get enough of it to store multiple copies. Your best storage-certainty per dollar is probably from hard drives for small volumes, and tape for large volumes (since you need to pay for the tape drive).

  40. Torvalds quote by bmo · · Score: 3, Interesting

    "Only wimps use tape backup: _real_ men just upload their important stuff on ftp, and let the rest of the world mirror it" - Linus Torvalds

    --
    BMO

    1. Re:Torvalds quote by Anonymous Coward · · Score: 0

      "Only wimps use tape backup: _real_ men just upload their important stuff on ftp, and let the rest of the world mirror it" - Linus Torvalds

      --
      BMO

      And to improve reliability a hundredfold, send DMCA takedowns every once in a while to every mirror you can find.

    2. Re:Torvalds quote by Anonymous Coward · · Score: 0

      "Only wimps use tape backup: _real_ men just upload their important stuff on ftp, and let the rest of the world mirror it" - Linus Torvalds

      --
      BMO

      So that's why Linux version 0.01 doesn't exist anymore!

  41. 2.5" (Laptop) Drives are Durable by bynick · · Score: 1

    I'll second the 2.5" (laptop) hard drives -- I have seen them take a fall to the floor while powered up and survived. They are extremely durable. My pick is the WD Passport (run the toolkit to remove the virtual driver / backup disk and its as close as you can get to a 'plain drive' these days without the need for drivers or other junk / bells / whistles). It's good in USB 2.0 or 3.0. 3.0 is backwards compatible with 2.0. The 2.0 is $10 less expensive and the cable is a bit lighter. A USB 3.0 cable is comparable to Ethernet in feel.

  42. Dedup or Tape by lucm · · Score: 3, Interesting

    If price is not an issue, a great solution is to go with a data-deduplication device (such as EMC DataDomain or IBM Protectier). If you were to host one unit in your basement and the other in coloc environment far from your home, you could setup replication and have a very reliable archive. Coloc of a 1U device can be quite cheap, I have one of them for which I pay less than 100$ a month.

    If you have a smaller budget, then the best cost-benefit is still found on tape, and it can even work in case of network disruption. Like Andrew Tannenbaum said: "Never underestimate the bandwidth of a station wagon full of tapes". A single LTO-5 tape is very cheap (50-60$) and can store 1.5TB (can easily double that with dedup).

    There are other interesting technologies out there, such as MAID, which you can use as a VTL with a good backup software to maintain a reliable archive, however cheap disks are cheap and in a MAID configuration they might not last as long as typical disks because of the on/off behavior.

    --
    lucm, indeed.
    1. Re:Dedup or Tape by Anonymous Coward · · Score: 0

      If you do automatic replication corruption (file-level) of the primary is automatically replicated on the secundary...
      Specially on critical backups I would never use compression or deduplication, IF an error occurs fixing it becomes a lot harder.

    2. Re:Dedup or Tape by lucm · · Score: 1

      backup != archive

      a lot of people don't make the distinction.

      --
      lucm, indeed.
    3. Re:Dedup or Tape by Anonymous Coward · · Score: 0

      I would love to see someone familiar with this topic compile a price/TB chart comparing tape and HD. I keep seeing the $2000 cost of the tape drive as a big hurdle. By example if I need to backup 2TB then I buy one HD for $70. OK, maybe I need 3 copies but that is still only $210. If it is 100TB, or a 1000TB then the price of a tape drive will become less of an issue.

      Last time I checked those tapes were not cheap either.

    4. Re:Dedup or Tape by linear+a · · Score: 1

      Corollary: Unreliable backup != archive, too.

    5. Re:Dedup or Tape by Anonymous Coward · · Score: 0

      an expensive way to duplicate your virus damaged files. remember the Iloveyou worm

    6. Re:Dedup or Tape by lucm · · Score: 1

      Right now in Orlando there is the 2011 IBM System Storage Technical University. Every year at this event they release market trends and various numbers, and every year tape is still the cheapest solution (per TB).

      However these figures do not take into factors such as labor or flexibility.

      --
      lucm, indeed.
    7. Re:Dedup or Tape by lucm · · Score: 1

      This is a common misconception that backups and archives are the same thing. What is described here is an archiving solution. This does not address backup.

      This being said, if one was to use a deduplication unit, he would also use the proper software and not perform replication blindly. There is a lot to consider when you replicate information; as an example you must take into account the processes that could be using the data. Good replication software is designed to handle this kind of stuff.

      --
      lucm, indeed.
  43. Proper printouts by Anonymous Coward · · Score: 0

    You can always rebuild it if you have to from proper printouts. Get them professionally published to archive quality. Then to save your actual digital stuff, rotational disk archiving. No digital storage medium will be 100% effective so use a Raid array and plan to upgrade it every 3-4 years to a new array of better/bigger drives. Even then though, your absolute best best is that first part, then put them into archival storage in multiple locations.

  44. NAS+2 online stores by keith_nt4 · · Score: 1

    I guess this isn't a very popular suggestion. And you seemed to imply you wanted a local archive for your data, something you do yourself.

    I would just a large iSCSI NAS. 2TB Drives are really cheap these days. FreeNAS even lets you flag a drive as a hotspare so you don't have to as much about failures.

    Then back this NAS up to at least two online storage services. Make sure they're not both the same thing on the back end (like amazon's S3). Actually Carbonite personal can't distinguish iSCSI from a local drive and is unlimited storage for personal use. I'm sure that violates some terms some where but technically it's possible. Pick another high capacity online service for redundancy.

    Also, encrypt the data locally *before* it's uploaded (it's just a good idea).

    You didn't say how much total data you have to archive or how fast if at all it is growing nor how often you would need to access it. I have seen amateurs making 40TB storage servers from component parts. Honestly I can't think of a reason to go with anything other large capacity drives. I assume 2TB drives don't have any where to go but down in price.

    --
    "UNIX is very simple, it just needs a genius to understand its simplicity." -Dennis Ritchie
    1. Re:NAS+2 online stores by Anonymous Coward · · Score: 0

      so you don't have to as much about failures.

      I think you a word.

      edit: CAPTCHA = urinate

    2. Re:NAS+2 online stores by keith_nt4 · · Score: 1

      Good thing I've got you.

      --
      "UNIX is very simple, it just needs a genius to understand its simplicity." -Dennis Ritchie
  45. Why bash the OP? by bynick · · Score: 1

    Why are the OPs in-thread questions hidden? And what's up with bashing him for asking questions? I took the time to answer him/her twice. To the person that did that you make me sick. I hope he/she can find the reply since it met the same fate otherwise this site is a waste of bandwidth.

  46. BitTorrent hash check by Anonymous Coward · · Score: 1

    I would also use a program like WinRAR with a recovery record, or one of the PAR utilities used for USENET to store your files in. This way, you can tell if there was file corruption, and have a good chance of recovering from it.

    If all you want is a by-the-megabyte file corruption check, go with BitTorrent. Create a .torrent of each project directory. You can fill the tracker field with some bogus server name, say http://127.0.0.1./ The beauty of a BitTorrent hash file is that you can pinpoint exactly where a file error occurs in a file, give and take a megabyte or whatever the file chunk you set for your BitTorrent file. This is unlike a ordinary md5 or sha1 check sum where all you know is whether a file is corrupted or not

    1. Re:BitTorrent hash check by QRDeNameland · · Score: 3, Insightful

      Yeah, that's a great solution if all you want to do is detect corruption, but note the GP's point about havng "a good chance of recovering from it". The only way to recover with BitTorrent is to have another copy available to replace any bad blocks. PAR2, on the other hand, is able to recover any random missing X% of data from a dataset as long as X% of PAR2 data was generated.

      --
      Momentarily, the need for the construction of new light will no longer exist.
  47. This is not an advertisement by symbolset · · Score: 1

    It does look like one though, I admit.

    Some time ago I read about the BackBlaze box here on slashdot. Essentially it's a 4U server chassis design that holds 45 LFF SATA drives and a server motherboard, plus the requisite connector bits and power and so on. BackBlaze is a storage provider that offers some online storage service and they designed the chassis to do high-density storage and hired a company, Protocase, to build it. BackBlaze doesn't sell servers, or server designs. They designed it because they needed it and shared it in the hope others would give back design improvements.

    BackBlaze open-sourced the design and authorized Protocase to sell it. I learned about this when I followed up on the story with Protocase because I'm in the server trade and the storage density was intriguing. We went back and forth but I never bought the thing.

    Purely by coincidence I got an email from Protocase just today. They're selling the thing now as a fully built server with everything you need (motherboard, processor, PSU, expanders, drive controllers, etc) -- except drives now for $5395.00 (1-4 units) and $4995.00 (5-9 units). Their website won't sell it, you have to contact lpodgursky@protocase.com via email for how to buy this because they're not geeks like us - they bend sheet metal for a living. At the time of the slashdot story this would store 67TB, but nowadays it's twice that. 3TB drives now cost $120, which would be $10,800 roughly for 135TB raw or probably 110TB usable - which puts it at $100 per served terabyte. Some folks would consider that a bargain. You'll want the 10Gbps links as that much volume will be link constrained for volume migrations. For storage density that's 1.35PB (raw) per rack, which is about as good as it gets right now. Bring cash or AmEx because Protocase is a tiny company and can't offer terms for new customers.

    Of course for stuff that's commercially valuable that much data would cost a lot to recreate. I would probably want two of these at least, and store multiple copies on each one. Advances in HDD density should take care of expansion needs and migration needs if your data is currently less than 50TB. For software look into OpenFiler, which is free to use and has commercial support available.

    This is not an advertisement. I don't work for any of these people. I don't care if you buy this thing. But if it was my money and my data and it was worth $50K or more... I'd buy several of these and find some geographically diverse locations to put them and devise a strategy for replicating and migrating my data as the hardware grew stale.

    So as long as I'm posting this... to totally sexy this up with automatically tiered storage for performance I'd add a couple Fusion-IO IODrive Octals per unit with Fusion-IO's directCache software to front this storage with 10TB of SSD cache per 135TB of slow SATA disk. That should get you up to over 1M 512B iops per node if you've stepped up to Infiniband QDR to handle the bandwidth. And I don't work with them either. This last bit will cost several times all of the rest of it. Probably layer lustre file system on top of that for large volume needs. If you need less volume, look into drobo.

    I've already gone overlength for this post, so I may as well go completely nuts. So here's some of Lewis Carroll's "Alice in Wonderland":

    * "Just the place for a Snark!" the Bellman cried,
    As he landed his crew with care;
    Supporting each man on the top of the tide
    By a finger entwined in his hair.
    o Fit the First : The Landing

    * "Just the place for a Snark! I have said it twice:
    That alone should encourage the crew.
    Just the place for a Snark! I have said it thrice:
    What I tell you three times is true."

    "Three times" is a good rule for data. If you put data in three disparate places it's less likely to be lost. Alice in Wonderland is a great reference manual for just about everything. The Reverend Dodgson was a wise man.

    --
    Help stamp out iliturcy.
  48. Individual drives by robi2106 · · Score: 1

    I know it is a PITA, but duplicating across pairs of hard drives is cheap per GB, and allows you to move the data to an offsite location / *SNL Al Gore Voice* lockbox. I have ~2TB of video project data that I store using this method.

    1. Re:Individual drives by Anonymous Coward · · Score: 0

      Concur with the "individual drives" solution. RAID and similar infrastructures bring little benefit in this scenario, as the data is not constantly changing. Individual drives are cheap and flexible. These discussions could go on forever though, as the OP has not specified the total data size, retention time, churn if any, and so on.

    2. Re:Individual drives by Jason+Levine · · Score: 1

      This is how I back up our files. We have about 40GB of photos (not to mention music, documents, etc). The grand total of data is probably in the 100-200GB range. I purchased two 1TB external hard drives. All of our computers are backed up to one hard drive. This drive is then, on a monthly basis, backed up to the second drive. This second drive is stored offsite. If something were to happen to one of the drives, I still have the data in the other one. If something were to happen to my house (fire, burglary, etc), I'd still have the data from my offsite backup.

      I can backup all of our computers and replicate the backup in a night or two (depending on how many hours I dedicate to it). Best of all, it's very inexpensive. NewEgg sells 1TB external hard drives for about $60 each shipped. You can have an effective backup solution for about $120.

      --
      My sci-fi novel, Ghost Thief, is now available from Amazon.com.
  49. Magnetic Tape is still best by Anonymous Coward · · Score: 0

    DLT technology is proven to work for 30+ years, LTO should be comparable but is not proven (ie. it has not existed for 30+ years).
    Most modern writable optical discs (CD, DVD, BlueRay) are usually not sealed and do not use non corrosive material (gold) as data layer, they may or may not keep your data for more than 5 years. Harddisks will keep your data but may have some issues with bearings.

    Paper have proven to be effective long time storage when kept properly but data density is too low.

  50. It takes more than one method by Chewbacon · · Score: 1

    I got an old Dell poweredge tower ($150) and put ubuntu and samba on it (free). Bought a cheap adaptec sata raid card ($65, took some searching) and setup a terabyte raid array (3 500gb drives at, then, $85/pop). I use it to host ghost backups of my desktop, and my and my girlfriends laptops. A raid5 array (and other configuration) means I get an email when there's a failed drive and I can simply replace it (now a drive is about $50). Remember, raid is not backup in itself. So I took it a step further and used a portable drive and ghosts offsite backup feature. So I hook the drive up to my desktop on my first day off work, Ghost backs up existing backups to it and then I keep it in my bag I take to work. So should my house burn down, I have some selected data (software projects, pictures, video, and music) with minimal losses. No single backup method should be considered best. It's a multi-faceted solution that takes some regular, dedicated work on your part.

    --
    Chewbacon
    The Bible is like Wikipedia: written by a bunch of people and verifiable by questionable sources.
  51. Obvious answer by phizi0n · · Score: 2

    OP: "If I remove the price issue (my data is important to me), does this change the choice?"

    ME: If price isn't an issue then you don't choose one, you choose them all.

  52. Quartz Crystal by Anonymous Coward · · Score: 0

    It can hold all of your data, and it will withstand the test of time.

  53. Offline and Online by dr00p · · Score: 1

    Depending on the period you want too keep it.
    Backup to multiple destinations:
    - external HDD/disks/tapes - initial cost, plus some cost to refresh it from time to time
    - Online storage (Crashplan, SpiderOak, Amazon S3....) - will incur a monthly/yearly cost but it' usually very reliable.

  54. Best option by Anonymous Coward · · Score: 0

    I, for one, welcome our data cloud overlords.

  55. FreeNAS is the safer/cheaper option... by Anonymous Coward · · Score: 0

    http://www.freenas.org/ You can use several inexpensive (and not reliable) disks to be used by the ZFS (software-raid) filesystem. Easy to upgrade/replace/manage. Online storage seems the second best alternative for now , IMHO...

  56. you need to answer two questions by Anonymous Coward · · Score: 0

    The first question you need to ask is who you're saving these projects for. The second question is you need to ask is whether or not you need every bit of data in them.

    First question: If you're saving these files for a client, just increase your price to cover the cost of a RAID NAS (or two) and cloud backup--have the storage fee as a line item on your invoice. If they're for friends, family, or people who can't pay, put a dollar value on your own attachment to the projects, then spend that much. And no more.

    Second question: Do you need every bit of data? Can you flatten some of those PSDs? Reduce some of those image sizes? Save some of those tiffs as jpgs? Heck, for some projects I'll bet you can get away with just exporting a couple PDFs (high quality print-ready, high quality web-ready, low quality web ready) and trashing all your IDDs.

    My personal approach would be three fold:

    Short term: save the projects, as is, in the same place as all my vital data--redundancy and everything.

    Midterm: export PDFs, save the PDFs with all my vital data, and move the projects to a less critical set of hard drives.

    Long term: Move the PDFs to a less critical set of hard drives and let the original project files die from bit rot on whatever antiquated hardware you have running in your closet.

    This is my approach for everything--including the extremely nice family history photo albums I made in indesign. No one except me is ever going to want the original IDDs or PSDs. All anyone else will want are PDFs, text, and jpgs.

  57. The best is... by Anonymous Coward · · Score: 0

    ALL OF THEM.

    But yeah, tapes is normally what people are fans of. Ideally you have 2-3 backups no matter what though.

    There is no such thing as too many backups.

  58. Paperback by qIroS · · Score: 1

    Paperback, a printer and some paper: http://www.ollydbg.de/Paperbak/index.html#1

  59. How much total data? Makes a big difference! by billstewart · · Score: 1

    Ok, your projects are about 50GB each, so you can fit about 20 of them per Terabyte. How many of them do you want to keep? If they're something that you generate 100 of them per day, all year, you're looking at a much different solution than if a project takes you a month to tweak all the pixels lovingly by hand.

    For a few Terabytes, just use 3.5" hard disk, make backups, keep one copy offsite. If you want to keep 10 or so TB handy and online, maybe you'll want to do a RAID thing, or maybe you just want JBOD, and 128-256GB of SSD for the project you're working on right now (but you're still copying it to hard disk once it's baked.)

    If you want to keep much more than that online, you'll need to think about fancier storage architectures, and more money. You can go with NAS (Network Attached Storage, a bit pricier, not much faster, high-density disks), or you can go with SAN (Storage Area Networks, much more expensive, blazingly fast, very large.) If you're the bureaucrats who run our IT department, you understand how to support SAN at $8000/TB, and don't have a clue how to support NAS at more like $100/TB, which is ok if you want a big blazingly fast database system, and way out of line if you want to keep a lot of log files that will be Write Once, Read Never (well, Hardly Ever.)

    --

    Bill Stewart
    New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks
  60. Backup is not trivial & is more than Media by BoRegardless · · Score: 1

    Lots has been written on the subject of archiving, and with lots of valuable and eventually irreplaceable data, it would pay BIG dividends to read a few books and look at some of the companies that manage data for their living.

    Others here have noted the variables with respect to media, hardware & software and the fact that over time they all change and eventually become obsolete. Then comes the factors of where you store it and how many places do you store duplicates in to prevent fire, flood or whatever war from wiping the cache of data out.

  61. Laser cut titanium punchcard by Anonymous Coward · · Score: 0

    That is the "best" method if you have unlimited money. Laser cut titanium punch cards will last for thousand and thousand and thousands of years. Go for it!

  62. microfilm by Anonymous Coward · · Score: 0

    Print many copies of the data to microfilm (micro fiche) and properly store the copies in various safe offsite facilities.

  63. the sarcasm by reiisi · · Score: 1

    in the group is palpable tonight. Full moon?

    --
    Computer memory is just fancy paper, CPUs just fancy pens with fancy erasers; the 'net is just a fancy backyard fence.
  64. Re:Large removable disk on the low end, tape highe by AmiMoJo · · Score: 2

    I think the bottom line is that no medium is bulletproof. If you really care about the data and money is no object then a combination of at least two different mediums is the way to go.

    Aside from the usual suspects like tape and HDD I'd suggest looking at flash memory. Expensive per GB but also not prone to mechanical problems. Most flash memory states data retention for 10 years, but it is a little bit more complicated than that. Every time you write data to a flash memory device it "refreshes" and the 10 year counter for that data starts again. To be safe you should probably be imaging and re-writing the flash every year or two.

    --
    const int one = 65536; (Silvermoon, Texture.cs)
    SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC
  65. images by reiisi · · Score: 1

    I can just see some poor schmuck holding paper tape up to the light, squinting, and reading into a mike attached to a computer:

    one
    oh
    oh
    one, no oh
    oh, darn it, one
    one
    oh
    oh
    one, uh oh

    (Calculating in my head, my memory is about 2 mm/byte, so a terabyte (base 10 tera) would be, erm, about 2 gigameters long.)

    --
    Computer memory is just fancy paper, CPUs just fancy pens with fancy erasers; the 'net is just a fancy backyard fence.
  66. Entropy always wins by grikdog · · Score: 1

    Hardware gets cheaper and better by the minute. That means your choices are degrading your chances to recover archived information as soon as it enters storage. It's a paradox. I have no answer to that; I've been bitten by storage on Apple ][+ floppies which can't been read on any current hardware I've got. Hopefully, there's an aftermarket for people who can afford to read the obsolete hardware of the past and transfer it to the nonexistent hardware of the future. Maybe there's a standard that won't be intentionally subverted by market forces (emphasis on force), but I dunno what it is. Pray that all that expensive data remains decryptable, if its encrypted. Your best bet may be to pay for redundancy at every weak point in your system.

    --
    ``Tension, apprehension & dissension have begun!'' - Duffy Wyg&, in Alfred Bester's _The Demolished Man_
  67. if price is no limitation... by Mister+Pedant · · Score: 0

    freeze your whole system in carbonite, should be good for a few hundred thou, obviously having copied to a new system 1st, don't forget that bit!

  68. Data Domain Archiver by bassiek · · Score: 1

    I was going to say, data domain might be a nice option since it is cheaper then EMC..... but appearntly EMC bought it allready ;) It's basicly the same, just the cheap(er) solution then the big EMC SAS solutions. Do note, you will need DD's own disks, since they run there own little firmware wich is needed. (I smell $$)

  69. Laser-etched stone tablets by Anonymous Coward · · Score: 0

    I would suggest laser-etched stone tablets buried in clay sediment with a religiously depicted stone monolith above them that would be likely to become a historical monument. If you're talking seriously long times, I mean.

  70. Lots of goofy options by KahabutDieDrake · · Score: 2

    I'm seeing a lot of really goofy suggestions in here. I'm going to make my own. First, let me say that my last job was to create massive image archives sourced from disparate media, and store them, permanently. Massive, as in 30tb a year. (maybe not that massive, but we were a tiny company, with a matching budget).

    First, let me tell you what won't work. Optical media. Just DON'T. It's unreliable, slow and generally a pain in the ass. I worked at a place that burned 150 CDs a day for distribution, we had consistent failure rates within 20 days of 50%. Granted, that's using the cheapest possible media, but that's still awful. Further our "archive" had thousands of discs in it, was stored well, and as a whole, had a 41% failure rate over 10 years. Optical media is crap for long term storage.

    Something else that won't work, TAPE. I know, heresy. But listen for a minute... do you know anything about tape? Ever used it? No? Then don't touch it, unless you plan to hire someone that is an expert to build out the system and keep it running. Were you planning to hire a full time systems manager? I didn't think so. Alternately, if you happen to have experience with tape, hell, use it. You can't beat the density or reliability.

    Now, a suggestion that does work. Build your own NAS (or buy one if you don't have the chops to build it). You ought to be able to build/buy a 5tb array for under 3k, give or take. It will quietly hum along in the closet doing it's thing for pretty much the next few years. After 3 years, start a swap program to replace each and every hard drive. Doing this all at once allows you to store the old raid in cold storage (box it up and stick it in the corner). Doing this at the rate of one drive per month allows you to absorb the costs a little easier. Continue forever.

    Now, if you are really nuts, and you actually think your data is valuable (you know, like you can trade it for money at some point), then you build out the NAS, order three of them, and keep one at your mom's house (or wherever), then you buy co-lo rack space and put the third unit (did I mention you need 3?) in there and sync all three as often as you can afford the bandwidth. This is, for all intents and purposes, how google backs up data. 3 systems, in 3 locations, each with a complete copy of the data. It's not exactly CHEAP, but neither is redoing all that work.

    I'm going to leave out suggestions like using a kodak image writer to burn the images to microfilm that is digitally indexed. Why, because you don't know the first thing about a system like that, and because you want "backups" not permanent archives. Also, you can't afford this method. I'll also skip the really wacky shit, like using BD discs, or SSD arrays (in the terrabyte range? Fuck off$$$), or anything that involves the clouds.

    Storing relatively large groups of data has been dirt cheap and easy for the last 5 or so years. Even before that it wasn't that hard. Don't invent a difficult system, or buy into enterprise gear. You don't need difficult, and you don't need a NAS that performs 100,000 IO ops a second with a fiber channel back haul. You need a couple of raided drives in a box in the corner, powered up pretty much all the time.

    Oh yeah, and do you know the single greatest cause of HDD failure? Cold storage. TURN THE FUCKING THINGS ON, and leave them that way. They last MUCH, much longer. God it was hard to teach people that concept at my last company. No, putting the drives in a box in the storage locker does not make them last longer, in fact, they started failing the minute you unplugged them. (yes, I know, physical shock is probably actually higher up on the list, as is manufacturing defects, a little hyperbole never hurt anyone)

    1. Re:Lots of goofy options by Courageous · · Score: 1

      Your suggestion is incomplete.

      One should absolute _not_ use a NAS system as an archival system without some kind of point-in-time restore capability (e.g., "snapshots"). The most common cause of data loss is not a failed hard drive. It is an "oops" by a systems admin or other human operator.

      Good news is, you can just grab Solaris Express and set all this up on ZFS. Too bad ext4/LVM doesn't have a suitable snapshot system yet. That really hurts me, but so it is.

      You assertion that leaving drives off leaves them to failing sooner isn't true. SGI/COPAN (the company that started the trend of turning off HD's in enterprise storage systems for tier 3 data) have reams of empirical data showing the opposite to be true: if they aren't spinning, they aren't wearing out.

      C//

    2. Re:Lots of goofy options by Savantissimo · · Score: 1

      This is the right answer. But do NOT use any kind of RAID except RAID 1. The controller will go bad, you won't be able to find another one, and nothing else will be able to read your data. Cold storage really is not that big a risk so long as the drives get spun up every month or two. No way should it cost anything like 3k for a NAS box. A good alternative is to get 3 or more USB 2.0 RAID 1, dual drive boxes for $50 each plus the cost of drives and keep 1 on-line and 2 off site, swapping them preferably at least weekly. This avoids the connector life issue for the drives themselves, provides ~6x redundancy, and keeps the drives spun up regularly. Write a script to verify each backup cycle. Replace the whole thing about every 3 years or at the first failure.

      --
      "Is life so dear, or peace so sweet, as to be purchased at the price of chains and slavery?" - Patrick Henry
    3. Re:Lots of goofy options by Savantissimo · · Score: 1

      Backblaze, which runs 16PB of disk storage has tested drives, here is their recommendation:

      We are constantly looking at new hard drives, evaluating them for reliability and power consumption. The Hitachi 3TB drive (Hitachi Deskstar 5K3000 HDS5C3030ALA630) is our current favorite for both its low power demand and astounding reliability. The Western Digital and Seagate equivalents we tested saw much higher rates of popping out of RAID arrays and drive failure. Even the Western Digital Enterprise Hard Drives had the same high failure rates. The Hitachi drives, on the other hand, perform wonderfully.

      Newegg has them for $130 http://www.newegg.com/Product/Product.aspx?Item=N82E16822145490R

      --
      "Is life so dear, or peace so sweet, as to be purchased at the price of chains and slavery?" - Patrick Henry
  71. Real solution by jcoy42 · · Score: 5, Funny

    There are only 2 real solutions if you want real long term storage. The first is you become Linus and just dump it on a server and let the rest of the world back it up, and the second is you make your data a religious text somehow. Because those guys with translate it for centuries to come, even if it means sitting 50 dudes in a room for 3 years with nothing but a feather, ink, and parchment.

    come to think of it, same thing.

    --
    Never trust an atom. They make up everything.
  72. Archival Quality BD-R by Hungus · · Score: 0

    Delkin offers Archival Quality Disks tested via ISO 18927-2002 standards with an estimated life expectancy of 200 years. You can buy them at most any mid to high end photography supply store for around $10 each for 24GB of storage. I use them for all of my research projects.

    --
    Bad Panda! No Bamboo for you! In matters of importance ACs will not be responded to. Want to say something critical,OK
    1. Re:Archival Quality BD-R by Overzeetop · · Score: 1

      That's nice, but it means spanning multiple discs for a even a non-parity protected version. Three discs per project would be required. Not only time consuming, but kind of a pain in the ass.

      --
      Is it just my observation, or are there way too many stupid people in the world?
    2. Re:Archival Quality BD-R by pnutjam · · Score: 1

      expectancy doesn't do much for me...

    3. Re:Archival Quality BD-R by Anonymous Coward · · Score: 0

      Let me guess you were an english major.

    4. Re:Archival Quality BD-R by Anonymous Coward · · Score: 0

      Umm why is this post off topic? Question specifically asks about BR disks and archiving.

      "I have a collection of large projects (Indesign files with associated images), which are typically 40GB to 60GB each. In this current climate, what is the 'best' method of archiving these? Spinny magnets? Solid state drives? USB? Tape? Blu-ray? All have pros and cons and price considerations. If I remove the price issue (my data is important to me), does this change the choice?"

  73. Take in mind the bit rot by Anonymous Coward · · Score: 0

    when going down the hard drive path. I have an old fc-storage unit with 16 4year+ old sata 400GB disks on which I use ZFS (zfs-fuse) on a zpool with 2 8-disk raidz3 vdevs. Gives me a total of about ~4TB on dirt-cheap hardware and I should be able to loose about 6 disks (3 per vdev) and have the ability to fix silent data corruption (or so, it's advertised). Safe enough (ignoring the possibility of non-drive hardware failures on the fc-storage unit) for my data (which is replicated to other places as well anyway) and for an acceptable budget. But if budget wasn't an issue, I'd swap it for tape any day.

  74. Why magnetic storage by Anonymous Coward · · Score: 0

    ...and not optical one?

    I can't claim to have any professional knowledge of storage technologies but I would assume that somebody wanting to archive large amounts of data would usually prefer an optical over a magnetic storage medium.

    Wouldn't laser disks, CD/DVD/BD-ROM (not -R!), ... pass the test of time much better than harddisks, tapes, ... ?

    Magnetic storage is nice because it can be overwritten very easily - but when you want to archive data the reusability of the storage medium doesn't matter and this benefit does rather look like a curse.

  75. Data Tresor Disc by Anonymous Coward · · Score: 0

    Immune to light, UV radiation, humidity, high temperatures or magnetism
    Backwards compatible with all common burners and readers including Blu-ray writers.

    http://www.datatresordisc.eu/introduction-page-dtd.html

  76. Cloud storage by slim · · Score: 1

    Sorry. It's not going to be popular with the Slashdot crowd, but dumping it onto a cloud storage service seems to make the most sense.

    I keep seeing ads for a service that gives you 2TB backups for £5/mo. For that you get full redundancy, and let them worry about replacing broken hardware etc. Cheaper than buying the hardware yourself, over the first year, and bound to get cheaper.

    If you're genuinely worried about your cloud provider having a catastrophe of some kind that their own fault-tolerance approach doesn't cover, then dupe your archive across two cloud storage suppliers.

    1. Re:Cloud storage by Taylor123456789 · · Score: 0

      Link?

    2. Re:Cloud storage by slim · · Score: 1

      I didn't want to single out a provider; the answer was meant to be more general.

      But since you ask: http://www.livedrive.com/ -- I have not tried them; this is not a recommendation.

      Actually it appears the £5/mo backup service has no storage limit. The 2TB applies to their more dynamic (and expensive) "Briefcase" service.

  77. BR-D or HDD? by GiMP · · Score: 1

    Single-layer BD-R disks and 2TB SATA disks are currently matched at $0.04/GB. I will assume that the OP's data, which contains images, is already compressed sufficiently.

    The BD-R disks have an unknown livespan and the OP's dataset would have to span 2-3 disks per project. The 2TB disks would hold multiple projects. There is an argument to be had that it is less expensive and more reliable to use the BD-R disks from the perspective of adding a single parity disk. The loss of any disk set would lose that project, not multiple projects. The data would be immediately offlined. As optical media tends not to fail by-disk, but by block, a filesystem like ZFS may be safest.

    Contrast to the 2TB solution where you could use RAID-5, fill the array, & then offline for archival. For as long as the drives are online, there is an increased risk of failure. The loss of the array would lose multiple projects (~66 projects). Your individual drives are arguably more reliable, but you have fewer disks at a greater capacity, so the impact of a disk failure is much greater than with the more distributed BD-R model.

    The benefits of hard disk storage here are ease-of-use and a better known MTBF. With fewer disks, it is easier & faster to online & verify your archives every so often. Even with ZFS-on-BDR, I'm not sure how well BDR disks will last over 10 years in a humidor, let alone on a random shelf.

  78. The last part is the reall important part by Sycraft-fu · · Score: 1

    If you want true longevity of archives, it isn't about finding a format that will not ever die, because they all can. It is about making copies. The brilliance of digital storage is perfect copies for an unlimited number of generations. So you take advantage of that. Have more than one backup, and test the backups. If one fails, make a new copy from the good data. Also, check the expected life of the backup medium, and replace it with new copies when it starts to age.

    Along those lines, to keep it useful, make sure to convert it to a new format, when appropriate. This mostly means new backup media format, like if you are using LTO-5 now you'd probably move to a newer LTO, 7 or 8 or something in a decade, but also the data itself. Like say your data was images stored in TIFF format. Ok fine, but maybe convert them to PNG, since TIFF has less support these days and is becoming a relic in some ways. Some time in the future maybe you'd again convert it to a new image format.

    The reason to do those things is otherwise in the far future, maybe you run in to a problem. Let's say it is 2060 and you need the data. It is all on LTO-5 tapes, however, and the world moved to a holographic storage medium 20 years earlier and a working LTO-5 drive is nearly impossible to find. Then you do get it off and the format is something no software reads anymore, so you have to break out an emulator to convert to a newer format, and then again, until you finally get to something you can use. If you can't do all that, then the data might as well be lost since you can't access it.

    Keep plenty of copies and keep them up to date (and tested) and you are good. The only other thing is to protect them from damage. That means storing them some place that is secure against various things. A good fire safe would be a good idea, if it is really important maybe a vault some place else.

    The thing to do is to ask yourself at what point do you stop caring about your data? That point does exist. Then design something that can withstand more or less anything below that.

    As an example I helped my parents get backups set up for their business. They care about them only so long as the business survives. If the building burns down, or floods, nothing on them matters. So the backups are in a good safe, but on the premises. There are plenty of things that could result in data loss, but only the things that would also result in the business being lost and them not caring. On the other hand at work we have data that needs to survive pretty much everything short of a nuclear war. If our building goes down, if we all die, it still needs to be intact. So we take copies of it to another building, in to an underground vault. It would take a pretty catastrophic event to get it all, and that would be large enough that then it wouldn't matter.

    1. Re:The last part is the reall important part by rev0lt · · Score: 1

      Not relevant for the point you're making, but PNG is not an alternative to TIFF, specially in professional graphics. >8 bit channels and multiple compression/encoding schemes aren't available on PNG format. Also, PNG is a RGB-only format, and pretty much useless in to-print graphic designs,

  79. Never use RAID 10 for backups! by RichiH · · Score: 1

    I have had two disks in a RAID 10 fail me directly after each other, once. Guess which two? Yay!

    Especially for backups where write speed is not much of an issue, you want RAID 6 or above. Never RAID 10.

    1. Re:Never use RAID 10 for backups! by Vrtigo1 · · Score: 1

      Parent title should be "never use RAID as a backup", unless the RAID in question IS a backup copy of data living elsewhere. In his example, the two failing drives in the RAID 10 set wouldn't matter, unless the primary array has also failed. Sure, you'd lose the RAID set, but RAID was never intended to be used as a backup mechanism in and of itself. By maintaining two separate arrays, you would have to have complete failure of two arrays simultaneously for your data to be lost. Very unlikely.

    2. Re:Never use RAID 10 for backups! by RichiH · · Score: 1

      Obviously, the RAID set is only a part of the puzzle. Still, downtime sucks and restoring from backups takes time.

      And yes, you always need several copies on distinct machines in separate locations, potentially enhanced by one or more offline copies.

    3. Re:Never use RAID 10 for backups! by Vrtigo1 · · Score: 1

      The situation I've had to deal with is video storage for a small FCP production team. They have approx 30 TB of disk storage and it's all set up as RAID 5 to maximize the storage capacity of the disks. I know we really ought to be using RAID 6, but unfortunately that was viewed as costing too much and consuming too much of the available storage for redundancy. When we buy storage, we buy it in doubles, that is to say if we buy an array to add 10 TB, we actually buy two. One is production, the other is backup. We use a simple utility called Chronosync to push any incremental changes from the production storage to the backup storage nightly.

      We don't have any offline storage at all. We know that if the building burns down or a tornado rips the roof off, we're SOL. It's not a matter of not wanting to do it, it's just that there's no practical and financially feasible way for us to keep that much data backed up on offline, offsite storage. We'd essentially have to hire someone to handle that extra workload, and then we're talking about 50-60k/yr in salary, plus the storage medium (LTO tape, etc).

      With that being said, this is the most cost effective solution I've found, and it works well for what it's designed to do, which is protect against disk failure.

    4. Re:Never use RAID 10 for backups! by RichiH · · Score: 1

      If you need offsite storage on the cheap, let the person who is lowest in the system but still 100% trustworthy take the tapes home. Keep several data sets. Verify, weekly, that said person has the tapes at home.

  80. Stones by Anonymous Coward · · Score: 0

    Best for long-time archives is still stone - although kB/kg ratio makes this a rather unmovable storage ;)

  81. Redundancy by Anonymous Coward · · Score: 0

    Offsite is great, but nothing beats one secure backups - like two secure backups.

  82. BBC by BigBadBus · · Score: 1

    You could adopt the British Broadcasting Corporations approach to valuable archival holdings c.1972: thrown everything into a furnace.

  83. Doing it cheap with tape by Anonymous Coward · · Score: 0

    1. Buy a single drive LTO-2 or 3 off Ebay. I see an 8 slot loader for sale for $270, but you really only need a single drive at 200-400Gb/tape. Let's say $150 for the drive and $100 for 5 tapes, another hundred bucks for your controller. Write all your projects to LTO-2 TWICE. Send a copy to a relatives house for safekeeping.

    2. In about 4 years, check Ebay again and buy yourself an LTO4 drive for around the same dollars. Read in all your LTO2 and write it back to an LTO4 tape.

    3. Rinse and repeat. You could probably get similar value out of a single 2nd hand LTO 3 drive now and wait for LTO5 to be cheap in 4-5 years.

    You media has a shelf life of 25+ years if kept in a cool dry place. Just try and use really common software to cut the backup (like Linux Tar). The LTO standard says that the next generation has to be able to read and write the current generation tapes and must be able to read the previous generation tapes. New LTO standards come out every couple of years.

  84. use two different methods by flappinbooger · · Score: 1

    If it's just you, and just one computer, why not carbonite (or another reputable online storage service) AND ALSO 1 or two external usb hard drives, keeping one off site and periodically rotating them. Like, weekly.

    If carbonite implodes, you have the hard drives. If you lose one hard drive, you have the other. If you lose both hard drives you have carbonite.

    If you never lose the working data, then you aren't out TOO much, as carbonite is not TOO expensive and external usb hard drives are also reasonable.

    No, I don't work for carbonite, just using them as a ubiquitous brand name like kleenex. I could have just as easily said dropbox. Oh, wait, no, not dropbox. Nevermind.

    --
    Flappinbooger isn't my real name
  85. CrashPlan by Anonymous Coward · · Score: 0

    Yeah, it's not offline, but it's cheap online backup with unlimited data and support for seeding data by getting a hard drive sent via mail. Same for restoring large amounts of data. If you don't trust a single company, also add BlackBlaze.

  86. Archiving - the best way by Auldclootie · · Score: 1

    Forgive my jaded perspective - respondents to this query are almost without exception fan boys of particular techie solutions. The real solution is far more commonsensical. I have every file I ever created from my 486 SX25 (circa 1990) onwards through a wealth of "blindingly fast' iterations of Pentium machines - my data, insofar as I ever wanted to keep it - is complete and has survived hard drive crashes, laptop and desktop thefts, floods, fire, misguided backup solutions involving CD and DVD, and the most malignant viruses the world felt able to bless me with. I have never had a raid array, a tape backup system - and I hasten to add - I spit in the general direction of your cloud solutions. Clouds are soft, vaporous and wholly subject to evaporation into nothingness. And I have never lost a file I wanted... The painfully obvious answer is - backup your hard drives - keep two copies (at least) of everything (preferably in different locations - I use family member backup and it has never failed) currently I have about 6TB of personal data - all backed up locally plus in at least one external location - this can be done with a handful of drives for an outlay of just a few hundred dollars - add a hot-swappable 3.5 inch drive dock or two and all your data is independent of all your computers. Just remember the rules: 1) The data on your computer is all temporary storage - never rely on it in the longer term - you should be able to reformat at the drop of a hat if you are doing it right 2) One copy is your interim (I don't care if I lose it) position 3) A 'cloud' copy is your 'this is convenient - but lets not pretend this is long term' solution for when you are traveling or using multiple computers in different locations 4) Two copies on site (on separate external drives) is your provisionally safe position (better still - keep one at the office) 4) Three copies with at least one in a remote location means you actually own your data - it is going nowhere without your say so and you will be able to bequeath your digital estate to those who are deserving (they in turn will be able to retain it - but only if they follow the rules above...) There! That's not so hard is it?

  87. How much total data? by sirwired · · Score: 1

    The OP failed to mention how many of these things needed archiving. A couple hundred? Redundant disks (don't even bother with RAID) spun up once a year or so. Ten thousand? Tape. No question. It has proven and well-known long-term reliability. But you must meet the media's storage requirements to achieve the media life specs. (If you can't do that, there are any number of off-site tape storage places that can.)

  88. Re:Large removable disk on the low end, tape highe by Anonymous Coward · · Score: 0

    I would also use a program like ... one of the PAR utilities used for USENET

    Thank you! That's a darn good idea!

    I'm in the process of setting up an off-site backup system, and I've been a little paranoid about my backup getting corrupted. That would give me a little peace of mind.

  89. Plenty of Information here by lililalancia · · Score: 1

    http://www.snseurope.com/ We do large genome sequencing runs and processing of the raw data with data sets not unlike yours. As other people have said how long do you need to keep the data or be able to retrieve the original and re-do the data analysis?

  90. I use LTO by slaker · · Score: 1

    My home media collection is around 20TB. For the longest time, I was dealing with redundancy by just maintaining a second, synchronized set of file servers. Each server has either a 16 port SAS controller or two 8 port controllers and a total of 16 storage drives in RAID6 including the hot spares. Each machine probably cost me $2500 to set up and I have four of them. And that's with getting the RAID cards and rack chassis from Ebay.

    The truth is, the chance of having a non-recoverable error while doing a RAID rebuild is really, disgustingly high. Hopefully I wouldn't run into a scenario in which both servers in a synchronized pair had issues at the same time, but that wasn't giving me warm fuzzies the more I read about the reliability of RAID5/6 for large volumes.

    So when it came time for me to upgrade my storage setup, I chose to go a different direction. I bought a lightly used LTO4 changer. Every tape holds 800GB and costs about $20. The tapes can be taken off site (shipped to my parents) and can grow to deal with whatever expansion of storage I make in the future. In the near term I will probably purchase a second LTO drive to store with my tapes, but I expect that I'll be in a much better place for dealing with my needs for at least the next several years.

    It's not a solution for everyone, but it was the right move for me.

    --
    -- I wanna decide who lives and who dies - Crow T. Robot, MST3K
  91. Tape Tape Tape by Anonymous Coward · · Score: 0

    Spinning disks consume power. Archival storage on HDD is unproven as HDDs are intended to be in use. The correct choice is tape - it has the lowest energy consumption, the environmental impact of manufacturing is far lower than for HDDs and they consume no power while stored.

  92. Re:Large removable disk on the low end, tape highe by gnugnugnu · · Score: 1

    WinRAR? PAR? Seriously. It would also be tediously slow and be a micromanaging solution that only covers the files but fails to consider the need to also keep a working backup image of the system and a properly licensed version of InDesign to ensure he can still reopen the files.

    Tapes make sense especially since they can easily be couriered for offsite backup. A well thought out disaster recovery plan must include offsite backup.

    The submitter mentions 60GB InDesign files and sounds like a small profession or high end amatuer, so the extra cost of a RAID setup sounds like it would be a sensible investment.

    Copying to new media sounds sensible but is not exactly the right answer, the correct solution is not only to make backups but to also to check that you are able to restore from backup.

  93. I'm confused...you're saying 60 GB is large by Anonymous Coward · · Score: 0

    On the one hand you say your project is large, but later in the sentence you say it's 60 GB. So which is it? Is it large or is it 60 GB?

    My HD video projects are regularly 400 - 500 GB.

    Personally I double backup on hard drives. My HP desktop has a built-in hard drive dock that makes it relatively painless to backup.

  94. Re:Large removable disk on the low end, tape highe by mlts · · Score: 1

    I agree -- tape drives are perfect for backups. Like someone mentioned, tar volumes from the 1970s are readable on tapes today.

    Tapes are an ideal backup medium, provided you use more than one tape for archiving, and periodically go through and recopy files to new media every couple years or so. Newer tape drives offer WORM capability, so data can only be destroyed, not tampered with.

    However, why I mention tapes secondary is that they are so expensive for meaningful capacity. Yes, you can buy tapes with less capacity cheaper, but there is a point where you are better off with multiple hard disks than trying to copy an archive onto 50-100 tapes. Same with optical media.

    Take a LTO-5 drive, which is par for the course, and has enough storage capacity to be useful. It costs about $2500.00. However, it needs a SAS card, and it also needs I/O. Similar to old CD-Rs, a tape that doesn't get enough data streaming to it starts shoe-shining, which jumps the chance of errors and adds considerable wear to the heads and the tape. So, the machine that tape drives need to be attached to either has to be fairly high end, or a dedicated machine just for moving stuff to tape with no other functions.

    If you can afford tape, it can be argued as the best backup media out there. However, most people can't, so external HDDs (laptop drives are better as they do not require power supplies) are the second best choice. They are nowhere near perfect, but for those who can't afford a new tape drive, are pretty much the only game in town for large files.

  95. Really not that big of a deal by salesgeek · · Score: 1

    I have a few friends who own print design agencies. Here's how they do it (I asked them last year when I was setting up project storage for my company):

    - A few do the old-school library checkout system and get the drive from storage and use it with their desktop/laptop. Most often they use USB drives.
    - The more sophisticated ones have a multi drive ESATA box and request that a particular archive be put online. An admin gets the drive and mounts it as needed. Live projects are stored online. Backups are done to another hard drive.
    - The most sophisticated have a big old NAS or file server and just leave everything online, and back up to HD.

    The second option is really the most popular.

    --
    -- $G
  96. Varried but start wtih tape by lastrogue · · Score: 1

    It's probably already in the comments here, but a lot of it was tl;dr. I do like slaker's LTO approach but would also recommend keeping a dedicated server backup system attached to a NAS device or something similar. Right now I don't have that much data, only about 1TB so I keep two external HDD's and any small documents that I'd definitely want to keep I use dropbox to store away. I may not be the best source for personal storage, but have been working in an enterprise environment for a few years now and a tape + NAS backup system has suited us quite well.

  97. online and offline by pnutjam · · Score: 1

    Hard drives are the cheapest storage. Put some Terabyte drives in a computer and add a removable drive slot. Put your data on the drives, RAID is a waste of time. SATA is fast enough for most people.

    If this is archival data that does not change, create a schedule where you back it up once a month, once a quarter, or annually depending on value. The backup will be to a drive you put in and then remove for offline storage. You can so some fancy Hard-linking of backups to maintain versioning, or you can encrypt your backups. The most important part is that they are in two locations, live on your server, and on an offline hard drive. You may consider hashing data to check for corruption.

    This setup is fast, it's easy to access, it's not expensive, and it provides robust data security. You should reevaluate in 5 years, purge unnecessary data, or see if better options have appeared.

    Check out my site, I specialize in this sort of setup.

  98. Data Glyphs are your 'best' alternative by randomErr · · Score: 1

    If you want something that extremely stable and will last for a hundred years then you want data glyphs. You use a printer, paper, and a flatbed scanner or hi res camera and you have a viable backup solution. The most common data glyph is the bar code.

    As with everything there are downsides:
      - You need to use a quality paper and ink if your backing up for the long hall
      - Printing takes time especially if you're using
      - Storing a ream of paper for each back up

    Below is list of some glyph formats. There use to be a site for a full Xerox solution but I think they licensed it out to another company.
    http://www.adams1.com/stack.html

    --
    You say things that offend me and I can deal with it. Can you?
  99. NAS by ramyphotography · · Score: 1

    If you have a NAS with multiple HDD then that'll work

  100. Lots by Anonymous Coward · · Score: 1

    Make lots of copies.
    Hard drives are cheap, any important data gets copied onto a few (2-3) drives stored in different locations (I like bank safe deposit boxes, parents basement) are a good one.
    I also keep my data on my main server, so it's included in every new backup, but I only have a few hundred gigs.

    If cost is really no object, I would have added tape and filed them away too, but tape is much more expensive, and my data isn't _that_ important.

  101. Need more data.... by Big+Boss · · Score: 1

    Offline, I would suggest a pair of HDD docks/hot-swap bays. SATA, with normal HDDs. Put them in a ZFS mirror and keep track of which ones go together. ZFS detects bit-rot, the mirror allows you to correct for it. Add in another backup layer as desired, for that size range, you're probably into LTO tape or multi-layer BD... Just make sure to store a checksum so you know if your data is good when you try to read it later. MD5 would probably work fine.

    If you're talking ONLINE storage, I would still use HDDs and ZFS, but include a cron job that runs "zpool scrub" on the array to keep it checked for bit-rot. Check the logs and "zpool status" for errors, replace bad drives as they show up. Make sure to use mirrors or RAIDZ. For large HDDs (>1TB?) I'd go with RAIDZ2 or 3-way mirrors. The reconstruction time can fail a second disk...

  102. LTO Tapes by bayankaran · · Score: 1

    Many commentators will mention tape drives for high volume/capacity data storage. My experience tells me not to trust tapes or tape drives.
    Unless you have humidity controlled and air conditioned storage facility tapes are a bad idea. Still tapes fail. Specifications change. LTO 4 becomes LTO 5 and so on. They are notoriously prone to slightest jerks or shocks.
    The best, cheapest option I found is the RAID 2 external SATA hard drives of 2TB from Western Digital. Keep two of these units...you have 4 hard drives with your data. That's how I store DPX files (each file is around 9MB, 24 files per second) from my feature film projects.

    --
    Tat Tvam Asi
  103. Re:G.one by BrokenHalo · · Score: 1

    True. Current tape formats are nice, provided that you have a reliable Grandfather/Father/Son overwrite policy, with appropriate verification. You can't expect too much of it, though. However, I have to admit that my point of view is coloured by repeated experience (in the '80s and '90s) of multi-volume backup sets proving to be worthless as a result of bad I/O in the middle of the set. However, with modern equipment YMMV.

    RAID is a great system for reliability (so long as it isn't RAID-0) but can't be relied on as a backup at all, since it is constantly plugged in.

  104. Did you consider ~*CLOUD*~ storage? by Wrath0fb0b · · Score: 1

    Amazon, BackBlaze, etc... have pretty decent services that would allow you to backup to their service and not have to worry about the details. You can TrueCrypt and parity-protect your files before uploading to protect against their service either snooping or corrupting your data. Heck, if you are really paranoid, you could upload to 2 or 3 such services for the same price as rolling your own or even have periodic hashing for consistency.

    The only downside I can see here is that you need to have sufficient upload bandwidth (and a compliant ISP) for the volume of data that needs to be backed up. The upsides are manifest: probably cheaper, better tech, not having to worry about implementation details, wasting less of your time managing your creations and more time creating.

    This is one of those (rare) cases where offloading to the cloud makes a ton of sense.

  105. my hard won experience by Anonymous Coward · · Score: 1

    1. use multiple technologies - one backup on hard disks, one on optical, one on tape, one "somebody else's problem".
    2. use standard formats - FAT32 and ZIP, exfs3 and tar.gz - avoid proprietary hardware & software, you want stuff which you can plug into Linux and read.
    3. give yourself the ability to recover from both partial failure and complete failure of a single disk/tape.
    4. redo your backups before you lose the ability to recover them.

  106. How long do you need them to be kept? by davidwr · · Score: 1

    Under 2 years? Multiple backups on hard disk, no question.

    2-10 years? Hard disks with integrity checks of every drive every 6 months so you know you always have 2 good copies at all times.

    10+ years? Same as above but migrate data to new media as it becomes available, OR plan on migrating the data to a proven archival storage in the future.

    Proven archival storage available today:
    Archival-quality optical media.
    Archival-quality tape.
    Archival paper - bits can be printed as graphics.
    Archival microfilm or other "microscopic paper"-equivalents.

    Proven archival storage available in 2021: ???

    --
    Knowledge is how to play a game, intelligence is how to win, wisdom is knowing what game to play.
  107. LTO-4 by fishbowl · · Score: 1

    The gold standard for archiving (distinct from backup) is LTO-4 tape. I know this is expensive. I know that a lot of people go "ewww, tape, STFU grampa", but you should have a serious look at the efficiency of SAS tape drives, the simplicity of the solution for really large amounts of archival data, and the reliability of the medium.

    --
    -fb Everything not expressly forbidden is now mandatory.
  108. RAID by DarthVain · · Score: 1

    They don't call it a REDUNDANT Array of INEXPENSIVE Disks for nothing.

    You can buy 2TB for 80$ now. Buy 8 of them. You now have 16TB. Set up a mirrored RAID. You now have 8TB. Run it off a cheap low end PC (that has 4 SATA RAID of course). Cost you under 1000$ bucks.

    1. Re:RAID by gweihir · · Score: 1

      RAID is not backup and certainly not archive. Anybody thinking that can just throw the data away, it is cheaper and has the same result.

      --
      Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
    2. Re:RAID by DarthVain · · Score: 1

      And the alternatives are? Physical media such as blu-ray perhaps? Too small and don't last.

      tape drive? have fun with that.

    3. Re:RAID by gweihir · · Score: 1

      Have fun losing your data. You obviously do not get it.

      --
      Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
    4. Re:RAID by clarkc3 · · Score: 1

      can always pick up an older tape drive - we picked an LTO-2 drive dirt cheap and new tapes were only around $30 each. The tapes can be read by LTO-2/3/4 drives so gives you a little bit of easier time to recover if your drive ever gets hosed

    5. Re:RAID by DarthVain · · Score: 1

      what sort of capacity do those have. I was under the impression that they haven't kept up with HD sizes...

    6. Re:RAID by lililalancia · · Score: 1

      Which is fine until someone leaves the tap on in the Laboratory upstairs and it rains on your parade! Belt and Braces approach with multiple locations after that..

    7. Re:RAID by DarthVain · · Score: 1

      You don't seem to know either then. I haven't lost any data yet. You don't seem to want to share your "extensive mind blowing knowlege" with the rest of the world.

      For a personal grade solution I don't see what alternatives you are talking about.

      Next think you are going to say is he should backup to the cloud right?

      You suggesting he buy 5-10k worth of drives and tapes with less capacity?

    8. Re:RAID by DarthVain · · Score: 1

      Offsite backup is always good, but hard to do with large capacities.

      There are online services, however they are expensive and some are a bit sketchy... also cloud solutions same.

      You could just take a cut on one extenal HD and throw them in a safety deposit box...

      Even large companies struggle with the concept of offsite backup.

    9. Re:RAID by toddestan · · Score: 1

      Then buy two of them. Treat the other as a large external hard drive with built-in redundancy. It's not like the hardware is expensive.

    10. Re:RAID by obarthelemy · · Score: 1

      Here we go again... Backups are:
      - Off-line, which RAID isn't
      - Off-site, which RAID isn't
      - several, which RAID isn't
      - tested, which RAID can be

      RAID is NOT backups. It's a high-availability, sometimes high-performance solution, that gets destroyed by anything that destroys your hardware (water, fire, power surge, theft...) or your data (viruses, user error, hacking, sabotage...), and introduces its own set of problems (faulty software, controller, enclosure, PSU...)

      RAID is NOT backups. I've had a handful of customers ruined because they thought so, including 2 that went under when their RAID setup burned same as the rest of their data.

      --
      The Cloud - because you don't care if your apps and data are up in the air.
    11. Re:RAID by DarthVain · · Score: 1

      So rather than calling me wrong (which I won't dispute), how would you then properly back up that amount of storage capacity then. I am assuming this is a person and not a corporation that will not be spending megabucks no matter what on whatever solution.

      I am aware of better solutions, but most are, lets say "cost prohibitive" to the normal user.

      In any case, it is likely this would be stored in buddies house, so if the whole shabang burns to the ground, he's got bigger problems than some lost design data.

    12. Re:RAID by obarthelemy · · Score: 1

      OP doesn't even state what the total amount on data is.
      If it's under 3TB, single 3TB HD, twice for redundancy, cost $300.
      If it's under 6GB, dual HDD enclosure + 2 HDDs, twice, costs $800

      After that things get really expensive: 3TB drives @130, 4-HD enclosure @300

      --
      The Cloud - because you don't care if your apps and data are up in the air.
  109. Price not an issue? Then use Archival-Grade Tape! by gweihir · · Score: 1

    Except for MOD (where the capacities did not keep up), archival tape is the only long-lived option, all other options either require regular maintenance or are unreliable.

    You will find that this quality of tape is more expensive than HDDs per GB and that the drives run you a couple thousand dollars.

    Also, keep in mind that you need to archive the software used to access the data in a form that you can run later, e.g. as VM images.

    --
    Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
  110. Best Idea: Print it out! by Anonymous Coward · · Score: 0

    Here is *the* solution...

    Print it out!

  111. NAS by DarthVain · · Score: 1

    Buy a NAS that is large enough for you. Plug it in your network.

  112. You didn't say how much data you're storing by Overzeetop · · Score: 1

    You said you had 40-60GB datasets, but not how many datasets you're backing up.

    Tape may very well be the "best" solution at the moment, though hard drives actually make a lot of sense. I would consider a part/parity system with enough parity to rebuild the entire file (yes, 2x the storage space), with the resulting set copied to two separate hard drives. 2TB hard drives are fairly commonplace, and can be had for $80 each. So you've got 100GB (avg) backup sets, and you're making two copies. You should be able to write to the drives simultaneously using two eSATA docking stations (about $40 each) and get proper cases for the drives (~$5 each). You shouldn't be out more than $200 for 20 files backed up, or $10 per project. Might as well store them in two separate physical locations.

    Finally, plan a migration path on a two-four year cycle. The migration should involve purchasing new media (presumably with 2x+ the initial storage density) and copying all of your files over. That will act as your bit-check, though in theory you could do the bit check without migration. Even with migration, the long term media cost should be less than $20-$25 per project, exclusive of the manpower to do the transfers.

    --
    Is it just my observation, or are there way too many stupid people in the world?
  113. "traditionally" meaning "in 1972" by Anonymous Coward · · Score: 0

    Tape was the way to go in 1972. Today, we use RAID arrays of hard drives to provide reliable off site managed backups at 10 cents per GB.

  114. For how long? by AtomicJake · · Score: 1

    The description looks like that you want to store your projects 5-10 years (after all: would it still make sense to open them afterwards?). If this is true, DVDs or even Blueray make sense - or even cloud (but then use at least 2 independent providers and check often).

    However, if you want to store them for more than 5-10 years, ask yourself first the question: How do you go to archive the programs that you need to open your projects? Open formats for the content, and open source for the programs would be a huge help. The you can think about the archive media for it.

  115. Re:Large removable disk on the low end, tape highe by Lawrence_Bird · · Score: 1

    I agree -- tape drives are perfect for backups. Like someone mentioned, tar volumes from the 1970s are readable on tapes today.

    I just read a VHS archive made in 1987 and it worked fine. Funny that most of the commercials were for food or clothes. Only one for a car and one for aspirin.

  116. Large?? by sunderland56 · · Score: 1

    Since when is a 40 GB project "large"??? If it fits on a single USB thumb drive it ain't large.

    Less than 2 TB = fits on a single hard drive = small.

    Less than 20 TB = fits on a single RAID = medium.

    Once you're up into the hundreds of terabytes (e.g. backing up all source material for a feature film), then yes, you have issues.

  117. Remember, If you buy June 6 onwards, by Aphonia · · Score: 1

    There is a free download (that isn't exactly working) for those who purchased after June 6, 2011.
    http://www.apple.com/macosx/uptodate/

  118. woa by hesaigo999ca · · Score: 1

    all in all, it depends what your limitations and needs are....i can zip a file that is 60 gb to a small 10 gb, then break it off in chunks, and upload it using an automated upload to hotmail account software using imap, needing a few accounts, but doable....

  119. Re:LTO4 by DocSavage64109 · · Score: 1

    A quick google search returns this: "LTO 4 is rated for 11200 end to end passes and 200 full read/write cycles". Doesn't sound very durable unless he buys several tapes and uses them sparingly. Hard drives are rated for magnitudes more use.

  120. Thanks by Anonymous Coward · · Score: 0

    Thanks all for your suggestions. HDs at $100/TB is certainly hard to beat! My main machine is raid-1, with nightly backups to a raid-5 server, The annual copy to new discs, and an offsite backup is my current regime. As I'm not in the TB+ archive category this still seems to be most sensible way to proceed. As a side question, what would you pay /TB to know for sure, you'd never have to check your backups for integrity? $1E3/TB? $1E4/TB?

  121. Buy a server by randyleepublic · · Score: 1

    The Supermicro SC936E26-R1200B Storage Chassis holds 16 3.5" drives. (This same chassis comes in cheaper models, but the savings are not worth the compromises - learned that one the hard way.) Then get an appropriate motherboard, memory, processor, and an LSI MegaRAID SAS 9280-24i4e raid card. Set up your Seagate Barracuda XT drives in Raid 60 drive groups, say 8 drives to a raid 6 stripe You'll get up to 2 stripes, so with 6 data drives per stripe that's up to 12 data drives in the chassis. At 2 terabytes per drive that's 24 terabytes, or with the new 3TB drives, that's 36 terabytes of reasonable performance, cost effective, dead reliable storage. That's not enough? Add up to 6 additional chassis to plug into your LSI card, (no motherboard needed), for a total of nearly a quarter exabyte. If you want a different cost/performance ratio substitute 5200 rpm drives, or Hitachis or whatever, and/or make smaller or larger Raid 6 stripes.

    Don't tell *anyone* where the thing is!

    --
    Social Credit would solve everything...
  122. Don't use DVD media by Jon+Harms · · Score: 2

    I'm a PhD student studying magnetism, and one thing I can say for sure is that DVD/BR media is not the way to go. Professionally printed media (the silver bottom) uses a stamp to make a mechanical impression, not unlike vinyl records. Once sealed, it last forever. Writable media uses a die, and unless you store your media at 0K, finite temperatures will cause the die to diffuse and the media become useless. This takes much less time than people think. Good disks will last 10 years, cheap ones only a few years. The problem is that it's impossible to tell anymore who is making the good disks, since all of the production lines get shared by many brands.

    Alternatively, magnetic storage isn't that great either (tapes or HDDs). For both a HDD and tape, thermal fluctuations cause random data to be lost, but hard drives are designed to recover this data and correct it. If you pull your hard drive off line for several years, it doesn't have the opportunity to constantly scan itself and check for these errors, so never expect an unpowered hard drive to store data for long periods of time - they just are not designed to do this.

    As previous users have pointed out, software raid is the only way to go. Hardware raid provides a single point of failure, and is really only suitable for high performance and short term reliability, not long term reliability.

    Tape drives also have the same thermal fluctuations issue, but because the magnetic grains can be much larger (tapes have 1000's of times more surface area to store the same amount of data) they can go much longer. I would still "refresh" my tapes every year or two though.

    Based on your requirements, I would suggest tape first, then a large software raid of HDDs. Anything else is just not safe!

  123. Build your own Petabyte server by kyutumsako · · Score: 1

    Build 3 Petabyte servers using Backblaze's instructions, place 1 in your house, and find 2 offsite locations to place the remaining pods.

  124. long trem storge by saqqa · · Score: 1

    from my experience you should 4 external storage that separate from your PC, There are special DVD for such purpose as weel thx

  125. Already Posted by Anonymous Coward · · Score: 0

    http://blog.backblaze.com/2011/07/20/petabytes-on-a-budget-v2-0revealing-more-secrets/

    Read

  126. 40 to 60 GB times how many ? by obarthelemy · · Score: 1

    it's a bit hard to recommend backups for an unknown backup size ....

    --
    The Cloud - because you don't care if your apps and data are up in the air.
  127. DVD burner that makes forever DVD's by Anonymous Coward · · Score: 0

    http://millenniata.com/ makes a DVD burner that will burn DVD's that last forever. I'm using it for my long term storage (stuff like family pictures, genealogy, journal scans, etc.) I did not see it listed here.