Slashdot Mirror


How Do You Backup 20TB of Data?

Sean0michael writes "Recently I had a friend lose their entire electronic collection of music and movies by erasing a RAID array on their home server. He had 20TB of data on his rack at home that had survived a dozen hard drive failures over the years. But he didn't have a good way to backup that much data, so he never took one. Now he wishes he had.

Asking around among our tech-savvy friends though, no one has a good answer to the question, 'how would you backup 20TB of data?'. It's not like you could just plug in an external drive, and using any cloud service would be terribly expensive. Blu-Ray discs can hold a lot of data, but that's a lot of time (and money) spent burning discs that you likely will never need. Tape drives are another possibility, but are they right for this kind of problem? I don' t know. There might be something else out there, but I still have no feasible solution.

So I ask fellow slashdotters: for a home user, how do you backup 20TB of Data?"
Even Amazon Glacier is pretty pricey for that much data.

108 of 983 comments (clear)

  1. Hmmm... by Anonymous Coward · · Score: 5, Funny

    I would say use floppies, but I'm kind of old and out of touch now.

    1. Re:Hmmm... by Anonymous Coward · · Score: 2, Funny

      I would say use floppies, but I'm kind of old and out of touch now.

      5 1/4" or 3 1/2"?

    2. Re:Hmmm... by DickBreath · · Score: 4, Funny

      Punched paper tape has better longevity than either floppies or optical media. You just need a really big roll and a lot of time.

      --

      I'll see your senator, and I'll raise you two judges.
    3. Re:Hmmm... by Anonymous Coward · · Score: 5, Funny

      5 1/4" or 3 1/2"?

      8". How young are you?

    4. Re:Hmmm... by Grisstle · · Score: 2, Funny

      Neither, 8" floppies would be the way to go.

    5. Re:Hmmm... by Primate+Pete · · Score: 4, Funny

      You can do the job with a mere 128 million single sided, single density 160K disks, like the one on the original IBM PC. When in doubt, go with proven technologies. Assuming a stack of 4 disks is 1cm thick, you should get away with around 1000 m^3 storage space.

    6. Re:Hmmm... by Waffle+Iron · · Score: 3, Funny

      Neither, 8" floppies would be the way to go.

      Hard sectored or soft sectored?

      It would be best to decide up front before putting in the order for 80 million disks.

    7. Re:Hmmm... by Cassini2 · · Score: 4, Informative

      At 10 characters per second, the backup would take 63,419 years(*) and require 659 TJ or 0.2 TWh of power to complete. I have a customer that still uses paper tape. It lasts and lasts, and I have only replaced the reader once. The punch needs a new power supply every 20 years or so.

      However, 63,419 years is a long time to wait for a backup to complete.

      (*) this assumes that 1 TB = 1,000,000,000,000 bytes. It takes almost 70,000 years if you add the extra 10%.

    8. Re:Hmmm... by Thanshin · · Score: 3, Informative

      20TB = 1.33LoC

    9. Re: Hmmm... by MightyYar · · Score: 4, Insightful

      Well, it's local storage to a computer somewhere. If that computer can run Backblaze, then super.

      --
      W..w..W - Willy Waterloo washes Warren Wiggins who is washing Waldo Woo.
    10. Re:Hmmm... by K.+S.+Kyosuke · · Score: 3, Funny

      Yes, but if he's a Microsoft SW user, he *has* to use 5 1/4" floppies. You see, 8" floppies are insufficiently micro and 3 1/2" floppies are insufficiently soft.

      --
      Ezekiel 23:20
    11. Re:Hmmm... by sdo1 · · Score: 4, Insightful

      Which, if it takes about 1 minute to load each one, it will take you a mere 243 years to do the backup.

      --
      --- What parts of "shall make no law", "shall not be infringed", and "shall not be violated" don't you understand?
    12. Re:Hmmm... by Lumpy · · Score: 4, Informative

      Punch the hole and you can flip them over to double your capacity.

      --
      Do not look at laser with remaining good eye.
    13. Re:Hmmm... by K.+S.+Kyosuke · · Score: 4, Funny

      Punched paper tape has better longevity than either floppies or optical media.

      If you're going for *actual* longevity, you can't beat fired clay tablets. (Yeah, I know they weren't fired originally, but you have to decide how much you value your MP3 files. I'd certainly take the extra time!)

      --
      Ezekiel 23:20
    14. Re:Hmmm... by Chrisq · · Score: 3, Informative

      According to this site 2560 Meters = 1 Meg. So that would be 20,000,000 times 2560 = 51,200,000,000 metres = about a third of the way to the sun

    15. Re:Hmmm... by NatasRevol · · Score: 2

      This was the greatest trick ever.

      --
      There are two types of people in the world: Those who crave closure
    16. Re:Hmmm... by Rhaban · · Score: 5, Funny

      Seed a torrent of it as an encrypted file named "porn.zip" or similar. You'll have it backed up on the cloud for free in no time and available for all of eternity.

    17. Re:Hmmm... by Mordok-DestroyerOfWo · · Score: 4, Funny

      Once again a slashdotter is advocating to take away a musician's right to make a prophet.

      --
      "Never let your sense of morals prevent you from doing what is right" - Salvor Hardin
    18. Re:Hmmm... by Anne+Thwacks · · Score: 4, Insightful
      LTO6 tapes hold 6.25 TB, so you probably only need 4. Hardly fills a station wagon - the basket of a push-bike would do fine.

      Of course, you might want to make more than one copy of your data. https://www.amazon.co.uk/dp/B006K1FSKA

      --
      Sent from my ASR33 using ASCII
    19. Re:Hmmm... by Old97 · · Score: 3, Informative

      The first 8" I used held 128k. The last one had a capacity of around 1.2 mb. They were twice as fast (transfer rate) as those new fangled 5 1/4 inch floppies. Kids, what do they know?

      --
      Very often, people confuse simple with simplistic. The nuance is lost on most. - Clement Mok
    20. Re:Hmmm... by idontgno · · Score: 4, Informative

      At the very end of the 5 1/4" floppy era, the "High-Density" floppy used the same data rate, tracking, and recording density as the 8" 1.2M floppies. They were, in fact, 1.2M 5 1/4" floppies. Which is why their formatted capacity was different from 3.5" "high-density" equivalent, 1.44M.

      Other than electrical needs (as 8" floppies often had their spindle motors directly powered by 120VAC line current), the high-density 5 1/4"s were used as a drop-in replacement for 8" floppies in the hobbyist retrocomputing community. (Not collectors, though; they'd want to keep the gear as cherry as possible.)

      --
      Welcome to the Panopticon. Used to be a prison, now it's your home.
    21. Re:Hmmm... by ewhac · · Score: 3, Interesting
      I had only hard of LTO tapes quite recently, and I did a very tiny bit of poking around. The latest generation is LTO-6, whose tapes can hold 2.5TB each (uncompressed). The tapes themselves are quite modestly priced -- an LTO-4 tape cartridge (800GB uncompressed) costs about $30 each.

      The drives, however, are not cheap. New drives appear to start at around $1200. Used drives are all over the place -- I've seen some on eBay with an opening bid as low as $350. Also, all LTO drives appear to have either an LVD SCSI or a SAS interface, which means you'll also need a controller card. There appears to be no such thing as a SATA LTO drive.

      Plus you get to re-live all the joys of selecting tape vendors, and placing bets on whose tapes are going to last for 20 years.

    22. Re:Hmmm... by lgw · · Score: 3, Informative

      Tape drives need the full SCSI command set, not the trimmed version that made it in to SATA (I'm not sure there's even a "(01) REWIND" supported in SATA).

      LTO tapes stored reasonably ("keep in in a cool dry place", as the song goes) should last 15 years from any vendor, as that's in the spec, and there aren't really bottom-feeder vendors for LTO.

      --
      Socialism: a lie told by totalitarians and believed by fools.
  2. reduce the amount by JeffSh · · Score: 3, Interesting

    At home, I didn't feel like paying for 2 large arrays to store my data, so if I rip any media, I always rip it to DIVX. 800 MB for a DVD or even bluray rip is a great economy, saves me money on primary storage and also enables me to back it up. I accept the loss of quality as I can always reference the original media if I want.

    Another option in the future may be subscription services which have HD content, thus eliminating my need to roll my own. We'll see what happens there.

    1. Re:reduce the amount by buchner.johannes · · Score: 3, Informative

      20TB is not out of the world. With a RAID of 4TB disks you can cover that at home, and it doesn't need to be on all the time. Maybe you can reduce the amount of disk usage by reducing duplicate content using bup or an appropriate FS.

      --
      NB: The message above might reflect my opinion right now, but not necessarily tomorrow or next year.
    2. Re:reduce the amount by cdrudge · · Score: 5, Funny

      I always rip it to DIVX. 800 MB for a DVD or even bluray rip is a great economy,

      I do the exact same thing with high res pictures. I immediately will take the full resolution raw image and convert it down to a 320px gif. Or maybe a 10% quality jpeg. You get great economy that way too. Who wants to keep a 30+MB image around when you can have almost the same thing in 10kB instead!

    3. Re:reduce the amount by edxwelch · · Score: 3, Insightful

      > I always rip it to DIVX. 800 MB for a DVD or even bluray rip is a great economy
      I do that as well, but I found out to my horror that all my DVD's had become unreadable over time. So, probably good idea to test your backups from time to time

    4. Re:reduce the amount by FreonTrip · · Score: 3, Informative

      I'd switch to x264 in an MKV container - you can get the same quality in about 3/4ths the file size without even being clever.

    5. Re:reduce the amount by cdrudge · · Score: 5, Insightful

      I wasn't looking for it to go anywhere really other than pointing out the absurdity of saying that taking a bluray rip down to a 800MB divx rip results in just an acceptable loss of quality.

      I'm by no means a audio/videophile snob, but you either have a blind and/or deaf if you can't see a MAJOR quality deficiency with a 800 bluray rip. What's the point of having a bluray movie if the first thing you normally do is make it look like crap?

    6. Re:reduce the amount by NatasRevol · · Score: 2

      Oh yeah, 20TB over VPN is *very* useful. /snicker,snicker

      --
      There are two types of people in the world: Those who crave closure
    7. Re:reduce the amount by Artraze · · Score: 3, Insightful

      Agreed.

      Regardless of whether or not 20TB is hording / excessive / inefficient, what it almost certainly is is replaceable. Let's face it, you aren't CERN, most of you data is probably media that you can reacquire with relative ease. It's not being stored because it's irreplaceable it's being stored because it's convenient. A RAID isn't too bad, but add in managing backups and where has that convenience gone? If it costs $10+/month to backup your ripped/downloaded movies, why not just sign up for Netflix?

      Just make a list of all the replaceable data (e.g. videos you have the original disc for) you have and then buy an external hard disk / Blurays to back up the rest. If you lose your RAID, well, it'll be annoying to rebuild, but you built it once... (Besides, I doubt you could restore 20TB over residential internet less time!)

    8. Re:reduce the amount by jedidiah · · Score: 2, Insightful

      If you've already got one large array then you are already by definition half way there. If you then decide not to go the rest of the way then you are at the same time being both extravagant and a cheap bastard. It's a wonderfully stupid paradox.

      If you've got one then you should get the 2nd one or not bother with the first one to begin with.

      --
      A Pirate and a Puritan look the same on a balance sheet.
    9. Re:reduce the amount by Mordok-DestroyerOfWo · · Score: 2

      Agreed.

      Regardless of whether or not 20TB is hording / excessive / inefficient, what it almost certainly is is replaceable. Let's face it, you aren't CERN, most of you data is probably media that you can reacquire with relative ease. It's not being stored because it's irreplaceable it's being stored because it's convenient. A RAID isn't too bad, but add in managing backups and where has that convenience gone? If it costs $10+/month to backup your ripped/downloaded movies, why not just sign up for Netflix?

      Just make a list of all the replaceable data (e.g. videos you have the original disc for) you have and then buy an external hard disk / Blurays to back up the rest. If you lose your RAID, well, it'll be annoying to rebuild, but you built it once... (Besides, I doubt you could restore 20TB over residential internet less time!)

      Some people have different use cases. A few years back I was visiting a friend in the boonies in Egypt and brought a TB of American movies and music along explicitly for her (she was putting me up for free while I was on a research project). With my 50Mbps connection and 250GB monthly cap, I could recreate the entire shebang in 4-5 months, but with her iffy ISP, she couldn't hope to download everything from me in her government's lifetime.

      --
      "Never let your sense of morals prevent you from doing what is right" - Salvor Hardin
    10. Re:reduce the amount by Pigeon451 · · Score: 2

      x264 is great, but why mkv? I've had terrible compatibility with mkv files in portable players or tablets, and proprietary software. Many won't even recognize them. I rarely have issues with mp4. Sometimes a user has no choice on software/hardware.

    11. Re:reduce the amount by Grishnakh · · Score: 2

      To be fair, 800MB is still better than your standard DVD rip, which is probably 200-400MB. But yeah, 800MB is still lame; 2GB would probably be a much better copy. It'd also help if he dumped his crappy DivX;-) codec and moved to something more modern.

    12. Re:reduce the amount by WuphonsReach · · Score: 2

      I do that as well, but I found out to my horror that all my DVD's had become unreadable over time. So, probably good idea to test your backups from time to time

      That's why you should reserve 10-15% of the disk for parity data. While DVD-R format has built in error-recovery at the sector level, by the time you figure out that the disk is going bad it is too late. By adding even more recovery data at the file level, you can treat the disk errors as an early-warning system, then use the recovery data files to get your data off the disc.

      The old standard was QuickPAR (PAR3), the new version is called MultiPAR.

      With PAR3, you can even write the disc image to an ISO file, make a copy of that ISO file and rename with the PAR3 extension, then let QuickPAR or MultiPAR use the PAR3 filename to search the ISO filename. Which lets you retrieve data from a disc, even if the file system has become corrupted.

      (About the only thing you can't easily recover from is a "track zero" error where the TOC of the disc has been broken.)

      --
      Wolde you bothe eate your cake, and have your cake?
    13. Re:reduce the amount by nabsltd · · Score: 2

      20TB is not out of the world. With a RAID of 4TB disks you can cover that at home, and it doesn't need to be on all the time.

      Sure, it's easy to have 20TB of usable disk space (I've got forty 2TB drives spread among 5 servers at my house), but 20TB of "must be backed up because that's the only copy" is a little unbelievable for a home user.

      For example, I have 700 Blu-Ray movies that have been ripped and re-encoded to take about 2TB of disk space. If I had 30-40TB available, I might store the raw Blu-Ray images, but then I don't need backup, as the data is easy to re-create. So, I'm a little skeptical that the "friend" in TFS had 20TB data that he can't re-create by going back to original sources.

    14. Re:reduce the amount by ncc74656 · · Score: 3, Interesting

      RAID-5 uses up 1 disk worth for striping, so net space in an 8-drive array is 7-drives worth (about 27TB using 4TB drives). The problem with RAID-5 is that you are 2 disks away from failure and rebuilds often kill the disks.

      RAID-6 uses 2 disks worth for striping, so net space in an 8-drive array is 6-drives worth (about 23TB using 4TB drives). Is able to survive a double-disk failure before data loss. Still has some of the same issues as RAID-5.

      I use Greyhole for media and document storage. It handles disks of unequal size (currently running one 3TB and two 1.5TB drives), and you can choose the level of redundancy you need. In my case, movies, TV shows, etc. get a single copy (one file exists on one drive), while documents and photos get two copies (one file exists on two drives). If a drive goes bad, you only lose the files on that drive...and only for the files for which you selected no redundancy. With redundancy, extra file copies are recreated on the remaining drives from the surviving copies; this process is most likely less stressful on the disk set than a RAID rebuild.

      My movies, TV shows, and music are backed up to BD-R, stored in a binder at work. They hold ~20GB each, as I'm using dvdisaster to guard against media errors. When a 2TB drive failed, I brought the backup (currently about 190 discs) home and restored the files that had gone missing. Backup and restore are managed by scripts, with information about what files are on what discs held in a MySQL database that gets periodically backed up off-site as well. The initial backup took several months (on and off) to finish, and the last time I needed to restore, it took about a week, but now I just burn a disc when I have about enough new data to fill one. Burning and verifying takes a few hours, but it's something you can start and walk away.

      --
      20 January 2017: the End of an Error.
    15. Re:reduce the amount by wagnerrp · · Score: 4, Informative

      The problem with RAID-5 is that you are 2 disks away from failure and rebuilds often kill the disks.

      No. The problem with RAID-5 is that during a rebuild, there is a reasonably possible chance you could have a UBE, and lose one bit, making perfect recovery of the array impossible. Only a stupid controller would consider a UBE to be a failed drive and trash the entire array. On RAID-6, you still have the same possibility of a UBE, but the chances that two separate drives would experience one on the same exact block during a rebuild are so astronomically slim as to be irrelevant.

    16. Re:reduce the amount by DigiShaman · · Score: 2

      I've never ran into a UBE in a RAID5. But then again, that's because I've always used Dell or HP machines that have Patrol Reading on the RAID card. It's proactive checking/correcting when the volumes is at a minimal use. It's frequency of checking is also a user changeable option is most cases.

      No. The problem with RAID5 is that should one drive fail, and second drive could fail while in the rebuild process. That's because rebuilding stresses all the remaining drives. If one of them in the array is already on the cusp of failing, a rebuild could push it over the edge. Thus losing the entire volume. RAID6 is a waste in that it's N-2 drives of capacity lost. But when you have to be sure (and damned sure) of reliability and availability, it's worth looking into. Exceedingly so for a SAN.

      --
      Life is not for the lazy.
    17. Re:reduce the amount by nemesisrocks · · Score: 2

      Regardless of whether or not 20TB is hording / excessive / inefficient, what it almost certainly is is replaceable. Let's face it, you aren't CERN, most of you data is probably media that you can reacquire with relative ease. It's not being stored because it's irreplaceable it's being stored because it's convenient.

      There's a large, flawed, assumption running through this thread that it's "easy" to reacquire all media.

      Between torrents and usenet, trying to find good rips of content that's over 6-12 months old is often impossible. Good luck trying to find any older show that's not sci-fi or super popular. Often studios don't sell DVDs or Blurays of these shows anymore, if they even did to start with.

    18. Re:reduce the amount by redlemming · · Score: 2

      Regardless of whether or not 20TB is hording / excessive / inefficient, what it almost certainly is is replaceable.

      There are huge amounts of material that is very hard to replace, and often irreplaceable.

      For example, many instructional and educational DVDs produced in the last few decades for various subjects are completely out of print. Typically these are home-brewed DVDs that weren't professionally pressed. These have a fairly high rate of failure, which means every one has to be backed up. In some cases, various political and legal issues may prevent re-issue, even of professionally pressed disks.

      Similarly, many VHS tapes are completely out of print, and either impossible to acquire, or only available at very high prices. For example, I would love to get a copy of "Out of the Fiery Furnace", even at VHS quality, but good luck finding that. Even when the tapes are available, there's always a question of quality: there's a big difference between converting a pristine VHS tape to digital form, and one that has been played too many times.

      Then there's the issue of time spent organizing and editing video material to make it useful.

      For example, effective use of instructional material requires random access to it. The simplest and most reliable way to do this is by extracting clips in full (as an exercise of fair use rights), as bookmark mechanisms tend to be quite primitive and incompatible between programs. It's often necessary to use multiple video processing programs when working with video, as each of them has it's quirks and limitations, so this is pretty important.

      Another consideration: many videos edited by amateurs, and even some done by professionals, have serious problems (such as background noise) that need to be edited out (again, as an exercise of fair use rights) to make the video usable. Removing noise is not a linear process, and can require many hours of manual experimentation. Since techniques are likely to improve over time, it's important to keep the raw (unprocessed material) around.

      Further, high compression video algorithms don't work well when one needs to be able to play things frame by frame or at reduced frame rates, without loss of detail, and keeping the audio comprehensible, in order to understand what is being demonstrated. This means high quality must be maintained in these clips. This effectively doubles the space needed to store this kind of material, as both the raw and processed data must be stored using relatively poor compression techniques.

      Loss of the data, of course, means all the editing time is lost, and that's a lot of hours of human time.

      Far more sensible than to take the risk of losing all this time is to keep both the raw and edited copies, but then you have to deal with backup issues for huge amounts of data. The best approach at present is to have at least two sets of disks, each with copies of everything. Having a third set off-site would be even better.

  3. Crashplan by rossjudson · · Score: 5, Informative

    Crashplan has unlimited storage. I use their home plan; it's unlimited for up to 10 machines. I think I am backing up about 6TB there now.

    1. Re:Crashplan by genghisjahn · · Score: 2

      You can backup(encrypted) to a friends computer and not use CP stuff at all. Check out the Local and Offsite plan. https://www.code42.com/store/

      --
      Sorry about the mess.
    2. Re:Crashplan by phalangion · · Score: 2

      They allow you to create your own 448-bit key. When you use your own key, Crashplan is unable to assist with the recovery, which implies that they do not have access to this key.

    3. Re: Crashplan by Anonymous Coward · · Score: 5, Interesting

      Crashplan offers unlimited storage, yes, but they limit it indirectly by slowing down uploads.

      I recently paid for a crashplan account to back up ~6TB of media, and at the speeds I'm seeing the initial backup is going to take more than a year. I have 100Mbit/s fiber at my home and can max it easily with other services.

      So for 20TB, it's going to take many years to back up. I don't think that's a practical backup solution. There's a decent chance you're going to lose your data before the initial backup completes. And if crashplan goes under, you have to start all over again with the next "unlimited except for rate" provider, and have no backup in the meantime.

    4. Re:Crashplan by Anonymous Coward · · Score: 2, Insightful

      I would agree with CrashPlan, although I have never ever ever seen it upload faster than 3Mb/s. Uploads are slower than hell, but it does indeed work.

    5. Re: Crashplan by MightyYar · · Score: 2

      While I've never had the throttling issue that you have, I do want to point out that they accept seed drives if your initial backup is taking a long time.

      --
      W..w..W - Willy Waterloo washes Warren Wiggins who is washing Waldo Woo.
    6. Re:Crashplan by K.+S.+Kyosuke · · Score: 2

      Wait, before I lose count...6+1+1...this means you're at 8TB, right? Or that you're at 12 machines?

      --
      Ezekiel 23:20
    7. Re:Crashplan by Skreems · · Score: 2

      Watch out for Crashplan in certain cases. I'm currently trying to restore a 50GB file, and it keeps restarting the download halfway through. Their support is useless, basically leaving it at "sorry dude, nothing we can do". For other files it's been good, but their testing and support of edge cases doesn't seem especially solid.

      --
      Slashdot needs a "-1, Wrong" moderation option.
      The Urban Hippie
    8. Re:Crashplan by MrNaz · · Score: 2

      It implies nothing of the sort. I implies that they *pretend* to not have the key.

      I'm not saying anything about Crashplan, I'm just pointing out the logical fallacy in saying "They don't help you, therefore they can't help you."

      --
      I hate printers.
    9. Re:Crashplan by suutar · · Score: 2

      And even if you don't trust them to not grab that key anyway, nothing says the files you feed it have to be cleartext.

  4. Hard drives + Robocopy by DogDude · · Score: 5, Informative

    I have a 16 TB media collection at home that I just back up on more hard drives.

    External hard drives in USB cases + Robocopy works great for me.

    --
    I don't respond to AC's.
    1. Re:Hard drives + Robocopy by Lennie · · Score: 2

      Well, you might not need to have twice as many. It just depend how much you are willing to pay.

      He is using a RAID-array as his primary storage. You will only need the other copy as a disaster recovery. In case you accidentally delete or format or the hardware RAID controller fails and you can't replace it anymore. Or something else stupid like that.

      So if you have a RAID5 of your originals and a copy on non-RAID disks and do regular checksums of your data to compare if reading the data from the RAID and non-RAID disks is still the same, you should be OK in most cases. The checksums checks could be rsync -c or ZFS/BTRFS or whatever.

      If you want more assurances, you're bigger issue is: you should have your copy off-site.

      If you've done all that, only then you might want to make sure the copy has RAID.

      Next step up from that is to keep a history of your changes of course, to have a real backup. Not just a copy.

      --
      New things are always on the horizon
  5. If you want to hoard bits... by polymeris · · Score: 3, Insightful

    > It's not like you could just plug in an external drive [...]
    Why not? Maybe not one, but 10 or 20 of them.

    1. Re:If you want to hoard bits... by fuzzyfuzzyfungus · · Score: 2

      Tape actually works pretty well, it's just that if you are running a small enough system that "Well, obviously, just get another 20TB array and store it at one of your other sites, idiot... On second though, get two extras and put each one at a different offsite location" isn't the answer that springs to mind almost reflexively, you probably can't afford the cost of entry.

      An actually contemporary tape drive(and a machine capable of keeping it fed when it is running full bore) is Not Cheap; but the fleabay shit that is cheap tends to offer painfully mediocre capacity and unknown reliability. Disks, by contrast, have a cost of entry that basically starts at zero and scales more or less linearly with the number of disks, unless you absolutely must have them all online at the same time(and even here, you hardly need screamin' hardware RAID for your backup volume, and bulk SATA ports of undistinguished performance are cheap).

    2. Re:If you want to hoard bits... by ewilts · · Score: 4, Informative

      The dataset isn't that huge. Tape can write at speed at least as fast as disk - LTO-5 writes at up to 280MB/sec - far faster than you can read the source at which isn't likely to be fast disk. The seek for a single-file restore will be slower than disk but after the initial seek, the read will be as fast as from a typical archive disk (no, you're not archiving 20TB to SSD, nor are you storing the source data on SSD either)

      However, the change rate for this application is likely to be low. That makes it very feasible to do random testing from the new backups where a minute to do the tape mount/seek is not a problem. You won't be writing more than a single tape in any single run (LTO-5 is ~1.5 TB of uncompressed data).

      For $2K, you'll have the LTO-5 drive. Add $500 for 20 tapes and you can back up the entire set (once) plus a bunch of incrementals. I haven't done the math with LTO-6 which is faster and holds more data. If you want multiple generations, tape is a lot cheaper per TB than disk. The initial drive cost hurts but after that, the price is good at $15/TB or so.

      --
      .../Ed
    3. Re:If you want to hoard bits... by nabsltd · · Score: 2

      An actually contemporary tape drive(and a machine capable of keeping it fed when it is running full bore) is Not Cheap; but the fleabay shit that is cheap tends to offer painfully mediocre capacity and unknown reliability. Disks, by contrast, have a cost of entry that basically starts at zero and scales more or less linearly with the number of disks

      This is absolutely the best statement of this ever. Everyone who claims that tape is the One True Backup doesn't factor in the startup cost of $2-4K for a tape drive that can handle reasonably large capacity tapes, the hardware to connect it to a computer (many of these tape drives have fiber-channel as the only option), and then the cost of some kind of changer if you want any amount of automation.

      For home use, hard drives are by far the cheapest and most convenient method, as long as you are in the less than 50TB world. If you aren't satisfied with hard drive reliability, back up your data twice, to two different drives. With 4TB drives selling for around $170, and backing up twice as I suggested, you don't reach the break-even point with tape (single backup) until you need to back up 15TB. With single backup to disk, the break-even is around 45TB.

  6. BackBlaze by connor4312 · · Score: 2

    BackBlaze offers unlimited backup storage for home users for around $5/mo - encrypted with asymmetric keys. I've got about 750 GB on there myself, works great. Although they may not *like* you backing up 20 TB of stuff, they should accept it. And, if they don't, you're about back five bucks. Probably worth a try.

  7. Do something about your hoarding problem by Anonymous Coward · · Score: 4, Insightful

    "My friend (read I) lost 20TB of pirated content! What should my friend have done different?"

    How about, ask yourself, how much of that content were you intending to ever consume again. Yeah, you can most likely delete 95% of it, that's 1TB of content that you might use again.

    Hoarders! *lol*

    1. Re:Do something about your hoarding problem by ustolemyname · · Score: 3, Insightful

      Not all of us have access to the time machine required to know *which* 1TB that is.

      Are you willing to share yours?

    2. Re:Do something about your hoarding problem by flanders123 · · Score: 2

      How is this insightful. This is the typical useless - "Why would you want to do that, you idiot???" - non-constructive Slashdot answer that drives me insane. In short: Ask Slashdot means "Answer the fucking question", not "Attack the question". /Rant

    3. Re:Do something about your hoarding problem by LordLimecat · · Score: 4, Insightful

      This problem isnt unique; most people have trouble curating their data. That doesnt change the fact that the problem is mostly self-created, and the best solution isnt to find another place to stuff the 20TB. Its to take the time to cull it down to a reasonable size and then back it up.

    4. Re:Do something about your hoarding problem by Wrath0fb0b · · Score: 2

      You don't have to cull it down, you just need to organize it into logically distinct groups and assign them priorities. Hoarding isn't the problem, the problem is assigning too high a priority to the hoarded pr0n as compared to the really important stuff.

      • Group #0:

      Contents: Documents, source control repositories, user preferences, email archives
      Maximum Size: 10GB
      Protection: 3-way Mirror + Snapshots + Offsite
      Total Space Required (way upper bound): 150GB
      Total Cost: $3 a month for Crashplan

      • Group #1:

      Contents: Personal photos, music collection, other people's #0 backups, /home
      Maximum Size: 1TB
      Protection: 2-way Mirror + Offsite
      Total Space Required: 2TB
      Total cost $3 a month for Crashplan

      • Group #2

      Contents: Everything else, media, pr0n,
      Maximum Size: âz
      Protection: Diskpool, maybe integrity if you like zfs/btrfs

    5. Re:Do something about your hoarding problem by LordLimecat · · Score: 2

      It most definately IS worth curating your inbox. One of the toughest issues ive seen WRT backups and email performance is absurdly large PST files. Guess what: that PST file is marked as "modified" every time you open outlook, and most online backup systems will mark it as needing to be re-backed up each time leading to some absurd bandwidth usage. Theres also all of the issues with potential for a single file to get corrupted, difficulty in repairing it (ever tried to scanpst a 25GB pst?), performance indexing it, etc... all because people cant be bothered to keep their mailbox under control. And of course when you finally decide to move to provider like Outlook.com / exchange / gmail, you discover they have a 25GB limit.

      Source: Been there way too many times with clients. Hoarding is never the answer, and is often the problem.

    6. Re:Do something about your hoarding problem by Reziac · · Score: 2

      This is why I archive my email into separate bins every 60mb or so (which means about twice a year) -- anything larger becomes unwieldy. It's a nuisance, but not nearly the nuisance of trying to search the file with an external tool because it got munged and won't open in the mail client.

      [This is with SeaMonkey, which still keeps mail as plaintext, thank ghod, so any good text editor can root about in the file at need.]

      --
      ~REZ~ #43301. Who'd fake being me anyway?
  8. Good luck. by TheCarp · · Score: 3, Informative

    A quick check at one service which lists such large amounts, you would be looking at almost $20k/year to keep a single offsite copy of that. That is the posted price however, I imagine that is enough that you could shop around and find a deal, but, a deal is still going to be prohibitive for most people.

    At 20 TB I would start thinking about one of two things: Tape, and/or git-annex.

    Unless prices have changed since I last looked and the scales tipped, tape has the advantage of being cheap. Of course, you will need to test your tapes occasionally and likely want 2 copies just in case, but, at that point you are invested in tape, may as well.

    The other possibility is git-annex and lots of drives, but you can mix types. That way you can keep a catalog of your library and information on where it all is, and how many copies of each thing you have.

    Of course, any way you slice it, each physical piece of media is something that can fail so you have to occasionally test to ensure redundancy.

    --
    "I opened my eyes, and everything went dark again"
  9. Don't hoard by rainer_d · · Score: 4, Insightful
    Were those 20T of original movies and music or just stuff he downloaded via bittorent?

    He could have always bought a sufficiently large tape-library from ebay - but I guess the data wasn't worth that much.
    That's always the first pair of questions to ask: how much is it worth and how much would it cost to recreate?
    If the answer is somewhere between "I don't know" and "Well, it's not that much", then he just should stop hoarding that much stuff.

    He could have built a filer with ZFS and sent daily snapshots to a 2nd filer - but that wouldn't have helped him if the house burnt down...

    --
    Windows 2000 - from the guys who brought us edlin
  10. Build another server by EmagGeek · · Score: 2, Interesting

    If you want to back up 20TB of data, you have to pay for it.

    Build another server and rsync hourly.

  11. My solution by StripedCow · · Score: 5, Funny

    Figure out the theory of everything.
    Then you can always recompute your data from scratch.

    --
    If Pandora's box is destined to be opened, *I* want to be the one to open it.
  12. 2nd Array or Tape by Grave · · Score: 2

    With a second array, or tape backup. The second array is going to be the easiest solution, but tape backup provides you the option of storing the tapes off-site, which is important for any real backup plan. After all, your friend could just as easily wipe out the 2nd array by mistake, or a disaster could wipe out the physical location. LTO-6 tapes are cheap and can hold 2.5-6.5TB of data depending on compression. Tape drives are perfect for backup, so why even ask if it's right?

  13. Rsync and Bluray and maybe dedup by Supp0rtLinux · · Score: 2

    As you noted, Bluray holds a lot of data, but would take some time. Since its audio/video media, odds are most of it is pretty stagnant. I'd do an initial rsync job to write out to Bluray... then once a month or so repeat the job but now rsync will only get what's changed. Depending on the media type and age, you could also look at dedup'ing it and if the dedup'd copy is significantly smaller than the source you might be able to put that onto say one or two 3-4Tb drives.

  14. Why back it up at all... ask the NSA for a restore by Supp0rtLinux · · Score: 4, Funny

    You could always just call up the NSA and ask them to restore the data. Odds are good they have a copy of it...

  15. one bit at a time by Cmdr-Absurd · · Score: 2

    Same as always.

  16. Plan for backup before you buy by Ktistec+Machine · · Score: 5, Insightful

    Whenever you buy storage, you should buy the necessary backup capacity at the same time. You should never buy storage without buying backup capacity. Budget for it right from the start. If you can't afford the backup, you can't afford the storage. This may mean getting half as much storage as you'd like, but that's just the way it has to be. You probably wouldn't buy a car without an engine. It wouldn't do its job. So don't buy storage without backup. If you do, you have a storage system that can't do its job.

  17. I agree but... by matteo.defelice · · Score: 5, Informative

    I agree, I've been using Crashplan for three years and the unlimited space it's really great BUT... ...I'm not sure about the bandwidth they provide: how long it will take to upload 20 TB? Anyway, I don't see what's the problem in using external drives for backup. Here in my lab I've realized that the best way to backup X Terabytes is to have another storage with X Terabytes...

  18. Hilarious by jeffmeden · · Score: 5, Insightful

    It's not like you could just plug in an external drive, and using any cloud service would be terribly expensive. Blu-Ray discs can hold a lot of data, but that's a lot of time (and money) spent burning discs that you likely will never need. Tape drives are another possibility, but are they right for this kind of problem? I don' t know. There might be something else out there, but I still have no feasible solution.

    Lets start from the top: You *can* plug in an external drive, it's called a complete hardware duplicate of your array (or perhaps for space/cost consideration, a single disk based copy held offline and synced regularly). Not hard and not terribly expensive (i would go with this solution personally). Cloud? Yep the bandwidth and storage even on something like Amazon Glacier would be prohibitive to all but the most financially independent geeks. Bluray doesnt hold enough (even at 50gb/disc you need 400 of them, groan). So, tapes? You bet your ass tapes are designed to do exactly this task, why do you think they are still in use? You can get individual tapes at 1/1.5TB, but for a one man operation they are probably going to cost you more than the first solution (offline spinning disks) and they are a pain to manage properly.

    Now what is this doing on ask slashdot? A pencil, some scratch paper, and 15 minutes between amazon.com and newegg.com would tell you the prices of every solution. Oh, right, they need a chance to tee up some targeted ads for Carbonite, Mozy, Crashplan, etc.

  19. Backup only the best one by jones_supa · · Score: 2

    How about backing up only the crown jewels of the collection?

    Make a directory like /entertainment/premium and put the best stuff there, with a 4 TB limit. Rotate two external 4 TB HDDs and copy the stuff over periodically. Put a little sticker or some other mark on the newest, so you remember which one it is. If your main RAID array fails, build a new one, and restore the premium stuff from the most recent one of the two external disks.

    1. Re:Backup only the best one by geekoid · · Score: 2

      Why would you only back up the nut shots from his porn collection?

      --
      The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
  20. Ah, "unlimited"... right. (*cough*) by stoploss · · Score: 5, Informative

    These "unlimited" claims always turn out to be lies. When will we learn?

    My friend paid for an "unlimited" account from JustCloud for backup. He stored 1.8 TB on it and then they "fair use"'d his ass and canceled his account. They didn't even give him a refund for the rest of the money he prepaid.

    1. Re:Ah, "unlimited"... right. (*cough*) by dcollins · · Score: 2

      Fraudulent Capitalist Practice Spun into Argument Against Communism. News at 11.

      --
      We know where leadership by an anti-intellectual "strongman" who scapegoats minorities and likes boisterous rallies goes
  21. Amazon Glacier by uiucgrad · · Score: 3, Interesting

    I use Glacier and its great. 20 TB is about $200 a month which to me does not seem like all that much money for backing up that much data. The biggest problem from a home users perspective is getting all of that data to Amazon. Hopefully he lives somewhere where fiber is available to his house.

    1. Re:Amazon Glacier by koan · · Score: 2

      They charge you to download it too.

      --
      "If any question why we died, Tell them because our fathers lied."
  22. Re:no by jones_supa · · Score: 2

    md prnt dwn

  23. Connect a raspberry pi and by Coeurderoy · · Score: 5, Funny

    Connect a raspberry pi and configure it as a backup server and let it copy all to /dev/null...
    Then put aside the money you would have invested in a "better" solution, put it in a safe bank (under your mattress)
    and wait until you need to restore something..
    Most probably you'll enjoy the money more ...

  24. Make a list by PsyMan · · Score: 2

    Catalogue the contents and when you lose it all you can spend 10 minutes searching for the 2% of the content you really want to download again and feel good that you now have 98% of your storage space back to start filling with more crap :D

  25. Re:Another RAID? by jones_supa · · Score: 3, Funny

    That's why it is supposed to be used with caution, as no 'rm' supports it. ;)

  26. Re:Glacier at $20/mn expensive? by jeffmeden · · Score: 3, Informative

    Glacier at $20 per month for 20TB is rediculously cheap by today's standards. And at those sizes, you'd want to ship those drives to Amazon instead of uploading. We do this all the time and it's not that hard.

    The price of TBs of storage of course will come down without question. But by today's standards $20/month for a medium that won't "bit rot" on you is an amazing deal.

    You missed a 0, he has 20,000GB and the cost for glacier is $.01/gb/mo (not including upload charges). So, Glacier would cost him $200 a month or $2400 a year. Not hugely expensive but if you are OK with a quasi-local copy (offline and stored in a fire safe, perhaps) you could do it cheaper for less, after you hit the 1 year mark.

  27. I keep it simple with BackBlaze by BenTristem · · Score: 2

    I was going down the route of buying an expensive RAID NAS / DAS, but then I remembered when I got broken into in the Canary Islands and the thieves took both of my backup drives from two separate rooms. I'm now settled on a simple external drive, with the whole lot backed up offsite. I was looking for... + Unlimited backup, so I don't need to think +The ability to backup attached drives (NAS, DAS, USB, etc) + To feel that my data is safe with a 2nd layer of encryption You can try it free here: http://bit.ly/1bRNax1 My blog post about this: http://www.bentristem.com/1/po... Enjoy!

    --
    Ben Tristem I'd love to know more about you in this short survey... http://bit.ly/1oM7Fvl
  28. a used LTO autochanger is what I employ by Anonymous Coward · · Score: 2, Informative

    No one will ever see this anonymous post but a cheap robot changer (used) on ebay can be had from between a few hundred to a few thousand dollars. Most of us are geeks and love technology. I use two such devices, couldn't imagine life without them. LTO4 is still the sweet spot in storage cost (media) and capacity. The tapes hold 800GB and can be purchased for around $22 dollars each.

     

  29. No 2 ways about it by obarthelemy · · Score: 2

    1- if you need to backup 20 TB today, you need to budget for 40TB in the medium term.
    2- a backup is off-line, off-site, tested, and multiple. The "multiple" part is pricey, and the other 3 you can get cheapest with a PC filled with HDs. Or two (I'm making do with one). $200 for the BC, $150 per 4TB HD x 5 = $950. Hide that backup in a place safe from theft, floods, fire...

    --
    The Cloud - because you don't care if your apps and data are up in the air.
  30. TAPE by Lawrence_Bird · · Score: 2

    There are many > 1TB tape back up systems, many with very high speeds, assuming you can feed it data fast enough.

    I have to wonder though.. 20TB for a single person? I'm not gonna do the math but that sounds like so much stuff to be impossible to listen/watch all of it.

    But at least he has proven once again, RAID is not a backup. RAID will merrily do what ever you wish, including copying drive corruption.

  31. License? by scsirob · · Score: 4, Funny

    Just assuming that your friend had a fully legal collection, I would think that all he needs to do is ask the media companies for a new copy. Because the media industry tells us that we do not buy music, we buy licenses, right?? So even if we lose the bits-and-bytes which are easy to replace, then we still hold a license and the media companies should facilitate that your friend can exercise his licensed rights..

    [/sarcasm]

    --
    To Terminate, or not to Terminate, that's the question - SCSIROB
  32. Boxcars / Gigabyte by jabberw0k · · Score: 4, Funny

    If IBM punch cards were used, 1 GB equals approximately 47 cubic yards (assuming 80 bytes per 187x86x0.18mm per card) and about 70,000 lbs (at 2.42 g per card), so one standard railroad boxcar (limited by both cubic capacity and weight) could hold about 3 GB. 20 TB would need over 6000 boxcars of punch cards; at 60 feet per boxcar, that's a freight train about 70 miles long.

    1. Re:Boxcars / Gigabyte by joshuac · · Score: 3, Funny

      Well sure, latency is a bitch but imagine the throughput once it got moving!

  33. Stop being cheap or collecting so much crap data! by cyn1c77 · · Score: 2

    What does it mean that he didn't have "a good way to backup that much data, so he never took one"?

    The concepts behind backing up data have not changed. You need to manage the size of your data to redundantly fit into the storage of your system. So either pony up the cash and time to properly store your files, stop collecting TBs of crap, or stop complaining about losing it when your system crashes.

    It's frustrating to see people continuously complaining about how they have too much data to back up cheaply and conveniently. It's even more frustrating to see them complaining about losing all of their data because they didn't back it up properly.

    I think that the main issue is that most people do not realistically or conservatively plan their actual storage capability. For example, it seems like 90% computer users believe that having 4 TB of hard drive space means that they can safely store 4 TB of data.

    After a conversation about scratch space, redundant drives, and timestamped backups, they then will grudgingly agree to allocate 25% of their available storage to RAID/Backup space, which obviously does not get the job done! Very few are willing to accept using 66% of their available hard drive space for RAID and Backups, which is really the minimum metric for any sort of storage longevity.

    20 TB is an awkward amount of data for a non-corporate individual to be storing. It's more data than most people actually need for their media and it is getting into a very expensive price range to backup for basic music/movie content. (By expensive, I mean that it would be cheaper to just re-purchase the media rather than back it up.)

  34. Please post Tape backup ref by advid.net · · Score: 4, Insightful

    To /.ers saying that 1TB+ tapes would be a good idea to do this backup, please:

    Add some references and price of such hardware and media that would suit best home usage.

    1. Re:Please post Tape backup ref by Anonymous Coward · · Score: 2, Insightful

      Stop ! Stop ! Stop ! Stop .... Seriously STOP !

      There are a few comments here that are really trying to help, and most are wasting time. This is a real problem, it's a real problem we all have - even if we don't think too hard about it. I bet that lots of you have a usb stick with a 100% backup of some critical files... perhaps even multiple copies ( 200%, 400% coverage )

      To solve this problem - you have to understand what your requirements are ( surprising eh - but nobody is talking about it ). It's clear that most of the comment have no idea about managing large data volumes, and they clearly don't understand backup solutions.

      Here are some questions - the set of answers to these questions decide which backup solution is optimal for THIS use case ...

      What 'backup' problem are we trying to solve ?
      - A single spare copy ? Multiple copies ?
      - Is all the data in the backup 'equal' ?
      - How often does the data change - or is it mostly static ?
      - How much data loss from the primary source are you willing to accept ? ( In otherwords how closely does the backup have to match the live data set )
      - If you lose the primary source ; how quickly do you need the data set back ? ( Can you backfill the dataset while continuing to work on the live system, or are you hard down until a full restore has completed ).
      - How long does a backup need to live ? ( If you do a fresh full backup every week, and you write it to a biodegradable wafer, and it dissolves in 6 months ; but you have copies for every intervening week - do you care that the 6 month old version is nolonger readable ) ... OR do you need to restore from a year ago, 5 years ago, 10 years ago (think financial records))

      Figure out the answers to those questions - and you might be a ways towards deciding what solution you actually need.
      There are commercial software systems (and opensource ones like amanda/zmanda) that understand the data management lifecycle - and are setup to support solutions like 'intermittent full backups ; with daily/hourly incremental snapshots'. . but then there are the filesystems that support snapshotting. Look at the features that the full service solutions provide - they might prompt the conversation to help you understand what you actually require.

      So - if you have 15TB of mostly static data (movies etc. ) - then you can archive that to long term media.
      If you have 5TB of changing data ( music library, playlists, new mp3s, new movies ) - then you can snapshot that using the filesystem hourly , and back it up to the cloud weekly (with incrementals) - and then dump it to long term archival every 6 months..

      Where the (15TB, 5TB, 6 months) values in the above statement are completely made up - and are not the ones you should use unless I guessed right.

      None of the 'use tape, use cdr, use punched paper, use lasers to etch it on the moon' solutions will work if you don't first understand the problem you are trying to solve.

      This isn't hard - but if you don't know what you want, then every solution is the wrong one and too expensive

  35. Re:Similar Situation by afidel · · Score: 2

    You can get an LTO4 SAS drive for ~$50 on ebay, they do 800GB native per tape, so typically ~1.2TB per tape for mixed content (obviously if it's all compressed media it will be much closer to native). 10-20 tapes doesn't seem that bad (we send that many offsite daily). The tapes will cost you ~$20 each unless you're willing to go used (ewww).

    --
    There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
  36. The Anti-Solution by mr.dreadful · · Score: 2

    Do you really need to back-up that much data?

    I'm just speaking generally here, there are certainly cases where someone would need to back up this much data, but for your home media library? If we're talking movies, 20 TB is roughly 20,000 movies (for sake of argument, I'm not considering music). At what point is this just digital hoarding? I used to keep a large collection of movies, mostly pirated, and eventually realized that:

    a) I was spending more time and money managing the collection then I wanted to. b) That I rarely watched many of the items in my library. c) That I was placing myself in legal jeopardy by storing so many illegal copies. d) Anything I did want to re-watch I could get from Netflix, the public library, or download.

    Music would be slightly different, as I could see where music is in some kind of constant rotation, but again, how much of it are you actively using? I'm just playing devil's advocate here, but I think this kind of collecting/hoarding is a byproduct of pre-internet scarcity.

  37. Re:$825 by mlts · · Score: 2

    Even with bandwidth, there are caps and fees here in the US. Try moving 1TB of data via LTE, and the telco will likely hand the person a five digit bill next month. Do it on some cable company plans, and you will be greeted by a $300 bill. So, large data via the Internet isn't going to happen.

    There are a number of solutions for this problem:

    1: One of the better ones is a server with decent backup software and a LTO tape drive. Then eight tapes will save the 20 TB. Expensive, but the job is done right.

    2: One can always have a 20TB RAID, then plug in removable HDDs and use them with WinRAR or another utility as volumes. I'd buy 5-6 4TB removable drives, use WinRAR to make segments, and have at least one recovery volume so that data can be recovered if a HDD fails.

    3: One can always buy a an external RAID enclosure, add drives, and use that as a large volume. Then use multiple enclosures that were swapped around as volumes (with an offsite rotation), so a failure or loss of everything at the site wouldn't mean everything is gone.

    4: Buy a RDX drive and media, and use 2TB disks. The drive costs $600, the 2TB disk cartridges around $360. However, this just needs a USB connection, no fast box, no SAS interface required.

    If I wanted to do the job "right", I'd buy a dedicated server with its own RAID array, use a decent backup utility, and dump the data to multiple sets of tapes, one set being stored offsite. If the server and where it sits gets destroyed, it can be re-bought. LTO-4 and newer have built in AES-256 tape encryption so just set a long passphrase that you can remember and call it done.

  38. Re:Go on the internet and find a DLT drive by Anne+Thwacks · · Score: 2
    LTO is 6.25TB per tape. Tapes are about $100. Drives are NOT cheap, but if you have 20TB now, I am guessing you will soon have 40TB

    The benefits of tape are:

    Data will probably last 30 years (I have read 30 YO tapes myself) HD interfaces go out of fashion every few years.

    You can have a pool of tapes, and recycle them when you no longer need the data.

    Tapes will survive serious abuse. (A lot more than HDs anyway) definitely included the back of a station wagon (except in tropical climates).

    You can use Amanda, Bacula or tar for free. (I recommend tar if you want to keep the data for 30 years).

    --
    Sent from my ASR33 using ASCII
  39. Re:Go on the internet and find a DLT drive by ewilts · · Score: 3, Informative

    You're dating yourself. LTO-5 is 1.5TB native, 3TB compressed at $25 per tape. LTO-6 is 2.5TB native and 6.25TB compressed. Both of those compressed numbers are using the built-in compression in the drive.

    A 10-pack of LTO-5 tapes is about $250.

    You can easily encrypt the tapes and tape them offsite. You can keep a copy onsite and offsite. You're simply not doing that with disk.

    Your speed is also off - an LTO-5 can write at 280MB/sec. The limiting factor is not the write time on the media but the read time from disk.

    Restore times are typically limited by the write rate on the destination raidset, not the read rate from tape.

    --
    .../Ed
  40. Re:Go on the internet and find a DLT drive by grumbel · · Score: 3, Informative

    LTO-6 can hold 2.5TB per tape, a tape cost ~$70, the drives cost $2000. That's still more expensive then just more HDDs for 20TB, but at >50TB it might be worth it.

  41. Re:math majors by Redmancometh · · Score: 4, Funny

    Now this joke has really come full circle.

  42. Plenty of entertaining and not practical solutions by anewsome · · Score: 2

    I really did get a kick out of some of these responses. I sell data protection products for a living and 20TB is what I would consider an average small/medium customer. Every business these days has tens of terabytes of data. Of course they all need to backup their data, so there is nothing novel here. We have plenty of customers backing up hundreds of petabytes of data. Every dataset just needs a plan for backup, pretty simple.

    The way I see it, this guy has a few options. One option is to just get more disk and make redundant a redundant copy. This would have have saved him in this case of the mistakenly erased raid, depending on how smart his sync script is. But a redundant copy is not a valid genuine backup plan. So many types of failures will show the holes of the dumb redundant copy.

    The other option for a home user who's not looking to spend a bunch of money, is LTO6. They hold a sufficiently large amount of data, so only a handful of tapes will be needed. LTO6 drives are cheap enough, they won't break the bank. Since the data is on tape, you can shuttle the tapes to an off site location. Seems pretty simple.

  43. Jacquard loom punch cards by drkim · · Score: 3

    I'm backing up my 40TB music library on Jacquard loom punch cards.

    Added bonus: You can use the punched cards to make fabric.
    Right now I'm wearing Justin Bieber's "Love Me" ...as a sweater!

    https://web.duke.edu/isis/gess...

  44. FlexRaid or UnRaid by cbeaudry · · Score: 2

    http://www.flexraid.com/
    http://lime-technology.com/ (UnRaid)

    Best solution for big media collections.

    All data is stored seperatly on each drive, and 1 separate parity drive can protect up to 21 drives (as long as its as big or bigger than any 1 of those 21 drives).