Slashdot Mirror


How Do You Backup 20TB of Data?

Sean0michael writes "Recently I had a friend lose their entire electronic collection of music and movies by erasing a RAID array on their home server. He had 20TB of data on his rack at home that had survived a dozen hard drive failures over the years. But he didn't have a good way to backup that much data, so he never took one. Now he wishes he had.

Asking around among our tech-savvy friends though, no one has a good answer to the question, 'how would you backup 20TB of data?'. It's not like you could just plug in an external drive, and using any cloud service would be terribly expensive. Blu-Ray discs can hold a lot of data, but that's a lot of time (and money) spent burning discs that you likely will never need. Tape drives are another possibility, but are they right for this kind of problem? I don' t know. There might be something else out there, but I still have no feasible solution.

So I ask fellow slashdotters: for a home user, how do you backup 20TB of Data?"
Even Amazon Glacier is pretty pricey for that much data.

983 comments

  1. Hmmm... by Anonymous Coward · · Score: 5, Funny

    I would say use floppies, but I'm kind of old and out of touch now.

    1. Re:Hmmm... by Anonymous Coward · · Score: 2, Funny

      I would say use floppies, but I'm kind of old and out of touch now.

      5 1/4" or 3 1/2"?

    2. Re:Hmmm... by DickBreath · · Score: 4, Funny

      Punched paper tape has better longevity than either floppies or optical media. You just need a really big roll and a lot of time.

      --

      I'll see your senator, and I'll raise you two judges.
    3. Re:Hmmm... by Anonymous Coward · · Score: 5, Funny

      5 1/4" or 3 1/2"?

      8". How young are you?

    4. Re:Hmmm... by Grisstle · · Score: 2, Funny

      Neither, 8" floppies would be the way to go.

    5. Re:Hmmm... by Anonymous Coward · · Score: 0

      8"

    6. Re:Hmmm... by Assmasher · · Score: 1

      I wonder how much energy taping out 20TB of data would take...

      --
      Loading...
    7. Re:Hmmm... by Primate+Pete · · Score: 4, Funny

      You can do the job with a mere 128 million single sided, single density 160K disks, like the one on the original IBM PC. When in doubt, go with proven technologies. Assuming a stack of 4 disks is 1cm thick, you should get away with around 1000 m^3 storage space.

    8. Re:Hmmm... by Anonymous Coward · · Score: 0

      8" ss-sd

    9. Re:Hmmm... by Anonymous Coward · · Score: 0

      please use standard units. How much is that in LoC?

    10. Re: Hmmm... by Anonymous Coward · · Score: 1

      Backblaze has unlimited storage. It would take some time for the initial backup but future incrementals would be quick.

    11. Re:Hmmm... by Bing+Tsher+E · · Score: 1

      Make sure you use the mylar tape, not that failure prone paper tape.

    12. Re:Hmmm... by Waffle+Iron · · Score: 3, Funny

      Neither, 8" floppies would be the way to go.

      Hard sectored or soft sectored?

      It would be best to decide up front before putting in the order for 80 million disks.

    13. Re:Hmmm... by Primate+Pete · · Score: 1

      Several?

    14. Re: Hmmm... by daninaustin · · Score: 0

      You'll need local storage. It won't backup your network drives. Other than that, Backblaze is great (and cheap.)

    15. Re:Hmmm... by Anonymous Coward · · Score: 0

      ...and be prepared to be shunned as a nut job.

    16. Re:Hmmm... by Thanshin · · Score: 1

      A station wagon full of tapes.

      Better storage AND better bandwidth!

    17. Re:Hmmm... by Cassini2 · · Score: 4, Informative

      At 10 characters per second, the backup would take 63,419 years(*) and require 659 TJ or 0.2 TWh of power to complete. I have a customer that still uses paper tape. It lasts and lasts, and I have only replaced the reader once. The punch needs a new power supply every 20 years or so.

      However, 63,419 years is a long time to wait for a backup to complete.

      (*) this assumes that 1 TB = 1,000,000,000,000 bytes. It takes almost 70,000 years if you add the extra 10%.

    18. Re:Hmmm... by Grisstle · · Score: 1

      Tough call, I'd go soft sectored as long as the floppy drive itself is efficient enough to cram as much data on the disk as hard sectored. All these thoughts of floppies and holes...just remember grower or shower you can't go wrong with an 8" floppy.

    19. Re:Hmmm... by Thanshin · · Score: 3, Informative

      20TB = 1.33LoC

    20. Re: Hmmm... by MightyYar · · Score: 4, Insightful

      Well, it's local storage to a computer somewhere. If that computer can run Backblaze, then super.

      --
      W..w..W - Willy Waterloo washes Warren Wiggins who is washing Waldo Woo.
    21. Re:Hmmm... by K.+S.+Kyosuke · · Score: 3, Funny

      Yes, but if he's a Microsoft SW user, he *has* to use 5 1/4" floppies. You see, 8" floppies are insufficiently micro and 3 1/2" floppies are insufficiently soft.

      --
      Ezekiel 23:20
    22. Re:Hmmm... by sdo1 · · Score: 4, Insightful

      Which, if it takes about 1 minute to load each one, it will take you a mere 243 years to do the backup.

      --
      --- What parts of "shall make no law", "shall not be infringed", and "shall not be violated" don't you understand?
    23. Re: Hmmm... by fuzzyfuzzyfungus · · Score: 1

      I haven't checked; but while Backblaze definitely doesn't back up UNC-pathed SMB shares or (ugh, mapped drive letters), is it smart enough to check whether a disk is 'real' or just an iSCSI initiator pointed at a suitable target? Even the cheap-ass versions of Windows come with a bundled initiator, though the target is server only...

    24. Re:Hmmm... by Lumpy · · Score: 4, Informative

      Punch the hole and you can flip them over to double your capacity.

      --
      Do not look at laser with remaining good eye.
    25. Re:Hmmm... by azav · · Score: 1

      Real men use cassette tapes on their TRS-80.

      --
      - Zav - Imagine a Beowulf cluster of insensitive clods...
    26. Re:Hmmm... by K.+S.+Kyosuke · · Score: 4, Funny

      Punched paper tape has better longevity than either floppies or optical media.

      If you're going for *actual* longevity, you can't beat fired clay tablets. (Yeah, I know they weren't fired originally, but you have to decide how much you value your MP3 files. I'd certainly take the extra time!)

      --
      Ezekiel 23:20
    27. Re:Hmmm... by Chrisq · · Score: 3, Informative

      According to this site 2560 Meters = 1 Meg. So that would be 20,000,000 times 2560 = 51,200,000,000 metres = about a third of the way to the sun

    28. Re:Hmmm... by NatasRevol · · Score: 2

      This was the greatest trick ever.

      --
      There are two types of people in the world: Those who crave closure
    29. Re: Hmmm... by Anonymous Coward · · Score: 0

      8-inch, of course.

    30. Re:Hmmm... by Vitriol+Angst · · Score: 1

      So this means he'll have to do a RAID array of punch tape.

      Also, he's going to need a warehouse to store it. So it's going to cost a few million at the least to make really, really sure a hard drive doesn't go down.

      --
      >>"ad space available -- low rates!!!"
    31. Re:Hmmm... by emag · · Score: 1

      You haven't seen how long my backups take at work...

      --
      "The urge to save humanity is almost always a false front for the urge to rule." --H.L. Mencken
    32. Re:Hmmm... by NeverVotedBush · · Score: 1

      It has the added ability of being readable by eye if your opto-mechanical reader ever fails.

      But I suggest punch cards as they seem more durable.

    33. Re:Hmmm... by jfdavis668 · · Score: 1

      Do they make 5 1/4 or 8" external USB drives? I don't think either one would fit in a modern home computer.

    34. Re: Hmmm... by Immerman · · Score: 1

      > It would take some time
      Indeed. At 1Gbps, neglecting overheard, that's only ~45 hours. But for me on my poor 2mbps connection it'd be 2.5 years.

      Also their security seems slightly iffy, if that's a concern. When backing up they claim to do all encryption on your machine, then send the encrypted data over SSL to the servers (Why SSL if their encryption is secure? I suppose another layer of encryption never hurts.) but for restoring they unencrypt on their servers then send you a .zip of the desired files over SSL. You do have the option of adding another level of encryption keyed to a passphrase they promise they won't store - though to get your data you have to send the passphrase to them so they can again do server-side decryption.

      Not a huge issue I suppose - if security is important after all the rule is generally do your own encryption before you let any backup service anywhere near your data. Still it would be nice if a service like this would offer the option of a layer of fully client-side encryption/decryption - presumably on a per-file or per-folder basis (whatever the smallest restorable chunk is), even if you'd still have to trust them not to discretely send themselves your passphrase.

      --
      --- Most topics have many sides worth arguing, allow me to take one opposite you.
    35. Re: Hmmm... by Anonymous Coward · · Score: 0

      Real men use tape cassette on their ZX-81.

    36. Re:Hmmm... by mykie242 · · Score: 0

      Paper degrades over time, nickel disc is the only way to go for long-term longevity.

    37. Re:Hmmm... by Rhaban · · Score: 5, Funny

      Seed a torrent of it as an encrypted file named "porn.zip" or similar. You'll have it backed up on the cloud for free in no time and available for all of eternity.

    38. Re:Hmmm... by SpzToid · · Score: 1

      Um, ok, but can I still use zfs (like the FreeNAS), or perhaps ext4, or can I even use linux for this archival application using such ancient hardware?

      Any suggestions for the parity errors I'm likely to endure sooner or later? Of course knowing how to deal with those in-advance would be a God-send right about now.

      --
      You can't be ahead of the curve, if you're stuck in a loop.
    39. Re:Hmmm... by Jason+Levine · · Score: 1

      Bah. I say you engrave the ones and zeros directly on stone tablets. If you engrave each one/zero at around 12 point size, then each one/zero would be about 4.5 millimeters tall. Add another millimeter and a half on the bottom for spacing between lines, so 6 millimeters per line. At about 80 characters per line, we would need 9 trillion lines. So we just need a stone tablet 54 million kilometers tall which is coincidentally about as close as the Earth and Mars are at their closest.

      --
      My sci-fi novel, Ghost Thief, is now available from Amazon.com.
    40. Re:Hmmm... by Anonymous Coward · · Score: 0

      Why punching.

      Write the 0s and 1s with a pencil.

    41. Re:Hmmm... by Anonymous Coward · · Score: 0

      Fun Fact: This many floppies would be a brick 50X50X25 feet in size.

    42. Re:Hmmm... by Anonymous Coward · · Score: 0

      Actually --- Stone tablets are even better.

    43. Re: Hmmm... by Applehu+Akbar · · Score: 1

      Crashplan does back up NAS shares, but given its internal overhead it only sips data at the rate of about three months per terabyte. That initial backup of your RAID could be accomplished in a scant five years, so long as your provider doesn't impose a usage cap.

    44. Re:Hmmm... by Bob+the+Super+Hamste · · Score: 1

      And since I recently found an old 5 1/4" floppy drive of mine I now want to put it in my new computer, too bad the computer doesn't have a floppy drive connector on the board since the drive probably would still work.

      --
      Time to offend someone
    45. Re: Hmmm... by rthille · · Score: 1

      SSL to their servers prevents tampering in-route. Now they could have used their own wire protocol to ensure the data wasn't altered on the way, but why reinvent the wheel?

      --
      Awesome furniture, accessories and cabinetry in Santa Rosa, CA: http://humanity-home.com/
    46. Re:Hmmm... by Mordok-DestroyerOfWo · · Score: 4, Funny

      Once again a slashdotter is advocating to take away a musician's right to make a prophet.

      --
      "Never let your sense of morals prevent you from doing what is right" - Salvor Hardin
    47. Re:Hmmm... by Anne+Thwacks · · Score: 4, Insightful
      LTO6 tapes hold 6.25 TB, so you probably only need 4. Hardly fills a station wagon - the basket of a push-bike would do fine.

      Of course, you might want to make more than one copy of your data. https://www.amazon.co.uk/dp/B006K1FSKA

      --
      Sent from my ASR33 using ASCII
    48. Re:Hmmm... by sensei+moreh · · Score: 1

      As I recall, those 8" floppies really didn't hold much data (I'm too old to recall how much they did hold)

      --
      Geology - it's not rocket science; it's rock science
    49. Re:Hmmm... by Mashiki · · Score: 1

      Pft. Real men use their vic-20's, without the ram expander cartridge.

      --
      Om, nomnomnom...
    50. Re:Hmmm... by flyingfsck · · Score: 1

      Obviously hard sectored, single sided.

      --
      Excuse me, but please get off my Pennisetum Clandestinum, eh!
    51. Re:Hmmm... by Anonymous Coward · · Score: 0

      That book sucks.

    52. Re:Hmmm... by lanswitch · · Score: 1

      A couple of thousand Dropbox accounts.

    53. Re:Hmmm... by Old97 · · Score: 3, Informative

      The first 8" I used held 128k. The last one had a capacity of around 1.2 mb. They were twice as fast (transfer rate) as those new fangled 5 1/4 inch floppies. Kids, what do they know?

      --
      Very often, people confuse simple with simplistic. The nuance is lost on most. - Clement Mok
    54. Re:Hmmm... by distilate · · Score: 1

      What is the average $/LoC of data stolen from a media organisation?

    55. Re:Hmmm... by dr.Flake · · Score: 1

      I would say, bit by bit.......

      --
      Why are other peoples sig's always more witty ???
    56. Re:Hmmm... by idontgno · · Score: 4, Informative

      At the very end of the 5 1/4" floppy era, the "High-Density" floppy used the same data rate, tracking, and recording density as the 8" 1.2M floppies. They were, in fact, 1.2M 5 1/4" floppies. Which is why their formatted capacity was different from 3.5" "high-density" equivalent, 1.44M.

      Other than electrical needs (as 8" floppies often had their spindle motors directly powered by 120VAC line current), the high-density 5 1/4"s were used as a drop-in replacement for 8" floppies in the hobbyist retrocomputing community. (Not collectors, though; they'd want to keep the gear as cherry as possible.)

      --
      Welcome to the Panopticon. Used to be a prison, now it's your home.
    57. Re:Hmmm... by Anonymous Coward · · Score: 0

      Yes! I remember being in absolute awe that that would work.

    58. Re:Hmmm... by Chrutil · · Score: 1

      Bet you most people here have no idea what you meant.

    59. Re:Hmmm... by Anonymous Coward · · Score: 0

      Oh I've heard this one before... One "byte" at a time.

    60. Re:Hmmm... by operagost · · Score: 1

      READY.
      SAVE "20TB

      PRESS RECORD AND PLAY ON TAPE
      SAVING 20TB

      [wait until heat death of universe]
      READY.

      --

      Gamingmuseum.com: Give your 3D accelerator a rest.
    61. Re:Hmmm... by SuricouRaven · · Score: 1

      High-power communications laser directed out into space.

      Catching up with your data might be more of a problem.

    62. Re:Hmmm... by confused+one · · Score: 1

      If you're going for longevity and survivability, you need to use stainless steel foil.

    63. Re:Hmmm... by rjune · · Score: 1

      I would recommend 8" floppies. I have a couple of drives and disks available for a very reasonable price.

    64. Re:Hmmm... by rjune · · Score: 1

      The DS DD's held a whopping 1.2 MB

    65. Re:Hmmm... by Ol+Olsoc · · Score: 1

      At 10 characters per second, the backup would take 63,419 years(*)

      Gee, he'd have to become a vegan to live that long also.

      --
      The shepherds did so well protecting the flock that the sheep no longer believed that wolves existed.
    66. Re:Hmmm... by GTRacer · · Score: 1

      I'm getting the feeling the sequel of Asimov's "The Last Question" could be wrung out of this scenario...

      "Let there be data!"

      --
      Defending IP by destroying access to it? That makes sense, RIAA/MPAA. Go to the corner until you can play nice!
    67. Re:Hmmm... by Defenestrar · · Score: 1

      Meh - doubling capacity is so last century. Look at optical media - just by punching a hole in the middle you go from zero useful storage to a lot!

      P.S. Ok, so I punched a hole in my Bards Tale character disk so my sister could have her own side (and not screw up my stuff).

    68. Re:Hmmm... by Anonymous Coward · · Score: 0

      LTO6 is 2.5 TB - please don't be fooled by 6.25 TB "compressed" numbers - YMMV. Sure, you'll get lots of compression of text and other non-compressed stuff, but anything which is already compressed won't compress well in the drive's compressor, including tar.gz, or bz2, or whatever file compression you choose. There are even situations where the "compressed" size is slightly bigger than an already compressed original.

      I'm only a moderate tape user, so I'm sure others will have other experiences, but I've had occasional difficulty restoring compressed tapes. Often the compression of one model of drive doesn't de-compress well (or at all) on another drive. So if you use drive's compression, be sure to have 2 spares of the exact same drive model.

      IMHO, better not to use drive compression at all. I normally use tar.gz - seems to be most reliable and in my mind, tape backups are for data integrity, not winning a numbers game.

    69. Re:Hmmm... by Anonymous Coward · · Score: 0

      Yeah, rotate all the dirt caught in the protective sleeve back out. Seriously, double-sided floppy drives were not that expensive. I paid something like €400 for each of mine. Put two in a nice case, whip up a +12V/+5V power supply for them. Standard stuff.

    70. Re:Hmmm... by ewhac · · Score: 3, Interesting
      I had only hard of LTO tapes quite recently, and I did a very tiny bit of poking around. The latest generation is LTO-6, whose tapes can hold 2.5TB each (uncompressed). The tapes themselves are quite modestly priced -- an LTO-4 tape cartridge (800GB uncompressed) costs about $30 each.

      The drives, however, are not cheap. New drives appear to start at around $1200. Used drives are all over the place -- I've seen some on eBay with an opening bid as low as $350. Also, all LTO drives appear to have either an LVD SCSI or a SAS interface, which means you'll also need a controller card. There appears to be no such thing as a SATA LTO drive.

      Plus you get to re-live all the joys of selecting tape vendors, and placing bets on whose tapes are going to last for 20 years.

    71. Re:Hmmm... by Anonymous Coward · · Score: 0

      You can always go bofh and forward it to /dev/null
      Makes miracles for speed ;)

    72. Re:Hmmm... by Anonymous Coward · · Score: 0

      I have bought a 5.25" DSDD (40 tracks) drive for about half a month's net salary. It held 360K. That was a much better buy than the SSSD (35 tracks)
      drives that Tandy offered at 30% more.

    73. Re:Hmmm... by Anonymous Coward · · Score: 0

      Prophet?
      Deer Jesus!
      http://www.smbc-comics.com/index.php?db=comics&id=3281

    74. Re:Hmmm... by Anonymous Coward · · Score: 1

      Heck, several millennia ago Moses backed up his downloads on stone and from what I can tell his torrents are still around.

    75. Re:Hmmm... by crakbone · · Score: 1

      Maybe this would work? http://www.kryoflux.com/

    76. Re:Hmmm... by jabuzz · · Score: 1

      I would imagine the cheaper drives on eBay will be the ones with the Fibre Channel interface, because most people will run a mile on seeing that. So I see a 48 slot library with two LTO4 drives going for 2000USD.

      The trick to realize is that even dual 4Gbs PCIe FC card can be had for buttons, and the drive an library will work perfectly well in an arbitrated loop, no requirement for a switch and OM2 LC-LC cable is also cheap.

      New LTO4 tapes are ~30USD labelled, I reckon on 1:1.25 compression ratio so that is around 50TB of backup capacity, not bad for under 4000USD investment.

      The next trick is to realize that in a few years you can swap out the LTO4 drives for LTO6 and keep using the library...

    77. Re:Hmmm... by Iniamyen · · Score: 1

      Brilliant.

    78. Re:Hmmm... by lgw · · Score: 1

      That player in the modern home computer for CD-ROMs and DVDs? That's the 5 1/4 (half height) form factor. Laptops don't have them of course, but most modern 'box PCs" have at least 1 5.25 half-height slot, often still 3.

      1.75" remains an important industrial design measurement, from being 1 "U" in a rack, to a variety of internal form factors being multiples of it.

      --
      Socialism: a lie told by totalitarians and believed by fools.
    79. Re:Hmmm... by lgw · · Score: 1

      If you're going for *actual* longevity, you can't beat fired clay tablets. (Yeah, I know they weren't fired originally, but you have to decide how much you value your MP3 files. I'd certainly take the extra time!)

      The military did research on how to preserve critical "rebuild civilization" data in a form that would last for centuries and have the best density that was still possible for a human to read. Presumably a DARPA project. They ended up with (non-ferrous) metal punch tape. Seems right to me.

      --
      Socialism: a lie told by totalitarians and believed by fools.
    80. Re:Hmmm... by Steveftoth · · Score: 1

      It also didn't work unless you had high quality floppies. Many ones I tried it with were write once, read never cause the data degraded on the other side since the disks we not rated for double sided.

    81. Re:Hmmm... by Talderas · · Score: 1

      I'd provide about 14 donations to the local sperm bank... that's 20 TB of my genetic material backed up.

      --
      "Lack of speed can be overcome. In the worst case by patience." --Znork
    82. Re:Hmmm... by lgw · · Score: 3, Informative

      Tape drives need the full SCSI command set, not the trimmed version that made it in to SATA (I'm not sure there's even a "(01) REWIND" supported in SATA).

      LTO tapes stored reasonably ("keep in in a cool dry place", as the song goes) should last 15 years from any vendor, as that's in the spec, and there aren't really bottom-feeder vendors for LTO.

      --
      Socialism: a lie told by totalitarians and believed by fools.
    83. Re:Hmmm... by OhSoLaMeow · · Score: 1

      Once again a slashdotter is advocating to take away a musician's right to make a prophet.

      Well done. I'd +1 but you're already at 5.

      --
      They can take my LifeAlert pendant when they pry it from my cold dead fingers.
    84. Re:Hmmm... by flyingfsck · · Score: 1

      You can use par2 or rsbep to protect your floppies against bit rot.

      --
      Excuse me, but please get off my Pennisetum Clandestinum, eh!
    85. Re:Hmmm... by drummerboybac · · Score: 1

      I think you understimate this crowd somewhat. I'm 32 and I remember that, I'd guess that the majority of people here are at least 32.

    86. Re:Hmmm... by davester666 · · Score: 1

      Why not punch tape? then you are only limited by the size of the boxes you spool it into.

      --
      Sleep your way to a whiter smile...date a dentist!
    87. Re:Hmmm... by davester666 · · Score: 1

      Oh no, he/she can still bang whomever they fancy.

      --
      Sleep your way to a whiter smile...date a dentist!
    88. Re:Hmmm... by CityZen · · Score: 1

      The 6.25TB is "compressed" capacity, while the "native" capacity is 2.5TB.
      That tape cartridge will cost you about $70, plus you need a drive for it (about $2000).
      A 3TB hard drive goes for about $100.

      To handle 20TB:
      7 x 3TB HD's = $700.
      8 x 2.5TB tapes + 1 tape drive = $2560.

      You'd need quite a few copies of the data before the tape drives make more sense economically.

    89. Re:Hmmm... by Anonymous Coward · · Score: 0

      This was the greatest trick ever.

      That's what your mother said.

    90. Re:Hmmm... by Anonymous Coward · · Score: 0

      Just make sure your source drive is fast enough to feed it SSD or a raid array otherwise you'll wear out the drive pretty fast, unless they have them fixed now where they will speed match.

    91. Re:Hmmm... by cogeek · · Score: 1

      He did say station wagon, maybe he was referring to 8-track tapes....

    92. Re:Hmmm... by AlphaWolf_HK · · Score: 1

      Football fields IMO.

      --
      Careful with names containing L slashdot.org/~AiphaWolf_HK slashdot.org/~AlphaWoif_HK slashdot.org/~AiphaWoif_HK
    93. Re:Hmmm... by Anonymous Coward · · Score: 0

      If he has 20TB of HDD's then price is likely not an issue.

    94. Re:Hmmm... by mbone · · Score: 1

      Punched paper tape does not age well - in a few years it starts to crack on characters that are mostly 1's, and then you have a mess.

      (No joke - when I was in grad school they hired an undergrad for the summer to recover punched paper tape data. The tape, about 5 years old, had broken into ~4 foot long stretches, and he was paid to feed each stretch through a reader, figure out and type the character punched at each break, and do that full time for 3 months.)

      Punched cards are much more robust (as long as you don't get them wet) and you can put 120 kB in a box not much larger than a shoe box.
      20 Terabytes would only require 463,910 m^3 of cards! That's only 320 kilotons of cardboard.

      Or the guy could just buy 20 TB of hard drive, and be done with it.

    95. Re:Hmmm... by AlphaBravoCharlie · · Score: 1

      I'd recommend paper tape, but most people likely don't know what it is.

    96. Re:Hmmm... by Anonymous Coward · · Score: 0

      And a long time to wait for a restore, without verify

    97. Re: Hmmm... by Delarth799 · · Score: 1

      If I remember right you can have CrashPlan ship you some drives and backup to them locally and ship them back. Of course that costs money but if you need to backup a lot of data and quickly, it is an option to consider.

    98. Re: Hmmm... by Applehu+Akbar · · Score: 1

      This capability is called Seed Disk, and you can do it once for $100 one-time extra. But you have to use Crashplan's own drive, which is 1T max. In the OP's case, that will shave about four months off his five year backup time.

    99. Re:Hmmm... by Anonymous Coward · · Score: 0

      Which, if it takes about 1 minute to load each one, it will take you a mere 243 years to do the backup.

      and I wonder how long it would take to watch/look at all the media that's being stored. probably ain't possible in a lifetime - probably don't need to back it all up...

    100. Re: Hmmm... by Anonymous Coward · · Score: 0

      No BBC

    101. Re:Hmmm... by Lisias · · Score: 1

      Don't be a cheap bastard. Buy a DAT tape driver!

      --
      Lisias@Earth.SolarSystem.OrionArm.MilkyWay.Local.Virgo.Universe.org
    102. Re: Hmmm... by Anonymous Coward · · Score: 0

      Bitrott will be your problem

    103. Re: Hmmm... by Anonymous Coward · · Score: 0

      I'm pretty sure you didn't consider leap years. It would take about 1/1506 times less that you calculated. 63,419 - 42 = 63,377.

    104. Re:Hmmm... by dbIII · · Score: 1

      SAS is pretty cheap now if you only need to hang one thing off it.

    105. Re:Hmmm... by dbIII · · Score: 1

      On the topic I've got a few dozen boxes of reels of tape from the 1980s that were not quite stored reasonably (kept in a hot humid place for a few years) and a few dozen have been read in with no problems over that last decade. Of course that's with extreme care, bits would probably flake off everywhere if they were just spooled up with no preparation.

    106. Re:Hmmm... by Neil+Boekend · · Score: 1

      about a third of the way to the sun

      DON'T let go of the end!

      --
      Well, I might have a way, but it only works on a semi spherical planet in a vacuum.
    107. Re:Hmmm... by Neil+Boekend · · Score: 1

      Be sure to use Hastelloy or a similar quality stainless steel. Normal SS 316 (for example) still corrodes in some circumstances (mainly with high humidity and salinity in the air).

      --
      Well, I might have a way, but it only works on a semi spherical planet in a vacuum.
    108. Re:Hmmm... by deroby · · Score: 1

      I would propose to measure *away* from the sun as you risk setting it on fire coming too close, paper tape and all...

      Then again, about a third away from the sun doesn't sound right.
      (yes, I really should go back to work)

      --
      If there is one thing to be learned on slashdot, it has to be sarcasm.
    109. Re:Hmmm... by jhumkey · · Score: 1

      Tried that on the old IBM Series/1 EDX boxes we had . . . the "other side" worked . . . but, LOL, the read/write heads were so powerful . . . it would erase side 1 when you used side 2.
      So . . . it let us get "another life" out of the disk when side 1 went bad . . . but we could never use two sides at once.
      (Of course, these were the same computers that had belt driven Hard Drives, that had to have speeds set with a tachometer.)
      Ahhh, the good ole days . . .

      --
      No, I don't remember your name. But the memory mapped screen on a TRS80 from 1977 is from 15360 to 16383 if that helps.
    110. Re: Hmmm... by Anonymous Coward · · Score: 0

      I just did, works like a charm! Do you have a one liner to test recovering some files at random?

    111. Re:Hmmm... by Reziac · · Score: 1

      The problem with the whole backup tape thing is that the total solution tends to exceed the cost of just buying more hard disks, and is often so much slower. :( And in 20 years, will you still have an interface that can handle the hardware??

      Dunno about now, but back in the days of QIC-80, the only tapes that were worth a shit were Sony. Number of bad blocks when first formatted (don't forget to format your tapes!) and likelihood that the tape would be bad later on correlated pretty well, in my experience. Sony consistently had no more than four bad blocks per tape (the only one I ever saw with more, I returned as defective), vs 20 to 300 for other brands.

      And all I could find for my Travan drive were TDK, tho they seemed to be okay.

      Great, now I've got kids on my lawn again...

      --
      ~REZ~ #43301. Who'd fake being me anyway?
    112. Re:Hmmm... by Anonymous Coward · · Score: 0

      Use 5 disks of 4 TB

    113. Re:Hmmm... by Anonymous Coward · · Score: 0

      I use a 9-Track tape system ... :) :) :) .. and I am OLD!

    114. Re: Hmmm... by Anonymous Coward · · Score: 0

      8" I go way back

    115. Re:Hmmm... by vandamme · · Score: 1

      I still have one of those punches. Doesn't get a lot of use any more. Kids nowadays don't appreciate ....what was I talking about?

    116. Re: Hmmm... by Anonymous Coward · · Score: 0

      That would only give you 3.6379788071TB of storage space. That's not incluing partition tables using up space as well.

    117. Re:Hmmm... by Anonymous Coward · · Score: 0

      I have used 64k 8", anyone can beat!

    118. Re:Hmmm... by Gallomimia · · Score: 1

      Don't worry. The solar wind will push it that way anyways.

      --
      Sadly, a Libertarian cannot force his views on another, and freedom cannot spread as does the cancer known as religion.
    119. Re:Hmmm... by Anonymous Coward · · Score: 0

      Seed a torrent of it as an encrypted file named "porn.zip" or similar. You'll have it backed up on the cloud for free in no time and available for all of eternity.

      That is so hilarious. Is a 5/5 for funny the highest we can give it?

    120. Re:Hmmm... by Anonymous Coward · · Score: 0

      Don't be silly - punchcards rule...

    121. Re: Hmmm... by nickrao · · Score: 1

      Luv it. How many 80 col cards on that diskette? Think carefully.

    122. Re:Hmmm... by Wolfrider · · Score: 1

      --I agree with you that clay or stone tablets stand the test of time. The problem becomes: how much data can you reliably store on such a thing, how thick should the stone be, how small should the bits be, where do you store it, and how conveniently can it be restored?

      --M-discs are promising. They are basically stone-based Blu-ray DVD discs. Last I heard, they will have a 25GB version shipping as of March 20, 2014. I plan to get a set of 3 and burn the stuff that I've filed under NEVERLOSE - which really isn't all that much, it fits under 25GB. (I have multiple TB of JBODs at home but it's mostly VMs and a few movies.)

      --The thing to do that most people don't think about, is PARE THE DATA DOWN. Does one average human being REALLY need 20TB of music, or pictures, or anything? Doubtless there are quite a bit of things in that pile that aren't in the top 25% of what you really enjoy the most.

      --How do you even keep track of all that in your head? If you put all that music on random play, it would most likely go for more than a week or two. And I guarantee there would be songs in there that you wouldn't enjoy listening to that much - most albums are not 100% great.

      --Separate out the stuff you enjoy less than ~65-75% of the time. Put it on secondary storage. Now you have less to back up.

      --The only thing with current Blu-ray drives is that they're slow reading the data back. Linux is actually faster than Win7 when it comes to reading UDF discs on the same equipment, in my experience. Play around a bit, and get the data separated out so you can restore what you need the MOST in a certain period of time. YMMV.

      --
      .
      == WolfriderV6 == I'm willing to admit that *I just might* be wrong... Are you??
    123. Re:Hmmm... by BlackHawk-666 · · Score: 1

      And for the brave, formatting their SSSD disks as DSDD.

      --
      All those moments will be lost in time, like tears in rain.
    124. Re:Hmmm... by Zeromous · · Score: 1

      I definitely recommend LTO. A great format and reasonable speed (you should be able to restore this over the course of a few days via SCSI). If this guy has 20TB of data in racks, maybe it's not really too much of an investment. (It's also cheap to expand).

      --
      ---Up Up Down Down Left Right Left Right B A START
  2. reduce the amount by JeffSh · · Score: 3, Interesting

    At home, I didn't feel like paying for 2 large arrays to store my data, so if I rip any media, I always rip it to DIVX. 800 MB for a DVD or even bluray rip is a great economy, saves me money on primary storage and also enables me to back it up. I accept the loss of quality as I can always reference the original media if I want.

    Another option in the future may be subscription services which have HD content, thus eliminating my need to roll my own. We'll see what happens there.

    1. Re:reduce the amount by Anonymous Coward · · Score: 0

      I always rip it to DIVX

      2005 called, they want their crap back.

      Another option in the future may be subscription services which have HD content

      Assuming they have the content you want. Which they usually don't.
      And assuming you live in the US, which 95% of the world population don't. There simply aren't useful services outside the US, if any.

    2. Re:reduce the amount by buchner.johannes · · Score: 3, Informative

      20TB is not out of the world. With a RAID of 4TB disks you can cover that at home, and it doesn't need to be on all the time. Maybe you can reduce the amount of disk usage by reducing duplicate content using bup or an appropriate FS.

      --
      NB: The message above might reflect my opinion right now, but not necessarily tomorrow or next year.
    3. Re:reduce the amount by cdrudge · · Score: 5, Funny

      I always rip it to DIVX. 800 MB for a DVD or even bluray rip is a great economy,

      I do the exact same thing with high res pictures. I immediately will take the full resolution raw image and convert it down to a 320px gif. Or maybe a 10% quality jpeg. You get great economy that way too. Who wants to keep a 30+MB image around when you can have almost the same thing in 10kB instead!

    4. Re:reduce the amount by Anonymous Coward · · Score: 0

      well, a vpn is a useful service outside of the us

    5. Re:reduce the amount by edxwelch · · Score: 3, Insightful

      > I always rip it to DIVX. 800 MB for a DVD or even bluray rip is a great economy
      I do that as well, but I found out to my horror that all my DVD's had become unreadable over time. So, probably good idea to test your backups from time to time

    6. Re:reduce the amount by JeffSh · · Score: 0, Troll

      disingenuous sarcasm never got anyone anywhere, so good luck with that.

    7. Re:reduce the amount by SJHillman · · Score: 1

      CD-R/DVD-R has a shelf life of about 5 to 10 years. Beyond that, your data will probably still be readable but it's going to become hit-or-miss. Factory-stamped DVDs do have a much longer shelf life, but that's not really relevant here. Hard drives and magnetic tape should retain data reliably for 10 to 20 years, making them a better option.

    8. Re:reduce the amount by maeka · · Score: 1

      disingenuous sarcasm never got anyone anywhere, so good luck with that.

      As if suggesting DivX wasn't sarcasm...

    9. Re:reduce the amount by FreonTrip · · Score: 3, Informative

      I'd switch to x264 in an MKV container - you can get the same quality in about 3/4ths the file size without even being clever.

    10. Re:reduce the amount by cdrudge · · Score: 5, Insightful

      I wasn't looking for it to go anywhere really other than pointing out the absurdity of saying that taking a bluray rip down to a 800MB divx rip results in just an acceptable loss of quality.

      I'm by no means a audio/videophile snob, but you either have a blind and/or deaf if you can't see a MAJOR quality deficiency with a 800 bluray rip. What's the point of having a bluray movie if the first thing you normally do is make it look like crap?

    11. Re:reduce the amount by NatasRevol · · Score: 2

      Oh yeah, 20TB over VPN is *very* useful. /snicker,snicker

      --
      There are two types of people in the world: Those who crave closure
    12. Re:reduce the amount by ZahrGnosis · · Score: 1

      I like the compression idea. Blu Ray media is cheap per GB, and it's a nice alternative to HDDs if you want to keep two media types (i.e. to have one that resists magnetic damage). Given that media libraries don't change frequently (other than adding new content), this might make some sense.

      Still, copying to BR is slow and loathesome, and at 50GB each (or 25 if you go the cheaper route) for home burning it will take a while to off load. Of course, once you have it, you have it.

      External hard drives in an offsite vault may be easier; you may be quick to dismiss them. Archive by date or by content and a handful of cheap drives will get it done in such a way that you can easily find what you're looking for if you have to go searching for individual files, and it's quick to recover. You could probably get a backup for less than a grand and it will be pretty fast and easy to store and move.

      Paid cloud storage is fine for this, but honestly the bandwidth has never worked for me for that capacity; "never underestimate the bandwidth of a station wagon filled with," well, hard drives. I'd archive the most recent stuff to the cloud (because that's the stuff at the most risk if you haven't off-sited it yet),

    13. Re:reduce the amount by hairyfeet · · Score: 1

      I have DVD backups nearly a decade old but I follow two VERY important rules...1.- Take a few discs from each batch at random and test them every 6 months or so, if a disc has any read errors? that batch needs replacing. 2.- the most important of all...AVOID CRAP DVDs!!!! Now just because its cheap does NOT mean they are crap, likewise expensive doesn't always mean good. I've had no problems with the memorex and amazon basics for example. The brands I actively avoid are Ilo,Best Buy/Staples store brands, and while Rosewill are fine for DVDs you hand to people I don't think I'd want to trust them long term.

      So follow those simple rules, along with the common sense keeping them in a dark cool place and your DVD backups will easily outlast your desire to keep the old junk backed up. As for TFA? Like for like is really the only way to go with that much data. Buy a pile of HDDs (most will say 4Tb but you can buy 2 2Tb for the price of a single 4Tb so that would be the smart money) and slap 'em in a full size case with a cheap board to be the controller/server and Bob's your uncle. For a board I'd probably go with this. Its cheap, plenty of SATA and more importantly 3 PCIe 1x slots for adding more SATA. For a backup/file server it'd be just perfect.

      --
      ACs don't waste your time replying, your posts are never seen by me.
    14. Re:reduce the amount by Artraze · · Score: 3, Insightful

      Agreed.

      Regardless of whether or not 20TB is hording / excessive / inefficient, what it almost certainly is is replaceable. Let's face it, you aren't CERN, most of you data is probably media that you can reacquire with relative ease. It's not being stored because it's irreplaceable it's being stored because it's convenient. A RAID isn't too bad, but add in managing backups and where has that convenience gone? If it costs $10+/month to backup your ripped/downloaded movies, why not just sign up for Netflix?

      Just make a list of all the replaceable data (e.g. videos you have the original disc for) you have and then buy an external hard disk / Blurays to back up the rest. If you lose your RAID, well, it'll be annoying to rebuild, but you built it once... (Besides, I doubt you could restore 20TB over residential internet less time!)

    15. Re:reduce the amount by emag · · Score: 1

      Indeed, this weekend I just brought a 8x 4TB RAID6 array online at home. I'm wondering what approach I'll be taking myself.

      --
      "The urge to save humanity is almost always a false front for the urge to rule." --H.L. Mencken
    16. Re:reduce the amount by Anonymous Coward · · Score: 0

      Sadly you won't find much dupe data in music or movies. I too have this same issue but with about 50% more data. I simply don't back it up and am smart enough not to format or delete it!

    17. Re:reduce the amount by jedidiah · · Score: 2, Insightful

      If you've already got one large array then you are already by definition half way there. If you then decide not to go the rest of the way then you are at the same time being both extravagant and a cheap bastard. It's a wonderfully stupid paradox.

      If you've got one then you should get the 2nd one or not bother with the first one to begin with.

      --
      A Pirate and a Puritan look the same on a balance sheet.
    18. Re:reduce the amount by squeeze69 · · Score: 1

      Funny, btw, for jpeg and mp3 backup, there is at least one lossy compression solution: http://www.elektronik.htw-aale... For up to date code take a look at the Matthias http://www.matthiasstirner.com... There are pre-built (command line) windows exe It's LGPL (and GPL) code fairly portable, alas, the compression isn't streamable (so the inclusion in some packers is quite hard).

    19. Re:reduce the amount by Anonymous Coward · · Score: 0

      Buy another. Not being smarmy there. It's either that or get a similar RAID0 with slightly less capacity (i'll guess RAID striping eats some space) and use that for backups. Turn it off when the backup's done.

    20. Re:reduce the amount by Mordok-DestroyerOfWo · · Score: 2

      Agreed.

      Regardless of whether or not 20TB is hording / excessive / inefficient, what it almost certainly is is replaceable. Let's face it, you aren't CERN, most of you data is probably media that you can reacquire with relative ease. It's not being stored because it's irreplaceable it's being stored because it's convenient. A RAID isn't too bad, but add in managing backups and where has that convenience gone? If it costs $10+/month to backup your ripped/downloaded movies, why not just sign up for Netflix?

      Just make a list of all the replaceable data (e.g. videos you have the original disc for) you have and then buy an external hard disk / Blurays to back up the rest. If you lose your RAID, well, it'll be annoying to rebuild, but you built it once... (Besides, I doubt you could restore 20TB over residential internet less time!)

      Some people have different use cases. A few years back I was visiting a friend in the boonies in Egypt and brought a TB of American movies and music along explicitly for her (she was putting me up for free while I was on a research project). With my 50Mbps connection and 250GB monthly cap, I could recreate the entire shebang in 4-5 months, but with her iffy ISP, she couldn't hope to download everything from me in her government's lifetime.

      --
      "Never let your sense of morals prevent you from doing what is right" - Salvor Hardin
    21. Re:reduce the amount by Anonymous Coward · · Score: 0

      Holy Jesus, drop DivX for x264 at least, for all that is holy. The difference is night and day, either in size or quality, or even both.

    22. Re:reduce the amount by Pigeon451 · · Score: 2

      x264 is great, but why mkv? I've had terrible compatibility with mkv files in portable players or tablets, and proprietary software. Many won't even recognize them. I rarely have issues with mp4. Sometimes a user has no choice on software/hardware.

    23. Re:reduce the amount by jbaltz · · Score: 1

      Indeed - just to move the 20TB somewhere is going to take quite a bit of time; put those data in the cloud and you'll wait a LOONG time for a full restoral. If you want single files, ok, but all 20TB?

      Larger 'professional' services like BUMI (I'm not a spokesman, but a user) will overnight you an actual, physical disk if you're restoring a huge amount of data and need it _tout de suite_.

      --
      I am the Lorvax, I speak for the machines.
    24. Re:reduce the amount by Anonymous Coward · · Score: 0

      disingenuous sarcasm never got anyone anywhere

      Mr. Jonathon Swift disagrees with your assertion.

    25. Re:reduce the amount by Grishnakh · · Score: 2

      To be fair, 800MB is still better than your standard DVD rip, which is probably 200-400MB. But yeah, 800MB is still lame; 2GB would probably be a much better copy. It'd also help if he dumped his crappy DivX;-) codec and moved to something more modern.

    26. Re:reduce the amount by WuphonsReach · · Score: 2

      I do that as well, but I found out to my horror that all my DVD's had become unreadable over time. So, probably good idea to test your backups from time to time

      That's why you should reserve 10-15% of the disk for parity data. While DVD-R format has built in error-recovery at the sector level, by the time you figure out that the disk is going bad it is too late. By adding even more recovery data at the file level, you can treat the disk errors as an early-warning system, then use the recovery data files to get your data off the disc.

      The old standard was QuickPAR (PAR3), the new version is called MultiPAR.

      With PAR3, you can even write the disc image to an ISO file, make a copy of that ISO file and rename with the PAR3 extension, then let QuickPAR or MultiPAR use the PAR3 filename to search the ISO filename. Which lets you retrieve data from a disc, even if the file system has become corrupted.

      (About the only thing you can't easily recover from is a "track zero" error where the TOC of the disc has been broken.)

      --
      Wolde you bothe eate your cake, and have your cake?
    27. Re:reduce the amount by Anonymous Coward · · Score: 0

      It really depends on the media manufacturer. I have some burned Ricoh CDs from 1997 that are still going strong, yet some media I burned last week has failed. I just make a lot of copies.

    28. Re:reduce the amount by Anonymous Coward · · Score: 0

      At least you aren't stuck in 2005.

    29. Re:reduce the amount by MrNaz · · Score: 1

      The solution I'd use is a large RAIDZ array, with snapshots. That way you get the redundancy of RAID, with the ability to rollback to past points in the filesystem, if you accidentally delete something. When you run out of space, simply expunge the snapshots until the last known good state, However given how cheap HDDs are these days, I'd plan to have enough space space to keep snapshots going back at least a few weeks. Get one of those old 24 bay 2RU IBM servers from eBay, 6x 4TB drives and set the whole thing up with FreeNAS+RAIDZ. Add drives to the bays as you need extra space, as RAIDZ can be dynamically expanded.

      Problem solved.

      --
      I hate printers.
    30. Re:reduce the amount by PhotoJim · · Score: 1

      It depends on the quality of media that you use, and how well your burner works with that media. You can certainly do a lot better than that, but your numbers are pretty typical for low-cost media burned at maximum burn speeds.

    31. Re:reduce the amount by TangoMargarine · · Score: 1

      Because an 800 MB DVD compression and a 10% quality JPEG are so analogous, yeah.

      --
      Unity? Screw that: XFCE. Slashdot Beta? Screw that: SoylentNews. Australis? Screw that: Pale Moon. UX developers DIAF
    32. Re:reduce the amount by WuphonsReach · · Score: 1

      i'll guess RAID striping eats some space

      RAID-5 uses up 1 disk worth for striping, so net space in an 8-drive array is 7-drives worth (about 27TB using 4TB drives). The problem with RAID-5 is that you are 2 disks away from failure and rebuilds often kill the disks.

      RAID-6 uses 2 disks worth for striping, so net space in an 8-drive array is 6-drives worth (about 23TB using 4TB drives). Is able to survive a double-disk failure before data loss. Still has some of the same issues as RAID-5.

      --
      Wolde you bothe eate your cake, and have your cake?
    33. Re:reduce the amount by Anonymous Coward · · Score: 0

      Wow one of those audiophiles ... I have a 400$ cable to sell . Is gold plated. Vibrancy guaranteed.
      Please don't get me wrong, but nothing will be good enough but the pure raw data for the movie.
      Good luck

    34. Re:reduce the amount by Anonymous Coward · · Score: 0

      Because an MKV container support A LOT more track types than MP4 does. MP4 doesn't even have standard support for subtitles and chapters (although some devices support them as extensions). MKV can hold pretty much anything.

    35. Re:reduce the amount by Anonymous Coward · · Score: 0

      I actually prefer a 640x480 px gif, that way it matches my screen resolution. Next year I plan to upgrade to a 14" 800x600 monitor though, so I'm not sure if maybe I should rethink my strategy,

    36. Re:reduce the amount by Bacon+Bits · · Score: 1

      With a trade-off of about 3-5 times the processing power required to decode. I learned that the hard way when trying to play movies on my old netbook with an Atom N270.

      --
      The road to tyranny has always been paved with claims of necessity.
    37. Re:reduce the amount by bryansj · · Score: 1

      Actually it is pretty close. A DVD is about 8,000 MB and 10% of that is 800 MB.

    38. Re:reduce the amount by Kjella · · Score: 1

      While it eventually only had one unreadable file on one unreadable disc, what I found was that with old discs the drive would spin up and down and finally read it very, very slowly like 30-60 minutes to read a single CD/DVD. So in practice what should have been a one night's job turned into a couple weeks. Never again, now I'm backing up to HDDs and if it spins up at least it'll probably finish in reasonable time. Oh, and you don't have to swap TB drives that often...

      --
      Live today, because you never know what tomorrow brings
    39. Re:reduce the amount by nabsltd · · Score: 2

      20TB is not out of the world. With a RAID of 4TB disks you can cover that at home, and it doesn't need to be on all the time.

      Sure, it's easy to have 20TB of usable disk space (I've got forty 2TB drives spread among 5 servers at my house), but 20TB of "must be backed up because that's the only copy" is a little unbelievable for a home user.

      For example, I have 700 Blu-Ray movies that have been ripped and re-encoded to take about 2TB of disk space. If I had 30-40TB available, I might store the raw Blu-Ray images, but then I don't need backup, as the data is easy to re-create. So, I'm a little skeptical that the "friend" in TFS had 20TB data that he can't re-create by going back to original sources.

    40. Re:reduce the amount by TangoMargarine · · Score: 1

      Actually, a single-sided DVD is ~4800 MiB.

      Comparison by volume, maybe, but quality? (Or maybe that's just my "don't need better quality than DVD" craziness popping up again.)

      --
      Unity? Screw that: XFCE. Slashdot Beta? Screw that: SoylentNews. Australis? Screw that: Pale Moon. UX developers DIAF
    41. Re:reduce the amount by omnichad · · Score: 1

      I think you mean lossless. Or it wouldn't be a very good backup plan.

    42. Re:reduce the amount by neorush · · Score: 1

      Subtitle and multiple audio track support seems to always work....also xbmc really loves these files.

      --
      neorush
    43. Re:reduce the amount by keltor · · Score: 1

      Let's say that the rebuiilds CAN cause additional failures, not that it *often* does. I've done maybe 500-600 RAID5 rebuilds and only had two times that there were additional failures, both with consumer SATA/EIDE drives. Most of the RAID failures I've seen have been due to no monitoring and having failed drives for months or even years.

    44. Re:reduce the amount by keltor · · Score: 1

      I have 30 TB of Movies (Ripped via x264 @ 2.5GB/hr - 1080p) - I have the data stored on my NAS using zfs and I am seeing about a 35% savings. Now that's a LOT of savings to go down to 20TB. My NAS has plenty of space though since I have 45x3TB of disks.

    45. Re:reduce the amount by ncc74656 · · Score: 3, Interesting

      RAID-5 uses up 1 disk worth for striping, so net space in an 8-drive array is 7-drives worth (about 27TB using 4TB drives). The problem with RAID-5 is that you are 2 disks away from failure and rebuilds often kill the disks.

      RAID-6 uses 2 disks worth for striping, so net space in an 8-drive array is 6-drives worth (about 23TB using 4TB drives). Is able to survive a double-disk failure before data loss. Still has some of the same issues as RAID-5.

      I use Greyhole for media and document storage. It handles disks of unequal size (currently running one 3TB and two 1.5TB drives), and you can choose the level of redundancy you need. In my case, movies, TV shows, etc. get a single copy (one file exists on one drive), while documents and photos get two copies (one file exists on two drives). If a drive goes bad, you only lose the files on that drive...and only for the files for which you selected no redundancy. With redundancy, extra file copies are recreated on the remaining drives from the surviving copies; this process is most likely less stressful on the disk set than a RAID rebuild.

      My movies, TV shows, and music are backed up to BD-R, stored in a binder at work. They hold ~20GB each, as I'm using dvdisaster to guard against media errors. When a 2TB drive failed, I brought the backup (currently about 190 discs) home and restored the files that had gone missing. Backup and restore are managed by scripts, with information about what files are on what discs held in a MySQL database that gets periodically backed up off-site as well. The initial backup took several months (on and off) to finish, and the last time I needed to restore, it took about a week, but now I just burn a disc when I have about enough new data to fill one. Burning and verifying takes a few hours, but it's something you can start and walk away.

      --
      20 January 2017: the End of an Error.
    46. Re:reduce the amount by Stormthirst · · Score: 1

      Seriously? How much porn does a need?

    47. Re:reduce the amount by ncc74656 · · Score: 1

      With a trade-off of about 3-5 times the processing power required to decode. I learned that the hard way when trying to play movies on my old netbook with an Atom N270.

      That's why you offload video decoding to the GPU. A weedy Atom 230 (or even the ARM-compatible core in a Raspberry Pi) is more than up to the task of shoveling 1080p H.264 into the GPU, which handles decoding, scaling, etc. Even the lowest-end nVidia or AMD GPUs are more than up to the task. I have OpenELEC running on an Acer Aspire Revo and a Raspberry Pi, and neither have any issues with anything I've thrown at them. (A third box runs on a Core 2 Duo E8400, which would be sufficient for software decoding of just about anything, but a GeForce 210 uses less power (fanless heatsink!) and adds HDMI output.)

      What's that? Your netbook doesn't have a proper GPU? Well, isn't that special? :-P

      --
      20 January 2017: the End of an Error.
    48. Re:reduce the amount by wagnerrp · · Score: 4, Informative

      The problem with RAID-5 is that you are 2 disks away from failure and rebuilds often kill the disks.

      No. The problem with RAID-5 is that during a rebuild, there is a reasonably possible chance you could have a UBE, and lose one bit, making perfect recovery of the array impossible. Only a stupid controller would consider a UBE to be a failed drive and trash the entire array. On RAID-6, you still have the same possibility of a UBE, but the chances that two separate drives would experience one on the same exact block during a rebuild are so astronomically slim as to be irrelevant.

    49. Re:reduce the amount by Anonymous Coward · · Score: 1

      If you're seeing 35% compression on x264 output you need to reduce your bitrate -- you're wasting space. Efficiently encoded video output should be more or less incompressible.

    50. Re:reduce the amount by CastrTroy · · Score: 1

      He's probably one of those guys who hordes pirated movies. No way to recreate from source because he never had the original movies to begin with.

      Really, it's not a hard problem to solve anyway. If you have a 20 GB raid array, all you have to do is set up a second 20 GB raid array, and copy the files over every night. Once in a while (possibly run manually) delete any files that don't exist on the primary location. At most you'll lose a days' worth of data. If both raid arrays fail at the same time, you might be out of luck, but that's a pretty small chance.

      --

      Anthropic principle: We see the universe the way it is because if it were different we would not be here to see it.
    51. Re:reduce the amount by Anonymous Coward · · Score: 0

      I got a friend with 20TB on one computer. I don't know why or how he decided to do it, but finding, and redownloading it all would take forever. It might have something to do with the fact he's in a group where highend to just plain silly computer are the norm. There are reasons to have that much storage, most of them are just hard to encounter other than as a close to unique case.

    52. Re:reduce the amount by Wdomburg · · Score: 1

      Your definition if "easy to recreate" is different than mine.

      (And really, what is the point of buying Blu-Ray if you're going to transcode it to half (or less) the bitrate of a DVD?)

    53. Re: reduce the amount by guruevi · · Score: 1

      RaidZ/raidz2 is your friend! No hiding failures, throughput is better (over enough disks) than an individual raid controller and all data is checksummed, encrypted, snapshotted etc

      --
      Custom electronics and digital signage for your business: www.evcircuits.com
    54. Re:reduce the amount by Anonymous Coward · · Score: 0

      It would just take time to get all of those moves from Netflix again.

    55. Re:reduce the amount by rsborg · · Score: 1

      I wasn't looking for it to go anywhere really other than pointing out the absurdity of saying that taking a bluray rip down to a 800MB divx rip results in just an acceptable loss of quality.

      I'm by no means a audio/videophile snob, but you either have a blind and/or deaf if you can't see a MAJOR quality deficiency with a 800 bluray rip. What's the point of having a bluray movie if the first thing you normally do is make it look like crap?

      No your analogy was horribly flawed and hyperbolic. Movies are never re-encoded if you're just interested in watching them and all your devices support the format (like JeffSH, I also encode, but to an apple friendly h.264 of a decent bitrate). If you really want the quality for a given movie, make an exception and upload the blue ray image.

      Photos however, often require post processing, and are often resized to fit smaller form factors (i.e., avatar pics, MMS sends, uploaded to get printed). A larger "reference" does make sense (again, 30MB is ridiculous - not everyone has a digital medium-format camera).

      --
      Make sure everyone's vote counts: Verified Voting
    56. Re:reduce the amount by mythosaz · · Score: 1

      The OP made clear he can reference his original media if he needs higher quality.

      DivX might not be cutting edge, but there's a huge number of devices that'll play DivX files and won't think about touching a "current" format - most notably DVD players and "last gen" gaming systems.

    57. Re:reduce the amount by rnswebx · · Score: 1

      you must be new here.

    58. Re:reduce the amount by Anonymous Coward · · Score: 0

      For my DVDs I just rip the native files. Takes up more space, but they're only stored on a single HDD. I still have the DVDs as backup. Any DVDs that subsequently don't work I can either replace or not. It's easy enough to transcode the ripped files to another format if, for example, I want to watch any of them on my Nexus 7 in bed. When VLC on Android can handle DVD files more smoothly, I'll just use the native files.

    59. Re:reduce the amount by Anonymous Coward · · Score: 0

      Depends on what kind of screen you have. I still use a circa-2005 848x480 plasma EDTV for my primary TV because, despite having wired ethernet to my PS3 and services like YouTube hosting HD content, it doesn't always actually look HD on a 720p or 1080p screen. You got a fancy 4K tv? Everything's gonna look like shit.

      Meanwhile, sitting 8 feet back, even the most dismal YouTube video actually looks super good. 720p content scaled down to 480p looks awesome. 800mb divx rips look awesome. DVDs look fantastic. I'm really dreading having to buy something current when this thing dies.

    60. Re:reduce the amount by carnivore302 · · Score: 1

      the friend can surely find 90% of his lost stuff, but it will take him a lot of time to download it again. Also, if he's like me, he can't remember half of what he had.

      Then there's his wife who's screaming at him about the loss of the "home videos" (no honey, they're really gone, they're not at somebody else's computer now)

      Yeah, I guess the price of an extra 20TB raid array doesn't seem so high now anymore...

      --
      Please login to access my lawn
    61. Re:reduce the amount by dinfinity · · Score: 1

      800MB is still better than your standard DVD rip, which is probably 200-400MB

      Are you trying to fool intelligence agencies by proving that you know absolutely nothing about this subject?

      I don't think I've ever seen a DVDrip that isn't about CD-sized (or multiples thereof).

    62. Re:reduce the amount by Anonymous Coward · · Score: 0

      Let's face it, you aren't CERN

      He might not be CERN, but he could be GEHRIN or it's successor organization NERV. Backing up the MAGI is tuff stuff mang!

    63. Re:reduce the amount by Anonymous Coward · · Score: 0

      It is fine on a computer monitor but if you have large TV 800 MB won't do it.

    64. Re:reduce the amount by Anonymous Coward · · Score: 0

      Also, if he's like me, he can't remember half of what he had.

      If you can't remember you had it, doesn't that imply that it isn't worth worrying about?

    65. Re:reduce the amount by edxwelch · · Score: 1

      CD-Rs last for a *lot* longer than DVDs, but they are impractical because of capacity

    66. Re:reduce the amount by psymastr · · Score: 1

      Maybe he doesn't care about the quality loss; many people don't. Who are you to tell them what they should like?

      --
      Improve at backgammon rapidly through addictive quickfire position quizzes: www.bgtrain.com
    67. Re:reduce the amount by edxwelch · · Score: 1

      I suspect that that won't work, because the entire disk become unreadable by the OS. I mean you can't even see the list of files

    68. Re:reduce the amount by Anonymous Coward · · Score: 0

      I'm a little skeptical that the "friend" in TFS had 20TB data that he can't re-create by going back to original sources.

      I'm not, because it isn't a matter of being unable to go back to the source. I've only got a couple hundred gigs of MP3s, but I'd still be kicking myself if I had to re-rip from the source CDs. Twice was enough for me.

    69. Re:reduce the amount by SJHillman · · Score: 1

      Source? Everything I've seen lumps DVD-Rs and CD-Rs together in terms of longevity.

    70. Re:reduce the amount by Kjella · · Score: 1

      A few years back I was visiting a friend in the boonies in Egypt (...) she couldn't hope to download everything from me in her government's lifetime.

      I think that says more about their politics than their broadband...

      --
      Live today, because you never know what tomorrow brings
    71. Re:reduce the amount by chris_mahan · · Score: 1

      Maybe it's cheaper to buy the blu-rays than to archive them...

      --

      "Piter, too, is dead."

    72. Re:reduce the amount by michelcolman · · Score: 1

      CD-R/DVD-R has a shelf life of about 5 to 10 years. Beyond that, your data will probably still be readable but it's going to become hit-or-miss. Factory-stamped DVDs do have a much longer shelf life, but that's not really relevant here.

      Just have a factory stamp out your backups then.

    73. Re:reduce the amount by edxwelch · · Score: 1

      by my own observation and I've heard it mentioned a lot on forums

    74. Re:reduce the amount by DigiShaman · · Score: 2

      I've never ran into a UBE in a RAID5. But then again, that's because I've always used Dell or HP machines that have Patrol Reading on the RAID card. It's proactive checking/correcting when the volumes is at a minimal use. It's frequency of checking is also a user changeable option is most cases.

      No. The problem with RAID5 is that should one drive fail, and second drive could fail while in the rebuild process. That's because rebuilding stresses all the remaining drives. If one of them in the array is already on the cusp of failing, a rebuild could push it over the edge. Thus losing the entire volume. RAID6 is a waste in that it's N-2 drives of capacity lost. But when you have to be sure (and damned sure) of reliability and availability, it's worth looking into. Exceedingly so for a SAN.

      --
      Life is not for the lazy.
    75. Re:reduce the amount by Skynyrd · · Score: 1

      I think that's the whole point; he doesn't have the original media.

      Do you think it was "lost in a fire" or "washed away in a flood"?

    76. Re:reduce the amount by maeka · · Score: 1

      There is an increasingly large number of devices (just about every lower power device of the last 5-7 years) which will play h.264 and not divx. Since the era of the iPod 5th gen ("iPod Video") the trend has been for h.264 hardware decoders paired to relatively weak CPUs.

    77. Re:reduce the amount by mythosaz · · Score: 1

      I've got about 2TB of movies - mostly from a vast physical collection - and that's at a tiny ~1GB per movie.

      At 4-5GB/movie, I'd have the same backup problem the GP has.

    78. Re:reduce the amount by wagnerrp · · Score: 1

      That's why you buy drives in different batches from different retailers, and cycle them in at different times, to reduce your odds that any two drives will be at a similar wear level and fail simultaneously.

      The FUD that was raised a few years ago about "the death of RAID5" was because most consumer drives were rated at a mean time between bit errors of more than 10^14 bits, or 12.5TB. When re-silvering a multi-TB array, it was becoming statistically likely that you might run into one of these, which on RAID5 means a loss of data. Loss of any data is a bad thing, but the FUD from these articles was that it would result in a loss of all data. The "RED" drives you see on the market are no different from standard hard drives, except their firmware will give up trying to recover such a read error after just a few seconds, before low end RAID systems have a chance to kick the drive out due to unresponsiveness.

    79. Re:reduce the amount by nemesisrocks · · Score: 2

      Regardless of whether or not 20TB is hording / excessive / inefficient, what it almost certainly is is replaceable. Let's face it, you aren't CERN, most of you data is probably media that you can reacquire with relative ease. It's not being stored because it's irreplaceable it's being stored because it's convenient.

      There's a large, flawed, assumption running through this thread that it's "easy" to reacquire all media.

      Between torrents and usenet, trying to find good rips of content that's over 6-12 months old is often impossible. Good luck trying to find any older show that's not sci-fi or super popular. Often studios don't sell DVDs or Blurays of these shows anymore, if they even did to start with.

    80. Re:reduce the amount by DigiShaman · · Score: 1

      The "RED" drives you see on the market are no different from standard hard drives, except their firmware will give up trying to recover such a read error after just a few seconds, before low end RAID systems have a chance to kick the drive out due to unresponsiveness.

      It's called TLER. And fuck WD for segmenting this market. On some of their older drives (raptors), there used to be firmware setting to enable/disable TLER. I'm pretty sure Hiatchi (IBM) drives still makes this user selectable. Yes, WD is on my personal shit-list of drives not to buy.

      --
      Life is not for the lazy.
    81. Re:reduce the amount by WuphonsReach · · Score: 1

      That's the dreaded TOC error. There are (far more) expensive drives that will ignore TOC errors and let you dump the disk to an ISO file. But for us mere mortals a TOC error means you are well and truly hosed.

      But for any situation where you are dealing with the far more common errors that the sector-level ECC can't deal with, the recovery data files give you the option to recover everything.

      --
      Wolde you bothe eate your cake, and have your cake?
    82. Re:reduce the amount by Anonymous Coward · · Score: 0

      The submitter is clearly an idiot. 20TB could be backed up on five 4TB drives. You certainly could just plug in some externals and make back ups.

    83. Re:reduce the amount by dbIII · · Score: 1

      After various scary hassles with RAID6 just being good enough in a failure, with about a week of slow performance to be sure, I ditched the hardware RAID idea and went for ZFS. Some hardware RAID is a bit of a pain to deal with when things go wrong.
      If the disks are not completely full then recovery is a lot faster than hardware RAID - however the natural state of disks is almost full :)

    84. Re:reduce the amount by dbIII · · Score: 1

      It's just a container. The compatibility problems are with what is inside.

    85. Re:reduce the amount by thegarbz · · Score: 1

      Erm no a container is just another part of the chain that needs to be supported. If you don't know how to read the container how is that in any way the fault of the encoded media?

    86. Re:reduce the amount by squeeze69 · · Score: 1

      Uooopsssss....... HUGE LAPSUS ALERT... Sorry... yes, I meant lossless... :-) If you like, give also packARC a try (p.s. the author also provides linux building scripts).

    87. Re:reduce the amount by Neil+Boekend · · Score: 1

      Don't compress a DVD by volume. It won't fit in a normal player anymore.

      --
      Well, I might have a way, but it only works on a semi spherical planet in a vacuum.
    88. Re:reduce the amount by Anonymous Coward · · Score: 0

      I'm not sure how you think replacing videos of kids growing up is easily replaceable

    89. Re:reduce the amount by Anonymous Coward · · Score: 0

      You didn't read the article. His friend had a 20TB RAID. Even if you used another 20TB RAID, it would be difficult to offsite it (moving parts aren't 99.999999999% reliable).

    90. Re:reduce the amount by Anonymous Coward · · Score: 0

      CD-R/DVD-R has a much lower shelf life than 10 years; its more like 3 years. If you get the DVD's pressed at a specialized factory, yes, then you get 10+ years of life. Consumer DVD-Rs are etched on dye using heat. I make it a point to burn more DVDs and use PAR2 error correction when archiving data that way.

    91. Re:reduce the amount by Radak · · Score: 1

      Aww, you think the original poster's "friend" actually paid for all of his 20 TB of data and therefore has original media? How adorable.

    92. Re:reduce the amount by dbIII · · Score: 1

      Yes but it looks like the above poster can get SOME stuff out of the container, so your point is not relevant.

    93. Re:reduce the amount by Anonymous Coward · · Score: 0

      Ummm, RAID-10? (and I do mean 1+0, not 0+1)

      If you _really_ don't want to lose your data you won't mind wasting half your capacity for fault tolerance. Rebuilding is simply copying stripes from the other side, so rebuilding is faster and doesn't stress your controller. You can even keep on working at full speed when you lose one drive, and still have most of your performance when the controller is chewing on the new drive.

      My main server runs on 8 SAS drives in RAID-10: I can lose up to 4 drives if I'm extremely lucky, which is a lot better than being screwed automatically if I lose more than one. Full backup to a RAID NAS nearby, and critical data replication (with encryption) to a NAS at my friend's place. Likewise, I keep an encrypted backup of his stuff so we're both happy and safe.

      That being said, I don't count my movie and music archive as critical data... Photos, documents, digital memories, geek projects, password store, e-mail archive and other unique things is more like what I really wouldn't want to lose.

    94. Re:reduce the amount by redlemming · · Score: 2

      Regardless of whether or not 20TB is hording / excessive / inefficient, what it almost certainly is is replaceable.

      There are huge amounts of material that is very hard to replace, and often irreplaceable.

      For example, many instructional and educational DVDs produced in the last few decades for various subjects are completely out of print. Typically these are home-brewed DVDs that weren't professionally pressed. These have a fairly high rate of failure, which means every one has to be backed up. In some cases, various political and legal issues may prevent re-issue, even of professionally pressed disks.

      Similarly, many VHS tapes are completely out of print, and either impossible to acquire, or only available at very high prices. For example, I would love to get a copy of "Out of the Fiery Furnace", even at VHS quality, but good luck finding that. Even when the tapes are available, there's always a question of quality: there's a big difference between converting a pristine VHS tape to digital form, and one that has been played too many times.

      Then there's the issue of time spent organizing and editing video material to make it useful.

      For example, effective use of instructional material requires random access to it. The simplest and most reliable way to do this is by extracting clips in full (as an exercise of fair use rights), as bookmark mechanisms tend to be quite primitive and incompatible between programs. It's often necessary to use multiple video processing programs when working with video, as each of them has it's quirks and limitations, so this is pretty important.

      Another consideration: many videos edited by amateurs, and even some done by professionals, have serious problems (such as background noise) that need to be edited out (again, as an exercise of fair use rights) to make the video usable. Removing noise is not a linear process, and can require many hours of manual experimentation. Since techniques are likely to improve over time, it's important to keep the raw (unprocessed material) around.

      Further, high compression video algorithms don't work well when one needs to be able to play things frame by frame or at reduced frame rates, without loss of detail, and keeping the audio comprehensible, in order to understand what is being demonstrated. This means high quality must be maintained in these clips. This effectively doubles the space needed to store this kind of material, as both the raw and processed data must be stored using relatively poor compression techniques.

      Loss of the data, of course, means all the editing time is lost, and that's a lot of hours of human time.

      Far more sensible than to take the risk of losing all this time is to keep both the raw and edited copies, but then you have to deal with backup issues for huge amounts of data. The best approach at present is to have at least two sets of disks, each with copies of everything. Having a third set off-site would be even better.

    95. Re:reduce the amount by Wolfrider · · Score: 1

      > RAIDZ can be dynamically expanded

      --To the best of my knowledge, not so much. You can create a pool of mirrors and expand that, but expanding a RAIDZ (ideally) should be done with the same number (and capacity) of drives that was in the original RAIDZ - for balancing purposes. Otherwise you get weird errors and possible performance impact. Building asymmetrical pools is fine in a VM, but on bare-metal you kinda have to start to question what to do if there's data loss.

      http://serverfault.com/questio...

      --You can expand the underlying disks in a pool, but it's a PITA and requires repetitive resilvers.

      http://jsosic.wordpress.com/20...

      http://www.itsacon.net/compute...

      --Honestly, adding a 4-port SATA card to an existing system and using all-new drives is probably the best bet for expanding existing storage. You can buy 2-4TB drives depending on budget, copy the data over, and repurpose the existing pool of old/smaller drives until the HW starts failing.

      PROTIP: With newer drives (4k sectors) you're better off setting the ASHIFT to 12 on your ZFS pool right off the bat. Will save you trouble later -- I speak from experience.

      https://www.icts.uiowa.edu/con...

      / Btrfs has some promising features, but practically I would give it another ~2 years to get to production-ready as of this writing. Just my $2.02

      --
      .
      == WolfriderV6 == I'm willing to admit that *I just might* be wrong... Are you??
    96. Re:reduce the amount by dissy · · Score: 1

      What's the point of having a bluray movie if the first thing you normally do is make it look like crap?

      I've been asking that same question, but I think Viacom blocked my email addresses by now :{

    97. Re:reduce the amount by Anonymous Coward · · Score: 0

      Seriously? How much porn does a need?

      Seriously? How many nouns does a need?

    98. Re:reduce the amount by nabsltd · · Score: 1

      (And really, what is the point of buying Blu-Ray if you're going to transcode it to half (or less) the bitrate of a DVD?)

      Take a look at the quality of output from:

      x264 --preset slower --tune film --crf 20

      You'd be amazed at how many of the bits on the average Blu-Ray are wasted. What's especially noticeable is that most Blu-Ray encodes are essentially fixed bitrate...it's not unusual to see credits running at 20Mbps. OTOH, there are movies like Lord of the Rings or The Rock, where 4GB/hour is required to keep up the quality due to the amount of action. Or, there are grainy movies, like Fast Times at Ridgemont High or 300. All of these automatically get the bitrate they need because of the "crf" option. These are balanced by movies like The Sixth Sense, with minutes long scenes where nothing moves but the actor's mouth.

  3. Crashplan by rossjudson · · Score: 5, Informative

    Crashplan has unlimited storage. I use their home plan; it's unlimited for up to 10 machines. I think I am backing up about 6TB there now.

    1. Re:Crashplan by Mr.+Competence · · Score: 0

      +1

      --
      Those who open their minds too far often let their brains fall out.
    2. Re:Crashplan by funwithBSD · · Score: 0

      +1

      --
      Never answer an anonymous letter. - Yogi Berra
    3. Re:Crashplan by angularbanjo · · Score: 1

      Anything other than an NSA-approved offsite backup?

    4. Re:Crashplan by genghisjahn · · Score: 2

      You can backup(encrypted) to a friends computer and not use CP stuff at all. Check out the Local and Offsite plan. https://www.code42.com/store/

      --
      Sorry about the mess.
    5. Re:Crashplan by Pichu0102 · · Score: 1

      I agree with this. In addition, you can also backup to local folders, and have different backup sets so the really big stuff will be backed up online, but the smaller, more important things can be backed up both to a folder and online. That, and they let you control frequency of backups, and never delete anything unless you set it to remove deleted files after whatever period of time you say. Lord knows how many TB I have backed up there that is just deleted files and their daily versions.

    6. Re:Crashplan by tippe · · Score: 1

      Interesting. I just checked them out (http://www.code42.com/crashplan/) and the home service appears to be free. What's the catch? I noticed that the home plan uses weaker encryption (128-bit vs 448-bit), but is that all?

    7. Re:Crashplan by phalangion · · Score: 2

      They allow you to create your own 448-bit key. When you use your own key, Crashplan is unable to assist with the recovery, which implies that they do not have access to this key.

    8. Re:Crashplan by Anonymous Coward · · Score: 0

      The free service only lets you back up to a friend's computer, or to an external hard disk.

      You have to pay if you want to use their cloud storage, but it is unlimited.

    9. Re: Crashplan by Anonymous Coward · · Score: 5, Interesting

      Crashplan offers unlimited storage, yes, but they limit it indirectly by slowing down uploads.

      I recently paid for a crashplan account to back up ~6TB of media, and at the speeds I'm seeing the initial backup is going to take more than a year. I have 100Mbit/s fiber at my home and can max it easily with other services.

      So for 20TB, it's going to take many years to back up. I don't think that's a practical backup solution. There's a decent chance you're going to lose your data before the initial backup completes. And if crashplan goes under, you have to start all over again with the next "unlimited except for rate" provider, and have no backup in the meantime.

    10. Re:Crashplan by Anonymous Coward · · Score: 2, Insightful

      I would agree with CrashPlan, although I have never ever ever seen it upload faster than 3Mb/s. Uploads are slower than hell, but it does indeed work.

    11. Re: Crashplan by MightyYar · · Score: 2

      While I've never had the throttling issue that you have, I do want to point out that they accept seed drives if your initial backup is taking a long time.

      --
      W..w..W - Willy Waterloo washes Warren Wiggins who is washing Waldo Woo.
    12. Re:Crashplan by Anonymous Coward · · Score: 0

      The client is free, and you can use it to back up to a local drive or anyone else running crashplan that you exchange friend codes with. You have to pay if you want to use crashplan's cloud storage.

    13. Re:Crashplan by K.+S.+Kyosuke · · Score: 2

      Wait, before I lose count...6+1+1...this means you're at 8TB, right? Or that you're at 12 machines?

      --
      Ezekiel 23:20
    14. Re:Crashplan by blueg3 · · Score: 1

      If you're backing up a collection of movies and music, what does it matter?

      You want cheap backup and information security? This is one case where encryption is actually magic fairy dust that will solve your problem.

    15. Re:Crashplan by Anubis+IV · · Score: 1

      No catch that I'm aware of, and the data is being stored by your friend, so you should have less worry about some antagonist trying to brute force it.

      Also, if you sign up for one of their plans and cancel, they actually are good enough that they refund a prorated amount, so the cost for trying out one of their for-pay offerings and then deciding you don't like it is very minimal. The company I'm with has been using them at work without complaint, and I started using them at home a few years back, also without complaint. And they come in way cheaper than what I was using for offsite backup previously, despite the fact that I'm backing up not only my internal drives, but my external ones as well.

    16. Re:Crashplan by Anonymous Coward · · Score: 0

      It looks like the free version doesn't back up to the cloud, just to "other trusted computers." For the online service, it's $60/year and only backs up one "computer."

      Not sure what a "computer" is defined as to them. I typically use a single computer, but it has 3 hard drives and have about 10 more running in my house that I access via network shares. Perhaps the program doesn't allow backing up of non-local data?

      Personally the cloud storage doesn't sound like it's for me. I'd rather be in charge of encrypting the data and selectively backing it up, not having every piece of data that I ever have shipped off to someone else, which it sounds like this does.

    17. Re:Crashplan by Anonymous Coward · · Score: 0

      Their software is free for you to use to backup from one place to another that is within your control (or a friend/family member's). But that just means your stuff is backed up on a friend's machine and if you lose something, you can dig it up there. It doesn't include any backups to their datacenter, which is what you pay the subscription for.

      I use Crashplan for about 5TB of data on my server and another TB or so from a few other machines. It took several weeks to get the first backup done, but now it does realtime backup of individual files as they change, so the weekly reports they give me indicate a low overall throughput as most of my repository doesn't change. I can't speak highly enough of this service. The fact that it has clients for Windows, Mac and Linux as well as supports external drives is just amazing.

    18. Re:Crashplan by greywire · · Score: 1

      The home plan backs up between computers or other local storage.

      To use cloud storage you need to pay.

      Still, this is interesting...

      --
      -- Senior Software Engineer, Attorney appearance services, locallawyerapp.com.
    19. Re: Crashplan by kenh · · Score: 1

      It arguably took him years to create test 20 TB collection, if he updated his cloud service backup as he added Giles, it would likely never be more than a day or so behind his main library...

      --
      Ken
    20. Re:Crashplan by Anonymous Coward · · Score: 0

      The main problem I have with storing your backup data in the cloud is that it is available to someone else, especially the government. If you remember ripping a DVD or even a Blu-ray is illegal, even for archival purposes. Some may say fair use but you are still breaking encryption to do it.

      Now I have lots of DVD's ripped but I say less than 1Tb. I use a pogoplug to store these images for easy access to media players around my house. Even though my pogoplug is in my house on my own personal network, the drive that is plugged into the device is still accessible by pogoplug the company. I worry about it even though I still have the plastic shiny disk in a box in the corner of the basement.

      20Tb of ISO images of DVD's and Blu-ray is a $#!T ton of ISO's. It seems to me if someone wanted to start poking around looking for illegal activity, that would be an easy place to start. It is kind of like submitting a tax return to the IRS that has a ton of red flags. You are the first one to be audited, even though you might be completely legit.

      I would never store that much data outside of my personal domain. But that is just me.

    21. Re:Crashplan by Anonymous Coward · · Score: 0

      I havn't used it my self but you should be able to backup to a friend, so if you can put a same size storage at a second location you should be able to do that.
      I figure you should start by attaching have the two mashines connected to the same network to make the initial backup. Crashplan is not that fast, my 250GB took about 2weeks to backup the first time.

    22. Re:Crashplan by operagost · · Score: 1

      20TB? Probably need a lot of friends... and we're talking geeks here.

      --

      Gamingmuseum.com: Give your 3D accelerator a rest.
    23. Re:Crashplan by Anonymous Coward · · Score: 0

      The catch is the free service doesn't give you any storage - it's local only!

    24. Re:Crashplan by genghisjahn · · Score: 1

      So you want free-multiTB-offsite-encrypted backup that completes in an hour? Backup is hard to do. I'm not sure what you're expecting.

      --
      Sorry about the mess.
    25. Re:Crashplan by Skreems · · Score: 2

      Watch out for Crashplan in certain cases. I'm currently trying to restore a 50GB file, and it keeps restarting the download halfway through. Their support is useless, basically leaving it at "sorry dude, nothing we can do". For other files it's been good, but their testing and support of edge cases doesn't seem especially solid.

      --
      Slashdot needs a "-1, Wrong" moderation option.
      The Urban Hippie
    26. Re: Crashplan by Applehu+Akbar · · Score: 1

      This is exactly what happened to me when I signed up for it: my 1T disk crashed at the 52% point in the initial big backup.

    27. Re:Crashplan by Anonymous Coward · · Score: 0

      The free plan is a program that lets you backup to other computers you or your friends contol, and to external harddrives. The paid plans give you online backup.

    28. Re:Crashplan by Ravaldy · · Score: 1

      I use this solution but who pays for the 20TB of storage. As far as I know 4TB @ $150 X 5 = $600.00

      And watch out for the bandwidth throttling and overage if applicable. I know my connection gets throttled after I hit the 150GB mark per month.

    29. Re:Crashplan by MrNaz · · Score: 2

      It implies nothing of the sort. I implies that they *pretend* to not have the key.

      I'm not saying anything about Crashplan, I'm just pointing out the logical fallacy in saying "They don't help you, therefore they can't help you."

      --
      I hate printers.
    30. Re:Crashplan by operagost · · Score: 1

      It was a joke...

      The whole idea of an easy and cheap backup for an edge case like 20 TB of home date is a joke.

      --

      Gamingmuseum.com: Give your 3D accelerator a rest.
    31. Re:Crashplan by CKW · · Score: 1

      I don't know what page you are looking at, but to use their cloud storage it's $4 to $9 a month (individual/family). Their "free" offering clearly says "backup to other computers and external drives, free" -- maybe they give you a demo period of cloud storage, but clearly the free is not cloud based.

    32. Re:Crashplan by Anonymous Coward · · Score: 0

      2014: Using "the cloud" for any reason
      You may as well just print out copies of all your data and glue it to the walls in public places for everyone to see.
      Also, backing up terabytes of data over the internet.

      Crashplan shill, please leave.

    33. Re:Crashplan by Anonymous Coward · · Score: 0

      The "Free" version lets you backup to another device (external drive) or to a friends computer (e.g. across town), not to their cloud.

      If you have a friend/relative you trust who has lots of storage space and no bandwidth cap (or a very large one), then you can backup to one another for free.

    34. Re:Crashplan by genghisjahn · · Score: 1

      Got ya. (Puts down flame thrower).

      --
      Sorry about the mess.
    35. Re:Crashplan by Anonymous Coward · · Score: 0

      What ISP do you use that will even allow you to upload 6TB of data without blowing their caps?

      Hell, my 20MB down, slow-as-shit-up, would literally take 2 years to upload that much. Slower than I'm adding new space.

    36. Re:Crashplan by Anonymous Coward · · Score: 0

      It's free if you host the backup. If you want cloud backup that they host then you pay.
      So it's pretty good as you can use the software to have a local backup and a remote one.

      I have 2 gripes with crashplan at this time:
      1) the backup service seems to leak memory and reaches 500MB on my machine.
      2) if you kill the backup service I don't see any automated or manual way to restart it.

      Still, I would say I like crashplan better than carbonite as the initial backup seemed faster 350GB in about 2 weeks (including times when the machine was off by mistake from the user.)

    37. Re: Crashplan by amaiman · · Score: 1

      The CrashPlan seed drive is limited to a single 1TB drive and you can't add a seed to an existing backup without erasing what's already stored with them. After that, you're subject to the capped upload. (Technically not "capped" according to them, but they divide their total available bandwidth by the number of users and limit you to only your share, so it's effectively capped.)

    38. Re:Crashplan by Anonymous Coward · · Score: 0

      You can NEVER trust an external backup business to:
      a - not look at or give away your data
      b - keep your data intact and available in the first place

      Your best home backup solution is to:
      a - put all your data on two sets of storage array hardware in (b)
      b - run freebsd + geli + zpool raidzN + zfs sha256 on it
      c - run zfs snapshot and zfs dump between them, zpool scrub once a month
      d - unplug the secondary, take it to your friends house and run backups remotely over the net, now you're nature and almost bombproof
      e - take your friends secondary to your house, now you both have double the pirated content :)

    39. Re:Crashplan by suutar · · Score: 1

      I also use crashplan. Backblaze is also a flat-rate-for-whatever option.

    40. Re:Crashplan by kehren77 · · Score: 1

      Agreed. I'm backing up almost 1TB (most of which is my music/movie library). The only downside was the amount of time that first backup took. Almost 2 months on my crappy Charter Cable connection got my to 80%, switched to a fiber connection and it finished in 2 days.

    41. Re: Crashplan by ILongForDarkness · · Score: 1

      Not to mention not everywhere in the world has unlimited transfer limits on their internet. Example most ISPs in Canada have a 20-250GB/month cap counting both downloads and uploads. Afterwards you pay about another $100 a month before the penalties stop. Even if they don't directly penalize you you likely would be paying for a higher internet package/getting traffic shaped because of your high usage. So your free/cheap backup will likely cost you several hundred in penalties to backup and several hundred more should you need to restore making it almost as cost effective to just have a second 20TB array.

      I get people like to do this but the easier solution for 99% of people is just to get some counseling for their hording behaviour. How likely are you t watch that S3E04 Gilmore Girls episode again? I went to university with a guy back in 2000 when storage was relatively expensive that had 1000's of anime movies. He hadn't watched most of them and didn't really intend too. He downloaded them and then if anyone recommended one of them to him he would watch it. We was downloading: just in case one day he might have a reason to be interested.

    42. Re:Crashplan by suutar · · Score: 1

      This page has their three plans. One is free, but it assumes your backup locations are not Crashplan's systems (friends, family, other computers you control). That's probably the catch you're looking for; you have to come up with the backup location yourself. Being able to back up one system to CP's servers (unlimited quantity) is on the order of 5 a month; being able to back up 2-10 systems is on the order of 15.

      You may also consider the encryption options to be a catch; as you note, the free plan has less encryption. On the other hand, the data is being stored somewhere you presumably trust. The plans that save to CP's systems have a couple of key management options, and in one they don't get the key (barring sneakiness, but if you're that worried about it you're likely going to rig it so what they're backing up is encrypted already).

      Personally, I back up one system (my file server) to Crashplan, and most of my other ones to the file server. NOTE! This does not mean my other systems are getting backed up to Crashplan indirectly! If I want it backed up to Crashplan I have to copy it to the fileserver as a regular file. It just means that stuff on my laptop that I don't think is important enough to go offsite still gets automatically backed up to a different disk.

    43. Re:Crashplan by suutar · · Score: 2

      And even if you don't trust them to not grab that key anyway, nothing says the files you feed it have to be cleartext.

    44. Re: Crashplan by Anonymous Coward · · Score: 0

      Crashplan does not actually slow your uploads based on how much you upload intentionally. Uploads slow down as they look to eliminate files already uploaded. You can actually disable this option somewhere in the settings, but I don't remember where.

    45. Re: Crashplan by psydeshow · · Score: 1

      I'm really surprised CrashPlan hasn't added a premium feature (like Amazon S3 has) where you can ship them a hard drive for import or export into their storage cloud.

      I mean, never mind the upload speed. If you have 1TB of data in CrashPlan and your home or office burns down, it's going to take you several days to get all of it downloaded again. They should just be able to FexEx you a drive for a $50-$100 fee.

    46. Re:Crashplan by Stormthirst · · Score: 1

      I don't have an ISP that has a bandwidth cap.

    47. Re:Crashplan by afidel · · Score: 1

      You can do the initial seed as a local backup and then take that offsite to the friends location, that's what I did with my brothers, we traded HDD's around to seed the backups to each other.

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    48. Re:Crashplan by nblender · · Score: 1

      I use crashplan as well... Free plan; backing up to my own server... I've had occasion to use their tech-support on about 3 different issues and their tech-support is Top Notch; especially since I'm not even paying them money. I get a response to my issues within 24 hours and back/forth cycles are generally faster... I'm pleased enough that I'm considering paying them actual money.

      The only concern I have with backing up to my own servers using crashplan software is if they go out of business, I can not access my backups because the software requires a connection to their service, even if the disks are in your basement...

    49. Re: Crashplan by Anonymous Coward · · Score: 0

      we experienced the same throttling issues as well...

    50. Re: Crashplan by Brian+Puccio · · Score: 1

      Crashplan offers unlimited storage, yes, but they limit it indirectly by slowing down uploads.

      I recently paid for a crashplan account to back up ~6TB of media, and at the speeds I'm seeing the initial backup is going to take more than a year. I have 100Mbit/s fiber at my home and can max it easily with other services.

      So for 20TB, it's going to take many years to back up. I don't think that's a practical backup solution. There's a decent chance you're going to lose your data before the initial backup completes. And if crashplan goes under, you have to start all over again with the next "unlimited except for rate" provider, and have no backup in the meantime.

      Did your upload start fast and then slow down? That's not network throttling that you're running into. I ran into as well as did "network rockstar": http://networkrockstar.ca/2013... I made the change described there and my upload speeds went back to normal, as "network rockstar" mentions here: http://networkrockstar.ca/2013... Have you tried those changes yourself?

    51. Re: Crashplan by Anonymous Coward · · Score: 0

      Amazon Glaicer is another cheap cloud storage option. (http://aws.amazon.com/glacier/pricing/)

    52. Re:Crashplan by servant · · Score: 1
      I use and love Crashplan. But I would probably use Backblaze instead if they had a Linux client. Yep, all their machines run Linux but they don't backup their own machines on their own network I guess. Or at least they don't want to make it public.

      My bandwidth is so limited, that network based backups are problematic except for my most 'critical' things. I have 3T of ripped material, I just back it up to another 3T drive, and plan on replacing/adding drives to the rotation every couple of years, eventually retiring some drives (hopefully before they die of old age).

      --
      ... "When you pry the source from my cold dead hands."
    53. Re:Crashplan by Reziac · · Score: 1

      Their server doesn't support resuming?? or is it specific to their client software, if any??

      Regardless, that sucks. :(

      --
      ~REZ~ #43301. Who'd fake being me anyway?
    54. Re:Crashplan by Skreems · · Score: 1

      I restored something like 350GB in smaller files, and that all went smoothly. It seems that it generally DOES support resuming. But for some reason, the 50GB file just keeps restarting in the middle. My guess is actually something weird about the FS interaction or the output buffers they're using, but who knows for sure. It's taken about 2 weeks of back and forth on a support ticket to get them to even _mention_ that they might eventually need to hand this off to Tier 2 support. Everything else about the service has been pretty good, but this is a pretty big downer.

      --
      Slashdot needs a "-1, Wrong" moderation option.
      The Urban Hippie
    55. Re: Crashplan by tbuskey · · Score: 1

      You realize that you can have a local crashplan storage too. Have Crashplan go to a 2nd set of 20 TB drives in addition to the cloud.

      The local crashplan is for "oops!". The cloud is for the house burned down...

    56. Re:Crashplan by Reziac · · Score: 1

      Do you have to use their client to download the file, or can you use something like Getright that can restart wherever it left off??

      Kinda sounds like outsourced support, tho (or emulating it very well), which is never a good thing. Escalate? whuzzat??

      --
      ~REZ~ #43301. Who'd fake being me anyway?
    57. Re:Crashplan by Skreems · · Score: 1

      Has to be their client :-(

      --
      Slashdot needs a "-1, Wrong" moderation option.
      The Urban Hippie
    58. Re:Crashplan by Reziac · · Score: 1

      Bah humbug. Well, no crashplan for me.

      Probably best off to just hurl disk images at my web hosting**, since I have unlimited space and bandwidth. Well, some year when *I* actually have upstream speed worth noticing. :(

      Shameless affiliate link:
      http://www.1and1.com/?k_id=676...
      10 years now and still happy as hell with 'em.

      --
      ~REZ~ #43301. Who'd fake being me anyway?
  4. Another RAID? by Anonymous Coward · · Score: 0

    Why buy one array when you can have two at twice the price?

    In related news: rm -ra * should be used with caution. ;)

    1. Re:Another RAID? by devman · · Score: 1

      In my brief search I wasn't able to find a version of 'rm' that accepted a '-a' option.

    2. Re:Another RAID? by jones_supa · · Score: 3, Funny

      That's why it is supposed to be used with caution, as no 'rm' supports it. ;)

    3. Re:Another RAID? by cheesybagel · · Score: 1

      I did 'rm -rf /' once years back. It did not delete the entire filesystem. It stopped when it erased '/lib/libc.so'.

    4. Re:Another RAID? by amorsen · · Score: 1

      It stopped when it erased '/lib/libc.so'.

      Why? I can use rm to remove /lib/libc.so. The problems do not start until after rm is finished...

      --
      Finally! A year of moderation! Ready for 2019?
    5. Re:Another RAID? by Anonymous Coward · · Score: 0

      Are you sure about that? Because that is not supposed to happen on any *nix system that I know of. Files that are in use can be deleted from the directory, but the file descriptors that already have the file open will keep having access to the file. Only when the last file descriptor is closed will the space be marked as free.

      Now, if you were using some kind of "shred" program which overwrites the files before deleting them, Linux at some point had this feature where a shared object (so-file) was mmap()'ed into the process without any form of copy-on-write, so overwriting the file on disk would also overwrite it in memory. That one did bite me once, when manually replacing some things in /lib with a version that only differed in minor version number, and thus had the same filename. That ended up crashing every daemon using those libraries.

      Had I just deleted the libraries first, the daemons would keep using the still open file, and it would not be overwritten by the new version. Of course after deleting the old version, I would then have been unable to use tar or mv to put the new libraries into /lib.

    6. Re:Another RAID? by Anrego · · Score: 1

      Why buy one array when you can have two at twice the price?/quote

      This is actually what I do, except not at twice the price.

      I have my high-ish performance (for a home server anyway) file server made from decent parts, and then a server of equal storage capacity made from an old desktop with a couple sata cards, software raid, and those ultra-cheap "green" drives.

      Reliability is less important in a backup server, and so is performance. As long as it doesn't die at the same time as your main server dies, or during recovery, it doesn't really matter, so you can really cheap out on it.

    7. Re:Another RAID? by Anrego · · Score: 1

      .. the heck

    8. Re:Another RAID? by kenaaker · · Score: 1
      Here's a more subtle way of doing a file system in (from personal, bitter experience).

      As root of course.

      Start in any directory

      rm -rf .*

      On the system that I discovered this on, the first file it removed was the system kernel. That's when the panic started. I was just trying to get rid of some hidden directories in a home directory.

    9. Re:Another RAID? by Anonymous Coward · · Score: 0

      I watched a sys admin do this on a Unisys Unix box back in 1989 or so, and the same thing happened (although I don't recall if the exact file needed was libc.so or something else).

      The fun part was, after he trashed the system, he discovered that the tape containing the replacement OS was unreadable. :-)

    10. Re:Another RAID? by Anonymous Coward · · Score: 0

      rm -rf /
      will happily remove the entire file system. I know I've done it :) I love virtual machines .

  5. ZFS pool by Anonymous Coward · · Score: 1

    Use a ZFS pool using a combination of a mirror, a raidz3 & spares. Add new disks as hot spares when money can be allocated. Easy, some what affordable & allows for failure.

    1. Re:ZFS pool by Menkhaf · · Score: 1

      What the parent said.

      Besides that, do weekly scrubs (to catch hardware errors and ensure consistency) and nightly snapshots (to foil any accidental deletion, CryptoLocker etc.), depending on how IO the file system sees. I also regularly do a `zfs send | ssh server2 sudo zfs receive` to do an offsite backup.

      --
      A proud member of the Onion-in-Hand alliance
    2. Re:ZFS pool by Immerman · · Score: 1

      As the entire point of the question highlights though, RAID (or ZFS) only adds fault-tolerance, it is not at all the same thing as a backup. Lightning strike, flood, or act of dog can all easily wipe out your entire quintuple-redundancy array in moments. Backups, preferably off-site, specifically secure against such things.

      --
      --- Most topics have many sides worth arguing, allow me to take one opposite you.
    3. Re:ZFS pool by MrNaz · · Score: 1

      +1 on this.
      ZFS/RAIDZ is THE solution to this problem.

      --
      I hate printers.
  6. Why you back up your porn? by DrMcNasty · · Score: 0

    You don't, at least not for cheap.
    The only way you would be able to do something like this effectively would be to run another raid along side of your working raid and backup off to that
    Still, holding onto 20 TB of data is overkill.

    I would recommend bringing that down to a more manageable size for a home user and only backing up the items that are not easily replaceable, like personal photos, movies and documents. Backing up your movie collection or porn collection is just a waste.

    --
    "Voices In My Head" The Unauthorized Biography
    1. Re:Why you back up your porn? by Anonymous Coward · · Score: 1

      While I agree that the likely use case here was pirated movies or porn, there are several very legitimate use cases such as a small / home business doing video production (someone I know has a contract to do several local school sports games as an example).

    2. Re:Why you back up your porn? by Anrego · · Score: 1

      For a long time that's what I did (not the trimming down storage requirements, but the backing up the irreplaceable stuff).

      In my 20 TB array (12x2TB drives in raid6) the "irreplaceable" stuff comfortably fits on an external 2TB drive. I have two of them.. one I leave plugged in, one I keep elsewhere, synced daily, swapped out periodically).

      However I recently did start backing up the whole 20TB (within the last year) via a completely separate "backup" server made from cheap parts. It sounds extreme, but when you consider that performance and reliability don't really matter, and storage is cheap, you can throw together a backup server pretty quick (in my case I used an old desktop with some sata cards jammed in, and those ultra-cheap "green" drives).

      Most of that 20 TB is replaceable (media rips for which I still have the media) or stuff that I probably wouldn't miss, but it would still be an epic hassle if it irrecoverably died, so having a complete mirror as a safety net is nice.

  7. Don't bother. by Anonymous Coward · · Score: 1

    I really doubt anyone actually uses 20TB of movies & music. It just sits there.

    1. Re: Don't bother. by binarylarry · · Score: 1, Insightful

      Exactly, do you really need to hoard all that content?

      --
      Mod me down, my New Earth Global Warmingist friends!
    2. Re: Don't bother. by SJHillman · · Score: 1

      Choice can be a good thing. Besides, you never know when he might have a lady friend over who's just dying to see the Director's Cut of Midgets Take Manhattan or Dwarfs Do Debbie Doing Dallas.

    3. Re: Don't bother. by Anonymous Coward · · Score: 0

      Do you also have efukt as your home page?

    4. Re: Don't bother. by MightyYar · · Score: 1

      While I agree, who are we to judge? It's a technical question, not one of lifestyle. For what it's worth, 20TB is only about 400 ripped BluRays. A large, but not inconceivable, video collection. If he's doing home movies, these can eat up serious drive space. The nicer consumer models capture at around 24Mbps... over 10GB per hour. Anyone with a first baby who likes to take videos probably has many hours of video.

      --
      W..w..W - Willy Waterloo washes Warren Wiggins who is washing Waldo Woo.
    5. Re: Don't bother. by dcw3 · · Score: 1

      That's only his pr0n collection, barely!

      --
      Just another day in Paradise
    6. Re:Don't bother. by Anonymous Coward · · Score: 0

      Yeah, but his question wasn't whether or not he uses it. It raises the question as to how it got lost, though (maybe that filesystem should have been mounted readonly?).

    7. Re: Don't bother. by kenh · · Score: 1

      If he created backups incrementally as he grew his collection, the effort to back it up would be trivial, and while the restore would be boring, at least it would be possible.

      Someday people will learn that RAID is not a suitable replacement for backups.

      Seems to me that 5x four terabyte hard drives would have been a wise investment - heck, toss in a couple extra drives and a RAID option like RAID 5 or 6 and you should be sitting pretty, just copy updated files as they are added to the main array... Of course, it may take a while to rebuild a list 4 TB drive...

      --
      Ken
    8. Re: Don't bother. by jedidiah · · Score: 1

      Yes. No one needs silly things like property rights anyways.

      --
      A Pirate and a Puritan look the same on a balance sheet.
    9. Re: Don't bother. by jedidiah · · Score: 1

      You laugh but I had an opportunity of this sort just last night.

      It's nice to have all the things you like or are likely to want at your fingertips. It's also nice not needing to worry about some snafu with Amazon, or Netflix, or Comcast, or TWC buggering the whole thing.

      Trailer trash can try to save money by streaming things.I would rather own them and user them under my own terms.

      --
      A Pirate and a Puritan look the same on a balance sheet.
    10. Re:Don't bother. by Wdomburg · · Score: 1

      Depends entirely on what you are storing. A typical blu-ray movie probably runs an average of 25GB or so. You would burn through 20TB in about 800 movies at that rate. Even less if you preserve special features and alternate cuts ( Hardly an unreasonable collection.

      Television series can eat us space even faster. Look at something like like Lost, which comes on 36 blu-ray discs. Then add Star Trek: Next Generation (+35), Doctor Who (+29), Battlestar Galactica (+20), Farscape (+20), Stargate: Atlantis (+20), Star Trek (+20), Heroes (+18)... We're up to 198 discs so far. Even at single layer capacity, that is around five terabytes of data with only a handful of shows. Add in Firefly, the later Star Treks, Babylon 5, Lexx, X-Files, Carnivale, Prisoner, Alien Nation, Red Dwarf, Torchwood, Blake's 7, Fringe, Twilight Zone, Outer Limits, Angel, Buffy, Dollhouse, Terminator: The Sarah Conner Chronicles and you could easily most of the space with just sci-fi / fantasy content.

    11. Re: Don't bother. by ihtoit · · Score: 1

      20TB for me is at maximum shy of 60 hours of "decent"* quality video.

      I shoot lots of uncompressed *QHD (720p) and 625 PAL footage. That takes up a LOT of space. 370GB/hour and 93GB/hour respectively. Yes, I am poor and can't afford a 1080p DSLR much less a Red 4k, yet I never delete a single frame. All that raw footage is copied twice, one goes straight into storage, one is the working copy, and the original stays on the camera until the production is finished.

      I will burn a 2TB drive on a day's shoot without even thinking about it.

      I'm not even going to try and estimate how much stock footage I have (several TB), never mind how many drives I have it on (a walk in wardrobe with a custom pop-rack and several dozen drives ranging from 200GB to 2TB). All I know is, I have a stack for personal use and a stack for work, and when a job comes in the first thing I do is buy a new drive for running backup. Pretty much guaranteed it'll get filled.

      --
      Political debates have me rolling my eyes so much I think I got optical whiplash. I should sue. - Foamy The Squirrel
    12. Re: Don't bother. by Anonymous Coward · · Score: 0

      Absolutely. Having that many books should be illegal.

      Ohh wait...it is illegal to have that many books.

    13. Re:Don't bother. by subminiature · · Score: 1

      You have just listed part of my media library. and I guess I should look further into the two I haven't come across.

    14. Re: Don't bother. by Anonymous Coward · · Score: 0

      I bet it's his porn collection. Every last bit of that 20 TB was precious memories for him. Oh the grief of so many intimate memories lost!

  8. Hard drives + Robocopy by DogDude · · Score: 5, Informative

    I have a 16 TB media collection at home that I just back up on more hard drives.

    External hard drives in USB cases + Robocopy works great for me.

    --
    I don't respond to AC's.
    1. Re:Hard drives + Robocopy by Zocalo · · Score: 1

      It's really either this or tape. Just be sure to verify your backups, make sure that you can actually restore, and keep a copy off the data off site at a suitably remote family member/friend's place. Personally, I use external USB3 disks as they are cheap and can be left copying data overnight, especially so if you structure the data so that it's easy to segment data on the RAID across multiple external drives, and replace them long before any MTBF kicks in. Also unlike some tape systems, don't require particular software to backup/restore - any directory copy/sync tool of choice will do, and for restoring specific files you can even use a file manager or command line copy.

      --
      UNIX? They're not even circumcised! Savages!
    2. Re:Hard drives + Robocopy by ckedge · · Score: 1

      Yup, same here. It's annoying, having twice as many hard drives as one needs including one entire set on the shelf, but it's the way to go.

      I don't actually have a raid array for the live data, I have just a collection of disks mounted individually and so the files are already forced into "appropriate sized sets" suitable for a simple full disk robocopy.

      I'm not sure what I'd do if I had a massive raid array of that size. Probably just grin and bear it and have a single 1-3 TB "new/incoming" that can be regularly backed up, and when it was full then make a final backup for the shelf and move it's contents into the long term raid array storage area, and I'd (try) and never make changes to the main long term raid files.

    3. Re:Hard drives + Robocopy by MightyYar · · Score: 1

      I'm not on the scale of you guys, but I just approach it as if the hard drives come in sets. I never buy a single hard drive unless I'm replacing a failed drive. If I can't afford enough hard drives to expand my capacity along with backup, I just make do with what I have. A good culling can make a remarkable amount of space available. Tools like "Grand Perspective" on Mac and "SpaceMonger" on Windows make this chore a bit less tedious. Once your kids are over a certain age, do you really need those ripped seasons of Dora the Explorer? Are you ever going to watch Minority Report? Are you ever going to catch up on those old episodes of Daily Show?

      That said, I don't generally bother backing up movies. If the RAID barfs, oh well. There are exceptions, and all of my music is backed up... who wants to rip all of those CDs again? And music only takes up a few hundred GB.

      --
      W..w..W - Willy Waterloo washes Warren Wiggins who is washing Waldo Woo.
    4. Re:Hard drives + Robocopy by Lennie · · Score: 2

      Well, you might not need to have twice as many. It just depend how much you are willing to pay.

      He is using a RAID-array as his primary storage. You will only need the other copy as a disaster recovery. In case you accidentally delete or format or the hardware RAID controller fails and you can't replace it anymore. Or something else stupid like that.

      So if you have a RAID5 of your originals and a copy on non-RAID disks and do regular checksums of your data to compare if reading the data from the RAID and non-RAID disks is still the same, you should be OK in most cases. The checksums checks could be rsync -c or ZFS/BTRFS or whatever.

      If you want more assurances, you're bigger issue is: you should have your copy off-site.

      If you've done all that, only then you might want to make sure the copy has RAID.

      Next step up from that is to keep a history of your changes of course, to have a real backup. Not just a copy.

      --
      New things are always on the horizon
    5. Re:Hard drives + Robocopy by DogDude · · Score: 1

      That's a lot of complexity and room for error just to save a few hundred bucks.

      --
      I don't respond to AC's.
    6. Re:Hard drives + Robocopy by Anonymous Coward · · Score: 0

      Exactly. It's not difficult to accumulate a considerable amount of data at home, so a well-thought-out backup & restore is essential.

      We have 16TB of disks (8TB effective, in RAID) on our Synology DS414 file server at home, with automatic incremental backup every day via USB3 onto a pair of 4TB external disks. There's another 12TB of disks (6TB effective, in RAID) on an older Synology as our media server, with automatic incremental backup via USB2 onto two 3TB external drives. The external disks are cycled with another set every few weeks. I really wish USB3 had been invented a few years earlier, as the restore time is appalling on the media server (yes, I tested it).

    7. Re:Hard drives + Robocopy by Anonymous Coward · · Score: 0

      We have just over 8 TB of data on our home media server, but gave up on regular backups a couple of years ago. Built a duplicate server for the shop next door and have them mirrored. Unless both both buildings blow up or burn down we should be somewhat covered, right?

    8. Re:Hard drives + Robocopy by mjwx · · Score: 1

      I have a 16 TB media collection at home that I just back up on more hard drives.

      External hard drives in USB cases + Robocopy works great for me.

      Yep, two storage devices. Once you've got the data onto your backup device, you only have to keep it synced up some how. Personally I'd use a NAS but 4TB would be plenty for me.

      --
      Calling someone a "hater" only means you can not rationally rebut their argument.
  9. Title correct? by Anonymous Coward · · Score: 0

    Shouldn't this be titled 'Ask Slashdot: How Do You Backup 20TB of Data?'?

  10. If it's that important, pay for tapes by Anonymous Coward · · Score: 0

    Not really a hard question to answer.

    1. Re:If it's that important, pay for tapes by DickBreath · · Score: 1

      But punched paper tape is slow and makes a lot of noise.

      --

      I'll see your senator, and I'll raise you two judges.
    2. Re:If it's that important, pay for tapes by Waffle+Iron · · Score: 1

      On the bright side, my math indicates that 8e13 paper chads would take up about 20 thousand cubic meters of space. That would probably be enough free fuel to heat your home for a lifetime.

    3. Re:If it's that important, pay for tapes by Bing+Tsher+E · · Score: 1

      The chaff would get to be a problem too, after the first several dumpsters full.

    4. Re:If it's that important, pay for tapes by Immerman · · Score: 1

      Try using karate-chopped tape instead. As long as you disable the dramatic shouts it's considerably quieter.

      --
      --- Most topics have many sides worth arguing, allow me to take one opposite you.
  11. You can just plug in an external drive, by CoolCash · · Score: 1

    but you need real backup software. As you fill up drives you replace it and continue the backup until you have a full backup. This way you can take them off site. Like any other backup solution, make sure you test the drives every few months to make sure your data is not corrupt and have a failed drive.

    1. Re:You can just plug in an external drive, by Anonymous Coward · · Score: 0

      Yes, if the software can handle a disk filling up, pause and wait for a new disk to be mounted, something like the Thermaltake BlacX would be ideal. It's an SATA dock, you can just drop a bare drive straight into it, mount it and away you go. With eSATA you get plenty of speed. You still need seven 3TB drives, but future expansion is easy - you just buy another disk.

      I think this is better than a mirror RAID array. If you were to make another 20TB RAID, but your data grows, you're going to need physical space in the array to keep adding disks or you have to start swapping out drives from the array and replacing them with bigger ones.

  12. Old School by Anonymous Coward · · Score: 0

    Tape.

    1. Re:Old School by bobbied · · Score: 1

      If you took all the punch tape ever produced, I'm not sure you'd have 20Tb worth of storage... I wonder how many times THAT would go around the earth?

      --
      "File to fit, pound to insert, paint to match" - Aircraft Maintenance 101
    2. Re:Old School by tgv · · Score: 1

      The earliest tape had 128B/inch, so that would be 3.9e6km, or 100 times the earth's circumference.

      Wikipedia however mentions: "As of 2010, the record for highest data on magnetic tape was 29.5GB per square inch." So assuming tape one inch wide, that would be just 17 meters, or 4.3e-7 times the earth's circumference.

    3. Re:Old School by Anonymous Coward · · Score: 0

      Challenge accepted.

      This is a big assumption, as I've never used punch tape and I cannot find an accurate description of size.

      Diameter of Earth: 7,926.41 miles (12,756.32 kilometers).
      punched tape: 120 char per second, "up to a few dozen kilobytes", lets assume 36kb. Also, magnetic tape came in lengths somewhere around 2000ft, using that length means each character would take 2000/(36*1024) feet, or roughly 1 character per 0.05 feet, or 0.65 inches. Lets do half, 1000ft means 1 character per every ~1/3 inch.
      (20TB in 1000ft increments/ 36kb unit ) / diameter of earth
      ((20 * 1024^4) / 36) / (7926.41 * 5280 * 12)
      610839793208.88888888888888888889 / 502217337.6
      = ~1216.285 times

      Anecdotally, it would take 89,194,037.79 years to write all that data at 120 char per second.

  13. If you want to hoard bits... by polymeris · · Score: 3, Insightful

    > It's not like you could just plug in an external drive [...]
    Why not? Maybe not one, but 10 or 20 of them.

    1. Re:If you want to hoard bits... by Shortguy881 · · Score: 1

      Thats what I dont get. If he has 20TB of hard drive space as it is, why not double it and use that as a back up?

      --
      Brilliance without wisdom, power without conscience. Ours is a world of nuclear giants and ethical infants.
    2. Re:If you want to hoard bits... by minstrelmike · · Score: 1

      Yes. The _only_ way to have a usable backup of that data is to have a second RAID array.
      If you just want a backup in order to check a box on a list (like most companies), then tape might work.

    3. Re:If you want to hoard bits... by Lumpy · · Score: 1

      SDLT tape, sitting under 3 feet of water is still readable and recoverable. Try that with your second raid array.

      Second raid is NOT a backup it's high availability.

      --
      Do not look at laser with remaining good eye.
    4. Re:If you want to hoard bits... by dingleberrie · · Score: 1

      At home I have four 4TB seagate USB 3.0 backup plus drives for 16 TB of backup. I have too much backup storage so that I can allow for not fully using the drives.
      I run Cobian backup (as an incremental directory duplicator) and just pick different directories (and disks) to back up. BTW, I bought a USB 3.0 card to give me those ports, so that's not an excuse either.

      This does not solve the problem of bitrot, (looking at building a ZFS for that), but it is a simple way to have a scalable backup system for most usages.

      I don't get why you can't plug in an external drive.

    5. Re:If you want to hoard bits... by fuzzyfuzzyfungus · · Score: 2

      Tape actually works pretty well, it's just that if you are running a small enough system that "Well, obviously, just get another 20TB array and store it at one of your other sites, idiot... On second though, get two extras and put each one at a different offsite location" isn't the answer that springs to mind almost reflexively, you probably can't afford the cost of entry.

      An actually contemporary tape drive(and a machine capable of keeping it fed when it is running full bore) is Not Cheap; but the fleabay shit that is cheap tends to offer painfully mediocre capacity and unknown reliability. Disks, by contrast, have a cost of entry that basically starts at zero and scales more or less linearly with the number of disks, unless you absolutely must have them all online at the same time(and even here, you hardly need screamin' hardware RAID for your backup volume, and bulk SATA ports of undistinguished performance are cheap).

    6. Re:If you want to hoard bits... by ewilts · · Score: 1

      That's a lousy answer. Tape *does* work. It can be slower than disk, but disk is not the *only* way to have a usable backup. Tape is not dead.

      Tape works. Disk works. Offsite replication works. Do the math for how much data you have and how much bandwidth you have or can afford, and do the calculations on how much data you really have to back up. In many cases, recreating the data can be significantly cheaper than backup it up and restoring it. If you have your original CDs and DVDs, put them in an offsite location. If you have a disaster, you can have them re-ripped for a LOT cheaper than backing them up.

      I've help run a multi-petabyte data center with backups to tape and they worked. Everything written to tape was restorable. I currently run a multi-petabyte data center and replicate everything to disk in an offsite location. It also works. Neither is cheap.

      Figure out what part of the data is important to you and how long you can wait to get it back. If a fire burns down your house but you need the data back in minutes or hours, then tape is obviously not the answer but then neither is DVDs or any cloud provider.

      There is NO single answer that's good for everybody. It's a cost/benefit/risk analysis that every first year comp sci student better become familiar with.

      --
      .../Ed
    7. Re:If you want to hoard bits... by jon3k · · Score: 1
    8. Re:If you want to hoard bits... by jon3k · · Score: 1

      You'd need 25 tapes (800GB) and a full backup or restore would take 3.8 days (peak theoretical 60MB/s). Depending on the rate of change this may not be viable.

    9. Re:If you want to hoard bits... by Attila+the+Bun · · Score: 1

      That's what I thought. I have a backup server in the basement, built out of two Raspberry Pis attached to four USB disk enclosures. Goodsync takes about 15 minutes to scan my 4TB of data, depending on how many new files there are. It would be faster over a wired network. Scaling that up to 20TB across five 4TB drives doesn't sound expensive or difficult.

    10. Re:If you want to hoard bits... by Anonymous Coward · · Score: 0

      I see uneducated moderators are out. Lumpy is right, RAID is not a backup.

    11. Re:If you want to hoard bits... by bill_mcgonigle · · Score: 1

      Second raid is NOT a backup it's high availability.

      That's entirely a matter of configuration. If I were doing this with drives, I'd grab an 8x SATA card, 8 4TB drives, an external hot-swap shelf, and setup RAIDZ-2 with compression on it. I'd put the drives in, rsync the data to them, snapshot it, pop them out, and bring them to a safe deposit box for safe keeping. Cost is about $1800. That's totally a backup. A second set of backups is another $1300.

      Can somebody compare the cost for tape?

      Also, stop hoarding stuff, it's expensive. My (legit) backups of my extensive DVD(2.4Mb/s h.264) and CD(FLAC) collection fits in less than 2TB.

      --
      My God, it's Full of Source!
      OUTSIDE_IP=$(dig +short my.ip @outsideip.net)
    12. Re:If you want to hoard bits... by flyingfsck · · Score: 1

      Par2 or rsbep to the rescue.

      --
      Excuse me, but please get off my Pennisetum Clandestinum, eh!
    13. Re:If you want to hoard bits... by im_thatoneguy · · Score: 1

      It's not even that extreme these days, you can get 4TB drives now. So we're down to 5 drives stacked in a closet.

    14. Re:If you want to hoard bits... by minstrelmike · · Score: 1

      "usable" backup means the ability to restore.
      in my experience, it takes a lot of time to restore from tape so it isn't done so therefore your backup remains untested.
      To test your backup, you should restore one or two random files from it every time you make one.
      Tape backup of huge sets doesn't give you time to do that.

    15. Re:If you want to hoard bits... by minstrelmike · · Score: 1

      _anything_ that is a second copy of your data is a backup, whether that is RAID or Tape or papyrus scrolls.
      Usable backup means you can actually retrieve it, and that's always iffy regardless of medium.

    16. Re:If you want to hoard bits... by ewilts · · Score: 4, Informative

      The dataset isn't that huge. Tape can write at speed at least as fast as disk - LTO-5 writes at up to 280MB/sec - far faster than you can read the source at which isn't likely to be fast disk. The seek for a single-file restore will be slower than disk but after the initial seek, the read will be as fast as from a typical archive disk (no, you're not archiving 20TB to SSD, nor are you storing the source data on SSD either)

      However, the change rate for this application is likely to be low. That makes it very feasible to do random testing from the new backups where a minute to do the tape mount/seek is not a problem. You won't be writing more than a single tape in any single run (LTO-5 is ~1.5 TB of uncompressed data).

      For $2K, you'll have the LTO-5 drive. Add $500 for 20 tapes and you can back up the entire set (once) plus a bunch of incrementals. I haven't done the math with LTO-6 which is faster and holds more data. If you want multiple generations, tape is a lot cheaper per TB than disk. The initial drive cost hurts but after that, the price is good at $15/TB or so.

      --
      .../Ed
    17. Re:If you want to hoard bits... by nabsltd · · Score: 2

      An actually contemporary tape drive(and a machine capable of keeping it fed when it is running full bore) is Not Cheap; but the fleabay shit that is cheap tends to offer painfully mediocre capacity and unknown reliability. Disks, by contrast, have a cost of entry that basically starts at zero and scales more or less linearly with the number of disks

      This is absolutely the best statement of this ever. Everyone who claims that tape is the One True Backup doesn't factor in the startup cost of $2-4K for a tape drive that can handle reasonably large capacity tapes, the hardware to connect it to a computer (many of these tape drives have fiber-channel as the only option), and then the cost of some kind of changer if you want any amount of automation.

      For home use, hard drives are by far the cheapest and most convenient method, as long as you are in the less than 50TB world. If you aren't satisfied with hard drive reliability, back up your data twice, to two different drives. With 4TB drives selling for around $170, and backing up twice as I suggested, you don't reach the break-even point with tape (single backup) until you need to back up 15TB. With single backup to disk, the break-even is around 45TB.

    18. Re:If you want to hoard bits... by houghi · · Score: 1

      I was thinking along the same lines. If you have 20TB you should already have a NAS. So buy a second NAS and mirror it. Cost of a QNAP os about 1000 EUR and 8x4TB=1200EUR. So for 2200EUR you have 32TB available. Those including Belgian taxes and I have not even searched for prices. If you 'only' want 20TB and no way to grow, you are ready with 5x4TB + 5tray for 1550EUR. (all prices via alternate.be if you want to verify. Cheaper IS possible.)

      --
      Don't fight for your country, if your country does not fight for you.
    19. Re:If you want to hoard bits... by LVSlushdat · · Score: 1

      I do a multi-level backup.. I have about 110GB of stuff thats nice to have, but if the house burned down and took the machine the data was on, the house server, AND the USB NAS drive that 110GB is backed up on, I'd not lose too much sleep, THEN I have a smaller subset of data, about 10GB that I would cry my eyes out if it were lost. You can imagine the data is.. Tax returns, pictures, scanned documents, etc.. That subset is backed up to the NAS and house server along with the 110GB of stuff, PLUS its backed up to AWS S3. I really couldn't see wasting $$ to backup the whole 120GB...

      --
      THANK YOU, Edward Snowden!! Americans owe you a debt of gratitude (whether they know it or not..)
    20. Re:If you want to hoard bits... by afidel · · Score: 1

      ReFS on a parity storage space also solves bitrot and it's built into Window 8.1. Plus you can use an SSD as a cache.

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    21. Re:If you want to hoard bits... by emj · · Score: 1

      If I were doing this with drives, I'd grab an 8x SATA card, 8 4TB drives, an external hot-swap shelf, and setup RAIDZ-2 with compression on it. .... Cost is about $1800. That's totally a backup. A second set of backups is another $1300.

      Can somebody compare the cost for tape?

      You pay about ~$1800-$2500 for tapes, a tape driver, and SAS card, depending on vendor, an extra 27 tapes to store 20TB again would be about $700. The big win is that the tape setup take less space, changing tapes every 800GB is not so hot though, but you get used to it. I've used 4 tape drives in parallel to be able to backup all data generated durng the day, it's ok, you learn to handle that with the morning coffee.

    22. Re:If you want to hoard bits... by Anonymous Coward · · Score: 0

      to backup 20TB, I could use 12 4TB HDD in a RAID6. That can do a lot faster than 280MB/s.

      I can get 12x4TB for $165 each (lowest price on first page of Newegg), $1980 + $513 for a 16xSAS300 RAID controller (http://www.newegg.com/Product/Product.aspx?Item=N82E16816103213R).

      But this really sits on the boundary economics between LTO and RAID replica. The advantage to RAID replication is speed. The advantage to LTO is weight, cost of incrementals and expandability.

      If the storage set was double the size, it would be a no-brainer: go with LTO. If it was half the size, it would be a no-brainer: go with RAID. But it also depends on the nature of the data. If it's a bitpile (warez, etc), then there aren't going to be lots of incrementals. If it's a log-append database, it's also just going to grow, you're going to have to keep adding disks to the original database, but there's going to be continuous incrementals: go with LTO.

      I wish there was a tape archive solution for me. DDS and co are dead, and my home dataset is around 8TB of mixed databases and warez. LTO is too big and expensive, and there are no cheap tape systems for smaller ~200GB a week backups. I could get an older gen LTO, but the older gen tapes are about as expensive as current LTO tapes, and far less $/GB than HDDs. Only LTO-5/6 has competitive $/GB against HDD but then the machine is too expensive.

    23. Re:If you want to hoard bits... by fuzzyfuzzyfungus · · Score: 1

      There's also the matter of your expected retrieval case: tapes almost certainly beat HDDs in the archival timeframe, so suck it up and pay up; and they (if reasonably modern) can be alarmingly fast at the linear reads and writes associated with doing full restores or fast backups of data that have been suitably lined up to be shoved onto tape.

      On the other hand, they are almost perversely non-random-access(doubly so if you are talking about a multi-tape set with library swapping, more than doubly so if you are talking about a multi-tape set larger than your library can handle with junior-admin swapping), so the 'somebody fucked up, they need File X to be like it was three months ago' scenario sucks.

      Nearline or consumer SATA on undistinguished controllers aren't quite as zippy; but they might as well be a RAMdisk compared to tape if you are doing small-scale restores, though probably not as fast if de-icing a much larger dataset for wholesale restoration of multiple systems.

    24. Re:If you want to hoard bits... by servant · · Score: 1

      Go check out the backblaze blog. They go over physically making their pod's. A couple of pods, would do, and keep them in sync with rsync or whatever.

      --
      ... "When you pry the source from my cold dead hands."
    25. Re:If you want to hoard bits... by david_thornley · · Score: 1

      Also, if I have a home setup and it's in a fire (one of the use cases for backups), I need another $2-4K tape drive to restore.

      --
      "When you have eliminated the unacceptable, whatever is left, however improbable, must be the truthiness" - Holmes
    26. Re:If you want to hoard bits... by Anonymous Coward · · Score: 0

      Four words. Linux, ZFS, Replication, Snapshots

    27. Re:If you want to hoard bits... by Anonymous Coward · · Score: 0

      Uh huh..... And how about those nightly backups swapping out Ten tapes?

      Unless you're forking out money for a robotic library, and a backup management system to deduplicate (No GFFS crap!), you're spending a Hell of a lot of time managing this crap.

      If you go the tape route, that is....
      Those who know, don't use tape.

    28. Re:If you want to hoard bits... by Anonymous Coward · · Score: 0

      Incorrect on a couple of points. The worst being LTO-5. Try ~140MB real world uncompressed. Unless everything you have is a giant ASCII file there is no way to get 280MB on lto5, 6 can barely push 150/160MB/s.

  14. Two by Anonymous Coward · · Score: 0

    Most businesses would have two. Then just sync them. If its not worth it to you, the data must have been worthless.

  15. tape by Anonymous Coward · · Score: 0

    Use tape.

    1. Re:Tape by jellomizer · · Score: 1

      If I only had mod points today.

      My thought was a second RAID Device, however my out dated experience with tapes had a 2gb limit.

      --
      If something is so important that you feel the need to post it on the internet... It probably isn't that important.
    2. Re:TAPE by Anonymous Coward · · Score: 0

      I use way over 20TB but I am a data connoisseur. I ripped my DVD collection and upscaled it to bluray resolution, and store it uncompressed for the absolute highest quality possible.

    3. Re:TAPE by Anonymous Coward · · Score: 0

      You can download 30GB 2 hour movies. Some people are really into their entertainment centers.

    4. Re:TAPE by Lawrence_Bird · · Score: 1

      So thats like..almost 700 movies? I dont think I've watched even half that many in my entire life.. Not telling people what to do by any means but I do think some people are becoming digital pack rats / digital hoarders. Perhaps that is a new category needing intervention by social services!

  16. No Need for Backup by Anonymous Coward · · Score: 0

    No need to back it up because he owned all the original CDs.

    Oh . . . wait.

    1. Re:No Need for Backup by Cramer · · Score: 1

      That's what I was thinking. Legal "hard" copies of all that stuff would be far cheaper than any archival backup technology (tape, blu-ray, etc.)

      It would take a lot of blu-ray's, but as the content is entirely static, it wouldn't be that much work to backup. (actually, "archive" is correct term.)

    2. Re:No Need for Backup by Medievalist · · Score: 1

      I know you were kidding, but that's what my backup is.

      I have five large crates of CDs sitting in a cellar - and that's my backup of my music collection. Of course, my music collection is only about 250 GB of FLAC files, not 2 TB... if the guy's not generating his own video, it's hard to imagine where he'd get 2 TB of legally owned content without any sort of initial distribution media.

  17. Backblaze by Anonymous Coward · · Score: 0

    Unlimited backups, $5 a month...
    you need a BIG connection though...

    1. Re:Backblaze by ibneko · · Score: 1

      +1 Backblaze. I've been using it for 3+ years now and periodically had to restore data, which has worked flawlessly. Their online interface is a bit slow, especially if you're trying to find and restore one file deep inside inside a very large directory, but otherwise, no real complaints.

    2. Re:BackBlaze by Anonymous Coward · · Score: 0

      I currently store 13TB, and they haven't ever said anything to me.

  18. Go on the internet and find a DLT drive by Anonymous Coward · · Score: 0

    some tapes and done..... should not even be that expensive..... everything else will cost more!
    What will sadden me is that you could not thought of this yourself......

    1. Re:Go on the internet and find a DLT drive by MightyYar · · Score: 1

      At 300GB per tape for the best, most expensive tapes that would require 69 tapes (and 69 hours) for a full backup. At around $120 per tape, that is over $8000, just in tape. A second RAID with full redundancy would be far cheaper - the 10 4TB hard drives coming in at around $1700. Doing it with 2-drive redundancy would only require 7 drives.

      --
      W..w..W - Willy Waterloo washes Warren Wiggins who is washing Waldo Woo.
    2. Re:Go on the internet and find a DLT drive by seeker160 · · Score: 1

      Tape has come a long way since that. With lto5 (last gen current is lto6) you can do some where between 1.5tb to 3tb compressed at about 23 dollars a tape. For the cost of between 2 to 3 of the (cheapest) 4tb harddrives you could have a (questionable ebay) writer and the tapes need for 20tb of data.

    3. Re:Go on the internet and find a DLT drive by MightyYar · · Score: 1

      I was addressing the comment to use a DLT drive. Those only hold 600GB compressed.

      On niggle with your response, though... you won't get any substantial compression with music or video. You will likely need double the number of tapes that you think you will.

      --
      W..w..W - Willy Waterloo washes Warren Wiggins who is washing Waldo Woo.
    4. Re:Go on the internet and find a DLT drive by Anne+Thwacks · · Score: 2
      LTO is 6.25TB per tape. Tapes are about $100. Drives are NOT cheap, but if you have 20TB now, I am guessing you will soon have 40TB

      The benefits of tape are:

      Data will probably last 30 years (I have read 30 YO tapes myself) HD interfaces go out of fashion every few years.

      You can have a pool of tapes, and recycle them when you no longer need the data.

      Tapes will survive serious abuse. (A lot more than HDs anyway) definitely included the back of a station wagon (except in tropical climates).

      You can use Amanda, Bacula or tar for free. (I recommend tar if you want to keep the data for 30 years).

      --
      Sent from my ASR33 using ASCII
    5. Re:Go on the internet and find a DLT drive by ewilts · · Score: 3, Informative

      You're dating yourself. LTO-5 is 1.5TB native, 3TB compressed at $25 per tape. LTO-6 is 2.5TB native and 6.25TB compressed. Both of those compressed numbers are using the built-in compression in the drive.

      A 10-pack of LTO-5 tapes is about $250.

      You can easily encrypt the tapes and tape them offsite. You can keep a copy onsite and offsite. You're simply not doing that with disk.

      Your speed is also off - an LTO-5 can write at 280MB/sec. The limiting factor is not the write time on the media but the read time from disk.

      Restore times are typically limited by the write rate on the destination raidset, not the read rate from tape.

      --
      .../Ed
    6. Re:Go on the internet and find a DLT drive by grumbel · · Score: 3, Informative

      LTO-6 can hold 2.5TB per tape, a tape cost ~$70, the drives cost $2000. That's still more expensive then just more HDDs for 20TB, but at >50TB it might be worth it.

    7. Re:Go on the internet and find a DLT drive by MightyYar · · Score: 1

      Yes, I was responding to the DLT suggestion. I should have been more explicit.

      --
      W..w..W - Willy Waterloo washes Warren Wiggins who is washing Waldo Woo.
    8. Re:Go on the internet and find a DLT drive by MightyYar · · Score: 1

      I'm definitely getting older, but it was not me who brought up DLT. I think the AC was recommending that the submitter find an old DLT drive on eBay or some such place and use that. While that would make the drive itself cost-effective, I wanted to show that there is a reason those old things are available for a low cost.

      If you do modern tape, compression is worthless on video and music, which is presumably already compressed - so you will need the full tape capacity. The tape drives themselves cost more than buying 40TB worth of hard drives. Tape is great for many use cases, but backing up your home media center probably isn't one of them.

      --
      W..w..W - Willy Waterloo washes Warren Wiggins who is washing Waldo Woo.
    9. Re:Go on the internet and find a DLT drive by Anonymous Coward · · Score: 0

      not the read rate from tape.

      What breaks the restores is usually the seek rate. That's the only drawback with tapes. You can also buy a 24tape lib with one lto5 drive for 800€. At least you could 3 years ago.

    10. Re:Go on the internet and find a DLT drive by Cramer · · Score: 1

      DLT-S4, the last generation, holds 800G native. But it's deadend technology now... it hasn't been manufactured in years, and finding actually new tapes is next to impossible. (noone on ebay is selling "new" S4 tapes. I don't give a shit what they claim. The eMAM proves them liars -- any tape without a SN has been bulk erased, RUN away from those.) Also, when you do find "new" (as in never used) tapes, they're old and freakin' expensive.

    11. Re:Go on the internet and find a DLT drive by Cramer · · Score: 1

      Tape vs. hard drive is a wash for the first 20TB. (minus a controller, HD is a bit cheaper) HOWEVER, with tape, capacity scales very cheaply... the next 20TB costs less than a single HD.

    12. Re:Go on the internet and find a DLT drive by MightyYar · · Score: 1

      Yeah, it seemed like a weird thing to suggest. I see my numbers were off, but it's absurd even with 1/3 the number of tapes.

      --
      W..w..W - Willy Waterloo washes Warren Wiggins who is washing Waldo Woo.
    13. Re:Go on the internet and find a DLT drive by Cramer · · Score: 1

      It's doable with S4, but it's going to be insanely expensive and increasingly harder to find the tapes. LTO 4, 5, or 6 would be a better choice.

      *I* use S4 for the high volume systems, but I'm only doing a full dump once per quarter (if that.) The majority of that data never changes. But I need to be able to rebuild any of those systems if they fail. (which is a growing likelihood -- those drives are getting really old.)

      (But for archiving stuff that doesn't change, blu-ray is a perfect choice. It's not like he'd be storing 20TB every month.)

  19. use /dev/null by Anonymous Coward · · Score: 0

    redirect all the backups to /dev/null :P

  20. Similar Situation by sanosuke001 · · Score: 1

    I have a similar situation; 18.6 TB RAID-Z at home (8 3TB drives) using FreeNAS and with the new update it shows it was initially set up using a non-native block size (I was a bit naive regarding the settings when I first set it up) and I'd like to rebuild it but I have no way to backup 14+ TB. Also, I would like to have a backup in case more than one drive dies (1 parity works well but I could still suffer a catastrophic failure). I've looked into tape backup but anything that seems like it'd have enough storage to be practical (1+ TB per tape) seems excessively expensive and the 100GB tapes seems like it'd be unmanageable.

    --
    -SaNo
    1. Re:Similar Situation by LordLimecat · · Score: 1

      LTO5 tape drives will run "just" $1500 or so, and the tapes are $25/each.

      When you hit this quantity of data theres not much thats really competitive. OP may want to get an autoloader, which ARE a bit pricey at $3k/each, but once youve spent the initial money the actual storage is cheap; you could populate it with ~25TB of storage for under $200.

      Or he could spend some time determining how much of that data is truly relevant, and whether he isnt creating more problems than he needs to avoid sorting through it (as is almost always the case).

    2. Re:Similar Situation by afidel · · Score: 2

      You can get an LTO4 SAS drive for ~$50 on ebay, they do 800GB native per tape, so typically ~1.2TB per tape for mixed content (obviously if it's all compressed media it will be much closer to native). 10-20 tapes doesn't seem that bad (we send that many offsite daily). The tapes will cost you ~$20 each unless you're willing to go used (ewww).

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    3. Re:Similar Situation by WuphonsReach · · Score: 1

      When you hit this quantity of data theres not much thats really competitive. OP may want to get an autoloader, which ARE a bit pricey at $3k/each, but once youve spent the initial money the actual storage is cheap; you could populate it with ~25TB of storage for under $200.

      That's the big problem with tape, the cost of the drives. And most of the cheaper autoloaders are not without issue, resulting in jammed/broken tapes. Especially in a home environment where there is more foot traffic around to bump into things. The more enclosed and robust solutions are closer to $5k.

      Then there's the problem that LTO-6 tapes are $65 each (and they hold 2.5TB each, maybe 3-4 with compression). So to backup 20TB you are going to need around 8 tapes. That's closer to $500 instead of $200 because most bulk media does not compress well.

      And finally, you get into the problem of wear and tear on the tapes. Best-case, you can read/write 2.5TB from the tape about 200 times before it is worn out. So figure on replacing all your tapes every year or two.

      What starts to be tempting is the 8x external RAID enclosures. Good ones can be found as low as $300-$500 for backup purposes, plus 8x 4TB drives for $175 each. That gives you around 22-23TB of backup space using RAID-6 for around $2000.

      --
      Wolde you bothe eate your cake, and have your cake?
    4. Re:Similar Situation by SydShamino · · Score: 1

      Just cross your fingers and wait a few more years?
      http://tech.slashdot.org/story...

      --
      It doesn't hurt to be nice.
    5. Re:Similar Situation by Reziac · · Score: 1

      [goes to look] I'm not seeing any complete, working drives at that kind of price. Those start at around $300. :(

      Otherwise.... tell me more, and what to look for in such a drive used?

      --
      ~REZ~ #43301. Who'd fake being me anyway?
  21. Backblaze by rjshirts · · Score: 1

    If your ISP doesn't have data caps, look at Backblaze ( http://www.backblaze.com/ ). $5 / month for unlimited storage for one computer. Only available for Mac and Windows, but I'm sure a virtual instance of Windows if you're using a Linux box would work... These are the folks that opensourced their hardware design for their storage "pods." http://blog.backblaze.com/2011...

  22. Raid != backup by Anonymous Coward · · Score: 0

    I ran into this problem with software raid back when i was younger. My fix was mirroring to multiple drives and of course backing up off-site (parents house).

    1. Re:Raid != backup by jlv · · Score: 1

      mod +1

      And "onsite backup" != "reliable backup".

      This is an important point that many people miss when the think of backups.

      A had a friend who had a RAID and additionally used an external drive for offline backups. He kept the external drive in a safe when not doing a backup. Unfortunately, his house burned down, destroying all his computers. The safe wasn't safe enough as the external drive didn't work afterwards (I think the heat got to it).

  23. I delete it by Anonymous Coward · · Score: 0

    And just download new porn.

  24. github by beowulfcluster · · Score: 1

    Why not store it all in 20000 github repositories?

  25. Surely... by Anonymous Coward · · Score: 0

    If his data is legitimate, legally aquired media, he has hard copies anyway, and only uses the digital copies for his home media? As to your question; a second server seems the only feasable option to me with that amount of data, although it all depends on the monetary value you place on your data, my motto being if it's freely aquired and easy to replace, why bother, regard it as temporary, whereas if it's important or work related, back it up.

    1. Re:Surely... by Albanach · · Score: 1

      If his data is legitimate, legally aquired media, he has hard copies anyway

      You do realize that some people create their own data?

    2. Re:Surely... by mtthwbrnd · · Score: 1

      No. Not at all. Licenses allow reasonable copying of the material in case you lose the disks or they break. What you pay for is the digital information content, not the disk.

  26. BackBlaze by connor4312 · · Score: 2

    BackBlaze offers unlimited backup storage for home users for around $5/mo - encrypted with asymmetric keys. I've got about 750 GB on there myself, works great. Although they may not *like* you backing up 20 TB of stuff, they should accept it. And, if they don't, you're about back five bucks. Probably worth a try.

  27. Do something about your hoarding problem by Anonymous Coward · · Score: 4, Insightful

    "My friend (read I) lost 20TB of pirated content! What should my friend have done different?"

    How about, ask yourself, how much of that content were you intending to ever consume again. Yeah, you can most likely delete 95% of it, that's 1TB of content that you might use again.

    Hoarders! *lol*

    1. Re:Do something about your hoarding problem by ustolemyname · · Score: 3, Insightful

      Not all of us have access to the time machine required to know *which* 1TB that is.

      Are you willing to share yours?

    2. Re:Do something about your hoarding problem by Bing+Tsher+E · · Score: 1

      I use Netflix, the local Public Library, and the Movie Rental place to 'back up' most of my popular-consumption media. It's just crappy pop culture! Why store it all locally? The things I really care about that aren't widely available I have on DVD-R media, but that's probably a TB at most.

    3. Re:Do something about your hoarding problem by Anonymous Coward · · Score: 0

      But you had the time to download it? Fuck off. You're making the same excuses as someone keeping his 1979 National Geographics on the stairs. You have a problem.

    4. Re:Do something about your hoarding problem by PIBM · · Score: 1

      You downloaded it once. Download it again. Perhaps you`ll even have a faster internet connection by then, so you will be able to download a larger sized version in higher quality in the same small amount of time.

    5. Re:Do something about your hoarding problem by flanders123 · · Score: 2

      How is this insightful. This is the typical useless - "Why would you want to do that, you idiot???" - non-constructive Slashdot answer that drives me insane. In short: Ask Slashdot means "Answer the fucking question", not "Attack the question". /Rant

    6. Re:Do something about your hoarding problem by Anonymous Coward · · Score: 0

      Actually, none of the 25TB worth of stored movies I've got were downloaded. All of it was ripped from media and compressed. I've got a few bins worth of media stored away so it's replaceable but not without great effort. Your solution sux.

    7. Re:Do something about your hoarding problem by LordLimecat · · Score: 4, Insightful

      This problem isnt unique; most people have trouble curating their data. That doesnt change the fact that the problem is mostly self-created, and the best solution isnt to find another place to stuff the 20TB. Its to take the time to cull it down to a reasonable size and then back it up.

    8. Re:Do something about your hoarding problem by Anonymous Coward · · Score: 0

      Just apply some heuristic. Mine is basically this: is it obscure, was it hard to get, or is it likely to become hard to obtain, or am I likely to want to watch it over and over again, or is personal data, then back it up; otherwise delete.
      And yes, you'll sometimes regret deleting something when your heuristic failed. Man up and either get yourself some more 4 Tb drives, or just accept the small losses in life. You cannot watch everything there is as it is, and eventually you'll die and it'll all be irrelevant anyway.

    9. Re:Do something about your hoarding problem by jedidiah · · Score: 1

      > Why store it all locally?

      On the one hand you've mentioned Netflix which is cool in some ways but has it's own limitations. On the other hand, you've have mentioned things other than Netflix that are all clearly inferior and aren't even worth mentioning in the same counter argument.

      The first one completely negates the others.

      On the other hand Netflix has limitations on responsiveness and selection as well as quality. At best, the Netflix version of something will be somewhat inferior. That's assuming it's even available. Then there's the whole network problem (which also impacts quality).

      --
      A Pirate and a Puritan look the same on a balance sheet.
    10. Re:Do something about your hoarding problem by spacepimp · · Score: 1

      If by trouble then you mean: Cannot be bothered to organize and maintain. then Yes I agree most people are too lazy to pay attention to their data and storage habits.

    11. Re:Do something about your hoarding problem by Anonymous Coward · · Score: 0

      Put the files you've watched in a queue for deletion. As the space fills up, the files are removed in reverse order in the queue.

      Digital hoarding usually isn't nearly as a problem as physical hoarders, as you can just buy another drive -- but its the same principle. It's hard to make your house bigger to hoard physical things. It's always "I'll keep it in case I need it", and just gets worse.

      (been there, done that)

    12. Re:Do something about your hoarding problem by Wrath0fb0b · · Score: 2

      You don't have to cull it down, you just need to organize it into logically distinct groups and assign them priorities. Hoarding isn't the problem, the problem is assigning too high a priority to the hoarded pr0n as compared to the really important stuff.

      • Group #0:

      Contents: Documents, source control repositories, user preferences, email archives
      Maximum Size: 10GB
      Protection: 3-way Mirror + Snapshots + Offsite
      Total Space Required (way upper bound): 150GB
      Total Cost: $3 a month for Crashplan

      • Group #1:

      Contents: Personal photos, music collection, other people's #0 backups, /home
      Maximum Size: 1TB
      Protection: 2-way Mirror + Offsite
      Total Space Required: 2TB
      Total cost $3 a month for Crashplan

      • Group #2

      Contents: Everything else, media, pr0n,
      Maximum Size: âz
      Protection: Diskpool, maybe integrity if you like zfs/btrfs

    13. Re:Do something about your hoarding problem by jedidiah · · Score: 1

      Yes. Re-ripping my media hoard would suck. This applies equally well to my music as it would to my video. It's simply not something I want to have to do again. The overhead of a 2nd array is worth the potential bother.

      In a couple years it won't even be an issue and it will be laughable that anyone ever thought it was.

      --
      A Pirate and a Puritan look the same on a balance sheet.
    14. Re:Do something about your hoarding problem by TangoMargarine · · Score: 1

      Some things are sufficiently difficult to find that that may not be feasible. Old and not-popular stuff that you could only find in 1 or 2 obscure places gets ripped down by the *IAA, or the site just dies sometime within the next 10 years, and then what?

      All those moments will be lost in time...like tears in the rain...

      --
      Unity? Screw that: XFCE. Slashdot Beta? Screw that: SoylentNews. Australis? Screw that: Pale Moon. UX developers DIAF
    15. Re:Do something about your hoarding problem by hodet · · Score: 1

      Pretty much, I have about 500GB of must have backup, family pictures and videos I backup to two 2TB USB drives and keep one offsite (locked cabinet at work). Sensitive documents get backed up to a Truecrypt container on a local USB and to an offsite VPS I rent monthly. The rest can be backed up on strangers systems, it's called bittorrent and Netflix

    16. Re:Do something about your hoarding problem by nabsltd · · Score: 1

      Actually, none of the 25TB worth of stored movies I've got were downloaded. All of it was ripped from media and compressed.

      Unless you aren't compressing very much from the original source, this means you have about 6,000 movies (2GB/hour, with an average movie being 2 hours).

      I've got a few bins worth of media stored away

      With 6,000 movies, your "few bins" would be a cube about 8 feet on each side, assuming standard DVD cases. And, it would also mean that you would spend over $3,000/year on movies (figuring $10/movie, and assuming you've been collecting since DVDs first came out). If you have been collecting for less time, or buy more expensive movies (Blu-Ray, special editions, at first release, etc.), then it would be closer to $5,000/year. Do you seriously spend $400/month on purchasing media?

    17. Re:Do something about your hoarding problem by Anonymous Coward · · Score: 0

      Maybe that's why it's gone. It's all been consumed!

    18. Re:Do something about your hoarding problem by Wdomburg · · Score: 1

      Are you bad at math, or have you been in a coma long enough that you'r'e unfamiliar with high definition video?

    19. Re:Do something about your hoarding problem by Anonymous Coward · · Score: 0

      Here's my backup list:

      Personal photos/videos
      Financial/legal documents

      That's it. Everything else is easily replaceable or not worth the trouble.

    20. Re:Do something about your hoarding problem by LordLimecat · · Score: 1

      A 2 way mirror is not a backup. Treat it like one and youll learn some pretty valuable lessons eventually.

    21. Re:Do something about your hoarding problem by Anonymous Coward · · Score: 0

      Yes... a time machine is the only possible solution to the question "Gee, I wonder which music I want to keep?"

    22. Re:Do something about your hoarding problem by Anonymous Coward · · Score: 0

      I am not sure I am a 'hoarder'. However I had a different issue. 2500+ DVDs. I *do* watch my movies. I got rid of cable and just watch my movies. I can buy 10 dvds a month for what I was paying the cable company. Netflix is a decent workaround however there are *many* dvds on 'very long wait'. I just go on amazon 3-5 bucks later and 1-2 days later I have a movie.

      I am in the realm of about 30TB of data. As XBMC is the awesome.

      I have a 1TB drive full of junk that yes I could blow away. The movies on the other hand are not negotiable. They are part of my media center. The wife would be massively annoyed at going back to flipping dvds.

      So do you have any other ideas than 'format c the bitch'?

    23. Re:Do something about your hoarding problem by dwpro · · Score: 1

      The same argument would have applied to email inboxes a little more than a decade ago. When storage/price isn't an issue, it's not worth the bother of curating so long as the data is easily searched. A decade from now 20 TB will probably be the average thumb drive size.

      --
      Millions long for immortality who do not know what to do with themselves on a rainy Sunday afternoon. -- Susan Ertz
    24. Re:Do something about your hoarding problem by LordLimecat · · Score: 2

      It most definately IS worth curating your inbox. One of the toughest issues ive seen WRT backups and email performance is absurdly large PST files. Guess what: that PST file is marked as "modified" every time you open outlook, and most online backup systems will mark it as needing to be re-backed up each time leading to some absurd bandwidth usage. Theres also all of the issues with potential for a single file to get corrupted, difficulty in repairing it (ever tried to scanpst a 25GB pst?), performance indexing it, etc... all because people cant be bothered to keep their mailbox under control. And of course when you finally decide to move to provider like Outlook.com / exchange / gmail, you discover they have a 25GB limit.

      Source: Been there way too many times with clients. Hoarding is never the answer, and is often the problem.

    25. Re:Do something about your hoarding problem by Anonymous Coward · · Score: 0

      Meh, because he doesn't have 20TB of RAW pictures of his kids at the playground, he has 10TB of movies, 5TB of TV, and 5TB of pr0n. I think the 1TB of actual data would be being generous. Besides, this clearly is a stupid question, which boils down to "how do I backup 20TB for free", even though the volatile storage costs thousands to set up he wants a useful backup of that data cheaply. Cheap, fast, good, pick any two.

    26. Re:Do something about your hoarding problem by prezkennedy.org · · Score: 1

      On yet another hand, exactly how much money do you want to spend archiving and holding onto a high-quality rip of a movie that you're never going to watch again?

      --
      It started back in Team Fortress Classic
    27. Re:Do something about your hoarding problem by Rich0 · · Score: 1

      This is the approach I use, actually, with some variation.

      Group 1 is critical data (documents, etc). It is only a few GB, and I use duplicity to do daily incrementals to the cloud.
      Group 2 is everything (including group 1). It is about 2TB, and I do daily rsnapshots with a few days retained to a local hard drive.

      Regarding multimedia like photos, etc. I put that in group 1 to start. However, once I accumulate a few GB of it I do a one-time backup of it and then move those files to group 2. That keeps my critical files collection manageable so that my full backups of that set don't keep growing over time. If I take a picture it doesn't keep changing over time, so a one-time strategy makes more sense, and there are many ways to do that (burn to DVD, offsite hard drive, cheap cloud service, etc).

      The main benefits of the cloud-based solution to me are that it is offsite, and it doesn't require any media manipulation so I can stick it in my crontab and just occassionally test that it is still working. I don't want to rely on a backup solution that I can potentially forget to run.

    28. Re:Do something about your hoarding problem by Reziac · · Score: 2

      This is why I archive my email into separate bins every 60mb or so (which means about twice a year) -- anything larger becomes unwieldy. It's a nuisance, but not nearly the nuisance of trying to search the file with an external tool because it got munged and won't open in the mail client.

      [This is with SeaMonkey, which still keeps mail as plaintext, thank ghod, so any good text editor can root about in the file at need.]

      --
      ~REZ~ #43301. Who'd fake being me anyway?
    29. Re:Do something about your hoarding problem by psithurism · · Score: 1

      best solution

      Well that's one solution, but the optimal solution varies by person. Assuming laziness is the problem: how much is your time curating your data worth vs. cost to back it all up? For me, it's only worth it I when it start nearing the size of the typical external hard drive, until then, cu-ration is an unnecessary hassle.

      20TB is actually pretty easy to generate. It's not super common for an individual to do so, but a hobbyist video producer could do it easily. I also know a few semi-professionals that generate TBs of data and really have to start calculating future_value/storage_cost and deciding to delete the data is often more punishing to get wrong.

    30. Re:Do something about your hoarding problem by nabsltd · · Score: 1

      Are you bad at math, or have you been in a coma long enough that you'r'e unfamiliar with high definition video?

      OP said "ripped from media and compressed", which is exactly what I do with my Blu-Ray disks. I'm pretty picky about video quality, and I leave sound intact, and I average around 2GB/hour on my 600 movies. I don't expect most people would use a lot more bits per second than I do.

      So, for 25000GB at 2GB/hour with an average movie being 2 hours long, you get around 6000 movies. Pretty easy math.

      You can argue about my assumptions, but even at 4GB/hour (which is insane overkill if you use a good encoder like x264), that's about 3000 movies.

  28. tape by Anonymous Coward · · Score: 0

    Tape, still cheapest per byte when random access is not needed, including backups. Much of the speed disadvantaged can be handed by putting a VTS in front.

  29. Re-rip by Anonymous Coward · · Score: 0

    Re-rip from the original media.

  30. Not 20TB, but.... by Anonymous Coward · · Score: 1

    I have about 7TB. I built 2 RAID devices, and back one up to the other.

  31. Too much data by Anonymous Coward · · Score: 0

    It really depends on how often you plan on backing up, whether automation is required, and what your budget is.

    The most expensive (but most automated and up-to-date) is a second RAID array that's made of cheaper disks and mirrored. Doesn't have to be fast, just made up of cheat 2TB/3TB disks.

    Tape backup is also going to be an expensive solution, but also one that is much more automated.

    The cost of Blu-Ray disks can't really be much compared to the other two options - if it's a media/file server, just do monthly incremental backups and save to Blu-Ray, the majority of the backups won't be substantial in size.

    The best solution might be to tone down the digital packrattery - 20TB is a LOT of space for media that will probably never be played again.

  32. ZFS? by Anonymous Coward · · Score: 1

    Totally not a "backup" solution but raidz2 to protect the data from many types of failures, and hourly snapshots to protect the data from the operator....
    Now if your box catches fire, floods, etc you are in trouble but i agree the problem is not easy to fix.

    You either spend a ton of time (and money) writing blurays, expensive tape soloutions, etc.

    At the end of the day you might find it is cheaper to just have two boxes with seperate raidz2 pools and sync them.

    Heck, you can even use the snapshots to support offline replication where you power up the second box and dump the snapshots across and power it down again.

    1. Re:ZFS? by Anonymous Coward · · Score: 0

      raidz2 does not protect against the idiot at the keyboard deleting the array.

    2. Re:ZFS? by bobbied · · Score: 1

      NOTHING protects the data from the short between the monitor and the keyboard... All you can do is make it take more commands and ask "Are you sure?" a lot.

      --
      "File to fit, pound to insert, paint to match" - Aircraft Maintenance 101
    3. Re:ZFS? by Anonymous Coward · · Score: 0

      You mean something like recovering deleted pools via "zpool import -D"?

    4. Re:ZFS? by Anonymous Coward · · Score: 0

      https://launchpad.net/safe-rm

      I install this when I set up machines for people. 'Some protection' at least.

    5. Re:ZFS? by Wdomburg · · Score: 1

      One word - snapshots.

    6. Re:ZFS? by bobbied · · Score: 1

      The short between the monitor and the keyboard can take care of that too...

      --
      "File to fit, pound to insert, paint to match" - Aircraft Maintenance 101
    7. Re:ZFS? by Wdomburg · · Score: 1

      True, but you need at least a double failure to lose data instead of a single failure. More if you keep multiple generations around.

      No solution will be perfect, of course. Even a theoretically ideal setup with on-line redundancy, regular snapshots and off-site backups can still fail with a sufficiently ingenious snafu. But that doesn't reduce the utility of relatively simple steps to guard against more mundane errors.

    8. Re:ZFS? by bobbied · · Score: 1

      Which was exactly what I posted..

      NOTHING protects the data from the short between the monitor and the keyboard... All you can do is make it take more commands and ask "Are you sure?" a lot.

      What you are doing is putting more steps between the user and total data loss.

      --
      "File to fit, pound to insert, paint to match" - Aircraft Maintenance 101
  33. Crash Plan by Anonymous Coward · · Score: 0

    I've been using Crash Plan (http://www.code42.com/crashplan/) and it's pretty good and relatively cheap ($4-$5/month depending on contract length).

    For one price you get "unlimited" storage. I only have 60GB or so of data that I back up, though. I wonder if they would have an issue with 20TB.

    The big problem would be the initial upload. If you have a bandwidth cap issue, you can rate-limit your upload. You can also send them a seed drive, but that may be pretty tricky with 20TB of data.

    Just a thought.

  34. Good luck. by TheCarp · · Score: 3, Informative

    A quick check at one service which lists such large amounts, you would be looking at almost $20k/year to keep a single offsite copy of that. That is the posted price however, I imagine that is enough that you could shop around and find a deal, but, a deal is still going to be prohibitive for most people.

    At 20 TB I would start thinking about one of two things: Tape, and/or git-annex.

    Unless prices have changed since I last looked and the scales tipped, tape has the advantage of being cheap. Of course, you will need to test your tapes occasionally and likely want 2 copies just in case, but, at that point you are invested in tape, may as well.

    The other possibility is git-annex and lots of drives, but you can mix types. That way you can keep a catalog of your library and information on where it all is, and how many copies of each thing you have.

    Of course, any way you slice it, each physical piece of media is something that can fail so you have to occasionally test to ensure redundancy.

    --
    "I opened my eyes, and everything went dark again"
    1. Re:Good luck. by damn_registrars · · Score: 1

      A quick check at one service which lists such large amounts, you would be looking at almost $20k/year to keep a single offsite copy of that.

      Hopefully that will cause the poster to re-evaluate how much of what he has really is data, and how much really needs to be backed up. Does he really need to back up the entie series of Friends when he has it sitting on DVD already? What about the Rolling Stones box set, which is also sitting on the shelf? Sure, he spent a lot of time ripping all of those discs but does he have $20k worth of stuff to back up every year? You can buy a lot of movies and music for that much money.

      I am willing to wager that on that 20TB array there was not more than 5GB worth of actual data. That can be backed up for almost nothing in comparison.

      --
      Damn_registrars has no butt-hole. Damn_registrars has no use for a butt-hole.
    2. Re:Good luck. by hab136 · · Score: 1

      >A quick check at one service which lists such large amounts, you would be looking at almost $20k/year to keep a single offsite copy of tha

      Amazon Glacier would be about $205/year to store 20 TB. A full restore would be like $2,000 though, unless you want to restore 1 GB/month. Still, that's a significant difference from $20k/year.

    3. Re:Good luck. by ebh · · Score: 1

      It also matters why he's backing it up. If it's primarily for disaster recovery, you would use a different strategy from what you'd use to deal with accidental deletion. Snapshots a almost always sufficient for the latter.

    4. Re:Good luck. by Anubis+IV · · Score: 1

      A quick check at one service which lists such large amounts, you would be looking at almost $20k/year to keep a single offsite copy of that.

      Crashplan comes in at $3.96/mo. for 20TB. Restores from your online data are free. You can supply your own key for encryption if you don't want to give them the ability to access your data.

    5. Re:Good luck. by ihtoit · · Score: 1

      my legal file is more than 5GB. In fact, that would just about cover the preliminary court bundle from 2008. Hell, the audio recordings run half a Terabyte.

      --
      Political debates have me rolling my eyes so much I think I got optical whiplash. I should sue. - Foamy The Squirrel
    6. Re:Good luck. by Anonymous Coward · · Score: 0

      tape, definitely tape...
      i've been running tape backups at home for 10+ years.
      Your just need to make a backup strategy.
      backup your generated files / email etc in full each week, daily do incremental
      all the other stuff, do special backup jobs and yes use 2 duplicate tapes
      and depending on the OS you are using, make an image of that at lease once a year.

      the other benefit is you can throw a few tapes into your car, in case your place burns down
      i have 1.2 TB on an array, 20 TB? what is your electric bill like... and the noise of all those drives
      anyone have experience with ZFS?
       

    7. Re:Good luck. by n7ytd · · Score: 1

      >A quick check at one service which lists such large amounts, you would be looking at almost $20k/year to keep a single offsite copy of tha

      Amazon Glacier would be about $205/year to store 20 TB. A full restore would be like $2,000 though, unless you want to restore 1 GB/month. Still, that's a significant difference from $20k/year.

      Storage on Amazon Glacier is $0.01/GB per month, not per year. That's $205/month.

      The problem here sounds like the OP is trying to do this on the cheap. I'm surprised that this question even made it to the front page today. "How do I back up data?" is a solved problem. My local Sam's Club has 20TB worth of storage on the shelf that I could have running by lunchtime. Amazon could have twice that amount on my doorstep tomorrow.

      Reliable backup of this amount of data will cost more than a $5/month "unlimited" Backblaze or Carbonite account.

      Pushing it to a cloud provider is attractive since they would handle storage, disaster planning, electricity, and hardware failures. If they had a 100MB/s connection that they could keep saturated, it would take 3-4 days to push or retrieve the data. This level of service would cost money.

      The OP should tell his friend to stop being a cheapskate. If this data is worth saving, it will cost time, money, and planning to protect. Other posters have pointed out the economics of tape vs. hard drives. If the friend thinks this is too expensive, then the data isn't worth saving.

    8. Re:Good luck. by Anonymous Coward · · Score: 0

      Amazon Glacier charges 1 cent/GB, so it would only cost $200/mo ($2,400/yr) for such a backup. Amazon Glacier also allows multi-threaded uploads and high bandwidth so you can max out your connection. I can backup 369 GB/day on my 75/35 FiOS connection. In contrast, you can ship them a bunch of disks and they'll upload the data from the disk ($80/drive + $2.5/hr loading time).

      However, a restore of the full data set will cost a ton of money as Amazon charges for restores + for the download bandwidth used.

  35. Glacier at $20/mn expensive? by kervin · · Score: 1

    Glacier at $20 per month for 20TB is rediculously cheap by today's standards. And at those sizes, you'd want to ship those drives to Amazon instead of uploading. We do this all the time and it's not that hard.

    The price of TBs of storage of course will come down without question. But by today's standards $20/month for a medium that won't "bit rot" on you is an amazing deal.

    1. Re:Glacier at $20/mn expensive? by Anonymous Coward · · Score: 1

      $200 per months for 20TB.

      Slight but significant difference!

      Plus $120 per TByte if you want to actually restore the data.

    2. Re:Glacier at $20/mn expensive? by devman · · Score: 1

      It would be $200/mn for 20TB, at least at the current advertised price I see of $0.01/GB. Also if you ever actually had to completely restore from backup the data transfer costs from glacier for 20TB of data is pretty savage. I could be missing something, all in all seems like a good deal if your running a business off your data but not too accessible to the home user yet.

    3. Re:Glacier at $20/mn expensive? by Anonymous Coward · · Score: 0

      at $0.01/GB, I calculate $200/month.

      http://aws.amazon.com/glacier/pricing/

      Is there a higher tier price I missed?

    4. Re:Glacier at $20/mn expensive? by jeffmeden · · Score: 3, Informative

      Glacier at $20 per month for 20TB is rediculously cheap by today's standards. And at those sizes, you'd want to ship those drives to Amazon instead of uploading. We do this all the time and it's not that hard.

      The price of TBs of storage of course will come down without question. But by today's standards $20/month for a medium that won't "bit rot" on you is an amazing deal.

      You missed a 0, he has 20,000GB and the cost for glacier is $.01/gb/mo (not including upload charges). So, Glacier would cost him $200 a month or $2400 a year. Not hugely expensive but if you are OK with a quasi-local copy (offline and stored in a fire safe, perhaps) you could do it cheaper for less, after you hit the 1 year mark.

    5. Re:Glacier at $20/mn expensive? by afidel · · Score: 1

      You can easily do it cheaper with tape, $50 for an LTO4 drive, $400 for 20 tapes, and $5/month for a safe deposit box. You can double the number of tapes if you always want to have a copy offsite (vs retrieving the tapes, updating the contents and moving them offsite again) and still come out significantly cheaper.

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    6. Re:Glacier at $20/mn expensive? by Anonymous Coward · · Score: 0

      It depends on what failure mode you want to cope with. If the scenario is"my local store has just borked, i need to get 20TB down from glacier by tomorrow" then the data access costs will make it very expensive.

    7. Re:Glacier at $20/mn expensive? by jeffmeden · · Score: 1

      LTO4 tapes do seem to hit a good price/capacity point, but where can you find even a heavily used LTO4 drive for $50? Also, LTO4 (or just about any other drive type) isnt going to just plug in to eSATA or USB3; you need a SAS/SCSI adapter plus a PC capable of hosting it, which balloons the cost.

    8. Re:Glacier at $20/mn expensive? by afidel · · Score: 1, Informative

      You can find a bunch of SAS LTO4 drives on ebay for ~$50-75, and adding a SAS PCIe HBA doesn't cost much more (if you have 20TB I assume you already have a tower).

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    9. Re:Glacier at $20/mn expensive? by amaiman · · Score: 1

      A rough estimate of the cost to completely restore 20TB from Glacier is about $2,300. (Mostly outbound bandwidth charges, but also the restoration fee for restoring more than 5% of data in a month.) You might be able to bring the cost down significantly by using AWS Import/Export, but you'd have to go through S3 first (and deal with shipping drives, etc.)

    10. Re:Glacier at $20/mn expensive? by Anonymous Coward · · Score: 0

      4TB consumer HDDs cost under $200. Build a box with an OS that supports ZFS and you'll have a robust backup system for less than 6 months of Amazon Glacier.

    11. Re:Glacier at $20/mn expensive? by Anonymous Coward · · Score: 0

      Plus the restore costs for Glacier are insanely high, or by careful arrangement you can avoid this cost if you live long enough to trickle feed it back down .

    12. Re:Glacier at $20/mn expensive? by Anonymous Coward · · Score: 0

      You can get 4x4TB + 6x2TB = 28TB into a HP microserver, so not necessarily.

    13. Re:Glacier at $20/mn expensive? by Rich0 · · Score: 1

      Plus $120 per TByte if you want to actually restore the data.

      The cost to restore data from Glacier is actually fairly complex to calculate. It is about $105/TB for data transfer which you can't avoid at all and that is constant regardless of how fast you download it (it would be cheaper to have them mail you hard drives - $80 each plus a bit more for hourly loading fees). Then there is the retrieval cost, which varies SIGNIFICANTLY based on the speed of retrieval. If you restored that 20TB evenly over the span of 4 months it would cost you about $50 in retrieval costs. If you restored it over the span of 2 days it would cost you $3000 in retrieval costs (on top of the $2k for transfer).

      I use Glacier for backups, but only in addition to local backups. My Glacier use case is that the house has burned down, and having to pay an extra $100 to restore my backup really isn't a big deal in that case. Chances are it would take days to have a new PC up and running so I could probably cut down the costs considerably by spreading out my restore requests.

    14. Re:Glacier at $20/mn expensive? by jeffmeden · · Score: 1

      You can find a bunch of SAS LTO4 drives on ebay for ~$50-75, and adding a SAS PCIe HBA doesn't cost much more (if you have 20TB I assume you already have a tower).

      You wont find any auctions that have ended in the $50-75 range, the best i can locate in completed auctions that look remotely legit is $200. Still not a huge cost considering whats involved, but not trivial either. For the 20TB point, the LTO4 option ($500 out the door) seems to be about half as expensive as the cheapest raid cage+spinners option ($1000 out the door) and you still have to juggle 20 backup tapes vs just plug it in, let it sync, and then return it to the vault.

  36. How much is really data? by damn_registrars · · Score: 0

    It sounds like your friend had 20TB of movies and music that he might - or might not - have had legitimate copies of elsewhere. I'm not defending the RIAA or anything here but it sounds to me that all your friend is out is time. If he had any legitimate data - of his own creation - he should have already been backing it up somewhere else.

    --
    Damn_registrars has no butt-hole. Damn_registrars has no use for a butt-hole.
    1. Re:How much is really data? by sandytaru · · Score: 1

      This was a hard lesson for me to learn, but a worthwhile one. Things that are unique - stuff I've created, programs, stories, resume, art I've scanned, pictures - those things needs to be backed up. (I use DropBox for my text based stuff and have shifted pictures through media over the years Floppies to a ZIP disk to a CD-R to a DVD-R -- next stop will be a Blue-Ray-R one of these years, most likely.)

      13 GB from a pirated copy of a TV series does not need to be backed up. Odds are you can either watch it again on-demand from a streaming site, or purchase a legit copy of the series on DVD for $20 if Netflix or Hulu have failed you.

      --
      Occasionally living proof of the Ballmer peak.
    2. Re:How much is really data? by Archangel+Michael · · Score: 1

      Or you can download another copy from Torrent (since you're already a thief) whenever you want to watch it.

      --
      Agent K: A *person* is smart. People are dumb, stupid, panicky animals, and you know it.
  37. Replacate to a NAS by Anonymous Coward · · Score: 0

    Any good 8 bay NAS with 4TB drives
    Raid 6 on a separate GIGABIT sub-net in a separate location in the house with battery backup.
    Qnap, Synology or Thecus all make models that should do.
    I personally would go with a Qnap ts-870-pro and HGST drives.

  38. Don't hoard by rainer_d · · Score: 4, Insightful
    Were those 20T of original movies and music or just stuff he downloaded via bittorent?

    He could have always bought a sufficiently large tape-library from ebay - but I guess the data wasn't worth that much.
    That's always the first pair of questions to ask: how much is it worth and how much would it cost to recreate?
    If the answer is somewhere between "I don't know" and "Well, it's not that much", then he just should stop hoarding that much stuff.

    He could have built a filer with ZFS and sent daily snapshots to a 2nd filer - but that wouldn't have helped him if the house burnt down...

    --
    Windows 2000 - from the guys who brought us edlin
    1. Re: Don't hoard by Anonymous Coward · · Score: 0

      I don't think the poster was asking for recommendations on which files to keep. The question was limited to which backup solution would be appropriate.

      Regarding ZFS, that would not save him from a ZFS bug that takes down the whole array either. Been there, done that, scrapped ZFS when I saw suns answer at the time was "Well, there's no fsck because ZFS is so SUPER DUPER RELIABLE IT NEVER FAILS. But if it does, which it totally never does, you should restore from your tape backup."

    2. Re: Don't hoard by SirMasterboy · · Score: 1

      So what super duper awesome filesystem do you use now that can never have a bug where you lose your data? Or what do you do about theft? fire? PSU taking out your disks?

      You need a backup of any data you intend to keep whether you use ZFS or any other data storage system anyways.

    3. Re:Don't hoard by Anonymous Coward · · Score: 0

      I have a similar issue, and I can see what may have happened. I have a few hundred movies/TVseries on DVD/bluray. I made the decision to rip them all and go with a Plex solution. All the data is recoverable (I've got the discs, obviously)... but I sure as fuck don't want to take the time to re-rip everything. :/

  39. I would say get a life by Anonymous Coward · · Score: 0

    Who the hell even listens or watches to that much freely downloaded content anyways? Seriously, you're nothing more than a digital hoarder who gets emotionally attached to data.

  40. Build another server by EmagGeek · · Score: 2, Interesting

    If you want to back up 20TB of data, you have to pay for it.

    Build another server and rsync hourly.

    1. Re:Build another server by hodet · · Score: 1

      That sounds more like high availability and less like backup. Unless he throws the server in a station wagon every once and while and brings it to his parents place.

      But I agree at 20TB there is no "cheap" option. I think he needs to suck it up and buy a tape backup system and store a copy offsite, it all depends how much his content is worth to him. Not much of issue since he doesn't have the 20TB anymore. But like any good horder he will probably be back to that in no time.

      I have a family member who is a digital horder. It's a site to see.

    2. Re:Build another server by CKW · · Score: 1

      > rsync hourly.

      Noooo no no no no. That's a good way of rsync'ing an empty mount point to your target and wiping all your backup data automatically.

      If you want to rsync frequently, make sure you are using --link-dest or something to multiple targets. Thus the day that you have a main array failure and you also discover that last week's rsync was "empty", you've only lost a differential and the prior month's copy is still there.

    3. Re:Build another server by Anonymous Coward · · Score: 0

      Rsync will only delete files from the destination that don't exist on the source if you specify the --delete flag.

      Don't specify the --delete flag, and rsync will not delete files from the destination that have been deleted on the source.

      Rsyncing with -a alone should be sufficient. I use -av --progress just so I can see what's going on at a given time.

    4. Re:Build another server by Anonymous Coward · · Score: 0

      Great if you accidently delete the files, your f***up gets propogated automaticly. Implementing an incremental backup system with hardlinks is a better option.

  41. Backup = More Disks by Anonymous Coward · · Score: 0

    If the data cost more than $1200-1800 to replace it, it's worth paying to have a backup.

    Amazon Glacier is $0.01 per GB, or $205/month for 20TB. Clearly that becomes uneconomical fast.

    A 20TB backup can be had in six 4TB drives via RAID 5 (don't want a backup drive failing, do we?).
    A Drobo B800FS* is small enough (14x12x5) to be easily stored offsite and is a self-contained network appliance.
    Since it has 8 bays total, there's also room for expansion to 28TB.

    Total cost: 6*150 + 850 = $1750.

    *You can shave off about $500 by using an off-brand RAID enclosure, but I think the software and support for the Drobo is worth it.

    1. Re:Backup = More Disks by blackjackshellac · · Score: 1

      And then his house burns down.

      --
      Salut,

      Jacques

  42. My solution by StripedCow · · Score: 5, Funny

    Figure out the theory of everything.
    Then you can always recompute your data from scratch.

    --
    If Pandora's box is destined to be opened, *I* want to be the one to open it.
  43. 2nd Array or Tape by Grave · · Score: 2

    With a second array, or tape backup. The second array is going to be the easiest solution, but tape backup provides you the option of storing the tapes off-site, which is important for any real backup plan. After all, your friend could just as easily wipe out the 2nd array by mistake, or a disaster could wipe out the physical location. LTO-6 tapes are cheap and can hold 2.5-6.5TB of data depending on compression. Tape drives are perfect for backup, so why even ask if it's right?

    1. Re:2nd Array or Tape by jeffmeden · · Score: 1

      After all, your friend could just as easily wipe out the 2nd array by mistake

      This is exactly the core issue to protecting his data... I would propose what he really needs is a nuclear launch key style interlock that only lets him access his backup array when both keys are present, and give it to a smart friend (along with a shovel) tasked with looking over his shoulder when the key is in use, ready to hit him over the head if he keys in a command involving dd, fdisk, rm -rf, etc.

    2. Re:2nd Array or Tape by gbjbaanb · · Score: 1

      exactly, tape is the solution for backup problems. Unless you're dealing with mickey mouse storage problems, tape is what you want.

      I mean, get a SL8500 - only 2.1 Petabytes of tape backup space!

      Of course its one of those boring-but-works solutions so nobody cares about it, and none of the slashdot kiddies can think of using it, not when there's unlimited cloud and additional raid array buzzwords to be spouted!

      And remember, a backup is not a backup unless its stored somewhere away from the fiery, drippy, bangy or collapsey disaster that destroyed your main storage.

    3. Re:2nd Array or Tape by DriveDog · · Score: 1

      $2000 + $60/tape isn't what I'd call cheap for home use. Or is that stuff available for a LOT less? Neither is the $20/month from an earlier post. Now $5/month for Backblaze is sounding ballpark-ish...

      And just when did floppies get down to 8"? 14" is the way to go. Either that or boxes of cards. As long as they're protected from moisture, you can use them to insulate your walls. If things get really desperate, you'll have fuel for the fire. Not just 80 but 132 characters per card!

    4. Re:2nd Array or Tape by meustrus · · Score: 1

      You know it probably costs more than building another 20TB RAID when the web site says "Contact Sales Assistance on +1-888-672-2534 to quote or purchase this product".

      --
      I sometimes ask revealing, often ignorant-seeming questions. Maybe they're harder to answer than you think.
    5. Re:2nd Array or Tape by Jeremy+Erwin · · Score: 1
    6. Re:2nd Array or Tape by nabsltd · · Score: 1

      I mean, get a SL8500 - only 2.1 Petabytes of tape backup space!

      You misread a comma as a decimal. With filled-out slots, the library can hold 2,100PB, or 2.1 exabyte. Or, the equivalent of over half a million 4TB hard drives.

      >We have two of these at work, and with just 3000 slots each and not the biggest tape drives, they can each store around 15PB.

    7. Re:2nd Array or Tape by gbjbaanb · · Score: 1

      Its by Oracle... you didn't need to go further than the URL to know its gonna be expensive!

    8. Re:2nd Array or Tape by Man+Eating+Duck · · Score: 1

      $2000 + $60/tape isn't what I'd call cheap for home use. Or is that stuff available for a LOT less?

      Well, it is available used for somewhat less, but 20TB isn't exactly your average home storage system either. I would ask him *why* he needed to keep all that data online, including backing it up. If the RAID itself contains a copy of his physical media library (or the bay), well, there's your backup already :)

      He should balance the need to keep it all instantly available against the cost of doing so. The reason why there are no cheap solutions to backing up 20TB in a home scenario is that very few people do it. If it were me I'd just get a bunch of cheap drives and a hot-swap bay, and just create a script that catalogued the content of each drive. If it was irreplaceable content I created/shot myself I would invest in a tape solution, in that case it isn't *that* expensive. Crashplan works well for me to keep onsite/offsite backups of videos and pictures of my kids, but if I needed so much space that recovery time became a concern I would shell out for some disks or a tape library (it won't, however).

      He could be something like a freelance video producer, in which case it might be a legitimate need, but then he would be insane to not already have a backup solution in place. Also, that would remove him from the home user category. For instance, rip-on-demand is likely the cheapest strategy storage-wise for using your library on your media center, and you probably won't end up with 20TB on a RAID :)

      --
      Are you a grammar Nazi? I'm trying to improve my English; please correct my errors! :)
  44. 20? by Anonymous Coward · · Score: 0

    That's not very much data I have at least 75TB worth of hard drive space and it just keeps getting cheaper and cheaper. To be fair though I run a repair shop lol so I get a lot of extras people don't want or want to trade ect.

    http://www.amazon.com/s/ref=sr_nr_p_n_feature_five_bro_0?rh=n%3A172282%2Cn%3A!493964%2Cn%3A541966%2Cn%3A1292110011%2Cn%3A595048%2Cp_n_feature_two_browse-bin%3A5446812011|5446813011|7817230011%2Cp_n_feature_five_browse-bin%3A2419644011&bbn=595048&sort=price-asc-rank&ie=UTF8&qid=1394631960&rnid=2419643011

    See what I mean.

  45. Rsync and Bluray and maybe dedup by Supp0rtLinux · · Score: 2

    As you noted, Bluray holds a lot of data, but would take some time. Since its audio/video media, odds are most of it is pretty stagnant. I'd do an initial rsync job to write out to Bluray... then once a month or so repeat the job but now rsync will only get what's changed. Depending on the media type and age, you could also look at dedup'ing it and if the dedup'd copy is significantly smaller than the source you might be able to put that onto say one or two 3-4Tb drives.

    1. Re:Rsync and Bluray and maybe dedup by Supp0rtLinux · · Score: 1

      Oh, and for what its worth... yes, you can do rsync on Windows. I'm not assuming he has Mac, Linux, or UNIX. I've been doing rsync on the PC for years with cygwin... these days there are many more options and ports of rsync for Windows... even 64bit versions, etc. But my scripts are written around cygwin and expected variables and such, so I still do it the *old school* way. :) Just wanted to point this out though as inevitably someone will cry about rsync and Windows...

    2. Re:Rsync and Bluray and maybe dedup by Reziac · · Score: 1

      Last I heard, BluRay-R discs have a shelf life of about 6 months before they start losing data. Has this problem been solved??

      --
      ~REZ~ #43301. Who'd fake being me anyway?
    3. Re:Rsync and Bluray and maybe dedup by Supp0rtLinux · · Score: 1

      I've never had this problem. And since data is data is bits is bits... if this was an issue, you would think BR movies would have the same issue? I've been backing up to BR for about 3 years now and have had to go back to older media for restores and never had an issue. Like all media, your environment can affect them. I wouldn't let them get too hot or too cold, no direct sunlight, etc...

    4. Re:Rsync and Bluray and maybe dedup by Reziac · · Score: 1

      Are commercial BRs made like commercial DVDs, or are they more like burned disks? Cuz they're not the same.

      And longevity probably varies by media brand, too, much as it does CDRs. (Which brand are you using?)

      Storage conditions, of course, are a definite problem for any archival medium, other than maybe flash drives. (Which are how I backed up a lot of my stuff last time around, cuz they're far more durable in the face of environmental misadventure.)

      --
      ~REZ~ #43301. Who'd fake being me anyway?
  46. Only back up important stuff. by Anonymous Coward · · Score: 0

    Music is ephemeral. Use streamtuner. There is no need to save copies of muzak.

  47. Unlimited cheap by Anonymous Coward · · Score: 0

    Bitcasa

  48. NAS4Free by Anonymous Coward · · Score: 1

    I use ZFS on NAS4Free at home and have two 48TB arrays, the second array is at a neighbors house, I am using mikrotik SXT PTP links in trade for him keeping my secondary server at his house, he gets internet and access to the movie storage/backup array. With ZFS I am not worried about a RAID failure as I just had a controller card fail and kill two drives on each of my pool. I didn't have any problems rebuilding the array and had I, I would have just pulled from the backup server. Also, with ZFS you get RaidZ-2 along with snapshots, which has been very handy at client locations to be able to save deleted documents(more than once from a disgruntled employee) also all of our machines backup to a backup area the kids have more than once gotten malware and restoring from a snapshot is easy!

    Cheers!

  49. LaCie 5Big for the Thunderbolt by Anonymous Coward · · Score: 0

    I have a LaCie 5Big for my Mac. Using RAID0, it gives me a full 20TB of storage. I'm using it with two drives in RAID1 configuration, and three drives in RAID0, giving me a total of 16TB mounted as two volumes. Cost was just under $2k US.

  50. Simple by Anonymous Coward · · Score: 0

    Delete the porn you don't actually watch.

  51. Here's an idea for preventing his accident by Anonymous Coward · · Score: 0

    Don't RAID the storage. Use (and connect) drives individually and keep a sane index in order to quickly search and access the proper drive. Yes, it's a bit more hassle, but so is losing 20TB in a RAID accident. I personally don't even see why one wouldn't consider the problems and dangers with RAID of this scale instead of just being a bit more careful, but then again I'm the pragmatic type, not the over-engineering nerd type.

    1. Re:Here's an idea for preventing his accident by DanZee · · Score: 0

      I think this is the best answer. I keep a Google spreadsheet with the directory names in columns. If I need an "essential program" I do a search and it tells me which drive the essential program directory is on. I attach the drive to an external USB case and I have what I'm loosing for.

  52. failsafe**2 by MissNoItAll · · Score: 1

    Ironic since from your description it would appear the RAID architecture served sufficiently well here (as it should have). It would appear you are seeking a solution to operator error, not equipment failure or other acts of God. Good luck.

  53. Why back it up at all... ask the NSA for a restore by Supp0rtLinux · · Score: 4, Funny

    You could always just call up the NSA and ask them to restore the data. Odds are good they have a copy of it...

  54. one bit at a time by Cmdr-Absurd · · Score: 2

    Same as always.

    1. Re:one bit at a time by Anonymous Coward · · Score: 0

      That would take forever!

  55. Just like you'd eat a whale... by kyldere · · Score: 1

    One byte at a time.

  56. Plan for backup before you buy by Ktistec+Machine · · Score: 5, Insightful

    Whenever you buy storage, you should buy the necessary backup capacity at the same time. You should never buy storage without buying backup capacity. Budget for it right from the start. If you can't afford the backup, you can't afford the storage. This may mean getting half as much storage as you'd like, but that's just the way it has to be. You probably wouldn't buy a car without an engine. It wouldn't do its job. So don't buy storage without backup. If you do, you have a storage system that can't do its job.

    1. Re:Plan for backup before you buy by matt-fu · · Score: 1

      As cheap as disk is nowadays, there is no excuse to not do this. Especially since RAID5 starts to become problematic with this amount of data.

    2. Re:Plan for backup before you buy by Anonymous Coward · · Score: 0

      Depends on the job. If the car is just going down hill, it'd be fine. If the storage is just for stuff you're prepared to lose, it'd also be fine. So, while I know we're talking about a specific 20TB situation where the user didn't want to lose the data, to suggest that it's always bad to buy storage without a backup isn't the case.

    3. Re:Plan for backup before you buy by Zerohm · · Score: 1

      Good advice, but this person already had the drives protected with RAID. They accidentally deleted all of their files. I believe that FreeNAS and ZFS could handle 20TB fairly easily and provide snapshots to recover if you were to accidentally delete files.

    4. Re:Plan for backup before you buy by Anonymous Coward · · Score: 0

      You probably wouldn't buy a car without an engine.

      Not quite. It's more like "you wouldn't buy a car without a garage to put it in" which is, of course, not true. The garage offers security and a reliable place to store your car; without it you're subject to risk of vandals and theft, but in reality most people get along without a garage just fine. There's stories of people having their car's radio (or whatever) stolen - you know that it could happen to you too, but it's unlikely so there's no real push for you to get a garage. And then it happens to you, and you're surprised and a bit upset and you start thinking "damn I wish I had parked my car in a garage".

      Except the submitter has, like, three cars to deal with.

    5. Re:Plan for backup before you buy by Anonymous Coward · · Score: 0

      You probably wouldn't buy a car without an engine. It wouldn't do its job. So don't buy storage without backup. If you do, you have a storage system that can't do its job.

      That's gotta be the worst car analogy I've ever heard.

      The analogy you're trying to make -- which makes no sense -- is that you wouldn't buy a car without buying another one at the same time as a backup. So in the same way, you should always buy backup hard drives.

      The problem is that cars solve a very different problem than what hard drives solve. Perhaps using an analogy in the first place was just a bad idea.

    6. Re:Plan for backup before you buy by darrylo · · Score: 1

      This.

      Also, given the long rebuild times with 2TB drives and larger, one should be using raid6, raidz2 (raidz3?), or mirroring. With the large disk sizes, another disk error can be fairly likely during a rebuild.

    7. Re:Plan for backup before you buy by Anonymous Coward · · Score: 0

      Your analogy fails because it does do it's job right up until it fails.

      And there are many examples of storage systems which don't need backups. A web cache, web server or anything operating with cacheable or replica data doesn't need a backup. If I build a cluster of 500 web servers, with 250GB disks in each, I'm not going to plan for 500x250GB of backups, because each web server is going to have nothing on it that can't be recreated from the master.

      A DVR is another example, sure I could put a backup system on my DVR, but the added cost is not worth the value. Let's say my DVR HDD is expected to fail once every 3 years (HDDs are in practice more reliable than this), that means I might miss a few days TV once every 3 years, I'm not going to double the cost of my DVR to have a backup for that.

      Then there's all sorts of intermediate storage that can't be backed up due to it's nature. I don't have a backup of the SD card in my camera. If I lose my camera or the SD card fails, I lose all the photos I took since the last time I synced it. I can never get those back, but there's no practical backup solution for the gap between when I take the photos and when I sync them to my file server.

      I have a bunch of OpenBSD servers running as routers. I keep backups of the server config and have deployment scripts for building new ones. The routers have various size HDDs in them up to 250GB, but I don't have backup capacity for them, Why? Because I can regenerate the system images with my deployment scripts and the config files are only a few kb, and replicated in enough places (and backed up to tape).

      Your argument is plain stupid. There are probably more cases where storage is used where a backup is not needed than there are cases where a backup is required.

    8. Re:Plan for backup before you buy by sprint907 · · Score: 0

      When you buy a car, do you buy an additional car, as backup?

  57. Re:How Do You Backup 20TB of Data? by Icarium · · Score: 1

    AC, please point us mere mortals in the direction in which me may find these DVDs with a storage capacity of 4TB...

  58. Raspberrypi as transport to another array? by AidanApWord · · Score: 0

    I would consider: http://www.makeuseof.com/tag/b... ... and then plug a big array into the pi. Then host that in someone else's home (someone I trust of course!).

  59. I agree but... by matteo.defelice · · Score: 5, Informative

    I agree, I've been using Crashplan for three years and the unlimited space it's really great BUT... ...I'm not sure about the bandwidth they provide: how long it will take to upload 20 TB? Anyway, I don't see what's the problem in using external drives for backup. Here in my lab I've realized that the best way to backup X Terabytes is to have another storage with X Terabytes...

    1. Re:I agree but... by Anonymous Coward · · Score: 1

      I came on here to recommend CrashPlan too. You can upload 20TB in the most high bandwidth, high latency way possible - via a truck carrying a boat load of hard discs. The offer what's called a "Round Trip" service, where they send you a drive (or drives, IIRC, in this case) and you locally back up to it to "seed" your backup. Then you send it back, and sync up from there. Worked pretty well for us at work.

      For anyone who doesn't like CrashPlan, there's also BackBlaze, who from my previous research seems to be similar. There's also Carbonite, but I've heard bad things about their throttling over a particular data threshold.

    2. Re:I agree but... by TMYates · · Score: 1

      I use CrashPlan at home for my 18+TB Home Brew SAN. I have 1 VM running as my file server that has about 10TB of the total storage assigned. I switched from Carbonite to CrashPlan because CrashPlan does not limit bandwidth like Carbonite does (2Mbps for first 200GB and then 200Kbps thereafter). It would have taken me more than 11 years to backup 4TB to Carbonite, but with CrashPlan and my 35Mbps Upload, it took me about 1-2 months for the initial backup of 10TB. Now it only syncs the updated files and has no trouble keeping up. I should also add that you can use CrashPlan to backup to a friend or external hard drive from the same app for Free as well. As for the paid subscription, Business is $7 per PC for unlimited storage and home is like $7 per PC and $14 for a family plan of up to 10. Home edition does not limit installs to non-server systems (as I use mine for personal use). * I am not a paid representative of CrashPlan. Statements are based on personal opinion or experience.

    3. Re:I agree but... by menos · · Score: 1

      It looks like you can only seed the first 1TB though.

    4. Re:I agree but... by Anonymous Coward · · Score: 0

      This!

      I have an 8TB raid array and I am currently just backing up the important files on it using BT Sync between my office server and my laptop. This data is around 100GB.

      You could host your own hardware in a data centre (Co Location), and use either BT Sync or just plane ol' RSYNC to transfer the deltas offsite.

      Hosting it yourself, means you can copy the data locally and then shift the server to its final resting place saving on off-site data transfer.

      What platform is the data on? Windows, Mac or Linux?

    5. Re:I agree but... by Anonymous Coward · · Score: 0

      Exactly! You need another storage array, that is capable of taking regular snapshots. You sync your data to it nightly, and configure the snapshots to run at whatever intervals and retentions you choose.

      20TB backup is disk to disk. Whether the other disks are in someone else's datacenter or not doesn't change that fact.

      ****Unless you REALLY want to spend money, and get an LTO-5 (1.5TB native, 3.0TB compressed per tape) Robotic Tape library, and something like BakBone to run it.

  60. Re:How Do You Backup 20TB of Data? by Anonymous Coward · · Score: 0

    That would be more like 5000 DVDs.

  61. what he lost was bandwidth by Anonymous Coward · · Score: 0

    ok, but it's not like he lost anything he owned. I think what he's worried about is the 40TB of bandwidth (assuming 1 up 1 down sharing) required to replenish that.

  62. I second that! by Anonymous Coward · · Score: 0

    I'd like to plug BackBlaze. I've been using it for a while now and it's fantastic. It saved me from going down the whole RAID NAS / DAS route which isn't really backup because of fire / theft.

    Here's my brief blog post on why I chose it...

    http://www.bentristem.com/1/post/2014/03/offsite-backup-thats-easy-to-setup-geek-proof-and-secure.html

  63. Tape storage! Cheap. Effective. by Anonymous Coward · · Score: 0

    You can even make copies and leave them in a safety deposit box! Of course the real question is what data that was lost can be replaced (personal collections of movies can always be "found" again). Perhaps the backup plan for 20TB is unnecessary when smaller storage could do.

  64. Nothing will save you from your own carelessness by mlwmohawk · · Score: 0

    If you set up a RAID6 system and keep tabs on it, replace drives as they come and go, then you'll probably be OK unless you do something stupid, or have a fire, flood, or what ever. The ONLY way to really have backup of important information is geographically separate redundant copies.

  65. My only solution... by Anonymous Coward · · Score: 0

    Google Mail offers 15GB of storage space.

    So hjsplit everything into 25MB chunks and upload it.

    You'll need 1366 different accounts, though.

  66. Thinking ahead. by wlj · · Score: 1

    From my own (painful) experience: if you don't plan for it up front, you are always fighting fires (playing catch-up). Organizing your data can help a LOT! If it is media, arrange it by genre (e.g. video animation or video classical or whatever) to keep a particular grouping small enough to backup easily. If it is data, arrange by some category that works for you (e.g. current financial projects or past analytic projects).

    The most useful guide I have found for resources allocated to backup: how much is it worth to me to re-create this resource? ("Worth" can be money, time, sentiment, or any other measure(s) or combination you chose.)

    My current feelings: disk is the most versatile and cost effective.

  67. In business school, I learned... by DoofusOfDeath · · Score: 1

    Obviously you get 5,000 women pregnant, and ask each one of them to backup just one DVD-R!

  68. Hilarious by jeffmeden · · Score: 5, Insightful

    It's not like you could just plug in an external drive, and using any cloud service would be terribly expensive. Blu-Ray discs can hold a lot of data, but that's a lot of time (and money) spent burning discs that you likely will never need. Tape drives are another possibility, but are they right for this kind of problem? I don' t know. There might be something else out there, but I still have no feasible solution.

    Lets start from the top: You *can* plug in an external drive, it's called a complete hardware duplicate of your array (or perhaps for space/cost consideration, a single disk based copy held offline and synced regularly). Not hard and not terribly expensive (i would go with this solution personally). Cloud? Yep the bandwidth and storage even on something like Amazon Glacier would be prohibitive to all but the most financially independent geeks. Bluray doesnt hold enough (even at 50gb/disc you need 400 of them, groan). So, tapes? You bet your ass tapes are designed to do exactly this task, why do you think they are still in use? You can get individual tapes at 1/1.5TB, but for a one man operation they are probably going to cost you more than the first solution (offline spinning disks) and they are a pain to manage properly.

    Now what is this doing on ask slashdot? A pencil, some scratch paper, and 15 minutes between amazon.com and newegg.com would tell you the prices of every solution. Oh, right, they need a chance to tee up some targeted ads for Carbonite, Mozy, Crashplan, etc.

    1. Re:Hilarious by Anonymous Coward · · Score: 0

      I am not sure why tape is still being used. The reliability of tape has been very low when I used them in the past. Harddisks have a very low price/data compared to tape as well. You can access disks non-sequentially which makes restores less painful. Disks include their own device to read/write to it, compared to the very expensive and unreliable tape drives.

      There are even disk robots, or easier just keep all the disks semi-online. There is this company who designed 3 unit rackmount cases filled to the brim with disks. Most those disks can simple stay idle.

    2. Re:Hilarious by geekoid · · Score: 0

      You are falling into the 'I know it so everyone knows it' fallacy.

      A wide range of age groups come here. from 12 and up, so keep that in mind.

      " Yep the bandwidth and storage even on something like Amazon Glacier would be prohibitive to all but the most financially independent geeks."
      120 a year is prohibitively expensive? Not for AG, natch, but there are solutions for that, and bandwidth is time. You can do it incrementally. Once done the future dif would be a lot easier.

      EVERY article is a chance for targeted ads, it's how they keep running, so get over it already.

      --
      The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
    3. Re:Hilarious by Anne+Thwacks · · Score: 1
      The reliability of tape has been very low when I used them in the past.

      You might want to find out why. You probably are not doing it right. Tape may not be perfect, but its better than anything else long term for high capacity (Paper is better for ruggedness and lifetime, but wont work for 20TB).

      --
      Sent from my ASR33 using ASCII
    4. Re:Hilarious by Kjella · · Score: 1

      Lets start from the top: You *can* plug in an external drive, it's called a complete hardware duplicate of your array (or perhaps for space/cost consideration, a single disk based copy held offline and synced regularly). Not hard and not terribly expensive (i would go with this solution personally).

      Exactly. I very much doubt he has 20TB of data that changes very often, plug in a 4TB external drive, fill it to the brim, take note of what you've backed up. This is how I used to do it with CD-R/DVD-Rs back in the day, I had one "incoming" directory and one "archived" directory. When I had enough to burn a full disc I'd burn it and move it. Keep a text file saying what's on the different discs. It's very very low tech but it's simple and it works. Unless he's for some reason keeping a 20TB database at home with random changes everywhere.

      --
      Live today, because you never know what tomorrow brings
    5. Re:Hilarious by Solandri · · Score: 1

      You *can* plug in an external drive, it's called a complete hardware duplicate of your array

      You don't even need to duplicate the array hardware. 20 TB is still within the range of a RAID enclosure. Basically a box where you plug in drives, and it gloms them together into a RAID 0/1/5/10. You then plug it into your computer with eSATA or USB 3, and your computer sees it as a single drive up to 32 TB in size (8x4TB with current drives). Do your backup, then unplug it and turn it off until your next scheduled backup.

      Incidentally, TFS is a perfect example of why RAID is not a backup. Yes RAID is kinda like storing two copies. But the "two copies" are linked both virtually and physically. If you accidentally delete one copy, the other gets deleted simultaneously. If you physically damage the array, you lose both copies.

      RAID is not a backup. RAID is for redundancy. If your business will lose hundreds of dollars per hour or more while the file server is down, you put the file server on a RAID array so it will continue to operate even if there's a single (or double) drive failure. i.e. It only protects against your file server going offline. You still need a separate backup.

    6. Re:Hilarious by Gothmolly · · Score: 1

      Slashdot has ads?

      --
      I want to delete my account but Slashdot doesn't allow it.
  69. Backup only the best one by jones_supa · · Score: 2

    How about backing up only the crown jewels of the collection?

    Make a directory like /entertainment/premium and put the best stuff there, with a 4 TB limit. Rotate two external 4 TB HDDs and copy the stuff over periodically. Put a little sticker or some other mark on the newest, so you remember which one it is. If your main RAID array fails, build a new one, and restore the premium stuff from the most recent one of the two external disks.

    1. Re:Backup only the best one by geekoid · · Score: 2

      Why would you only back up the nut shots from his porn collection?

      --
      The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
    2. Re:Backup only the best one by MightyYar · · Score: 1

      With the right setup, you could leave everything in the master directory and just add symbolic links to a backup directory. That way the backup solution won't get in the way of the setup. You could even write a little utility that automatically backs stuff up that you rank highly with whatever program you are viewing with. Back when CDs were expensive, I would only backup my "3-stars and over" playlist in iTunes.

      --
      W..w..W - Willy Waterloo washes Warren Wiggins who is washing Waldo Woo.
    3. Re:Backup only the best one by Anonymous Coward · · Score: 0

      "Make a directory like /entertainment/premium and put the best stuff there..."

      This is also what I was going to say, I spread out my data into very important to could care less. The really important stuff like tax data and family photos is usually sub terabyte and I make sure I have lots of copies of it. The other stuff I back up at my leisure and if I really really lost all of it I could probably recover from a variety of places. Don't forget offsite storage ie. your desk at work. Also, think about m-disc. I'm thinking about testing it and burning a few directories starting with my oldest and putting them into a deposit box at the bank.

    4. Re:Backup only the best one by aaarrrgggh · · Score: 1

      The easiest approach for 20TB is to properly partition it into static and dynamic groups of data that easily fit on one drive. Get two drives for each static group of data, and three for dynamic data, and rotate a current copy.

      Most people have a lot of bulk archival data that they need a backup of, not a business solution that allows for every file to change constantly. Simplify the problem and the solution becomes pretty simple as well.

      Now, if you really have 20TB of dynamic data that cycles on less than a 30-120 day basis, buy a frigging tape system.

    5. Re:Backup only the best one by Anonymous Coward · · Score: 0

      By 'crown jewels', I assume you mean items that are no longer being reliably seeded?

    6. Re:Backup only the best one by Anonymous Coward · · Score: 0

      >Put a little sticker or some other mark on the newest, so you remember which one it is.
      A nice fridge magnet for example!

  70. Duplicity. by durin · · Score: 1

    What, noone mentioned duplicity yet?
    http://duplicity.nongnu.org/

    --
    Why, yes! I AM new here.
    1. Re:Duplicity. by Anonymous Coward · · Score: 0

      I looked into this for my own offsite backup to a friend's computer... chose not to use it. There were a few problems:

      -Incremental backup format was very awkward, and recovery got slower over time. It was recommended to do a full backup every X weeks, but that was infeasible in my case due to connection speeds
      -Required the use of the same tool to make sense of the data

      Instead, I decided to go the route of backing up an encfs source folder (i.e. the encrypted part) just using vanilla rsync. The files are available on the other side without any special commands, rsync makes for fast backups, and there is not a need for full backups on a regular basis. An external drive is all that is required for the initial sync.

      I don't see the need for added complexity and lock-in products (even those which are OSS) when simple tools suffice.

    2. Re:Duplicity. by Mike_K · · Score: 1

      I prefer duplicati:

      http://www.duplicati.com/

    3. Re:Duplicity. by durin · · Score: 1

      Sounds nice. I'll have to try that.

      --
      Why, yes! I AM new here.
    4. Re:Duplicity. by durin · · Score: 1

      I'm sure there must be something I'm missing in your reasoning.
      How do you make sure that the files on the "other side" are not read by anyone? Sounds like you just transfer the files (presumably using rsync over ssh) and leave them plain on the remote server. With duplicity, the encryption and decryption is all handled on the client side.

      --
      Why, yes! I AM new here.
  71. 10TB free cloud storage ^.- by Anonymous Coward · · Score: 0

    You can get 10, yes 10TB of free cloud storage from a Chinese organisation

    http://www.weiyun.com/

    Create two accounts, job done.

    It's quite a complicated signup - but there are guides. You need to create a login on the QQ site and activate it with a smartphone app - something like that.

    You'll have the Chinese authorities looking at your data instead of the NSA!

  72. Re:How Do You Backup 20TB of Data? by Anonymous Coward · · Score: 0
  73. Ah, "unlimited"... right. (*cough*) by stoploss · · Score: 5, Informative

    These "unlimited" claims always turn out to be lies. When will we learn?

    My friend paid for an "unlimited" account from JustCloud for backup. He stored 1.8 TB on it and then they "fair use"'d his ass and canceled his account. They didn't even give him a refund for the rest of the money he prepaid.

    1. Re:Ah, "unlimited"... right. (*cough*) by SirMasterboy · · Score: 1

      I have 13.4TB on my CrashPlan account and counting... I've had more than 10TB with them for over a year now. I have a friend with over 20TB with them too. Yes it really is unlimited.

    2. Re:Ah, "unlimited"... right. (*cough*) by LordLimecat · · Score: 0, Flamebait

      And this, folks, is why an "everyone gets their fair share" system like communism can never work. 1.8TB for a homeuser seems like theyre trying to take advantage of pricing that they know isnt sustainable.

      Theres no storage system in existence that will store 2TB of data for $5, let alone maintain it.

    3. Re:Ah, "unlimited"... right. (*cough*) by Anonymous Coward · · Score: 0

      I've had between 4 and 7TB at CrashPlan for years. No issues here with unlimited.

    4. Re:Ah, "unlimited"... right. (*cough*) by im_thatoneguy · · Score: 1

      I have about 6TB now on backblaze and haven't received a complaint. I don't know who "Just Cloud" is but it sounds like a rather small organization who resells cloud storage. Backblaze prides themselves on being truly unlimited as well as their efficiency. They probably would see the cost of storage less than the cost of the PR nightmare from all their bragging being undone by a policy that only affects a handful of clients.

      Having a 20TB backup would be a marketing win. It might cost them a couple dollars per month but that's cheaper than even a small Google ad campaign.

    5. Re:Ah, "unlimited"... right. (*cough*) by punkr0x · · Score: 1

      Then why do they call it "unlimited?" It's not up to the consumer to know if a company's business plan is sustainable or not. Claiming unlimited usage and then having a cap is just false advertising.

    6. Re:Ah, "unlimited"... right. (*cough*) by dcollins · · Score: 2

      Fraudulent Capitalist Practice Spun into Argument Against Communism. News at 11.

      --
      We know where leadership by an anti-intellectual "strongman" who scapegoats minorities and likes boisterous rallies goes
    7. Re:Ah, "unlimited"... right. (*cough*) by ihtoit · · Score: 1

      your average user will probably never fill the 8GB memory in their phone with all the irreplaceable data they commit to digital their entire lives. That's what these "unlimited" plans rely on; average users having no more than 8GB of data to back up. Sure, there'll be some dick with 8TB, but if you've built for 1000 customers and just one kills your storage then you've got to do something. "Fair using his ass" is only fair!

      (OK, vastly oversimplified things, your average cloud provider will probably have built for a million potential customers - 8PB total storage with multiple failover and power contingencies up the wazoo, but again, bump up the stats - it'll only take a thousand 8TB dicks to kill *that*. I'd like to know how Google are doing it, my byte clock is showing 14.2GB of "free" storage right now that I don't use, I guess the other 800MB is my mailbox, and their platinum plan is $800/mo for 16TB).

      --
      Political debates have me rolling my eyes so much I think I got optical whiplash. I should sue. - Foamy The Squirrel
    8. Re:Ah, "unlimited"... right. (*cough*) by TangoMargarine · · Score: 1

      Then don't advertise the damn thing as unlimited! Say "up to 1TB" or whatever it actually is!

      --
      Unity? Screw that: XFCE. Slashdot Beta? Screw that: SoylentNews. Australis? Screw that: Pale Moon. UX developers DIAF
    9. Re:Ah, "unlimited"... right. (*cough*) by Curmudgeonlyoldbloke · · Score: 1

      Then why do they call it "unlimited?"

      Because they're selling to you.

      It's not up to the consumer to know if a company's business plan is sustainable or not.

      It is, if you want that company and your data to be still there in X months time.

      Claiming unlimited usage and then having a cap is just false advertising.

      If that's illegal where you live, call them on it. If not, move somewhere with decent consumer protection legislation.

    10. Re:Ah, "unlimited"... right. (*cough*) by stoploss · · Score: 1

      That's nice, but again, "unlimited" is always a lie. From your BackBlaze ToS:

      Backblaze has the right at any time to change, modify, add to or discontinue or retire any aspect or feature of the Backblaze Products including, but not limited to, the software, hours of availability, equipment needed for access or use, the maximum disk space that will be allotted on Backblaze servers on your behalf, or the availability of Backblaze Products on any particular device or communications service. Backblaze has no obligation to provide you with notice of any such changes.

      So, just like all these other providers throughout history, BackBlaze will try to make it seem unlimited until the moment they decide to fuck you over.

      Marketing deceit like this makes me want to get an account and set up a "dd if=/dev/urandom" job to create low compressibility garbage to crapflood the storage on their servers until they hit me with their "fair use" policy. Which they would... it's only a question of degree.

      So, where would they draw the line? 20 TB? 100 TB? I don't know, but dd can pull from urandom all day long...

    11. Re:Ah, "unlimited"... right. (*cough*) by stoploss · · Score: 1

      "Fair using his ass" is only fair!

      Not when the product is advertised as unlimited storage capacity. If they hadn't been deceitful and had instead advertised a price for say, 500 GB, then he could have made an informed decision.

      These "unlimited storage!" claims are deceitful and malicious. Companies don't want to give honest specifications in their marketing because they want to tweak that psychological sense of security that comes from knowing you don't have to be concerned about constraints. Selling "unlimited" plans gets them more money, but it's a damn lie.

      I'm as libertarian as they come, but all libertarians agree about enforcing contract law. I argue that a corollary to contract law is that parties are not allowed to redefine plain language terms like "unlimited" to mean "very limited, and completely at our capricious unilateral undefined discretion, which we can redefine at any time without notice". I have no sympathy for a company that sells an 8 GB product but wants potential customers to think it is 800 GB.

      tl;dr: Customers are not responsible for companies's shitty business plans that are built upon marketing deceit. Stop selling "unlimited" plans!

    12. Re:Ah, "unlimited"... right. (*cough*) by Anonymous Coward · · Score: 0

      I have 5 TB on Backblaze, so it seems unlimited enough for me!

    13. Re:Ah, "unlimited"... right. (*cough*) by Anonymous Coward · · Score: 0

      Sue them in small claims. If you lose, they still had the expense of defending themselves. If they don't show you win by default.

    14. Re:Ah, "unlimited"... right. (*cough*) by ihtoit · · Score: 1

      All marketing is deceit. That's the point of it.

      Prime example right there; another one is these ads for DVD/BluRay movie releases: "OWN IT NOW!" No - you don't "own" it, you have merely bought a limited and revocable license to view the content. The movie is still owned by the copyright holder, which is why they're still bitching about format shifting and not only trying to reverse SCOTUS decisions on it, they're making it ever more difficult to pull it off by going for DeCSS developers with all but tactical units on dawn raids and six year old girls on Gramma's broadband connection.

      Oh, not marketing here, this is purely anecdotal: I have a 3G connection on an "all you can eat" pay-as-you-go plan. I regularly pull upwards of 20GB/day, often more, and as far as I can determine I've never been capped or throttled. Not in over five years on the same plan. I wish I could work out how to pool daily packet statistics on Windows 7 (which I've been using since March 2011 - and the warranty on the laptop expired today), but hey, you'll just have to take my word for it :)

      --
      Political debates have me rolling my eyes so much I think I got optical whiplash. I should sue. - Foamy The Squirrel
    15. Re:Ah, "unlimited"... right. (*cough*) by Anonymous Coward · · Score: 0

      As the CTO and one of the founders of Backblaze (posting as Anonymous to preserve mod points) I really must object to being called a liar.

      Backblaze has quite a few customers who store more than 50 TBytes at Backblaze, and we absolutely do not do anything to turn them away.

      EXPLANATION: we do not make it "unlimited" to attract huge customers, we make it "unlimited" because Mom & Pop don't know a MByte from a GByte from a TByte and it stresses them out (creates sales friction) that if they take one more picture it will cost 1 more penny. So Backblaze is a "fixed cost" - by knowing it is $5/month in advance, we get more low end customers that would not have signed up. In the end, we only care about the average amount stored per customers. I know this sounds counter-intuitive, but by being "unlimited" (also known as "fixed price") we ****LOWER**** the average amount of data stored per customer by attracting more low end non-technical customers.

      REALISTICALLY: Backblaze does not limit your backup in any way, however if you refuse to purchase a fast enough internet connection, don't blame us for throttling. We don't throttle, but if you only have a 1 Mbit/sec upload speed on your ISP then you can only upload around 9 GBytes per day, so 20 TBytes will take you 2,000 days (6 years) to upload.

    16. Re:Ah, "unlimited"... right. (*cough*) by LordLimecat · · Score: 1

      Its like an all you can eat buffet. Show up with a wheelbarrow and both the other customers and the restaurant owner are gonna get pissed off at you.

      Theres such a thing as "common sense". Being super pedantic and hyper-technical in real life is a pretty good way to alienate everyone you meet.

    17. Re:Ah, "unlimited"... right. (*cough*) by LordLimecat · · Score: 1

      All-you-can-eat does not mean "unlimited food". Im fairly certain this was tested in court and the ruling was "quit being a jackass, you know what they meant."

    18. Re:Ah, "unlimited"... right. (*cough*) by LordLimecat · · Score: 1

      They probably dont have strict limits, its more of a "we know an abusive customer when we see it." Having a strict policy probably isnt necessary till they see the outliers consuming disproportionate resources, and then have to deal with abusive jackasses. 20TB is so far above the clear intention that its silly anyone would even have to point that out.

    19. Re:Ah, "unlimited"... right. (*cough*) by stoploss · · Score: 1

      I'll bite. If it's unlimited (and not a lie) then why do you have the weasel words in your ToS that allow you to limit available backup storage for users and also unilaterally make such changes without notice?

      Seems like you're leaving yourselves quite an "out". Why should people trust what your marketing "large print" trumpets ("trust us!") when your actual adhesion contract agreement "small print" reads just like every other typical sleazy lying provider's ToS?

    20. Re:Ah, "unlimited"... right. (*cough*) by TangoMargarine · · Score: 1

      Although that obviously falls into the "there will always be a top 10%" problem... :-)

      Cf. telecommunications industry refusing to upgrade the network, instead "selling" bandwidth at 10x (or whatever) actual capacity. "Oh yeah yeah yeah...it's totally unlimited..." Then they yank the rug out from under you.

      --
      Unity? Screw that: XFCE. Slashdot Beta? Screw that: SoylentNews. Australis? Screw that: Pale Moon. UX developers DIAF
    21. Re:Ah, "unlimited"... right. (*cough*) by punkr0x · · Score: 1

      More like an all you can eat buffet where they turn you away if you exceed a certain weight. They claim something like their seats can't accommodate you, but in reality they didn't mean all *you* can eat.

    22. Re:Ah, "unlimited"... right. (*cough*) by stoploss · · Score: 1

      All marketing is deceit. That's the point of it.

      Arguing that lies and deceit are acceptable in one instance simply because lies and deceit are perpetrated in another instance is a logical fallacy. It's wrong to sell something as unlimited when, in fact, it is intentionally limited. It's also wrong to mischaracterize a license as a a transfer of ownership.

      I'm pleased to learn you are presently able to use your 3G connection without restriction. I also guarantee you that your network provider's ToS will allow them to limit your "unlimited" connection whenever they see fit. Nobody would complain if a provider sold a 500 GB plan and then never charged for any overages (i.e. decided not to limit a plan that was advertised as having limits); people become incensed when they see an unambiguous term like "unlimited" and subsequently find out that it was a bait and switch.

      My friend would have never have paid for the JustCloud service if they had sold it honestly as a 500 GB cap plan. That would have been insufficient for his needs. They deceived him to get his business.

    23. Re:Ah, "unlimited"... right. (*cough*) by Anonymous Coward · · Score: 0

      Well I myself am using bitcasa, and they seem to be doing ok on their unlimited claim. I use about 7tb. There are some people on the forums who have 15tb and above. Now, they used to offer unlimited storage at 99 bucks per year, after all of us using up so much space, they had to up the price to 99 bucks a month. I'm just glad i'm grandfathered in on the old plan, cause i could never afford 99 bucks a month (or 999 a year).

      They were hoping that they'd hit that critical mass point, where they'd have enough casual users to offset the cost of us heavy users. They currently use amazon for their server storage needs.

    24. Re:Ah, "unlimited"... right. (*cough*) by Anonymous Coward · · Score: 0

      But yea, i agree, before finding bitcasa i was looking at alot of other services and JustCloud was one of them. Alot of them have terrible reviews and just for this reason. I'm glad i didn't go with any of them.

    25. Re:Ah, "unlimited"... right. (*cough*) by Anonymous Coward · · Score: 0

      All-you-can-eat does not mean "unlimited food". Im fairly certain this was tested in court and the ruling was "quit being a jackass, you know what they meant."

      Ironic you should choose that precise example. So, if a restaurant was advertising a price for "unlimited food" (like these companies are offering "unlimited x"), then people should be well within their rights to show up with a wheelbarrow, right?

      Take a look around at the other comments in this thread. You will see that apparently there is a very wide range of what is considered normal and acceptable usage, both by users and by providers. For example, apparently backblaze is fine with "unlimited" meaning 50+ TB (for now). It's just disingenuous for a provider to sell "unlimited" meaning "whatever I feel is appropriate, and I am allowed to change my mind whenever I like". Just sell a fucking capped plan that everyone can use as an objective reference and be done with it.

    26. Re:Ah, "unlimited"... right. (*cough*) by Anonymous Coward · · Score: 0

      More like an all you can eat buffet where they turn you away if you exceed a certain weight. They claim something like their seats can't accommodate you, but in reality they didn't mean all *you* can eat.

      Shit, if only these providers were as honest as that. It's more like an all-you-can-eat buffet that requires you to sign up for a membership, takes your money, and then after you sit down and start dining they decide you are eating too much and kick you out without a refund. Bonus if they charge you an early termination fee on your membership if you want to go with another restaraunt that is more suitable for your needs.

    27. Re:Ah, "unlimited"... right. (*cough*) by LordLimecat · · Score: 1

      then people should be well within their rights to show up with a wheelbarrow, right?

      And the restaurant will gently (or not so gently) remind you that it is a private establishment with every right to tell you to take your patronage elsewhere, that you are abusive and they dont want you there.

    28. Re:Ah, "unlimited"... right. (*cough*) by Man+Eating+Duck · · Score: 1

      Theres no storage system in existence that will store 2TB of data for $5, let alone maintain it.

      No, but the average customer stores far less, obviously it all works out because they're still in business :)

      I wouldn't find it inappropriate if Crashplan contacted the poster storing 14TB above with a suggestion to get a business plan (if he doesn't already have one), but Crashplan obviously can handle it. I've read articles about people successfully recovering terabytes of data from Crashplan, but that guy had to use the HDD recovery option because it would have taken him weeks or months to download it all.

      --
      Are you a grammar Nazi? I'm trying to improve my English; please correct my errors! :)
    29. Re:Ah, "unlimited"... right. (*cough*) by Anonymous Coward · · Score: 0

      All of the "unlimited" backup solutions offer terrible speeds. I have 14 TB of data I want to backup and I tried the trials of Backblaze, Crashplan and Carbonite. I also tested with Amazon Glacier using the multi-threaded FastGlacier client. I have a 75/35 FiOS connection, and can reliably do 83 Mbit down and 39 Mbit up.

      Of the unlimited services, Backblaze offerred the highest speeds at 2.5 Mbit/sec upload (~26 GB/day), while CrashPlan and Carbonite barely maintained speeds of 1 Mbit (10.5 GB/day). In contrast, the pay-for-what-you-use Amazon Glacier managed to max out my connection at 35+ Mbit/sec (369 GB/day). The key is that the free services do single-threaded uploading via SSL, which is highly latency dependent. With Amazon Glacier, I was running 15 threads and backing up to their Northern VA facility which is within 50 miles of my location. Backblaze is based in CA, and I believe their data center is on a fault line. Even with Backblaze, a full backup would take 1.5 years. I suspect that a restore of that much data would also be incredibly time consuming.

      Ultimately, I decided to backup my most important data (~350 GB) to Amazon Glacier (music, photos, documents). Amazon is targeting Glacier to enterprise customers and offers 11 nines of durability. They store at least 3 copies of your data on multiple servers and data centers, and perform regular integrity checks. In contrast, I've read forum posts about Crashplan randomly deleting user data. One user reported that his entire 2.5 TB backup was lost, and posted an email from Crashplan noting that his data was lost due to a combination of hardware failure + human error on the part of an employee.

      As for the remaining data, I decided to simply add an additional parity drive. This data is mostly movies and TV shows that can be re-ripped/downloaded.

      I actually emailed Backblaze about the poor speeds since they specifically advertise that they don't throttle uploads. Here was their response:

      Greetings,

      Sometimes due to network issues along the path from an ISP to Backblaze's servers, some customers on some networks can not max out their connection speed to us. Backblaze encapsulates data in 10MB chunks, uses https connections, uses a single connection at a time, so latency can play a very large role in overall backup speed.

      If you've ensured you've set the Backblaze throttle is either set to Automatic or the slider is all the way to the right, indicating maximum, you've done everything you can to improve your upload speed.

      We don't throttle backups, so the issue may be on the route between your ISP and our servers, or general latency.

      Cheers,
      Christopher

  74. The Assange Protocol by Theaetetus · · Score: 1

    Simply compress and encrypt your backup data, then post it on a torrent tracker as "New Julian Assange insurance file, decryption key to be released if extradited". Thousands of other people will make backup copies for you.

    1. Re:The Assange Protocol by TeknoHog · · Score: 1
      --
      Escher was the first MC and Giger invented the HR department.
  75. blu ray ? by Anonymous Coward · · Score: 0

    Well, Double layer blu ray can be a way of archiving for at least once without continuous updates.
    It will take about 40 double layer discs. Not sure if it will survive 50 years, but I'm sure it will be enough until a new magnetic or optical technology comes to replace it in, lets say, 5 or 8 years.

  76. Amazon Glacier by uiucgrad · · Score: 3, Interesting

    I use Glacier and its great. 20 TB is about $200 a month which to me does not seem like all that much money for backing up that much data. The biggest problem from a home users perspective is getting all of that data to Amazon. Hopefully he lives somewhere where fiber is available to his house.

    1. Re:Amazon Glacier by koan · · Score: 2

      They charge you to download it too.

      --
      "If any question why we died, Tell them because our fathers lied."
    2. Re:Amazon Glacier by Anonymous Coward · · Score: 0

      10 months of that and you could have bought a Synology NAS + 20GB of disks. Park it at a friend's house and it's offsite.

    3. Re:Amazon Glacier by PRMan · · Score: 1

      Let's see. How many legal movies and songs could you buy digitally for $200 / month. Lets say 20 movies and 100 songs on average. How many nights do you even have to watch things or listen to music? The amount of time and money people spend to not spend money baffles me.

      --
      Peter predicted that you would "deliberately forget" creation 2000 years ago...
  77. Re:no by jones_supa · · Score: 2

    md prnt dwn

  78. Paulo by Anonymous Coward · · Score: 0

    http://www.mdisc.com/what-is-mdisc/

  79. Connect a raspberry pi and by Coeurderoy · · Score: 5, Funny

    Connect a raspberry pi and configure it as a backup server and let it copy all to /dev/null...
    Then put aside the money you would have invested in a "better" solution, put it in a safe bank (under your mattress)
    and wait until you need to restore something..
    Most probably you'll enjoy the money more ...

    1. Re:Connect a raspberry pi and by Anonymous Coward · · Score: 0

      The time taken for a backup is sufficient for a significant interest to accumulate, even with the most trustful of under mattress banks. Money, it keeps on multiplying.

    2. Re:Connect a raspberry pi and by Anonymous Coward · · Score: 0

      I did this. The money helped me restore my liver instead. I'm enjoying it much more than the porn solution.

  80. $420,000 in movies? by Anonymous Coward · · Score: 0

    I can sort of feel for your friend. I once lost about 20GB of music due to not having a backup -- the ex-wife had nearly all of the original CDs, so there was no way to get it back.

    However.

    Let's say the entire collection was movies. At .7GB per movie, that would be about 28,000 movies. At an average cost of about $15 per movie that's $420,000 worth of movies. So, your friend has spent $420,000 on movies in his lifetime? That seems a little far-fetched.

    If the entire collection were music, at 3mb per song, that's about 6,666,666 songs. At $1 per song. So, your friend spent $6 million on music in his lifetime? Again, that seems a little fantastic.

    If my numbers are wrong someone please correct me.

    I know just about everyone has SOME illigitimate media on their systems, but perhaps you friend's real problem isn't a lack of backup capability, but a lack of karma. :-)

    1. Re:$420,000 in movies? by Minwee · · Score: 1

      Or let's just say that the friend is a film maker, or a recording engineer, or an astronomer, or just about any other kind of person for whom storing multiple terabytes of irreplaceable data is just another day at the office.

      And compressing a full length movie into 700 megabytes? What a horrible thing to do to it. While I appreciate your attempt to use MPAA math to compute the value of that hypothetical collection, it would be more realistic to assign a full DVD or Blurry disk to each one instead of squeezing them onto CD-Rs. At 25GB per title, 20TB would be completely filled by only 800 of them, which is a completely different story.

    2. Re:$420,000 in movies? by n7ytd · · Score: 1

      Or let's just say that the friend is a film maker, or a recording engineer, or an astronomer, or just about any other kind of person for whom storing multiple terabytes of irreplaceable data is just another day at the office.

      I think this is the OP's problem: the question wasn't "how do I do this?", but "how do I do this and not spend more than $10?"

      "Irreplaceable" data is worth spending money to keep.

  81. BTSync by SkipF · · Score: 1
    Build two arrays. Sync the data with btsync... before you separate them. Once the original data set is synced then move one of the arrays to a trusted friend's house. Additionally, use find, or similar tool to keep a document of all the files you have and store that on DropBox or similar service. You now have two equally robust copies at separate locations, and a document of what you have in a third location in case you need to rebuild again. Also, reread the previous post about compressing your collection, you have 20tb of stuff, but do you really have 20tb of value?

    -=Skip

  82. Easy.... by LoRdTAW · · Score: 1

    When you have that much data your only real viable option is to have a second storage array just as large. RAID 5/6 isn't backup. It provides fault tolerance which means you can still access (read/write) the array as you normally would if a disk fails.

    Its almost a no-duh solution in lieu of tape or other cumbersome removeable storage options. Even backing up to 50GB or 100GB blu-ray discs would be rather pointless as the cost of a single disc is the cost of buying a movie on blu-ray. Even if you could fit 4+ movies on a 100GB BD-ROM is it worth the hassle and cost?

    Why is another array better? Its quite simple. You dont have to shuffle discs or tapes to make backup sets. You also aren't stuck with a format that could become obsolete. That LTO tape drive might look good but what if it fails? Can you find another for a reasonable cost? Will you periodically test your backup tapes or disks for bit-rot? A tape or BD-ROM rotting in a safe deposit box, safe or shoe box under your bed is useless. If your disk fails you can replace them quite easily as most every disk supports SATA or SAS and controllers are found on every motherboard. A failure of a disk will be reported and you can handle it.

    Here is another question that is kind of burning in my head. If your friend legally purchased all of his movies and music, wouldn't he have the original sources? If not, and i'm not judging I have a collection of both music and movies that I pilfered over the years, you have to be smart and make copies. My collection is just about 2TB, mostly some hard to come by movies and TV shows (entire MST3K library). My home server used to be on 24/7 but it was a waste of power. I almsot lost that array of 5 500GB disks until I made copies to 1TB drives. I now have a few 2TB drives that have copies of the server data on them. One drive is even at my place of work in a USB box. Even if my home burns down I still have a copy somewhere.

  83. Floppies by wisnoskij · · Score: 1

    15 million of them.

    --
    Troll is not a replacement for I disagree.
  84. You don't. You find out what's important first by Anonymous Coward · · Score: 0

    Even though he has 20 TB of data, there's no chance all that data is irreplaceable.
    That's the only thing backup is for, irreplaceable data. When you have large amounts of data, you start evaluating what needs backup and not.

    Recently crashed a 2 TB disk myself. Barely shrugged from it, even with only a few hundred MB worth of backup (private photos, videos and documents).
    Why? Because most of the data was either easily replaceable or not important enough warrant a backup.
    I actually considered the disk crash nothing worse than spring cleaning. New disk, restored what was irreplaceable.

  85. Tape by rabun_bike · · Score: 1

    Tape backups are the cheapest way to go as far as media and surprisingly is making a comeback due to high storage requirements. It can be expensive as far as hardware and software depending on what you buy. We backup about the same amount of data in our production environment for offsite storage. Latest tapes can hold 4 TB per tape.

  86. Make a list by PsyMan · · Score: 2

    Catalogue the contents and when you lose it all you can spend 10 minutes searching for the 2% of the content you really want to download again and feel good that you now have 98% of your storage space back to start filling with more crap :D

  87. Andre the Giant's thumb drive? by Anonymous Coward · · Score: 0

    Might be big enough for that.

  88. Very likley by SuperKendall · · Score: 1

    lot of time (and money) spent burning discs that you likely will never need

    If you have any data, over a long enough period of time you WILL need a backup. Saying "I will likely never need this backup" is a non-sensical statement, because (a) you probably will, and (b) the cost of NOT having the backup is essentially infinite in pain and grief.

    --
    "There is more worth loving than we have strength to love." - Brian Jay Stanley
  89. Even if all legal media, not easy to recover by unfortunateson · · Score: 1

    Even if 100% of that 20TB is legally owned content, recovery is a huge process: re-ripping hard media is still awfully slow -- if you can even find where you stashed it (I think a few CD's have walked off the reservation)
    Purchased digital media is no better: you've got many sources to find it from, and it may disappear: preview tracks, live tracks, etc. may disappear when they stop updating their MySpace, or a local distributor goes belly-up. That's also assuming you're still using the same providers: if you had download privs on some of the music servers of the 2000's, you'd have 'ownership' of that media, but you may not be able to get it again if you aren't still paying for the account.

    The most economical and reliable is probably a mirror RAID array. It sounds like this guy accidentally issued a command to erase the content, rather than a RAID failure. Ordinarily, the RAID should be good for most stupidities, but this falls a little outside that. The question is, if you have mirroring software, how frequently does it try to match, and would it clean off the mirror too?

    --
    Design for Use, not Construction!
  90. I use the internets! by dmatos · · Score: 1

    20TB of music and movies? How many of those could be downloaded again tomorrow? My guess is "most." The only thing that's really "mine" on my computers, and not backed up, is my own pictures. I upload those to image sharing sites on the internet. Most docs are done on Google Docs for portability reasons, and other things I've created are already on Dropbox.

    I experienced a catastrophic hard drive failure a year or so ago. After replacing the hard drive, and about one day of downloading and installing the programs I needed, I was up and running again. It took 24 hours to download enough of the series that I was watching to pick up where I left off again. And if I ever get a hankering for watching something I've seen before, well, I can get it from the internet again in a matter of hours or days.

    --

    It may look like I'm doing nothing, but I'm actively waiting for my problems to go away.
    --Scott Adams
  91. 50 DKP minus! by Bengie · · Score: 0

    Very very slowly. Who pulled the whelps?!

  92. I keep it simple with BackBlaze by BenTristem · · Score: 2

    I was going down the route of buying an expensive RAID NAS / DAS, but then I remembered when I got broken into in the Canary Islands and the thieves took both of my backup drives from two separate rooms. I'm now settled on a simple external drive, with the whole lot backed up offsite. I was looking for... + Unlimited backup, so I don't need to think +The ability to backup attached drives (NAS, DAS, USB, etc) + To feel that my data is safe with a 2nd layer of encryption You can try it free here: http://bit.ly/1bRNax1 My blog post about this: http://www.bentristem.com/1/po... Enjoy!

    --
    Ben Tristem I'd love to know more about you in this short survey... http://bit.ly/1oM7Fvl
  93. a used LTO autochanger is what I employ by Anonymous Coward · · Score: 2, Informative

    No one will ever see this anonymous post but a cheap robot changer (used) on ebay can be had from between a few hundred to a few thousand dollars. Most of us are geeks and love technology. I use two such devices, couldn't imagine life without them. LTO4 is still the sweet spot in storage cost (media) and capacity. The tapes hold 800GB and can be purchased for around $22 dollars each.

     

  94. that you likely will never need by Anonymous Coward · · Score: 0

    "Blu-Ray discs can hold a lot of data, but that's a lot of time (and money) spent burning discs that you likely will never need."

    Any form of backup is there incase the very worst case scenario happens, so you have answered the question, if you want to save it - is it worth the effort?

  95. Keep the original CDs and DVDs by Anonymous Coward · · Score: 0

    If it's 20 TB of movies, TV shows and music then maybe he purchased hard copies legitimately. That is your backup.

  96. No 2 ways about it by obarthelemy · · Score: 2

    1- if you need to backup 20 TB today, you need to budget for 40TB in the medium term.
    2- a backup is off-line, off-site, tested, and multiple. The "multiple" part is pricey, and the other 3 you can get cheapest with a PC filled with HDs. Or two (I'm making do with one). $200 for the BC, $150 per 4TB HD x 5 = $950. Hide that backup in a place safe from theft, floods, fire...

    --
    The Cloud - because you don't care if your apps and data are up in the air.
  97. check for backups first by Anonymous Coward · · Score: 0

    Call the NSA and see if they'll do a restore on your data.

  98. He's better off now by essbase_nerd · · Score: 1

    Tell him to stop hording. Streaming is good!

    The only irreplaceable data that I posses is digital home videos and digital photos of the family/kids. I have a lot, but it's super easy to back up to an external HDD that I keep locked up at work. To protect against HDD failure between those monthly off-site backups, I replicate the data partition on our main PC to a second PC in my house using a scheduled Sync Toy job.

    Cheap, very effective.

  99. If you do go tapes/drives, store them offsite by Anonymous Coward · · Score: 0

    Safety deposit box, Iron Mountain, your trustworthy friend who lives far away but not so far you couldn't drive there and back in a day, and so on.

    If there's a twister with your home address written on it, all the onsite backups in the world won't help.

    .

  100. Giving back! by Anonymous Coward · · Score: 0

    The same way they got the music seed it to the internet

  101. problem? by Anonymous Coward · · Score: 0

    Why would it be a problem? if he owns all the music/videos the worst is he has to do is re-rip all of it, thats just time, it sucks but thats life. At 20TB of data, there is no good backup solution, not one that is affordable or reasonable.

  102. Let the NSA do it by Anonymous Coward · · Score: 0

    just label it as terrorist plans and they'll back it all up from you for free!

  103. Ok by koan · · Score: 1

    First "LOL", sorry had to get that out of the way.
    1st choice for that much and to be reliable use tape, 2nd choice multiple hard drives and by that I mean multiple backups not just one, 3rd choice BlueRay disk, after that backingup starts using bandwidth, Amazon Glacier is cheap to upload and store, they get you when you want it back.

    Or you could sign up for Mega at 50 GB a pop.
    20 Mega accounts = 1 TB, yes go ahead and laugh but I have a terabyte of storage online for free. (use their sync app)

    I used a Gmail account to sign up, so lets say your Gmail is "turtle@gmail.com" well did you know you can use "turtle+01@gmail.com and it will go to "turtle@gmail.com"?
    Yeah it will, so my Mega sign up scheme is turtle+01@, turtle+02@, etc, and I can manage it all from "turtle@gmail.com"

    Finally I would like to offer an apology to "turtle@gmail.com", nothing personal it just popped into my head.

    --
    "If any question why we died, Tell them because our fathers lied."
  104. Simple by Lumpy · · Score: 1

    20TB of backup drives.
    Not that hard, although if it was really important I suggest 40TB of backup drives and 2 full backups. I'm a fan of tape for large capacity. SDLT600 would be his best solution with a small 10 tape carousel. do a full backup monthly and then incrementals every week.

    20TB means he has to spend money. If someone freaks out at the cost of a real backup solution, then the data was worth less than the backup.

    SO manually back up to 20 2TB hard drives on a esata interface, or automated Tape system. either way it is not going to be cheap.

    --
    Do not look at laser with remaining good eye.
  105. TAPE by Lawrence_Bird · · Score: 2

    There are many > 1TB tape back up systems, many with very high speeds, assuming you can feed it data fast enough.

    I have to wonder though.. 20TB for a single person? I'm not gonna do the math but that sounds like so much stuff to be impossible to listen/watch all of it.

    But at least he has proven once again, RAID is not a backup. RAID will merrily do what ever you wish, including copying drive corruption.

  106. Simple by zakeria · · Score: 0

    ever hear of punch cards!

  107. Prioritize by jgotts · · Score: 1

    Most data that people have on their hard drives can be readily re-obtained via BitTorrent or in other ways. The simple and probably best strategy is to figure out the 500 GB or less that is actually irreplaceable, and make several copies of that. I have three or more copies of my most important data.

    Or, looking at the problem another way, 4 TB hard drives are selling for $160 right now including shipping. A complete insurance policy would cost $800 plus your time. What I would have done if I just had to save everything would be to simply copy all of the data in 4 TB hunks, and put each hard drive one by one into a fireproof safe, or in a safe-deposit box at the bank. A second RAID would be complete overkill, unless time to recovery is of the essence or the data churn rate is high. More than 90% of my data simply accretes over the years, and I'm sure that is true for most people.

    $800 is a small price to pay for your data. I seem to recall that it cost a company I worked for over $1,000 to recover a 9 GB IBM hard drive that failed about 15 years ago.

    According to this article, Seagate is promising 20 TB hard drives by 2020:

    http://www.computerworld.com/s...

  108. rsync with a friend by Anonymous Coward · · Score: 0

    Solution: Find a friend who has a collection very similar to yours. Make a gentleman's pact and rsync them, serving as an active backup for each other.

    1. Re:rsync with a friend by PsyMan · · Score: 1

      I think the feds have just busted an international ring of people who like to do that sort of thing.

  109. Bandwidth considerations? by ElForesto · · Score: 1

    I don't know about you, but I'm pretty sure my ISP would flip out if I tried to transfer even 1TB in a month. Even if they didn't care about the amount of data being backed up, it would still take me around 231 days to upload that much. Any kind of online backup would be infeasible for the initial dataset, but it's also probably not a great option to ship in a box of hard drives.

    Let's be honest: any large dataset like that is going to cost some serious coin to backup. You can probably "cheat" by incrementally backing stuff up to Crashplan (with its "unlimited" storage), but it'll take so long to seed that initial dataset that you're likely to experience some kind of data loss before it's done.

    --
    There is a difference between "insightful" and "inciteful" other than spelling.
  110. Your process is more important than your media by Anonymous Coward · · Score: 0

    The first thing you need to do is to categorize your data both by type and by common content groups (e.g. audio tracks by artist, photographs by family member, etc.).

    Next, start backing up your data one group at a time with only one group per backup device -- hardware is cheap.

    When you are done you will have a segmented backuup that should be stored off site in order to protect you from a localized catastrophe.

    Every night you should back up any new or altered content to removeable media which you can take with you.

    Periodically bring back all of your media and add the content of the incremental backup to the appropriate media.

    Now, as to media. My own preference would be for external hard drives because they are inexpensive, reliable, and readily available.

    By the way, with 20TB of data it would be a good idea to build a small locator database so that you can easily locate and retrieve specific content.

    Have fun.

    Jerry

  111. Should be "Back Up" by horm · · Score: 1

    As in "How Do You Back Up 20TB of Data?" "Backup" is a noun. Verb conjugations are "back up", "backs up", "backed up", etc.

    1. Re:Should be "Back Up" by geekoid · · Score: 1

      Uh, no.
      His data is on tape and in the back of his station wagon that he needs to backup out of the driveway. :)

      --
      The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
  112. JBOD, mhddfs? by Blaskowicz · · Score: 1

    It would be relatively easy to backup this on "only" four 4TB disks. They could be in one USB3 enclosure each, or in an outdated PC (pentium 4 or something) that is turned on for backups only, whatever.
    A simple mechanism to make them appear as a single ~16TB volume or directory would be nice. Or perhaps optional. Or just use some real backup software.
    Maybe the backup will be so painfully long (days?) that a drive failure may be a concern.

    On another note, I'd like a very easy and nice to use program that simply back ups the file names etc. ; I can afford easylier to lose music/movies if I have a list of what I actually had, so the good stuff easy-to-find can be found back and reconstituted.

    1. Re:JBOD, mhddfs? by SirMasterboy · · Score: 1

      Havent heard of mhddfs, thanks. There is also AUFS too.

      Why not just output a directory listing to a text file every day to keep track of the files you have?

      You can even text diff them and its easy to see what changed.

    2. Re:JBOD, mhddfs? by Anonymous Coward · · Score: 0

      find . -type f -print

    3. Re:JBOD, mhddfs? by Anonymous Coward · · Score: 1

      On another note, I'd like a very easy and nice to use program that simply back ups the file names etc. ; I can afford easylier to lose music/movies if I have a list of what I actually had, so the good stuff easy-to-find can be found back and reconstituted.

      Linux - have cron periodically run:
      find / > filenames.txt

      Windows - have task scheduler periodically run:
      dir /b/s c:\ > filenames.txt

      If you need more than that on linux, you can code a perl script to grab md5 or crc32 hashes and throw that in there too. If you need more than that on windows... ehh, good luck with that.

    4. Re:JBOD, mhddfs? by mjwx · · Score: 1

      JBOD

      I have no idea why people would run RAID at home without a rock solid backup solution. Even using RAID 5 or 6 to protect against an individual disk failure leaves you wide open to a controller failure (especially if it's software RAID), I've seen consumer grade RAID controllers fail a lot. Seeing as you dont need the speed RAID provides, JBOD protects you against a controller failure but leaves you open to losing a disk, but would you rather lose a fraction of your pron, erm.. sorry, home media collection or the lot of it.

      I ran 2 x 1 TB drives in RAID 1 for a while and all it did was conflict with the audio driver (both were inbuilt into my mobo).

      --
      Calling someone a "hater" only means you can not rationally rebut their argument.
    5. Re:JBOD, mhddfs? by cboslin · · Score: 1

      With Linux I simply use ls -alFR (as root user) from root (/):

      login as root ( sudo or su - root, depending on how you have it setup)

      cd / get to the root directory of that machine and/or drive

      ls -alFR > /dirname/yymmdd-ls-alF.txt where yy is 14 for 2014, mm is 03 for March and dd ps 13 for today or this

      ls -alFR > /dirname/140313-ls-alF.txt

      I I had multiple drives, I would put one of these commands in a script, one for each drive and put the drive name after the date in the name in the filename to differentiate one text file listing from another.

      The added plus is if a rootkit gets put on your machine and you have taken the time to get to know what files are put where on your machine, you will find that rootkit.

  113. Crashplan may be affordable by LordLimecat · · Score: 1

    Ive used Crashplan for years at clients, friends, and personally, and its generally been good. They have 2 options that may work here.

    The first is their all-you-can eat backup service, but they may well balk when you tell them its 20TB-- they might shove you to a $120/year business plan.

    The other is buying a pack of Crashplan ProE licenses, which let you host your own cloud backup service. You can use any PC as the "server" (just make sure its reliable and on 24/7) and it handles diffs like a champ. It also verifies backups to avoid bit rot.

    1. Re:Crashplan may be affordable by geekoid · · Score: 1

      10 dollars a month for 20 TB+ isn't really that bad. Over all it's going to be cheaper then building a back up array, plus no maintenance.

      --
      The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
    2. Re:Crashplan may be affordable by rvw · · Score: 1

      Ive used Crashplan for years at clients, friends, and personally, and its generally been good. They have 2 options that may work here.

      The first is their all-you-can eat backup service, but they may well balk when you tell them its 20TB-- they might shove you to a $120/year business plan.

      The other is buying a pack of Crashplan ProE licenses, which let you host your own cloud backup service. You can use any PC as the "server" (just make sure its reliable and on 24/7) and it handles diffs like a champ. It also verifies backups to avoid bit rot.

      You can use the free version. Setup another computer with sufficient capacity, so probably 6x 4TB external drives in a ZFS system of something like that. Make the first backup locally. Then move the computer to another location.

      Only catch with these local backups - it seems to be that when CP goes offline (bankrupt), even the local backups won't work anymore. Just disconnect the network cable from the backup server, and then try to restore a file from the local backup to test this. I use CP myself, and haven't seen anything that compares in price, but this "feature" is something I really don't like.

    3. Re:Crashplan may be affordable by LordLimecat · · Score: 1

      Only catch with these local backups - it seems to be that when CP goes offline (bankrupt), even the local backups won't work anymore

      Im fairly certain that is correct, but they offer an enterprise version without that issue.

  114. Hard copies? by Anonymous Coward · · Score: 0

    If he possesses 20TB of digital copies of legitimately backed-up media ripped from CDs, DVDs and Blu-ray discs, it shouldn't be that big a deal, should it? Just start ripping again as he needs them.

  115. License? by scsirob · · Score: 4, Funny

    Just assuming that your friend had a fully legal collection, I would think that all he needs to do is ask the media companies for a new copy. Because the media industry tells us that we do not buy music, we buy licenses, right?? So even if we lose the bits-and-bytes which are easy to replace, then we still hold a license and the media companies should facilitate that your friend can exercise his licensed rights..

    [/sarcasm]

    --
    To Terminate, or not to Terminate, that's the question - SCSIROB
    1. Re:License? by gnasher719 · · Score: 1

      Just assuming that your friend had a fully legal collection, I would think that all he needs to do is ask the media companies for a new copy.

      On iTunes, the average song is 8 MB for $0.99. 20 TB would by 2.5 million songs. I actually doubt that the test guys at Apple who are responsible for testing iTunes have a 2.5 million song library.

      (Of course HD videos are a lot bigger; they can easily be 3GB or 4GB, so that would only be let's say 5,000 to 10,000 videos purchased, at say $9.99 on average).

    2. Re:License? by Wdomburg · · Score: 1

      An SD movie can easily be 6-8GB. An HD movie can easily be 30-50GB.

  116. Cheapskate by cerberusss · · Score: 1

    he didn't have a good way to backup that much data

    But he did. Another RAID array of the same size would have sufficed. Oh, now I see what you mean. He didn't want to spend the money on a good way to backup that much data.

    Another issue entirely :-)

    --
    8 of 13 people found this answer helpful. Did you?
  117. The right answer by pla · · Score: 1

    Having a similar setup myself, and having looked into exactly your question, you have exactly one realistic answer:

    You back it up to an identical (or larger) disk array.

    If possible (though not necessary), you'll want to do the initial backup with both arrays directly connected to the same host; but after that, just rsync --link-dest (to make hardlinked differential snapshots) them on a nightly basis.

    For a media server, where the typical use case consists of adding large files slowly over time and little ever changes, your backup shouldn't take up much more room than the primary storage.

  118. Throw away the trash and compress the rest by bobbied · · Score: 1

    Seriously, who needs 20Tb of data at home? This is like a digital version of "Hoarders" or something. Time to clean house and organize.

    First, it's time to TAKE OUT THE TRASH. I'll bet the large majority of this data is stuff you never use, don't know you have or is simply out of date and unnecessary. Toss it.

    Second, De-Duplicate what's left as best you can. No need to have multiple copies of the same pictures at different resolutions, or the same video encoded multiple ways in your backups. Keep the best resolution stuff in your backup, forget the rest. Don't backup anything you can re-rip from the original media (i.e. that DVD collection, Oh, don't have the DVD's anymore? Turn yourself into the MPAA...)

    Third, Compress what's left.

    If you find that 20Tb is what you need to keep, then stop asking Slashdot for advice and go buy yourself a professional tape drive and some brand new tapes and start doing backups like a professional. If this is too expensive, start over at step 1 and really take out the trash this time.

    --
    "File to fit, pound to insert, paint to match" - Aircraft Maintenance 101
    1. Re:Throw away the trash and compress the rest by Anonymous Coward · · Score: 0

      Third, Compress what's left.

      Yeah, that'll be useful. Take media files that have already been compressed with a compression algorithm specifically designed for the contents and compress them again with a general purpose compression algorithm. Why not compress them a third or fourth time? Eventually they'll all compress down to 1 bit each.

  119. Boxcars / Gigabyte by jabberw0k · · Score: 4, Funny

    If IBM punch cards were used, 1 GB equals approximately 47 cubic yards (assuming 80 bytes per 187x86x0.18mm per card) and about 70,000 lbs (at 2.42 g per card), so one standard railroad boxcar (limited by both cubic capacity and weight) could hold about 3 GB. 20 TB would need over 6000 boxcars of punch cards; at 60 feet per boxcar, that's a freight train about 70 miles long.

    1. Re:Boxcars / Gigabyte by joshuac · · Score: 3, Funny

      Well sure, latency is a bitch but imagine the throughput once it got moving!

    2. Re: Boxcars / Gigabyte by Anonymous Coward · · Score: 0

      Best way to make your comment neither understandable by Americans nor the rest of the world.

    3. Re:Boxcars / Gigabyte by Anonymous Coward · · Score: 0

      80 column "IBM" cards have 12 rows, so you could put 120 bytes per card... ---> train would be about 46.67 miles long.

  120. b1200i by Anonymous Coward · · Score: 0

    With a four Drobo B1200i 2x local and 2x offsite silly goose!

  121. If you do this a lot, then try tape. by Vitriol+Angst · · Score: 1
    --
    >>"ad space available -- low rates!!!"
  122. my 2 cents by Anonymous Coward · · Score: 0

    You would be looking at either 20 gigs of distributed storage across multiple hard drives (not recommended but a valid option for us who don't own a raid server yet), or a secondary storage (like a secondary raid array server or drobo box that gets backed up to once a month)

    I'm no where close to 20 gigs, but even without a raid array, if I had a hard drive fail, I all of my data either backed up on unplugged hard drives (in case of a power surge), on there original disks (optical media) or in the cloud (google music / docs / onedrive / etc)

    The bigger issue here is versioning. It sounds like your friend has a decent setup but should set up some sort of redundant copies of files (like what time machine and windows backup kind of offers) so that if you somehow deleted your files, you could just point to a specific backup and press restore (you would obviously want more space) That way if your drive somehow got wiped (other then a complete format of a disk) you should be able to recover since it would be on its own hidden partition.

     

  123. well by geekoid · · Score: 1

    Take the storage media he has, and then duplicate it.

    Really, was that hard?

    --
    The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
  124. NAS by Anonymous Coward · · Score: 0

    NAS

  125. This - ish by Anonymous Coward · · Score: 0

    First, get organized - what can you absolutely not lose, what would be inconvenient to lose, and what do you just not care about...

    Things that can absolutely not be lost (email, accounting and tax stuff, projects documentation source materials and deliverables, photo library, etc) get backed up 2x with one or both backups offsite.

    Things that would be annoying to lose but could be re-built (system software, settings, VMs that are doing something, etc) get backed up only once (twice if you have business continuity needs but I was assuming a home user)

    Things that you don't care about (test VMs, non-keeper photos, downloaded videos, etc) are not backed up.

    Then you can reduce backup times by doing stuff like making sure your email is in MailDir format, not Mbox or PST files (which use one big file for everything - and this file changes every time you look at an email.

  126. Synology by Anonymous Coward · · Score: 0

    1 Synology DS1813+ - $1000
    6 4TB drives (RAID 5) - $1000
    Done for $2000, and you have 2 more drive slots to expand into. If the stuff you're backing up isn't worth $2000, just get 5 4TB external drives and use them. Bonus if after the initial backup you put the Synology at a friend's house & use it remotely.

  127. DVD by Anonymous Coward · · Score: 0

    This is where the "physical media" is obsolete BS hits the wall. I keep all of my video on a disk array for convenience. For emergencies I have the originals on disk. Gasp! Yes I purchased them on actual physical media. Since I know nobody here pirates their video, I'm sure this will work for them too. If you download all of your paid video, then I guess you should budget for a decent backup too. Pretty much eliminates any cost savings though.

  128. Re:Hmmm I throw down the gauntlet! by Vitriol+Angst · · Score: 1

    This is Slashdot, until someone makes a Beowulf cluster of punch-card processing machines, we can't call ourselves geeks!

    And, someone needs to compute how many punchcards it would take to back up Google. Oh yeah, I'll just google that;

    Let's assume Google has a storage capacity of 15 exabytes, or 15,000,000,000,000,000,000 bytes. A punch card can hold about 80 characters, and a box of cards holds 2000 cards. 15 exabytes of punch cards would be enough to cover my home region, New England, to a depth of about 4.5 kilometers. That's three times deeper than the ice sheets that covered the region during the last advance of the glaciers.

    http://gizmodo.com/if-data-was...

    I think it's important to know that Google's data would be three times thicker than the glaciers during the ice age. It's strangely comforting.

    --
    >>"ad space available -- low rates!!!"
  129. An Awful Lot of Data for the 'Net by trydk · · Score: 1

    20 TB is an awful lot of data for backing up over the net.

    What I do is backing up over the net to my brother's NAS. (He lives in another country.) I use rsync and it works like a charm. It is a bit of a bother when I have been taking a lot of pictures but as it works in the background and is traffic shaped with low priority, it is manageable. I've got a fairly slow 1Mbps/6Mbps connection, so it takes some time. 20 TB would take the better part of a year, but since I do it incrementally as I get the data, it has been manageable so far. The Raspberry Pi server at my brother's replicates it to a friend's NAS as they both have 10/10 Mbps lines.

    I keep a local copy on a Raspberry Pi with a couple of USB drives, just for the fun of it.

    Worst case scenario that my house burns down or similar total catastrophe: My brother copies my data to an external disk and sends that by courier to me. Downtime around 24 hours.

    And, obviously it is fairly easy to restore individual files over the net.

  130. FreeNAS by Anonymous Coward · · Score: 0

    Setup a FreeNAS box with enough space to store your data, have it replicate to the FreeNAS box, then take that FreeNAS box off-site.

  131. Stop being cheap or collecting so much crap data! by cyn1c77 · · Score: 2

    What does it mean that he didn't have "a good way to backup that much data, so he never took one"?

    The concepts behind backing up data have not changed. You need to manage the size of your data to redundantly fit into the storage of your system. So either pony up the cash and time to properly store your files, stop collecting TBs of crap, or stop complaining about losing it when your system crashes.

    It's frustrating to see people continuously complaining about how they have too much data to back up cheaply and conveniently. It's even more frustrating to see them complaining about losing all of their data because they didn't back it up properly.

    I think that the main issue is that most people do not realistically or conservatively plan their actual storage capability. For example, it seems like 90% computer users believe that having 4 TB of hard drive space means that they can safely store 4 TB of data.

    After a conversation about scratch space, redundant drives, and timestamped backups, they then will grudgingly agree to allocate 25% of their available storage to RAID/Backup space, which obviously does not get the job done! Very few are willing to accept using 66% of their available hard drive space for RAID and Backups, which is really the minimum metric for any sort of storage longevity.

    20 TB is an awkward amount of data for a non-corporate individual to be storing. It's more data than most people actually need for their media and it is getting into a very expensive price range to backup for basic music/movie content. (By expensive, I mean that it would be cheaper to just re-purchase the media rather than back it up.)

  132. exactly by Anonymous Coward · · Score: 0

    Hey, look how big my collection of movies is that I never watch and did not purchase! You don't need a home backup strategy since your usenet provider or torrent tracker already has it waiting for you in most cases...

  133. Please post Tape backup ref by advid.net · · Score: 4, Insightful

    To /.ers saying that 1TB+ tapes would be a good idea to do this backup, please:

    Add some references and price of such hardware and media that would suit best home usage.

    1. Re:Please post Tape backup ref by Anonymous Coward · · Score: 0

      Your comment seems to suggest that $1500-$2000 tape backup is not home usage.

      I would argue a 20TB raid array isn't home usage.

      So if we have moved past home usage, then yes a $1500 tape backup and about 20 1.6 TB tapes (another $500) is very reasonable considering you have ~$2000 invested in the hardware to run the 20TB raid array not including the cost of the content.

      for the lazy
      http://www.newegg.com/Product/...
      with
      http://www.newegg.com/Product/...

    2. Re:Please post Tape backup ref by OverlordQ · · Score: 1

      LTO-6 Tape Drive: $2200
      LTO-6 Tape: $65 x 4

      --
      Your hair look like poop, Bob! - Wanker.
    3. Re:Please post Tape backup ref by Anonymous Coward · · Score: 2, Insightful

      Stop ! Stop ! Stop ! Stop .... Seriously STOP !

      There are a few comments here that are really trying to help, and most are wasting time. This is a real problem, it's a real problem we all have - even if we don't think too hard about it. I bet that lots of you have a usb stick with a 100% backup of some critical files... perhaps even multiple copies ( 200%, 400% coverage )

      To solve this problem - you have to understand what your requirements are ( surprising eh - but nobody is talking about it ). It's clear that most of the comment have no idea about managing large data volumes, and they clearly don't understand backup solutions.

      Here are some questions - the set of answers to these questions decide which backup solution is optimal for THIS use case ...

      What 'backup' problem are we trying to solve ?
      - A single spare copy ? Multiple copies ?
      - Is all the data in the backup 'equal' ?
      - How often does the data change - or is it mostly static ?
      - How much data loss from the primary source are you willing to accept ? ( In otherwords how closely does the backup have to match the live data set )
      - If you lose the primary source ; how quickly do you need the data set back ? ( Can you backfill the dataset while continuing to work on the live system, or are you hard down until a full restore has completed ).
      - How long does a backup need to live ? ( If you do a fresh full backup every week, and you write it to a biodegradable wafer, and it dissolves in 6 months ; but you have copies for every intervening week - do you care that the 6 month old version is nolonger readable ) ... OR do you need to restore from a year ago, 5 years ago, 10 years ago (think financial records))

      Figure out the answers to those questions - and you might be a ways towards deciding what solution you actually need.
      There are commercial software systems (and opensource ones like amanda/zmanda) that understand the data management lifecycle - and are setup to support solutions like 'intermittent full backups ; with daily/hourly incremental snapshots'. . but then there are the filesystems that support snapshotting. Look at the features that the full service solutions provide - they might prompt the conversation to help you understand what you actually require.

      So - if you have 15TB of mostly static data (movies etc. ) - then you can archive that to long term media.
      If you have 5TB of changing data ( music library, playlists, new mp3s, new movies ) - then you can snapshot that using the filesystem hourly , and back it up to the cloud weekly (with incrementals) - and then dump it to long term archival every 6 months..

      Where the (15TB, 5TB, 6 months) values in the above statement are completely made up - and are not the ones you should use unless I guessed right.

      None of the 'use tape, use cdr, use punched paper, use lasers to etch it on the moon' solutions will work if you don't first understand the problem you are trying to solve.

      This isn't hard - but if you don't know what you want, then every solution is the wrong one and too expensive

    4. Re:Please post Tape backup ref by advid.net · · Score: 1

      Thanks.

      Hum... the LTO-6 tapes are only 2.5TB native ( 6.5TB with 2.5 compression, that won't apply to media files).
      Someone said there's 4TB tapes now, it looks so but news about this are rare...

    5. Re:Please post Tape backup ref by bdam · · Score: 1

      Define 'would suit best home usage'. If you have 20 TB of data on a RAID running on a server in your home then I'm not sure the term 'home usage' really applies.
      LTO-5 which stores 1.5TB natively can be had for under $1,500: http://www.newegg.com/Product/...
      LTO-5 tapes are $30: http://www.newegg.com/Product/...

    6. Re:Please post Tape backup ref by Anonymous Coward · · Score: 0

      Launch the Anti Oracle People....

      At work I use Oracle SL150's. To get one configured to hold 30 LTO5 tapes and a single drive, it runs us about $6k. Now the media adds up pretty fast at about $30-50 per tape, but they hold 1.5TB per tape uncompressed. So it would probably set him back about $8k for the HW and media to make a backup of 20TB (less if it was pretty compressible). But you still need some sort of software to manage that.

      I couldn't imagine using tar to backup and keep track of that...

    7. Re:Please post Tape backup ref by Anonymous Coward · · Score: 0

      Tape is a relic from the stone age continually foisted upon you by dusty SA's and PHB's.
      The ONLY reason you should consider tape 1) you abuse the fuck out of your backup media... vibration, shipping, pot smoking data jockeys 2) you do not have a separate datacenter to put your own spinning media duplicate or 3) you have some requirement to ship tapes around

      1.5 TB of LTO5 is $30 each for cheap brand.
      LTO5 drive will set you back $1500 for cheap brand.

      4TB of hard disk is $165 for cheap.

      drive = $990
      tape = $1900+

      Assuming you have 6 SATA ports free (because you need at least raidz1),
      you're saving $1000 by using drives for 20TB backup solution. Which by
      the way means you can also TRIPLICATE your data for the same cost
      as one tape solution, AND you have realtime access, online access if
      you want it, no cleaning tapes to buy, no manual loading, etc.
      (N extra copies of tape cost $400 to be fair).

      In nearly all small installations that don't also require regulatory multi-year sitting
      in a box on a shelf in a bunker storage, drive is the FAR better solution.

    8. Re:Please post Tape backup ref by lart2150 · · Score: 1

      Unless it's something other then LTO 2.5TB is the best there is right now http://en.wikipedia.org/wiki/L...

    9. Re:Please post Tape backup ref by lart2150 · · Score: 1

      I should have done a bit more looking around before I posted my other comment. If you want to drop a crazy amount of money you can get a Oracle tape drive. The T10000 T2 tapes hold 8.5TB uncompressed however it's unclear if you can buy one drive or how much one costs. http://www.oracle.com/us/produ... http://www.oracle.com/us/produ...

    10. Re:Please post Tape backup ref by Anonymous Coward · · Score: 0

      Something like an HP StoreEver LTO-6 Ultrium 6650 External Tape Drive with interface card/cables and a half dozen pieces of LTO-6 media can be had for under $4000.

    11. Re:Please post Tape backup ref by advid.net · · Score: 1
      Here is what I've found, here (my emphasisi):

      First step in LTO-6 (native 3.2TB, 210MB/s) was made in June 2011 when the three technology providers announced the availability of licenses for this format. But later these specs were reduced to only native 2.5TB (+67% compared to LTO-5) and 160MB/s (a mere +15%). Generally, the capacity of the LTO tape follows current highest capacity of HDD. But it's now 4TB uncompressed and LTO-7 is supposed to be native 6.4TB (and 315MB/s) and will be there probably in at least two years.

      So it looks like LTO-6 would backup 4TB uncompressed, pushing the original standard a little bit (no pun intended).

  134. "electronic" collection? by Anonymous Coward · · Score: 0
    As opposed to what? The live orchestra most of us keep clothed and fed in the living room?

    Just because the player is integrated to the spinning discs doesn't make it more electronic than tape or CDs. Even record players have been electronic for what, 70 years now? How old is subby?

  135. Music and Movies? by roninmagus · · Score: 1

    Why bother? It's rarely-used, practically useless bits anyway. A quote from John Nash: "Facts are available where direct memory fails in many circumstances." In this context, that could mean use spotify and netflix to stream your dumb music and movies, rather than saving them indefinitely.

    1. Re:Music and Movies? by SirMasterboy · · Score: 1

      That would be an option IF Netflix had half the movies I want to watch (which it doesn't). Their library is pathetic and their streaming quality (both video and especially audio) is also poor compared to BluRay.

  136. 20TB of music and movies by Anonymous Coward · · Score: 0

    More media than one could actually consume in one's lifetime. Unless you're some kind of weirdo that actually likes 4GB-sized lossless encoded films.

  137. The Bigger Question: by Vitriol+Angst · · Score: 1

    How do you listen to 20 terabytes of music? You won't repeat a song for at least a year, at a guess.

    --
    >>"ad space available -- low rates!!!"
    1. Re:The Bigger Question: by SirMasterboy · · Score: 1

      It's about selection... How am I supposed to predict my mood and know what songs I want to hear in the future and only keep those?

  138. Depends on the data by panda2005 · · Score: 1

    He should at first assess his needs for backing up files. What kind of data for home use could possibly fill 20TB? Does he need to keep a backup of everything? Either way, you could always get a second (third?) NAS server and HDDs and setup automatic backups. A NAS with 2 bays will set you back 100$ or less. Add 2x 4TB for 400$. Better yet, build a custom file server for under 200$ with a nice case with plenty of room for disks. Be sure to choose a motherboard with 4 or more SATA ports and fill it with HDDs. FreeNAS is a great OS with many options. Five 4TB (=20TB) would cost under 1000$, so for 1200$ you have a nice long term solution with great flexibility.

  139. 20TB? At home? by tgv · · Score: 1

    Seriously, where do you get 20TB from? I mean, if you rip 250 DVDs at home, you've got 1TB. So for 20, you'd have to rip 5000 (or a bit less), without further compression. If you compress them a bit, 20TB would store over 20.000 movies. So, what's in this 20TB and where is it from?

    Anyway, simplest solution: 10 $99 2TB disk units and some time.

    1. Re:20TB? At home? by SirMasterboy · · Score: 1

      The 2000s called, they want their DVDs back...

      It's all about BluRays now and soon 4K video. BluRays are about 30GB-50GB per movie (especially if you want the bonus features for a complete ISO) and 4K will be at least double that too.

      And before you suggest it. No I do not want to compress my video. I got the BluRay in the first place because I want the quality. I watch my content on a 136" projector screen and I can tell when it's been compressed.

      I have a 20TB array at home and have a separate set of disks I use as an offline offsite backup.

    2. Re:20TB? At home? by tgv · · Score: 1

      I never tried to rip a bluray, but isn't backing up to hard disk not a bit expensive for movies? 2TB would hold 40, so backup would be $2.50. Not cheap.

      Or are you suggesting download only versions, and no way to download again?

    3. Re:20TB? At home? by SirMasterboy · · Score: 1

      I never claimed that it's cheap, but this is usually what people with several TB of storage at home are doing.

      $2.50 is cheaper than having to buy another disk if something happens to it.

      It's for the convenience of it. If I have a movie library of hundreds of movies. I can find one in an instant and play it on the big screen on my HTPC. I can be at a friends house or a relatives house and pull out my phone and stream it from my server at home and connect it to their TV with HDMI. I can give out a login to friends and family and they can enjoy streaming the movies too.

      Then of course there is the piracy option too. $2.50 to store a BluRay is far cheaper than buying one haha.

    4. Re:20TB? At home? by tgv · · Score: 1

      Yeah, but then the solution to have more backups is simple: add more hard disks. The price is pretty low in comparison to the investment in all movies, indeed.

    5. Re:20TB? At home? by dogbowl · · Score: 1

      I've got nearly 16TB at home and know a number of people who have nearly the same as well.

      Here's a hint - DVDs are nearly a twenty year old standard. Movies these days are stored with 1080p quality and can easily be 10gigs each.

      Beyond that, I keep all of my home movies in uncompressed format. That and pictures and can quickly eat up a lot of space.

      --

      These pretzels are making me thirsty.
    6. Re:20TB? At home? by tgv · · Score: 1

      Amazing. Someone else replied similar, see above. I think I'd compress it. Our eyes don't process all that detail. But if you don't want to, the backup solution is really easy: either you see the original disks as backup, or you get an extra set of hds...

    7. Re:20TB? At home? by MattGWU · · Score: 1

      You don't hang around http://www.reddit.com/r/dataho... much, do you?

      --
      "These people look deep within my soul and assign me a number based on the order in which I joined" --Homer re:
    8. Re:20TB? At home? by tgv · · Score: 1

      Can't say I do. Can't say I do.

  140. sure by Anonymous Coward · · Score: 0

    There are a few things I thoroghly enjoyed watching and would consider watching again.

    Here is a list of things I would consider having stored.
    The Wire
    Drive
    Oz
    The Matrix

    Here are a few things I wouldn't consider storing
    Sitcoms
    The simpsons etc.
    Uwe Boll movies

    Things I'm on the fence about
    Game of thrones
    Breaking bad

    Now, here is another thing that makes this discussion rather useless. If you like it that much, just bite the bullet and buy a copy, I know it seems wrong buy you are most likely not a starving college student anymore.

  141. Bacula by meta-monkey · · Score: 1

    Part of your choice of solutions will depend on the nature of your data. Is it changing often? At all?

    I use Bacula for my backups. My wife has a photography business and her collection of images is about 6TB and is being added to constantly and occasionally edited. The Archive is about 5TB and is stuff that is unlikely to change. Then there's the Working array, which is 1.5TB (max) and generally clocks in around 700GB. This is current work that hasn't been delivered to the client yet (RAW files from recent weddings/portraits, JPEGs where the client is still picking out what they want, PSD files for current album designs, etc). Both are on RAID5 arrays, and the Working array has a hot spare. At the end of each month, the Working folders are gone over for bodies of work that have been delivered to the client and are unlikely to change. This work is then moved to the Archive, backups are burned to DVD and also copied to an external hard drive.

    For the Working backups, I have JBOD on another server. I think it's 6 1TB disks. These are set aside for different Bacula pools of volumes. There's two full backup pools, two differential backup pools, and two incremental backup pools.

    Every night at 5AM a Bacula job kicks off. On the first sunday of the month, a full backup of the Working array is dumped to the JBOD. Bacuala makes a backup copy of everything (~700 GB). On the other sundays of the month, a differential backup dumps everything that's changed since the previous full back up to another set of volumes on the JBOD (~80GB). On every non-sunday of the month, an incremental backup copies over everything that's changed since the previous incremental backup (or differential or full backup if it's a monday). I have two sets of these pools. On odd months, it uses Full-Pool1, Diff-Pool1, and Inc-Pool1. On even months it uses Full-Pool2, Diff-Pool2, and Inc-Pool2. This way I have two sets of backup copies of everything so I don't have to delete last month's full backup to make this month's full backup.

    It works pretty well, and every morning I get an email telling me that all the backups worked fine and the arrays are stable. I know it's a little anal, but well, I couldn't imagine having to tell a bride "Hey we lost your wedding photos. Hard drive crash. Too bad." With the system I've got, unless the house burns to the ground I'm fine. And if the house burns to the ground, I've got bigger problems. I wouldn't mind an off-site solution, but I don't see how I can transfer the several TB of backup data I have at any given time someplace else, except by carrying hard drives out of the house every day, and I don't think that's something I'd be able to stick with for very long.

    --
    We don't have a state-run media we have a media-run state.
  142. Upload to TPB by Anonymous Coward · · Score: 0

    If you put it in the cloud and distribute it to everyone odds are it will remain available until the end of the world.

  143. Friends don't let friends use RAID! by dcray2000 · · Score: 1

    First of all, don't run RAID at home for data storage. RAID systems are for corporate high availability. They are inherently dangerous the moment you have to touch the config and only worth it if you need a drive system available 24/7 with hot spare. Truly stable RAID systems are also huge power hogs and heat sources. You can build highly redundant file systems for a fraction of the cost, with a small fraction of the power.

    This is easily the 15th time I've heard of someone loosing huge amount of personal data to RAID. The last one I heard was everything for the poor fool, wedding pictures, kids pictures, etc...

    Beyond that, I have about 20TB myself. I use DFSR to keep it highly available, then a one way rsync job with no purge. That way if I mess up one of my replicas, it won't get purged from the rsync target. I take an encrypted version of the rsync target to a friends house regularly so there's no chance of massive loss. I also back up limited encrypted data to the cloud, but only documents, code, and pictures.

    Don't add complication where you don't need it.

    1. Re:Friends don't let friends use RAID! by BitingChaos · · Score: 1

      RAID is fine, even for home users. The only way you could lose data to RAID is if it wasn't backed up. RAID isn't a backup.

  144. math majors by Anonymous Coward · · Score: 0

    overthinking the joke times pi

    1. Re:math majors by Redmancometh · · Score: 4, Funny

      Now this joke has really come full circle.

    2. Re:math majors by zdepthcharge · · Score: 1

      Someone once told me that backups were a flat circle.

  145. Easy to do... by Anonymous Coward · · Score: 0

    Buy a Drobo D800FS for $899 which has 8 drive bays. Drop in 8 drives w/ 4gb @ $186 for a total of 32gb...for $2,400 total. If your data isn't worth $2,400 to you, then forget about it.

  146. Paper tape by kenh · · Score: 1

    Sure the information density is pretty low, but it lasts forever!

    --
    Ken
  147. Sames way you back up any data... by sqorbit · · Score: 1

    Choose Cheap, Quick and Correct, but you can only choose two.

    --
    Sent from my TARDIS
  148. Look at the bright side by kenh · · Score: 1

    It likely didn't cost him a dime to build up that collection...

    --
    Ken
  149. Hoaders, digi-edition. by DeTech · · Score: 1

    I thought I was bad with my media consumption. 5TB of visi media, 3TB of audio, 2TB of um... not porn. I really only back up the audio. Do you really expect to rewatch all of the other stuff? If you did how hard would it be to reacquire?

  150. Tried and true by syntheticmemory · · Score: 1

    Back up all your data to stone tablets.

  151. How do I do it? by Anonymous Coward · · Score: 0

    How Do You Backup 20TB of Data?

    I do it easily every night.

    If you have a friend with a large server in their basement, you can back each other up securely and quickly. You might have to physically transport one of the servers to the other for the initial sync, after that you can easily keep up with each other over a FIOS home Internet connection.

    Of course, my data's stuff I actually own, which makes a pretty big difference. You might not have many friends willing to incriminate themselves by storing your malware-riddled piles of pirated music and pr0n torrents on their property.

  152. Classification, first step by enriquevagu · · Score: 1

    The first step is to classify the data in two groups: what you would not want to lose at any cost, and the redundant data (movies, music, etc) that you could survive without. This is the most important step

    The second step is to backup the important data using an external 1 TB drive, tape or similar.

    Optionally, the third step is to delete the remaining 19 TB.

  153. Buy a second 20TB server and mirror it. by MatthiasF · · Score: 1

    After a certain point, you have to go big or get out.

    A tape drive able to handle 20 TB is going to be $3k+.

    Online backup is out of the question. If it takes two weeks to backup 300 GB to Crashplan or Amazon Glacier, it'll take two and a half years for the 20 TB.

  154. Jottacloud has unlimited storage 6$. And it works. by fluor2 · · Score: 1

    Being a Jottacloud customer for a long time, I really like their backup. Unlimited storage is 6$ per month. You can specify when to back up, and you can exclude subfolders from sync, and you can limit the bandwidth used.

    I guess it's not very well known in the US, but it's been for several years in Europe. All servers are located in Norway.

    Unlimited is limited to one computer.

    Jottacloud.com: Jottacloud.

    (I am in no way affiliated to jottacloud)

  155. Give copies data to friends and family. by Anonymous Coward · · Score: 0

    I solved this problem by simply making copies for all my family members for their media centers. If one of my drives fails then I could simply call one of them up and do an RSYNC to recover the missing data.

  156. Segregate Data by Anaxagoras · · Score: 1

    Do you need to back up all 20 TB? Or is half of it crap you got from usenet/torrents?

    I run a 24 TB usable zfs array that I snapshot regularly so I can restore an event like me being a dumbass and doing an rm -rf /Array/.

    As far as backups I separate my content into 3 major categories.
    original content - this stuff i backup regularly to 2 locations. it contains things like home movies, pictures, documents, etc. I copy to a usb drive and to a cloud backup service (I use crash plan). It's stuff I can not replace and would be devastated if I lost it.
    rare content - stuff that's hard to find. I back this up too, but only 1 location. It's mostly static, it consist of things that took a lot of time and effort to find but are probably still replaceable. I back it up to the cloud only.
    replaceable content - stuff that's backed up already on the bit torrent network. It's mostly media i just hoard that i download off of usenet. If i lose it it's not a big deal.

  157. Just mail them to get your data back? I tried it and it worked like a charm. The next day the latest copy of my document was in my maibox.
    They even had gone through the trouble of correcting a few spelling errors, a misspelled name and a glitch in the layout.

    They did censor the part about privacy though.

  158. Re:Stop being cheap or collecting so much crap dat by SirMasterboy · · Score: 1

    20 TB is an awkward amount of data for a non-corporate individual to be storing.

    4K movies will be out shortly. We will be looking at 50-100GB per movie. Some people want to backup their disks and have them accessible for their HTPC because its significantly more convenient.

    And before you suggest, no I do not want to compress my movies to a lesser quality... That's why I got the BluRay in the first place. Because I want high quality. I watch them on a 136" projection screen. I can tell when it's been compressed...

  159. Ever hear od redundancy by sabs · · Score: 1

    Redundancy is the way you do it.
    You have an identical system as the first one, with 30TB of drive space, and every night you copy the data over to the other system. That will cover you for anything short of a house fire/tornado/earthquake/flood.

  160. $825 by sjwoo · · Score: 1

    4TB drives sell for $165 right now on Newegg. So five of those would cost $825. Your friend could stick them all into a large PC with multiple bays and create an enormous RAID-0 array of 20TB. Then he could use FreeFileSync to copy those files. Or he could set up another NAS with those 5 4TB drives and just do a copy/sync. It'd take days for the initial load, but it would be backed up. The problem with having that much data on CrashPlan (also my cloud backup of choice) is that it would take so long to restore it -- too long, I'd think. You'd blow all sorts of bandwidth limit to do so. And you can't use their restore-to-door plan for backups greater than 3.5TB. Until we have Google Fiber running everywhere, bandwidth just doesn't make it feasible to push all that data to the cloud.

    1. Re:$825 by mlts · · Score: 2

      Even with bandwidth, there are caps and fees here in the US. Try moving 1TB of data via LTE, and the telco will likely hand the person a five digit bill next month. Do it on some cable company plans, and you will be greeted by a $300 bill. So, large data via the Internet isn't going to happen.

      There are a number of solutions for this problem:

      1: One of the better ones is a server with decent backup software and a LTO tape drive. Then eight tapes will save the 20 TB. Expensive, but the job is done right.

      2: One can always have a 20TB RAID, then plug in removable HDDs and use them with WinRAR or another utility as volumes. I'd buy 5-6 4TB removable drives, use WinRAR to make segments, and have at least one recovery volume so that data can be recovered if a HDD fails.

      3: One can always buy a an external RAID enclosure, add drives, and use that as a large volume. Then use multiple enclosures that were swapped around as volumes (with an offsite rotation), so a failure or loss of everything at the site wouldn't mean everything is gone.

      4: Buy a RDX drive and media, and use 2TB disks. The drive costs $600, the 2TB disk cartridges around $360. However, this just needs a USB connection, no fast box, no SAS interface required.

      If I wanted to do the job "right", I'd buy a dedicated server with its own RAID array, use a decent backup utility, and dump the data to multiple sets of tapes, one set being stored offsite. If the server and where it sits gets destroyed, it can be re-bought. LTO-4 and newer have built in AES-256 tape encryption so just set a long passphrase that you can remember and call it done.

  161. 20TB of static content? by Bobberly · · Score: 0

    First question is how much data actually changes on a given day? I have 20TB of data... 50% are videos that will never change. I think I only average 10GB of changed files a day. So, every physical drive I buy also has a USB external drive for backup. I keep the drives online and test the backup weekly for the content that is static. Critical drives are mirrored daily (usually to flash AND usb hdd). Less critical is on a 72 hour mirror (or on demand if i know i made many changes) I've lost about 7 drives over time. I just pull out a spare drive, start an immediate mirror and order another replacement drive. I stopped using real-time mirrors to avoid accidental data deletion.

  162. Re:Hilariously Cheap Hard Drives by BoRegardless · · Score: 1

    The cost of 2 independent sets five 5GB hard drives is NOT enough to worry about compared to the cost of obtaining the 20GB of data.

    You can price the cost of this in a few minutes, period, end of discussion.

    If you are half way clued in, take those backup/clones over to two different physical locations.

  163. I have an idea by slashmydots · · Score: 1

    Don't have that much data. See, it really was that easy.

  164. Backblaze by DiamondGeezer · · Score: 1

    http://www.backblaze.com/

    Unlimited storage $5 a month. You're welcome

    --
    Tubby or not tubby. Fat is the question
  165. The answer ... 4TB drives are cheap enough by DJRikki · · Score: 1

    Multiple 4TB drives. Best you can do.

  166. Hubic. 10TB for 10 Euros. by mverwijs · · Score: 1
  167. off-site BT sync by Anonymous Coward · · Score: 0

    1. purchase a second machine (similar spec, but not necessarily)
    2. find a friend with a similar situation
    3. build initial copy of data onto second machine on local LAN
    4. go to friend's house and maintain a BT sync copy remotely
    5. allow friend to use your bandwidth as you're using theirs (or alternatively pay for another high speed drop at friend's place)

    That's how I'm doing it (only 6TB, but still)

    BT sync, someone above mentioned Crash Plan, rsync ... pick your poison

  168. Count me in by __aaqvdr516 · · Score: 1

    There are plenty of people who do 1:1 backups of movies and music. It's extremely convenient. I don't handle any physical media more than once. It keeps the house tidy and the disks in pristine shape if I ever need to re-rip.

    Around 6 months ago I had a similar problem to the story. My media drive died a sudden death (Seagate drive, never again). I had all of my family pictures, home movies, music, and movies on that drive. I had done backups and stored them remotely and was able to recover most of what I had. A few re-rips of some movies and I was done.

    The time investment necessary to rip a 1:1 copy for a large collection is not insignificant. I probably should setup raid + parity at some point but right now I'm only doing a clone of my stuff. I don't have bandwidth capacity at home to use any sort of cloud storage.

  169. 4-Bay eSATA enclosure, 4x4TB HD, and mhddfs by Anonymous Coward · · Score: 0

    It's only 16TB today, but when 5TB drives arrive in the market (Real Soon Now), it'll be a 20TB solution.

    I use the Mediasonic 4-Bay USB3+eSATA enclosure -- small, portable, easy to just "plug in".
    Then Linux + mhddfs to bond the four drives into a single "meta" filesystem of full capacity.

    Simple.

  170. Self created by Anonymous Coward · · Score: 0

    What if his 20TB is mostly self-shot video files? Or pictures? Or studio recording session audio? Some people do produce their own content.

  171. Quite Simple by TechnoGrl · · Score: 1

    A 4 TB slowish seagate hard disk can be had for about $160ish if you look around. Five of them are $750. An inexpensive bod tower such as a TowerRAID 4 Bay eSATA RAID runs about $150. Get two of them.

    Total cost is around $1100 and the solution is expandable .

    --
    ----- In Your Cubicle No One Can Hear You Scream...
  172. backup disk to other disk by roc97007 · · Score: 1

    A couple of things. First, do you really need to back up all 20 GB of data? How much of that can be recovered by other means? For instance, is it reasonable to back up the OS if you would probably just reinstall anyway? How much of your content did you acquire electronically? Would it be easier to go back to the source?

    Thing two: If you really have to back up all 20 GB, the only really practical, cost-effective way to back up that much data is to another set of hard drives. Build up a second array, replicate, and then turn the backup array off. Leave it off except for periodic backups.

    For incremental backups, dedicate one removable SATA slot. (I use one of those "hard drive toasters" that plug into a USB slot and allow you to hot-plug a SATA drive.) Plug in a drive on a regular schedule, and copy over the files that have changed recently. Mark it with a sharpie and put it in a safe place.

    The idea is to (a) back up only what you couldn't easily recover through other means, (b) back up to the cheapest and fastest per byte, which is currently other hard disks, (c) keep your backup disks turned off when not in use, and (d) Figure out a schedule that suits you. For me, it was replicating the entire array only a couple times a year, supplementing with incremental backups to individual drives every week or so. Yes, you could still lose data, but not nearly as much as if you did nothing. Don't choose a solution so ambitious that you would later tire of it and stop doing it.

    --
    Oliver's law of assumed responsibility: If you're seen fixing it, you will be blamed for breaking it.
    1. Re:backup disk to other disk by confused+one · · Score: 1

      Dude, he said 20 terabytes not 20 gigabytes, you're off by 3 orders of magnitude.

    2. Re:backup disk to other disk by roc97007 · · Score: 1

      Terabytes, right. The solution still stands.

      --
      Oliver's law of assumed responsibility: If you're seen fixing it, you will be blamed for breaking it.
  173. Amazon Glacier is NOT expensive for that by cloud.pt · · Score: 1

    So, guy spent around 10 x $100 (2TB drives), maybe more since you mentioned redundancy, for a total of ~$1000. Guy kept drives probably up 24-7, spending a lot in the electricity bill, I would say something in the lines of $150/month. Guy also had to manually maintain the complex disk array, prone to failure. Guy failed at it and lost invaluable amounts of (mostly) unrecoverable data (good luck getting that TV show from the 90's that now has 0 seeds on TPB, your familly event pics and videos, or your college papers).

    Now tell me, how can ~$250/month be expensive for 20TB in Amazon Glacier? They will give you transparent redundancy (if they lose the data you have reasons to sue for MILLIONS, you know, those numbers with 7 figures instead of 3). They will pay the electricity bill. They will buy the hardware. They will maintain the hardware too, so no need to replace drives. Your ISP is shapping traffic to AG? Sue them or change provider. Last time I checked it was a lot easier than doing ANYTHING on your 4TB+ RAID array, especially since it's for home use and will return you absolutely nothing besides self-complacency.

    Just sayin'

    1. Re:Amazon Glacier is NOT expensive for that by prezkennedy.org · · Score: 1
      Sue the ISP or change providers?

      Where do you live that you can change between decent high-speed providers? The best option I ever had was the choice between Comcast and Verizon FiOS. Good luck suing either of them.

      Nowadays I'm on a smaller cable provider. And that's the only option, unless you still consider DSL or satellite "high-speed".

      --
      It started back in Team Fortress Classic
  174. Considerations by agoldenlife · · Score: 1

    In all seriousness most people don't have that much data to backup. But I can see it might be possible, but it isn't going to be necesarrily cheap. Assuming that this data is a positively must KEEP, then using the 3-2-1 rule of backup here is what I would suggest. 1. Need to have a second synced copy. So you are going to have to purchase some kind of NAS or large storage device. You can go your own DIY route (FreeNAS) or BackBlaze storage Pod 3.0, or something like Drobo. Plenty of lower cost options out there. But it will cost some money to do it. 2. Use BackBlaze or CrashPlan for an offsite replication. There are no limits! I use BackBlaze for mine and have about 2 TB backed up there. It took about a week to get it all there because there are upload limitations by your ISP and by them, but it will eventually get it all. For $60 a year, you can't beat it! 3. Writable media (Blue-ray or DVD) is a viable option, as it is cheap but complicates recovery. And it has longevity issues. It should not be thrown out if keeping cost low is a priority. Also if the data is so rarely used, then this would be a better solution than paying for the energy and cost of hard drives. Other considerations: 1. Like any filing system, physical or digital it needs to be checked, purged and arranged on some kind of annual or semi-annual schedule. To get rid of stuff no longer needed, and to make sure you do not have duplicates, and to see if you are going to need more space this year. I simply have an internal 4TB drive that I use to sync data, a second drive for image backups of the computer, then I use backblaze for offsite storage. I know, I have 4 copies, but it makes me feel safe. 2. It seems like priorities haven't been established when it comes to retrieval. At times it appears Cost is your highest priority, then at others convenience. You won't be able to have an extremely convenient cheap solution. You need to decide which is the highest priority, and then the next and then the next.

  175. Really? by BitingChaos · · Score: 1

    Services like CrashPlan cost pennies a day and would have backed it all up. If they could afford 20TB of media and the storage to host it, there is no reason they could not afford to back it up.

  176. It's no that complicated... by clupean · · Score: 1

    I think buying 5 x 4TB hard drives would be the best solution but if you have a decent upload speed, there are online back-up solutions for $4-$5/month (usually you have to pay the whole year in advance). I've also seen people back-up on usenet: create your own alt.binaries. sub and upload everything there. Obviously, don't upload the personal files even encrypted since anyone can download them.

  177. Terribly pricy? by Anonymous Coward · · Score: 0

    If you want to go the cloud route you should look into CrashPlan. For around $4 a month you can store unlimited data on the cloud--and they actually mean unlimited. I know of someone who has 61 TB stored on their cloud. Although, restoring that much data would take a long time and a load of bandwidth. Restoring that much from tape would take loads of time too...

  178. retards by Anonymous Coward · · Score: 0

    Everyone hear seems retarded. the first question everyone should ask was what raid was he using 0,1,5, ect. I'd suggest he use a form of raid 10

  179. Comment removed by account_deleted · · Score: 1

    Comment removed based on user account deletion

  180. The RAID was the backup by Anonymous Coward · · Score: 0

    Wasn't it?

  181. I don't see the problem by Anonymous Coward · · Score: 0

    All my music and movies are stored and backed up on TPB. On my machine at home I have a 100GB HDD that is approximately 10% full, and that includes the OS.

  182. How about "dont erase your array"? by Gothmolly · · Score: 1

    If you have a PEBKAC error torch your array, you can have a PEBKAC error with your backups.

    --
    I want to delete my account but Slashdot doesn't allow it.
  183. Don't forget the time aspect by Anonymous Coward · · Score: 0

    Wherever you backup to, consider the time it takes to create a full / incremental backup. That basically rules out any online backup service, as even with 100MBit/s fibre it would take you one month to create a full backup. If you want a full backup to complete within 24 hours, that means you need something with nearly 2GBit/s of bandwidth. While SATA 2.0 and 3.0 can do that, you will need to write this to a medium. Current hard drives perform at about 1.5GBit/s. So you will need to backup to a RAID level that combines the bandwidth of several disks (RAID-0, 5 or 6).

  184. backblaze 5$/ month by crazyfrenchmen · · Score: 1

    Hi,
            just use backblaze, at 5$/month unlimited storage...

    --
    "Failure is not an option, it come bundled with the software"
  185. Just buy disks and a JBOD by SplawnDarts · · Score: 1

    The cheapest solution where you retain a decent amount of control is basically to replicated what Amazon or whoever would do - create an array of the cheapest high-cap disks you can buy and put the data on it. Your net cost will be about $1000 plus ongoing electricity cost.

    Anyone who's charging you less than that (with the $1000 amortized over 3-5 years) is running at a loss and likely won't be around when you want your data back.

  186. How did this make the front page? by Vrtigo1 · · Score: 1

    Either buy double the storage and periodically do a differential backup or use a cloud service. A Google search for 'unlimited cloud backup' yields tons of results.

    If he has 20TB of music and movies, why even back it up at all? The majority of that content is available on BitTorrent. The idea of backup is that you only backup unique data that can't be replaced.

  187. Cloud storage is not protect against search by Anonymous Coward · · Score: 0

    FWIW - the moment your data leaves your person/computer/home, searching the data no longer requires a search warrant. Just a subpoena, which is simple to get.

    http://www.zdnet.com/u-s-attorney-general-government-should-get-a-warrant-before-email-cloud-storage-snooping-7000015493/

  188. Raid Box by spacepimp · · Score: 1

    So my first thought is that 20TB is excessive. But if he and you are certain that the 20TB is all necessary then it is going to be expensive(ish). Buy a computer with a Perc controller and a used DAS/MD1200 from some supplier. I just bought one with 30TB of storage for 3K, with less than a few months of use on it. Take that and set up Syncback Pro on it to monitor for changes and set it to back up the new files/changes into the DAS backup folders.

    20 TB is no small amount of data to accumulate. If it is precious and valuable and needs backing up then your friend needs to be prepared to accept the costs associated with protecting such a large quantity of data. If he balks at it, then ask him if the roughly 3.5 k would replace the lost files. People who have serious photography/lightroom habits are in a similar position. I spent about 40 hours trying to rescue and restructure the un-maintained mess of someone who couldn't be bothered to understand file folders and naming methods. When their primary drive failed, it was some effort to piece it all together from recoverable portions of their drive and files located across many folders on many different drives. Lightroom confused him more than manual placement would have.

  189. My backup solution by Sandman1971 · · Score: 1

    I have a similar set up. Between music, movies and photos I'm close to the 15TB range. I'm selective as to what I back up however.
    I don't back up commercial movies or music. I have the CDs/DVDs/Blurays that I ripped. If something were to happen to the NASes that's holding that media I can always re-rip. For movies/tv shows, I find myself only watching them once or twice, so if something were to happen I probably wouldn't be re-ripping most of my collection. What would probably need to be re-ripped right away would be the Barney/Dora/Thomas DVDs for the kids. For music it's fairly quick to rip (and even faster to download :P ).

    The only things I back up are home movies and photos. For home movies I backup the uncompressed files, but for photos I don't back up my RAW files, only the jpegs. Those are backed up to external hard drives that I keep either at my desk at work or at my parents' place. If by some weird coincidence I would lose those as well, a great deal of my home movies were uploaded to Youtube (private) and selected important pictures to Flickr.

      With that much data, what it comes down to for me is what I absolutely do not want to lose or can't afford to lose.

    --
    It's better to burn out than to fade away
  190. Here's what I do... by gregerst22 · · Score: 1

    I have close to 10TB of data on my home server and the most cost effective solution I could come up with was to build two servers the 2nd one mirrors the data of the primary using DFS (Microsoft Distributed File System). Neither server uses RAID for redundancy, just Spanned disks. This makes it more cost effective because I'm not wasting a drive on each server for parity. If I lose a drive I can replace it and restore the data from the mirrored copy on the other server. I probably have a more elaborate hardware setup than most because I tend to do a lot of testing using Hyper-V and VMware. The backup server is a small and efficient Mini-itx system in a Chenbro SR30169 compact server case with 4 hot-swap bays. Even with 16GB of RAM, Core i5 4570S CPU, 120 SSD, and 3 SATA drives it only draws 40 watts from the wall. The primary server is more powerful ATX system with an 8 core Xeon CPU, 64GB RAM, SSD, 3 SATA, but still only draws 60 watts at idle using a gold rated power supply helps.

  191. Seriously by Anonymous Coward · · Score: 0

    Are you fucking stupid? Either you care about the data, or you don't. If you care about the data, buy a tape drive. This problem has been solved for decades.

    If tape is too expensive, or it takes too long to back up and is too much effort ... then you don't care about the data. Simple.

    What a fucking waste of time.

  192. Make a torrent by kodomo · · Score: 1

    Easy, make a torrent and name it as porn. Soon enought you will get some hundred of seed, an will be a distributed backup easy to download.

  193. Put 5 4TB INTERNAL Drives in another box on Networ by Anonymous Coward · · Score: 0

    Use an old computer box and operating system like Win XP w/3GB memory, and put 5 or 6ea - 4TB drives in there and back up across the network. Simple and only ONE time up front cost instead of a monthly $$$ hit.

  194. rsync by Anonymous Coward · · Score: 0

    Rsync

  195. Talk to your neighbours/friends/relatives... by Anonymous Coward · · Score: 0

    Offer to keep backups of their data in exchange that they keep backups of your data.
    Each party must supply the other with storage hardware. If it is just next door you could run direct cable/wifi to your backup.

    Ultimately (though I am not aware of any current projects) I could imagine the need for an open-source project that offers a global distributed encrypted storage system. It would work kind of like bit-torrent/tor/bitcoin all rolled into one, you would give some drivespace and bandwidth to others so you can replicate your own data into the system. You would get a storage key (like a bitcoin wallet) for your data. The data would always be replicated across multiple nodes for safety in case of node drops. You could also restrict access to your own node for certain time periods (like when you are sleeping). That would allow one half of the planet to back up the other halves data. IMHO if done right and with enough uptake it would be more reliable than anything any corporation could offer, it would turn our planet into a single backup system.

  196. Seriously? by AzTechGuy · · Score: 1

    All joking aside... Seriously, get 20 tb of additional storage. I don't have 20 tb of data but I do have 3 tb. I did an initial "copy" at home, it took a while. Then I took that "copy" to work. I now use robocopy to copy the differential daily over vpn. Most of the time the job takes a min or less, if there are new items to copy, it takes whatever time is needed to copy over the new data. I am able to copy over 3 gigs in about 20 or so min. If your 20 tb is changing daily, yea, you're screwed. But if you only have a couple or few gigs a day that change, this will work. Granted, 20 tb of drives off site may be an issue. I just have one 3 tb drive and before I had that, I had 3 1 tb drives. Also, my upload speed at home is between 5 and 9 mbps, that's the bottleneck

  197. Just spend the money already by swb · · Score: 1

    The problem with a "solution" here is there's no way to know how the data is organized.

    I'd say any relatively hack-free solution will involve a commercial backup application and a storage array of sufficient size to handle at least one full backup and some chain of incrementals.

    Ideally the backup array would be of sufficient size and disk count that you could gain some small protection by creating independent disk groups each capable of each holding an independent file system for a full plus backup chains. I say this having supported large backup arrays where monolithic file systems were created only to corrupt, causing the entire backup to be useless. It doesn't protect against failures caused by faulty array controllers or enclosure failure, but nothing does but multiple complete arrays.

    Decent commercial backup software will make the job simpler with compression, deduplication, intelligent incremental management, cataloging, etc.

    CDW says $9,000 will get you a Netgear ReadyNAS with 12x4TB disk. In RAID-10, you'd have 24TB to work with. Combined with decent backup software this would result in a fairly painless way to backup that much data and manage it.

    If you had nothing but time on your hands, you could roll your own solution with rsync, de-duped ZFS, etc but the hardware piece is still not cheap and rolling your own is nearly as expensive with a lot more headache.

  198. Data Deduplication by Anonymous Coward · · Score: 0

    ^^

  199. Build a sequential burner by spiritplumber · · Score: 1

    https://www.youtube.com/watch?... It's super easy, I will gladly send you the schematics.

    --
    Liberty - Security - Laziness - Pick any two.
  200. Re:How Do You Backup 20TB of Data? by Anonymous Coward · · Score: 0

    You are just 3 order of magnitude off... You need 5000 DVDs for 20 TB...

  201. One recommendation by Anonymous Coward · · Score: 0

    Blackblaze. $5 a month, unlimited.

  202. Removeable Media Bay.... by David_Hart · · Score: 1

    I bought a Syba 5.25-Inch Dual Bay Mobile Rack for both 2.5-Inch and 3.25-Inch SATA HDD Plus 2 USB 3.0 Ports SY-MRA55006 for my latest desktop build. You could then buy 7 or 8 3TB drives, back things up, then store them someplace. After the first full, you could take incremental backups for a while. You would have to refresh it every so often but my thought is that the backup should be good for at least a year. Just make sure that the drives aren't stored next to the microwave...

    Of course, the enterprise solution would be to buy a SAN or NAS, fill it with storage, and use data duplication software.....

  203. You never need backups until you need backups. by Anonymous Coward · · Score: 0

    "Blu-Ray discs can hold a lot of data, but that's a lot of time (and money) spent burning discs that you likely will never need."

    You have your answer, you're just not willing to put in the effort and despite actually needing them you still manage to avoid the work by saying you don't need them.

    Get a 4TB HDD, back up changes to that, flush changes to BD periodically. Not all changes that make the HDD need to go to BD since you're not keeping a full history there, just a recovery point. Example, you add a file and then delete the file this month, it doesn't make it to BD next month.

  204. Pre-Seed an Offsite Backup by Anonymous Coward · · Score: 0

    Get a friend to buy a similar disk array to yours. Bring it to your house and copy all the data over (pre seed). Setup the array at your friends house and setup a continuous sync between the two arrays. BOOM! Personal cloud.

  205. Save increments since the previous generation by Fencepost · · Score: 1

    There are two decent approaches: backup or mirror your setup offsite OR archive the previous generation intact and do incrementals starting from that point. I'm assuming that a home user isn't going to be picking up a $2000+ LTO-6 tape drive and swapping in 8+ $65 tapes for each full backup.

    The first is to have your own offsite storage that you back up to, where the backup is (at least) as large as the original. Multiple people have recommended Crashplan, and that's certainly a viable option. There are undoubtedly other options that could do similar things depending on how down into the weeds you want to get - rsync, the various rsync-based versioning backup solutions, git-annex as mentioned by someone else though that one's new to me. I'll note that from experience with Crashplan's Enterprise product on some older 32-bit servers, the client software can chew some fairly significant memory when you have a lot of files or data.

    The other and probably simpler option is that when you start to near capacity on the storage system, don't upgrade it - shut it down and store it, preferably not in the same (not-yet-burning) building after building the new system and copying the data over to it. After you shut the old one down, keep backups of anything you've changed since that "checkpoint" system; hopefully your data isn't changing that rapidly - 20 TB seems to me almost guaranteed to be mostly static.

    --
    fencepost
    just a little off
  206. External drives by CCarrot · · Score: 1

    I would definitely say external drives for the irreplaceable data (photos, home video, scanned images, voice clips, documents, etc.). The rest is already *cough torrents cough* backed up for you. Yes, it would take a while to rebuild, but ultimately it's available.

    I would also perhaps back up any older or hard-to-find collections to the hard drive, or any particularly cherished movies (kids movie collection, perhaps). Personally, I back up everything to three 4TB external drives because I have the ports available on my server, but if you don't then back up what's important and don't worry about the rest...

    Your only other option, really, is to get a 6-bay NAS and some hard drives to fill it. This setup would run you around $2,000, but then you'd be able to back up all of teh things...until your data grows beyond 20 TB (assuming you'd put the NAS into Raid 5 at least :)

    --
    "I love animals! Some are cute, others are tasty, what's not to like?" - Betsy Schroeder, Jeopardy contestant
  207. I'm sure this was answer already but I got bored t by EricSon1 · · Score: 1

    I just did this with a friend and we don't have 20tb but we both have about 8 so 16 total. rsync or brfs,zfs send snapshot. If you have the space keeping multiple snapshots should protect you from accidental delete. Do the first back up on site then set a cron job weekly or daily. This is easy to set up on any low power cheap Linux box with a couple USB 3 4-bay enclosures. If using ZFS make sure you have enough memory. You can run encryption on each users space so only you have access to your data. Just a thought.

  208. Cloud Service that Might Be Useful by Anonymous Coward · · Score: 0

    Bitcasa.com might be a good one to look into, set yourself up for a long first backup process, then get happy and keep on keeping on.

  209. Colocation by Lyttek · · Score: 1

    Duplicate your existing hardware storage setup and then send it to a colocation datacenter. Then you'll have one offsite copy at a fixed price. Downside: you only have one copy, but that's better than none.

  210. Another RAID by Anonymous Coward · · Score: 0

    To back up one RAID that large, you really need another RAID, maybe half again as large. Removable media just doesn't cut it.
    Then you use something like an rsync wrapper (fast, deduplicates some) or backshift (deduplicates better and compresses).
    http://stromberg.dnsalias.org/~strombrg/backshift/documentation/comparison/index.html

  211. High-end requirements require high-end solutions by operagost · · Score: 1

    Wait-- you have 20 TB of data, yet are complaining about expense? That's far above the media requirements for home storage. You're in enterprise territory. Fast, reliable, or cheap: pick any two. Since reliability is not negotiable for backups, you have two options.

    Buy an LTO autoloader and tapes. This will cost about $3,500-4,000. You may also need to buy backup software for another few hundred dollars. You'll be able to back it all up within a day, and backup new files in minutes.

    Buy the Crashplan unlimited home service, buy the seed drive service to get the first few hundred GB started, and you'll be set in a few days to a week.

    --

    Gamingmuseum.com: Give your 3D accelerator a rest.
  212. One Word - Synology by Anonymous Coward · · Score: 0

    I use a Synology 1512+ with 4TB drives in RAID5 configuration. The entire system cost under $2000 to build (of course, you may need a slightly bigger system or a different configuration to put all 20TBs in one system).

    I replicate some of my critical data to a friend's Synology for an offsite copy. I return the favor by letting him use part of my array. So far, things have worked great and while I haven't had the need to do a full recovery of my data, I feel fairly confident in this solution. Synology works as an iTunes server and a target for Time Machine backups for Macs.

    1. Re:One Word - Synology by Anonymous Coward · · Score: 0

      I would also not recommend spending your money and time on content that is created by someone else. If 20TB of data is what you have spent effort in creating, then it makes sense to protect it. But if these are just music and videos downloaded/ripped/stolen/whatever, consider a subscription service like iTunes match or something else.

  213. The Anti-Solution by mr.dreadful · · Score: 2

    Do you really need to back-up that much data?

    I'm just speaking generally here, there are certainly cases where someone would need to back up this much data, but for your home media library? If we're talking movies, 20 TB is roughly 20,000 movies (for sake of argument, I'm not considering music). At what point is this just digital hoarding? I used to keep a large collection of movies, mostly pirated, and eventually realized that:

    a) I was spending more time and money managing the collection then I wanted to. b) That I rarely watched many of the items in my library. c) That I was placing myself in legal jeopardy by storing so many illegal copies. d) Anything I did want to re-watch I could get from Netflix, the public library, or download.

    Music would be slightly different, as I could see where music is in some kind of constant rotation, but again, how much of it are you actively using? I'm just playing devil's advocate here, but I think this kind of collecting/hoarding is a byproduct of pre-internet scarcity.

    1. Re:The Anti-Solution by Anonymous Coward · · Score: 0

      I can't believe nobody has mentioned this. The friend has a huge positive side to this event. Dude you got 20 TB of free space. OMG just think of the possibilities! What can you do with 20 TB of free space!

      (reminds me of an old BOFH article :)

      My take on backups is buy additional HDs for backup and swap them out (cheap eSata/USB HD dock for $20-30). I bet that's cheaper than a tape machine since the only prices I've seen on LTO out of this thread is $2000.

      My other thought is culling the collection. Back when netflix was new I'd rip copies of all the DVDs I got. Before I had 100s of DVDRs I figured out that I spent a lot of time on these rips and burns and while that was really cool to have a DVD collection I re-watched 1% of the rips. It was a huge waste of time and money IMO. I think having a local collection on tap is awesome but as it grows the amount of effort to curate and maintain it starts to feel like a real job.

    2. Re:The Anti-Solution by Anonymous Coward · · Score: 0

      Where in the article does it say it's digital media?

      It does not.

      And even if it was, I have plenty of rare movies in my collection that you can't buy on Amazon or find on Netflix. I have even more rare software that you can't buy anywhere. Some people keep private digital media collections because they don't trust our corporate overlords to be diligent in their curation. It's happened enough times that corporate media companies have erased content, eg. Doctor Who, that it is wise if you have rare films, tv shows, music or software that you should diligently curate them.

      Then there's plenty of other ways to end up with 20TB of data without being a media historian. I do audio production for live music, and the multitrack recordings from each show can be 100GB+ in size. If you take a lot of photos with a DSLR in RAW mode, archiving photos could easily reach 20TB in size. Doing GIS, specifically land-use change studies, can result in enormous datasets of aerial photographs. If you do simulations, it is often more effective to keep the output of a simulation run than to re-run it, though perhaps then it is not so necessary to keep backups of the simulator output.

  214. Free homeplan is local only by Anonymous Coward · · Score: 0

    The free version of their home service only backs up locally. To backup to the cloud you need to pay the (completely reasonable) $5 a month.

  215. Re: ZFS snapshots by Anonymous Coward · · Score: 0

    ZFS snapshots work well. I have 12TB on a server in my garage, and another 12TB on a different server at a friend's house. I occasionally "zfs send" incremental snapshots to an external drive from the main server, drive over to the other server, and "zfs recv" the snapshots to keep my two servers in sync. USB was pretty slow, so now I just have an extra SATA data and power cable hanging out of each server, and carry a bare 2TB hard drive in a plastic box. That makes 1TB incrementals almost painless.

  216. Backblaze $5/mo all you can eat. vs tape by Anonymous Coward · · Score: 1

    You really just have 2 options.
    1. Tape; LTO6 which will run you about $3000 plus another $250 for tapes; But the data can be backed up often and recovered very fast.
    2. Offsite; I have seen a lot of suggestions, but i didn't see Backblaze on the list. Backblaze is $5/m all you can eat. The problem is, recovering those 20TB might take you a few months as they cap the speed at which the service operates. I Think there are options to have a drive sent to you for like $250 plus an hourly rate. All that said, still cheaper then tape, just not going to get your data fast.

    I personally go to the tape route. Tapes will archive for 30 years if kept in a cool clean environment, so you don't have to worry about bit rot as much as you do with just keeping stuff on a NAS.

  217. Digital hording by Barlo_Mung_42 · · Score: 1

    Just like having piles of stuff taking up all available space in your home is a problem having a 20 TB media collection could be a sign of a larger problem. I'd recommend going through it and getting rid of the things that will never be watched / listened to again or can just be downloaded again. Then worry about backing up what's left.

  218. Sit Tight by DaveJ45 · · Score: 1

    Just sit tight until these are perfected, and then buy a couple of dozen-

    Next-gen “Archival Disc” will squeeze 1TB of data onto optical discs
    http://arstechnica.com/gadgets...

    Either that, or install a duplicate RAID to back up the first one....

    --
    Differences between how you act when some one is watching, and how you act when no one is watching, define who you are
  219. I post everything to Usenet by sandbagger · · Score: 1

    I figure that's good enough.

    --
    ---- The above post was generated by the Turing Institute. Maybe.
  220. Close by Anonymous Coward · · Score: 0

    Actually the free plan lets you backup to an offsite location. But not to the cloud.

  221. Do you really need 20 TB online ? by eulernet · · Score: 1

    Let's suppose you have 200+ movies. I very much doubt that you need to have instant access to them, since you'll probably watch them only once.

    Why don't you just burn what has not been used since a long time on DVD, and then catalog your DVDs ?
    If you have 4GB DVD, simply subdivide your data in 4GB folders, and burn at least one every day.
    If you fear that your DVDs vanish, burn everything twice and store them at different places.

    Benefits:
    1) you can probably reduce the 20TB to less than 5 TB that you need at any moment. Use the saved space to mirror your data
    2) doing backups frequently is a good habit that'll be useful in the future
    3) doing some cleaning will help you categorize your collection

  222. Navi by Anonymous Coward · · Score: 0

    Tell him to stop storing all those movies and simply stream them using Navi-X for XBMC.

  223. I use nearline RAID by egarland · · Score: 1

    My backup strategy is to keep the old drives from my previous array and put them into a second server, then back up to it weekly. I use a linux software raid 5 setup for backup, with the drives powered off unless the backup is running. I have a script that spins them up, starts up the raid, mounts the filesystem, performs the backup using rsync, then unmounts and powers down the drives. I only can back up about 1/3rd of my main array, so I have to be choosy, but a large amount of what I have stored is replaceable non-original content that I'm content to simply have one raided copy of, so I just exclude the right folders and I'm good.

    The servers are currently in the same room, which makes me uncomfortable, so I've long considered creating a mini-server for a relative and setting it up in their home as an offline backup. Using a commercial service would probably make more sense, but I'm not sure I'm comfortable with that yet.

    Another thing I'm considering for my next setup is using ZFS for the backup filesystem and keeping snapshots as long as I can for a combination backup/version control. I'm interested in how efficient that would be with vm disk images where the file changes every time, but only small parts of it. Would it detect the unchanging portions, even if rsync re-writes parts that didn't change, or would that cause duplicated space usage? Does anyone have experience with this?

    --
    set softtabstop=4 shiftwidth=4 expandtab nocp worlddomination
  224. NSA does it for you. by Anonymous Coward · · Score: 0

    NSA does it for you so you don't have to.

  225. NAS RAID by Anonymous Coward · · Score: 0

    There's plenty of them.
    Best bet, IMO.

  226. Media server with raid 5 by JacobLeclerc · · Score: 0

    Lecie big 5 networked servers could hold that much data. Save doubling your storage by using raid5. Better speed and you can save all your data if a drive fails. Some of these options are affordable but 20tb will probably cost you a bit. You could also use the drives you already have in a barebones setup.

  227. How Do You Backup 20TB of Data? by bibendum59 · · Score: 1
    The same way you eat an elephant....

    ...One Byte at at time!

    Thank you, thank you. I'll be here all week.

  228. optical media by confused+one · · Score: 1

    archival quality optical media in a robotic silo. 100 year guarantee on your data. Storage space only limited by the size of your silo.

    1. Re:optical media by ewilts · · Score: 1

      100 years of your data is absolutely meaningless if you can't read the media.

      Do you have anything around that have the ability to read a tape that was written 20 years ago? 5 years ago?

      Darn, I can't listen to my 8-tracks!

      --
      .../Ed
    2. Re:optical media by confused+one · · Score: 1

      I do have devices that can read tape written 20 years ago. 8-track was from the 60's and 70's by the way, making it a bit more than 20 years.

      First CDs came out in early 1980's. That's 30 years and the drives are still backwards compatible -- I can still put in my old Pink Floyd CD into my Blue-ray drive and the computer will play it. There is no reason to believe that won't continue for another 10 years or so as music is still being released on CD in commercial quantities. Despite the hype optical disks are far from dead.

      Archival quality optical disks have a good chance of continued support because there is a substantial effort being put into them in the background by people like the National Archive, Library of Congress, and others looking for means to maintain long term records storage. It's not guaranteed that they will be around in 100 years. Maintaining any system that long is difficult, even simple paper. It's got as good a shot as any though.

  229. It's a shame... by Guest316 · · Score: 1

    ...that I never got around to implementing my Redundant Array of Free Email Accounts virtual drive idea.

  230. Thumb drives by Anonymous Coward · · Score: 0

    In January of 2013 (one year ago), Kingston announced 1 TB thumb drives. They are pricey ($2000). But 20 of them would back up 20 TB. Cost: $40,000 (sell the Lexus as used and you are good to go). Choice 2: I bought a 2 TB drive last week for $69.- 10 of these will cost you $690.- and will get you there. Choice 3: A BD writer (blu-ray(tm) disk writer will cost you $150.- plus a 50 pack of blank disks will cost you $130. 50 blank disks will give you 2.5TB, so you will need 8 of these packs, so $150+8x$130=$1190.- ...I don't have any good options on hand for tape storage. Personally I use a NAS for backing up software and data on the computer, and archive movies on DVD. Its widely available, cheap, and for me 'good enough'. I recently had a drive die about a week ago, and its *very* annoying, but it seems that I've been through the pain of it often enough that I can pull out data recovery tools and install new disks almost in my sleep. Spinning disks of rust are cheaper than spinning disks of pricey plastic, and that is wildly cheaper than solid state chips.

  231. options... by itzdandy · · Score: 1

    really, there are 3 options
    1) a second array
    2) tape (biggest are 5TB right now...)
    3) online

    1* RAID6 (or raid5+hot spare) would be 7x 4TB drives and could be built for about $1200 using a cheap workstation and external drives w/ freenas
    2* tapes would be expensive and cumbersome IMHO. Also expensive!
    3* I say this is an option but it's not realistic. if you have a typical 4Mbps upload from Cox/Charter/etc then the initial seed would something like 2 year!

  232. sounds like an OCD hoarding issue by peter303 · · Score: 1

    Who is going to revisted 20,000+ hours of viedo and music?

  233. Build another server but... by Anonymous Coward · · Score: 0

    Careful with this.
    If a fumble fingering kills the data, rsync will happily duplicate that fumble and delete your backup.
    Either enable some sort of version control, or setup the rsync so that it won't sync a massive change. (but will email you to do it manually)

    1. Re:Build another server but... by EmagGeek · · Score: 1

      Sorry, I thought the "be smart about it" was implied. I forgot that I was talking to someone who deleted an entire RAID array by accident.

  234. Store it in the Internet. That's what it's for. by D1G1T · · Score: 1

    My dad is a bit of a hoarder. I tell him to "store" his broken toaster collection at Goodwill. They will have one when he needs one.

  235. Mike by Anonymous Coward · · Score: 0

    Unless your lab burns down.

  236. 6-year backup time? by Anonymous Coward · · Score: 0

    20 TB backup to the "cloud"?!?!?! What the hell kinda internet connection do you have? Mine is 1 megabit/second upload. That would take over 6 years to complete the backup.

  237. No free lunch by Larry_Dillon · · Score: 1

    Agreed. If you want backup, you have to pony up. You have to either buy twice the disks, an expensive tape drive (or a cheaper tape drive a lot of tapes) or pay for bandwidth and off-site storage.

    --
    Competition Good, Monopoly Bad.
  238. Blazing fast and astonishing capacity by Anonymous Coward · · Score: 0

    The most practical solution, considering the type of data:
    just copy everything to /dev/null

  239. Hybrid solution.. by Anonymous Coward · · Score: 0

    I split my data into two.
    Important stuff I can't recreate easily. (email, pictures, music, around 200G)
    This data is fired out into the cloud on a fairly regular basis.
    The rest(~11TB) It'd be a pain to get back but it can be gotten back. I threw all my old drives into a JBOD and rsync to that every few months.

  240. This dude I know... by Anonymous Coward · · Score: 0

    No "this happened to a friend" story should be accepted on Slashdot, and everybody submitting one should be banned for life. Especially when the question is needlessly contrived and every reasonable solution has been summarily excluded. I'll even grant videos can take a lot more space than that, but it's perhaps worth mentioning 20 terabytes can hold quarter million albums, or well over 30 YEARS worth of music. Based on that, one might assume this guy is trying an experiment where he's recording every sound around him for his whole life.
    In any case, if he cares enough about Slashdot users' opinion and is likely consider advice from Slashdot, he should spare 5 minutes to ask the question himself. Unless he already did, in which case the only thing we can conclude is the person asking the question is lying about details of the problem.

  241. How about... by Anonymous Coward · · Score: 0

    Don't rm -r your RAID, fool.

  242. Drobo 20TB Array by Anonymous Coward · · Score: 0

    Drobo 5N 5-Bay NAS Gigabit Ethernet Storage Array and 5 x 4TB Hard Drives providing the necessary storage:

    http://www.bhphotovideo.com/c/product/907157-REG/Drobo_Drobo_5N_20TB_5X4Tb.html

  243. Let Go of Attachment by mutoneon · · Score: 1

    Your friend is clearly a hoarder. 20TB is absurd for a private collection of unimportant media. Instead of looking for ways to back up the Library of Congress, your friend could seek counseling to work through his attachment issues.

  244. Drop the OP as an advisor by whitroth · · Score: 1

    "Asking around among our tech-savvy friends though, no one has a good answer to the question, 'how would you backup 20TB of data?'. It's not like you could just plug in an external drive, " tells me that you have NO "tech-savvy friends". None. Zip.

    Right now, I'm on my biweekly offline backup - that's where we rsync from the online backups to offline backups. This is the 10 3TB drive, if you're interested, out of 13.

    Now, if you actually had any "tech-savvy friends", as opposed to people who think they're "power users", they'd have pointed out, first, that what your tech-savvy-friendless friend had was *not* a 20TB file, but many, many files. It's certiainly not any kind of problem to partition them - y'know, divvy up the RAID and have movies and music subdirectories, and break that up by moving all the movies whose title starts with "A" under /movies/A.... and then rsync (or however you prefer) copy enough to close to fill one drive, then swap drives....

    Oh, and why can't you do it in an external drive? Certainly, that's what I'm doing *right* *now* as I type with those 13 3TB drives.

                          mark

  245. Copies. by ledow · · Score: 1

    What did he use to store that 20Tb in the first place? I'm assuming we're talking a large RAID array. I doubt, from the sound of it, that we're talking software RAID. So, at a minimum, it'll cost AT LEAST the same as building that RAID again to have a backup, no matter what the medium.

    Yeah, you're throwing 50% of your money away - on nothing but backups of shit you've probably downloaded or taken off discs you own anyway. So now he probably sees quite what that data was "worth" anyway.

    To be honest, nowadays, for home use, just build another RAID the same size and mirror the data across.

    Oh, and if you're that daft with 20Tb of data that you press the wrong button and wipe out an array that you have recovered several times over, you shouldn't be let near the low-level storage. Use a filesystem, or even just access layer, with some kind of snapshotting / rollback.

    Buy a cheap NAS, or just build yourself a new RAID from scratch and use the "old" array as a redundant copy of the data. Keep it powered off and somewhere else except once a month or whatever when you mirror across.

    Backup speed will be good (the speed at which you can interconnect the two computers, basically, probably Gigabit Ethernet for the cheapest scenario), restore speed will be the same, media will be cheap, no fancy software or hardware required, you can re-use your old setups and just buy a new one when it starts getting full and your backups are literally working copies with no further action required.

    If that turns out to not be good enough for your needs, that's when you can look at tape and other stuff. To be honest, tape is dying. The places I've seen have weaned themselves off them and just replicate to as many places as possible (including an occasional "offline" copy to prevent automated spreading of bad/corrupt data to the backups).

  246. Check the data first by DirtyFly · · Score: 1

    Some times I do hoard some data, usually after some harddisk change of something, in wich case i get a folder named olddisk with the contents of the old disk, this method does lead to having a bunch of such folders, like olddisk1 olddisk2 and so on, I usually take far too many photos on my trips so I end up with GIGs of photos, I can imagine a person who takes vidoes of everything would fill 20TB in a couple of years... anyway, im trying to lead this to a non piracy based all music and torrented video scenario. what I do,bear in mind that I only have 2 TB of stuff, is : run a duplicate finder program,maybe you (ops, I meant your friend) have to much duplicate stuff. if there are many videos, are they in the apropriate quality and sampling rate ? do your VHS tapes really need to be in 1080p ?

    then


    1) use a raid station to backup. I have a synology I use for bck.
    2) keep the sdcards after they are full. I do not erase sd cards from my camera, I just store them remotelly, cheap baackup for one of the most precious and inpossible toreproduce files.
    3) get 10 2TB hard disk to copy your stuff and store it :) I really cant visualise getting 20TB of data for a home user...

    jc

  247. OMNI solution by eric31415927 · · Score: 1

    I remember a puzzle in OMNI some decades ago, where an alien had to transport the knowledge of the Encyclopedia Britannica in its space ship away from Earth without carrying any additional weight.

    The solution was to transform all the data into a single rational number between 0 and 1 and to etch a scratch on the surface of the Alien's space ship, where the size of the scratch would correspond with the single rational number (say in inches or some comparable measuring units). It was apparently possible for aliens to etch and subsequently measure distances at the subatomic scale.

  248. Re:How Do You Backup 20TB of Data? by Anonymous Coward · · Score: 0

    Not to mention that I've been going through CD's and DVD's that I burned back in 2001-2003, had stored in a zipped closed CD/DVD 150-disc binder for 10 years, and am copying them off to a NAS... and am finding that about 1 in every 25 has corruption and can't recover the files. Literal bit-rot due to dye break-down (not from light... but from age and ambient air.) If you advocate storage on optical media, this is a factor you need to contend with.

  249. Redundant Backup in a Self-Healing ZFS Array by Anonymous Coward · · Score: 0

    I myself have a 20TB NAS/iSCSI SAN at home using FreeNAS, 16GB ECC RAM, and 8 4TB HDDs in a Raid-ZFS2 Array connected to my home lab with dual 10GB NICs. It cost me under $2000 to build and when combining ECC RAM with ZFS you have self-healing, double-parity fault-tolerant data storage. I have it filled with VM Images and backups of 4 PCs. 20TB, contrary to what many might say, is not an excessive amount of data. That is about average for someone in IT, Digital Art/Photography, a Musician, or a household of 4+ people.

    (And yes, I can empathize that there is a major time investment in ripping a large CD/DVD/Blu-Ray library to HDD. It took me two years at @ 5-8 hours a day to rip my 1400 CDs to FLAC, 1000 DVDs, and 250 Blu-Rays to disk. I really don't want to ever do that again, thank you very much!)

    Just prior to this I had lost everything digital I had accumulated over the course of 20 years after simultaneously losing both my PC's HDDs (Raid 5) and losing > 2 drives in my Raid 5 backup array (including 10 books I had written but not yet published, 10000+ pages of research, and all the pictures of my child from birth to her teenage years, and decades of correspondence that was irrecoverable), I realized the hard way that one on-site backup is not enough. If your data is critical to you then you simply need redundancy. So, when I built my NAS/iSCSI SAN I ponied up and built a second one to store off-site and use rsync to nightly mirror the one at home, although I could just as easily do snapshots too (I work as a SysAdmin for a Data Center so Co-Lo doesn't cost me a dime, but one could easily do Co-Lo for free by finding a friend who is willing to host your backup NAS in exchange for you hosting theirs. If you build your NAS right it should be a small ITX form factor, silent, and use less than 40W).

    You can also use Crashplan with FreeNAS for a remote tertiary backup of all or just the most critical directories...but personally I wouldn't trust a Third-Party Vendor for your only backup. Companies come and go, or may not hold themselves responsible or liable for the integrity of your data. It's too much of a risk for a primary backup, however for a redundant secondary or tertiary backup it is a wise idea.

  250. What's wrong with rsync? by MMC+Monster · · Score: 1

    Steps:
    1 - Get a RAID similar to your main storage to use as backup.
    2 - Put the second RAID in a relative's house, where you can get access to it.
    3 - Have this backup run an rsync over ssh once a week/month, pointing at your main storage array.

    With proper ssh key exchange set up ahead of time and using an ssh username and port that are non-obvious (with ssh on your main system only allowing known keys and not username/password combinations), you'll do pretty well against everyone except a malignant government entity.

    --
    Help! I'm a slashdot refugee.
  251. AltDrive FTW by djohnsto · · Score: 1

    $50 per year for unlimited data, and you can use your own encryption keys to encrypt prior to upload. Will take a loooong time to back up that much data initially, but incremental updates are pretty quick (depending on how quickly you add new media).

    Note: Not affiliated with altdrive, just a happy customer. altdrive.com

    --
    Dan
  252. Best way to have the backup for free by GiuseppeX · · Score: 1

    Write to some friends that you have 20GB of Al-Quaeda training footage and the NSA will do the backup for you. (PS: Use another set of hard drive to backup and never have the original set up as RAID array)

  253. Similar. by Jason+Pollock · · Score: 1

    I've got a large collection of movies (12TB). My backups are the physical DVD/BluRay/CD media. It does take a bit of time to restore a 4TB drive, rips are typically about 1GB/minute for BluRay or DVD.

    My recommendation: don't store the collection as a single RAID array. That way, when you lose the array (which will happen), you don't lose the entire collection.

    Personally, I'm too cheap to pay for the extra drives to implement mirroring, so I just use JBOD.

  254. Maybe by Anonymous Coward · · Score: 0

    Your friend should consider dating for a while.

  255. Double or nothing by duke_cheetah2003 · · Score: 1

    To me, the only feasible backup strategy for a home user (like myself) for LARGE volumes of data (I have 2TB, not 20, YMMY) is to keep two copies. One being your working copy, that you have in an active server, the other copy you should keep in a safe and rsync to bring it upto date every few months.

    If your volume is really 20TB, which seems extreme to me (do you really need all your DVD's and Bluray's on a media server? With netflix and other online streaming services?) then I guess you're gunna need a tape backup of enterprise-level quality. Expect to pay for it. My personal massed music collection after 20 years of collecting music is like.. 30GB. That's a lot of music too. So I think you should look at what your storing, reducing it to the stuff that's truly irreplaceable.

    But bottom line, to me, mirroring your drive(s) and sticking copies of them in a safe is the best backup strategy for a home user.

  256. What software can handle multiple HDDs well? by grumbel · · Score: 1

    Simply going for multiple USB HDDs seems to be the obvious option (cheap, extendable, can be stored offsite and offline, etc.). However what would be some good Free Software to actually handle the backup? Common solutions such as duplicity, rsync, rdiff-backup, etc. all seem to assume that your backup target directory can hold the whole backup all at once and that the whole backup is online at the same time. While one can probably hack something together with union mounts to accomplish that, it seems like a very cumbersome and fragile solution.

    Is there anything that allows you to just copy the data to a HDD and then plug-in a new one when the old one is full? Preferably in a data-format that is robust enough to handle some backup HDDs dieing without destroying the data on the other drives (i.e. no incremental changes across HDDs).

  257. LTO6 + Autoloader by jdkc4d · · Score: 1

    I would suggest: LTO6 + Autoloader. It's not pretty, but it will get the job done.

  258. Build it. by Anonymous Coward · · Score: 0

    A PC with 5-6 4TB drives, A Tower Case, a Mainboard with 6 SATA header, and a big Power Supply.

    Then I would suggest CentOS and some scripting, or Openfiler if you would like a GUI.

    That is it. No other choices. I think you could pull together a box new for under $1000, or buy a used server online with 6-8 bays for $300-$400 and pick up the drives separate.

  259. Backup suggestion by wolfguru · · Score: 1

    Buy a basic PC chassis and a MB that has multiple SATA ports, with a raid bios. Add 5 3T or 4T drives in a simple raid5 config, and use a dedupe program and some basic backup / sync software to run an incremental backup. It will take a while to initially get it all into the baseline, but a job will pull whatever has changed (at the file level, but that isn't too bad for this) and any decent dedupe application should get the files to under 50% and leave plenty of space for the offline de-dupe to work. Given the deals on drives you could run this pretty reasonably for under $600 or so with a little careful shopping. Set up the machine bios with a wake time and power down time to minimize power demand, or just leave it running. Not free, but compared to the cost of replaying 20T of files, music and pictures, a lot better than a poke in the eye with a sharp stick.

  260. Crashplan by Anonymous Coward · · Score: 0

    I've been playing with Crashplan in San Jose and the ISPs in my neighborhood. I'm getting about 400-500 kbps sustained through Comcast, and was getting a similar number via AT&T Uverse, in both cases with plans that should have in theory been able to outstrip that by a good margin. As near as I can tell, there's a bottleneck eitehr between here and CrashPlan, or within Crashplan itself.

    I'm going to put a couple disk's in a friends rack system and handle it myself, but I would strongly recommend against using CP for anything more than about 1TB.

  261. Who cares? by Anonymous Coward · · Score: 0

    It was about 19.5TB of music more than he ever listened to. People are packrats - instead of figuring out what they use and what they don't use, they save everything. Even those things that are not worth saving.

  262. Backblaze by Anonymous Coward · · Score: 0

    $5/mo

  263. No by ThatsNotPudding · · Score: 1

    You probably wouldn't buy a car without an engine.

    No; you're contending one must buy a second engine.

  264. How about only backing up what's not available in by Anonymous Coward · · Score: 0

    What are you paying to maintain a 20TB RAID array, plus backup for this? Wouldn't be cheaper to subscribe to a $5/mo music and pay $3 per movie when you decide to watch?

    Backup your personal videos, photos, data. I don't see the value in backing up commercial content.

  265. At last! by FuzzNugget · · Score: 1

    A worthy adversary to rival my porn collection!

  266. More storage + rsync by Anonymous Coward · · Score: 0

    Set up another 20TB of storage and rsync daily

  267. Multiple eSATA cabinets and rsync? by rnturn · · Score: 1

    With 2TB and even 3TB spindles being pretty commonplace these days, why not fill up an external drive cabinet, make the entire thing into a RAID5 device and backup using rsync? May be a little pricey but how much time and effort went into creating a 20TB collection of data? I have a friend who did something like that (but using smaller Buffalo devices) for his small business by having several systems shuffle files around using rsync. In the event of one computer's storage failing there'd still be 2-3 others on the network with a copy of the data. And, if memory serves, he had one system that had a couple of arrays that would be rotated in/out and one of them kept offsite just in case.

    I'm still trying to figure out how much time it would take ripping CDs and converting from WAV to wind up with 20TB of MP3 files. Based on what Amarok is telling me about my music collection, a quick calculation tells me that that 20TB would amount to about 30 years worth of continuous music playback. I'd better get that ripping and converting started now if I want to have that much music for my great grandkids to listen to; it's probably already too late to get that done for my kids or even grandkids to enjoy.

    --
    CUR ALLOC 20195.....5804M
  268. It seems to me that anyone who could afford by mark_reh · · Score: 1

    20 TB worth of content in the first place should easily be able to afford a backup system for it. He did come into that 20TB of content by legitimate means, right? You can't legally transfer a digital copy of a Blu-ray disc to HDD, so it must be UltraViolet copies, so he must have the original Blu-ray discs...

    I recall a lawsuit that the RIAA brought against someone several years ago in which the defendant used an interesting argument to defend his having tons of illegally acquired music files on his computer/iPod. I may have some of the details wrong, but the argument was essentially that since songs cost $.99 each on iTunes, and an iPOD (at the time) could hold 8GB (or was it 16GB) equivalent to >$20K worth of music that no one in their right mind would ever pay for music to fill up an iPod. Therefore, Apple was encouraging people to get music illegally by providing a device to keep and play more of it that any sane person would ever buy.

    I don't think the guy won with that argument, but it does make one think about the huge HDD capacities that are available for very low cost. What would people ever have to keep that takes 3TB (a single HDD), if not a bunch of movies, TV, etc., the majority of which has been acquired illegally? I'd bet the number of people who could legitimately fill that sort of space (home movies?), let alone 20TB, is very small.

  269. Plenty of entertaining and not practical solutions by anewsome · · Score: 2

    I really did get a kick out of some of these responses. I sell data protection products for a living and 20TB is what I would consider an average small/medium customer. Every business these days has tens of terabytes of data. Of course they all need to backup their data, so there is nothing novel here. We have plenty of customers backing up hundreds of petabytes of data. Every dataset just needs a plan for backup, pretty simple.

    The way I see it, this guy has a few options. One option is to just get more disk and make redundant a redundant copy. This would have have saved him in this case of the mistakenly erased raid, depending on how smart his sync script is. But a redundant copy is not a valid genuine backup plan. So many types of failures will show the holes of the dumb redundant copy.

    The other option for a home user who's not looking to spend a bunch of money, is LTO6. They hold a sufficiently large amount of data, so only a handful of tapes will be needed. LTO6 drives are cheap enough, they won't break the bank. Since the data is on tape, you can shuttle the tapes to an off site location. Seems pretty simple.

  270. Backblaze by Anonymous Coward · · Score: 0

    I use Backblaze who only offer very reasonably priced unlimited plans. I currently backup about a TB. They will send you disk to speed up the restore process for a price. I’ve been impressed with them so far.
    http://www.backblaze.com

  271. LTO6 Tapes by Anonymous Coward · · Score: 0

    For purely backup/recovery purposes LTO6 internal drive will use about 10 tapes

    http://en.wikipedia.org/wiki/Linear_Tape-Open#Generations

  272. A fairly good solution by Anonymous Coward · · Score: 0

    Here's what you need if you hav ethe time and money to burn:
    2 boxes with quad core processors and 16 GB RAM

    12 x 4 Tbyte hard drives

    2 x 250 Gbyte hard drives

    2 x 256 Gbyte SSD drives

    FreeNAS

    For each box, install FreeNAS on the 250 Gbyte drive (optionally, you can use an SSD or raid 2 drives together) I recommend using a separate drive from your share volumes so you can at least bring up the box if you lose the shares. Format 6 of the 4 Tbyte drives and set them up as a zpool using ZFS. Determine the level of failure you want to maintain using RAID-Z. Set up the 256 Gbyte SSD as the L2ARC.

    The first machine is set up to always stay on. At night, the second machine will boot itself, run the appropriate disk utilities, take a snapshot of the first machine's ZFS filesystem and copy the diffs, and then shut down. If you want to be really safe, host the second box at a colocation, such as your neighbor's basement. This works fine, especially if you can string a network cable or connect to your wireless SSID from that location.

    If you have a drive fail in your first box, you can add a new drive and the filesystem will rebuild parity. If your whole server dies, you can boot the second box and have all the data available from the night before while you work on bringing the first box back to life.

  273. You still own all the original CDs by hduff · · Score: 1

    So I don't see the problem.

    --
    "I believe in Karma. That means I can do bad things to people all day long and I assume they deserve it." : Dogbert
  274. Step One by Megane · · Score: 1

    The first thing to do is to not create a single-volume RAID that spans several drives. Each drive should be able to stand on its own. Especially with not-quite-essential data like ripped DVDs. This way if one drive fails, you only have to re-rip one drive of DVDs. But most importantly, you can't erase them all with one command. I'm not sure how submitter's friend happened to do that, but it's exactly the kind of failure that RAID does not protect against!

    Sure, it's nice to have one big volume and not have to worry about switching over as they fill up, but unless you have some kind of advanced volume management that can deal with drives disappearing and let you easily add or remove drives of arbitrary sizes, it can come back to bite you.

    If you really want redundancy, use mirrored drives, or sync to a mirror volume, or whatever, just don't use RAID 5. Parity RAID seemed like a good idea at the time, but it's just begging for two drives (usually from the same manufacturing lot) to fail at the same time. And the system is loaded way more when you're trying to do recovery, which could cause another drive to fail from the extra stress. Even worse is that the size of modern drives means the sensitive recovery period is going to last longer.

    This advice is specifically toward storing large A/V libraries. The really important stuff (financial data, family photos) is going to be smaller. Keep it separated from the big non-essential A/V files and it should be easy to use multiple backup strategies like removable storage and cloud backup.

    --
    #naabhaprzrag, #sverubfr-000, #agi-fcbafberq, negvpyr[pynff*=' negvpyr-ary-'] { qvfcynl: abar !vzcbegnag; }
  275. LVM instead of RAID by Anonymous Coward · · Score: 0

    Of course, he didn't lose anything because he had all the original discs for his music and movie collection, right? Certainly, he wasn't downloading illegal copies of stuff...

    As for the question, the way I handle this exact situation is I run 2 volume groups (I use LVM instead of RAID), one of the volume groups serves as the main archive, and the 2nd volume group serves as the backup. They are identical in size, 4 hard drives each, currently at 9TB each. So long as I don't lose a drive from both volume groups at the same time then I won't lose anything. However, I actually do have every physical VHS/CD/DVD/Blu-ray that I have backed up to the server. I accept the risk over the cost of hosted solutions or redundant servers, etc... The one thing I do invest in and recommend everyone do so as well is use a good UPS to provide clean power and protection against sudden power loss.

    To mediate the risk of drive failures, I replace drives every 2-3 years before they have a chance to fail. In 20 years I have not had a single hard drive fail on me that was in active use. I just swapped out the 2 oldest drives in my server, for example. They were working fine, but I knew they were both more than 3 years old and overdue for replacement. I'm going to replace 2 more in the next month that are just over 2 years old. That's about $1,000 every 2-3 years to replace all 8 drives, if I did that every 2 years it would be less than $50 a month. Far less than any "cloud" service for that much data.

    If I were going to keep an off-line copy of the archive, I would use Blu-Rays and a program that could keep track of what had been stored already so that incremental backups could be done on a monthly basis. I don't feel like putting that much effort into it though, I would rather just re-rip everything in whatever new wiz-bang format the kids are using.

  276. If you can afford to raid out 20TB by s13g3 · · Score: 1

    If you can afford a 20TB RAID *and* have enough data of value to warrant *retaining* 20TB, then you can certainly justify the expense of a tape drive and corresponding tapes to back it all up.

    Tape is not dead, contrary to more than 3 decades of claims otherwise. It is, in fact, perfectly alive and healthy, and well worth using (with a proper backup/rotation scheme) when you have that kind of data volume to store.

    I've worked for Arcus/Iron Mountain and Recall both, and I can't tell you how many times over my years with those companies I've heard someone say "We don't need off-site backups" or "We don't need tape, we just have the IT guy take the hotswap drives home every day", only to have them come crawling back in tears weeks, months or years later when they've lost everything.

    --
    "Inveniemus Viam Aut Faciemus" 'We will find a way... Or we will make one!' --Hannibal of Carthage
  277. NAS by WoodburyMan · · Score: 1

    Easy. Get a 6 or 8 bay NAS and a bunch of 4TB drives to fill it. Set it up in JBOD. Only local onsite backup solution that's feasible. Keep it powered down and unplugged except when you make periodic backup. Offsite backup is more complicated, and unfortunately will have to shell out a lot for, and may not be feasible to backup via a throttled home connection upload speed. Around these parts in US most ISP's have 30mbit down, but only 3mbit or 4mbit upload. I'm being "Upgraded" to 60mbit down / 4mbit up next week. The upload to download proportion is ridiculous.

  278. More disks by DrHyde · · Score: 1

    I'm backing up 8TB at home, by rsyncing to another 8TB of disk space. It's been working reliably for years, starting back when a TB was a lot and adding/replacing disks over time.

    A 4TB hard disk is pretty cheap these days, so he just needs to get six of 'em and make another RAID array. Once you've done the initial rsync, I presume that subsequent changes will be relatively small, so transfer speed doesn't matter much, so he could hang them off a USB port in one of those USB-to-SATA dock things.

  279. The nature of things by Anonymous Coward · · Score: 0

    "Blu-Ray discs can hold a lot of data, but that's a lot of time (and money) spent burning discs that you likely will never need."

    That is what all backups are. Spending money and time making copies you likely will never need.

    You can't avoid spending money and time making copies you will likely never need, unless you choose to simply not backup.

  280. You're approaching small business sized data... by Anonymous Coward · · Score: 0

    When you reach that point, you have to look at solutions that fit. Offsiting is not gonna work unless your bandwidth is amazing (I'm guessing no). Really, it's going to have to be onsite, but you can do some things to reduce the size. It's most likely going to be a disk backup, since even older tape drives (LTO3 or 4) can cost a bundle when it comes to that many cartridges. You're better off buying the disk to meet the growth of the data, and backing up to some sort of deduplicaiton mechanism (zfs from Oracle, etc.). Some are free, some aren't, you need to research what makes the most sense and if you want a backup management app (like BackupExec or zManda), and use those to write to the device and manage the images.
    Regardless, it's the same old problem: good, fast, cheap; pick two...

  281. Build a second one by DrYak · · Score: 1

    You're better off building a second server.
    Then use one server as the live server (the one which access from the network to work).
    and the other as a server.
    - doing rsync and directory rotation [either ZFS/BTRFS/etc. snapshotting, or plain old rsync+hardlinks and directories] should work, specially that (unless you work in the video editing business) chances are that not a big chunk of the 18 TB change a lot. So you could invest into 24 TB of RAID-6 or RAID-Z2 and afford to keep a few daily/few weekly/couple of monthly+yearly snapshots.

    --
    "Sufficiently advanced satire is indistinguishable from reality." - [Tips: 1DrYakQDKCQ6y52z6QbnkxHXAocMZJE61o ]
  282. The Pirate Bay by Coward+Anonymous · · Score: 1

    All your friend's music and movies are already there.

  283. RAID is dead... by Anonymous Coward · · Score: 0

    To put it quite simple its their fault. RAID is dead and cannot support large data sets well. To augment the short comings of RAID you need to store multiple copies or use erasure coding. Storage is cheap so buying a bunch of drives to hold 20TB (40TB raw for 2 copies) of data will not break the bank. There are many solutions out there and quite a few open source projects that one can leverage.

  284. 20TB: history shouldn't be big by DrYak · · Score: 1

    And unless the question's asker is working in the video editing industry, chances are that not much of these 20tb change on a regular basis.

    It should be possible to build a 24Tb or 28Tb RAID-6(*) backup server, that could still quite a few daily/weekly/monthly/yearly backups, provided a space-efficient snapshot rotation system. (Not actually keeping separate copies, but either using a file-systems Copy-on-Write snapshots like BTRFS' or whatever is the ZFS equivalent, or using the old classic RSync+hardlinks).

    The only thing that you don't solve is disaster resilience (you'll need an offsite replicate for *that*).

    (*) At this size, hardware failure are going to be a certainty. RAID-6 (or ZFS's RAID-Z2) are the best solution against bitrot and for resilience against dead drives.

    --
    "Sufficiently advanced satire is indistinguishable from reality." - [Tips: 1DrYakQDKCQ6y52z6QbnkxHXAocMZJE61o ]
  285. How about 6TB of FLAC music? by Anonymous Coward · · Score: 0

    Greetings, dear reader-

            I've been pondering the same thing. Thanks to Usenet, I've managed to amass a collection of over 13000 albums of music. It's in a lossless FLAC format, and I hope to live long enough to listen to it all. I think of my 13-year-old self, having built my first decent stereo system, having the system and the collection I have today placed before him. He'd think he was in heaven, and his current self feels similarly. Anywho, I'm thrilled to have this collection, but.... It's currently on two 3TB drives, in an external box, the drives setup as JBOD for simplicity, and to get the best use of space. Yes, I'm acutely aware that I'd be better off were I to set it up as some level of RAID. The thing is, finances are an issue, I just couldn't afford to do that. I've got 'em set up now in JBOD so it's just one big collection, not broken into sepearte drives, and I prefer it that way. Sadly there's no 6-8TB drives yet.
          Well, so I'm aware that these drives are now over two years old, thus they're approaching the point where they're more likely to fail. I've had great success with drives over the years, some I've had for >ten years now, but what's on those isn't critical, wouldn't kill me to lose 'em. The music collection, OTOH, that would really really suck to find one day that it'd failed.
          So, like the OP, I have a lot of data that I'd like to back up, and to do it as easily and cheaply as possible. Yeah, I know, probly not gonna be easy, cheap or fun, but it's either that or lose the results of all my time and effort amassing this terrific collection of ear candy.
            I'm glad I don't have to find a way to save 20TB of stuff, at least. I keep adding to the collection too, an average of 5 new albums/day, It's gonna completely fill the 6Tb of space I have now, so it'll be time to do something new. So, my house is always filled with happy tunes. I'll be watching this thread. Thanks to the OP for posting the Q.
                      Peace

  286. Buy a NAS by Anonymous Coward · · Score: 0

    Option A: (I wish I had the bucks for a NAS)
    (1) Buy a NAS that holds more than 20GB. For example, there is a 5x5GB NAS being advertised.
    (2) Mount the NAS as a local drive letter.
    (3) Spend a lot of time exploring which file-copy programs that update 20TB "quickly."
    (4) Disconnect the NAS and store it somewhere else.
    (5) Put a P-Touch label on the original giving the last time it was backed up.

    Option B: (This is what I use...with a bunch of 1TB to 3TB USB hard disks)
    (1) Buy a bunch of large USB hard disks.
    (2) Break Your data up into directories. (It is probably already broken up into directories.)
    (3) Copy the directories, one at a time, to the hard disks. Sometimes an Excel Spreadsheet of how much is in each directory helps. LEAVE LOTS OF EXTRA ROOM for future additions.
    (4) Spend a lot of time exploring which file-copy programs that update directories "quickly." I use: http://sourceforge.net/projects/freefilesync/
    (5) Disconnect the first USB hard disk and store it somewhere else.
    (6) Proceed to the next directory and the next USB hard disk.
    (7) Put a P-Touch label on the original giving the last time it was backed up.

  287. Corruption? by Anonymous Coward · · Score: 0

    rsync is not intended for backup.

    Name slips me, however some company recently shutdown because their backup was rsync based. Original corrupted and rsync copied the corruption to the backup.

    Don't use rsync.

  288. Get a backup buddy by Anonymous Coward · · Score: 0

    agree that you backup my shit and I'll backup your shit.

  289. Hard drives in firesafe. by Wild_dog! · · Score: 1

    What about 4 5TB mirrored drives in a fireproof safe?

  290. Contribute to the resolution, stop dismissing the by Anonymous Coward · · Score: 0

    All y'all losers posting things like "it must be all stolen", and "nobody ever needs 20tb" are just filling this thread with noise. It doesn't matter if it's porn, pirated movies, or your own damn sequenced genome, there's nothing wrong with wanting to have 20tb and there's nothing wrong with asking how to manage it. It's none of your damn business if he "should" or shouldn't" have something!

  291. Duplicate systems by subminiature · · Score: 1

    I have 14 x 2Tb of storage. 25Tb of data. My solution is to have a back up system. I do a refresh every week or so. It isn't perfect. I have had JPEG images get corrupted and the back up process copies the fault over. Some videos get damaged but knowing it that is hard as you can only really see when you watch the 90 minute 2 hour film. A way would be to do a CRC check and repeat that monthly - very tie consuming. The back up system saves some damage and the accident deletion but they are in the same room. The next stage is for a third system that is swapped out with the 1st back up system every month and stored somewhere else. If the material is commercial - ebooks, transfers of DVDs and CDs for streaming then it could be built up again. All the family home movies going back 30 years, the photographs from negatives and digital cameras can only be done again if the original tape, negative is still around and the technology to extract it. Some of my early video tapes can be used again but it would be difficult to get a good image from them - the digital copy done 15 years ago is a better copy. I have hard drives still running from 98 but a pile of broken ones from only three years ago. These large 2-4Tb drives need checking often. RAID would help but that would mean 28 drives for each system. Even with costs coming down that the costs soon mount.

  292. New way to store data... by steeleyeball · · Score: 1

    Holograms with images of QR codes. How many holographic images can you put on a small glass cube? With Holographs, making duplicates would be a snap... Get it?

  293. You Need RAIRANASD! by esobofh · · Score: 1

    you need RAIRANASD...

    Redundant Array of Inexpensive Raid Array Network Area Storage Devices

    --

    ----------------------------
    Esobofh - Currently drinking fresh mango juice.
  294. Yet another alternative by AlphaBravoCharlie · · Score: 1

    Is to simply call the NSA or subpoena them since they probably have a copy or two of your data somewhere.

  295. "Erased"?? o_O by klek · · Score: 1

    Wait, he "erased" his 20TB RAID array? What, with a giant electro-magnet or something? Did he Select-All > Delete and then go to bed thinking all was chugging along ok? Run a script that secretly had rm -rf * tucked away in it that he left running overnight? Cripes. Well,.. bum luck to that then.

    Yeah, LTO5 or 6 cassettes are your best option, really, since you can additionally get those off-site, avoiding the catastrophe of a fire or flooding taking our your next 20TB array.

    Better, though, is to PRIORITIZE: Identify which 5-8TB of data is "most critical" and make sure at least *that* is backed up, (onto removable HDs?). You can get to the other 12-15TB as time and expenses allow, or just let it be at risk.

    Of course at this point he has 0TB of data, so he could start small with a cloudy services, and then scale the backup as his hoarding expands again and takes over his life.

    I guess this proves the maxim: "If your data is not in two places, it's already gone."

  296. And nothing irreplaceable was lost. by Anonymous Coward · · Score: 0

    The OP said that this friend had 20TB of music and movies. It's not like it was a bunch of code for a game he was developing, or original documents, etc.
    Unless he created that content himself, then nothing has been "lost" here.
    It's still all available to re-download or re-purchase or re-rip.

    If my media raid ever died... would I really be that dismayed? Not really. I only ever re-watch maybe 20% of it, and it can be re-aquired.

    How about just finding a friend to share your media library with? You both build identical 20TB systems, and sync with each other periodically over a VPN or similar. So if one person loses their crap, 99% of it is still with your friend.

  297. DLT-S4A by Anonymous Coward · · Score: 0

    25 X DLT-S4A (0,8 TB) each.

  298. Since he owned the 20 GB, he could re-rip them by DontScotty · · Score: 1

    Since he owned the 20 GB, he could re-rip them

    It's obvious that at 20 gig, he only was moving digital copies of material that he personally owned - so it should be a simple matter of his re-ripping the material.

    1. Re:Since he owned the 20 GB, he could re-rip them by DontScotty · · Score: 1

      ... BUT ... many other people came up with this exact idea - and I don't see a DELETE button on the above post...

      *sigh*

  299. Second array by mad_psych0 · · Score: 1

    For something that large, and presumably something you may not want certain organizations with 4-letter acronyms that end in 'AA' to be able to subpoena a 3rd party and gain access to without your knowledge, build your own redundancy. It may cost more upfront, but ultimately building a second raid array on separate hardware and using an automated process like DRBD to keep them in sync seems like the most sane approach.

  300. Archival Disc by Anonymous Coward · · Score: 0

    I'm waiting for the "Archival Disc" that is being proposed. At 300Gb?? per disc. Would take quite a few discs.. but I would think storing data on a non mechanical system would be best.

    http://www.ign.com/articles/2014/03/11/sony-and-panasonic-announce-300gb-archival-disc

  301. Compression by Anonymous Coward · · Score: 0

    First off, compress the hell out of the data. Using 7zip or rar or something at its greatest compression ratio could probably cut the size of the data down to 10TB.

    1. Re:Compression by electrongunner · · Score: 1

      Wrong. Movies and music are generally already compressed and 7zip or other similar file compression tools won't do much besides waste your time. That's why you should generally turn off backup compression on video and music backups.

  302. Re:How Do You Backup 20TB of Data? by Anonymous Coward · · Score: 0

    http://www.ebay.com/bhp/1000-blank-dvd

  303. Re:Stop being cheap or collecting so much crap dat by Anonymous Coward · · Score: 0

    in corporations, there is a lot of junk files that users put up there, thats why administrators use this thing called disk quotas.
    under windows 4 and 2000 you didnt have such a thing and we had 12 gig of storage (back when pc's had 250 meg drives)
    90% of it was junk, cute cat pictures etc.
    you should limit yourself also...

  304. Either don't back up disposable data or unRAID it. by electrongunner · · Score: 1

    There's no reason to be backing up 20TB of data if you're not Amazon or Google. Separate out the essential data that you can't live without. Your music collection, your work files and your photos. The rest is disposable. Then go get yourself 2 nice identical hardware RAID cards, set up a 4+ drive RAID5 fileserver using 1.5 or 2TB drives, with 3 active drives and one drive as a hot spare. Buy at least one extra drive for when you need to put your hot spare into action and replace a dead drive. Put all your important data on that raid, put it in a closet somewhere, put in a ventilation fan of some sort, set up email alerts to tell you when there's SMART errors or the raid is degraded, then check your raid status software once a month just to be sure it's all good. Then get another cheap external RAID enclosure with built-in raid5, (something like a StarTech SAT3540U3ER - which is iffy, but works for me) and fill it with 3TB or 4TB drives (plus another spare for when THAT raid fails) and use that to back up your first raid. The backup raid should be large enough to hold at least 2 full backups of the first raid -- choose your drive sizes accordingly. Then back up your first raid onto the 2nd and smile because you've finally achieved relative safety for your important data. Then take a deep breath and say to yourself "I can live without all those videos if something goes wrong. It will suck, but it won't be life-ending, after all that's what bittorrent is for". If you know where to go, you can find almost any movie or tv show and download it in under an hour via bittorrent. Hell, you can download 15 seasons of South Park in h264 format in under an hour. Most HD movies in 1080p format take 30 minutes or less over a decent connection. And if you really care about saving your videos, make an offline library and burn them to DVDs. Not really feasible if you've waited until you filled 20TB of drive space with movies and tv shows, but I have a series of 4 filing cabinets with DVD-sized drawers full of around 2000 CD-Rs and DVD+Rs (and a few BD+Rs for my 170GB collection of BBC Horizon documentaries) I've burned since about the year 2000 when DIVX and XVID format movies started to appear en masse. Every few months I'd spend an evening or two burning my latest batch of movies to DVD and then removing them from my hard drives. But I've found that most of the time these days I don't even touch my archives when I want to watch one of my movies because it's easier just to download a fresh copy in 1080p which is generally better than the archived version I downloaded years earlier. I expect that trend will continue, which is why I've recently stopped burning my movies altogether and now I just add hard another shared hard drive to one of my HTPCs as they fill up, or delete stuff I know I'll never watch again. And last, but not least, I'd be remiss if I didn't mention unRAID. Though I don't use it myself because I've long-since gone down the path of the aforementioned RAID5 setup, unRAID could be the best option if you're starting from scratch. unRAID gives you RAID5-like redundancy, but with arbitrary disks, and with the benefit of only losing the data from specific failed drives in the rare event that it can't be rebuilt from parity data. If you want to know more about unRAID, google it. And forget about backing up 20TB of data for at least another 5 years. No one has that kind of time.

  305. recover from accedential deletion by subminiature · · Score: 1

    I was editing some off air material and deleted the sub-directory on the wrong computer. Some of these where new files and not yet backed up. The source for the edits had already been over written. I have Microsoft Home Server. I took the drives out of the computer and external 4 drive housing and used RecoverMyFiles to look for the files on a desktop PC. The videos had already been spread across three drives but I was able to recover them all. The software was more than the data was worth - a DVD will be release this year. My existing version of the software could not find these files. But at least the latest version does work and will be useful again in the future. I have used RecoverMyFiles to search through a dying drive that wouldn't allow access to the directories. I needed to check what I had lost and not backed up. I could recover files particularly documents and databases but not video. I was able to list what I had lost and to re-do the work. I was able to check that I had not lost anything important.

  306. Use iTunes Match and don't back it up! by Anonymous Coward · · Score: 0

    It's around 30$ a year and you can accessit from every devices with access to the Net.

  307. Jacquard loom punch cards by drkim · · Score: 3

    I'm backing up my 40TB music library on Jacquard loom punch cards.

    Added bonus: You can use the punched cards to make fabric.
    Right now I'm wearing Justin Bieber's "Love Me" ...as a sweater!

    https://web.duke.edu/isis/gess...

  308. Time/Money - Effort Never Needed? by Kookus · · Score: 1

    The reasoning for not using a dvd or blue ray writer is pretty flimsy. They might be more expensive per TB than a hard disk, but they'll last 6 times longer. The majority of the data sounds like it might not even change. Movies, songs... it's all static data. Get a good incremental backup solution in place and it won't be hard to make sure everything is backed up.
    As the the whole probably never need argument... well, your friend just needed it.

    No matter what, you will need a backup. At some point in time you will not be able to buy replacement raid array cards that will work with the volumes you've created. Hardware will be obsoleted, and you'll have to replace it all... that means your backups will need replaced too. If you want it to last 50 years, then that's what it takes!

  309. I'm confused... by ewenix · · Score: 1

    their entire electronic collection of music and movies

    Explain to me how this 20TB digital collection isn't already a backup?

  310. 4TB External Hard drives by Anonymous Coward · · Score: 0

    I'm a bit amazed that no one has mentioned 5 x 4TB external hard drives.

  311. UNRAID by Anonymous Coward · · Score: 0

    Simple solution for backing up digital media is unRAID.
    Overall simple setup, extremely easy to maintain, and expandable.
    Nothing comes cheap, but unRAID is by far the most cost-effective solution you can take when considering the complexities of a typical RAID setup.
    I currently store roughly half, 10TB array and have absolutely no issues and have been running so for the past year and a half.
    Several others in the well supported community have plenty of tenure and experience with the system as well.

  312. Simple solution by Anonymous Coward · · Score: 0

    You can use the grandfather / father / son method. onto tape or DVD's. It would be a lot of them, but not undooable,

  313. Correct answer that should be less than $1K: by Anonymous Coward · · Score: 0

    1. Buy LTO6 drive (for $2k)
    2. Back up to 8 tapes or less using LTFS or Windows' NTBackup ($500)
    3. Sell LTO6 drive on eBay (may lose a couple hundred, but your drive is basically new with a few hours on it)
    4. Repurchase drive at much lower cost if/when future recovery is ever needed (or send to tape restoration company, usually couple hundred per tape)
    5. Back up incremental additions to 4TB USB HD (optional)

  314. SyQuest by Anonymous Coward · · Score: 0

    SyQuest 44. I heard there is a new 88MB cartridge coming soon.

  315. ZeoSync by hudsucker · · Score: 1

    Compress with ZeoSync (100:1 lossless compression of random data). Repeat until compressed file is small enough to backup.

  316. Symform by Anonymous Coward · · Score: 0

    If you need to backup 20tb of data, you can use symform. All you need is another 40tb space to be available for symform backup use.

  317. Removable or external hard drives by Anonymous Coward · · Score: 0

    20TB is five 4TB external hard drives. If you want better backup speed, use drives removable SATA caddies. Just think of hard drives like media -- "removable storage cartridges". They are cheaper than any other alternative for non-datacenter applications.

  318. Don't backup - replicate. Unraid instead of raid by eauze · · Score: 1

    "by erasing a RAID array on their home server" - don't use a RAID array. If it fails or you make a mistake, you lose everything. unraid is ideal for this. It isn't fast but it has disk level parity and a thriving support community to help. If you lose a disk, you can rebuild the array, and if you lose 2 or more disks, you only lose 1 or more disks of data - the rest of the disks are readable so most fo your data will survive.

    'how would you backup 20TB of data?' - wrong question. Think instead how to replicate it off-site. My own setup has 16TB of films/music etc, and I have two 16TB copies at different sites which are loaded from a 1TB portable drive I move around. The bonus is whoever has the other servers gets to share your collection too.

    Cost? - seriously, get over it. 3TB drives are mainstream and getting cheaper. HP's microservers are around £100 each and will take 5 drives.

  319. There's always Usenet... by Anonymous Coward · · Score: 0

    1. Upload (encrypted) files to alt.binaries.backup.
    2. Repeat #1 every 3 years.

    You don't even need to keep your $10/month account if you don't want to.

    Don't forget your parity files...

  320. Another server, ssh, rsync, keychain, bash, cron by doggo · · Score: 1

    You could set up another server with 20TB (or more, for versions) and ssh rsync with a shell script & cron (or whatever) job. (Linux box, obviously). I did this for years, using keychain for authentication between servers.

    Now we've switched to Windows Server, and I've got to find a way to replicate it using Win. Slow going.

  321. Just use external drives by Anonymous Coward · · Score: 0

    It's not like you could just plug in an external drive,...

    Actually, it's just like that. There are plenty of multi-drive external enclosures with USB 3.0 and/or eSATA ports on them. I'm using a Vantec HX4R with 4 x 4TB drives in it plugged in with eSATA, then just back up to it using rsync. If the data's important enough to you it's trivial to buy a second one and just swap them over periodically for off-site backups.

  322. I used to feel that way by Anonymous Coward · · Score: 0

    Then I realized that Blurays, DVDs, CDs and such are covered by homeowners insurance. With the exception of rare copies, you can buy a new version in nearly all cases. Ripping to disk really only needs you to rip the things you want to watch at an acceptable quality.

    And really, I don't bothered to backup that stuff unless all the rest of my things are backed up.

  323. What's it worth to you? by drhamad · · Score: 1

    I have a Mac Mini server set up hosting about 8 TB of primary data on mirrored USB3 drives. I then have it running Time Machine on all of that to a 16 TB RAID5 array on a NAS. Total cost (not including the server itself)? About $1,000... and that's for two sets of backups, one for drive errors (primarily) and one that has an always-available actual backup.

    --
    -Daniel
  324. If you own the original... by Anonymous Coward · · Score: 0

    Then that is your backup. If you are pirating, then serves you right.

  325. Synology NAS + AWS S3 + Amazon Glacier by Anonymous Coward · · Score: 0

    The only cost-effective solution that I know of is to purchase a Synology Network Attached Storage (NAS) unit and back it up to Amazon S3 (backup to AWS S3 is built into the Synology), then set up the S3 bucket to migrate the files to Amazon Glacier after some period of time (maybe 14 days). A quick calculation on AWS costs would be about $250.00/month for 20TB of Glacier storage and there would be a small amount for S3 storage before migration (new files for a few days). A Synology DS-1813+ with 8 x 4TB hard drives (in RAID 5) would result in 28TB of storage and can be expanded to 68TB quite easily. Cost of NAS would be approximately $1750.00. The beauty of this configuration is that it is all managed by Synology's very easy to use DSM web management interface and could be running in a few hours. Now it would take weeks to upload the information the first time, but after that a few hours per night. I have a smaller configuration of this running on Synology DS412+ (4x4TB) and it works like a charm.

  326. You can only afford 1/3 as much data as you think by Anonymous Coward · · Score: 0

    If you think you can afford to build a 30TB NAS then you really can only afford a 10TB NAS plus 2 identical backups with one kept off site and only one of the backups in a writeable state for updating at a time, the remote copy only provides checksums and then is swapped with the local copy.

    Keep data moving and checksum test each replication then it will survive intact. Look at how life manages the data in DNA, not perfect but still usable after billions of years.

  327. Huh by Anonymous Coward · · Score: 0

    I don't really understand the dilemma. If you have paid for 20TB in storage and your data is important spend the cash on another 20TB array or PC with DAS and back it up to disk. It's really not very complicated.

  328. ZFS and second server by Anonymous Coward · · Score: 0

    ZFS would be a great way to hold onto your data in the event you as a user some how deleted your own files. Simply restore from a snapshot.
    Sadly to make a backup of 20TB, it would just make sense to have another server with a mirror image of your data off site. Keeping both in the same location would defeat the purpose. Now partner up with a good buddie and split the costs and your media collection. Win-win situation :)

  329. Many small, or a few big by SDPost · · Score: 1

    One option is to store you files are metada to a huge number of "small" jpegs. You can also use steganography if you like. Then, upload them to sites that allow for unlimited number of pictues (facebook, picasa, etc). Or simply get five 4TB HDs ($165 each).

  330. recovery by Anonymous Coward · · Score: 0

    Good day. If you recreate the RAID array exactly as it was then the data is there and not deleted so you are effectively recovering it. That is if you actually only deleted the raid array definition in the Raid software/Firmware. I have done this with success.

  331. Rsnapshot to an offside box over the Internet by Anonymous Coward · · Score: 0

    I use it to keep 60tb backed up that way. Works without a problem. Tips: you need a decent internet connection and split up into smaller jobs to keep manageable.

  332. Egypt by Anonymous Coward · · Score: 1

    Well, to be fair, that's not a good analogy. An Egyptian gov't might only last 4-5 months anyway... ;-)
    I have a 3x2TB RAID 5 array (4TB total) and I'm constantly bumping up on my limits. It's like 3/4 downloaded movies, some music, some software, and some personal pictures and data. Honestly, the personal stuff, the stuff I couldn't just re-download, might take up a whole 10-15 gigs. That stuff is backed up multiples times over via Dropbox, Google Drive, and scattered Blurays and flash drives.

  333. Glacier by Anonymous Coward · · Score: 0

    They actually charge little to back up, but they charge an arm and a leg to get at the data. Glacier is really only for a rainy day. Also, $200 a month isn't really economical when you consider after half a year you could have purchased a 20TB backup solution...

  334. FlexRaid or UnRaid by cbeaudry · · Score: 2

    http://www.flexraid.com/
    http://lime-technology.com/ (UnRaid)

    Best solution for big media collections.

    All data is stored seperatly on each drive, and 1 separate parity drive can protect up to 21 drives (as long as its as big or bigger than any 1 of those 21 drives).

    1. Re:FlexRaid or UnRaid by Anonymous Coward · · Score: 0

      I've been using FlexRAID for years. I have a 4U case with 24 hotswap bays. I currently have 12 discs in the system (10 data, 2 parity) with a total usable capacity of 21 TB. FlexRAID is nice because you can use drives of any size, and can have as many parity drives as you want. For a large media server, Snapshot RAID makes the most sense (parity calculation is scheduled rather than run real-time) as you don't lose read or write performance. Unlike RAID5/6, even if you lose more discs than your array can tolerate, you only lose the data on those specific discs. In addition, the discs can be removed from the array and read in another system as there is no striping.

      FlexRAID supports both Linux and Windows. I'm running RAID-F on WHS 2011, and also use storage pooling to display the 21 TB array as a single virtual disk, which is accessible from any system on my network. I use XBMC as the frontend, with a shared MySQL database so all of the XBMC clients (living room TV, bedroom TV, desktops) are synchronized.

  335. That's a whole lot of porn by Anonymous Coward · · Score: 0

    I love my cigar too, but I take it out of my mouth once in a while.

  336. 2 methods by Anonymous Coward · · Score: 0

    1. You put it on a filesystem that has versioning/snapshots and you replicate it.
    2. You buy a tape library and back it up to that.

  337. how i backup my array by Anonymous Coward · · Score: 0

    My library is around 12tb and I use a dock and bare drives. I have a series of unix scripts that backup the library in parts. Here are my scripts.

    rsync --verbose -r --delete-before --ignore-existing --include=[A-Ma-m0-9]* --exclude=* /Volumes/Drobo/iTunes/Movies/ /Volumes/Movie\ Backup\ 1/

    rsync --verbose -r --delete-before --ignore-existing --include=[N-Zn-z]* --exclude=* /Volumes/Drobo/iTunes/Movies/ /Volumes/Movie\ Backup\ 2/

    rsync --verbose -r --delete-before --ignore-existing --include=[N-Zn-z]* --exclude=* /Volumes/Drobo/iTunes/Movies/ /Volumes/Movie\ Backup\ 2/

    rsync --verbose -L -r --delete-before --ignore-existing --include=[A-Da-d0-9]* --include=*.mp4 --include=*.m4v --exclude=* /Volumes/Drobo/iTunes/TV\ Shows/ /Volumes/TvBackup1/

    rsync --verbose -L -r --delete-before --ignore-existing --include=[E-Re-r]* --include=*.mp4 --include=*.m4v --exclude=* /Volumes/Drobo/iTunes/TV\ Shows/ /Volumes/TvBackup1a/

    rsync --verbose -L -r --delete-before --ignore-existing --include=[S-Zs-z]* --include=*.mp4 --include=*.m4v --exclude=* /Volumes/Drobo/iTunes/TV\ Shows/ /Volumes/TvBackup2/

    rsync --verbose -r --delete-before --ignore-existing --exclude=‘Mobile Applications/‘ --exclude=‘iPod Games/‘ /Volumes/Macintosh\ HD/Users/dave/Music/iTunes/ /Volumes/Macintosh\ HD/Users/dave/Dropbox/Backup/iTunes/

  338. Second machine and rsync by dbrower · · Score: 1
    Get another machine with equivalent capacity and rsync between the two using gigE. The first copy will take a while (a day), but subsequent updates will be quick, being only the changes.

    Even with windows you can run rsync if you install Cygwin and it's sshd.

    If you can put the second machine in a distant room (garden shed, detached garage) that's unlikely to go up in the fire, that's better.

    --
    "It if was easy to do, we'd find someone cheaper than you to do it."
  339. Seroius Answer: LTO6 by lionchild · · Score: 1

    Considering that you've got to be running something larger than your average desktop PC to hold that much data, I'd consider looking at a tape library like this:

    http://www.tigerdirect.com/app... ($3750)

    8 slots for Ultrium 6 tapes, non-compressed will hold 20TB, 50TB if you can get decent compression...which I'm guessing you might not. I think tapes can be found for just under $65 each, depending on how you shop them.

    I guess it depends on how many tapes you want to back up to after that.

    --
    Awk! Pieces of eight. Pieces of eight. Pieces of seven... ERROR: General Protection Fault. [Paroty Error.]
  340. 20TB of movies and music.... by Anonymous Coward · · Score: 0

    I don't think there's that much enjoyable content out there. Perhaps learn to delete crap media you don't ever really want, and then your backups will be much more manageable.

  341. Paperback! by Anonymous Coward · · Score: 0

    You'll only need about 14000 reams of paper...

    www.ollydbg.de/Paperbak/index.html

    You could laminate them for extra data integrity!

  342. Obviously you're not a photographer by Anonymous Coward · · Score: 0

    If you were a photographer you wouldn't be doing what you are doing.

  343. Re:Hmmm I throw down the gauntlet! by Neil+Boekend · · Score: 1

    That's from XKCD. No need to use an aggregator site.

    --
    Well, I might have a way, but it only works on a semi spherical planet in a vacuum.
  344. Cheap desktop PC with 5 or 6 hard drive slots? by rklrkl · · Score: 1

    Assuming you don't need RAID on the backup device itself, then a cheap desktop PC (usually from a custom white box builder - most OEM PCs don't come with enough SATA connectors/hard drive bays) with 5 or 6 4TB SATA hard drives does the trick. Sure, it'll cost you a fair amount for the hardware (in the UK, probably around 1,000 pounds or so), but it might be the most flexible solution (e.g. could be located offsite if you're paranoid, though you'd need a fast connection to it - at least 100 Mbits/sec I'd have thought - for that amount of data).

    Of course, if you then want to keep multiple archive copies, then you'd have to look at compressing the backups and/or perhaps using backup software that does incrementals (e.g. Amanda on Linux or whatever). Another much pricier alternative is multiple spanning Ultrium 5 tapes in 24-slot autoloader attached to a machine with little local storage (1-2 TB free for holding space), but we're talking 5,000 pounds or so for this solution.

  345. Portable server? by WayneOsteen · · Score: 1

    My storage array is only 8TB, but I doubled down and built another 8TB array in a small Lian Li case. I do a mirror backup to it once a week or so, and carry it to work with me. That way, it's offsite in case something physically happens to my home.

  346. A backup is just another copy by racermd · · Score: 1

    I'm in a similar situation and I actually have planned for a worst-case scenario. However, my storage needs are slightly more modest at about 5TB (give or take).

    My main, active archive exists on my primary desktop and is the location that will get the most changes. That, in turn, is backed up to a dedicated NAS server (currently an 8-bay Synology unit packed with 3TB disks) in my home. THAT, in turn, is backed up, off-site to a friend's NAS units of similar construction and capacity via CrashPlan. The free version offers "backup to a friend's computer" as an option, though the paid subscription offers to store data on CrashPlan's servers, instead. The cost is fairly reasonable for that option if none of your friends has enough storage for you.

    One other last point - it might not make sense to back up EVERYTHING you have. Photos, critical documents, etc. (things you can't easily replace) should absolutely be backed up. Copies of game files, software installations, etc. (things that can be replaced relatively easily from the original media) should probably be left out of the backup set. That limits the amount of remote storage required as well as the time it takes to back up those items in the first place.

    --
    My sources are unreliable, but their information is fascinating. -- Ashleigh Brilliant
  347. Pogoplug by Anonymous Coward · · Score: 0

    Pogoplug Cloud storage is unlimited for $5/month. Or at least they claim it is; backing up this amount of data might be a good way to find out.

  348. crashPlan by Anonymous Coward · · Score: 0

    How about CrashPlan? It is an online backup that offers unlimited space.

  349. Good article on tapes by aurizon · · Score: 1
  350. Be selective by Anonymous Coward · · Score: 0

    I too have that sort of magnitude of digital stuff on a home server.
    I have flagged for backup only those files and folders that I would not be able to restore from elsewhere - so, no music, films, or other external content - either I have the original CD's and DVDs (I rip because most of the films are Zone 2 and I'm not buying the same damned stuff again because I now live and operate in Zone 1), or the material is available elsewhere online: where content is available via a stable url, I keep, flag and backup the URL's; Thus, only material that I have produced and which is "original" content is backed-up.
    Result (surprise, surprise)? Less than 1TB and with an annual growth rate over the last 10 years of no more than 10% year-on-year

  351. I recommend by Anonymous Coward · · Score: 0

    Bornoulli drives.... Like an IOMEGA 20+20. Dual 40MB 12 inch floppies should back that puppy up in no time! There's one on ebay for $64 that's 1% of the cost in 1986!

  352. SATA drives... by Anonymous Coward · · Score: 0

    I did some consulting work for a company that wanted to store that much data, a kind of one-shot archival deal...

    We looked around at options, and there was really only one cost and time/speed-effective one.

    So I bought several SATA drives, backed it up on them, made an index, labeled them, then unplugged them and put them in a safe...

    I know it probably seems dumb, we bought 14 3 TB SATA drives at 135 bucks each, so it ran over 2k including shipping and taxes.

    They don't anticipate needing the data, but if at some point they ever do need it they can relatively easily locate and retrieve it.

    They have a pretty big vault to store stuff like that in long term, so I understand it's not a perfect solution for just anybody. I would have preferred blu-ray, but I didn't want to be the one to burn all that data to discs that might not even be readable in 10 years.

  353. This is what I do by Anonymous Coward · · Score: 0

    I have my media computer and then I robocopy the data to a set of hard drives with the exact same sizes onto my server every night - incremental updates. So much data is worth the few hundred bucks you would have to pay. If I had to re-rip all of those blurays and DVDs it would take me months and costs me thousands in my own personal time.

    Always ask yourself how much your time is worth.

  354. Maybe a hybrid approach by Anonymous Coward · · Score: 0

    I do like the idea of tape in theory due to the ability to backup and store offsite. At home I have different buildings so I am creating a data center in a garage that I will be doing incremental backups to but in thinking about how to do this in a city got me going. The reason I don't go directly to tape is because it is slow, not automatic (you have to physically move the tapes) and more expensive than setting up a duplicate drive system.

    How about making a deal with a neighbor that you can either get a direct network line to or wireless at the least and store backups at each others house? Encrypt them and do incremental backups nightly. That should be sufficient and if your house burns down just pull the drives from your neighbors. That might give you the best of both worlds.
    ennalta

  355. don't backup uncritical data by Anonymous Coward · · Score: 0

    Backup only the data that needs to be backed up; cannot be recreated or redownloaded. Hard to believe all 20tb is critical.

    Or host a replica server at another location after seeding the data. Then replicate the changes to keep the 2nd copy synced. Be sure to use encryption.

  356. One important consideration seems to be missing... by dequire · · Score: 1

    Who cares, really? There's no way this guy legally purchased that much A/V media over the years. Easy come...easy go...

  357. this s how yu can back up 20TB by Anonymous Coward · · Score: 0

    hmmm... 20 TB data.

    buy 20 1TB external hard disks,hit 1TB and write in a notebook serial no of disk and wat u stores in each one.

    pro:ur data backed up n organized,u can pull t respective hard disk and get data.

    cons: mmm 20 hard disks.. may be u might get discount f u walk up t vendor n shock im tellin u wanna buy 20 nos of 1TB external hard disks..

  358. Hey, mindless troll... apk by Anonymous Coward · · Score: 0

    Why do you ALWAYS RUN from these -> http://tech.slashdot.org/comments.pl?sid=4885825&cid=46474817

    APK

    P.S.=> Is it since you're a undereducated WANNABE in the art & science of computing? Yes... apk

  359. They can do this research by Anonymous Coward · · Score: 0

    That shows how STUPID a little troll like you are vs. this -> http://tech.slashdot.org/comments.pl?sid=4885825&cid=46474817

    APK

    P.S.=> Why do you always "RUN" forrest? Is it because you're nothing more than a useless waste of life TROLL? Absolutely, lol... apk

  360. Hey "Forrest" Gump... apk by Anonymous Coward · · Score: 0

    Running away from a fair challenge to you, again? See here http://tech.slashdot.org/comments.pl?sid=4885825&cid=46474817

    APK

    P.S.=> You're nothing but a TROLLING waste of life I am going to ENJOY destroying, once again, on the same thing (you being unable to disprove my points in favor of custom hosts files value in added speed, security, reliability, & even added anonymity online)... apk

  361. The simplest answer tends to be the right one... by Anonymous Coward · · Score: 0

    http://en.wikipedia.org/wiki/Occam's_razor

    Get another raid.

    www.pricewatch.com

    I see there are 4TB drives going for $156. I would maybe shell out another 20 or so to get a better/name brand. So for around 1.5~2k, you have a backup with far less time invested and can be rebuilt as needed. Or, get more drives with less space for better redundancy. The next smallest drive is roughly 30% cheeper.

  362. Use Tape for Backup by lsatenstein · · Score: 1

    It is a bit more expensive, but take would be the way I would go. There are cartridge tapes that can run as fast as the hard disk transfers can take place.

    Many tape backup systems rely on a base backup, and then on incremental backups. Each incremental backup was the one based on the base backup. Once a month we would create a new base backup. (It was a business application, with daily changes).

    I made sticky lables to indicate the backup date and generation number. These labels went onto the tape cartridge's plastic case.

    In a very large shop, tape backupsrun from a feeder machine, with major automation and cataloging to allow reasonable file recovery time.

    Tapes are checked a day before reuse, to insure no lost oxide or fading. That function was part of the tape backup system.
    Weekly tapes were duplicated and moved off-site.

    All it takes is gelt and time.

    --
    Leslie Satenstein Montreal Quebec Canada
  363. Another 6TB Array by pebear · · Score: 1

    The only and best way to back it up is with another 6TB Array. You could setup your array card to write to both arrays simultaneously. You get that much data it becomes a major chore to keep track of the tapes and another array would afford you real time up to the millisecond backup availability. One thing the tapes will do for you is you can keep versions of your backup images from different dates. I would recommend like a small Dell library that holds like 30 tapes and maybe two LTO6 Tape drives. The tapes will allow you to go back in time where the 2nd array will not be able to do that but both would be good for a reliable restorable options in times of disaster.

    --
    Paul E. Bahre
  364. offsite redundant backup by Anonymous Coward · · Score: 0

    I've got about 10TB and use an offsite NAS that I rsync to every week. Most friends will store it pretty happily if they can back their own stuff up to your server. You can split the cost and you both get your data backed up. And copies of all their stuff, too.

  365. Mirroring? by Anonymous Coward · · Score: 0

    I'm not an expert but could he just only buy mirrored drives?

    Thinking out loud- maybe he could buy 7 3TB drives to do it as well.

    I wouldn't use Glacier. The cost is too high and you'd have to worry about bandwidth too.

  366. To hard disks of course by Anonymous Coward · · Score: 0

    The best way is to backup to another set of harddisks of course. Cheap, fast and you dont need new hardware, such as tape station.

  367. 20TB backup for a home user eh.. by Anonymous Coward · · Score: 0

    Are tape drives right for this kind of problem? only if you are interested in backup.

    Also you use the "Towers of Hanoi" method to save tapes, it's the most efficient if a little more complicated.

    As I home user I'd recommend you write out a tape change schedule and put it on the fridge.. yes defiantly leave it somewhere visible so that you don't forget a step and get confused.

  368. How to store a YB by astar · · Score: 1

    http://en.m.wikipedia.org/wiki...

    YBs are an active element of public policy. A YM is sort of the size of the universe and at that size the cosmologist types have multiple definitions of distance. But a TM is about the size of the solar system. Pluto is about .6 TM out and Voyager 1 is about 18 TM out. The next star is about a PM out.

    If i buy high end SD cards and load them up with a YB then they will fit very comfortably in the space shuttle assembly building.

    i notice that people are talking about TB USB sticks.

  369. Re:Either don't back up disposable data or unRAID by cboslin · · Score: 1

    Yours was the best of the bunch (minus formatting html tags), though I enjoyed reading about the trials and tribulations of punch tape vs punch cards vs tape/dat backup systems. The biggest problem I had many years ago was using a dat format system that I could not longer purchase hardware for. So I had tapes, but no way to read them. That taught me a lesson. Never use a media that I might not be able to read from 10 years from today. Thus I only backup on hard disks today.

    I agree that to backup music, videos and other static content that has been downloaded via the internet (and not personally created) is a waste of time and space. As you pointed out, with even a throttled cable connection you can download this fairly quickly. So never waste time backing it up. Totally agree with you.

    Now the one exception to video, pictures and music, are those that you create yourself. For your own personal pictures and personally created video. That needs to be backed up and I would suggest a harddrive (or multiple hard disks) for this purpose.

    If you work in the video / movie industry creating content, obviously this comment does not apply to you...check into creating your own Linux video sever farm for while-you-sleep-rendering and a homemade Linux SANs like this Petabytes on a budget: How to build cheap cloud storage. You will have to learn some Linux to do this, but it would be well worth it, if you have the need. This article should help you, Thoughts about this DIY-Thumper and storage in general

    Just as with industrial and union jobs of yesterday, white collar IT jobs, your movie editing jobs are now being offshored to India and when I was in LA a couple of years ago, a number of studios were relocating to Canada because it was cheaper for them...fyi.

    For home users not in an industry creating massive videos, the next few paragraphs should cover you. Give thoughts to what you really need and why. Don't back up anything you do not have too. Like Software, Operating Systems, only focus on the data you create.

    Plan your locations for different types of data, since you can label (mount point) your directory whatever you want. You could have one for video, one for audio (music), one for non picture images (your digital camera) and one for everything else. If you have the need, perhaps a DB directory as well. This would look as follows:

    /video/ ~ for downloaded video, not home movies, never backed up (this will be your largest directory for most)
    /music/ ~ for downloaded music, not self created, never backed up (you could write this to DVR or copy to a USB thumb drive if you want, the files are NOT that big. A 64GB thumb drive costs less than $30 on sale. Get a Micro USB adapter and only purchase micro SD cards and get very large ones. I use to use 8GB in my Nokia N800, now my zareason ZT2 Tablet has a 32GB micro SD card in it. Since I am using it for books, PHP development and research only, it will take a very long time to fill up.)
    /myvideo/ ~ personally made video, back it up
    /mymusic/ ~ personally created music, back it up
    /images/ ~ digital images from your digital camera, back it up
    /db/ ~ custom database stuff, back it up
    /data/ ~ everything else, back it up

    For the majority of you reading this, from /myvideo/ to /data/ (five different directories) will easily fit on one 500GB drive. If you are smart and compress it when you backup, you can probably fit a months worth of backups on that 500GB drive if not more. Linux comes with built in compression / backup commands and you can use PKZIP (or other compression program) for Windows to compress your data sizes and make your backup space go further. Even mo

  370. Deduplication by Anonymous Coward · · Score: 0

    Deduplication.

  371. LTO by DigiTechGuy · · Score: 1

    Tape would be best, though kind of pricey. Either that or hard drives either cheap slow disks or to be more pricey duplicate your live setup. It's not gonna be cheap for 20 TB of home use data, I'm guessing mostly of the size comes from video and audio, probably could be reacquired if need be. Backup your most prized data (personal documents, pictures, video, etc. that cannot be replaced) and take your chances with RAID 6 on the rest.

  372. THE PAPER IS BACK ! LOL by Anonymous Coward · · Score: 0

    Great minds!

    If you want to definitely keep digital information protected forever, you would have to go back in time, buy 20 ton of cards to be drilled and use that machine that IBM used to make the North American census. With punch cards no electromagnetic storm can destroy my data and I'd be quiet. The problem now would be another, how to store 20 tons of perforated paper? My definition of backup says that the backup must have 3 copies, one is worth, so I need to buy a shed to store 60 ton of paper, I think I will change profession and working with paper! LOL!

  373. Like you eat an elephant by javawocky · · Score: 1

    One byte at a time

  374. If money is no object... by Dhandforth · · Score: 1

    For a one time charge (in the high 4 digits or low 5 digits), guys like this http://pivot3.com/surveillance... have solutions that claim to use something called RAID 6. In my experience, this is a good solution if the data is more write intensive than read intensive. At first glance, good for storing movies and music for personal use, bad for streaming to multiple subscribers.

  375. 20TB solution by syleishere · · Score: 1

    Quite simply your solution is simple: a)money dependant Get another storage server with exact same specs and run an rsync cronjob to backup data once a night to other system. b) this time use only half the storage in a raid 10. I recommend FreeBSD or freenas with ZFS enabling compression as well, to get most out of your space.

  376. What's wrong with singular HD's? by Gallomimia · · Score: 1

    I just can't see why you haven't thought of grabbing 10 or so 2TB or 3TB drives, copying a segment of the data to them and putting them back in their anti-static bags on the shelf somewhere. SATA drive docks are cheap, or plug it into your eSATA port to access the drive. Perform updates once per week on only the lead drive. I think the problem that remains is software to index it all and know which files need to be backed up and which are on drive X of Y.

    --
    Sadly, a Libertarian cannot force his views on another, and freedom cannot spread as does the cancer known as religion.
  377. 40 Terrabite by Anonymous Coward · · Score: 0

    gosgog:

    Hunt for an old Intergraph machine (Photogrammetric Companies somewhere may stil have one... then back it up on the hugh tapes.

  378. Its a cost issue by Anonymous Coward · · Score: 0

    Its not the amount of data (20TB) that's important; its the "worth" of the data.
    If the data is important enough, it is worth the cost of a tape backup device & media.
    If its not, worth it, don't bother.

    If only part of the data is important, match the cost of the backup device and media with the importance of the data.

    I back up the most important documents online (dropbox), the unrepeatable but large (pictures) online in the home network + offline copy at the office,
    the replaceable (music) only online in the home network, and the rest is not backed up.

  379. pretty obvious by Anonymous Coward · · Score: 0

    Set up a torrent, let N people download it, and redownload in event of disaster.

    Sharing: it's not stealing, it's data conservation!

  380. Ask the NSA. by Anonymous Coward · · Score: 0

    They are sure to have a copy, just ask nicely.

  381. Backup 20TB of Data by Anonymous Coward · · Score: 0

    First of all, I don't know why I'm posting as Anonymous Coward, but would happily change that to ArtfulOne, aka Arthur Fuller. If anyone can tell me how to change that, I'd be appreciative.

    I have tracked down the way to do this, but as others have posted, the cost is non-trivial. About the cheapest way I can think of is to score a dozen or so 4TB drives (they're quite cheap now) and chain them; but the really (cost no object) effective way to go is with IBM's fancy drive, more about which may be read by visiting http://www.technologyreview.com/news/425237/ibm-builds-biggest-data-drive-ever/.

    As for me, my demands are far more modest. I recently purchased a Blu-ray burner and backed up everything and it only took 8 discs, nowhere near even a couple of TB, but mind you, I'm just one semi-retired guy and most of the stuff such as DVD movies and CD music has already been burned to disc.

    Arthur

  382. backup to USB disk by Grimwiz · · Score: 1

    Buy a cheap large USB disk.
    Sort the files by creation time, remove the ones in your list of files already backed up (empty at start) and fill your USB disk using the oldest files first.
    Add list of backed up files to your collection of disk indexes, (excluding files that you want the backup to process again).
    Label the disk as "files from xxx to yyy"
    Repeat until you're duplicating current files into a current disk.

    If the data is sensitive then truecrypt the disks before writing the data, being aware that losing your password would also lose your backup and be prepared to wait days building the encrypted container.

    I wrote an awk script to do this, though at the time I was writing 4Gb DVDs rather than multi-terabyte USB disks.

    At some point in the future, when disk sizes have grown you can copy multiple old smaller disks into larger ones to avoid being stuck with old technology. I expect after the submitter has moved to a 100Tb NAS in 2018 they'll be using the 20 as a backup.

    --
    -- Don't believe everything you read, hear or think
  383. How Do You Backup 20TB of Data? by Dabido · · Score: 1

    Bit by bit ... a little bit here, a little bit there.

    --
    Sure enough, the cow costume was hanging up next to the superhero outfit and sailors uniform. (S,Spud)
  384. huge data by lccsuppliers · · Score: 1

    Why do you need that huge data?

    --
    low cost country suppliers
  385. Friend with room by goarilla · · Score: 1

    What I would do is build a second NAS.
    Do an initial backup of it and move it to a friends basement/rack and sync it up every week or so.

  386. I'll do it by Anonymous Coward · · Score: 0

    You can use my wife, I'm pretty sure her ass can hold a few 100.

  387. Who has a movie collection these days! by mtthwbrnd · · Score: 1

    "Recently I had a friend lose their entire electronic collection of music and movies" You can stream almost everything for free and if you want the files back then apparently there is this new thing they invented called bit torrent.

    Most films are so crap that they really can only be watched once anyway, if at all. How many times a year was he watching the same movies? I doubt that in 1 year he would watch 20Tb of movies.

  388. $600 for 5 4Tb externals. by Anonymous Coward · · Score: 0

    Back it up on five 4tb drives which as of 11pm last night when I ordered one cost $150 each.

  389. Snapshot Raid by __aayurq3262 · · Score: 1

    20TB for a home user is likely to be media data. It doesn't change much and it 's usually possible to recover - rerip, red/l, etc. so it's probably OK to live with a higher risk of loss than a business would need for their backup of 20TB of data. With those assumptions, I'd focus on minimizing the risk of loss and opt for snapshot raid. (If he wants true backup, backup to disk is my preferred option, using a decent backup program. If offsite is needed, carry the data that doesn[t change offsite and arrange to send to web the stuff that changes daily.) True raid 5/6 seems like a good way to minimize the risk of loss, but it's not nearly as good as snapshot raid for media data. True raid is too likely to fail during a recovery when the disks and controller are heavily stressed. I've lost raid arrays from both controller failure and multiple disk failure. Plus, a loss beyond the redundancy level loses everything with true raid. Snapshot raid with pooling software is much better for media backup. You only lose the data on the disks that actually die if you have more than the redundancy/parity number of drives that fail. You can add additional parity drives at any time to increase redundancy. For windows or *nix systems, Snapraid is a free snapshot raid option that works great. It comes with pooling capabilities that will make the entire array look like a single drive or, for more advanced pooling needs, there are multiple 3rd part options. Liquesce is free for Windows and there are even more free options for *nix systems that need pooling.

  390. Re:How to store a YB by PingPongBoy · · Score: 1

    Currently, SD cards or memory stick can be obtained slightly less than $1 per GB. Suitable for storing the most precious data... A few years later, this technology hopefully will be more economical than tape.

    --
    Know your pads. One time pad: good for cryptography. Two timing pad: where to take your mistress.
  391. Re: How to store a YB by astar · · Score: 1

    Curiously in the labs there is this highly redundant DNA storage scheme. Sort of refrigerator size. Currently a YB of sd stick retail would be 10*27/10**9 or a million T$. Better called an exa $. Or a quintillian ion bucks.