Slashdot Mirror


Long-Term Storage of Moderately Large Datasets?

hawkeyeMI writes "I have a small scientific services company, and we end up generating fairly large datasets (2-3 TB) for each customer. We don't have to ship all of that, but we do need to keep some compressed archives. The best I can come up with right now is to buy some large hard drives, use software RAID in linux to make a RAID5 set out of them, and store them in a safe deposit box. I feel like there must be a better way for a small business, but despite some research into Blu-ray, I've not been able to find a good, cost-effective alternative. A tape library would be impractical at the present time. What do you recommend?"

58 of 411 comments (clear)

  1. Exactly what you're doing by rwa2 · · Score: 4, Informative

    I don't think you can beat a bunch of conventional hard disks in a RAID5 for both cost-per-TB and backup/restore performance, not to mention medium-term data integrity. Might be able to make hooking up the drives more convenient with an eSATA mult-bay enclosure, but those are kinda expensive. But I bet your backup box already has some sort of hot-swap on it already, like: http://www.amazon.com/Thermaltake-BlacX-eSATA-Docking-Station/dp/B001A4HAFS

    I assume you already compress your data, since scientific datasets tend to compress well. You might consider compressing to squashfs, since it will let you do transparent decompression later on so you can skip the restore step if you just need a handful of files.

    1. Re:Exactly what you're doing by forgottenusername · · Score: 5, Interesting

      I don't think it's a great solution. You're storing relatively fragile hard drives in a raid5 configuration in a lock box? It's not like you can tell if one of the drives goes bad and needs to be replaced when it's sitting in a box. You'd have to regularly pull the data sets out, fire them up and make sure everything is still functional.

      I'd at least want to do 2 complete sets of mirrored drives.

      Tape storage does store better.

      Depending on how important the data is, I might do something like a local mirrored drive set in storage and an online copy at something like rsync.net - stay away from s3, it's not designed to protect data, despite what AWS fans may say.

    2. Re:Exactly what you're doing by hardburn · · Score: 4, Insightful

      That's why you hot-swap them. You treat them just like tapes. In fact, once you start doing that, you realize that RAID mirroring isn't helping you any (striping is another matter).

      The best way to backup a big hard drive these days is with another big hard drive.

      --
      Not a typewriter
    3. Re:Exactly what you're doing by TheMeld · · Score: 3, Informative

      The other thing to do if you want longish term reliability is to add redundancy to whatever you're storing with a tool like par2, http://www.par2.net/ and http://www.quickpar.org.uk/ are your friend.

      Raid5 will help you if you lose a whole drive (e.g. siezes up from sitting still for a long time), the par2 data will both allow you to verify that the data hasn't been corrupted, and if it is (e.g. a couple sectors go bad), it will let you recover the data.

      --
      -Cheetah
    4. Re:Exactly what you're doing by mabhatter654 · · Score: 2, Informative

      LTO4 tapes are reasonably price.. but the DRIVES push $5k. For long term even the drives go obsolete too quickly, then become MORE expensive in 5 - 10 years when you really need them.

      The best thing is probably what Google does, simply keep 3 "live" copies of the data. Then the data is always on current hardware. The data is on "production" hardware along with other stuff so it is properly monitored by the OS and database for integrity, and hardware is maintained with support. Drive arrays are cheap enough in large scale just to keep moving things over to one that's twice the size in half the space every few years. The problem of course is when you are a Small or Medium business keeping even $50k in hardware around you don't really NEED for operational purposes is a big money sink... way to much for "insurance". But unless you keep one copy of the data live and verify/refresh your tapes every year you really don't have "securely" stored data.

    5. Re:Exactly what you're doing by rwa2 · · Score: 4, Informative

      Yeah, keeping those drives in a huge online storage array is probably better. Then they can mirror them across multiple sites.

      Here's a compelling petabyte online RAID system for cheap:

      http://blog.backblaze.com/2009/09/01/petabytes-on-a-budget-how-to-build-cheap-cloud-storage/

    6. Re:Exactly what you're doing by TrippTDF · · Score: 4, Funny

      your sig is incredibly apt for your post...

    7. Re:Exactly what you're doing by HeronBlademaster · · Score: 2, Interesting

      stay away from s3, it's not designed to protect data, despite what AWS fans may say.

      Just curious... S3 stores all of your data at multiple, geographically separate data centers. How exactly does that not protect your data? What else would you want it to do in terms of protection? It even gives you md5 sums of your files if you want to verify them (check the ETag attribute of each object).

      So, honest question: what do you think they're missing to make S3 really protect data?

    8. Re:Exactly what you're doing by lgw · · Score: 4, Informative

      Tape is really best for archiving, to this day. A single LTO drive won't break the bank for a small business, and it will be reliable.

      3 Things to remember about tape backup:

      Encrypt your backups. This is becoming available in the tape drive itself, but many backup applications will also do it for you in software. Limits embarassment if a tape goes missing.

      Occasionally test restores. This is incredibly important - almost every unreadable tape in existance was unreadable when created. Any reasonable backup software will give you the ability to do this automatically (as part of the backup job). If practical, create a job that does a backup of everything, but verifies only some small volume. If you can read anything, chances are high that the whole tape is fine.

      Get those tapes offsite. A safe deposit box works for a tiny company, but someone like Iron Mountain works better and is less hassle. Store a copy of your encryption key in the same facility (but don't transport the tape and key together).

      --
      Socialism: a lie told by totalitarians and believed by fools.
    9. Re:Exactly what you're doing by Again · · Score: 4, Insightful

      (Or btrfs on a Linux distro)

      Are you honestly suggesting using an in-development filesystem for backup purposes?

    10. Re:Exactly what you're doing by Vellmont · · Score: 3, Insightful


      if it was medical records i'd be storing 5 copies in 5 geographically distinct locations, each with their own backup for the backup. i'd be checking the MD5's each day on all the backups to ensure they can be accessed when i need them

      I can about guarantee you that nobody stores medical records in this way. And realistically, why should they? 5 different locations is insane for just about any piece of data.

      Geeks tend to go overboard when it comes to data paranoia and worry too much about technology, but then forget about all the human problems that go on. Most data loss doesn't occur from some geographic catastrophe where a super volcano destroys half a continent. More often someone changes some critical path of the backup scheme and the whole she-bang comes crashing down. Super-redundant geographic co-location can't save you from one idiot that didn't understand changing one critical name silently took down the backup scheme.

      --
      AccountKiller
    11. Re:Exactly what you're doing by temojen · · Score: 2, Insightful

      If disks in the safe deposit box are fast enough to access, running to the store to buy a generic power supply is fast enough recovery.

    12. Re:Exactly what you're doing by jon3k · · Score: 2, Interesting

      Are you doing disk to tape to tape or disk to tape and the disk to tape again? or disk to disk to tape/tape? are you using LTO4? Just curious, because that seems like a lot of data to backup to tape (twice! every night!)

  2. bzip2 by Colin+Smith · · Score: 5, Funny

    And optar:

    http://ronja.twibright.com/optar/

    You know it makes sense.

    --
    Deleted
  3. GMail Drive by sopssa · · Score: 2, Funny

    Unlimited space with several accounts.

    1. Re:GMail Drive by 0100010001010011 · · Score: 4, Interesting

      That's what ZFS is for.

      mount -t gmailfs /disk1 -o username=gmailuser,password=gmailpass
      mount -t gmailfs /disk2 -o username=gmailuser,password=gmailpass
      mount -t gmailfs /disk3 -o username=gmailuser,password=gmailpass
      mount -t gmailfs /disk4 -o username=gmailuser,password=gmailpass
      mount -t gmailfs /disk5 -o username=gmailuser,password=gmailpass

      zpool create gzfs raidz1 disk1 disk2 disk3 disk4 disk5

      Actually.... I think I just found my project for the evening. I mean it's already been done with 12 USB drives

  4. Amazon AWS? by TSHTF · · Score: 4, Interesting

    It might not be the cheapest option, but with Amazon's AWS, you can snail mail them a copy of the drive with the data and they're store it in S3 storage buckets.

  5. Different manufacturers by idiot900 · · Score: 4, Insightful

    Hard drives are ridiculously cheap these days, especially for how much data you are storing. You may wish to consider buying drives from different manufacturers but of the same size to put in a single mirrored set. This way if there is a problem with a particular batch of drives it won't ruin everything.

  6. Tape is your friend by chill · · Score: 5, Informative

    LTO tape, properly stored, will outlast burned optical media and hard drives. Great stuff and designed specifically for what you're talking about.

    http://en.wikipedia.org/wiki/Linear_Tape-Open

    --
    Learning HOW to think is more important than learning WHAT to think.
    1. Re:Tape is your friend by Anonymous Coward · · Score: 2, Funny

      Go Betamax!!!

    2. Re:Tape is your friend by cruff · · Score: 3, Informative

      I agree, when the tapes are stored in proper environmental conditions. You don't need a library, just use some stand alone tape drives. Also look at the claimed media lifetime and recovered bit error rate figures to see if you are choosing the right tape drive/media.

    3. Re:Tape is your friend by Saint+Aardvark · · Score: 5, Informative

      Couldn't agree more. A tape library (as in autochanger) might be out of your budget, but a simple tape drive wouldn't be too much -- say $5000 for an LTO4. Media is $50-$100 or so depending on where you shop. Seriously, you're not going to find a reasonable way of storing that much data anywhere else.

      BTW, if you're not a member of LOPSA, you may want to seriously consider it. Even if you're not a sysadmin, this is definitely a sysadmin-type question, and their mailing lists are second to none. It's an excellent resource.

    4. Re:Tape is your friend by mengel · · Score: 3, Insightful

      There's some code lurking in the amanda backup package I did a while back for "RAIT" (RAID with tape instead of disk) to make a stripe-set of tapes, if you need several tapes worth of data in one set, with redundancy.

      On the other hand, while LT04 tapes are about half the price ($40) of cheap 1TB disk drives ($80), the tape drives are ablout $2k apiece, so depending how many data sets you want to keep, and for how long, the disk drives may really be cheaper...

      --
      - "History shows again and again how nature points out the folly of men" -- Blue Oyster Cult, 'Godzilla'
  7. Amazon S3 by friedo · · Score: 3, Informative

    It can get a little pricey for huge datasets, but Amazon S3 now has an option where you can ship your data on a big set of disks directly to them, they will import everything into S3, and it will live there forever. The nice thing about S3 is unlike physical disks, it can grow essentially forever, and comes with retention and redundancy guarantees. And once your stuff is in S3, you can recycle the same disks to mail them more data.

    1. Re:Amazon S3 by Andraax · · Score: 2, Informative

      I hope so. A 3TB dataset on Amazon S3 would run $450 / month for storage.

  8. Go with Blu-ray by sabreofsd · · Score: 3, Interesting

    With the advent of 2TB drives, you could easily combine 3 of these with software RAID 5 as you suggested. Depending on how long you need to keep the data, recording them to dual-layer blu-ray disks might be a better solution. Ya, it's a lot of disks (you can buy 100GB discs now), but they'll last longer and you don't have to worry so much about mechanical failure or needing a certain OS when you want to restore them.

    --
    Sabre
    1. Re:Go with Blu-ray by eldepeche · · Score: 3, Insightful

      I thought burned optical discs started to degrade after a few years. Have they solved this problem?

  9. Drobo fan and user by Lvdata · · Score: 3, Interesting

    You might look at a www.drobo.com as a set of 4,5 and 8 drive enclosures. 1 TB disks gives you 3 TB usable space with a 2 drive failure tolerance. I have the older 4 bay drobo (2 for myself, and 2 at separate clients offices). It is much simpler to use, and will scale to your 2-3tb use and allow mismatched drives that normal raid will not use. Get a enclosure to start with, and then financing permitting, get a 2nd for Drobo redundancy. Not the fastest or cheapest, but reasonably good by both accounts, and simple to use.

    1. Re:Drobo fan and user by pavera · · Score: 3, Informative

      You misunderstood the post. He needs 2-3TB PER CLIENT, not 2-3TB total.

  10. Re:Exactly. by TooMuchToDo · · Score: 5, Informative

    Because Amazon can be *expensive* compared to doing it yourself ($$$ for data in, $$$ for data out, $$$ for monthly storage). But heh, what do I know. I just manage the storage for one of the LHC detectors (5PB spinning disk, 17PB tape). Amazon is good when you've got VC money or have no IT folks.

  11. Re:Practice your Recovery Method by __aastpl2241 · · Score: 2, Interesting

    Magnetic disk are not a permanent storage, AFAIK not even for medium, they break, when they mustn't. Optical such as blueray kept in a really safe place; hidden from light, humidity and temperature variations can last much longer. Hard Disk failure may always occour, i had a raid 1 at home.. both disk failed at the same time... You may even do something good such as compressing your data and adding some disks for additional safety with some kind of error correcting code spreaded in them.

  12. I'd encrypt the data and... by Rivalz · · Score: 5, Funny

    Label it something like complete american idol blueray collection and upload it on p2p to piratebay. every couple years rename it to some other horrible popular tv series. It will be self sustaining form of storage with infinite number of redundant hosts.

  13. Use RAID6 not RAID5 by jbridges · · Score: 3, Insightful

    I would use RAID6 not RAID5, since 2 drive failures means data loss with RAID5, while it takes 3 drive failures to loose data on RAID6.

    Linux MDADM has supported RAID6 for years, it's stable.

    I would mix and match drives, not buying all the same model from one maker. One Samsung, One WD, One Hitachi, One Seagate.

    That gets you 4TB in 4 drives, and unlike a RAID1, any 2 drives can fail with no dataloss.

    You can further ensure no dataloss by making a second copy using different brand drives for each clone.

    Eight 2TB drives is around $1500. Not bad for a very safe 4TB backup.

    1. Re:Use RAID6 not RAID5 by jbridges · · Score: 2, Insightful

      Just because you had some problems with Samsung means nothing about their general reliability.

      A few specific models have had problems, such as the IBM "Deathstar" models, or the recent Seagate firmware problems, but there is no evidence that whole brands are less reliable.

      Read the Google report on drive brands, there are no clear winners or losers across brand lines in their exhaustive real world tests.

  14. use a tape drive by Lehk228 · · Score: 3, Informative

    you make the assertion that a tape archive would be impractical, but really it is the most practical solution. the drive will set you back a couple thousand, but 800 gig tapes are only around 40 bucks each, and they are engineered for data storage unlike hard drives. this will only cost $160 per 3 gig dataset, or 200 if you use par2 files and an extra tape to make it recoverable in case a tape does fail.

    --
    Snowden and Manning are heroes.
  15. Re:Exactly. by Anonymous Coward · · Score: 5, Insightful

    Ok, yes, we see you know a lot about this.

    So what's your recommendation?

  16. Blu-ray not the answer by trentblase · · Score: 2, Insightful

    Who wants to burn over 100 discs per client? I guess they make automated disc burners, but that's a little nutso. Plus, with that many discs you have a high chance of failure, so you'll need some kind of ecc scheme (parity discs?). I'd also have to vote for hard drives, although I agree with the shortcomings of that solution.

  17. Re:Blu-Ray by snowraver1 · · Score: 2, Insightful

    Cheaper how exactly? Even if you could get BR discs at $2 each, it would cost $80/TB, and I havn't seen BR discs even close to that cheap. That doesn't include the writer which I belive are still $400. For the cost of the writer alone, you could purchase 5TB of HDD.

    --
    Copyright 2010. All rights reserved. This comment may not be copied in any way including, but not limited to caching.
  18. Re:Exactly. by snikulin · · Score: 3, Funny

    Huh, don't you see has has Too Much to Do?

  19. Agree with the tape option..;. by klubar · · Score: 3, Informative

    Tape is probably your best option. You can buy at DAT-5 (or even a DAT-4) tape drive for not very much. The tapes cost about $10 to $30 each (depending on what tape option you choose). Make 3 copies of the data set, store one onsite, store another offsite in a secure/climate controlled facility and send the 3rd to the client. Buy a spare tape drive and use both to make writing across tapes easier. There is a wide variety of software to write to the tape; we use the aging Retrospect.

    The disk options is just way too complex; if anything, skip the RAID option and just store 2 copies. Putting the RAID sets back together and finding the RAID software will be nearly impossible in a couple of years. Use some standard formatting on the drives (FAT, NTFS, etc.) and you'll be good to go for the next 15 years.

    1. Re:Agree with the tape option..;. by iluvcapra · · Score: 2, Interesting

      As someone that did a lot of backing up (and maybe restoring if I was lucky) to DDS DATs in the earlier part of the century, I can assure there's a very good reason the drives are so cheap now :) The reliability was atrocious and at $10 a cart DVD-R is quite competitive.

      --
      Don't blame me, I voted for Baltar.
    2. Re:Agree with the tape option..;. by Bigjeff5 · · Score: 2, Informative

      LTO-4 drives can be had for under $2k if you shop carefully, and a 1.6TB tape can be had for between $50 and $80.

      Cheap, simple, and far more reliable long term than hard drives. They also support Write Once Read Many, which is a regulatory requirement for some industries. Getting WORM on a hard disk would be a really neat trick, particularly the part about guaranteeing that it cannot be written to more than once.

      A better way to look at it, is disk is best for quick backup and quick recovery from failure (which is very important for enterprise environments). Tape is for archiving and as a last protection against catastrophic failure.

      --
      Security is mostly a superstition... Avoiding danger is no safer in the long run than outright exposure. - Helen Keller
  20. Re:Tape is crap anyway. by afidel · · Score: 2, Informative

    I didn't see you mention LTO or DLT in your little rant against tape, both are superior to any other method for the storage of bulk offline data.

    --
    There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
  21. Never us DVDs as long term storage. by strangeattraction · · Score: 4, Interesting

    Repeat never use DVDs as long term storage. I have seen them go unreadable anywhere from 2-5 years. I have fired up disk drives 10 years later with no problems. They are cheap reliable and fast. Don't try and get fancy just compress and store data sets over multiple volumes. Don't use RAID.

  22. Re:Exactly. by hawkeyeMI · · Score: 2, Informative

    I already use S3 for some things. Unfortunately it would be about $500/month/customer's data on S3 right now, so that's out. I could buy a whole computer with hard drives every month for that.

    --
    Error 404 - Sig Not Found
  23. Re:Exactly. by Anonymous Coward · · Score: 2, Funny

    I think his/her recommendation was: 17PB Tape.

  24. Re:For what it's worth by dgatwood · · Score: 2, Funny

    Probably the giant rare earth magnets I stored in the next box over.

    You're welcome. :-D

    --

    Check out my sci-fi/humor trilogy at PatriotsBooks.

  25. a petabyte class cluster file system ? by TravisHein · · Score: 2, Informative

    how about a cluster file system like http://www.moosefs.com/ , where it works well on commodity or even end of life by current standards of hardware. Redundancy is achieved through several chunk server nodes in the system. Performance and size can be dynamically scaled over time by adding more nodes or disks to the system.
    Their project is GPL licensed, requires very little effort to set up, and likely much less effort over time to maintain and administer than a RAID or Tape system likely would, with the benefit of being able to choose what data is online or offline (by its presence on chunk servers being connected at the time).

  26. Depends on frequency of access by adosch · · Score: 4, Informative

    I work as a contractor for the USGS and the projects I've been involved with host, archive and provide means for customers to access all our different satellite data products. We've got a Long-term archive method for tons of data products (digitally and tangible) and I can honestly tell you the first thing that always comes up is: how often will the data need to be accessed?

    For the longest time (almost a decade) we used 3 big, STK tape silos for data archive and retrieval for custom orders. The problem behind that type of design is we used a archive in a completely wrong manner in the fact that we tried to use it as a archive and a quasi-online retrieval system into a caching filesystem. We had tape mount counts in the hundreds and thousands, constant mechanical tape issues because of the excessive use, ect. We actually decided to move it all to online storage using enterprise RAID (EMC Clarion) and moved to a small LTO-4 tape unit for almost permanent, maybe-once-in-a-great-while storage and the rest we leave completely on spinning disk and control the access to it via application layer network protocols as needed.

    IMHO, I really think it's going to depend on the access frequency of your data. If that custom needs their data once, and maybe never again in case they lose it, put it on tape. If it's a requirement they can get the data from you any time they want and you've got the hardware and administrative resources, power and bandwidth, put it some RAID.

  27. Just copy everything to you unlimited drive! by mswhippingboy · · Score: 2, Funny

    I've been doing backups by copying everything to my unlimited drive for years now. It's amazing - it never fills up!

    Just type
    copy Edit.* NUL
    at your command prompt (or cp * /dev/null if using Unix).

    One day I'm gonna look to see how much data I have in that damn thing!

    --
    Sometimes the light at the end of the tunnel is the headlight of an oncoming train.
    1. Re:Just copy everything to you unlimited drive! by hawkeyeMI · · Score: 2, Funny

      You have to retrieve it from /dev/urandom

      --
      Error 404 - Sig Not Found
  28. Re:Exactly. by TooMuchToDo · · Score: 4, Insightful
    Either MogileFS, Lustre, or possible Hadoop (depending on the type and size of the data). Any sort of distributed file system where multiple chunks, replicas, etc (3 is a good number, more is better if you have cheap disk and deduping at the filesystem level) are constantly available.

    Feel free to ask more questions.

  29. Re:Exactly. by TooMuchToDo · · Score: 3, Interesting

    Almost forgot to add. Never pay for expensive disk systems. Put the intelligence into your application instead. It'll scale faster and much cheaper. You also aren't locked into a technology (and instead, can enjoy the falling costs of storage, both spinning and SSD).

  30. Not as complicated as you think by dbIII · · Score: 2, Informative

    If the setup is raw files on tape (the stuff I've read in from reels recorded in the 1980s) or tar (stuff from the 1990s) it's not a big deal at all. It's only two-bit closed source single developer efforts on MSDOS that are difficult for anyone to read if they have the hardware.
    With recent backup systems like AMANDA you can just dump a file with dd or similar and the instructions on how to deal with the data are there in the header in ASCII! It couldn't possibly be easier.
    Also the "proper environment" is a lot easier to achieve than with storing hard disks, putting them in boxes in a warehouse without a leaky roof is most of the way there. Those tapes from the 1980s I'm talking about were exposed to 30C temperatures and high humidity every year, which is not ideal. Theoretically they were spare copies that were only made for transport and only had worth after the originals and copies were mistakenly thrown away by the clients.
    The "people" problem is very easy to deal with. You make a lot of people responsible for backups of the department they are in, and make it VERY easy for them to do the backups. Somebody in small department X knows to put a tape in a few times a week, and so on, so you get a lot of people that know for certain that there are backups and how old they are. When the inevitable happens and things get lost or deleted there is no longer the panic of tracking down the IT guy and asking lots of questions whether something is lost or not. Instead of spreading panic you just get mild annoyance that it will take so long to get it back.
    Also when one person leaves there are still many that know the system.
    Of course using rsync to a machine elsewhere to get a copy of everything is useful, but that is not a backup, it's a copy, so it shouldn't be the only option. With things like that you eventually purge old deleted files or write over old versions of files, while the tape from two years ago has what was on there two years ago.

  31. Another Option / Definition issues by bruciferofbrm · · Score: 3, Interesting

    A problem I have here is the definition of 'long term'. To each of us it means something different.

    In my job I have to archive 1.6 terabytes of data per day, and keep it around for 45 days (which, BTW, is not my definition of LONG TERM). For this task I utilize Data Domain storage, which utilizes data deduplication techniques for massive compression.

    What you find is that at the block level your data may in fact be incredibly deduplicatable. In my case it very much the situation. I am currently storing 86 terabytes of rolling archives within 2.5 terabytes of physical disk space.

    The problem with any technology you use for 'long term' storage is the ability to read those archives later. Assuming the media doesn't self degrade inside of the time frame you call 'long term', you must have the tools to read that media again. If you use BluRay, then you must store a compatible drive with it. (Nothing says Sony will not change the standard in two years and make all current drives obsolete, so no one makes them any more). Tape is worse, in that in two major model revisions, drives wont be able to read your media because its density is to low for the new drive head technology. Hardware based disk raid has the issue that the controller the raid was built with needs to stay with that raid. Another controller from the same manufacture, with the same model number, but a different firmware revision may not be able to figure out the raid, and declare the drives empty. Software raid is a little easier to deal with as long as you keep a copy of the OS you used to create it with in the same box. But then, during your defined 'long term' period, will you still have access to a system you can even plug these drives into, or run the OS on?

    What you end up dealing with in reality is that as an archivist, you either ignore these facts, or you invest in a constant media / technology refresh and spend large amounts of time keeping your archives on the latest storage available.

    Of course, all this falls apart if your definition of 'long term' isn't as long as some will project. In my case, my archives roll over every 45 days. I could easily keep that data alive for years on a live piece of hardware with a service contract. If I do not trust that hardware enough, I can buy two and replicate between them. (which, actually I am, for disaster recovery purposes)

    With deduplication my (acknowledged) high initial investment quickly outweighs the cost of single purpose drives holding one copy, and wasting unused space. My purchase cost was less then $60k, but if I had to store all of that data in its raw form, my costs would be in the millions. However, if the data is not deduplicatable, then of course it is a moot point.

    Each answer has it flaws. You decide which risks are acceptable, plan your best to deal with obsolesce, and define your definition of 'long term'. You also have to be ready to change your solution, when the one you choose today, fails to be the right solution for your needs in 5 years.

  32. Re:Exactly. by raddan · · Score: 2, Interesting

    Why distributed FS and not something like live mirroring/shadow copy? I wonder also... what do you consider an "expensive disk system"?

    We played around with DIY JBOD a bit (i.e., moving the complexity up into software) because it seemed a lot cheaper, but we have yet to get the thing to operate as reliably and simply as our fibre channel RAID units. The main problem we're running into is that for SATA to be practical, you need to multiplex several SATA disks onto single SATA ports, but that software support for doing this (at least in Linux and Solaris) is terrible. SAS interconnects are obviously better in this regard, but the hardware doesn't end up being any cheaper than standard fibre channel RAID, so you end up spending the same amount of money but doing more work because the spanning/striping/whatever is not managed for you. Also, under Linux, we've been somewhat dissatisfied with mdadm compared with something like OpenBSD's bioctl (which is easy to use and AWESOME). Sadly, OpenBSD is severely limited in some other areas that make it inappropriate to manage large disks (very large filesystems are not possible... fsck is prohibitively expensive with large volumes... etc...).

    The other thing I've been wondering, and it is unclear to me whether distributed filesystems solve this is: it seems like they're designed to spread data around on many nodes. Are they also good for allowing access FROM many hosts? E.g., our StorNext system allows us to plug any client into a fibre channel switch and simply use the disk without really having to think through what that means. But StorNext limits us because we can't plug, say, an OpenBSD machine into it (proprietary drivers, no client software) and that licensing can get expensive fast. StorNext's documentation also leaves something to be desired, and their salespeople do not inspire confidence (none of them could tell us how to actually install a license on a machine... I had to figure it out myself).

  33. Re:Exactly. by TooMuchToDo · · Score: 2, Interesting

    Why distributed FS and not something like live mirroring/shadow copy? I wonder also... what do you consider an "expensive disk system"?

    Distributed file systems are great because your limitations are (almost) always going to be hardware. Want 1000 boxes serving up your content? Get 1000 commodity boxes with disk. Need 10000? Also, not a problem. A box filled with raw disk is WAY cheaper than an EMC, Nexsan, etc (i.e. expensive disk system).

    Serving over Ethernet should be fine, as you can always bond network connections together to increase throughput from your storage boxes to whatever boxes are processing the data (or even process the data *on* the storage boxes). I've bonded 8 gigabit ports together on some of our storage boxes without problems (Linux Kernel 2.6). I'm not a fan of fiber solutions. They're expensive, and I can get gigabit networking gear cheaper.

    In short, stick to commodity hardware and open source distributed file systems. I know you're a little far along for that, since we had the opportunity to build from the ground up.

  34. Re:Exactly. by Tablizer · · Score: 2, Funny

    I just manage the storage for one of the LHC detectors

    But your storage only has to last until December 2012.