Slashdot Mirror


Hard Drives as Backup Media?

rootus-rootus asks: "I funny thought struck me as I was going over the life expectancy for tape media for backups... Since the size of 3.5" hard disks is surpassing 100GB in a reasonably inexpensive package, has anyone thought of using them as backup media, as in a jukebox or autoloader? The access times and data transfer rate for data stored on them would make backing up databases, etc. MUCH more palatable (200+GB takes a LONG time to dump to tape for a full backup) Any thoughts on the matter?" Bet you've thought about this question before, haven't you? Has anyone done anything like this? If so, how well did it work?

17 of 47 comments (clear)

  1. Not good by autocracy · · Score: 2

    It's been thought of, and rejected. The reason for this is that the data storage and mechanical parts are contained in one unit, and failure of either makes the other useless. This means that if your drive stops spinning, but your data is fine you can't get to it. This wouldn't be a problem with removable media because you can change the read/write device.

    --
    SIG: HUP
    1. Re:Not good by Liquor · · Score: 2, Insightful
      It's been thought of, and rejected
      But it was rejected back when the cost of the mechanism was far higher that of the media alone. This suggestion is that the 'Media' IS the mechanism.

      When the cost of a TB of storage on IDE hard drives is lower than the cost of a TB of tape media (at least when you include the amortized cost of the tape drive - though it's becoming true without that nowadays) then "if your drive stops spinning, but your data is fine" winds up in the same category as any other tape failure.

      And I've yet to see a tape drive mechanism failure that didn't manage to corrupt (or even destroy) a tape.

      And in my experience, bad tapes are even more likely than failed hard drives.
      --

      Liquor
      Sanity is a highly overrated commodity.
  2. Firewire Solution by clark625 · · Score: 2

    I purchased a couple of 80Gb firewire drives for my backup needs. They ran about $275 each after shipping, though I'm sure it's cheaper now. Every day I bring one to the office with me and replace it with the one that was plugged in the previous day. This allows me to do full backups every night and data recovery takes almost no time at all.

    On the other hand, this isn't a perfect solution for most companies. First, it would be easy for me to bang the hard drives and have them not spin up. They also are a lot bigger than a tape cartridge. But they do save me lots of time--and that means a lot. I really don't expect these drives to last forever with the "trashing" that gets done to them every night; but since they aren't terribly expensive (for my company) I don't really care if I have to buy another.

    --
    Long, cute, or funny Sigs are just another form of over compensation, used by geeks, nerdz, etc.
    1. Re:Firewire Solution by Delirium+Tremens · · Score: 2
      Assuming you keep backups for hardware crash and disaster recovery, I think there is a little problem in your plan. Basically, during your day at work, your two firewire drives are at the office with you. What happens then if - say - a fire erupts in the building while you are out for lunch and destroys everything? Bye, bye, backups!

      I suggest you buy a third drive and keep the same backup habits. That way, in the case of a disaster, you would still have a one-day old backup at home.

  3. Certainly has been *thought of*... by adjuster · · Score: 2, Informative

    I've done a reasonable quantity of backup-solution deployments, from the simple "tape drive in a server" to multi-element DLT libraries. I've had customers "invent" a version of this idea on many occasions. Typically, the customer's "invention" takes the form of one of several similar ideas.

    What it comes down to, though, is that the idea behind having multiple medias, stored _away_ from the production copy of the data, is a good thing. Until recently, this has only been really convenient with tape media. With the advent of very convenient hot-swappable hard drive carriages and support for hot swapping of hard disk media in nearly every commonly used operating system, I don't see why hard drives could not be used-- but they would need to be treated with a little more physical care than tapes.

    The "problem" seems to come when the (typically small-business) customer "invents" this idea, buys one of those cruddy "centronics connector on the back" sub-consumer-grade plastic "drive bays", slaps a hard drive in it, and starts doing backups to one hard drive from another. The cycle is something like: (1) insert 2nd hard drive, (2) wipe 2nd hard drive, (3) copy contents of production hard drive(s) to 2nd hard drive, (4) remove 2nd hard drive. They don't think about what would happen if, say, between steps 2 and 3 the production hard drive(s) failed.

    If you're going to use hard disks as "tapes", I don't think there's anything fundamentally wrong-- but buy the same number of hard disks as you'd buy tapes-- and rotate them in the same manner. Treat them as large, mechanical tapes. Keep them away from the production data except when in use.

    --
    The Attitude Adjuster, I hate me, you can too.
    1. Re:Certainly has been *thought of*... by ansible · · Score: 2

      ...but buy the same number of hard disks as you'd buy tapes...

      Yup. The exact type of the medium shouldn't be changing your backup strategy.

      Tape prices vary wildly, but figure on a buck or two per GB. Hard Drives aren't quite there yet, somewhere around four or five bucks per GB if you include a decent enclosure.

      I guess I'm old-school, but I still prefer tapes. They are also more robust to physical/temperature abuse, which I like.

  4. Robotics, Linux IDE hotswap, other factors by Adam+J.+Richter · · Score: 2

    A few notes on your idea:

    1. There is no need to build a mechanical autoloader. IDE controllers and removable drive bays are cheap, less than $25 per drive, making them much cheaper than a robotic loader, with greater reliability and response time to boot. IDE drives can be spun down when they've been idle for a while, so electricity consumption should be similar.

    2. I believe that Linux IDE does not currently support hot swapping of drives, although the PCMCIA drives do support removal of an entire IDE controller, which is what happens when you remove a CompactFlash card.

    3. My understanding is that hard drives are not hermetically sealed but rather have air filters similar to what you stuff at the end of a cigarette is made of. I believe that when hard drives are not in use, they can accumulate dust internally and are more likely to have problems. You may also have problems with their greater sensitivity to being dropped and to statically electricity. So, you may want to store them in sealed conductive bags.

    4. In my humble opinion, I think you have a good idea. I believe that, disk-based backups are much more valuable to an organization because they're easy enough to use that people will save time by doing minor recovery tasks. In comparison, with tape backups, the effort of doing a restore can be so much that people will often opt to spend an hour regenerating their previous work from scratch instead.

  5. Re:Here's how we do it: by cloudmaster · · Score: 2

    I'm just "rsync / backup@remote:/backup/$HOSTNAME"ing every night to a box offsite that rotates the drive mounted on /backup every day when a backup's not running. It runs overnight when the network's not real busy, and works fairly well. I backup the really important/dynamic stuff on site on a daily basis with a 7-disk DVD-RAM rotation. It's the right balance of price/simplicity v/s date safety for my organization, and is pretty idiot-proof.

    The drives in the remote backup server (which could easily be co-located at your nearest ISP) aren't "removable", but they're certainly not premanent either. :)

  6. One reason why tapes are better than drives.. by cmowire · · Score: 2

    Ever drop a tape while taking it out of the bay and stuffing it into the tape store? I have. The tape was fine.

    Even rugedized drives, when dropped from arm's length, are not going to hold up too well. Cheap drives will definately not hold up.

    1. Re:One reason why tapes are better than drives.. by ansible · · Score: 2

      It obviously wasn't a DLT tape. I've damaged those by dropping them. Sometimes you can "fix" them by doing a retension, but sometimes not...

      I've never had a problem with DAT or VXA tapes, and those have had a fair amount of abuse from me.

  7. Speaking of mobile drive racks by unitron · · Score: 2

    Okay, this is sort of an off-topic rant, but can anybody tell me what's up with ATX tower cases with 4 5.25 inch drive bays, but only the upper two are useable for anything as long as a CD or 1.2Mb floppy drive because the standard ATX motherboard is in the way, in other words, the case is high enough and wide enough, but not deep enough. Anybody else fighting this particular frustration factory?

    --

    I see even classic Slashdot is now pretty much unusable on dial up anymore.

  8. Re:Here's how we do it: by penguinboy · · Score: 2

    You're saying which one is overpriced?

    400GB added to a PC - $1,200
    460GB RaidZone - $10,000

  9. Re:The fastest backup, the fastest restore by NevDull · · Score: 2

    Many people use this for backing up to tape. You break the mirror logically, then stream it to tape, then add it back into the mirror and resync. It speeds backups because you back up from a disk which isn't otherwise busy with head seeks to other parts of the disk, and if you're doing it with software RAID, likely off a completely different SCSI controller.

  10. Let me rephrase things by The+Mayor · · Score: 3, Informative

    I'm reading some of the replies and thinking to myself that the /. readers don't understand what a backup system is.

    A backup system is not simply redundancy (i.e. RAID). A backup system for files typically can recreate any version of a file requested by the user (as backed up according to the backup regimen). Thus, if you have nightly backups, you might keep every night for the past month, every month end, and every year end for a given document. RAID won't give you this.

    I'm familiar with some expensive IBM products that do this. However, they're expensive. Basically, ADSM (ADSTAR Data Storage Manager, or something) is a product that allows regular backups of products, and access to every incremental version of the documents. On the backend, it can be hooked up to a huge disk cache and a robotic tape library. The end result is terabytes of near-online access data, with automatic versioning. Pretty nice. And if your disk cache was large enough, it would never hit the tapes. It seems to me that this could be modified to remove the tapes and present what the user requires.

    I'm not aware of anything open source or free (as in beer) that does this. It would be really nice, though.

    Hell, I've always dreamed about an automatic versioning filesystem. Documents would be automatically versioned. You could use CVS to handle this. Perhaps you could do something as simple as have some code executed upon every file close for files that are opened with write access. When these files are closed, they are added as new versions of the document within CVS.

    When the disk reaches some capacity watermark, a disk cleanup agent would be invoked. Its goal would be to remove redundant versions of old binary files from CVS. Rules could be attached to the agent to perform tasks such as retaining specifc versions of binary files (i.e. retaining the first version, the latest version, and all versions from the last named version).

    Users could tag specific versions of files. These versions would always be retained.

    I know this would incur a significant performance hit for disk access. Perhaps I could limit such disk access to specific directories or mount points. In this manner, I could have a mount point for documents, all of which would be automatically versioned.

    Plugins for Explorer could be built to allow users to tag versions of documents and retrieve specific old versions of files. I'm thinking something like TortoiseCVS, a beautiful piece of software. In fact, for prototyping, TortoiseCVS would be enough.

    Now, is anything like that available? No? Perhaps I should do something about that.

    Cheers.

    --
    --Be human.
    1. Re:Let me rephrase things by Baconator · · Score: 2, Informative
      ADSM is now known as TSM, the "Tivoli Storage Manager". While it's nice in principle, in practice it has many flaws.
      • It's almost useless for a complete backup, since the client can't handle restoration of system-critical files (at least on platforms I've used)
      • Platform support isn't very good, and it's getting worse. You can only get the clients in binary, and the list isn't very long. Want Linux/PPC? Tough.
      • Authentication/xfer is entirely clear-text. This makes the system pretty much useless for backing up sensitive files.
      • Reliability seems dubious, and getting worse. I've seen both backups and restores mysteriously time-out or fail for some other reason, and it's NOT the network. Maybe it's just our setup here, but I sure don't trust it.
      So, for certain limited uses I'm sure it's swell, but it's hardly a panacea. It's also absurdly expensive, but I suppose that's par for the course.
    2. Re:Let me rephrase things by Big+Jason · · Score: 2, Informative

      I'll second that opinion. We use TSM 3.7 on a Sun E4500 with a STK L700 and an HDS 9960. We have an ELA with IBM so we get it for "free", problem is IBM support sucks rocks. IBM, like most "big" companies, is really made up of a bunch of subgroups ie Tivoli, Lotus, Storage, Mainframe, AIX, Netfinity, etc. So when you have a problem, they all point at each other and the net result is the customer is screwed. I have an RS/6000 with ADSM/HSM and an IBM 3995-C64 optical library, and it is barely usable because it is so *damn* slow and unstable. The only platform where TSM works well, is the Mainframe I hear.

  11. Heh, I've seen it done by randombit · · Score: 2

    Some grad student from China came here with a 6 gig IDE disk with all his data on it. I though it was kind of weird myself, but I guess it worked out OK...

    Though drives will often die if left to their own devices [ie, off] (we say they get lonely and kill themselves). Which would really suck if that was your backup, wouldn't it?