Slashdot Mirror


Hard Drives Instead of Tapes?

An anonymous reader writes "Tom's Hardware News weekly news letter has a very interesting article about Dr. Koch of Computertechnik AG who won the contract to build a RAID backup system for the University of Tübingen. Dr. Koch took several standard entry-level servers, such as the dual-Athlon MP, and add modern components and three large-caliber IDE-RAID controllers per computer, and a total of 576 x 160GB Drives."

73 of 465 comments (clear)

  1. Far more practical by Random+BedHead+Ed · · Score: 5, Interesting

    This is a much better solution than tape, really. It's predictable that the industry will probably move in this direction, now that the hardware is cheap enough and of high enough capacity to serve this function.

    Imagine: instant recovery. Your backup could be a usable image of your live server.

    1. Re:Far more practical by diverman · · Score: 4, Informative

      Yeah, I've seen this trend for a while now. Our backup system is also a large HDD raid setup. And for things that need long term storage, those eventually get spooled to tape. I'm sure long term storage will probably start going the way of DVD optical media or something similar (better capacity more likely).

      Yeah, the full usable image would be nice, but would probably require a shutdown for data consistency. The backup strategy would likely be similar to that of an Oracle system cold backup. :)

      -Alex

    2. Re:Far more practical by Servo · · Score: 4, Interesting

      I don't really see it as being all that predictable.

      The benefits of having backup to disk is of course speed. But what happens when you have a disaster? Your SOL, because your backup-on-disk system just got toasted too.

      The benefits of having backups on tape is that you can send those tapes anywhere. It might not be as quick as sending a file electronically, but when you are talking hundreds of gigabytes of data, it just isn't economical to do anything but tape.

      Tape will never die. Hardware may be cheap and high capacity, but transmission costs keep it from being feasible.

      You also need to take a look at space utilization. You can put a tape silo into a footprint that gives you much much more capacity per square foot than disk.

      --
      A slip of the foot you may soon recover, but a slip of the tongue you may never get over. -Benjamin Franklin
    3. Re:Far more practical by Random+Frequency · · Score: 5, Insightful

      cdr/dvdr uses a chemical substration process to have data written to it, and is nowhere near as stable as magnetic tape.

    4. Re:Far more practical by jhoffoss · · Score: 4, Interesting

      This concept raises some issues though. My employer owns two +1TB SANS and has them in separate locations, constantly mirroring from the production to the backup. But if you delete a file off of one, it is also deleted off of the other. So then how do you decide when to actually delete something from the backup, if you want it to serve as a tape backup? Other anomolies can occur as well. A drive died in our backup SAN which brought both SANS (and in turn, all of our servers, which run off of the SAN using fiber cards) to their knees immediately, because they were trying to write bits to that drive (big coincidence, but took us down for over two hours, and it could've been much longer than that...) Still, an interesting concept. But we still take tape backups daily, incrementally through the week and a full backup over weekends. Never put all your eggs in one basket, remember.

      --
      Linux: The world's best text-adventure game.
    5. Re:Far more practical by mfrank · · Score: 3, Interesting

      Why can't you ship a 200 GB HD somewhere?

    6. Re:Far more practical by shotfeel · · Score: 3, Informative

      The reported shelf life for CD-R is anywhere from 10-100 years depending on the type of dye and who you want to believe.

      My understanding was that for tape it is only 5-10 years, but that could very well be out-dated. What is the current shelf life for magnetic tape?

    7. Re:Far more practical by ninti · · Score: 3, Insightful

      Yes, I do have some 5 years old cd-rs lying around, they work fine. I was an early adopter because at the time I was tired of my ONE year old floppy disks and Syquest cartridges dying. Stop trolling.

    8. Re:Far more practical by winse · · Score: 2, Interesting

      one time a couple of guys I work with wanted a large live storage device to backup some dvds. They used a hardware raid card and a bunch of scsi drives until one day... the hardware raid card went out. Turns out that they couldn't recover the information off of the disks without a duplicate replacement. They couldn't find one ( of course this little project wasn't mission critical or they could have looked a little harder and purchased one ) but that seems to be a critical single point of failure that is overlooked sometimes

      --
      this sig is deprecated
    9. Re:Far more practical by ebh · · Score: 2, Insightful

      Well, eight years, but who's counting? I've seen tapes younger than that lose their oxide. I've seen others physically degrade and get eaten when you try to read them, sometimes taking the drive with them.

      OTOH, my father has 40-year-old punch cards that read just fine. (Course, that doesn't scale to terabytes.)

      The upshot is that for long-term (>10 years) backup, have a refresh plan in place, where the data is periodically verified, and if necessary, extracted and copied to fresh media. (I have some 15-year-old files I did this to, moving them from QIC tape to CD-R. Nothing was wrong with the tape, but I only have one QIC drive, which could fail at any time.)

      For my ultra critical data, I keep a backup of the backup locked in a bank vault.

    10. Re:Far more practical by diverman · · Score: 2, Interesting

      Yeah... I've heard of similar scenarios with RAID setups. That's why, if it is mission critical you not only run your RAID controllers with a hot spare, but you hav a cold spare ready to jump in, in case one of the hot controllers go bad. That way you can immediately get back to a hot-failover state, meaning no risk of down time. If you're really paranoid, you have 3 hot controllers, so if even two die you still have a 3rd.

      And if you're really paranoid, each one has it's own power source, etc, etc, etc. :)

      I can't believe I got a Score of 5 on that post though. heh.

      -Alex

    11. Re:Far more practical by Admiral1973 · · Score: 2, Interesting

      I disagree. Tape is still more reliable and easier to use than disk-based backups. We used to have an optical drive jukebox for backups, running on a Sun workstation. Last year we replaced it with a tape library running on Windows NT. The major reason for the switch was that the backup software for the optical system was no longer supported by the manufacturer and there was no replacement that would work for all of our servers (Windows, Novell, HP-UX). I liked the speed of the optical system but the disks only held 5 GB each, and the jukebox only held 32 disks or some other small number. The library holds 120 tapes, and the sizes are 50 or 100 GB each. We dupe the tapes every day and send the copies offsite. Data restore speeds are slower than the optical disks, but not by much. And we have a smaller library at our DR site in case of emergency. We couldn't do the dupe process with the optical disks. And while we stored some disks offsite, the management of what was where was way too difficult. The tape library has been a big winner for us.

      --
      Lousy minor setbacks! This world sucks! -- Homer Simpson
    12. Re:Far more practical by diverman · · Score: 4, Interesting

      Tape can be more reliable, largely because of its time to evolve. It's a more durable solution.

      Granted, most current tape systems support more storage than current optical systems. But optical backup systems are gaining popularity. I expect that as they become more popular, they will become cheaper, more reliable, faster, easier, and more spacious... just like every other form of technology that gained popular view.

      Tapes have been trust-worthy for some time now, but they are slow and bulky. As the amount of data to backup continues to increase, tape won't be able to keep up. Optical media has progressed in speed considerably over the last few years. I know that some media (CR-R's for example) have hit media thresholds... but they're still MUCH faster than tape at current speeds.

      I expect that optical media will continue to improve. In my original post I did state "or something similar", to allow for progress over current optical limitations. My main point really being that tape is dying out. An optical media is becoming a focus for long term storage media, and investment will go into improving upon it.

      -Alex

    13. Re:Far more practical by amorsen · · Score: 2, Interesting
      Mirroring is not backup. If you want to do backup with disks, treat them like tape. Tar up the files, put them on the disks. Reuse disks over time, just like tapes. For backup, the only difference between a disk and a tape is that with a disk it can be really fast to skip to a specific file or archive.

      The perfect solution to backups would be notebook SATA disks, which should hopefully appear soon. Hotpluggable, no bulkier than an LTO or DLT tape, screaming fast compared to LTO and DLT, and very hard to damage when powered down. Capacities are about the same. Unlike tapes, I can rescue data off of a disk without needing an expensive and fragile drive.

      --
      Finally! A year of moderation! Ready for 2019?
    14. Re:Far more practical by mt_nixnut · · Score: 2, Interesting
      I have been using HD backup for 2+ years now and would never willingly go back to tape. Long term gets burned to plastic (also cheap). All current data gets stored on numerous HDs that are on a machine in another building to protect against fire etc.

      Drag & drop backups that are fast and brainless. And the hand rolled system I built was cheaper than tape gear big enough and fast enough to do the same job.

      It also does not need exotic software, at least in my case since I just use cron and a set of rsync scripts.

      Easy, fast and getting cheaper every day. So I consider this to be non-news and sort of obvious and I am suprised more people are not doing it, other than they may have too much already invested in tape solutions to ever turn back.

      It is not the be all end all. For example if you want fast disaster recovery you may want something else. ( But I'm not sure that keeping a synced copy of your OS partitions on a spare HD is a terrible idea.) For the places that this makes sense I can tell you its great.

      FWIW

    15. Re:Far more practical by Tim+Doran · · Score: 3, Funny

      Shyeah, cause Zip drives are so much MORE reliable...

      "What's that clicking sound?"

  2. Compliance by sk3tch · · Score: 2, Interesting

    So a BIG RAID is somehow safer than many small RAIDS? Backups aren't just for the heck of it...some of them are required for compliance, i.e. the financial industry.

    1. Re:Compliance by Havokmon · · Score: 4, Insightful
      Backups aren't just for the heck of it...some of them are required for compliance, i.e. the financial industry.

      Oh PLEASE! I worked for, what was at the time, the 17th largest CC processor in the nation. Not so big, but lots of merchants. They bought a front-end (where your credit card terminals dial into), and built a backend settlement (so they didn't need FDR - who recently ROYALLY hosed everyone with a software update, including CHASE themselves. No, this software update was completely seperate from the SQL Slammer worm that took them down when it appeared.).

      Complaince, usually done by the OTS (Office of Thrift Supervision), is NOT ISO 9000 type stuff. Financial companies are CHEAP. Never forget that. Whatever is the cheapest solution, is the one that is used.

      As for tape backups - as an example: It took quite a bit of convincing to upgrade from the 4 drives that took two days to backup the whole network to a single Sont DLT drive. (Because $70/tape is a LOT of money)

      There were no 'compliance' worries at all.

      --
      "I can't give you a brain, so I'll give you a diploma" - The Great Oz (blatently stolen sig)
    2. Re:Compliance by stilwebm · · Score: 2, Insightful

      Those aren't just backups, but also archives for auditing purposes. The analyist scandals of the last couple of years really helped drive home the need for these archives.

      They of course are also important for business continuity, as Sept. 11, 2001 showed us when several large finacial firms had their data centers destroyed.

    3. Re:Compliance by sql*kitten · · Score: 2

      Oh PLEASE! I worked for, what was at the time, the 17th largest CC processor in the nation.

      That's not the financial services industry in any meaningful sense of the word, just a teeny tiny corner of it. If you're trading real instruments - think Wall Street, or the Square Mile - you need to keep everything around for 7+ years, and if anyone you've traded with in that time gets audited, you might be asked for your counterparty records. Not to mention the fact that you can trade instruments with a maturity date of 30 years. If in 29 years you want to know what positions you need to close out - that data has got to be there, preferably online, at worse in an easily accessible archive. Think commercial banking, pension funds and government treasuries.

      Financial companies are CHEAP.

      Goldman's alone spends $1B/year on IT. And they're not even the biggest investment bank. And there are mutual funds, hedge funds, commercial banks, etc, spending comparable amounts. Maybe small companies are cheap, but the big players will spend whatever it takes to get the job done - because it's worth it.

  3. Sound fine, but... by Quasar1999 · · Score: 2, Insightful

    What about being able to transport and store the information offsite?

    I mean, sure tape isn't great, but it's a lot more transportable than harddrives.

    --

    ---
    Programming is like sex... Make one mistake and support it the rest of your life.
    1. Re:Sound fine, but... by ashitaka · · Score: 4, Informative

      I don't know about the mega RAID systems descibed in this article but we're doing this with a couple of high-capacity IDE drives in a removable drive cage. The relevant system states and data are backed up to these drives daily. The time to get our databases and files up to running state in a disaster scenario is under three hours.

      --
      If you don't want to repeat the past, stop living in it.
    2. Re:Sound fine, but... by peterjhill2002 · · Score: 3, Interesting

      With the huge size of some databases, it would make more sense to connect to your offsite storage via fiber and store it there. There is no reason the backup disks need to be in the same room or building or state as the primary disks. Then you also solve the problem of reliably getting the data offsite in the first place. This is of course more expensive than renting a storage locker and driving a dat tape over to it every night, but I don't think Citibank is driving too many tapes around town. (just a guess)

    3. Re:Sound fine, but... by afidel · · Score: 4, Informative

      Well as the article states this implementation isn't really for offsite There's one aspect in which Dr. Koch's backup system can't keep up with tape solutions: storing the backup medium in another location after the backup has been completed. but it could be done pretty easily. Non-operating shock capacity on the D540X is 300G's for 2ms which is pretty darn good (plastic tape housings might shatter under a similar load). I also like the ultra low failure rate .5% (hmm, this and the data from storage review shows that the D540X and D740X line seem to be some of the most reliable out there...) I know our DAT failure rate was in the same ballpark.

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    4. Re:Sound fine, but... by Bonker · · Score: 5, Informative

      As someone who works in IT in the financial industry, let me tell you a little bit of what kind of requirements we fulfill. First of all, every system is backed up on a regular basis. For critical systems (systems that handle account numbers in any way), that schedule is daily or even hourly.

      All systems have live fail-overs. When not required by law, and they frequently are, such systems are required by the demands of profit. If financial transactions falter for a *second*, it means money lost.

      Back-up media is triple redundant and incremental over 5 days. Backup irregularities of any kind are logged, investigated, and acted upon by at least 3 individuals.

      Copies of backups are stored both on site and off-site in a secure location provided by our insurance provider. We make frequent trips to this secure location daily in order to deposit backups. These procedures are audited and reviewed on a regular basis by both internal auditors and regulatory board auditors.

      Tape is just a little more reliable than IDE in this kind of situation. Tape is going to be more recoverable, even in case of a long drop or serious auto accident between point A and point B. If necessary, teap will also survive shipping better.

      Sorry, guys. As reliable as IDE drives have become, they're just not as durable as a tape cartridge. With the sheer amount of backup we keep, it's also significantly cheaper.

      --
      The next Slashdot story will be ready soon, but subscribers can beat the rush and slashdot the links early!
    5. Re:Sound fine, but... by override11 · · Score: 2, Insightful

      OK, but look at how he has it setup: several systems with lots of HDD's in each. You could presumably setup one of those systems to store redundent information in it. You could designate 2 systems as hot spares, and swap them off site. :)

      --
      No I didnt spell check this post...
    6. Re:Sound fine, but... by Zathrus · · Score: 4, Informative

      High end mag tape cartridges store 50GB.

      Uh, I think you better look at tapes again. AIT-3 is 100GB uncompressed. Super-AIT is 500GB uncompressed. Transfer rates for Super-AIT are in the 30 GB/s range uncompressed. All of these numbers go up with compression, which is built into the tape drive hardware -- assuming you're storing compressable data.

      All in all, they're likely to have a higher sustained transfer rate than IDE drives, and are going to be more reliable, less costly in bulk, and easier to handle.

      Of course they're silly for small systems... but that's not what we're talking about at all.

    7. Re:Sound fine, but... by GigsVT · · Score: 4, Insightful

      Why do people in industries with strict uptime or reliability requirements always act holier-than-thou about the whole issue, as if their way is the only right way?

      Not all companies need five 9's. Not all companies lose much money if data or systems are not available for a short time. In fact, I'd say it's the majority of companies that fall into that category.

      Extreme reliability and availability are extremely expensive. For most companies, it's not worth it.

      I agree with you, Large ATA RAID probably isn't for your industry, it's not right for everyone. It does work fine for lots of people though. I expect to see it cover much of the 5TB range of near-line backups in the next few years.

      --
      I've had enough abrasive sigs. Kittens are cute and fuzzy.
    8. Re:Sound fine, but... by kcarlile · · Score: 2, Interesting

      4.7 GB vs. 300 GB AIT3 (or even 100 GB AIT2). That's a whole hell of a lot of DVDs. I've got clients in the graphic arts industry who archive to DVD-RAM (2.6 GB/side, in this case), and one job spans several of these. Imagine backing up the whole office (and all of the jobs) every night. Their backup (and truly rather insufficient at this time) is a 4 tape AIT 25/50 autoloader for each day. And I agree with the above poster about hd vs. tape. Remeber, MTBF mean Mean Time Between Failure. Not Mean Time Between It Might Fail. It means FAIL. As in, they will always fail. And what happens to the RAID when multiple drives fail?

    9. Re:Sound fine, but... by cymen · · Score: 2, Interesting

      Tangent: FreeBSD 5.0 has filesystem snapshots. Anyone interested in a more home-grown setup should take a look at that... Is there anything similar for ext3 or reiserfs?

  4. I don't know if that is a good idea by Tighe_L · · Score: 4, Insightful

    There has to be a better way than relying on anything stored in magnetic format, optical I think woudl be preferable, and resistant to EMP.

    1. Re:I don't know if that is a good idea by Alpha_Nerd · · Score: 3, Funny

      Yeah damn that EMP... If a neutron bomb goes off, I sure as hell don't want to lose my 100+ TB pr0n collection.

    2. Re:I don't know if that is a good idea by csnydermvpsoft · · Score: 3, Insightful

      Storing data offsite is the only good way - even optical media is still vulnerable to a nuclear strike. Of course, if you get nuked, your backups will probably be the least of your concerns.

      The best backup solution would be a bunker with hard drives, backed up via fiber in real time.

    3. Re:I don't know if that is a good idea by bmongar · · Score: 2, Funny
      There has to be a better way than relying on anything stored in magnetic format

      We could punch into paper tape.

      --
      As x approaches total apathy I couldn't care less.
    4. Re:I don't know if that is a good idea by angst_ridden_hipster · · Score: 5, Funny

      Etched stone seems to have a staying power of approximately 10,000 years, even with some outdoor exposure.

      Earthenware tablets, made of clay fired at low temperatures (1816F/991C), seem to do nearly as well, while stoneware tablets, made of clay fired at high temperature (2345F/1284C), last about the same as actual stone. Ceramics have relatively high resistance to moisture and thermal variation. Depending on the clay composition and the application of glazes, there is variable resistance to acid. Ceramics do not handle physical shock particularly well.

      Glass can last thousands of years, but is vulnerable to shattering or acid.

      None of these, however, are earthquake-resistant. Outside of the immediate blast radius, they're good against nukes.

      Etching into stainless steel is good, although in the event of a nuclear attack, this would be succeptible to melting (or self-destruction due to induced current) within a certain area. It handles thermal and moisture extremes pretty well, but doesn't handle acids well.

      Stamping into gold foil is expensive, but quite durable. It's immune to some of the chemical risks posed by steel, but is more likely to be stolen. It's also not as hard, thus leading to risk of data corruption or loss via impact.

      Parchment, preserved lamb or sheep skin, can last a very long time (on the order of 2,000 years) in the right conditions. It does well with exposure to electromagnetic radiation, but deals badly with moisture or excessive dryness, and is highly vulnerable to acid.

      Delay-line broadcast (reflecting your data with a laser off of a distant object, and rebroadcasting ad infinitum) is fairly reliable until occlusion of the data path occurs, or the transceiver is smashed, unplugged, EMPed.

      Yeah, data preservation is hard in the long haul.

      --
      Eloi, Eloi, lema sabachtani?
      www.fogbound.net
    5. Re:I don't know if that is a good idea by belloc · · Score: 4, Funny

      Etched stone seems to have a staying power of approximately 10,000 years...

      This reminds me of a formal budget proposal submitted by my predecessor many years ago (I run the IT dept. at a small college). He gave a very detailed cost breakdown of several means of replacing our then-current backup and recovery method for our file server's RAID array (we were very small way back then). He had costs for hardware, time, and manpower for just about every option available at the time.

      His last option, put forth just as seriously and fully as the rest, included the cost of having a team of monks write out the data by hand onto reams of paper, bit-by-bit. Then for recovery, the monks would re-enter the data back into the computer, bit-by-bit. On the pro side he argued that monks work cheap and are very dedicated to what they do. But the con was the time involved for this method was somewhat prohibitive. ;)

      Belloc

      --
      I got more rhymes than Jamaica got Mangoes.
  5. This would work for limited installations by Anonymous Coward · · Score: 3, Insightful

    But as large as harddrives are getting, the demand for backup will still be larger. I don't see this as taking over tape any time soon. People have been talking about how big harddrives are getting and about the demise of tape for a long time.

    Just remember, if you can build something like this for backup, you can also build something like this for regular storage... and then what will you do if you need to back it up? Especially if you need to have a 6 month rotating backup...

    I'm afraid it will be back to tape then...

    1. Re:This would work for limited installations by operagost · · Score: 2, Funny
      Everyone, read the parent post in "Dr. Evil's" voice for a chuckle:

      ----
      What we really need are really big ... "LA-ZERS".

      With a sufficiently powerful ... "LA-ZER", you could encode the data and beam it off to some distant object- like my Evil Base on the MOON.

      --

      Gamingmuseum.com: Give your 3D accelerator a rest.
  6. Surprised it didn't happen sooner by Dragonfly · · Score: 4, Informative

    I know of a lot of people (myself included) who use multiple external hard drives in rotation for their backups. Especially now with servers' hard drive capacities growing so fast. I just specc'd out a fileserver for a department at a cash-strapped public institution, and a tape drive big enough to backup the system's disk would have been more than 50% of the cost of the computer. Not to mention the cost of tapes. Instead I set them up with two firewire hard drives. For their needs, the reliability/longevity/cost equation made hard drives the best solution.

  7. Offsite? by da'+WINS+pimp · · Score: 5, Insightful

    One thing about tape systems that I didn't see mentioned was the portability of the media. Data recovery is still impossible if your backup burns up along with your server. I don't see anyone rolling one of these out to the offsite storage.

    Maybe you could do it with a big pipe between your backup location and your servers. But I bet that would cost a bundle in bandwidth.

    Also did anyone notice that typo on UPS (maybe they were on drugs USP)! It took me a good minute to catch it.

    --

    "I'm just here to regulate funkyness." - James Gandolfini, as Winston in The Mexican
    1. Re:Offsite? by jandrese · · Score: 2, Insightful

      That depends how "offsite" you want it to be. A fiber run across campus (say about a mile away) isn't too bad and can easily be Gigabit speeds.

      I've been using HDDs for backup for awhile now. Tapes were just way too much hassle, too expensive, and too fragile for my daily backups. I don't have protection against fire, but the whole setup can backup 650GB (usable) of data, survive disk failure, and cost me $1500, and I built this a year and a half ago with 80GB drives. My nightly backups are fully automated, and I never have to worry about swapping out tapes or having one streach on me, and it was far cheaper than the equivelent tape based system.

      --

      I read the internet for the articles.
    2. Re:Offsite? by kiwimate · · Score: 5, Interesting

      Well, they kind of flitted over it with one sentence:

      There's one aspect in which Dr. Koch's backup system can't keep up with tape solutions: storing the backup medium in another location after the backup has been completed.

      The article didn't address what to do in this case. Instead, they continued:

      As long as this isn't necessary, Dr. Koch's backup system offers some rather unique advantages.

      Given that it's hardware-focussed, maybe one can understand this omission, but here in the real world it's still important. So, yes, what does one do if one does need offsite storage? Realistically, I think your suggestion of a big pipe is about the only way. It's hardly feasible to hotswap loads of drives for your offsite storage every morning. (Yes, I know they're using IDE, but think Promise controllers.)

      The question then becomes a comparison of the cost of providing for offsite storage in this manner versus the saved cost of replacing your tape library with associated robots, etc.

      However, the article also discusses (very briefly) associated costs for specialized backup administrators, delays inherent in recovering from tape backups, etc., so they're not totally unaware of the real-world issue. I suspect they may have chosen to ignore this specific issue because (i) it wasn't an issue in this case study, and (ii) examining it would've been a touch difficult.

    3. Re:Offsite? by afidel · · Score: 2, Interesting

      Good point, he does talk about 100 nodes, why not have them on seperate ends of campus or even across town. Using longhaul fibre adapters they could go up 16 miles I believe without a repeater. So just devide the nodes into two groups and mirror the data to both sites, still be cheaper than tape. Sure it wouldn't work for a multinational corporation (for instance the telephone and transmitters in NY were often mirrored by being in each of the twin towers, this is now seen as being "not a good idea") but anything that takes out both ends of campus or two ends of town is probably so big that the universities last concerns will be the backup data.

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
  8. Help me understand... by sphealey · · Score: 4, Insightful
    There's one aspect in which Dr. Koch's backup system can't keep up with tape solutions: storing the backup medium in another location after the backup has been completed.

    As long as this isn't necessary, Dr. Koch's backup system offers some rather unique advantages.

    Out of curiosity, what is the use of a backup that is not periodically rotated off-site? And by "off-site" I mean at least 50 km away? What happens when a tornado takes out the building holding the critical data AND the building holding that nice array of IDE drives 2 minutes apart?

    sPh

    1. Re:Help me understand... by nick-less · · Score: 2, Redundant

      What happens when a tornado takes out the building holding the critical data AND the building holding that nice array of IDE drives 2 minutes apart?

      torados are quite rare in Tübingen ;-)

      however, a backup system like this protects you from (accidental) deletion of files and hardware failures. thats enough for many people..

    2. Re:Help me understand... by Anonymous Coward · · Score: 2, Insightful

      The use? You've got to be kidding.

      Of all the needs for backup, much is due to equipment failure. Not natural disasters.

      Some of my backup needs, and probably the major issue at universities, is due to the stray loss or erroneous deletion of files. Not a tornado.

      The closest thing to a natural disaster people will run into are fires. And another building is quite sufficient.

      Furthermore, to transport media, even tape, 50 km away, you've got have that infrastructure or, in using public infrastructure such as USPS, UPS, FedEx, and other delivery services, you then have to worry about security and that means crypto. For some, their files are not worth the added hassle of delivery and receipt of media, repackaging, the cost, and the extra steps to physically and computationally secure their data.

      You also have to consider purpose. For my computer backups, I keep all my backups on multiple media types (CDR and hard drives) in 2 firesafes (and water sealed) located in 2 areas of the building. The lower one is in the basement with cement blocks around it. In order to take that out, I'd be dead as well. There's no personal backup system for that (maybe in 50 years), and the data is of interest only really to myself, so it's survival is contigent and useless if I'm not around.

    3. Re:Help me understand... by egomaniac · · Score: 2, Insightful

      Out of curiosity, what is the use of a backup that is not periodically rotated off-site? And by "off-site" I mean at least 50 km away? What happens when a tornado takes out the building holding the critical data AND the building holding that nice array of IDE drives 2 minutes apart?

      Then you're probably out of business anyway, so what does it matter at that point?

      --
      ZFS: because love is never having to say fsck
    4. Re:Help me understand... by sphealey · · Score: 2, Interesting
      Then you're probably out of business anyway, so what does it matter at that point?
      I'll reply to this message, but cover several similar points.

      First, I should note that I do consider sanity checks and cost/benefit analysis when making backup/recovery plans. So I agree with many of these comments. BUT...

      (1) Disasters happen more often than people expect. And they can happen to you, not just the other guy. Wildpackets almost went out of business as a result of underestimating that.

      (2) Being out of business anyway - well - that's a discussion I had with the owner of one small company. I pointed out to him that one of his core values was loyalty to his employees. In the event of a big disaster, he and his family would collect the insurance check and sell the site, but his (former) employees' mortgage payments would continue. He got the point and agreed to improve disaster recovery plans.

      (3) "Both of our sites will never get hit at the same time". I had a friend in charge of DR for a large company who analyzed 10 years of data center disasters and came to the same conclusion. He put the backup right down the street from the primary. Ever hear of the Great Chicago Flood? Luckily for my friend he was working at another company when both his primary and backup were taken out by that event!

      That's my 0.02 anyway.

      sPh

  9. Tape technology not keeping pace... by jafo · · Score: 4, Informative

    The unfortunate thing is that tape technology just hasn't kept pace with disc technology. Back in my first job, we were backing up $1,000 20MB drives onto $40 200MB tapes. If that held true, today we would have $4 tapes that would hold around a terrabyte of data...

    But, we now have $100 tapes that hold as much data as a $100 hard drive.

    We switched over to hard drives for our backups at our (modest) server facility. Late last year we spent $2000 on a system with 600GB of RAID-5 protected storage. That holds current and historic backups, for around 6 months with our current load. We then weekly dump the current data-set off to a removable 120GB hard drive, which we take off-site.

    Tapes are SO dead...

    It works great.

    Sean

    1. Re:Tape technology not keeping pace... by sirwired · · Score: 4, Insightful

      No, you have $150 tapes that hold 1 TB of IT data. They can be written to at 60MB/s. Tape is compact, requrires no power, it is light, transportable and sturdy. The only major drawback as a backup method is the cost of the drives. (Which gets paid off quickly.)

      To backup a storage pool with under a couple of TB of storage, tape is indeed stupid. If what you need is truly massive amounts of storage that does not need to be accessed instantaneously, tape cannot be beat.

    2. Re:Tape technology not keeping pace... by override11 · · Score: 2, Interesting

      Low end dell server: about 500 bucks
      Escelade 4 port IDE RAID card: around 200 bucks
      200 Gig Drives X 5: say 249 each, = 1245

      Total Cost: 1945

      And thats 1000 GB of un-raided space, so will end up being more than 600 GB raid 5. :)

      --
      No I didnt spell check this post...
    3. Re:Tape technology not keeping pace... by JediTrainer · · Score: 2, Informative

      RAID-5 only requires 3 disks, although you can use more if you want better data protection.

      I get the feeling that you understand how RAID-5 works, but your statement is misleading.

      With RAID-5 it stores parity data across the array for each piece of information stored. So to store data that would fill N drives, you need N+1 drives for the array (1 drive extra for the parity info).

      Adding drives won't protect your data any more (although hot standbys are nice to have). RAID-5 fails if I lose two drives at once. On a 3-drive array, that would mean I'd have to lose 66% of my drives at once to lose my data. On a 10-drive array I'd have to lose only 20% of my drives to lose my data. Having the hot standby drive would be great, because the chances of two drives failing simultaneously are usually low, so hopefully the hot standby would be synched before another drive goes.

      A better solution would be to use striping and mirroring together, for maximum redundancy. Costs more, but a lot safer.

      --

      You can accomplish anything you set your mind to. The impossible just takes a little longer.
  10. Tape will be with us for a while yet... by sirwired · · Score: 5, Interesting

    Right now, Sony is shipping Super-AIT tapes. The cartridges are about 3/8 of an inch thick, and each holds 500GB, before compression (which is integrated in the drive hardware). The drive can read or write at 30MB/s, before compression. With typical IT compression of 2:1, you get just under 60MB/s. The cartridge goes for about $150. Just try and get a terabyte of disk for that much. No, the drives aren't cheap, but they get paid off quickly.

    Yes, disk is good if you need instant access to your backup, and for small installations of under a couple of TB, using disk backups make sense, but for larger data pools, tape is far more economical.

    Also, as mentioned in the article, disk is terrible if you need off-site backups. In addition, a tape library consumes far less power, takes up less space, and produces less heat than a drive array of the same capacity.

    Basically, the death of tape has been predicted for years, but it hasn't happened yet.

    1. Re:Tape will be with us for a while yet... by sirinek · · Score: 3, Insightful

      Right on. :) Most ./ers dont know anything about enterprise systems. Thats why you see them rail against commercial unices, because they only know Windows/Mac/Linux/*BSD. This carries over into tape backup strategy. They dont know anything about high-end tape technology, so you will see them suggest things like using large IDE harddrives because it sounds so simple on the surface. To do backups to disk right (and then to tape, because you really should) you need a real SAN though.

    2. Re:Tape will be with us for a while yet... by ziegast · · Score: 2, Insightful

      The advantage of backup to nearline disk is the near-instant access times for restores. You don't have to wait for a tape to load, and the read speed can be 50 megabytes/sec or higher if you use striping (RAID0,0+1,1+0,3,5) with multiple disks.

      On the down side, you need to keep spinning a disk in a RAID environment to make sure the data is still good. Drives with one-year warranties aren't designed to sit on a shelf for 5 years and be powered back on. When drives fail, the RAID takes over and rebuilds a spare. You then take out the bad drive and replace it. To protect data, you need to keeps the disks spinning, and that consumes power. With lots of drives, it's lots of power.

      One vendor has a hybrid solution that has disks both online and offline emulating a tape library. When disks aren't in use, they spin down. You get the best of both worlds - fast access time and storage that doesn't require power all of the time. It's great for nearline restores, but isn't designed (pricewise) for long-term storage.

      In an enterprise world, I see people use SCSI- or FCAL-based SAN/NAS storage with nearline recovery data on IDE farms and long-term archive storage on tape libraries. The software to manage the data can be complicated and/or expensive.

      In a budget world, I see people use IDE storage for both active and nearline and archive storage. The only difference between the storage is that the disks on the nearline or archive storage are larger and are used less frequently.

      If you have data that gets read frequently after it is backed up or which requires fast recovery times, use nearline disk. If you have data that needs to be archived without any immediate requirements to read it in the near future, use archive tape.

      -ez

  11. ack! by mhatle · · Score: 4, Insightful

    I looked at doing something similar (but on a smaller scale) for my home.. but the amount of power that a hard drive based storage system takes is amazing. In additional IDE hard drives arn't know for their reliability.. :P (I've had numerous IDE raids fail spectacularly to the point I won't do that again...)

    I ended up going on ebay and getting a StorageTek 9714 "Media Library" with 2 DLT 4000 drives in it. It takes a maximum of 2A of power.. (I've measured it much lower then that when the tape drives arn't in use..) This sucker will store up to 2.4 TB ( 1.2 TB uncompressed) in the 60 available tape slots..

    The electricity saves more then makes up for the cost of the tapes.. (Also I expect the tapes to last approx 5-10 years.. I wouldn't expect that with the hard drives.)

    --Mark

  12. Why not Quantum DX-30 by harmless_mammal · · Score: 5, Informative

    Instead of building a giant kluge, why didn't they buy a few Quantum DX-30s? Each one only takes up 4U, holds 20 drives, and the internal software emulates a tape library so it easily integrates with enterprise backup software from Legato or Veritas. If your environment requires off-site storage, you could attach a tape library to clone the backups and then store the tapes off-site.

    1. Re:Why not Quantum DX-30 by afidel · · Score: 2, Insightful

      Because I would be willing to bet that Quantum will charge a hell of a lot more per TB then this system cost (69TB for less than half a million is a bargain).

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
  13. Optical isn't necessarily immune to EMP by SecGreen · · Score: 2, Interesting

    Haven't you ever put a CDR in a microwave? Pretty lights! (I take no responsibility for any damage to your microwave...)

    --
    Dupe posts are /.'s tacit protest on the rights of users to time-shift content...
    1. Re:Optical isn't necessarily immune to EMP by Zathrus · · Score: 4, Funny

      Uh, if you have enough voltage differential to be creating microwave level effects in free space then data backup has ceased to be a concern.

      In all likelihood the entire human race has also ceased to be a concern.

  14. I wouldn't want to support it... by Saint+Mitchell · · Score: 2, Insightful

    There is no way I would want to support that monster. I didn't see any mention of what happens when a drive fails. It's cake with most any SCSI Raid controllers. Look for the orange light, change the disk. Even promise makes IDE enclosures that do the same. With this system, do you have to take down the node when a drive fails? Sure it's a ton of space, but I'd give up some of the space for some easier administration. It only costs $70 per promise enclosure. That'll add about $12,000. So what. when you've spent $450,000 what's the big deal.

    1. Re:I wouldn't want to support it... by PerlGuru · · Score: 3, Informative

      It mentioned it several times spread pretty evenly through the article. The 3Ware controllers switch in the hot backup and that specific drive is replaced. It doesn't directly say but it sounds like the defective drive could be hot-swapped, perhaps a function of the controller? In addition from the rest of the article it would be no problem to take down a single node for the short few minutes to switch out the drive.

    2. Re:I wouldn't want to support it... by afidel · · Score: 2, Informative

      3Ware controllers support hotswap and hotspare, so on failure the data is recreated to the spare so need to rush in replacing the drive (in other words get around to it whenever you have time that week) and then you just unplug the drive and plop in a new one. Plus they have lost a total of 3 drives in the first year, I had to change out tapes every 2 weeks, this is a lot less work =)

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
  15. Bad idea. by Znonymous+Coward · · Score: 3, Funny

    A hard drive is sensitive to vibrations and has too many moving parts. The only reliable backup media is punch cards. Just don't store them near liquids.

    --

    Karma: The shiznight, mostly because I am the Drizzle.

    1. Re:Bad idea. by HermanZA · · Score: 2, Funny

      What do you do when the punch card battery goes flat? ;-)

    2. Re:Bad idea. by sharkey · · Score: 2, Funny
      Just don't store them near liquids.

      Put a large fireplace in your card storage room. That should dry up liquids before they can cause any significant harm, right?

      --

      --
      "Outlook not so good." That magic 8-ball knows everything! I'll ask about Exchange Server next.
    3. Re:Bad idea. by mfrank · · Score: 3, Funny

      And as long as you don't have your datacenter in Florida :)

  16. tape's still bigger by raduga · · Score: 2, Funny
    Let's see...

    576 x 160GB
    = 92 terabytes
    = .092 petabytes
    = .00009 exabytes
    So... my vintage EXB-8200 beats your puny RAID by a factor of more than ten thousand.
    --
    First, nothing begins if not opening
  17. Off Site Backup +/- by 4of12 · · Score: 3, Insightful

    Offsite backups, whether tape or disk, present some pros and cons.

    Pro: offsite is safer from local disaster effects.

    Con: data restoration takes longer from further away.

    Pro: high bandwidth connection makes moving data quick enough.

    Con: high bandwidth connections are expensive

    Con: high bandwidth connections are susceptible to disaster induced interruption

    Overall, though, I like the random access provided by disk drives over linear searches of tapes. In case the network connection is broken to the backup site, you can easily load a couple of terabytes on cheap IDE drives into the back of your station wagon and bring them to any site you like and the effective BW will still be pretty darn good.

    If you drive your station wagon across the continental U.S loaded with 3 TB of IDE drives in 3 days then you will be running faster than T1.

    safer away from local disaster access time is high when locals need restoration big net pipe to far away but disaster that kills the network pipe ? maybe hard drives can be couriered back.
    --
    "Provided by the management for your protection."
  18. Let's see... by stienman · · Score: 2, Insightful

    576 Hard drives.

    Assume 5 years MTBF.

    That end up being 100 Hard drive failures per year, about $10,000/yr, not counting labor.

    Or 2 per week. ($200/wk), if efficient to replace then add another $100/wk for ordering, shipping, storage, replacement and disposal.

    That's assuming good cooling and low usage (equivilant to an intermittant home user - which is what I expect a good backup system to get used to)

    So, ignoring the cost of the initial investment, they'll be paying up to $15,000 per year to maintain this backup solution.

    This is more expensive than many traditional backup methods, such as tape.

    However there were a few 'gimmes'. Firstly, the array only has to last 5 years. Secondly they are using 5400rpm hard drives - much cooler. Thirdly, these hard drives have a 3 year warranty, which is better than most places will give you now.

    So it's likely that the maintenance cost, in this case, is going to be low compared to the initial investment.

    The real problem, then, is the tendancy to keep an old system long past its prime and original intent. Someone in the future will say, "Instead of junking the system and upgrading to new technology, let's just throw larger hard drives in there each time one fails and up the capacity. Eventually it will cost $10k or more per year, and they won't know it.

    -Adam

  19. Tape is Dead. Long Live Tape! by BTM1001 · · Score: 2, Interesting

    Just a disclaimer to start things off - I am in the tape library business, so take what I say with a grain of salt. OTOH, I am a technical person, so it isn't going to be a polished marketing twist either.

    The article mentions one major drawback, the inability to do offsite storage. You could work something out with offsite mirroring, but bandwidth costs at 70TB would get excessive. Not to mention needing the same hardware setup on the other end.

    The other major advantage that tape has over disk is the archive ability. Once you write a tape, that data is static. I can have it sit in a slot in the library for a long time. Since this system is only designed for 5 years, archive is not a big deal, but a lot of industries it is huge. The ability to alter data on a disk drive seamlessly is a lot easier than to do on a tape.

    The person who mentioned the shock/vibe values for a disk drive VS a tape cartridge: #1 I have dropped PLENTY of cartridges, and have only has one chip a corner. That chip did not affect my ability to use the tape further. Additionally, if the housing is destroyed, the process to spool off the tape, and splice it onto a different tape is not that difficult. I would not loose the data permanently. If there is a major mechanical failure inside a disk drive, getting the data off the platters is a lot harder. .53 failure rate is good (I'm not sure what the published rates for new tape drive technology is) but the rate 5 years down the line is going to be much higher in my opinion.

    I would be interested in seeing numbers for throughput of the system, power consumption, backup window lengths, average restore time. Some of these might stack up favorably to tape, others might not.

    The comment on moving to optical as a backup medium - maybe someday, but for now the space needed/time to backup to optical does nto compare well with tape. A DVD of 4.5 GB VS a tape of 100GB (Currently available, yes I know blue lasers will improve that)

    As for a robot failure, worst-case scenario, you put the tape in the drive manually. Realistically, at least at our company, we have solved this problem for our customers by providing the ability to easily replace components. This can happen either with a field engineer, or even the customer themselves. Generally all you need is a Phillips screwdriver, 20 minutes max, and the ability to follow instructions.

    Again, I'm not in the sales department, so I can't quote costs, but a 435K total cost for 70TB is not that cheap. With tape systems, a lot of the cost depends on how fast the backups need to occur in. I could build out a 70 TB system with 1 drive, a SCSI connection and a huge wall of tapes relatively cheaply. As you add more drives, use fibre or gigabit Ethernet interfaces, etc costs go up, but access times go down. Cost can also be brought down by not going with the 500 lb gorilla of the field - StorageTek.

    Yes disk is growing, but generally it does not replace tape, it only pushes it back a layer. This won't change for a while.

  20. Optical tape? by gweihir · · Score: 2, Informative

    For some years there have been rumours of optical tapes with capacities in the several hundreds of GB or even several TB per cartridge, but no products that I am aware of so far.

    Still I think that this misbalance between tape prices and HDD prices cannot last.

    --
    Most ACs are not even worth the keystrokes to insult them. Be generically insulted and ignored otherwise.
  21. Cost effective? by the+kfc+avenger · · Score: 2, Informative

    I can appreciate the appeal of building a massive system from commodity hardware, but it states that the entire system was $435,000. After some rough calculations, a smilar system using apple xserve-RAIDs would run around $300,000, or $135k less before host computer costs, and would most likely be much easier to maintain. Plus, five racks of xserves would look pretty bitchin' :P