Slashdot Mirror


SCSI vs. SATA In a File Server?

turboflux asks: "I'm currently in the process of replacing an aging file server with something more robust. Company-wide, there will be about 100 people who could be using this server, but I don't imagine there being more than 50 concurrent users. Right now, I'm torn between spending alot on SCSI hardware, much like our other servers, or spending less, but getting more space, with SATA II drives. Whatever I decide, the server will be setup with a RAID 1+0 array for the numerous benefits it offers. Does Slashdot have opinions or suggestions on performance, reliability, and stability?"

303 comments

  1. Have you considered...? by Anonymous Coward · · Score: 5, Funny

    Have you ever thought about the benefits of RLL?

    1. Re:Have you considered...? by Nugget · · Score: 4, Funny

      I'm holding out for ESDI. All I need is the EISA configuration floppy for this controller card and I'm good to go!

    2. Re:Have you considered...? by techno-vampire · · Score: 1

      Personally, I've always preferred MFM.

      --
      Good, inexpensive web hosting
    3. Re:Have you considered...? by click2005 · · Score: 1

      Whats wrong with punch cards?

      --
      I am a free slashdotter. I will not be modded, blogged, DRM'd, patented, podcasted or RFID'd. My life is my own.
    4. Re:Have you considered...? by luvirini · · Score: 1

      Dropped decks are a real pain... as are dropped harddrives... go for C-Casette.

    5. Re:Have you considered...? by the+eric+conspiracy · · Score: 1

      That's why paper table is better than punch cards for reliability. Not as noisy either.

      You get round chads too.

    6. Re:Have you considered...? by SpaceLifeForm · · Score: 1

      And don't forget (this is important for your mood),
      you get to roll your own.

      --
      You are being MICROattacked, from various angles, in a SOFT manner.
    7. Re:Have you considered...? by Wonko+the+Sane · · Score: 1

      I love the beeping sounds the old seagate MFM drives made. 40 Mb in a 5.25" form factor

    8. Re:Have you considered...? by rakslice · · Score: 1

      Too reliable. The mean time between fires is too low (even after we redirected the halon system to the beer fridge.

    9. Re:Have you considered...? by Richard+Steiner · · Score: 1

      80-column cards with rectangular chads are the fault of IBM -- UNIVAC had 90-column punch cards with round chads. :-)

      --
      Mainframe/UNIX Bit Twiddler and long time Windows/Linux Hobbyist.
      The Theorem Theorem: If If, Then Then.
    10. Re:Have you considered...? by thebestanalyst · · Score: 1

      get a NAS device man. no one make cars out of parts no more.

    11. Re:Have you considered...? by Mattcelt · · Score: 1

      Hmm, I have a spare I'm not using around here somewhere. Tell you what, I'll give you a deal on it... say, $750.00?

    12. Re:Have you considered...? by ivan256 · · Score: 1

      RLL drives used MFM (multi-frequency modulation).

      Floppy drives still do.

    13. Re:Have you considered...? by ivan256 · · Score: 1

      You get round chads too.

      Any geezer geek (or young geek with a jargon file) should know that 'chad' is already plural. 'Chads' is a creation of the US media during the 2000 elections. Apparently american journalists don't know how to do research.

    14. Re:Have you considered...? by Anonymous Coward · · Score: 0

      "That's why paper table is better than punch cards"

      Paper table? Is that to go with your paper chairs?

  2. SCSI?? by ZachPruckowski · · Score: 3, Funny

    I don't mean this as a troll, but I was under the impression that USB 1.0 replaced SCSI? Or was that desktop only, and it still has server uses? I mean, I thought USB killed SCSI? Or am I thinking of something different?

    1. Re:SCSI?? by __aaclcg7560 · · Score: 1

      No, you're thinking Firewire. :P

    2. Re:SCSI?? by sr180 · · Score: 1

      That was for external devices like scanners.

      --
      In Soviet Russia the insensitive clod is YOU!
    3. Re:SCSI?? by digitallife · · Score: 1

      What the heck are you talking about? SCSI based drives are the fastest, most robust drives on the market. Put a couple 15k SCSI drives in RAID0 and you will have the fastest damn drive access times possible in todays market. Very sweet, if you can afford it.

      I have yet to see 15k USB drives... or RAID USB drives... or high RAM and processing power USB boards. I don't quite see how they compare.

    4. Re:SCSI?? by goarilla · · Score: 0

      hahahahahaha
      that really cracks me up, LOL

    5. Re:SCSI?? by Anonymous Coward · · Score: 3, Informative

      You people are really showing your (inadvanced) age! Back in the good old days, many external peripherals (such as scanners) were connected to your machine via a SCSI bus. Don't forget what SCSI stands for - Small Computer (Serial|Standard) Interface. I believe our friend here was referring to peripherals, and in that case he's right - SCSI was replaced by USB. As the rest of you seem to have been born in the 80s, you probably thought he was referring to SCSI hard disks - the most common use of SCSI these days. For this purpose, SCSI (especially its modern incarnation) is vastly superior to USB. So the answer is yes and no. :-)

      - Jeremy
      madscience AT mac DOT com

    6. Re:SCSI?? by Anonymous Coward · · Score: 4, Funny

      Well, here's the problem. FireWire, and in fact the entire Apple experience, is intuitive for a certain kind of person. Artists, fashion mavens, scientists, and other creative personalities can sit down with a 6-to-4 pin IEEE-1394-compatible digital video downlink cable and comprehend its sensitive, tasteful aesthetic. It's a rare instinct, this appreciation for beauty and truth; unimaginative, dogma-bound drones haven't a prayer.

      In summary, unattractive squares should stick to Linux and Windows.
      FireWire is for different thinkers.

    7. Re:SCSI?? by danbeck · · Score: 1

      Bravo sir! Bravo.

    8. Re:SCSI?? by __aaclcg7560 · · Score: 1

      Yes, I heard all that before. I'm getting my Mac Mini this week. Hmmm... Does the Mac Mini come with any nubile girls?

    9. Re:SCSI?? by morcego · · Score: 1

      Maybe you mean Fiber Channel ?

      --
      morcego
    10. Re:SCSI?? by Anonymous Coward · · Score: 0

      the second s stands for system. (also you should check with your doctor about your brain.... scsi absolutely is not serial, duh.)

    11. Re:SCSI?? by Kymermosst · · Score: 2, Funny

      You people are really showing your (inadvanced) age!

      And you are showing your senility:

      Don't forget what SCSI stands for - Small Computer (Serial|Standard) Interface.

      Heh. Wrong.

      Small Computer System Interface.

      --
      "Alcohol, Tobacco, Firearms, and Explosives" should be a convenience store, not a government agency.
    12. Re:SCSI?? by MikeFM · · Score: 3, Funny

      Yes, but they're four inches tall and made of white plastic. Enjoy.

      --
      At what price learning? At what cost wisdom? The price is a man's peace of mind, and the cost is his life.
    13. Re:SCSI?? by Anonymous Coward · · Score: 0

      That girl in the first picture has some nice stockings.

    14. Re:SCSI?? by Lisandro · · Score: 1

      You, sir, just made my day. Standing ovation! :)

    15. Re:SCSI?? by HaMMeReD3 · · Score: 1

      ***Warning off topic generalization***
      Mac users are FUCKED

      That is all.......

    16. Re:SCSI?? by Kymermosst · · Score: 2, Informative

      scsi absolutely is not serial, duh

      While he did screw up the second 'S' in SCSI, you cannnot seriously expect anyone who knows anything about the evolution of SCSI to take you seriously after you stated the above.

      I will prove your statement false with a single counterexample: Serial Attached SCSI (PDF). Note the date of the document.

      Remeber that with SCSI-3, the standard became more modularized in order to do things like separate the SCSI command set and the SCSI physical interface.

      Here's the SAS FAQ from the SCSI trade association.

      --
      "Alcohol, Tobacco, Firearms, and Explosives" should be a convenience store, not a government agency.
    17. Re:SCSI?? by sumdumass · · Score: 1

      Lets not forget that SCSI wasn't replaced because USB was better, It is because USB is cheaper to manufacture then SCSI.

      I have a scanner that conects with a SCSI wide (or 3). It is the fastest scaner I have seen on a computer. It apears to process pages faster then the copy machine/kinoca printer we got a few years ago. I forget the actualt pages per minuute specs on it but I think it can feed 20+ page and have them displayed in a little less then a minute. Evedently, there are faster ones too. We just got a deal on this.

      As for the topic, Yea stick with SCSI if it is a high demand situation. SATA while having impresive specs, are at a complete duty cycle. The cannot sustain those speed for a long period of time (yet still better then IDE i believe).

    18. Re:SCSI?? by HeroreV · · Score: 1

      Links are great and all, but do you think you could tone it down a bit?

    19. Re:SCSI?? by Anonymous Coward · · Score: 0

      So, the story is that Apple has to resort to advertising pics of 'artsy' people instead of having strong technology in order to sell their computers. The only way they think "different" is that they all believe that form is more important than function.

    20. Re:SCSI?? by Anonymous Coward · · Score: 0

      What the fuck did I just look at?

    21. Re:SCSI?? by ameoba · · Score: 1

      ATTACK OF THE MAC HIPSTERS

      --
      my sig's at the bottom of the page.
    22. Re:SCSI?? by lord+aDam · · Score: 1

      SCSI cards also have internal ports that are used for hard drives. I used to think that SCSI cards only had external ports as well, until I had to start using them on the outdated machines where I work.

  3. SATA is fine by Bombcar · · Score: 5, Insightful

    SATA is fast and cheap; just make sure you spend a little bit more to get the "nearline" storage drives and not just desktop drives. Put them behind a 3ware 9550 and you'll fly.

    1. Re:SATA is fine by turboflux · · Score: 1

      I was looking at the Seagate NL35 drives and an Adaptec 2820SA SATA 2 controller this afternoon. I've used quite a few Adaptec cards and never had a problem with them.

    2. Re:SATA is fine by Bios_Hakr · · Score: 4, Interesting

      Some more tips for you:

      First, make sure you get a SATA2 controller. NCQ is a must for multiuser environments.

      Second, whatever controller you buy, grab 3 of them. RAID is great for disk failure, but people rarely think about what they'll do when the controller fails.

      Look at some of the stranger RAID options. If you just use RAID5, you'll be selling yourself short. RAID3 is worth a look. I'd actually suggest you put two controllers in a machine. Run RAID0 on 4 drives on a single controller. Run RAID0 on 4 drives on the other controller. Then use Windows or Linux software RAID to run RAID1 between the two RAID0 drives. Very fast performance and fully fault tollerant.

      Keep the OS on a small, slow hard drive seperate from the array. You can do funny things there, but I'd suggest you set it up properly and then use a disk clone utility to create an offline backup to store somewhere.

      Arrange for testing in the first few months. Unplug drives from the array and see what happens. Verify that you can restore from tape backup to the array. Veryfy that your cloned OS hard drive can actually get the array online. Extensive testing before you go live. Two tests in the first month after going live. One test every 6 months after that.

      If you are using commodity servers, get a spare for everything in there.

      If you want high avalibility, look into DRBD. It's like RAID1 over a network.

      Monitor the damn thing! My last job someone let the server die. It had RAID5 over 5 drives. One drive had failed and no one noticed. When the second failed, that was the end of it. Learn to use SNMP or get some good monitoring utilities that will notify you of problems. You need to know if a drive fails, if it reports SMART errors, drive temp, proc temp and usage, NIC utilization, drive utilization, system temp, and memory utilization. MRTG+RRD Tool and SNMP will give you pretty charts for all that crap. MotherboardMonitor will also give some nice readouts for Windows. If it's commodity server, look at installing a Crystalfontz LCD so that you can walk by and get a quick status without the need to login.

      --
      I'd rather you do it wrong, than for me to have to do it at all.
    3. Re:SATA is fine by Dante · · Score: 2, Interesting

      Having replaced sixteen drives and lost two raid 5s with hot-spares no less, in the last eighteen months. I can't recommend SATA for anything that is mission critical.

      --
      "think of it as evolution in action"
    4. Re:SATA is fine by schnurble · · Score: 4, Informative

      Look at some of the stranger RAID options. If you just use RAID5, you'll be selling yourself short. RAID3 is worth a look. I'd actually suggest you put two controllers in a machine. Run RAID0 on 4 drives on a single controller. Run RAID0 on 4 drives on the other controller. Then use Windows or Linux software RAID to run RAID1 between the two RAID0 drives. Very fast performance and fully fault tollerant.

      Uhh. Yes. Then you can lose one disk in each side, and you have lost all your data.

      This would perhaps be slightly less than fully fault tolerant.

      Perhaps you meant to set up 4 mirror pairs, 2 on each controller, and use software to RAID0 them together.

      I have successfully done this with a 24 disk 5U chassis, and it is an IO steamroller (our database server, right now).

      --
      "To err is human, to forgive is simply not my policy." --root
    5. Re:SATA is fine by Velox_SwiftFox · · Score: 1

      Mod parent up, real RAID 10 is a Sysadmin's friend.

    6. Re:SATA is fine by SpaceLifeForm · · Score: 1

      Close to Tandem Guardian.

      --
      You are being MICROattacked, from various angles, in a SOFT manner.
    7. Re:SATA is fine by Bios_Hakr · · Score: 4, Informative

      The chances of losing two disks at once are slim. RAID 0+1 will provide great performance and good fault tolerance if you react to problems as they happen.

      But I guess it depends on what your users need. If they need raw throughput, RAID 0+1 is better. If they need low latency, then RAID 10 may be the answer. Or maybe both systems would fall within the margin of error of each other.

      In any event, once you get into what-if situations, no RAID will be good enough. What if you lose a disk? What about two? Five? Well, what if lightning hits the chasis or the janitor unplugs it to buff the floor?

      The best you can do is roll the dice and play the odds. You'll see that I told him to use RAID 0+1. I also told him to use good monitoring setups to mitigate problems. I also suggested a tape backup. Actually, maybe I didn't, but I did tell him to verify his backups work and that he is able to restore from them, so that's kind of the same thing.

      When it gets down to it, oppinions are like assholes; everyone has one. And most people only care about their own and don't really want to look at their coworkers'. I guess I'm the same in that respect.

      --
      I'd rather you do it wrong, than for me to have to do it at all.
    8. Re:SATA is fine by penguinboy · · Score: 1

      Even the "enterprise" SATA drives are crappy. Out of more than 200 hard drives, I've replaced four times as many of the SATA disks vs. the SCSI disks (12 vs. 3).

    9. Re:SATA is fine by pyite · · Score: 1

      Uhh. Yes. Then you can lose one disk in each side, and you have lost all your data.

      Uhh. No. You'd need to lose two more disks to lose all your data. RAID 4 is striping with a dedicated parity drive. Losing one disk on each side means you're running in a degraded mode but still are keeping data. Losing a third disk means one array is shot but you still have your mirror. Only losing a forth drive would mean you'd lose everything. By then, I'd hope you'd have replaced some drives.

      --

      "Nature doesn't care how smart you are. You can still be wrong." - Richard Feynman

    10. Re:SATA is fine by Ponder · · Score: 1

      Maybe you meant to say RAID 4 but what you typed was RAID 0

      --
      -- Back to the shadows again...
    11. Re:SATA is fine by Anonymous Coward · · Score: 0

      Hey, poser!

      SATA-II is the name of the trade organization. NCQ is a feature which may or may not be supported on any particular controller.

    12. Re:SATA is fine by Anonymous Coward · · Score: 1, Informative

      Monitor the damn thing! My last job someone let the server die. It had RAID5 over 5 drives. One drive had failed and no one noticed. When the second failed, that was the end of it.

      That's nothing! I work for a Fortune 500 company, and we had a centralized database containing data from sites nationwide. This database was on a RAID 5 array with well over a dozen drives, including two hot spares, situated in a data center which is staffed 24x7. We lost the database.

      How? Simple -- nobody was monitoring the array. One drive fails -- no problem, rebuild it on a hot spare. Another drive fails -- rebuild it on the other hot spare. Yet another drive fails -- running in degraded mode but no data loss yet. Finally, a fourth drive fails. The entire RAID 5 volume is lost, unrecoverable. Of course, that was when the problem was first discovered! We had to rebuild the database from scratch.

      There was nothing wrong with the design of the hardware platform, but nobody had been tasked with monitoring the array and replacing failed drives!

      RAID can't protect you forever. Monitor those disk arrays!

    13. Re:SATA is fine by Bios_Hakr · · Score: 1

      An additional thing about RAID. Always reboot to rebuild your array. If at all possible, come in at like 3am on a Sunday to replace a failed drive. Shutdown and remove the failed drive. Insert a new drive and turn the power back on. Never trust the "hot swap" scam.

      Seriously, downtime isn't that hard to get. Especially if you tell them that not granting a reboot window could potentially cost them 5+ hours of unscheduled downtime.

      If you work in a place where they refuse to grant downtime, have your boss draft a memo stating that you advised him of the potential problems and he will accept all responsibility in case of an outage.

      If he won't do that, then you draft the memo and have a coworker sign statiing that the boss ordered you to hot swap agianst your wishes.

      --
      I'd rather you do it wrong, than for me to have to do it at all.
    14. Re:SATA is fine by innosent · · Score: 2, Insightful

      Slim might be correct, but it certainly CAN happen. What is important is that you realize that it is possible. Just because you have to lose 2 or more to lose data doesn't make it safe. Having 20 drives from the same lot number could mean that all 20 are affected by a manufacturing flaw that kills them at 20,000 hours. Also, there are power surges, fire, etc. Always back up (off-site), always keep spares both on and off-site, and buy drives from multiple lot numbers. Keep in mind that the MTBF for multi-drive systems is lower than a single drive (in fact, statistically it is the MTBF for a single drive divided by the number of drives). You should expect failures, and should expect more than you would a single drive. The idea is not to reduce the likelyhood of a failure (it does the opposite), but to reduce the damage caused by a failure. Keep in mind that failures can occur in the drive, controller, power supply, or any other component. Keep spares on hand, and use a high end (3ware 9500 series works excellent for SATA, Adaptec for SCSI, but not as good IMHO as 3ware for SATA) controller card.

      As for the original topic, I would go with SATA II drives, on a 3ware controller, and make sure both the drives and the controller support NCQ. Faster drives are of course better, but more drives is also better (more spindles = more platters = more read heads = faster aggregate reads), and keep in mind that RAID level is extremely important, and the right choice has a lot more to do with your application and supporting hardware than anything else. Generally, though, RAID 5 is sufficient for storage, RAID 1 for smaller application servers that you can't afford to have outages on, and RAID 10 (1+0) on databases (or really anywhere you can afford it). RAID 10 on a high-end controller allows you to read at double (or better) the speed of the RAID 0 subarrays, since the controller can read from the stripe set that is closest to the data/least busy. Also, be sure to get a controller with plenty of memory onboard, and battery backed memory if power loss could be an issue. Personally, I'd go with the 3ware 9550SX line on an Opteron-based system (eliminate the Intel-based northbridge I/O / Mem bottleneck).

      --
      --That's the point of being root, you can do anything you want, even if it's stupid.
    15. Re:SATA is fine by aiken_d · · Score: 2, Informative

      The chances of losing two disks at once are slim

      Not in my experience. I've worked with many, many systems over the years, and I'd say that about half of the time a drive in an array failed, at least one other one went with it either simultaneously or shortly thereafter.

      Sometimes the failed drive has literally melted, putting great load on the power supply and taking one or more other drives out at the same time. In arrays that stripe error-recovery information across multiple disks (RAID 5, etc), I've had the additional load be the final straw for a second drive. I've had hot spares that died the moment they were asked to actually do something.

      It may be that the chances of losing two out of three disks at once are slim. But I can tell you that the odds of losing two out of twenty disks at once are not slim at all. Either that or God just hates me. In either case (he may hate you, too), it's best to plan for multiple drive failures and at least one power supply and one SCSI bus failure happening at once.

      Cheers
      -b

      --
      If I wanted a sig I would have filled in that stupid box.
    16. Re:SATA is fine by Malor · · Score: 2, Informative

      Out of 8 disks, the chance of losing two at once is higher than you'd think, especially if he's using the cheaper SATA drives. I lost 2 drives out of a RAID5 in short succession just recently, and *just barely* managed to save the most recent data before the second drive died too.

      RAID 0+1 is much inferior to RAID10. 0+1 is what the GP poster said... stripe 4 disks in RAID-0, and mirror those. You're no more fault tolerant than a RAID5 array.. if ANY two drives fail, you're hosed. You lose 50% of the space to boot. (in RAID5, on 8 disks, you'd lose only 12.5%, though of course it's slower.)

      RAID10, on the other hand, is setting up four mirrors, and striping the mirrors. You still lose 50% of your space. However, you lose the whole array only if both 'sibling' drives in a given mirror fail. That means you have a pretty good chance of surviving a multi-drive failure. And it's very fast....just as fast as 0+1, but it's a lot more robust. Of course, both are very inefficient in terms of space lost, but drives are so cheap these days that it doesn't matter too much.

      Any good controller will do RAID10 nowadays... only the very cheapest/crappiest controllers are limited to the inferior 0+1.

    17. Re:SATA is fine by WoodstockJeff · · Score: 1
      The chances of losing two disks at once are slim. RAID 0+1 will provide great performance and good fault tolerance if you react to problems as they happen.

      Say that after the airconditioning fails in the server room on a weekend, and you discover that the alarm company set the thermal alarm to 120F, instead of 80F.

      Two drives in one server, another in a second, with two more drives failing over the next 5 weeks due to overtemp. Fortunately, they were running as RAID5 arrays with a hot spare, so the worst that should have happened was one array ran degraded. Unfortunately, the second drive failed before the spare could be fully-syncronized, leaving a non-recoverable array.

      But, that's why there were 4 mirrored servers, including one off-site!

    18. Re:SATA is fine by arivanov · · Score: 1
      Fly into a wall at 100mph to be most exact.

      3ware used to be extremely sensitive to bus noise. As a result on at least some motherboards it will lead to memory corruption and outright crashes. If mounting on a riser card ensure that the riser is grounded on all ground contact points. There were discussions of the underlying problem (initially erroneously blamed on driver bugs) all over the FreeBSD mailing lists.

      Compared to a decent modern SCSI system like one of the more recent Compaq smartraids or a recent Adaptec, 3ware with SATA is quite slow. From the ones I had to work with, it is the only hardware RAID adapter which is so slow that it triggers the silly OOM killer mistakes in Linux 2.6.10 even on RAID1. As a measure of how slow it is - reading off a 5 year old SCSI based SmartRAID on a P3 600 over 1Gbit NFS and writing out onto a quad Opteron with 3ware congests the 3ware on IO. The system goes into the twighlight zone of 12+ loadavg.

      So on, so fourth. The only advantage 3ware has over other systems are the reasonably well supported and well working drivers on both Linux and FreeBSD. If you need performance SCSI is still the answer.

      --
      Baker's Law: Misery no longer loves company. Nowadays it insists on it
      http://www.sigsegv.cx/
    19. Re:SATA is fine by Anonymous Coward · · Score: 0

      Having not replaced any drives and lost no raid 5s with hot-spares no less, in the last eighteen months. I can't recommend SATA enough for anything that is mission critical.

    20. Re:SATA is fine by HuguesT · · Score: 1

      No,

      In a RAID10 configuration like you describe, i.e two sets of 4 mirrors that are striped, you lose 75% of your space, and this is where your extra redundancy comes from.

    21. Re:SATA is fine by Qzukk · · Score: 3, Informative

      Eh, his description is funky. I think he meant 4 sets of 2 mirrored drives, that are then striped. You could do it the other way, I guess, but that IS a lot of wasted space.

      As for "extra redundancy" The difference between RAID 10 and RAID 01 is in the failure mode, not strictly in the redundancy.

      In RAID 01, the data is stored like this:
      [ABCD] - four drives striped
      [ABCD] - four drives striped ... and then mirrored. If a drive C fails, that entire mirror becomes useless since you can't mirror ABCD to ABD, making the state:
      [ABCD] - four drives striped
      [XXXX] - four drives offline ... and the next drive failure kills it, assuming that offline drives don't count. Some hardware raid systems will continue to mirror ABD, essentially converting it to RAID 10 on the fly.

      In RAID 10, the data is stored like this:
      [AA] - two drives mirrored
      [BB] - two drives mirrored
      [CC] - two drives mirrored
      [DD] - two drives mirrored ... and then striped. If a drive fails only that drive is offline...
      [AA]
      [BB]
      [CX] - one drive offline
      [DD] ... if the remaining drive C fails, then the array is lost. However, any other drive could fail without destroying the array (in fact, up to three more, if you're lucky).

      --
      If I have been able to see further than others, it is because I bought a pair of binoculars.
    22. Re:SATA is fine by Zapman · · Score: 1

      Not quite.

      Raid 5 is fantastic in read heavy environments. Mathmatically speaking it will perform nearly to the level of either RAID10 or RAID01 (only slower because the controler has 2 disks to choose from when reading RAID10... it'll use the more idle one). In write heavy environments (like most DB's, especially transactional ones) the 'write, read, read, read (...), calculate parity, write' cycle can eat you alive.

      RAID3 is a waste of time, unless you're serving HUGE files that will only be accessed sequentially (ex: full length movies). In almost all cases, RAID5 is ALWAYS superior to RAID3 (raid5 spreads the parity information across all disks, raid3 only uses one disk for it).

      The only difference in performance between RAID10 and RAID01 is restore time. Everything else comes out a tie. RAID10 takes an even number of disks (for example: 10), makes #/2 (5 in this case) mirrored pairs of disks (5 whole disk raid1 volumes) and then concatinates them together with raid0. Raid01 takes an even number of disks, makes 2 raid0 volumes, and then mirrors the large volumes.

      The reason RAID10 is better for restores is you only have to sync 1 disk worth of data to restore one failed drive. In RAID01 you have to resync the whole volume to validate it (in general).

      --
      Zapman
    23. Re:SATA is fine by Malor · · Score: 1

      Ah, cool, I hadn't thought about that... the resyncs in RAID10 are faster too... another win.

      I knew the RAID5 stuff, but that wasn't really that relevant to a RAID0+1 vs. RAID10 comparison. "Slower but you lose less space" seemed like a pretty fair compromise for one sentence. :)

      Note also that if you have a GOOD RAID5 controller, it'll work at pretty much the full speed of the spindles anyway. Controllers that good are often several grand, but they do exist. As an example, I have an old ICP Vortex 32-bit PCI controller. If I recall correctly, it benched writing about 60 megs/second, running RAID5 across six disks on two channels. The response time in normal usage is lightning-quick. It can take one hell of a beating.

      Newer cards would be faster still, though you'll generally need better than vanilla PCI to support them.

    24. Re:SATA is fine by Malor · · Score: 1

      Very good explanation. I didn't realize my original wording was confusing... thanks for the clarification.

    25. Re:SATA is fine by pyite · · Score: 1

      Maybe you meant to say RAID 4 but what you typed was RAID 0

      No. Do you know what parity is? RAID 0 has no parity. RAID 4 has parity data stored on a single disk. This is in contrast to RAID 5 where the parity information is written across all the disks.

      Educate yourself.

      --

      "Nature doesn't care how smart you are. You can still be wrong." - Richard Feynman

    26. Re:SATA is fine by Anonymous Coward · · Score: 0

      And, so long as you are monitoring the condition of your RAID arrays, your enterprise is coming out way ahead on the cost of storage by using the SATA solution instead of SCSI. (Sorry you are having to spend time replacing drives, but if you weren't doing that, you'd be compiling TPS reports. Call it a blessing...)

    27. Re:SATA is fine by ocbwilg · · Score: 4, Insightful

      The chances of losing two disks at once are slim. RAID 0+1 will provide great performance and good fault tolerance if you react to problems as they happen. But I guess it depends on what your users need. If they need raw throughput, RAID 0+1 is better. If they need low latency, then RAID 10 may be the answer. Or maybe both systems would fall within the margin of error of each other.

      He has up to 100 users and says that there will probably only be 50 or so concurrent users. Reasonable performance for such a system doesn't require lots of crazy tweaking. Implement RAID5 with a hot spare and be done with it. If you have a drive failure it automatically rebuilds and you're safe. If you have another drive failure after that before replacing the dead drive, you're still running. If you are concerned about drive performance, then spread the array across as many spindles as possible. If you have any sense you will already have a decent monitoring system in place and will know the drives have failed.

      I find myself saying this often on Slashdot, but for the average IT department it makes far more sense to buy a business line server that comes with proper support for everything that you need than to try and cobble it together yourself out of parts, and then try to keep enough spare parts around in case of failure, and try to get warranty service from 5 different parts suppliers with different warranty lengths. I mean really, who does that kind of thing?

      Go to HP, buy a Proliant server that fits your needs and price range, and use the included management software to set up email alerts when there is a hardware problem (like a drive failure or imminent drive failure). HP has the replacement part at your doorstep next day (unless you buy a warranty with faster turnaround, and next-day is still faster than you'll get from most part suppliers), and you don't have anything to worry about. I'm sure IBM and Dell can do something similar too.

      Back in the day it actually used to be cheaper to build your own computer. Not only would you save money, but you get to choose exactly the components you wanted. Nowdays the computer market has been so commoditized that it's actually much more expensive to build your own. You don't get any of the advantages of economies of scale, and the profit margins are so slim on retail models that the savings of eliminating it is negligible. And of course, now you can have your system custom built to your specs anyway. The only reason to build your own is if you want to be able to tweak and upgrade it piecemeal, like the "enthusiast" market does. That's what I do with my home PC, but I would never consider doing that with business PCs, especially a server. A server should be deployed, and after that it should pretty much sit there with zero hardware maintenance (except in the case of hardware failure).

    28. Re:SATA is fine by harrkev · · Score: 1
      Never trust the "hot swap" scam.
      Huh?

      First, a disclaimer: I am a hardware design guy. I know VHDL and circuit board design. I don't do IT.

      This is not my area, but it seems to me that if a vendor advertises "hot swap" then they should be able to live up to it without any data loss. If they can't then they are a bad bad vendor, and should be dumped for somebody more respectable. Serious storage costs a LOT of money. A vendor cannot afford a ding to their repuatation by having customers loose data. With the internet, stories of bad hardware can spread quite rapidly and cause a company to lose customers.

      So, it comes down to one question: can you trust what the vendor says? If so, go for it. If not, go to another vendor.

      Please correct me if I am wrong.
      --
      "-1 Troll" is the apparently the same as "-1 I disagree with you."
    29. Re:SATA is fine by harrkev · · Score: 1

      It seems to me that this has little to do with the interface technology, and a lot more to with the build quality of the drives. Most SATA drives sold are intended for desktops, where cost is king and it is assumed that they are not stressed 24/7. Of course those will fail rapidly in a server.

      Do they make server quality SATA drives? And if so, what is the price difference between a server quality SATA and SCSI?

      --
      "-1 Troll" is the apparently the same as "-1 I disagree with you."
    30. Re:SATA is fine by feenberg · · Score: 1

      Actually the chance of losing two disks at once is pretty high. Here is the explanation from my posting at http://www.nber.org/sys-admin/linux-nas-raid.html

      =====

      Most of the drives in our NAS boxes and drive arrays claim a MTBF of 500,000 hours. That's about 2% per year. With three drives the chance of at least one failing is a little less than 6%. (1-(1-.98)^3). Our experience is that such numbers are at least a reasonable approximation of reality.

      Suppose you have three drives in a RAID 5. If it takes 24 hours to replace and reconstruct a failed drive, one is tempted to calculate that the chance of a second drive failing before full redundancy is established is about .02/365, or about one in a hundred thousand. The total probability of a double failure seems like it should be about 6 in a million per year.

      Our double failure rate is about 5 orders of magnitude worse than that - the majority of single drive failures are followed by a second drive failure before redundancy is established. This prevents rebuilding the array with a new drive replacing the original failed drive, however you can probably recover most files if you stay in degraded mode and copy the files to a different location. It isn't that failures are correlated because drives are from the same batch, or the controller is at fault, or the environment is bad (common electrical spike or heat problem). The fault lies with the Linux md driver, which stops rebuilding parity after a drive failure at the first point it encounters a uncorrectable read error on the remaining "good" drives. Of course with two drives unavailable, there isn't an unambiguous reconstruction of the bad sector, so it might be best to go to the backups instead of continuing. At least that is the apparently the reason for the decision.

      Let's repeat the reliability calculation with our new knowledge of the situation. In our experience perhaps half of drives have at least one unreadable sector in the first year. Again assume a 6 percent chance of a single failure. The chance of at least one of the remaining two drives having a bad sector is 75% (1-(1-.5)^2). So the RAID 5 failure rate is about 4.5%/year, which is .5% MORE than the 4% failure rate one would expect from a two drive RAID 0 with the same capacity. Alternatively, if you just had two drives with a partition on each and no RAID of any kind, the chance of a failure would still be 4%/year but only half the data loss per incident, which is considerably better than the RAID 5 can even hope for under the current reconstruction policy even with the most expensive hardware.

    31. Re:SATA is fine by Anonymous Coward · · Score: 0

      Seeing as he wasn't talking about RAID 4, but RAID 0+1, that's irrelevant.

    32. Re:SATA is fine by bobcat7677 · · Score: 1

      I agree SATA is fine. I can actually say that from Experience. A company I recently worked for needed a file server and I built one with PATA IDE drives in RAID 0+1. While the 3ware controllers are ideal, we actually used a cheaper raid controller and it worked fine. There was more load on the CPU from the raid controller since it was cheap but it didn't matter because all that box did was serve files. The CPU didn't have anything better to do:) This was pretty much an identical situation to yours. About 100+ or so employees with 50-60 simultainious users at any one time. That setup had more drive failures then the other SCSI boxen we had, but as long as you don't mind swapping drives once in a while to keep it running all is good. Most of the failures we had though were on the IBM deathstar drives. We eventually replaced the entire array with some bigger WD drives and only had one drive fail after that I think.

    33. Re:SATA is fine by Anonymous Coward · · Score: 0

      I'm sure that VHDL and circuit design is a rather formal sort of affair, what with actual engineers and all that. IT as a field is plagued by rampant incompetence, so a very high percentage of practices amount to little more than voodoo. Shutting down a RAID array to swap a drive is just one example.

    34. Re:SATA is fine by Anonymous Coward · · Score: 0

      You are wrong.

      Raid10 and Raid01 provide exactly 50% of the capacity of the disks you buy, but by the laws of probability, RAID10 is safer by a good margin, and gets relatively better the more drives you use.

    35. Re:SATA is fine by nosfucious · · Score: 1

      I have had zero problems with Big Blue's hardware based RAID cards. Both in servers and SAN.

      You never need to reboot to put in a disk, nor remove it.

      I think most of you are talking about software (Operating System) solutions. What a waste of resources! Let the damn card do the parity for you.

      IBM have both SCSI and SATA based hardware raid. Trust it and them. Or trust Dell, or HP, or whoever. 100 plus people? You have the budget and the means to get a decent solution. We even put those cards in servers where there are only four full-time staff members. Cost outweighs the the risk AND downtime.

      SCSI v SATA. Well, at least IBM thinks SATA is worth supporting. SCSI is what is recommended though. Wait and see ...

      --
      Q:I was listening to a CD in Grip and it sounded horrible! What's up? A:Perhaps you are listening to country music
    36. Re:SATA is fine by ivan256 · · Score: 1

      If you want high avalibility, look into DRBD. It's like RAID1 over a network.

      DRBD is great, but it's not high availability by itself. It's just one piece. Without additional software you still have downtime if the primary end dies.

      Monitor the damn thing! My last job...

      Best advice ever.

    37. Re:SATA is fine by ckaminski · · Score: 1

      Which is why I only use RAID1.

      Although the ongoing work on the RAID6 driver is interesting to me...

    38. Re:SATA is fine by Andor666 · · Score: 1

      Our last setup was better like:

      2 units of a 15 disk chassis.
      In each chassis 14 RAID5 disks + 1 spare (so 2 disks can fail a time in each chassis)
      And tied together both chassis with software RAID0 on LINUX.

      An IO rocket, you know, and you can do something similar with less discs and a pair or three 'cheap' controllers.

      Have a nice day.

    39. Re:SATA is fine by TClevenger · · Score: 1
      If you have another drive failure after that before replacing the dead drive, you're still running.

      Don't most RAID controllers allow you to specify more than one hot-spare? If so, putting in a second or even third hot-spare might be better for a situation where the server isn't monitored 24/7 or might have to go the weekend with a failed drive.

  4. The King by FatSean · · Score: 1

    SCSI slaps SATA arround like nobody's bidness. It's not just about bus bandwidth. SATA sure beats IDE, but that isn't really saying much.

    --
    Blar.
  5. Re:What's this SCSI you speak of? by Anonymous Coward · · Score: 1, Informative
    What's this SCSI you speak of?

    You know, those are the drives offering logs of models running at 15,000 RPM. The ones with controllers which can often take large amounts of RAM. You know, drives for servers.

  6. SCSI by armanox · · Score: 2, Funny

    Clearly the answer is SCSI - go with what you know, servers are not the best computers to experiment with random equipment.

    --
    I'm starting to think GNU is the problem with "GNU/Linux" these days.
    1. Re:SCSI by humphrm · · Score: 4, Insightful

      Dude, that thinking is the difference between an SA and an Engineer. Thinking like that would have us all running MFM drives, and these newfangled "SCSI" disks would be too risky random equipment to test out on a server.

      --
      -- "In order to have power, I must be taken seriously." -Mojo Jojo
    2. Re:SCSI by Yoweigh116 · · Score: 1

      I disagree. The RAID array negates the dependability advantage of SCSI through data redundancy. If the question submitter isn't worried about bus bandwidth, I don't see a reason why SATA shouldn't be a viable alternative. I do agree that experimenting with servers isn't the greatest of ideas. However, this isn't really all that much of an experiment, and the hardware certainly won't be random. That's why he asked the question, and other posts have already made controller recommendations. The drives themselves are the same as good old ATA drives, mechanically, only with a different interface.

    3. Re:SCSI by Anonymous Coward · · Score: 1, Informative

      I personally disagree with the view that SCSI is the way to go in every scenario. You pay more for SCSI and while a fair argument can be made that you tend to get what you pay for, there's also the scenario where you end up swatting flies with a howitzer, so to speak.

      If there won't be more than 50 concurrent users and the file access is relatively lightweight(as opposed to a heavyweight oracle database for example), then some money can be saved and good results realized with a SATA solution.

      I personally have had very good results with a moderately used Filemaker server sitting on SATA drives with about 40 concurrent users at peak, and about 70 users total hitting it throughout the day. The application is for clinical trial information tracking, if that helps anyone.

      But I definetley remain firmly in the SCSI camp for heavyweight applications.

    4. Re:SCSI by NutscrapeSucks · · Score: 2, Informative

      Even back in MFM's heyday, SCSI was the standard for workstations and servers.

      --
      Whenever I hear the word 'Innovation', I reach for my pistol.
    5. Re:SCSI by A+Commentor · · Score: 1
      SCSI will also give you better I/O performance (in part because it can read and write at the same time

      WOW, that is a pretty impressive trick to do when drives (yes, even SCSI drives) have only one physical head...

      --

      Looking for any old 8-bit Heathkit/Zenith software/hardware - http://heathkit.garlanger.com

    6. Re:SCSI by WgT2 · · Score: 1

      True, but the SCSI bus is capable of simultaneous read/writes while the IDE bus is no so. (IDE RAID sort of works around this limitation.)

  7. BACKUP! by Anonymous Coward · · Score: 4, Interesting

    Think about the fact that you need a sufficient Backup. You can buy lots of cheap storage with SATA Disks, but Ultrium 3 Tapes (400/800) are still expensive as fuck. Never forget that cost when calculating.

    OTOH, there are 300GB U320 Disks now, which you could use if latency is not an issue. Otherwise, go with lots of disk arms (72GB or 36GB U320 Disks)

    1. Re:BACKUP! by Spazmania · · Score: 3, Informative

      ATA and SATA drives are a great choice for online backup. Its pretty easy to put several terabytes worth in a box these days, software raid-5 them with Linux and then use tar and gzip. The price is not exceptionally higher than tapes either and the reliability (i.e. your success rate restoring data) is superior.

      --
      Moderating "-1, Disagree" is simple censorship. Have the guts to post your opinion.
    2. Re:BACKUP! by kahanamoku · · Score: 5, Informative

      I've seen more dead HDD's than backup tapes, and have seen 60 times as many backup tapes than HDD's...

      and last time I checked, an Ultrium 3 tape was half the price of a 400GB Drive.

      I wouldn't use disks for backup, unless they're to be used as live backups, and then I'd still archive to tape (provided it was affordable).

      --
      ----- Concentrate on promoting more than demoting.
    3. Re:BACKUP! by eric76 · · Score: 2, Interesting
      ATA and SATA drives are a great choice for online backup. Its pretty easy to put several terabytes worth in a box these days, software raid-5 them with Linux and then use tar and gzip. The price is not exceptionally higher than tapes either and the reliability (i.e. your success rate restoring data) is superior.

      That depends on how long you need to store the data.

      If you need it for a short time, you might be correct.

      But if you may need the data 5 years or more from now, tape is clearly far superior.

    4. Re:BACKUP! by eric76 · · Score: 2, Insightful

      You are absolutely correct. In addition, with a tape, you can much more easily take copies off-site for storage. I frequently suggest that people get a safety deposit box in a bank at least 20 miles away from their facilities and store a copy of their backups there.

    5. Re:BACKUP! by Spazmania · · Score: 4, Insightful

      But if you may need the data 5 years or more from now, tape is clearly far superior.

      You have much luck getting data back from a tape five years later?

      First you have to find the tape. You can't have misplaced it and you can't have reused it due to the damn high cost of magnetic tape.

      Then you have to find a drive that can read the tape. The one you wrote it with died two years ago, its no longer manufactured and oh darn none of the three you picked up off ebay use the same compression format.

      Next you need the old backup software. You've been using Acme Archiver for the past three years; It doesn't understand the old SuperBackup format and unfortunately SuperBackup only ran in DOS with an 8-bit ISA SCSI card.

      Finally you have to pray that the tape is still good. They're like floppy disks; they go bad just sitting on the shelf.

      Buddy, I've been there. It ain't pretty. So for the last 7 years I've stored my backups on hard disks. No pain! No pain!

      --
      Moderating "-1, Disagree" is simple censorship. Have the guts to post your opinion.
    6. Re:BACKUP! by eric76 · · Score: 2, Insightful
      You have much luck getting data back from a tape five years later?

      Yes.

      First you have to find the tape. You can't have misplaced it and you can't have reused it due to the damn high cost of magnetic tape.

      That is no problem at all. I keep detailed listing of what backup set is stored on what backup media.

      As far as the "damn high cost of magnetic tape", you must be talking about those cheap tape drives that use expensive tapes. We have a couple of those around here, but we don't use them much at all.

      Interestingly enough, individual tapes don't really vary all that much whether they carry 4 GB or 400 GB. But IMHO the tape drives that use 4 GB tapes are not trustworthy enough to use for backups and I would never suggest using such.

      Then you have to find a drive that can read the tape. The one you wrote it with died two years ago, its no longer manufactured and oh darn none of the three you picked up off ebay use the same compression format.

      If it is no longer manufacturered in two years, then you made a very poor choice of choosing a backup system.

      Go to LTO Ultrium. It will still be around in 5 years.

      Next you need the old backup software. You've been using Acme Archiver for the past three years; It doesn't understand the old SuperBackup format and unfortunately SuperBackup only ran in DOS with an 8-bit ISA SCSI card.

      Why would anyone use something like that for a backup? The first critera should be that whatever you write to the tape should be readable using any device that can read and write to that tape.

      Finally you have to pray that the tape is still good. They're like floppy disks; they go bad just sitting on the shelf.

      I've had very little problem with bad tapes. But just in case, having just one copy of the file on a backup tape is an amateur error.

      Buddy, I've been there. It ain't pretty. So for the last 7 years I've stored my backups on hard disks. No pain! No pain!

      I've used tapes, disks, CDs, DVDs, and even in a few cases printouts of really important data. And tapes are still my favorite.

    7. Re:BACKUP! by mrchaotica · · Score: 1
      last time I checked, an Ultrium 3 tape was half the price of a 400GB Drive.
      That doesn't include the ~$2000 cost for the tape drive itself. The real cost for tape is $2000 + $75 per tape, while the real cost of SATA is just $200 per disk (or $240 if you want an external enclosure for each one (see below)). Unless you've got more than 6.4 TB of data to back up (the point where 2000+75x = 200x, where x is a 400GB-capacity tape or disk) (or 4.8TB assuming enclosures), hard disks are cheaper.

      (Incidentally, I checked pricewatch, and I'm intentionally giving low figures for the tape equipment. In most cases, hard drives would probably be even more favorable.)

      Besides, what if you just put the hard drive in a portable enclosure and pretended it was tape (i.e. plug it in, do the backup, and take it out, rather than leaving it running all the time)? Surely that would reduce the failure rate of the hard drives by a lot. Plus, then you could store it off-site, as the other person replying to you mentioned.

      Considering all that, hard drives might actually be a better choice than tape, unless you've got a whole bunch of data.
      --

      "[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz

    8. Re:BACKUP! by misleb · · Score: 1

      The problem with disk backup is that it is difficult to maintain multiple archives. And maintaining offsite backups can be a pain. Several times I've had to go back months to retrieve old data. And unless you know exactly what you might need to archive and can put it on DVD (pain in teh butt compared to regular tape rotations), you're going to want archives of everything. There is a reason why people still use tape and it isn't because they are stubborn.

      -matthew

      --
      "THERE IS NO JUSTICE, THERE IS ONLY ME." -Death
    9. Re:BACKUP! by Rickler · · Score: 2, Funny

      /walks by backup storage room with high powered magnet. ololo they fired me?!

      --

      The human race is artificial intelligence created using object orientated programming.
    10. Re:BACKUP! by Anonymous Coward · · Score: 0

      "I frequently suggest that people get a safety deposit box in a bank at least 20 miles away from their facilities and store a copy of their backups there."

      Ok, this one sounds good, in theory... But in practice, what happens that causes everything in a 20 mile radius to be rendered useless, gone, destroyed? Nuclear attack? Meteor strike?

      Guess what, those fucking backup tapes are the LAST thing on my goddamned mind if a 20 mile radius of Earth just got blasted to smithereens. If I wasn't killed in the incident, damn sure there was many friends and family that were. I'd be spending my time rebuilding my life, not giving shit about some pissant company's data.

    11. Re:BACKUP! by HuguesT · · Score: 1

      Usually your drives will be arranged in some kind of RAID. The truly redundant ones cost you 50% to 75% of your capacity, so it really become cheaper around the 2-3TB mark.

      On tape you can have multiple incremental backups. Incremental backups are very cheap and fast, you really want those. Multiple backups are essentials too.

      Then there is the offsite aspect. If you want to backup your server room to somewhere 20 miles away on a regular basis (every night?) you're up for a hefty network access bill.

      If we are talking a server room full of a few TB of data, 2-5k$ is chump change. Pros use everything, RAID, disk backups, CD/DVDs, printouts, and tape. Tape is not going away.

    12. Re:BACKUP! by mrchaotica · · Score: 1

      You seem to have missed the key point of my idea. Think about it: is a hard drive significantly larger than a tape? No! So why not put the backups on the hard drive? As in, big tar files and incremental backups and whatnot. After all, the medium and file structure have nothing to do with each other, really. And "bandwidth" means "N hard drives, each in an external SATA/FireWire/whatever enclosure, in a box that somebody carries to the safe-deposit box at the bank" -- just as if they were tapes.

      Does this make sense, or am I missing something?

      --

      "[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz

    13. Re:BACKUP! by Savantissimo · · Score: 1

      Well, if you live near a coast or a fault line it's sure a good idea.

      --
      "Is life so dear, or peace so sweet, as to be purchased at the price of chains and slavery?" - Patrick Henry
    14. Re:BACKUP! by afidel · · Score: 1

      I work for a midsized company, we have about 600GB of online data which is not that large for a firm with around 200 employees. 4.8TB is only 8 monthly backups of our entire systems if you assume zero data growth. So in less than a year LTO3 becomes cheaper than the HDD solution and is a vastly superior solution. Not only that but we take our daily tapes offsite which would probably lead to eventual failure of the HDD's as they are not designed to be carried like that on a regular basis. Plus startup and spindown is by far the hardest on HDD's, I've seen about 10x as many failures on HDD's during power up then on running systems.

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
  8. Hmm by PrvtBurrito · · Score: 4, Informative

    We have both SCSI raid (2 1TB arrays with 10k RPM SCSI drives on a dell powervault) and a several arrays with 3ware cards (an 8 way and a 12 way both with 200 or 250GB drives). We run Red Hat WS. We find that the 3ware cards are excellent for large data storage but have latency issues compared to the SCSI raid array. We are happy with both systems, but the price break on the 3ware shows, and I wouldn't recommend for really heavy use.

    --
    Laboratree - Scientific collaboration based on OpenSocial.
    1. Re:Hmm by Unknown_monkey · · Score: 1

      No offense, but I'm not sure which one is not recommended for heavy use.

    2. Re:Hmm by Malor · · Score: 1

      I have an 8500-series 3Ware card, and it really doesn't perform well under load. If the array is under heavy write I/O, the entire machine drags to a near-halt. It's very frustrating. I ended up using the fairly expensive card as a JBOD controller, and using software RAID instead. On an Athlon 1900+, it's MUCH faster in software. I'd really prefer hardware RAID, but it's just too damn slow.

      The 9XXX series are supposedly better, but reviews I've read suggest that they're still pretty underpowered for heavy writes.

    3. Re:Hmm by raynet · · Score: 1

      Are you using also 10000 RPM SATA drives in the array or only 7200 RPM drives (that would explain some of the latency difference)? Too bad there aren't any 15.000rpm SATA/SATA-II drives available.

      --
      - Raynet --> .
    4. Re:Hmm by dougmc · · Score: 1
      I've got a 3w-7810 card. With 6 200 MB IDE drives, it's sequental RAID-5 write performance is about 13 MB/s. A single drive could do better than that!

      So I did the same thing you did -- used it as JBOD and did RAID 5 in software. It's still slower than I think it should be, but at least it can write at 40 MB/s now.

  9. I'd say SCSI by SocialEngineer · · Score: 1, Insightful

    It's more reliable, as far as I know, compared to SATA. SATA is good enough for desktop performance, but I have yet to hear any glowing reviews of it in the server market.

    --
    "Better to be vulgar than non-existent" -Bev Henson
    1. Re:I'd say SCSI by dubdays · · Score: 1

      Yes, SCSI is far more reliable, but it really all depends on the application. If the file server is going to be used only intermittently, SATA may be okay. However, if the server is I/O intensive, you really need to go with the SCSI drives. Basically, SATA drives are rated for desktop use (something like a 10% duty cycle...don't quote me), while the SCSI drives are rated at a 100% duty cycle. This is why SCSI is recommended for database servers, while SATA is often recommended for nearline use, like a disk-to-disk backup server. Plus, if you get the higher end SATA drives (e.g. WD Raptors), you're getting up toward the SCSI price-range, anyway.

    2. Re:I'd say SCSI by GigsVT · · Score: 5, Interesting

      OK, hear mine then.

      We have several terabytes of SATA storage at work to hold our main business-critical digital asset archive.

      We've been using a ATA/SATA disk-only strategy for over 5 years now. It's worked great, and eliminated our slow and unreliable tape robot, which has greatly improved productivity.

      Back in 1999/2000 SCSI wasn't an option for the main archive because a terabyte of SCSI would have broken the bank. We went ATA back then. It was a mess trying to route 24 ATA cables in a case, I admit. SATA fixes that nicely.

      We keep three copies of our data, two onsite and one offsite. We use rsync-incremental snapshots to do disk-based incremental backups. Because the cost of SATA is less than 1/3rd the cost of SCSI, we get a high reliability solution for less than the price of a single SCSI RAID.

      One more advantage of SATA is that the disks are so cheap, it's easy to just replace all of them every two or three years. The disks you replace them with generally are twice as large after 2 or 3 years, so every cycle your RAIDs get more reliable as the number of disks is slashed in half.

      Most companies wouldn't replace every SCSI disk every two years, it would cost way too much. And considering the slow pace of SCSI size growth, you wouldn't see as much gain, a double hit against SCSI.

      So basically unless you need the excellent latency performance of SCSI, higher than even the WD Raptor can offer, I see no compelling reason to use SCSI for anything anymore.

      --
      I've had enough abrasive sigs. Kittens are cute and fuzzy.
    3. Re:I'd say SCSI by Zemplar · · Score: 1

      "So basically unless you need the excellent latency performance of SCSI, higher than even the WD Raptor can offer, I see no compelling reason to use SCSI for anything anymore."

      Unfourtnately you are mostly correct, save one concern of mine. I've personally had horrible luck keeping many Western Digital drives last until the term of their warranties. I've never had a Seagate drive fail on me. Though the Raptors offer good performance and have a better than usual warranty from WD, being produced by WD drives scares me. Seagate SCSI drives are now what I use and am very pleased. Now if only Seagate would make a similar drive as the Raptors...

  10. Re:What's this SCSI you speak of? by Anonymous Coward · · Score: 0

    More like "harbl harbl harbl!"

  11. SATA? I don't know.... by toofast · · Score: 5, Informative

    I use SATA on our smaller, non-mission-critical servers. For our data backend, I wouldn't touch it with a 10-foot pole.

    Here are some scenarios where I wouldn't hesitate to use SATA:

    - You have redundant servers. Using LVS and/or Heartbeat and your favorite tools, you can get full server redundancy using less expensive hardware. The overall solution can be quite elegant, with hot failover. Why just cover the drives?

    - Front-end cluster nodes. You have a powerful, expensive backend server (with a cheaper failover) and you use inexpensive front-end servers for serving client requests. Sounds like overkill for what you want, but with the right server load balancing technology, it can give you a scalable, fault-tolerant and damn fast solution.

    - You can live with downtime. Install a server with a couple of SATA disks in a RAID configuration and hope for the best.

  12. Re:What's this SCSI you speak of? by jhunholz · · Score: 1

    SCSI is great for a database server, but SATA will be much more practicle in a file server. That's the way I would go.

  13. SCSI file servers by juicy · · Score: 2

    Go SCSI. It's pretty amazing that there are so many people out there whose only exposure to SCSI has been a mention in a textbook or having read about it, considering it was the standard for performance for so long. The speed and reliability is definitely better than SATA and as you well know, price isn't much of an issue when the RAID fails is it?

    --
    -- Eli Juicy Jones
    1. Re:SCSI file servers by Spazmania · · Score: 1

      price isn't much of an issue when the RAID fails is it?

      The price is a huge issue. The cost of regenerating the lost data is enormous.

      --
      Moderating "-1, Disagree" is simple censorship. Have the guts to post your opinion.
    2. Re:SCSI file servers by rizzo420 · · Score: 1

      are you saying that raid has a bigger chance of failing on a SATA drive than SCSI?

      i'm confused...

      also, the guy with the question mentioned that at most, there will be 50 concurrent connections, and the place only has 100 employees. i've run labs and had well over 50 machines connecting to a single symantec ghost server... no raid, standard ATA. no trouble (yes, i realize they don't connect and transfer large amounts of data all the time like a fileserver). if he was really worried about speed, i'm sure SCSI would be the answer. SCSI is pretty old technology... modern hard drives are pretty robust (i'd put the best ATA or SATA drive up against the best SCSI drive, i bet they'd both last just as long). the only real difference is speed, and i bet his users won't notice much difference with SATA. and if, per chance, a drive fails, it's a cheaper to replace that SATA drive than a SCSI drive. and don't tell me SCSI drives don't fail.

      --
      please me, have no regrets.
    3. Re:SCSI file servers by timmy+the+large · · Score: 1

      I bet they wouldn't. SCSI is made to last in a high stress enviroment. ATA and SATA are made for desktop usage. There is a definite quality difference. There is also a huge price difference, which is why everyone uses SATA or ATA unless they have to use SCSI.

  14. SCSI for tier 1, SATA for tier 2... for now by Anonymous Coward · · Score: 2, Insightful

    For any tier 2 storage, SATA is the future. I'd go so far as to say that in about 80% of all instances, good SATA-II drives with NCQ and large caches in a proper RAID setup will be far, far more than adequate for most production servers unless you do tons of non-sequential I/O, need tons of iops, etc.

    For a file server, you'll be fine with SATA.

    For my tier 2 servers, I am moving a ton of stuff off of my EMC gear (because fibre channel drives are damned expensive) onto SATA-II drives in an iSCSI setup. I'm already running servers with trunked gigabit NICs... might as well let Win/*NIX boot from a local drive and mount block-level iSCSI devices over the gigabit fabric. Save a shit ton of money and get tons of space on the cheap (7.5 terabytes of fast SATA-II in an iSCSI chassis for well under $10K... make your RAID groups, carve up some LUNs, and par-tay!).

  15. The real info by sabreofsd · · Score: 5, Informative

    There might be some benefits to you to sticking to SCSI vs. SATA, it really depends on your preference. Both SCSI and SATA offload the main processor from the duties associated with reads and writes. SATA also now has optimized reading patterns just like SCSI. The only real advatages SCSI has right now are the speeds (SATA 150 (there is a newer faster one coming) vs SCSI 320). Also, most SCSI drives are desgined for 24/7 use, whereas most SATA drives are designed for desktop use. Just make sure the SATA drives you buy are made for Enterprise level operation. So it really comes down to compatability/speed vs. cheap/larger. Hope this helps!

    --
    Sabre
  16. Be sure you get NCQ support by bradleyland · · Score: 1

    If you go with SATA 150, make sure that the drives and the controller support NCQ. This is incredibly important for a server. If you want to split the difference between SATA and SCSI, go SATAII with a PCIx 64-bit or PCIe RAID card with expandable on-board cache of around 128MB.

    SCSI what?

  17. Re:What's this SCSI you speak of? by fodi · · Score: 2, Insightful

    Why? Why? Why?

    At least give us a 2 line explanation, so I don't think you're speaking crap that you read in an advertisement! Please, if you have some justification, I'd love to know what it is... seriously.

  18. Re:What's this SCSI you speak of? by Deliveranc3 · · Score: 1, Insightful

    Parent is right, for an admin the scariest thing you'll ever here is.

    "I can't find the network drive"...

    SCSI will seem a bit more expensive at first but that cost isn't just for the interface most of that cost is for the extra testing and hardware reliability you get with SCSI.

    I am a pir8 and I back up everything important so I can run an SATA raid-0.

    But you want something with modular controllers hotswappable raid arrays and reliability, hell if I was running a businnes off my home PC that would be something I would invest in.

    Find a way to make do with less space on the netword and greater reliability, you will sleep A LOT easier.

  19. Re:What's this SCSI you speak of? by turboflux · · Score: 1

    That thought had crossed my mind as well. I plan to setup the Active Directory equiv of home directories for each user and have them use that rather than My Documents. Plus there will be some common shares with high resolution product images, brochures, and so forth.

  20. SATA II is not your father's SATA by xusr · · Score: 5, Informative

    the SATA II spec is quite a bit different from the original SATA. SATA II adds port multiplication, hot plugging, native command queuing, external enclosures, and port selection. Also, with a theoretical peak of 3Gbps, it's twice as fast as the old SATA. here is a decent article with more explanation.

  21. SAS by Punboy · · Score: 3, Interesting

    Serial-attached SCSI. 15K RPM drives, SAS RAID 1+0. Heaven.

    --
    If you like what I've said here, and want to read more, go to http://www.krillrblog.com
    1. Re:SAS by Taimat · · Score: 1

      I'm drooling.. stop it.....

      --
      The above comments are not guaranteed to make sense to anyone other than the author...
    2. Re:SAS by Wesley+Felter · · Score: 1

      Sure, if you're made of money.

    3. Re:SAS by Anonymous Coward · · Score: 0

      Little do you realize, he is in fact, composed entirely of money, a sort of paper maché man made out of assorted bills with quarters for eyes

  22. Re:SATA? I don't know.... by GigsVT · · Score: 1

    You have redundant servers.

    Exactly.

    SATA is so much cheaper that it's easy to afford completely 2 or 3 redundant storage servers for the same price as a single SCSI array. We do this at work in fact.

    With SCSI around $3-5 per gig, and SATA around 35-70 cents per gig, it's a no brainer if you need lots of storage and latency doesn't have to be top of the line.

    If it's latency bound then SCSI or more expensive SATA 10,000 RPM Raptors would be better.

    --
    I've had enough abrasive sigs. Kittens are cute and fuzzy.
  23. obgligatory ballmer... by voxel · · Score: 3, Funny

    USB 1.0 didn't kill SCSI, Steve Ballmer F**KING KILLED SCSI!

    Now Ballmer says he is going to F**KING KILL USB.

    Man I need a T-Shirt that says that.

    "Steve Ballmer says, "I'm going to f**king kill you!"".

    --
    Modesty is one of life's greatest attributes
    1. Re:obgligatory ballmer... by Savantissimo · · Score: 1

      See if Cafe Press will do it. I'd buy one if it came in hat form.

      --
      "Is life so dear, or peace so sweet, as to be purchased at the price of chains and slavery?" - Patrick Henry
    2. Re:obgligatory ballmer... by GungaDan · · Score: 1

      A shirt that comes in hat form... a shat?

      --
      Eloi are stupid, throw morlocks at them!
  24. These are different media for diff jobs by postbigbang · · Score: 2, Informative

    SCSI is very fast, and usually more expensive. You can get really fast, highly cached drives in SCSI with high-RPM spindles, and cool controllers. But they're $$$$. Do you need the speed?

    If not, SATA is still pretty fast, much less expensive, less clever controllers, but still very reasonable for things like archiving, steady low-concurrency-demand streaming, and so on.

    SATA also has the advantage of not needing loads of austere cables with distance limitations imposed on them; it's a serial rather than a parallel bus-- hence the S in SATA. Use SATA when you don't need the absolute fastest you can get-- and you won't have to spend the most on the controller (which is hopefully a SCSI PCI-X controller or other fast clocker), the drives, the pricey cables, and so on. But if you need the speed, there is no faster than SCSI except for flash drives, which are still hideously expensive.... and not writeable as much as we'd like them to be.

    --
    ---- Teach Peace. It's Cheaper Than War.
  25. SATA and Linux will be much faster... Soon. by MarcQuadra · · Score: 3, Informative

    I recently did all this research myself. SATA on Linux is going to get MUCH faster, probably as fast as SCSI, but you'll have to wait for the libATA improvements to take hold. Right now NCQ isn't implemented, and neither are 'multiple sector transfers'. I bought hardware that WILL support those features because I know that NCQ will dramatically improve speed and latency (under high-use conditions) when it is finally fully-baked.

    The site to track progress on the library and driver status is here: http://linux.yyz.us/sata/

    The project has been moving along quite well. I think their goal is to completely modularize, simplify, optimize, and consolidate the ATA, ATAPI, and SATA kernel pieces into one overarching (underlying?) library. I like this kind of work. I can't see why ALL disk-like I/O isn't under one big modular kernel library, it seems like it would make adding new transport types and drivers a lot simpler and reduce maintainance all-around.

    --
    "Sometimes, I think Trent just needs a cup of hot chocolate and a blankie." -Tori Amos on Nine Inch Nails
    1. Re:SATA and Linux will be much faster... Soon. by Wesley+Felter · · Score: 1

      Of course, you can bypass all those shortcomings if you get a good hardware RAID controller.

    2. Re:SATA and Linux will be much faster... Soon. by GigsVT · · Score: 1

      3ware shows up to the OS as a SCSI controller, so its performance does not depend on the SATA kernel parts.

      --
      I've had enough abrasive sigs. Kittens are cute and fuzzy.
    3. Re:SATA and Linux will be much faster... Soon. by Aneurysm9 · · Score: 1

      Yes, actually, it does. All SATA drives show up as SCSI devices. They still use the SATA subsystem.

      --
      There was Cowboy Neal at the wheel of a bus to never-ever land.
    4. Re:SATA and Linux will be much faster... Soon. by GigsVT · · Score: 1

      Not all. It depends on what kernel module you use. The newer set of modules make drives show up as hdX.

      3ware in Linux uses no SATA kernel code. The 3ware module has been around since before SATA was even supported at all in Linux and it hasn't changed significantly since then.

      --
      I've had enough abrasive sigs. Kittens are cute and fuzzy.
    5. Re:SATA and Linux will be much faster... Soon. by SpaceLifeForm · · Score: 1

      Differences in Error reporting up the call stack
      can lead to complexity and bugs. It's not as simple
      as it may first appear.

      --
      You are being MICROattacked, from various angles, in a SOFT manner.
    6. Re:SATA and Linux will be much faster... Soon. by mishehu · · Score: 1

      Uhm, what newer set of modules? Certainly not what's in the kernel tree as of 2.6.15.x. Sil & nvidia SATA controlers on this motherboard, and drives show up as scsi type.

    7. Re:SATA and Linux will be much faster... Soon. by misleb · · Score: 1

      Umm, isn't libata for individual disk access not hardware RAID? Any serious (PC based) storage implmentation is going to be hardware RAID and that pretty much makes libata irrelevent. These devices either show up as SCSI or generic block devices. The NCQ and all that would be implemented on the controller, AFAIK.

      -matthew

      --
      "THERE IS NO JUSTICE, THERE IS ONLY ME." -Death
    8. Re:SATA and Linux will be much faster... Soon. by GigsVT · · Score: 1

      The very first SATA module set showed up as sdX, then they put in the libATA to hdX, it does look like as of 2.6.7 or 2.6.8 and later everything's back to sdX.

      3ware still doesn't use any kernel SATA code.

      --
      I've had enough abrasive sigs. Kittens are cute and fuzzy.
  26. Re:What's this SCSI you speak of? by DaedalusHKX · · Score: 1

    Sata has extremely fast sustained throughput... scsi, as I recall can perform more drive intensive tasks while keeping them OFF the cpu mix. When many requests for files come in, either of these will do, but personally I'd put money in SCSI unless you're really a badass geek, have some 3ware hardware, and as in "badass geek" you eat, drink and sleep HDD troubleshooting. (I'm still using parallel drives for mass storage, but they're mirrored on a pair of RAID cards. About 400 gigs, redundant, or almost 1 TB if the drives were separate)... if any of mine fail, its a quick trip to the store for another 160 GB seagate :)

    ~D

    --
    " What luck for rulers that men do not think" - Adolf Hitler
  27. Fibre Channel by MoFoQ · · Score: 3, Informative

    I'd say Fibre Channel.

    One benefit that SATA does have over SCSI is the cabling....it's smaller and blocks less airflow (and easier to do the cabling).

    SCSI on the other had has other benefits....like it's used in enterprise servers now. Faster, daisy-chained, more RAID options, etc.

    Of course, Fibre Channel is basically SCSI on steroids and has the cabling benefits that SATA has.

    With more room thanks to less data cabling, u can add watercooling to reduce the heat generated by the 15k+ rpm drives.

    1. Re:Fibre Channel by manalishi · · Score: 1

      I set up a four-drive FC array on OpenBSD and, yes, it is fast. But the thing is LOUD!

  28. As with all things, it depends on the usage... by thesandbender · · Score: 2, Insightful

    I am a huge fan of efficent, cheap systems. The bulk of our server load is handled by dual opteron machines with 3ware cards and a 10k rpm system spindles and 7.2k rpm data spindles. However, even the best sata drives choke under file system and database loads and our primary data stores are U320. StorageReview.com has a good review of the new 150gig 10k rpm WD drive that shows it gettting spanked by SCSI drives under non-linear server loads. Long story short, if you expect a lot of drive activity you might be able to eek by for a while with a well tuned SATA system but you will have to pony up for a SCSI system at some point and you might as well do it now and save yourself the hassle of migrating later.

  29. Serial Attached SCSI by klparrot · · Score: 4, Interesting
    You could go with Serial Attached SCSI (SAS). SAS drives offer the high-end performance of traditional SCSI, and you can also hook up regular SATA drives to an SAS controller if you want to go chep for now and upgrade later, or if you only need some of your drives to have high performance.

    SAS hardware is currently a little harder to find than SCSI or SATA stuff, but I'm sure there's a good selection out there if you take the time to look.

    I was checking out the Sun Fire 4100 a while ago, and it takes SAS drives, however the form factor is 2.5", and I haven't yet seen any 2.5" SATA drives (I wanted that compatibility). Also, I've heard SATA drives don't work with the Sun Fire 4100's SAS controller anyway. Not sure about that, since the SAS spec says they should work, but just something to keep in mind when you're looking for a server or mobo or controller that supports SAS.

    1. Re:Serial Attached SCSI by GigsVT · · Score: 1

      I'd wait on SAS. It remains to be seen whether it will even go anywhere. It would suck to dump that much money into something that never takes off.

      Not to mention you can't ensure that you'll even be able to get replacement drives if some fail, considering the extremely limited choices and supply of SAS drives and components.

      --
      I've had enough abrasive sigs. Kittens are cute and fuzzy.
    2. Re:Serial Attached SCSI by wild_berry · · Score: 1

      I think that high-end notebook computers are beginning to use SATA disks (Alienware, while using SATA, doesn't explicitly say that their m7700 uses 2.5" disks but I'd doubt they fit two 3.5" drives in a notebook), so they exist. Froogle has a U.S.A. search and A U.K. search yielding a few results for 2.5" SATA disks, though I doubt these drives have a decent amount of cache or spindle speeds above 7200rpm.

  30. Stick with SCSI by dFaust · · Score: 3, Informative
    SATA drives have definitely improved, and for file servers NCQ definitely helps out alot.... but for the absolute best performance in a (true) multi-user environment, 15k SCSI drives still offer gobs of performance over even the new 150gig 10k Raptor SATA drive. Ultimately it will come down to how important price vs. size is to you... but speaking purely on performance, 15k SCSIs are the way to go.

    One way to curb some of the cost, I might add, would be to switch to something like RAID 5... you won't have as high throughput, but you'll still see performance gains and end up with more usable drive space. The throughput likely won't be your problem, anyways... typically it would be the drive's ability to handle multiple simultaneous requests, which heavily relies on low access times (which is why SCSI dominates in this type of environment).

    Here's a quick reference of some IOMeter benchmarks using a file server test pattern. You'll see what I mean. Wealth of info on drives on that site.

    1. Re:Stick with SCSI by Trepalium · · Score: 1
      Just keep in mind that you will see performance degradation with RAID 5 when you're doing a lot of writes to the drive. You can get significant read performance increase from RAID 5, but parity generation can take quite a performance toll on writes (each write must be accompanied by a disk/cache read from each non-parity drive, and a write to the corresponding parity block).

      If you get SCSI/SAS RAID 5, strongly consider getting a controller with a battery backed cache module. First, it'll help prevent bad blocks from appearing due to power failures during write operations by retrying any writes that were pending before the power failure (drives with unreadable sectors may be kicked off the array, and you don't want that happening unnecessarily). Second, it'll also reduce the likelihood of data corruption due to a power failure, although you may not want to trust in this too heavily. A reliable UPS is very important, but mistakes happen - battery replacement indicators get missed, power failure shutdown doesn't get tested after a configuration change, cables get knocked loose, etc. If it wasn't bad enough that your server went down when it wasn't supposed to, how are you going to feel if the array is out of order, too?

      --
      I used up all my sick days, so I'm calling in dead.
  31. Depends on load by darkwiz · · Score: 3, Interesting

    You cannot buy the same performance class of drives in SATA as you can with SCSI. Some people call this a market segmentation scheme, I call it catering to the market. People who demand top class drive performance typically also want the other benefits of SCSI as well. Whether those benefits are needed for your requirements, well, depends on your requirements.

    SCSI can (depending on which particular SCSI) provide you with more devices per controller without sacrificing (any noticeable) performance. If you need to shove a ton of drives into one server, this will add up quickly. Since you are talking about RAID 0+1, depending on how much storage you are shooting for, this may be a strong factor (but you may be able to skin by on the 4-6 SATA ports you'll find on most mobo's).

    SCSI is more mature. So drivers are likely to be more robust, more efficient, and more stable than those you'll find in your garden variety SATA.

    You'll typically find that under heavy load, SCSI performs better. Again, this is mostly due to so called "market segmentation" schemes, but that is why you pay more. If your users are going to be mostly dealing with the usual, periodic saving of word processing documents, spreadsheets, and a couple of light media files - you probably don't need to handle really heavy loads. The RAID controller will eat the peaks of write demand in cache (if you get a decent RAID controller - see later), and you should have fairly smooth performance. Then again, if your users are constantly running large installers (development test environment) or working with large remote files - you should really go SCSI.

    All that said: I think you would be served best by investing in a better RAID controller rather than investing in top of the line drives. The RAID controllers they integrate on to most motherboards are crap (for what you are trying to do, desktop use - meh). You want something with a ton of cache, and good management soft/firmware. If you buy a real server class motherboard, you may get a better onboard RAID, but however you go about this - pay the most attention to this detail. Unless you really need low latency for high demand, random access applications, top end drives probably won't give you much over the usual network latencies.

  32. Go with SCSI by dclxv · · Score: 3, Interesting

    I bought into the whole "SATA is the new less-expensive SCSI" and put in two new file servers using SATA last spring. I can say that I'm unimpressed with the SATA servers as compared with our SCSI servers. I now wish we'd spent the extra $1000/server and gone with SCSI. I recommend SCSI -- you won't second guess your decision down the road.

    BTW, we used 3ware controllers with WD RAID Edition HDs. We're supporting approximately 75 users per server.

  33. avoid sil3112 + seagate barracuda by Squigley · · Score: 1

    Might not even be SATA II (I'm not sure), but avoid using Seagate Barracuda drives with RAID controllers based on the SIL3112 chipset.

    There's an incompatibility with some of the drives, Seagate won't say which ones, and it will result in reduced performance, or the risk of data corruption.

    There's a really good page about it, explaining the issue, but I can't find it at the moment. "15m" comes to mind, but that probably doesn't help. Google for references to "seagate errata fix".

    1. Re:avoid sil3112 + seagate barracuda by allankim · · Score: 1

      Close ... Google for "Sil m15w" and you should find a series of 2.6 kernel patches by Tejun Heo ... The patch improves disk I/O quite a bit, but I still wouldn't want to use that setup in production. If you have some extra bucks lying around, just get some newer drives.

  34. Re:SATA is fine ... for some things by abcess · · Score: 5, Informative

    As a matter of fact, you may not be flying at all. It all depends what you're using it for. The problem with SATA is latency, and there's not much that controller is going to do about it. If you've got a server that is performing latency sensitive tasks, then SATA can cause performance problems.

    In my experience, if you've got alot of random I/O, SATA is not a viable solution. That said, even if your I/O is mostly random, if there's not a heavy load on the disk, then you're probably ok. If you've got 200 people hitting a database or email server, you're probably going to have some performance problems. Swap it out with SCSI drives, or a quality disk array, and you'll be doing much better. If you've got a web server, or a database server that is exclusively reading, you can probably get away with SATA. Again, it all depends on how much and how random the disk I/O for your application is.

  35. Speed differences by BobPaul · · Score: 1

    Well, SATA 300 is already out. We're using that for 3 1TB file storage servers on our domain, using a PCI-X x16 Raid controller with RAID5.

    And to be really honest, SATA 150's 150MB/s is not shared with any drives, where as SCSI 320's 320MB/s is shared among all the drives on the ribbon cable, AFAIK. I'd personally rather have 8 drives with 300MB/s a peice than 8 drives sharing 320MB/s. The former makes the PCI-X port the bottle neck (or perhaps the raid controller card itself).

    1. Re:Speed differences by rsax · · Score: 1
      Well, SATA 300 is already out. We're using that for 3 1TB file storage servers on our domain, using a PCI-X x16 Raid controller with RAID5.

      Just out of curiosity which RAID controller are we talking about here and which Linux distro?

    2. Re:Speed differences by BobPaul · · Score: 1

      They're 3Ware 9550SX-8LP controllers. We're actually running them on Windows Server 2003 to avoid problems tying them into our domain as they act as active directory profile servers, but they're supposed to support linux fedora, suse, etc. I see there's a module in the gentoo-sources on my home desktop (it's under scsi low-level drivers).

      Also, I just got smacked in the head by a co-worker for not knowing what I'm talking about ;) They're actually PCI-X 64bit, not PCIe x16. The my origional post was ambiguous as I saw the big long port and associated it with the PCIe slots I know... I'm new to department this year, and they had the machines built a while ago... :$

      But they are great. We have all the drives in hot-swap bays running with a hot spare in each server, staggered spin up, online expansion, etc.

  36. SATA by Andy+Dodd · · Score: 5, Informative

    SATA's peak raw transfer rate (150 MB/sec) is half that of the peak raw transfer rate of SCSI (320 MB/sec), but you're going to be limited by the individual hard drive's transfer rate anyway. Keep in mind that a proper SATA implementation will be 150MB/sec PER DRIVE, since each drive is on its own channel. SCSI is 320 MB/sec per channel, but you're in for a cabling nightmare if you want only one drive per channel. Note that there is a 300 MB/sec SATA standard, although few drives and controllers seem to support it.

    If you buy the right model, you can get SATA drives that have gone through the rigorous quality control testing that has historically been reserved for SCSI drives. Many of the higher end server-grade SATA models are warrantied for 24/7 operation. SCSI has lost its advantage there.

    SATA has Native Command Queueing, formerly a SCSI-only performance feature. Note that it's optional for SATA drives though, so make sure you get a controller and drives that support NCQ. Again, one of SCSI's few advantages has disappeared.

    Last, but most definately not least, SATA cabling is far simpler and robust than SCSI cabling. SCSI cabling is a finicky nightmare where even high-end cables can cause data corruption if you're not careful, whereas even the cheapest SATA cables I've seen worked reliably. I've had hardware related data loss on hard drives twice in my life. One case was an IBM Deathstar, the other was a SCSI cable that started flaking out and corrupted data on three drives at once. I haven't touched SCSI with a ten foot pole since that incident.

    --
    retrorocket.o not found, launch anyway?
    1. Re:SATA by GigsVT · · Score: 1

      You aren't kidding.

      We use a couple ACNC external RAID units at work, they have SATA drives inside and use a SCSI interface outside to the server.

      That SCSI link to the server is the most problematic part of them! A 7 foot high quality SCSI cable is damn expensive, and we go through them often (about 1 per year) as they flake out when you breathe on them.

      Not to mention apparently there's no standard for the way the thumbscrews to fit, so some cables on some controllers can't even screw in, they just have to sort of hang there!

      We also run SATA 3ware internal RAIDs, those at least don't have to involve SCSI at all.

      --
      I've had enough abrasive sigs. Kittens are cute and fuzzy.
    2. Re:SATA by Anonymous Coward · · Score: 0

      SCSI cabling is a finicky nightmare where even high-end cables can cause data corruption if you're not careful...

      Bullshit! Your comment is like the image word: dumber

      You had a bad experience once and you are down on SCSI. Has a girl ever thrown you? Given up on women?

      SCSI is very reliable, very well done, and if you buy crap you get crap. Do things right and you'll be fine.

      I speak from 15 years of experience with it, from low end Macs to stuff so expensive it is pointless to mention it. SCSI is awesome.

      Here is what I recommend for the OP:

      Don't leap in where you don't know what is going on. Get the OK to trial the SATA separately, but plan for SCSI for the new server. Try different SATA drives, try different controllers, try software RAID (especially if you can use something reliable like Solaris). See what works for you. Call the vendors up and see how they treat you, try simulating drive failures under load. Heck, if it looks good, make it a failover server and work out tests hopefully culminating in trying it out for a week when things are slower than usual. But start with what is known and reliable.

    3. Re:SATA by Andy+Dodd · · Score: 1

      I had one bad experience that resulted in massive data corruption.

      I had routine bad experiences with systems that wouldn't boot due to something in the SCSI chain flaking out.

      It's too damn easy for a single failure somewhere in the chain to take down the entire bus.

      Meanwhile, if there's a failure on an SATA channel, only one drive is taken out of commission (unless you're using an SATA port multiplier, which you shouldn't be if you care about performance or reliability).

      The fact that SATA cables have fewer conductors and hence are much thinner means they can be routed more easily within a case in a manner that does not produce stress on the connectors. Stress on the connectors reduces reliability. Also, every SCSI connector I've seen that was designed for internal system use had no inherent method of securing it to the device. (Neither did IDE by the way, although IDE connectors always had a tendency to fit tightly, unlike those horrific SCA connectors on many SCSI drives, which are loose as hell.) SATA connectors are designed to clip into whatever they're plugged in to.

      In short, while you may be able to come up with a reliable SCSI cabling setup, it's going to require a huge amount of money, research, and trial and error. SATA, on the other hand, will Just Work. Don't forget the fact that with SATA, you no longer need to deal with SCSI IDs, termination, or in the case of IDE, master/slave issues.

      Here's a good analogy: SCSI is like 10base2 Thinnet (remember, where one failure or improper termination would take down the entire network?), and SATA is switched 10baseT (failure in the equipment connected to one port has no effect on the other ports). 10base2 was technically superior when it came out, but does anyone use it anymore? I've seen it in use once in the past decade (back in 1999 or so), care to take a guess how I found out it was in use? Yup, an entire network segment failed at a place I was working due to a loose connector.

      --
      retrorocket.o not found, launch anyway?
    4. Re:SATA by Echnin · · Score: 1

      Hey, I'm late in the discussion, but my mobo has what it calls SATA2--I assume this to be 300 MB/s--and it's even a cheap ASRock card (the 939Dual-SATA2). ... Reading up on it, though, it's on a PCIe bus (same as the PCIe and PCIe x1 slots?), meaning the highest throughput would be 250 MB/s. And it's not supported by libata yet, so no Linux. So, meh. Still, I think we'll see a lot more of 300 MB/s SATA from now on. Love my SATA drive: no more ATA cables for me. :)

      --
      Lalala
    5. Re:SATA by jeavis · · Score: 1
      Andy Dodd wrote:
      Keep in mind that a proper SATA implementation will be 150MB/sec PER DRIVE, since each drive is on its own channel. SCSI is 320 MB/sec per channel, but you're in for a cabling nightmare if you want only one drive per channel. Note that there is a 300 MB/sec SATA standard, although few drives and controllers seem to support it.
      This hardly matters, since the SATA controller is only going to speak to one drive at a time anyway. Don't you think SCSI vendors would have abandoned it by now if this were really an issue? As for top speed, SAS is the next generation of SCSI, and is available today, running at 375 MB/sec.
      Many of the higher end server-grade SATA models are warrantied for 24/7 operation. SCSI has lost its advantage there.
      All SCSI drives are designed with this in mind, not just the high end ones. You don't need to pay top dollar for 15K U320 drives to get this sort of reliability.
      SATA has Native Command Queueing, formerly a SCSI-only performance feature. Note that it's optional for SATA drives though, ...
      Again, standard on SCSI. You don't need to pay extra for it.
      Last, but most definately not least, SATA cabling is far simpler and robust than SCSI cabling.
      For a desktop system with several drives, sure. For a modern server, not at all, because you won't be using internal 68-pin connectors. You'll be using a hot-swap SCA backplane, which has a single connection to the SCSI controller. Hot-swap SATA backplanes require a separate connection for each drive, resulting in a lot more wire in the box.
      I've had hardware related data loss on hard drives twice in my life.
      I've had dozens, and I can count the ones that were SCSI on one hand. One of my colocation customers has lost four 250 GB Maxtor SATA drives in the past 6 months. I have a SAN device that uses SATA drives, and while it's been reliable, the vendor insisted that we keep spares on hand. ATA drives, serial or otherwise, simply are not engineered to the same high standards that SCSI drives are. Their sticker price reflects this. You get what you pay for.
    6. Re:SATA by clarkc3 · · Score: 1
      unlike those horrific SCA connectors on many SCSI drives

      Horrific? maybe if you are using a 80 to 68 pin adpater and trying to mount them internally, otherwise I've never had a problem with any SCA drive. Every Server I've ever used them had rails on the drive and it would slide and click into place. I dont know how you could have any cabling issues since every Sun, Compaq/HP, and Dell Server that I've ever used that had them just has you slide them into an the array.

      Also, every SCSI connector I've seen that was designed for internal system use had no inherent method of securing it to the device.

      Again, see every just about every Sun, Compaq/HP, or Dell Server made in recent years. They slide in and click in. Sure you have to make sure you have the rails on properly, but thats easy

    7. Re:SATA by Anonymous Coward · · Score: 0

      Last, but most definately not least, SATA cabling is far simpler and robust than SCSI cabling.

      Boy, do I have to disagree with this. I don't have experience with SATA except on desktop systems, but every problem I have seen with SATA on a desktop has involved the cable. I think the connectors are just too damned chintzy to stand up. I have seen 6 cases where replacing the SATA cable cleared up intermittent problems.

      SCSI cabling is a finicky nightmare where even high-end cables can cause data corruption if you're not careful, whereas even the cheapest SATA cables I've seen worked reliably.

      I've been using SCSI for years and, except for one case that involved a flaky active terminator, have never seen any data problems related to cabling. Whereas SATA, which hasn't even been around that long, has caused me fits with cabling in at least 6 cases (see above).

      Now, the caveats: I do not use cheap SCSI cables. Cheap cables are data death! Most of my work with SCSI arrays doesn't even involve cabling per se. For instance, in HP Netservers, most of the connections between drives involves an actual circuit board with connectors that drives are plugged into. No wires, no wear and tear on the connections. But I have used a lot of redundant server connections to the same array that did use cabling between the servers and the drive array. Never a single failure that I traced to cables.

      SCSI drive arrays run for years and years and years with absolutely no problems. Most of the systems I see replaced are due to upgrades, not any kind of system failure. In fact, my personal server is a 7 year-old used HP LH3 with the original HP drives. It ran for 5 years 24/7/365 before I bought it and has run for me for another 2 years 24/7/365.

    8. Re:SATA by Reziac · · Score: 1

      I've talked to a couple people who had a RAID controller do the same thing -- flake out and corrupt hell out of stuff. In one case it wasn't obvious what was doing it til they'd replaced practically the whole rest of the server.

      One has to wonder how common this might be, since we hear so many more horror stories about "failed HDs" on RAID setups. How often was it not the HDs at fault, but rather the RAID controller? (Does anyone take those "failed HDs" home and test them in an ordinary standalone PC?)

      --
      ~REZ~ #43301. Who'd fake being me anyway?
    9. Re:SATA by ckaminski · · Score: 1

      My recent favorite was a Dell Powervault that flaked out, finally dying. Putting the drives in another Powervault to recover them caused a number of other spare drives, plus this other Powervault, to also fail catastrophically. Thanks to one Powervault we lost 21 disks in the span of 12 hours.

      Thankfully I made a network copy before I tried the disk to disk copy. You know your SCSI setup is aged when you can copy data via GbE faster than you can copy volume to volume on your controller.

    10. Re:SATA by Reziac · · Score: 1

      Very very bad; not good!! How did the trashed drives being transferred into the other Powervault cause more drives to fail... or was their sending corrupted data thru the controller causing the controller itself to spew garbage? (never heard of such a thing, but the idea came to mind, so...)

      Anyway.. yeah, that's enough to make you HUG your backup mechanism...

      I've got some SCSI cards so old that a CDR attached to 'em will only do 4x, cuz that's their max thruput. Ouch!

      --
      ~REZ~ #43301. Who'd fake being me anyway?
    11. Re:SATA by ckaminski · · Score: 1

      We're not entirely sure. We are sure that the backplane on the original Powervault was faulty, which explained some mysterious Cluster behavior we'd been noticing. All the drives and the two chassis are now in the tip, since we don't want to trust anything to them. This is where it gets weird: the second Powervault was our nearline storage and catalog storage location for our Netbackup install. I've spent the last 4 weeks recovering catalog backups and doing tape inventory because that server wasn't so well backed up. Oops.

    12. Re:SATA by Reziac · · Score: 1

      I'd be loathe to trust 'em again myself... certainly not for mission-critical data!! Sorta like some HDs I've got here that have made funny noises at one time or another... yeah, they still work, and haven't thrown any errors, but they ain't goin' near my Real Work machines.

      How'd you get so lucky as for the netbackup unit to be the next one to go?! Goes to show.. make backups of your backup catalogs TOO!

      --
      ~REZ~ #43301. Who'd fake being me anyway?
    13. Re:SATA by ckaminski · · Score: 1

      I learned one very good lesson during this event. Always be sure you know which tape is holding your catalog backup before you toast the backup server.

      We needed it's Powervault, since the database server was MUCH more critical to restore, and they had identical hardware. We recorded the drive arrangements and numbered all the disks we took out with Sharpies so that we could restore the array once we saved the databases.

      Problem was we never got the chance to put the disks back online before the second array failed. :-/

      This is another reason I've sworn off anything but RAID1. Screw cheap drives. Give me mirrors or give me death. It's a waste, I know, but there's something primal about knowing I can just pull disk B and run over here and WHAMMO, instant machine copy.

      Good plan on the dodgey disks. Anything that sounds off-nominal should never get reused.

    14. Re:SATA by Reziac · · Score: 1

      Yeah... people forget about those backup catalogs. Especially critical for spanned tapes. -- Back in the Olden Daze of QIC80 tapes, most backup software only put the catalog on the LAST tape of a set. If you lost ANY tape of that set, the whole set was good as toasted. Fastback6 was an exception; it put the catalog on EVERY tape, so any single tape of a set was still usable by itself.

      At one time I was looking at some sort of RAID for backup duty for my own mess o'data... but heard too many horror stories. I've since decided the only sane method is full mirroring to an independent system. (I don't have a formal setup here, but critical data is duped across 3 systems, a couple of loose HDs, and a stack of CDRs, with some CDR copies elsewhere for remote safekeeping.)

      Tape drives of sufficient capacity are still too pricey for me, and through evil experience I'm also soured on any backup that doesn't offer full random access.

      --
      ~REZ~ #43301. Who'd fake being me anyway?
  37. REALLY depends on the task at hand by prantik · · Score: 2, Insightful

    For some things you NEED SCSI, for others you don't. That much is obvious.

    Large files/streams that require heavily mixed-mode I/O beat the balls off of SATA. E.g. Correct me if I'm wrong, but my partial understanding of SATA is that if many writes are cached and a read enters the queue, the cached writes are trashed.

    so if you are working with check-in/check-out I/O type such as Samba profiles, SVN stuff, or (Samba|N)FS on a small-medium number of small-medium size files, or web stuff, SATA offers best price/performance ratio, with RAID or whatnot.

    If you are working with large files that get a lot of unpredictable I/O, or databases, you really want SCSI.

  38. my tuppence by Anonymous Coward · · Score: 0

    SCSI drives should be more reliable and come with faster spindle speeds giving low latency and faster random access.

    I've heard that some h/w raid controllers use nice fast cache (similar to graphics cards), but, if you design the system correctly, you can get a (much) faster array using s/w raid. On the other hand, some h/w raid controllers have battery backups on their cache and (supposedly) cope ok with an OS crash, which means you can use write-back more reliably.

    Spec the CPU highly if you go for s/w RAID, and get lots of RAM. Also, I prefer an aggressively caching filesystem like XFS, but make sure you have a UPS, and don't piss about with s/w that can easily crash the system (like kernel models/drivers/etc).

    I've been told that RAID10 is best for random accesses (ie multiple applications/users).

    I think my choice would be 15K SCSI drives on multiple controllers with s/w RAID 10. Perhaps, 2 contollers each with 3 drives. RAID1 over 3 drives on each controller, then RAID0 over the 2 controllers. Buy more pairs of drives and you can hot-replace one mirror (a pair of drives) and use it as a backup, just like tapes.

    If that gets too pricy, try SATA.

    Be warey of benchmarking sequential performance - SATA/EIDE will look good compared to SCSI, but the latter should be (much?) better for random accesses.

  39. SCSI if you at all can afford it by Reapman · · Score: 1

    Depends on how important reliability is... generally SCSI means better support and warrenties, and hard drives designed to run 24x7. Hard drives that are targeted to the desktop user are designed for desktops, not servers. However if you have a good backup system, and you simply can't afford SCSI, then ya SATA would be your best bet. An example of that is my home server... no WAY I'm forking over a few thousand dollars just to serve my personal files :P

  40. Re:Only a rookie would suggest RAID 0+1 by georgewilliamherbert · · Score: 2, Insightful
    The fact that you mentioned RAID 0+1 shows that you really don't know anything about true storage. That is probably the most inefficient way to go, congratulations!
    0+1 is a mistake, but 1+0 isn't. 0+1 loses the data set if any one arbitrary drive fails in both sides of the mirror; 1+0 only if you lose both disks in a single mirror pair.

    RAID 5 is noticably slower disk performance for writes, and radically slower performance for reads and writes if you lose a disk. In many cases, the performance during a failure for RAID 5 can reduce system capacity below the required level, and thus RAID 5 is simply not an acceptable technology for those environments.

    If just sticking more data on disk is the requirement, and you don't care how slow it gets if you have a drive fall over, then RAID 5 is great. But real world enterprise environments exist where losing half your disk throughput will cause the company's service to go down, and then you're out of business. Those guys don't RAID 5 if they know what's good for them.

  41. Either way, not RAID 1+0 by T-Ranger · · Score: 1

    RAID 5, unless you like 50% waste, rather then 1/n (n>=3) waste.

    1. Re:Either way, not RAID 1+0 by Wesley+Felter · · Score: 1

      When you consider that a SATA RAID 10 array is cheaper (and maybe faster and more reliable) than a SCSI RAID 5 array, maybe the extra peace of mind is worth a little "waste".

  42. RAID0+1 != RAID1+0 by Anonymous Coward · · Score: 0

    > Since you are talking about RAID 0+1

    Actually, he said RAID1+0, which is not the same thing...in RAID0+1 (with 2 mirrors), if you lose a drive, you're back to RAID0 (no redundancy)...in RAID1+0, you still have a mirror on half your stripe.

  43. The very definition of RAID... by dbarclay10 · · Score: 5, Insightful

    The very definition of RAID is "Redundant Array of INEXPENSIVE Disks". Emphasis mine.

    I've already read a bunch of posts about how SCSI is more reliable than SATA. Well, they actually mean SCSI drives are generally more reliable than SATA drivers (and some actually say so). They're quite correct for the most part.

    Here's what storage vendors don't want you to know: It doesn't matter.

    Use RAID. With SCSI or FC disks, you'll have to use RAID5. At that point, two disk failures in a given array and you're screwed. You REALLY care that two disks don't fail at the same time. And when you're using low-end or even mid-range drives, it happens.

    Why do you have to use RAID5? Because with SCSI or FC disks, RAID5 is the only economical option. With a 300GB SCSI drive going for at least $1200USD, and FC drives of that size going for $2500USD, even the biggest corporations end up using RAID5.

    Of course, RAID5 isn't the only level of RAID. It's the least redundant of any level of RAID, as a matter of fact.

    Go SATA with RAID10, at least 4 drives, ideally six or more. With six drives, the likelyhood of having two drives fail before you can replace the first one is somewhat higher than if you're using SCSI, but the likelyhood of that second drive causing you data loss due to a failed array is infinitesimally smaller. It's guaranteed with RAID5, and the chance for RAID10 is inversely proportional to the number of disks in the array. So first the first drive has to fail, then the second drive which fails has to be of the same RAID1 set. Add onto that that drives do indeed "go old", and the heavier you work them, the faster they get old. With RAID5, disks tend to get worked a lot harder (without any cache, or if the cache misses, each write requires n-2 reads, and 2 writes).

    Of course, you've pretty much decided that RAID10 is the way to go. At that point it's cost. If you're looking for 50GB of fast redundant storage, SCSI is going to be slightly cheaper. If you need any amount of storage though, SATA is going to be a whole lot cheaper for the same level of reliability (which requires more spindles), and typically better speed (more spindles means more seeks per second and more megs per second, though one needs to be mindful that big SATA disks are only 7200RPM, while the slowest SCSI disks you're going to get are 10kRPM).

    Summary? I'm value-concious. I'd go the SATA route. RAID10, four disks minimum to start, a pair of 4-port 3ware SATA cards with 128MB+ of battery-backed cache. I'd do the RAID entirely with software (Linux MD), with each RAID1 set split across two controllers. We get cheap disk redundancy, cheap disk speed, cheap I/Os, and cheap controller redundancy. I'd consider using less fancy controllers, the 3ware jobbies tend to be expensive, but when you're doing big writes the cache makes a massive difference (75MB/s across four disks of RAID10 versus 20MB/s). I've considered putting together a dedicated storage appliance, exporting via SMB/NFS/NBD/GFS/what-have-you, without the battery-backed cache, but with a pair of 1U UPS units (one for each power supply). Then I'd go around turning off all the application-level fsync()ing, and see what happens with 4GB of disk cache. Bet it'd be fast. And with shutdown initiated via UPS trigger, almost as safe as a battery-backed cache. Remember: "Redundant Array of INEXPENSIVE Disks."

    God I ramble.

    --

    Barclay family motto:
    Aut agere aut mori.
    (Either action or death.)
    1. Re:The very definition of RAID... by greg1104 · · Score: 1

      Couple of comments on this message, which I generally agree with the theme of but with have some caveats on implementation.

      First, one of the benefits of buying cheaper drivers is that you can afford to buy an extra one that sits idle most of the time to use as a hot spare. The expected worst-case scenarios are much less serious if you start getting a rebuild to the spare the minute any one drive fails; you need two failures in the amount of time it takes to copy a disk to be dead, rather than two failures before someone can get to the server to fix the first bad drive. The real worst case scenario is of course a power surge or something that fries all the drives at once.

      Second, I'm not that big of a fan of the Linux software RAID. It's fine in an environment where the server can afford to be down until an admin comes in to deal with failed drives, but the ability of it to automatically repair itself and keep going isn't up to the standard set by the better hardware RAID. The main issue I've had with it in the past is that I've lost the ability to boot far too often. This is not a fault of the Linux software, but of the crappy limitations of the PC BIOS and how it impacts the bootloader. I've found that the real RAID cards are much better at segmenting off a failed drive so that the system boots anyway, and then remapping so the OS doesn't even notice. Too many times I've seen pure software RAID configurations where the boot drive goes, and the PC won't even boot to the point where it should switch to the alternate boot drive because the bad drive makes the BIOS insane. Also, I have my doubts about the feasibility of automatically switching to a hot-spare for the boot disk and having that work transparently. I aim at RAID setups where I can have any random person replace a failed drive while I'm sitting at the beach on my vacation; Linux software RAID doesn't meet that standard in my experience.

      Finally, one point I throw out at random while I'm typing here. Hard drives nowadays have gotten pretty good at throwing out SMART errors in advance of actually falling over dead; the last two massive disk failures on recent hardware I suffered from both gave me enough advance warning to save my critical data before the disk became unusable. In order to take advantage of this, you need a solution that lets the drive SMART data flow to the operating system where it can be caught and acted upon. I have found that many cards on the market, both SCSI and SATA, don't do a very good job of passing SMART early failure notices onto the OS so you can act on them.

      When I get a new controller in here, I make sure I can use SMART to see details like the drive's temperature (which is also handy to monitor) via the tools in the operating system. If I can't, that means it's unlikely SMART errors are going to make it through as well. 3ware is one of the few companies that seems to recognize the importance of SMART information even in an SATA context. The majority of the SATA cards I've seen (insert Silicon Image and Promise rant here) just throw all that data away. The better SCSI RAID cards just toss the drive out of the array the minute funny smells start coming from SMART, which is a perfectly reasonable approach in an environment where you're always going to have another drive to take over.

    2. Re:The very definition of RAID... by _generica · · Score: 2, Informative

      > The very definition of RAID is "Redundant Array of INEXPENSIVE Disks".

      Actually, the definition has been back-formed to "Redundant Array of Independent Disks, since you won't necessarily be using inexpensive drives any more.

      Just because you put 500gb drives in a RAID array, doesn't suddenly make them inexpensive, but they are each independent.

    3. Re:The very definition of RAID... by TheSkyIsPurple · · Score: 1

      Of course, RAID5 isn't the only level of RAID. It's the least redundant of any level of RAID, as a matter of fact. Guess that depends on what you mean by RAID... RAID 0 is not redundant at all, so it eeks out over RAID 5. (But since it's not redunant, is it RAID? Well, it's listed in RAID tables...) Nice thing with 5 though, is that most controllers I've used (on little servers) have hot spare options for RAID-5, but not 0 or 1. Load up a 6 disk array with RAID-5, and leave another 3 disks set as hot spares -> Now you can have 4 drives fail (as long as they don't all do it at once)

    4. Re:The very definition of RAID... by Anonymous Coward · · Score: 0

      well. I agree for the most part. now, I'm looking at what the marketplace is offering, and find that the Areca controllers offer RAID-6, which allows for 2 dead drives.
      this now provides for an interesting n+2 formula.
      now, you can find some rack cases that allow for 24 drives, that's 22 useable drives. with the latest 500G (some companies warranty them for 5 years), that makes 22x500, or 11Tb.
      as for the price point, 24x350 = 8400 (EUR or USD, doesn't matter)
      compared to SCSI, it's rather cheap

    5. Re:The very definition of RAID... by swillden · · Score: 1

      The main issue I've had with it in the past is that I've lost the ability to boot far too often. This is not a fault of the Linux software, but of the crappy limitations of the PC BIOS and how it impacts the bootloader.

      I think this is both not as big a problem as it appears and fairly easy to work around. It's not as big a problem as it appears because if it's rare that you need to reboot the machine, and it's also rare that your boot disk dies, then it should be very rare that you need to reboot the machine when your boot disk has died.

      To further mitigate the risk, however, there are some things you can do. Most PC BIOSes have the ability to boot from any of multiple drives. They try each one in turn, in a specified order. So step 1 is to make sure that each of the drives that your BIOS can start from is listed in that boot order and has a good MBR installed. If the first disk listed doesn't respond, the BIOS will try the next.

      Next, mirror your boot partition across all of the bootable drives, so that each still-good bootable drive has the boot block, the bootloader and the kernel. There are two ways to do this. The slightly more reliable but harder way is to actually copy the data to each boot partition, rather than using RAID-1 to do the mirroring, which allows you then to configure the bootloader on each differently. In this case, the bootloader on each disk should be configured to boot the copy of the kernel on that disk. Every time the kernel is upgraded or the boot configuration changes, all of the boot partitions have to be updated manually (or you need to create something to automate it).

      The other way to do it is to use MD to mirror the boot partitions, so that each will always have the same kernel and bootloader configuration (GRUB menu.lst). Then, configure that menu.lst to use the "fallback" option to try each of the boot drives in turn (I think you're limited to one fallback, so you can only have two bootable drives with this approach). That way, if the first boot device fails, when the BIOS boots the MBR on the second, GRUB will try booting the first device, also fail, then fall back to the second.

      In either case, once the bootloader manages to find and load the Linux kernel, you're home free, because the root partition can be a regular MD RAID volume.

      There's is still a possibility that the primary boot device could die in such a way that the BIOS still thinks it's good, but the disk fails to provide needed data from the GRUB image. In other words, the MBR is good, but the GRUB second stage is not readable. What would be really nice is if the running Linux kernel could re-configure the BIOS boot order on the fly... that way smartd could update the boot order when it noticed that the primary boot device was failing. Since I don't know of a way to do that, the next best option is to figure out how to change the boot device in the BIOS, document it, and tape it to the server display with a big heading at the top saying "IF THE SERVER WON'T BOOT, DO THIS". If you document it thoroughly, any clueless manager should be able to convince the machine to boot from the other device. Or, better yet, don't bother because the odds of the disk failing in exactly the right way to make this necessary are negligible.

      Or you could just pay a few hundred dollars and get a RAID controller, plus a spare to keep in the cabinet. Which option is better depends on a lot of things.

      Personally, my home file server is configured like this:

      • Five ATA-133 disks (four 200GB, one 250GB), on four IDE controllers.
      • My boot/root partition is /dev/md0, composed of hda1 and hdg1
      • Each of hda and hdg have a GRUB MBR
      • menu.lst sets hda1 as the first boot disk with hdg1 as the fallback
      • The rest of my storage is RAID5 (actually LVM over RAID5) across hda, hdc, hde and hdg, with hdh as a hot spare.

      The reason I go to such lengths on a home file server is that I travel quite a

      --
      Note to ACs: I usually delete AC replies without reading them. If you want to talk to me, log in.
    6. Re:The very definition of RAID... by greg1104 · · Score: 1

      If you document it thoroughly, any clueless manager should be able to convince the machine to boot from the other device. Or, better yet, don't bother because the odds of the disk failing in exactly the right way to make this necessary are negligible.

      "Negligible" in this case meaning that it will only happen when you're on vacation out of the country and can't be reached. I was in Canada the last time I lost a software RAID server this way. Remember the northern East Coast blackout of August 2003? I was trapped in Toronto, unreachable because my hotel's power was out, and the power fluctations at my office in New York caused exactly the scenario you describe.

      I am admittedly less lucky than most. Thanks for filling in the details, I didn't remember the trivia beyond recalling why I vowed never to do that again.

    7. Re:The very definition of RAID... by swillden · · Score: 1

      I was in Canada the last time I lost a software RAID server this way. Remember the northern East Coast blackout of August 2003? I was trapped in Toronto, unreachable because my hotel's power was out, and the power fluctations at my office in New York caused exactly the scenario you describe.

      Really? You actually had a disk with a readable MBR, but the GRUB images were unreadable?

      --
      Note to ACs: I usually delete AC replies without reading them. If you want to talk to me, log in.
    8. Re:The very definition of RAID... by greg1104 · · Score: 1

      Some number of the early sectors on the 1st disk got trashed by the power issues (of course Murphy assured that someone had disconnected the UPS serial cable the week before to move something and not plugged it back in); they showed up as bad blocks when tested later. The MBR started but GRUB died somewhere in the middle of the boot process. Since I wasn't there to do a complete post-mortem and had to go by what I was being read to me over the phone, I can't say for certain that it was the GRUB files, it may have been one of the sectors in the Linux kernel GRUB was running that took a hit.

      Once the drive was disconnected the server fired right back up again. The only thing that saved me in this case was that I had every drive in a removable drive bay, all very clearly labeled for such an emergency. Turn the key and the drive doesn't get power.

      As you suggested it may have been possible to fix it by adjusting the BIOS boot order. I hesitated to recommend that because even though I know it shouldn't happen, I'm afraid data from the bad drive is going to get mirrored to the good one and corrupt it when I do that.

  44. Re:SATA II is not your father's SATA by NetRAVEN5000 · · Score: 1

    I've never understood. . . why the hell would you want hotplugging for internal components? Isn't it always a smart idea to turn your PC off before you reach your hands inside the case?

  45. Re:What's this SCSI you speak of? by Anonymous Coward · · Score: 0

    Yes. All sysadmins should take advice from a slashdot-posting "pir8" that's dumb enough to use RAID-0 on consumer level drives.

  46. What about SAS? by GuyverDH · · Score: 1

    Or Serial Attached SCSI.

    Higher throughput than standard SCSI, easier to manage and daisy chain and somewhere I'd read that you could attach SATA drives to SAS controllers - although that's never been confirmed.

    --
    Who is general failure, and why is he reading my hard drive?
    1. Re:What about SAS? by SebNukem · · Score: 1

      Yes, you can attach SATA disks on SAS controllers. I am using it at work.

  47. market differentiation by r00t · · Score: 1
    SCSI is fast and reliable for non-technical reasons.

    It's not the interface itself. Interface incompatibility is used to split the market into regular users and those who don't have to spend their own personal money.

    Think about rotation speeds, seek times, and bearing wear. How could these specs possibly have anything to do with the data cable? They don't, except in the mind of a VP of marketing. Despite the obvious lack of technical reasons, SCSI drives often come with better specs. That's life.

    1. Re:market differentiation by Wesley+Felter · · Score: 1

      This market segmentation is quickly going away, though. Several companies are making reliable SATA drives, and the SATA Raptor is just as fast (and supposedly as reliable) as a 10K RPM SCSI drive.

    2. Re:market differentiation by Anonymous Coward · · Score: 0

      It's not going to go away until Serial-SCSI (which is basically SATA++) starts to come out. One gamer-oriented 10K drive doesn't really stack up against dozens of 15K server drives.

  48. Do both by eric76 · · Score: 1

    Use SATA and SCSI.

    There are devices available that appear to the comuter to be a SCSI drive when it is really a RAID array of SATA drives.

    Something like the the Maxtronic Arena Sivy SA-4830/SA-4831 could give you a 2 TB SCSI drive.

    1. Re:Do both by Wesley+Felter · · Score: 1

      That's more complex, more expensive, and slower than just using a SATA (or SAS) RAID card.

  49. Controller matters much more than the drives by Pygmy+Marmoset · · Score: 2, Informative

    All hardware (and software) sucks, and it breaks, it's a fact of life. No matter if you go with SCSI or SATA, the important thing is that you can find out when a drive dies so that it can get replaced.

    Many low to mid range SCSI raid cards (most? all?) either don't have any sort of interface to find the raid status when the server is up (they just beep at you and expect that somehow that's going to be hard over the AC and server noises when you're walking by the machine), or the tools for checking the raid status are so poor that they'll lock up the shared memory segment after checking a certain amount of times (ADAPTEC, I'M TALKING TO YOU). Since being certain about your raid status means checking it via something like nagios, that means that it gets checked many times, and will thus eventually lock up.

    While SATA is nowhere near the performance of scsi (despite what SATA fanboys will tell you), 3ware cards are actually really good at:
    a) letting you know when a drive has failed
    b) letting you check with their tools as many times as you want without locking it up

    And since the SATA stuff is so much cheaper, you can buy multiple servers, so even if the card fails, you have a hot backup.

    If you absolutely have to have the fastest, go for a raid 10 of 15krpm drives.

    If you don't, and want peace of mind, get at least 2 SATA setups with 3ware cards.

  50. There is no SCSI anymore by SebNukem · · Score: 1

    SCSI is obsolete and being actively replaced with SAS (Serial Attached SCSI). So the question should be whether SAS or SATA should be used instead, that would be a step in the right direction.

    Also note that SATA drives may be used on SAS controllers. So the answer to your question could ultimately be something like: a mix of SAS and SATA.

  51. The actual drive mechanisms are the same... by Anonymous Coward · · Score: 0
    They come off the same lines. Made the same ways. So I don't know that I'd buy this "SCSI drives are designed for this or that" bullshit. The controllers are different though but semiconductors are semiconductors and the price differential isn't large enough for me to buy that one is "more industrial grade" than another. The actual mechanics and recording mechanisms are nearly commodity parts. Similar to CPUs, different ones test out at different rates, SCSI commands a premium but there isn't anything too substantially different about the drives.


    The one thing that is really different and might be substantial depending on your application is that SCSI has been shoe horned in to more exotic configurations and because of that there are more standards for doing "enterprise" type things. Like running a heart beat over SCSI, or have a SCSI array attached to redundant hosts. I wouldn't kid myself in to thinking you could do anything of the sort with SATA in a remotely standard or supported way. Some of the big storage players are getting in to SATA though.


    Performance between best SCSI and best SATA is nearly identical. SCSI is more configurable. SATA gives you more bits per dollar. SATA is probably more "leading edge" where as SCSI is old technology that is tried and tested. Depending on the applications, maybe what you really want is a storage device and a server with whatever the boot drives it comes with and a dedicated high speed interconnect. If you're really serious about storage, quit fucking around with SATA and SCSI type questions and really solve the problem.
     

  52. Re:Only a rookie would suggest RAID 0+1 by NoMoreNicksLeft · · Score: 1

    Yes, that's why many big NOCS end up mirroing raid5s and other fancy things. The level of performance you get is directly related to how much you're willing to spend per gig of storage.

    Are you willing to spend for 10 terabytes, just to get 1 terabyte of usable space? Well, then don't expect to have incredible throughput while experiencing the most catastrophic failure imaginable...

  53. Re:What's this SCSI you speak of? by Wonko+the+Sane · · Score: 1

    You'd be crazy to do sata raid without a dedicated controller card. In that case, ide/sata has the same CPU utilization advantage scsi.

  54. SAS by SebNukem · · Score: 2, Interesting

    Serial Attached SCSI takes the best of both world together:

    SAS has:
      - lean SATA cables
      - 3Gbps transfer, soon to be 6Gpbs. Better than U320
      - 15,000 rpm disks
      - NCQ like SATAII
      - RAID-capable controllers
      - SATA on SAS possible

  55. Re:SATA II is not your father's SATA by Wonko+the+Sane · · Score: 1

    You can buy drive enclosures that allow you to hot swap hard drives.

  56. Re:SATA is fine ... for some things by Jeff+DeMaagd · · Score: 1

    So NCQ or whatever doesn't cut it?

    I guess I can understand what you mean though. It should also be clarified that it's not just any SCSI drive that you need, that it would need to be an array of 15k RPM drives if latency is the real issue because there do exist 10k RPM SATA drives with decent capacity that is more or less the same drive mechanicals with a different interface as a SCSI counterpart.

  57. Re:Teh what? by Anonymous Coward · · Score: 0

    LOL these kind of post always make me laugh to tears.

    OMFGROTFLMAOLOL!

  58. use SCSI... by Malor · · Score: 5, Insightful

    50 concurrent users is a LOT. You may not really mean concurrent, as in "50 people actually reading from or writing to this drive at the same time". If you DO mean that, you desperately need SCSI, the fastest you can find. You'll need seek time more than anything else; the drives need to respond as fast as possible to multiplexed requests for data. Rotation speed, which improves seek time and transfer rate, is good too, but it's seek time that's most crucial in heavy multitasking environments. If by 'concurrent' you mean '50 people occasionally hitting the disk', then yeah, you could probably do SATA.

    However, you already have SCSI. Management is used to paying for SCSI machines. If you have 50-100 people depending on something, and it's slow, that's a productivity drag. If you assume that all those people cost $100k/year each (not at all unreasonable with benefits), 50 people are getting paid about 2,500 bucks an hour, or about 20,000 dollars a day. In other words, if you speed them up by just 5% with better hardware, you're saving the company a thousand dollars a day. Even if it's a tiny 1% speed gain, that's still 200 bucks a day. Saving six grand a month for an upfront investment of ten grand is a total no brainer.

    Buy SCSI.

  59. Something to remember.. by deep44 · · Score: 4, Funny

    .. if you do end up with SATA, make sure to get some neon lighting for inside the case.

    --
    Current setup - 4x Seagate 400GB SATA, NVRAID-0, ThermalTechno 4000 1U Case w/ Ground-FX, 3x Zalmat 80mm SilentKiller Fans (soon)

    1. Re:Something to remember.. by Achromatic1978 · · Score: 1

      You list your "current setup" with items bracketed "soon". Doesn't this defeat the purpose?

  60. Take whatever's cheapest. Buy two. by defile · · Score: 2, Insightful

    If my limited experience has taught me anything about computer reliability, it's that a single mis-set bit somewhere can bring down a system. Maybe the bit got there by user error, maybe it got there because of RAM or disk failure, maybe it got there from a bug in the application, OS, or firmware. Maybe a component on the motherboard shorted out. Maybe it's the climate. Maybe it's the phase of the moon.

    I've seen it happen with discount ghetto hardware, I've seen it happen with high end hardware. I've seen it happen on Windows. On Linux. On FreeBSD. On Solaris. I've seen servers go down due to catastrophic hardware failure and I've seen them go down because a $2 fan died. I've seen people come inches from major power supply caused injury working on a desktop PC.

    Everything will break.

    There's just too much freaking complexity. Now I just buy whatever's cheapest so I can buy way more than I need. Mix up the configurations a bit so you get some bio-diversity; if one drive manufacturer has a bad year, you don't want all of your eggs invested in them.

    Most important of all, at the first sign of trouble, throw it away.

    Try to resist the urge to fix it. I mean it. You cost more than that piece of junk. Put in a purchase request and move on.

  61. But you still gotta use the slow, low-buck drives. by FatSean · · Score: 1

    Where are the 15k SATA drives? But other than that, SATA RAID card is pretty good if you just want one giant volume for storage and maybe a disk or two more for OS and swap.

    --
    Blar.
  62. Re:SATA II is not your father's SATA by Blackforge · · Score: 2, Informative
    I've never understood. . . why the hell would you want hotplugging for internal components? Isn't it always a smart idea to turn your PC off before you reach your hands inside the case?


    It's not necessarily for "internal components". It's also for entry-level servers and raid arrays. You can get hotplug bays that fit in 5 1/4" slots on a machine. This provides an easy way to swap out the drives in case of failure. If you're running a server you nor your users want downtime. If you're running RAID 1, RAID 5 or RAID 10, you want to be able to rebuild to the replacement drive before you lose another drive and lose the whole array. Its a lot faster to rebuild a single drive from the existing data on the other drives, than having to restore from tape or other backup media.

    Also motherboard manufacturers are now starting to include external SATAII ports to "hotplug" external SATA drives.
  63. SCSI by WgT2 · · Score: 1

    Unless price is that big of an issue. It's been my experience that SCSI will also give you better I/O performance (in part because it can read and write at the same time and that it can have rpms up to 15k).

    SATA would be a second choice only if I felt like worrying about when the first disk in the array was going to fail and (hopefully) be replaced.

    The short of it, even in hard drives, is: you get what you pay for.

  64. SATA is not your friend by lucm · · Score: 1

    SATA drives over 80-100GB will burn incredibly fast. Relying on those drives for mission-critical data is not a good idea, even if you have a top-notch RAID controller.

    Go with SCSI.

    --
    lucm, indeed.
  65. Re:Only a rookie would suggest RAID 0+1 by Anonymous Coward · · Score: 0

    Wrong. 1+0 or 0+1 is the most inefficient way you can to store your data. You are doubling your hardware storage when doing something smarter will solve your problem.

    You are completely wrong when you say that:

    >In many cases, the performance during
    >a failure for RAID 5 can reduce system
    >capacity below the required level, and
    >thus RAID 5 is simply not an acceptable
    >technology for those environments.

    I beg of you to show me an environment where performance is a necessity, and they use RAID 0+1. IT IS NOT USED IN PRODUCTION. The only place where 0+1 or 1+0 is used is at the consumer level.

    When you are dealing with a single file server, such as the original poster's environment, you use RAID 5 because performance hit will be negligible. When a drive fails you take down the server and go to CompUSA and buy one immediately before you lose your pr0n and illegal mp3s.

    When you work for a bank, you have racks and racks of hard drives, and they heads they are connected to use one of RAID 4, 5 or 6, or Netapp's vastly superior RAID DP. These filers are specially designed to be used by thousands of end-users, and they all use SCSI for the highest performing models. If they want to back data to tapeless backup, they will use ATA drives, but if this was really an environment where performance is required above a certain level as you suggested, then they would use SCSI.

    Unlike you, I actually have experience , so I know what I'm talking about and more importantly, what technology is suited for which type of environment. I don't talk out my ass like you do, giving bullshit statements, dropping misnomers like performance when it is obvious that RAID 5's performance characteristics won't make a bit of a difference to someone who is asking about the difference between SCSI and SATA for a home file server.

    Like I said, the difference will be negligible. The performance impact from either the network or from having a fragmented hard drive will vastly outweigh any perceived performance hits from using RAID 5.

  66. Warning: NCQ is not sufficient for SCSI-like perf by Anonymous Coward · · Score: 0

    Most people don't realize that SATA's NCQ is NOT the same as SCSI's command queueing. SCSI's command queueing has two parts, the command queueing itself, and the disconnect/reconnect. SATA has a spec for disconnect/reconnect, but I'm not aware of any drives and controllers that implement it yet. That's why all the benchmarks of SATA drives supporting NCQ show much lower performance versus otherwise identically spec'ed SCSI drives on fileserver loads. The NCQ drives are much better than SATA/PATA drives without however on fileserver loads.

    This may not matter for you if you aren't pushing the limits of performance or cost is important enough, but don't make the mistake of thinking that a drive and controller supporting NCQ mean you get SCSI performance for IDE prices, because you most assuredly do not. Wait a year for drives and controllers supporting disconnect/reconnect (probably not standardized until 600 Mbit SATA comes out late next year) Then you really will get SCSI performance at SATA prices (other than being able to get 15K drives for SCSI and not SATA, but that may change by then)

  67. Lies, Damn Lies, and Statistics by jgoemat · · Score: 1
    but the likelyhood of that second drive causing you data loss due to a failed array is infinitesimally smaller. It's guaranteed with RAID5, and the chance for RAID10 is inversely proportional to the number of disks in the array.
    Assumption: You are talking about the six drives in the RAID 10 array being three sets mirroed drives striped, if you are talking about using three drives for each mirros so that the data is double-redundant, you are taking away the price benefit.

    So, while adding more drives makes the chance that a second failure will hose data less likely, it makes the chance of such a failure occurring more likely. If you have 4 drives, you have two mirrors. Take the probability of any single mirror having both drives fail and you can multiply that by the number of mirrors you are striping to get the probability of data loss occuring. Having a second drive fail in a different mirror merely means that you came 1/2 way to failure twice.

    Some SATA drives are approaching SCSI's reliability, but you really don't know what you're getting most of the time. I just two Maxtor drives fail in a RAID-1 array last year in a Dell computer I had. Maybe one had failed and I just didn't get informed, maybe they both blinked out around the same time, I don't know. I decent SCSI setup will have tools to alert you when there's a problem. I think most SATA drives are around 600,000 hours MTBF, but I think that is measured as part-time usage (i.e. drive on for 12 hours, drive off for 12 hours) where SCSI's 1,200,000 hours MTBF is measured as 24x7 operation. If SCSI drives are twice as reliable and you get notifications of problems, then the chance of having two drives fail before you can get one replaced is much smaller with SCI, even in RAID 5.

  68. Write cache by loony · · Score: 1

    So many replies about performance and all the great new features in SATA IO... Too bad no one has mentioned the real issue for high end use... All modern scsi drives allow the HBA to make sure data has actually been written to disk. SATA does not. If you lose power, you lose the data in cache. Worse even in a raid - there the data might be written to two of the data disks but your redundancy disk(s) have not. Now, if a drive fails, you'll restore from parity and boom - you have wrong data that according to all drives, controller and filesystem status is good. That's a real bad thing(tm).

    SCSI controllers with battery buffered cache will take care of that - Unless all drives have written the data to disk, the line is not released in the controller cache and when power comes back on, it can be written to the drive. That will not keep you from having to check consistency but it will prevent you from seeing corrupt data as good.

    Peter.

    1. Re:Write cache by Anonymous Coward · · Score: 0

      Proper SATA RAID controllers such as the Areca have cache and battery backup.

  69. SATA RAID by Anonymous Coward · · Score: 0

    If you build a RAID from X SATA drives and Y SCSI drives both at the same cost points, you'll get better fault tolerance and performance from the SATA set, simply due to the low cost of the drives. Although the individual drives are less reliable, as a set they are more reliable thanks to the redundancy.

    You just need to set it up so if several drives fail at once you won't lose data or hurt performance (RAID5 is notorious for this).

  70. DP Raid? by TheSkyIsPurple · · Score: 1

    Related question... anyone using DP Raid in anything Linuxy? We're using NetApps all over the place where I am, and DP Raid is pretty nifty. I'd like to get out from under the NetApp burden though

  71. how about ULTRA ATA by goombah99 · · Score: 1, Troll

    I've been looking at a mac xraid myself. in 3U (plus 1 more U for the server) it's got 14 hot swap bays each running ULTRA ATA 100 with a separate controller on every disk. redundant logic, hot swap power supplies, hot swpa disks, and hardware raid 0,1,3,5. dual fiber channel to the server.

    even if you stuff it with apple disks the whole thing including a high performance dual gig dual processor xserve is about 15K$ for 7 terrabytes running at 7200 rpms. You can get it slightly cheaper if you populae it yourself and throw together your own server. but it hardly seems worth it since a major attraction is that you can also get a 3 year apple warantee on it from a reliable company.

    So if your network is going to be distributed over ethernet then I can't see why this is not one of the best things going.

    I'd like to hear why sata or scsi linux is a better solution.

    --
    Some drink at the fountain of knowledge. Others just gargle.
    1. Re:how about ULTRA ATA by Anonymous Coward · · Score: 0

      ULTRA ATA? LOL

      This is a battle strictly between Sata II and SCSI. Know what you're talking about before you open your mouth. You don't put PATA drives in a server, kid.

  72. SCSI. Still. by aussersterne · · Score: 4, Informative

    SCSI still tears the alternatives to shreds for price/performance at the heavy end of the load curve, no doubt about it.

    If you doubt it, try both.

    For going on twenty years it's been the same: those who haven't tried SCSI claim that there's no or little difference. Those who have used both SCSI and [MFM,RLL,IDE,ATA,SATA] in high-load environments hate to try to make due with anything but SCSI.

    For performance and reliability reasons both, you want SCSI if you're dealing with high-random-access-load or high-throughput situations. ATA/SATA is fine if you're just offering up noncritical bulk network storage but for the rest you want the real deal, and you will notice the obvious difference if you try both in a stressed environment.

    --
    STOP . AMERICA . NOW
  73. My suggestion. by Da+Twink+Daddy · · Score: 1

    Check out the Areca 1160 or 1260 controller card; it's also sold under the Tekram brand. I supports RAID6 (as well as the standards 0, 1, and 5), SATA II (or SATA 3.0Gb/s; whatever), and NCQ. As far as drives go, I'm very happy with the Hitachi 7K500, just make sure you turn on SATA II support and other advanced features that they ship turned off. It has good support under Linux and Windows (probably OS X, too), and is real hardware RAID. It supports a BBM (72 Hours, IIRC) that will allow you to turn on writeback caching for extra performance. The cache is a standard laptop SODIMM; it ships with 256M cache and expandable to at least 1G; I believe it will support a 2G SODIMM if you can find it.

    I'm running 1 TB (RAID 5) of data in this setup right now, soon to expand to 2.5 TB (RAID 5). At some point I'll upgrade to RAID 6, but the card supports 16 drives, so that will still give me 7TB of data. If you need more data than that, I believe Areca also sells versions of the controller that support 23 or 32 drives.

    (If you wanna do SCSI, Areca has controller cards for that, too.)

  74. Huh? 1+0 is great! by Anonymous Coward · · Score: 0
    The advantages of mirrored systems
    • You get extra read bandwidth. Either drive can service the load, while in RAID 5, the parity is pretty useless.
    • Degraded-mode performance isn't very degraded.
    • You can split the mirror for special occasions. In particular, our emergency evacuation plan involves grabbing half of the main file server's drives on the way out the door. Yes, there's a risk of a failure at just the wrong time, but having an up-to-the-minute backup and not having to shoot the server in the head is very nice.
    • No partial-stripe write problems.
    The downside is, it takes more disks for a given capacity, which is a problem when you get to the 5-10 TB range.

    But it emphatically is used in serious applications.

  75. Generational Difference by Anonymous Coward · · Score: 0

    I've used tapes, disks, CDs, DVDs, and even in a few cases printouts of really important data. And tapes are still my favorite.

    If the "76" in your username is indicative of your DOB, then I suspect this is a generational difference. I can actually agree with both of you, but having lived through exactly what the parent was talking about due to the problems of tape evolution in the DOS era, I tend to think you haven't.

    Personally, I'm a huge fan of using drives, with weekly or monthly tape archives for the longer haul, depending on the needs of the environment. The number of times I've had to pull a 5 year old record versus the number of times I had to get a quick restore from the previous day or week, is extremely small.

    The only real downside to drives would be the greater risk of virus infection or intrusion, which is why I prefer to keep weekly tapes. Most of the time I find myself doing restores, is because some user did something stupid, not because of data integrity compromise. I have had to do it a couple of times due to virus outbreak, but it never was able to hit my backup drives. Only use mirrored or striped Raids, mount and unmount them as you need them, file systems encrypted, keep data in tgz format, etc...

    1. Re:Generational Difference by eric76 · · Score: 1
      If the "76" in your username is indicative of your DOB, then I suspect this is a generational difference.

      If it is generational, it might be the other way around.

      I still have some old 9 track tapes and a 9 track tape drive. Maybe I ought to verify those tapes.

      For what it's worth, I never would have thought about appending my birth year to that. 76 is actually my class year of my undergraduate degree.

      Eric, Texas A&M University - Class of '76.

    2. Re:Generational Difference by Anonymous Coward · · Score: 0

      Eric is that you? Remember that gay porn you used to beat off to? Turns out I am gay too, sorry for kicking the shit out of you back then.

  76. Do RAID 5 ! by this+great+guy · · Score: 5, Insightful
    Whatever I decide, the server will be setup with a RAID 1+0 array for the numerous benefits it offers.

    No, choose RAID 5 instead of RAID 1+0. Here is why:

    • RAID 5 offers more usable disk space. With N disks of X GB, RAID 5 gives you (N-1)*X GB while RAID 1+0 only gives you (N/2)*X GB.
    • The maximum theoretical I/O throughput is better with RAID 5 than with RAID 1+0. With N=4 it is 1.5 times better, and when N is large (>= 8) it tends to be twice better.
    • RAID 5 is more customizable than RAID 1+0, giving you more control on the usable space / total space ratio. For example with N=10 you can choose to create 1, 2 or 3 RAID 5 arrays while with RAID 1+0 you only have 1 choice (1 large array, creating multiple smaller arrays is equivalent to a large one).
    • Linux's RAID 5 implementation rocks and consumes MUCH less CPU than what people think especially with today's 2+ GHz processors. Kernel hackers have found their implementation to be WAY MUCH FASTER than most expensive RAID 5 hardware cards.

    To give you a datapoint, I have set up multiple Linux software RAID 5 arrays on various servers with 10+ SATA disks, and the I/O throughput is over 500+ MB/s (enough to saturate 2 full-duplex GigE links !). At my previous work we had about 200 servers, all using Linux software RAID 5. And we have been MUCH MORE HAPPY than the previous setup where all of them were using hardware RAID 5. Moreover, Linux's software RAID 5 is more flexible (create arrays on ANY disk on ANY SCSI/SATA card in the system), more consistant (one and only one control software to learn: mdadm(8), no need to use crappy vendor tools or reboot into vendor BIOSes), cheaper (no hardware to buy), more reliable (no hardware card = 1 less hw component that can fail), easier to troubleshoot (plug the disks on ANY linux server and it works, no reliance on any particular hw card) and more scalable (spread the load across multiple disk controllers, multiple PCI-X/PCIe busses, or even multiple SAN devices).

    It's amazing the amount of misinformation and misconceptions about RAID that is spread around the world. I hate to say it but 95% of IT engineers don't make good choices regarding RAID servers because of all those misconceptions.

    1. Re:Do RAID 5 ! by ePhil_One · · Score: 1
      It's amazing the amount of misinformation and misconceptions about RAID that is spread around the world. I hate to say it but 95% of IT engineers don't make good choices regarding RAID servers because of all those misconceptions.

      Can we add you to the list?

      RAID 5 has great READ performance, since it is effectively an n-1 stripe on read. However, WRITE performance can be abysmal, especially in random I/O situations like the one the article is proposing. In order to change a one byte on a disk, n-1 disks must be read and new parity data calculated, then two disks (data + parity) must be written to. A simple RAID 1 only requires two writes.

      RAID 1 can also show gains in random reads, since a smart contoller (or linux's MD drivers) can read the data drive and the mirror drive simultaneously, meaning no time is wasted thrashing drive heads between two tracks.

      I've tested RAID 5 in real world situations, and in real random I/O environments like databases the performance differences are eye popping. I use RAID 5 when efficiency counts over performance, RAID 1 (with lots of independant volumes when possible, a single FS structure can also be a bottleneck) for performance.

      --
      You are in a maze of twisted little posts, all alike.
    2. Re:Do RAID 5 ! by morzel · · Score: 1
      RAID 5 has great READ performance, since it is effectively an n-1 stripe on read. However, WRITE performance can be abysmal, especially in random I/O situations like the one the article is proposing. In order to change a one byte on a disk, n-1 disks must be read and new parity data calculated, then two disks (data + parity) must be written to. A simple RAID 1 only requires two writes.
      Assuming that the RAID5 array is in sync (which should be), only two extra reads are required in stead of (n-1): the old value of the byte that is being replaced and the old value of the parity that should be updated. XORing the old byte, new byte and old parity should then give you the new parity byte.
      Of course that doesn't help you much in a three drive array, but it will perform better in arrays with more devices.

      You should also look at it a bit differently: writes of a single byte (as outlined above) are not occurring that often. With bigger writes where the data is spanned over all the drives, those read operations are no longer required; just the calculation of parity (which can be done really fast).

      Comparing a RAID5 array of 6 drives (4+1, 1 hotspare) with a RAID10 array of 6 (3 stripes of mirrored drives, no hotspare) an write of 3072 bytes of data could result in the following disk operations:

      • RAID5: 4 x 768 bytes data + 1 x 768 bytes parity = 5 x 768 bytes in parallel.
      • RAID10: 3 x 1024 bytes + 3 x 1024 bytes mirror = 6 x 1024 bytes in parallel.

      In this case, RAID5 requires less writes on less disks while offering more capacity and having the advantage of a hotspare. Parity calculations are not an issue on good raid controllers and have a negligible impact. With a good controller, using the right filesystem and configuration RAID5 is actually pretty good.

      The most important downside of RAID5 is that while adding disks to an array makes it more economical and more performant, it also increases your risk of catastrophic failures (read: two disks fail at once == array is toast). Once it grows above a certain number of drives, it is often wiser to go for a RAID6 setup where you use two parity disks instead of one. The writes required to disk remain more or less the same but the parity calculations are more complex (still not an issue on good controllers). It is far less common though, and not all vendors call it RAID6.

      --
      Okay... I'll do the stupid things first, then you shy people follow.
      [Zappa]
    3. Re:Do RAID 5 ! by Anonymous Coward · · Score: 0
      Can we add you to the list?

      Well it seems you are the one who doesn't understand how write operations are commited to a RAID 5 array... See morzel's reply: only 2 reads + 2 writes are required (compared to 2 writes for RAID 1). I would like to also point out that, although random small writes are indeed faster with RAID 1:

      • Modern OSes have efficient fs caching mechanisms, increasing the chance of already having the 2 blocks in the buffer cache, so that only 2 writes are required for RAID 5.
      • Random small writes usually represent a small fraction of all the I/O operations in a typical file server scenario (what the OP wants to build).

      - this great guy
    4. Re:Do RAID 5 ! by ePhil_One · · Score: 1
      Modern OSes have efficient fs caching mechanisms, increasing the chance of already having the 2 blocks in the buffer cache, so that only 2 writes are required for RAID 5.

      That would be a non-random write. Writing data that has been relatively recently read, and noatbly the entire "stripe" would have to be in cache for both parity and data. "There are some cases where the performance is the same" is not a compelling arguement. I find the caching algorithms are far better at allowing RAID 1 to match RAID 5 read performance than letting RAID 5 match RAID 1's write performance. But perhaps thats because I'm wasting my time with real world performance metrics than theoretical models

      Random small writes usually represent a small fraction of all the I/O operations in a typical file server scenario (what the OP wants to build).

      It depends on what the users will be placing there and how they are using it. Are they storing digital photos in the space? Are they dumping backups there occasionaly? Then you might be right. But if he's expecting a LOT of concurrent usage, I'd expect more small random I/O's of users actively using the space. MS Office Auto saves, etc.

      Well it seems you are the one who doesn't understand how write operations are commited to a RAID 5 array

      What I understand is real world performance, I'm an operations guy. I keep trying to convince myself that RAID 5 should be able to compete (n-1/n efficiency is so much nicer than n/2), but when the bits hit the bus, with real data and real users, RAID 5 tanks when pushed to the limit.

      Note I'm not saying RAID 1 is always better than RAID 5. But some newcomers to the world of RAID seem to think its the ultimate solution, I've seen lots of inappropriate applications of the technology.

      --
      You are in a maze of twisted little posts, all alike.
    5. Re:Do RAID 5 ! by this+great+guy · · Score: 1
      the entire "stripe" would have to be in cache for both parity and data.

      It happens more often than you think (because of read ahead and because of the small size of stripes).

      I find the caching algorithms are far better at allowing RAID 1 to match RAID 5 read performance than letting RAID 5 match RAID 1's write performance. But perhaps thats because I'm wasting my time with real world performance metrics than theoretical models.

      Let me repeat it again: the vast majority of poor performances observed with RAID 5 is due to BAD HARDWARE IMPLEMENTATIONS, it's not due to the design of RAID 5 itself. That's why I hate hearing people saying "RAID 5 is slow" when actually the culprit is their hardware RAID 5 card. And even when using Linux's MD drivers, in some particular workloads RAID 5 may be slower than usual because it is very sensitive to the I/O scheduling algorithm and to the way the scatter/gather mechanism acts. And guess what ? Here again people think "RAID 5 is slow" when actually the culprit is a side-effect of the I/O scheduling algorithm and or scatter/gather mechanisms (in other words the kernel is not tuned correctly). You really have to trust me on this. I have profiled so many RAID performance issues (and fixed so many) that I know for a fact that RAID 5 in itself, RAID 5 as a design, RAID 5 as a way to use block devices, does not deserve the current disrespect so many people show towards it.

    6. Re:Do RAID 5 ! by ePhil_One · · Score: 1
      Let me repeat it again: the vast majority of poor performances observed with RAID 5 is due to BAD HARDWARE IMPLEMENTATIONS

      Let me repeat, I'm an operations guy, and BAD IMPLEMENTATION = DON'T USE. Theoretic performance that does not map to the real world is of little use outside the classroom.

      Now also note: I have never said RAID 5 is generically bad, there are places where its works quite well. I use it a lot myself, after modeling the I/O I expect the volume to see. The OP was stating more "RAID 5 is the cat's pajama's, always use RAID 5", and I suspect given the ASK SLASHDOT senario that he might be better off with the RAID 1 (or 1+0 ask he stated) than RAID 5. When pushed hard with -random- I/O, I've seen orders of magnitude better performance, though I'll admit I always stick to high end hardware controllers over software implementations in those tests. And notably, my EMC is configured with RAID 5 because the battery backed cache allows confirmations of write to be sent as soon as the data is in RAM, negating a lot of the random write issues

      --
      You are in a maze of twisted little posts, all alike.
    7. Re:Do RAID 5 ! by this+great+guy · · Score: 1
      I'm an operations guy, and BAD IMPLEMENTATION = DON'T USE.

      I totally agree with you. I am also someone who had to go with RAID 1 instead of RAID 5 a few times because of some external constraints I did not have control on ("client: this server must use windows with this hardware RAID card", no Linux, no MD).

      The OP was stating more "RAID 5 is the cat's pajama's, always use RAID 5".

      I was the OP and more precisely what I meant is that RAID 5 should be the first option considered by a system designer (because of its good usable/total disk space ratio and good sequential read/write speed). But then of course, if small random writes are common and performance really matters, or if using a bad hardware implementation is a constraint of the project, then a system designer should consider other RAID strategies.

      People tend to discard the RAID 5 option too early. Here is a typical scenario I have seen too many times: they test RAID 5 with their favorite hw card but see it is slow. So they try to invent false arguments explaining this (like "writing a byte require N-1 reads + 2 writes", while in reality 2 reads are sufficient), whithout understanding the real reasons of this slowness (bad hw implementations). Then their incorrect reasoning leads to those misconceptions that I hate so much ("I tried RAID 5 with my hw card, it was slow, by consequent any other RAID 5 implementation shall be slow"). No wonder why some of them are then surprised by the performances of Linux's RAID 5 MD driver...

      Here is another thing most people don't know: large sequential writes on a RAID 5 array do not require previous reading of parity data.

  77. Storage levels by grendel_x86 · · Score: 1

    I would avoid a one-sized fits all solution to this.

    Use SCSI RAID (1+0), 15k drives, for your frequently accessed stuff. A second pool of less used, but still accessed drives, say 10k, still in RAID (1+0) for some less used stuff, and then SATA for hot backups, and *really* old stuff that no one ever uses, but you still need around.

    --
    Im glad /. isnt the real world, that would really suck..
  78. Re:SATA is fine ... for large sequential IOs by Terje+Mathisen · · Score: 1

    I am currently piloting a setup intended to replace 35 TB of disk+tape used for sesmological data:

    Each array will consist of a pair of full (42-disk) nexsan SATABeast 4U boxes. These use RAID5 + hot spares and connect to the host systems with FC.

    The two pairs will be mirrored (over a fiber connection) between two geographically separated locations.

    The payback time for this setup is so short that we can plan on replacing half the gear every year, using a staggered schedule. (Replace A after year 1, B after two years, then A/B/A/B etc.)

    Terje

    --
    "almost all programming can be viewed as an exercise in caching"
  79. Its simple really by darkdante · · Score: 1

    If you want speed speed speed, have many users, want hardware that lasts long and have multiple requests to your file server, and can afford it: Go SCSI If you want cheap speed, hardware that wont last long, and a small number of users and requests from the file server: Go SATA The biggest difference being cost. In other words if you can afford it go SCSI. If you can afford it then you'll finally understand the benefits of SCSI, because you'll see it with your own two eyes. If you can't afford it then well..... you just answred your own question.

    1. Re:Its simple really by darkdante · · Score: 1

      If you want speed speed speed, have many users, want hardware that lasts long and have multiple requests to your file server, and can afford it:

      Go SCSI

      If you want cheap speed, hardware that wont last long, and a small number of users and requests from the file server:

      Go SATA

      The biggest difference being cost. In other words if you can afford it go SCSI. If you can afford it then you'll finally understand the benefits of SCSI, because you'll see it with your own two eyes.

      If you can't afford it then well..... you just answred your own question.

      Damn formatting *&^^%$$

  80. Go for more, cheap servers by Otis_INF · · Score: 1

    You're focussing on 1 server with a lot of horsepower. But that single server's going to serve 50 people at any given time. That will make the mainboard/network channels the bottleneck, if not the HDD controller's ability to reach main-memory.

    Why not go for 2 servers, use cheap SATA RAID1+0 disks? Then you can move the load to multiple servers and if more demand is required, you're not facing bottlenecks, just add another box.

    --
    Never underestimate the relief of true separation of Religion and State.
  81. MTBF usually better on SCSI by abdulwahid · · Score: 2, Informative

    If you want reliability for the disk you had better check what the manufacturer claims for the MTBF (mean time between failure).

    Many SATA drivers have a MTBF of around 0.6 to 1 where as SCSI have between 1 and 2. Your SCSI disk therefore has about twice the life expectancy. If you couple this with the speed of the SCSI I guess for the moment if your budget allows for it then go for SCSI

    If your budget doesn't allow for it...just make sure you have good redundancy in your RAID with at least 2 redundant disks

    --
    perl -e 'print $i=pack(c5, (41*2), sqrt(7056), (unpack(c,H)-2), oct(115), 10);'
  82. ZFS and RAID-Z anyone? by BoxedFlame · · Score: 1

    ZFS offers a more secure storage medium than RAID, no matter if you use SATA or SCSI. Here's an article on RAID-Z and the advantages of this system.

  83. Performance and reliability differences by Anonymous Coward · · Score: 0

    There are a number of performance and reliability differences between SCSI/FC and SATA drives.

    SCSI (or FC) drives usually spin at 10k-15k RPM, vs. a SATA drive's rather leisurely 7.2k RPM.

    This gives the SCSI drives a significant advantage in random access performance (measured in IOPS).
    However, the two technologies aren't that different for sequential performance (measured in MB/sec).

    Therefore, SCSI drives excell for high intensity workloads such as database servers, while SATA drives suffice for tasks such as large file repositories or reference data.

    Another difference is in the reliability (MTBF) and unrecoverable bit error (BER) rate of the SCSI and SATA.

    SCSI drives generally have an MTBF of about 1.5M hours, vs. a typical SATA drive with 1.0M hours.
    And the BER of SCSI drives is usually 1^15, vs. 10^14 for SATA drives. That's a lot different.

    From this you can calculate the mean time to data loss (MTTDL) for a given RAID array configuration.

    This is OK for smallish RAID arrays, but SATA gets scary for larger arrays (which start to require dual parity configurations). Because of their higher reliability SCSI arrays don't normally have this concern.

  84. SIL3112 by StupidKatz · · Score: 1

    IIRC, the SIL3112 chipset is on the low end of the controller spectrum. Unless someone slipped him a fifty and said "make it happen", he might want something a bit more upscale.

  85. Re:SATA is fine ... for some things by X · · Score: 1

    The problem with SATA is latency

    Huh?

    I'm more than a little disappointed that this got a "5: Informative". Please. There is little to no difference between the SATA protocol and SCSI protocol in terms of latency. To the extent there is, one could easily argue it favors SATA.

    SATA *drives* tend to have lower rotational speeds, which tends to lead to slower responses to requests, but you can get SATA drives with zippy response rates that match or exceed the numbers for most SCSI drives.

    The SCSI protocol is traditionally preferred for heavy loads and random access patterns not because the low latency it provides, but rather the command queuing (which technically can actually increase the latency of a given request) which allows the disk to minimize how much it jumps around under such circumstances. Of course, unlike traditional IDE devices, the SATA protocol supports this kind of functionality, and it is supported by a lot of SATA devices.

    --
    sigs are a waste of space
  86. Re:SATA is fine ... for some things by Anonymous Coward · · Score: 5, Informative

    Assuming equal storage sizes, SCSI drives would have way better throughput and latency than a SATA drive because you can get 15K SCSIs. However, the sizes are NOT equal. Fact is that for the price of a 147GB 15K SCSI drive, you can get about 2TB of 7200RPM SATA space.

    What you end up with is the following throughput when disks are empty:

        1x147GB 15K SCSI -- 150MB/s
        8x250GB 7200 SATA -- 275MB/s to 550MB/s depending on exact RAID configuration

    Now fill up both configurations with 140GB of data and the throughput of the 15K SCSI has dropped in half to 75MB/s because the heads are now positioned at the "slower" inner portion of the disk. Meanwhile, the 2TB SATA config is 7%-15% slower depending on the RAID config.

    Latency also benefits from many disks for the same reason. Fill up a disk and you possibly have to traverse the entire disk. So while a 15K drive has a seek time of 2-3 times faster, you end up having to move 10X-15X farther than in a mega array where the heads pretty much just hover over the 2X faster outer portion.

    The big advantage for SCSI is the better TCQ algorithms for multi-user access. This can be mostly negated if you use a SATA RAID controller with enough onboard RAM to reorder IO at the controller level versus depending on the drive's NCQ.

    This is the route we've taken -- we went from a LSI MegaRAID 320-1 + 4-drive SCSI RAID config to an Areca 1170 + 1GB RAM + 24-drive SATA RAID. Every aspect of performance is up by big amounts -- throughput, latency, multi-user access. The drive array is actually TOO fast for our 2x244 Opteron server to drive. We ended breaking the array into 3 8-drive volumes and mirroring 2 volumes against each other for more redundancy. One of these days, we'll upgrade to faster CPUs and retest a 16-drive volume.

  87. SATA vs SCSI by Anonymous Coward · · Score: 0

    There's a thorough comparison of SATA vs SCSI drives at http://www.storagereview.com/articles/200601/WD150 0ADFD_6.html. For multiuser use 10000 RPM SCSI (or its close relative, Fibre Channel), typically performs 150 - 200% more I/Os per second than the WD Raptor 10000 RPM SATA drive. SCSI drives also come in 15000 RPM flavors, which widens the performance gap even more.

  88. Re:But you still gotta use the slow, low-buck driv by Wonko+the+Sane · · Score: 1

    I didn't say that sata raid could achieve the same speed as scsi, only that it can achieve the same low cpu utilization.

  89. Your chances argument are flawed by aliquis · · Score: 1

    "Go SATA with RAID10, at least 4 drives, ideally six or more. With six drives, the likelyhood of having two drives fail before you can replace the first one is somewhat higher than if you're using SCSI, but the likelyhood of that second drive causing you data loss due to a failed array is infinitesimally smaller. It's guaranteed with RAID5, and the chance for RAID10 is inversely proportional to the number of disks in the array. So first the first drive has to fail, then the second drive which fails has to be of the same RAID1 set."

    The totalt amount of drives doesn't effect the chances that two drives within the same array will die. Sure it's less likely that both drives in the same array has died when only two drives have died in total, but also since you have more drives there will be a higher chance that they do die. If it's 1/2 chance that one drive dies and 1/4 that both dies at the same time that will still be true even if you have 1000 similair arrays.

    (You seem to look at it in a what-if-i-wait-so-long-that-two-drives-might-die-p erspective, but why would you wait so long if you can fix it earlier? So it's more likely a situation where you have to look at how large the risk are that one or more drives in a specific amount of time. And sure with 4 disks it's less likely that 2 dead disks means a broken array than with 2 disks, but the chance that any disk dies are 2 times larger.)

    Whatever :)

    1. Re:Your chances argument are flawed by Anonymous Coward · · Score: 0

      RAID 10:
      Assume 4 drives (mirrored pairs striped)
      Odds that a 2nd disk will die before you fix the first: 1 in 1000
      Odds that both disks will be in the same array 1: 1 in 3 (First disk dies, 3 disks remain which can die, only one of which is in the same array)

      That means you've got a 1 in 3000 chance of a drive failure that RAID 10 won't compensate for.

      RAID 5
      Assume 4 drives
      Odds that a 2nd disk will die before you fix the first: 1 in 1000
      Odds that both disks will be in the same array 1: 1 in 1

      That means you've got a 1 in 1000 chance of a drive failure that RAID 5 won't compensate for.

      Sure, your RAID 5 array gives 50% more space, but at the cost of triple risk to the data.

    2. Re:Your chances argument are flawed by aliquis · · Score: 1

      Raid 5 haven't got anything to do with it, I where wondering about the RAID10 part with more or less disks. But maybe with 6 disks it's a strip of 3 mirrored disks? In that case yes it's more secure. I thought it where just 3 sets of raid 1 arrays.

  90. Uhm by aliquis · · Score: 1

    But maybe you mean the same data are also stored on all arrays?

  91. Re:Only a rookie would suggest RAID 0+1 by Jesus_666 · · Score: 1

    0+1 is a mistake, but 1+0 isn't. 0+1 loses the data set if any one arbitrary drive fails in both sides of the mirror; 1+0 only if you lose both disks in a single mirror pair.

    Shouldn't that be "01" and "10"? Wasn't the "m+n" notation used for generic RAIDs á la "m drives total, n can fail before the array dies"? In that case a 1+0 setup can even run without a RAID controller - and a 0+1 one is kind of hard to implement...

    --
    USE HOT GRITS WITH STATUE OF NATALIE PORTMAN (NAKED AND PETRIFIED)
  92. Re:What's this SCSI you speak of? by Savantissimo · · Score: 1

    From own painful personal experience, do not use a motherboard RAID controller for anything other than simple mirroring. Motherboard RAID controlers aren't even consumer grade, and that goes double for Soyo.

    --
    "Is life so dear, or peace so sweet, as to be purchased at the price of chains and slavery?" - Patrick Henry
  93. Why use either? by Anonymous Coward · · Score: 0

    I'm surprised nobody mentioned the obvious. I mean, OK, if you want to limit yourself to drives directly connected to the server, then SCSI wins hands down if for no other reason than you can have 15 of them on a (high bandwidth) cable. Then there's the fact than no SATA drive is designed for enterprise operation (24x7).

    But take down the server and all the drives are inaccessible during the downtime.

    A smarter solution IMO is a SAN. The whole concept of a "server" is kind of outdated now with SAN (or even NAS). Plus expandibility isn't as big an issue as it is with a "server".

    So why in this day and age do people even mess with "servers" for simple data storage? Or did I miss something?

  94. Get some background by Kieckerjan · · Score: 1

    Whatever you do, be sure to read the following article first. It's a bit dated, but it's written by people who definitely know what they are talking about (Seagate engineers) and may help put things in perspective. The hefty pricetag of SCSI does not depend on the interface. Read this and you'll now why.

    http://www.seagate.com/content/docs/pdf/whitepaper /D2c_More_than_Interface_ATA_vs_SCSI_042003.pdf

    --
    Being well balanced is overrated. -- John Carmack
  95. I'd say tempremental SCSI by Anonymous Coward · · Score: 0

    I have both. They both work, but I've found SCSI to be more tempremental. You have to get everything just so, or it will have you tearing your hair out. SATA is simple plug-and-play, and a tad more forgiving.

  96. RAID10 may not be that slow by TheLink · · Score: 1

    Assuming 4 drives and a smart RAID10 controller, reads can actually be faster than RAID5.

    This is because you can actually interleave reads from mirrored drives to get better read performance for single sequential reads. 3ware has controllers that do that. In fact it should be better than stripes, because the same data is present on all mirrored drives so you can read the first chunk from whichever drive with its head closest to the chunk and schedule the subsequent chunks from the rest of the drives, thus minimizing latency and still maintaining throughput. Whereas with stripes, the first chunk will only be on one drive, so you have no choice of drives to read from.

    So for sequential access you'd get the combined read throughput of four drives, and the write throughput of two drives.

    Whereas with RAID5, you only get the read throughput of N-1 drives and the write throughput is poor (especially for sustained writes).

    Of course if your raid controller isn't smart or you are using Linux software raid then for RAID10, you'd get single sequential read throughput of N/2.

    Still, Linux software RAID is smart enough to balance reads in mirrors. Say you have a mirror with two drives A and B, and two sequential reads going on, one read typically ends up on A and the other on B. So in many multiuser environments the lack of read interleaving might not be such a big deal. I haven't done tests for RAID10 yet though.

    I agree that RAID controllers fail way too often.

    --
    1. Re:RAID10 may not be that slow by this+great+guy · · Score: 1

      You seem to belong to the 5% of engineers that actually understand RAID, good :-) However I disagree with you on 2 points:

      • Assuming good implementations of RAID 1+0 and RAID 5, reads from RAID 1+0 arrays are NOT faster than reads from RAID 5 arrays. This is because in order to interleave reads from mirrored disks, the heads have to constantly seek to the next block of data to read, read it, then seek again, read, etc. So you can NEVER reach the combined output of N x individual_disk_read_speed, but more something like N x f x individual_disk_read_speed where f is a factor in the range [0..1] and depends primarily on the chunk size and the disk seek time (e.g. f=0.8 or f=0.9). Whereas in RAID 5 arrays the parity data is always read (in order to be verified), so the heads don't have to seek when doing sequential reads => you end up with EXACTLY (N-1) x individual_disk_read_speed. Of course when N increases, RAID 5 read speeds increase more than RAID 1+0 read speeds, because the later is always limited by this f factor.
      • You said "RAID 5 write speed is poor". I disagree here too, because for the same reasons than explained above, RAID 5 write speeds can theoretically be AS FAST AS RAID 1+0 write speeds. Most disapointing RAID 5 write speeds experienced in the real world are due to poor implementations, or inadequate I/O scheduling, or hardware cache bottlenecks. For example NONE of the hardware RAID 5 cards I have seen in my whole life offer decent RAID 5 write speeds, even the most expensive ones from Adaptec, LSI, etc. AFAIK Linux software RAID 5 implementation seems to be the only one offering decent RAID 5 write speeds. One of my big project is to rewrite FreeBSD's RAID 5 code, so that we have at least two competitors in the area :-)
  97. My Supporting Anecdote by Anonymous Coward · · Score: 0

    You'll need seek time more than anything else; the drives need to respond as fast as possible to multiplexed requests for data.

    I have a site that uses 15K RPM U320 SCSI drives in a SAN. I also have a very nice high speed streaming tape library to back it all up. The backups were running around two hours and I felt that, with my disk and tape bandwidth, the backups should have been much faster. Looking for the problem it finally occurred to me that there were over one million files being backed up. Doing the math, I determined that with a 4ms seek time I was spending at least one hour or 50% of my backup time just seeking. Disk head seeks were consuming one hour of backup time on this SAN! I switched to image level backups, where the disks are copied bit by bit, rather than file level backups. This eliminated the more than 1,000,000 seeks (assuming little or no fragmentation) and in turn reduced the backup time to just under one hour.

    The point is that most people don't think about seek time and in many cases it is not an issue but, if you have a lot of files or severe fragmentation, disk seeks can be a significant factor.

    Of course the same SAN using SATA drives would increase the under one hour backup time to at least four hours but, who's counting.

  98. What model drives? by Andy+Dodd · · Score: 1

    Did you buy desktop-targeted SATA drives (like the Maxtor DiamondMax series, known for having major problems), or server-targeted ones like the Maxtor MAXLine nearline drives, WD Caviar RE2 nearline drives, or the WD Raptors?

    SATA itself blows SCSI out of the water in terms of cabling reliability. It's even in terms of performance features such as NCQ. You just have to be careful not to buy a desktop grade drive. (It's impossible to buy a desktop grade SCSI drive. It IS possible to buy a server grade SATA drive that has been tested as rigorously as its SCSI counterpart, you just have to be more careful with the drive purchase.)

    --
    retrorocket.o not found, launch anyway?
    1. Re:What model drives? by Dante · · Score: 1

      WD2000 for the older raids.
      and
      WD3200SD for the newer ones.

      wd2000 series drives all fail with bad sectors.

      wd3200SD fail with drive time outs, that take reboots to fix.

      --
      "think of it as evolution in action"
  99. MODERATION ABUSE. mod back up by goombah99 · · Score: 0, Troll

    I was totally serious. what is wrong with my post?

    --
    Some drink at the fountain of knowledge. Others just gargle.
  100. More important considerations than SCSI-vs-SATA by ziegast · · Score: 1

    Regardless of whether you use SCSI or SATA or EIDE or FC, you need to know and trust your RAID controller as if your job depended in it. Drives will fail. Some drives will fail more than others. As long as your RAID controller can recover quickly on failed drives, not lose any data, and can be monitored from your OS so that you can detect when a drive fails so that you can replace your hot-spare ASAP, you'll be fine.

    Consider how long it will take to rebuild a 400GB drive as part of a RAID5 array compared to a faster-spinning 73GB drive part of a RAID1+0 array. While your controller is rebuilding, your array is at risk for data loss if another drive fails.

    If you cannot monitor your RAID controller form the operating system, you ma never know that a drive failed and rebuilt using the spare. You could go days or weeks without knowing that you have no spare available for the next failure.

    A RAID controller without a cache battery is slower than one with a cache battery. If your RAID controller safely buffers writes, your operating system won't have to wait for the slow disks to write the data. Turn off write buffering on the hard drives. The write buffer on a hard drive is an area where data can get lost on a power failure that neither your operating system nor RAID controller can recover from.

    Keep your disks cool and power-conditioned. Don't put your file server in an 80-degree closet. Don't just stick the drives in a cheap case that doesn't have proper airflow over the drives. If the data is important to you, you will make sure your file server is on a UPS protected from brown-outs, power spikes, idiot employees or even cleaning staff. If you can monitor your UPS from the file server, so much the better.

    All of this is more important than SATA-vs-SCSI and applies to both.

    When it comes to SATA vs SCSI, it is a matter of low-cost capacity versus performance. If you need high-RPM drives to support low latency times for random read access, you're stuck with SCSI. If just having enough capacity for your data-hungry staff is all you need, then SATA might be a better fit for your budget.

  101. SAS and SATA both still lacks... by Anonymous Coward · · Score: 0

    ...true multithreaded i/o like SCSI supports natively. Yes, NCQ approximates SCSI-esque multithreaded i/o but under truely heavy loads of random i/o, a raid10 array made of SATA-II drives will definitely hit the performance brick wall long before a similar array made of u320 scsi drives on a good hardware caching craid ontroller card like those made by LSI, Intel, IBM or HP for their big wintel servers.

  102. The real question is. by LWATCDR · · Score: 1

    What kind of file server?
    Are you going to be running any database applications on it?
    Is seek time going to be important?
    If you are just going to store a lot of documents, applications, and static data then SATA is going to be a good solution.
    If you are going to do a lot transactions then SCSI is probably worth the money.
    My office is about half the size of yours and our file sever has worked just fine using ATA drives for years.
    For our next server I am thinking about using a mini-itx system with SATA RAID. We are having space and heat issues at our current office. Now our database server is a different matter.

    --
    See my blog http://ilovecookes.blogspot.com/ for light hearted technical information.
  103. SATA and SMART by Anonymous Coward · · Score: 0

    Fyi: With Linux the SMART function of the SATA drives will be broken. It isn't supported with the open source drivers and tools and that's it.

  104. Bauer killed SCSI by duncan · · Score: 1

    CTU had certian information that SCSI was working with the terrorists and Jack got to SCSI first...

  105. Re:Only a rookie would suggest RAID 0+1 by georgewilliamherbert · · Score: 1
    I beg of you to show me an environment where performance is a necessity, and they use RAID 0+1. IT IS NOT USED IN PRODUCTION. The only place where 0+1 or 1+0 is used is at the consumer level.
    I'm sorry, but your statement is profoundly ignorant.

    I have worked for major system integrations, system vendors, and VARs, and I have architected and installed and benchmarked major storage systems on and off for 10 years. I've installed more full-rack-plus sized equipment than I can conveniently count, including work with Hitachi 9900 series and smaller stuff, EMC, Sun, Storagetek, IBM, as well as NAS gear from NetApp and some exposure to BlueArc.

    When I say RAID 1+0 is in use in production, I mean it, because I've architected and built and operated it. At major ecommerce websites. At banks and financial institutions. At retail companies. At ASPs. At biotech companies.

    When you work for a bank, you have racks and racks of hard drives, and they heads they are connected to use one of RAID 4, 5 or 6, or Netapp's vastly superior RAID DP. These filers are specially designed to be used by thousands of end-users, and they all use SCSI for the highest performing models. If they want to back data to tapeless backup, they will use ATA drives, but if this was really an environment where performance is required above a certain level as you suggested, then they would use SCSI.
    Up until a couple of years ago, large customers used either A) direct connect fiberchannel drives, with FC RAID controllers, and large SAN environments, or B) large NAS systems with a wide variety of RAID configurations. NAS was cheaper but lower performance. For highest performing models you buy Fiber Channel, period. Claiming that SCSI was what the highest performing models use indicates you've never worked with true high end enterprise equipment.

    Now, SATA drives connected to FC raid controllers in SANs are where the action's at, but all the major vendors still sell FC drives and SCSI drives for applications where the command tag queuing matters (though SATA 3 will eventually make that common across the technologies).

    Unlike you, I actually have experience , so I know what I'm talking about and more importantly, what technology is suited for which type of environment. I don't talk out my ass like you do, giving bullshit statements, dropping misnomers like performance when it is obvious that RAID 5's performance characteristics won't make a bit of a difference to someone who is asking about the difference between SCSI and SATA for a home file server.
    I really don't think someone who didn't even think of FC systems or SANs is in any position to be lecturing me about RAID performance.

    I know how RAID 5 and RAID 10 perform; I've been working with them continuously since the hardware vendor I was working for started building RAID boxes based on the Mylex DAC-960 controller back in 1995-6 and I got to both write performance benchmark code and use standards like SPEC's LADDIS SFS93. I've talked code with the RAID controller vendors. I've been building RAID systems since 4 gig SCSI drives were brand new, and I use both SANs and SAN attached SATA storage in bulk today.

    There are people more expert than I am in this stuff; I haven't built a large storage system using the highest end stuff in a couple of years, and there are clearly people at integrators/VARS/vendors who live and breathe storage every day and are tip-top current experts. But I know what I'm talking about.

    And, clearly, you don't.

  106. Re:Only a rookie would suggest RAID 0+1 by _damnit_ · · Score: 1
    Wow, I hope the parent was a troll. I could have sworn that I heard him say:
    The only place where 0+1 or 1+0 is used is at the consumer level.


    Hmmm... let's look at an Hitachi USP. This is their high-end storage unit right now and comparable to anything you can buy from EMC or IBM. They support RAID 5 in 3+1 and 7+1 configurations. They also support RAID 1 in 2+2 fashion (haven't seen a 4+4 setup). I have personally consulted at at least 8 of the Fortune 100 and setup RAID 1 for high i/o environments. RAID 5 is generally used for lower demand hosts. They also use fibre channel drives although SATA may be an option sooner or later since you can get them on their modular storage now.

    I can tell you I DO HAVE EXPERIENCE and you are full of shit. You're also a bit combative. Have a drink. Relax. It's slashdot.

    --


    _damnit_

    It's my job to freeze you. -- Logan's Run
  107. Hotswap can work, SATA's fine, check RAID levels by MagicMike · · Score: 1


    I just built and tested RAID1 arrays on the Ultra320 SCSI bus of a couple Dell 1750s.

    More than hotswap even, I was able to hot-kill one drive, pull it, replace it with a bigger drive, hot-enable it and have it sync. *Then* I was able to do the same with the original drive, giving me two bigger drives in the machine.

    For the final trick, I was able to online resize the software raid partition and ext3 filesystems on the device, the whole time leaving the machine up and running.

    The lesson here is: never trust anecdotal advice, especially from goofballs on slashdot that think they know everything.

    If you have a requirement ("we need hotswap"), test the requirement and set your operational plans based on the test results. Not that hard...

    Also, and back on topic, the software raid driver has a couple new modes, raid10 (similar to raid1+0, but with some extra features) and raid6 (essentially raid5 with an extra synced up spare). Those bear looking into.

    Additionally, in the raid1 and raid5 modes, the newest versions of the software driver have the ability to auto-fix the bad sectors that are so common on the newest consumer drives.

    The last advice I can give is to never, ever, ever, ever forget that RAID keeps you running when things go bad, but it IS NOT a substitute for backups.

    Back your data up. Test restoring it semi-frequently. If you already know that from experience, you'll at least appreciate why I'm yelling about it :-)

  108. Re:(Serially attached) SCSI. Still. by ivan256 · · Score: 1

    It has little to do with the actual interconnect at this point, but...

    The best drives are SCSI. Actually these days the best drives are SAS. You won't find higher performance/more reliable 2.5" drives with any other interconnect. 2.5" is the future it seems. It's not even that expensive anymore. 2.5" 15kRPM 36GB drives for around $200. 100+GB for $200 if you only need 10k RPM. You can't even get 15k SATA drives, and you're going to get the same capacity for the same price with SATA at 10kRPM, but you're going to get a 3.5" drive instead of a 2.5" drive.

  109. Re:SATA is fine ... for some things by ivan256 · · Score: 1

    At 10k RPM, the prices are equal per MB, but the SAS (serially attached SCSI) drives will be 2.5" and use less power. Plus, the SAS RAID array would be faster than the SATA RAID array.

    Latency also benefits from many disks for the same reason.

    Not for writes. All RAID increases write latencies unless you have a battery backed cache.

    we went from a LSI MegaRAID 320-1 + 4-drive SCSI RAID config to an Areca 1170 + 1GB RAM + 24-drive SATA RAID. Every aspect of performance is up by big amounts -- throughput, latency, multi-user access.

    Gee, you installed modern technology to replace something 4 years old and you got a huge performance boost? What a shocker.

    We ended breaking the array into 3 8-drive volumes and mirroring 2 volumes against each other for more redundancy. One of these days, we'll upgrade to faster CPUs and retest a 16-drive volume.

    You likely won't see a difference with faster CPUs. You're bottleneck is either your memory bus, your PCI bus, or your controller. Your controller is the likely suspect, and it has nothing to do with throughput. It can probably only process a certain number of IO operations per second, no matter how much data is in each one, and you're probably hitting that limit. With the right benchmark, there is no reason to believe your server couldn't push several hundred megabytes per second, (450MB/second if you've got a dedicated 133Mhz PCI-X bus for your controller), but your cheap SATA controller probably can't handle the IOPS. I'd be surprised if it could push 15,000 IOPS, and well, unless all your IOs are 32K, you're PC isn't the problem... Even if you are, your PCI-X bus is the bottleneck, not your CPU.

  110. Re:SATA? I don't know.... by ivan256 · · Score: 1

    SATA is so much cheaper that it's easy to afford completely 2 or 3 redundant storage servers for the same price as a single SCSI array. We do this at work in fact.

    That depends on how much performance you need.

    With SCSI around $3-5 per gig, and SATA around 35-70 cents per gig

    At 10k RPM, SATA and SAS prices are almost the same. At 15k RPM you can't even get SATA drives. If you're going to pay the higher price anyway, you should pick the technology with the superior controllers.

    If you only need 7200RPM, by all means, use SATA.

  111. My experience with IDE/SATA arrays by NerveGas · · Score: 1


        I have a number of 1+ terabyte arrays made from IDE and SATA drives, and my experience has been that failures come quick and often. Each of the machines has fans blowing large amounts of air over every drive, and the rest of the machines are top-notch quality. Whether the arrays were made from Maxtor, Seagate, or Western Digital, most have at least one drive die within the first six months.

        In fact, about a week ago, a Western Digital RAID-edition drive died in one of my arrays - and that particular set of disks had only been in use for a week or two.

        Of the SCSI arrays I've built and maintained, I've had three or maybe four drives fail in 5 or 6 years. It's a tough call, because it would be nice to build these arrays from SCSI drives - but when you're talking about 3 or 4 terabytes in an array, the number of zeroes on the cost of a SCSI solution makes you a bit dizzy. Because of that, I've continued to use SATA drives - but in RAID 6, with one hot spare, and two extra drives sitting on a shelf, waiting for another to fail. That might be overkill, but in two years if they don't make the same model of drive, you don't want to replace an entire array because one drive went out on you.

    steve

    --
    Oh, you're not stuck, you're just unable to let go of the onion rings.
  112. Raid 6 vs Raid 10 by rthille · · Score: 1

    Question about raid6 vs raid10. Given that a two drive failure can kill a raid 10 setup, but not a raid 6 setup, when and why would you recommend a raid 10 setup over a raid 6 setup?

    --
    Awesome furniture, accessories and cabinetry in Santa Rosa, CA: http://humanity-home.com/
  113. Re:SATA is fine ... for some things by phoenix_rizzen · · Score: 1

    You can get 10K RPM SATA drives from Seagate if you need SATA and want low-latency access. The Seagate SATA Raptor is just a re-packaged SCSI Raptor with a SATA interface.

  114. Re:SATA is fine ... for some things by Fweeky · · Score: 1

    "we went from a LSI MegaRAID 320-1 + 4-drive SCSI RAID config to an Areca 1170 + 1GB RAM + 24-drive SATA RAID. Every aspect of performance is up by big amounts -- throughput, latency, multi-user access."

    What's that, a cheap SCSI card and 4 platters is slower than an expensive SATA card with gobs of cache and 6x as many disks on $unspecified_workload? Truely shocking ;)

  115. Worry about the controllers, not the drives by Anonymous Coward · · Score: 0

    I have yet to see a SATA RAID controller that is as stable as the SCSI RAID controllers. Even the latest generations have hangs, crashes and corruption that disappeared from the major SCSI RAID controller venders many years ago. There is a significant learning curve for writing good drivers and firmware, and they just are not there yet.

    The name brand RAID subsystems have a level of stability and flexability above that of the SCSI RAID controllers.

    You get what you pay for ...

    So, you need to decide how important is your data? I wouldn't trust much to a SATA RAID controller yet.

  116. Re:SATA II is not your father's SATA by Nutria · · Score: 1

    Isn't it always a smart idea to turn your PC off before you reach your hands inside the case?

    Yes, turn off your PC before opening the case.

    A server, though, is not a PC. It's a set of rack (or cabinet) -mounted enclosures that are designed to be accessed while the power is on.

    --
    "I don't know, therefore Aliens" Wafflebox1
  117. Re:What's this SCSI you speak of? by Anonymous Coward · · Score: 0

    "its a quick trip to the store for another 160 GB seagate."

    And you'll be makin' that trip too.

  118. Re:Take whatever's cheapest. Buy two. by Anonymous Coward · · Score: 0

    "Take whatever's cheapest. Buy two."

    You work for the government, don't you?

  119. Re:Take whatever's cheapest. Buy two. by Darth_brooks · · Score: 1

    No, Government would be:

    "Take whatever's most expensive, even if it's not suited for the job. Buy four, use one."

    --
    There are some people that if they don't know, you can't tell 'em.
  120. First hand experience.... by jess_wundring · · Score: 1


    I'm coming to this conversation pretty late, so I hope somebody who cares gets a chance to read this. :)

    We're 3 to 4x the size of turboflux's org and both our 2TB 320 LVD scsi DAS (direct attached storage) units started getting wiggy early last year. After looking at our options we ended up with a couple of ExcelMeridian's (SecurStor XRS, if you care) DAS units populated with Western Digital SATAII drives in a Raid 6 config and connected back to our servers via 320 scsi controllers.

    Western Digital has had a line of enterprise SATAII drives with extra layers of testing to certify that they're good enough for RAID duty for awhile; I think Hitachi has a line of these as well now.

    The new units have been in service for about 10 months now without even a hiccup and they came with almost 3x the capacity at about 25% of the price that we paid for their scsi predecessors. One of our techs pulled a drive out of the config as a test...it kept humming and immediately started blasting us with email alerts, just as we had it set up to do. With 2 drives missing from the 16 drive config, performance degradation was noticeable but it still recovered okay.

    FWIW.

  121. Problems with raid0+1 by Stephen+Samuel · · Score: 1
    The chances of losing two disks at once are slim.

    It may be slim, but I've had it happen. The problem turned out to be a bad batch of fiber channel tranceivers, but it made my life hell for a month and finally cost me the entire array before we figured it out.

    Raid 0+1 has a number of problems relative to true raid-10 in terms of reduncancy:
    First of all, there's the problem that if you do lose 2 disks at close enough to the same time (e.g. a bad batch), that you don't have enough time to replace the first before the second goes, then you've got almost a 50% chance that the second bad drive is going to break your data.

    The second is that when one drive goes bad, it breaks the entire plex. This means that, if you've got a 1TB of data, you end up copying the entire 1TB when the replacement drive kicks in. This hurts both your redundancy and your performance for a lot longer than just replicating the data that was on the bad drive in true raid-10.

    The one problem with doing raid-10 is that I don't know of any drivers that let you do that in hardware and across controllers... (i.e. the solution of putting mirror pairs on the same controller means that, if the controller dies, you've got a broken array).

    So for best redundancy, you want:
    ctrl1 . ctrl2
    dska1 . dskb1
    dska2 . dskb2
    dska3 . dskb3
    .... . ....

    but you're almost guaranteed to do it in software.

    For best perfomance, you can do some of the raiding in hardware (how, exactly, may depend on your application), but at the cost of either performance or redundancy.

    One good tradeoff is RAID 51 -- raid-5 on the controllers and mirror in software. This means that you can, guraranteed, suffer 3 dead drives without losing data, and one dead drive means simply regenerating the replacemant drive on rebuild -- not the entire plex. At the same time, you have the advantage of offloading some of the work to the controller but you can still lose an entire controller (and a disk) and keep running.

    --
    Free Software: Like love, it grows best when given away.
    1. Re:Problems with raid0+1 by afidel · · Score: 1

      I've always calculated that if the data is valuable enough to do RAID-51 then you're better off going with an offsite mirrored system.

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    2. Re:Problems with raid0+1 by Stephen+Samuel · · Score: 1
      It depends on how much data you have. If you are already looking at raid 01/10, then going from striping to raid-5 isn't that (relatively) expensive. In terms of added survivability, I'd say that it's a cheap add-on.

      I'd expect off-site mirroring to increase the cost by almost an order of magnitued -- and the only thing it gives you is protection from a site disaster. Having done that, I'd still suggest using raid 5 on each plex and/or true raid-10.

      --
      Free Software: Like love, it grows best when given away.
  122. Re:SATA II is not your father's SATA by NetRAVEN5000 · · Score: 1

    Oh. I was under the impression that the question was about a home file server (which might be in a normal PC case, depending on how many hard drives he puts in there - one or two 250GB HDs would be just fine for home use, unless you're doing some real serious stuff).

  123. Sick advertising by ManDude · · Score: 1

    This is so fake. You've been had by a slick Mac ad. You geeks can't see it? I thought there was brains before beauty in this club.

    LrM

  124. Re:SATA is fine ... for some things by X · · Score: 1
    Okay, let's get the fact straight here.

    1. Seagate doesn't make 10K RPM SATA drives.
    2. Seagate doesn't make Raptors, Western Digital does.
    3. SATA Raptors aren't repackaged SCSI Raptors.
    4. SCSI Raptors don't exist.
    5. repackaged doesn't have a dash in it.


    Other than that, I agree with a lot of what you say. ;-)
    --
    sigs are a waste of space
  125. Re:SATA is fine ... for some things by danpritts · · Score: 1

    > SAS (serially attached SCSI) drives will be 2.5" and use less power.

    I was under the impression that current 10k rpm 3.5" form factor drives actually have 2.5" platters inside. If this is the case, the SAS disks are unlikely to use less power.

    > All RAID increases write latencies unless you have a battery backed cache.

    I suppose that with a fast RAID1 controller there would be some increase in latency but it is likely to be negligible (he says, pulling it out of his butt since he hasn't actually tested it). Certainly RAID5 will be slow on writes, which is what you were mostly talking about.

  126. Re:SATA is fine ... for some things by ivan256 · · Score: 1

    I was under the impression that current 10k rpm 3.5" form factor drives actually have 2.5" platters inside.

    I wasn't aware of that, but it makes sense.

    I suppose that with a fast RAID1 controller there would be some increase in latency but it is likely to be negligible

    It isn't negligible. Your write can't be considered complete until your data is on both disks. Unlike old slow SCSI drives, disks on the same bus don't syncronize their rotations anymore, so you have to wait for the worst case rotational latency of both disks. It's about a 25% latency hit on average for a single IO. For linear writes the effect is less, but if you're running a database with lots of random writes, it's a big deal.

    High end arrays and controllers get around this with a battery backed up cache, which allows them to call the IO complete once it is in cache. Low-end controllers, almost all SATA controllers, and many on-board controllers (even on server boards) don't come with a battery backed up cache though.