Slashdot Mirror


RAID Problems With Intel Core 2?

Nom du Keyboard writes "The Inquirer is reporting that the new Intel Core 2 processors Woodcrest and Conroe are suffering badly when running RAID 5 disk arrays, even when using non-Intel controllers. Can Intel afford to make a misstep now with even in the small subset of users running RAID 5 systems?" From the article: "The performance in benchmarks is there, but the performance in real world isn't. While synthetic benchmarks will do the thing and show RAID5-worthy results, CPU utilization will go through the roof no matter what CPU is used, and the hiccups will occur every now and then. It remains to be seen whether this can be fixed via BIOS or micro-code update."

284 comments

  1. don't worry by sum.zero · · Score: 4, Funny

    it's not a bug, just errata ;)

    sum.zero

    1. Re:don't worry by Tackhead · · Score: 5, Funny
      > it's not a bug, just errata ;)

      Where's the bug? My RAID 0+0.999999998 works just fine Intel Core 1.99904274017.

    2. Re:don't worry by wangxiaohu · · Score: 1
  2. Why aren't you running a dedicated controller...? by saleenS281 · · Score: 5, Insightful

    If you're running raid5 it's probably in an enterprise setup. If so, why aren't you running a dedicated controller? The CPU should have little to no impact on the raid subsystem...

    Seems odd to me that the inquirer is the only one reporting this. How about a real hardware review site?

  3. Xserve? by mrxak · · Score: 1

    Does this mean that Apple won't be using Intel chips in their Xseve's for a while?

    1. Re:Xserve? by AlexCV · · Score: 1

      No, they'll use a hardware raid chipset on the Xserve.

    2. Re:Xserve? by SaDan · · Score: 1

      Just like the IBM ServerRAID controller mentioned in the article?

  4. Small subset? by Anonymous Coward · · Score: 0

    Just because you don't use RAID in your basement doesn't the market is small.

  5. Problem by laffer1 · · Score: 3, Insightful

    I don't get what the problem is. Are there specific instructions used often in raid 5 algorithms that are slow on the new chips? Is it bus contention?

    1. Re:Problem by afidel · · Score: 1

      My guess is it's speed throtteling introducing delay into the occassional execution of these instructions whereas the chip is running full out when running through an artificial benchmark. That's pure speculation on my part though.

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    2. Re:Problem by martok · · Score: 1

      Afaik, raid5 is heavily dependant on a xor algorithm. I know with Linux md, it has several ways of doing it. It can use sse, sse2 etc. I'd be interested in seeing results of the actual xor thruput.

    3. Re:Problem by TheRaven64 · · Score: 2, Interesting
      Software RAID 5 does:

      Load byte 1.

      Load byte 2.

      XOR bytes 1 and 2.

      Store result. There are a few things that could be wrong here. The XOR performance could be bad. This seems a bit unlikely but XOR is not an incredibly common operation so it wouldn't slow down too much else.

      It could be that the pattern of data was bad for cache usage. This would be slightly odd, since it should be a series of 4K linear blocks.

      It could be low I/O performance between the chip and the on-board controller. This seems the most likely; there could well be some multiplexing issues too. I would be interested to see what the results are using a Core Solo.

      --
      I am TheRaven on Soylent News
    4. Re:Problem by ch-chuck · · Score: 1

      I think it's more a function of the mobo. Quoth TFA:

      "If you are using an Intel's own baby, D975XBX motherboard and put four drives in RAID5 array, an interesting overhead and a slowdown will occur on all upcoming Conroes."

      raid-5 is supported in an optional mobo sata controller. Still don't get the connection with the processor, but probably a mobo glitch that only shows up with certain ones.

      --
      try { do() || do_not(); } catch (JediException err) { yoda(err); }
    5. Re:Problem by Mr+Z · · Score: 1

      I'd be more likely to bet there's SMP locking issues in the driver. The performance of XOR is negligible in the equation here.

      --Joe
    6. Re:Problem by javaxjb · · Score: 1

      But at the circuit level, XOR is one of the basic gates (http://en.wikipedia.org/wiki/Xor_gate). It's central to other operations such as add and graphics masking operations (and, as already mentioned RAID calculations). The engineers would really have to go out of their way to make this slow. There's an example of combining NAND gates to make an XOR in the linked Wikipedia entry, but who would want to waste that kind of real estate on the CPU?

      --
      Programmers in mirror are brighter than they appear
    7. Re:Problem by ivan256 · · Score: 2, Interesting

      Seems more likely to be a scheduling issue to me...

      Core 0 loads byte 1, Core 1 loads byte 2, Core 1 or Core 2 has a cache miss on the XOR...(Do the cores share a cache?) Or it could be a locking problem. XOR is very common, and it would surprise me if it was slower than on previous intel chips.

    8. Re:Problem by TheRaven64 · · Score: 1
      There is a difference between XOR and the XOR operation. There are a lot of different components to the XOR operation; the instruction needs to be decoded into micro-ops, dispatched, executed, reassembled and then retired. Any one of these steps could have a bug, and that bug might only exhibit itself when the chip in in a particular state, one which happens to be used a lot by the RAID firmware.

      It is, however, a lot less likely that some of the other alternatives.

      --
      I am TheRaven on Soylent News
    9. Re:Problem by Anonymous Coward · · Score: 0

      RAID5 (could) work like this (for example):
      parity = data0 ^ data1 ^ data2

      Or put more generally:

      d = a ^ b ^ c

      If you know three of those, you can construct the fourth, or even MORE generally, if you know N you can re-construct the (N+1):th trivially.

      But it doesn't have to be ^ (xor). It could just aswell be subtraction. Or addition.

      Expressed in C,

      d = a + b + c;

      Again, if three are known, the fourth is trivial to re-construct. Example, data b is lost, we can recover it by doing:

      b = d - a - c;

      It doesn't really make any difference what is used. The problem is not likely to be what the cpu is doing but how the data it is doing something with gets there. If there's latency, that can be hidden: the cpu could process jobs in queue, not immediately when the jobs are dispatched. What's the bottleneck anyway? If it turns out to be the *newer* and *faster* cpu that would be surprising. Now, five-years-old cpu will not be much impressed by the workload of couple of cheap (E)IDE/(S)ATA(1/2) drives can push through. Even from the frikkin' cache. That's a fact you can take to the bank, come on, 100MB/sec or what can a one of these drives pull up? Multiplied by some factor like four, still doesn't add up does it?

      There's something fucked for certain. If old P2-400 can do the frikkin' job, order of magnitude faster processor with multiple cores shouldn't even frikkin' flinch. Maybe it's programmed like this:

      char read_byte(void* block_device_context, size_t offset, blablabla...)
      {
          char data[BLOCK_DEVICE_MAX];
          int i;
          for ( i = 0 ; i g_num_device; i++ )
          {
              block_device_seek(block_device_context,i,offset); /* !!! LOL !!! */
              data[i] = block_device_read_data(block_device_context,sizeof (char),i); /* OMG ROTFL? */
          } /* more horror here ... */ /* TODO */
          return ...;
      }

      Notice the char-at-time reading, lots of overhead for function calls, etc.. that would explain it, but then, how the heck that kind of clusterfuck would ever have ran decent on slower processors. Must be sunspots. That's it, mystery solved, move along.

    10. Re:Problem by level_headed_midwest · · Score: 1

      The two cores on a Core 2 Duo *do* in fact share a cache- the Pentium Ds have two chip-exclusive (but L1-inclusive) L2 caches. Maybe you are on to something.

      --
      Just "gittin-r-done," day after day.
    11. Re:Problem by ivan256 · · Score: 1

      Except that's the opposite behavior I would have expected... The shared caches should eliminate that problem. A different cache structure could have a negative effect on a hand tuned algorithm though.

    12. Re:Problem by ivan256 · · Score: 1

      Oh, but it adds a third possibility...

      Core 0 does reads for RAID 5, core 1 consumes the cache lines from core 0's reads, core 0 has a cache miss for the XOR...

      In fact... I can think of lots of code that would perform poorly because it assumes that two cores means two caches.... Most of it has been fixed with the introduction of Hyperthreading though.

  6. These are the cheesy RAID cards, right? by FatSean · · Score: 1, Troll

    Article implies that these are the lame RAID systems that the l33t G4m3rz use...the ones that use the CPU to do the real work, rather than having the card do it.

    I didn't realize they were so wide-spread...especially in government! Seems like just an additional avenue for data corruption should the software hiccup and lunch your data.

    --
    Blar.
    1. Re:These are the cheesy RAID cards, right? by amorsen · · Score: 4, Informative

      Software RAID is faster and more reliable than hardware RAID. Should your non-RAID controller fail, you just chuck it and get a random new one. If your RAID-controller fails, you have to get another controller exactly the same, sometimes even the same firmware revision, or kiss your data goodbye. And RAID-controllers are notoriously underpowered (SmartArray, I'm looking at you!)

      --
      Finally! A year of moderation! Ready for 2019?
    2. Re:These are the cheesy RAID cards, right? by WoodstockJeff · · Score: 1

      They are "wide spread" because a lot of SATA-based boards have these "RAID Controllers" built in, whether you want them or not. Something like 80% of the popular A64 boards have "RAID chips" on them, usually just the RAID 0/1 variety. And there are a lot of $30 add-in cards that are of the same ilk.

    3. Re:These are the cheesy RAID cards, right? by afidel · · Score: 4, Insightful

      Actually I would trust the Linux RAID5 software setup more than a LOT of the RAID controller firmware setups which I have had no end of problems with over the years including a card that rebuilt an array from the new drive on insertion instead of the other way around! Firmware is after all simply software, and software that tends to get a lot less scrutiny then alot of other classes of software, especially potentially data eating code in a project like Linux or one of the BSD's.

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    4. Re:These are the cheesy RAID cards, right? by InsaneGeek · · Score: 3, Informative

      Software is more reliable+performance, what are you smoking? To get the performance you've got to turn on write caching, system goes down with write caching you're very likely (almost guaranteed) to have a corrupt filesystem. To get the reliability you turn off write caching, and performance plumments. Any hardware raid worth more than 3 cents has battery backed cache that allows you to have write caching and maintain reliability, not even taking in account being able to do some Raid5 operations with only 1 disk iop.

    5. Re:These are the cheesy RAID cards, right? by Professor_UNIX · · Score: 1
      If your RAID-controller fails, you have to get another controller exactly the same, sometimes even the same firmware revision, or kiss your data goodbye.
      Depends on your hardware controller. Some cards like the 3Ware controllers store the RAID info on the disks themselves so you can switch out the card with a completely different one and it should still work fine.
    6. Re:These are the cheesy RAID cards, right? by Anonymous Coward · · Score: 0
      Software RAID is faster and more reliable than hardware RAID. Should your non-RAID controller fail, you just chuck it and get a random new one. If your RAID-controller fails, you have to get another controller exactly the same, sometimes even the same firmware revision, or kiss your data goodbye. And RAID-controllers are notoriously underpowered (SmartArray, I'm looking at you!)


      You're a fucking retard.

      As far as benchmarks, show me the numbers.

      As far as reliability, in our operation (you know, hundreds of machines, not your mom's basement) I can pull a spare or a float of the same rev and pop it in.

      If you're a smaller operation you should be running dual raid cards each hooked to arrays which mirror each other.

      If you can't do that when you get a replacement you'd better damned well know what rev of firmware the old card was running there, sysadmin boy... along with its configuration. It's part of your job. Then flash the card and pop it in.

      Either you've got the money to be a production outfit or you don't.
    7. Re:These are the cheesy RAID cards, right? by Billly+Gates · · Score: 2, Interesting

      I was going to recommend 3ware as well. I have done any administration work in years but one of my former employers use them for their servers for that reason. If it dies you can replace the board and we even have a few stored in case of a failure.

      Organizations should look into this and not the vendor for their server for any raid setup. It would be nice if they all did as a server is not a desktop and the data is needed NOW when it goes down.

    8. Re:These are the cheesy RAID cards, right? by MerlynEmrys67 · · Score: 1
      Well, software CAN NOT be faster if you are on a high performance RAID system.

      Lets say you have a disk subsystem that can perform 10 GBPS. Now I will take all of that and run it over a PCI-X bus (same thing applies to PCIe except the numbers are higher) and get throttled to 6.8Gbps for a throughput of 4.3Gbps.

      Now I have a hardware RAID solution. I get 10Gbps from the disks, process it on the card - ship 6.6 Gbps over the bus, no problem.

      Now I haven't even mentioned the CPU utilization. Lets not go there, but yeah - if you are benchmarking Disk I/O Host CPU utilization doesn't matter. In real world scenarios where I actually want to process something from that I/O, I need CPU power available to do the real work.

      Low end RAID - I am right there with you... might as well waste the extra CPU cycles available on your desktop system. In real server environments - no way... Hardware RAID all the way (oh yeah - and I do backups on my raid array so if the controller fails, just get a new one and restore)

      --
      I have mod points and I am not afraid to use them
    9. Re:These are the cheesy RAID cards, right? by rbanffy · · Score: 1

      Software may be faster if your RAID controller is very dumb, your CPU is very fast and your computer is under a workload that's very easy on the processor.

      If it's a server, with heavy processor load and lots of disk reads and writes, a good, intelligent RAID controller is the way to go.

      And having a second identical unit on the shelf is very smart. After all, how much is your data worth?

    10. Re:These are the cheesy RAID cards, right? by h4ck7h3p14n37 · · Score: 1

      Sorry, but I have to disagree; software RAID is potentially slower and less reliable. Software RAID systems are going to use the CPU, stealing cycles from your other applications. I've seen utilization as high as 30% during resync operations using Solaris Volume Manager on an E-250 (granted, it's an old machine). Hardware RAID systems are going to contain embedded processors that are responsible for computing checksums, among other things, and take the load off of the CPU. They'll also contain additional disk controllers and power supplies, most likely with redundant components. You also typically get additional features like the ability to cross-connect your RAID system to multiple servers.

      Soft-RAID systems typically store the RAID metadata on the disks themselves, thus they're vulnerable to damage/destruction if someone screws up a format operation. Hard-RAID systems are going to store this metadata on separate, battery backed, storage.

      I'm not sure why you're concerned about not being able to find a new controller to replace a failed unit. Sun and IBM will ship you a replacement unit almost immediately; are you concerned about finding replacement for EOL'd units? If you're into running EOL equipment (I am), then it's your responsibility to track down replacement parts (yay eBay). I'd suggest keeping at least one replacement on-site.

      I can't speak to HP controllers being underpowered, I only have experience with SUN and IBM. I get the impression that you're used to running soft-RAID on small, non-critical systems?

    11. Re:These are the cheesy RAID cards, right? by Puff65535 · · Score: 2, Insightful

      No you're the retard, I have a SAN with all the production data, dual channels, dual switches, bazillion raid arrays, whole nine yards. I still use software raid, for the system disk, nothing on there that needs backing up, all the config files are under version control (repo elsewhere). Its a waste of dual channel fibrechannel disks to store the system image on the SAN, so I use a pair of cheap SATA disks in the system. Raid 1 means I don't have to re-install if I lose a disk, if the fuzz shows up with a warrant for my logs I just hand them one of the mirror disks, cloning a server for a quick cutover is cake, none of which justifies "real money".

    12. Re:These are the cheesy RAID cards, right? by Fweeky · · Score: 1

      "might as well waste the extra CPU cycles available on your desktop system. In real server environments - no way... Hardware RAID all the way"

      Right. The extra few fractions of a percent of CPU and memory bandwidth consumed by software RAID-1(0) can cripple even the most powerful of servers. And of course by "cripple" I mean "go almost completely unnoticed while reducing dependencies on notoriously buggy hardware and poorly supported drivers and management tools you can't even get the source for". Clearly having it in every server unconditionally shows great reasoning and use of resources.

    13. Re:These are the cheesy RAID cards, right? by AdamTheBastard · · Score: 2, Interesting

      "If your RAID-controller fails, you have to get another controller exactly the same"

      This is why we always by a spare card whenever we get a new RAID controller. That way we know that there will be something that will read the disks and know how the data is setup. Next time you by a RAID controller remember to get another one just like it. Otherwise you might be stuck with disks that will only be read by a card that is out of production.

    14. Re:These are the cheesy RAID cards, right? by Xofer+D · · Score: 1

      He's probably smoking some of the numerous benchmarks on Linux systems demonstrating that hardware RAID from vendors like 3ware perform similarly to plain SATA controllers with a software RAID configuration. Most of the free ones online are a little old, but ten seconds with google gives me this cheap-ass solution for a start). It's true that you can choose to spend enough to buy a controller that does perform better, but they're incredibly costly. For all but front-line high-performance computing machines or similarly demanding tasks where every CPU cycle matters, software RAID is a good choice. I support several small shops whose IT budgets are such that a RAID configuration for their intranet servers isn't really justified. With software RAID they can have good performance as well as reliability on any Linux system, practically for free. The code is quite mature and flexible (the latest revision of it, which I plan to test next quarter, now supports hot array re-shaping) and well supported on many hardware platforms.

      You're correct that a battery backed cache makes a real difference in write atomicity, and I suppose technically I agree that the cheaper controllers aren't worth anything... but only because there are nearly-free alternatives that are just as good for all but really specialized purposes.

      --
      The Signal/Noise ratio can be improved in two ways. Remaining silent is the OTHER way.
    15. Re:These are the cheesy RAID cards, right? by Anonymous Coward · · Score: 1, Interesting

      The performance penalty for software RAID is minimal unless your file server is also your build server or render farm, in which case it probably won't matter much because you're accessing data from local disk..

      I have a software RAID 5 with encryption on 12 PATA disks attatched to a dual-core AMD X2 3800. It has over 2.1GB on a single filesystem and can easily saturate 100Mbps for about $2000. It would cost 2x the price to build this system with hardware RAID.

    16. Re:These are the cheesy RAID cards, right? by InsaneGeek · · Score: 1

      The slowest part of any system is phsyical disks, mathematical algorithm comparison between hardware and software is negligible (and basically everybody does it the same) which is why with hardware raid you can do additional tricks to get around that limitation that software is unable to do without basically guaranteed corruption.

      Performance gets increased significantly by decreasing the number of iop's to phsyical media or decreasing the ammount of time to ack to the host write complete. How do you do that? Well you allow the storage to delay a write, i.e. turn on write caching, ack a write complete without actually getting it to the media. If you do that with software you are going to get corruption, if you do it with any reasonable hardware you are not. Without write caching you are bound by the speed of the physical disks, which like I said is always the slowest part of a computer. That is why hardware raid is faster than software raid, you can't make a physical disk spin faster so how do you go faster you defer writes and front it with enough cache to support the write ingest rate (doesn't take much, even a small 16-32 MB will give noticeable difference).

    17. Re:These are the cheesy RAID cards, right? by MerlynEmrys67 · · Score: 1
      You are cute...

      You really think a 6.8 Gbps stream will take a "fraction of a single percentage" of a CPU. Lets look at this... Round down to 800MBytes/Sec. How many instructions do you think have to be spent processing EACH byte. Realize that there will at least three read, a XOR, and a write operation for each word. lets make it an even four to keep the math simple so there are 800M operations that need to be performed every second on a stream of this size. Lets say a Nice 4GHz processor, running with 2 IPC (heck, lets make it four with dual core) and that is 5% right there.

      Now throw in the extra memory usage needed (remember I am processing 6.4 Gbps instead of 4.2) and I get quite a few more cache misses making my "production" code miss cache a lot more often.

      Now that we have gotten over the fraction of a single CPU myth... try again.

      --
      I have mod points and I am not afraid to use them
    18. Re:These are the cheesy RAID cards, right? by aschlemm · · Score: 1

      When I deploying critical servers I almost always had them setup with hardware RAID-5. In some cases with some IBM Blades I only had space for two disks in the array and settled on hardware RAID-1. The reason I always went with a hardware RAID solution was so I could hotswap a failed drive and not have any downtime with our systems. Sure it would be slow with a drive out in the RAID-5 array but so what since the system stayed up. Hardware RAID wasn't a subsitute for regular backups though and we had nightly tape backups going for those systems that needed it with the tapes be stored offsite for disaster recovery purposes.

    19. Re:These are the cheesy RAID cards, right? by Fweeky · · Score: 1

      Right, and what "real server" workload are you describing here? Believe it or not, most servers do not need to perform anywhere near 6.8Gbps of IO to huge disk arrays theoretically capable of that. If a particular server does, fine; hardware RAID, gobs of memory and tonnes of disk shelves for that. But most will do just fine with a small number of disks performing small IO's that don't even come close to entertaining a modern CPU at any appreciable level; if work is IO bound, it's bound by seek time, not transfer rate.

      You might as well be arguing that all "real" servers should have hardware crypto; sure, some servers do a lot of crypto ops and will benefit significantly from it, but most are going to be using it for SSH and little else, making it a complete waste of money, power and admin effort.

    20. Re:These are the cheesy RAID cards, right? by drsmithy · · Score: 1
      You really think a 6.8 Gbps stream will take a "fraction of a single percentage" of a CPU. Lets look at this... Round down to 800MBytes/Sec. How many instructions do you think have to be spent processing EACH byte. Realize that there will at least three read, a XOR, and a write operation for each word. lets make it an even four to keep the math simple so there are 800M operations that need to be performed every second on a stream of this size. Lets say a Nice 4GHz processor, running with 2 IPC (heck, lets make it four with dual core) and that is 5% right there.

      [...]

      Now that we have gotten over the fraction of a single CPU myth... try again.

      A 2.8 Ghz Pentium 4 has a RAID5 checksumming throughput of around 3.5GBytes/sec. So your 800MBytes/sec (which is a truly massive amount of data to writing to disk - nearly 50 gigs a minute) will use around 22% of it.

      There *are* scenarios where hardware RAID can do a faster job than (modern) software RAID - but they're few, far between, specialised and very expensive.

    21. Re:These are the cheesy RAID cards, right? by drsmithy · · Score: 1
      Sorry, but I have to disagree; software RAID is potentially slower and less reliable.

      Benchmarks almost always show (modern) software RAID to be faster than hardware RAID, in the common case.

      Soft-RAID systems typically store the RAID metadata on the disks themselves, thus they're vulnerable to damage/destruction if someone screws up a format operation. Hard-RAID systems are going to store this metadata on separate, battery backed, storage.

      Are you seriously trying to argue moving from multiple points of failure (multiple drives) to a single point of failure (the RAID controller) is going to provide a more reliable system ?

      There's a reason why good hardware RAID controllers store the array metadata on the physical drives and it's the same reason software RAID does it.

      I can't speak to HP controllers being underpowered, I only have experience with SUN and IBM. I get the impression that you're used to running soft-RAID on small, non-critical systems?

      Unless you're dealing with large numbers (10+) of drives, combined with limited bus bandwidths and extremely high-end RAID controllers with gigabytes of cache, software RAID is almost always going to be faster than hardware RAID. Even then, the extra flexibility of software RAID will probably make it the better overall choice.

      This is not to say hardware RAID isn't useful even when "slower" - the abstraction of only having to deal with a single device at the OS level can be very handy.

    22. Re:These are the cheesy RAID cards, right? by circle+empty+roll+wh · · Score: 1
      Software is more reliable+performance, what are you smoking? To get the performance you've got to turn on write caching, system goes down with write caching you're very likely (almost guaranteed) to have a corrupt filesystem.

      No. Any modern filesystem (NTFS, Ext3, XFS, JFS, etc) is a journalling filesystem. The whole point of using a journal is to prevent corruption in such cases, since the filesystem is always consistent at any point in time. So when using, for example, the Linux MD software RAID implementation, with write caching enabled, with Ext3/XFS/JFS, you are guaranteed in theory to never experience a fs corruption. In practice, corruptions happen because of faulty hardware (bad RAM, system too hot, etc) or bugs (in early 2.6 kernels, etc).

      Regarding performance, kernel hackers have repeatedly found that their software RAID 5 implementation outperform most hardware RAID cards (even some very expensive ones). Have a look at the Linux Kernel Mailing List, or linux-ide mailing list.

      Moreover, Linux's software RAID 5 is more flexible (create arrays on ANY disk on ANY SCSI/SATA card in the system), more consistant (one and only one control software to learn: mdadm(8), no need to use crappy vendor tools or reboot into vendor BIOSes), cheaper (no hardware to buy), more reliable (no hardware card = 1 less hw component that can fail), easier to troubleshoot (take the disks with you and plug them in ANY linux server, it will work, there is no reliance on any particular hw card) and more scalable (spread the load across multiple disk controllers, multiple PCI-X/PCIe busses, or even multiple SAN devices).

      (The above paragraph is something that has been explained here in the past.)

    23. Re:These are the cheesy RAID cards, right? by dnoyeb · · Score: 1

      RAID manufacturers are certainly aware of this. Much of their new product is backwards compatible. Is this really an issue?

    24. Re:These are the cheesy RAID cards, right? by InsaneGeek · · Score: 1

      Umm... no, just no. A journaled filesystem does not protect you from a write-cache situation where you've lost all uncommited I/O's. If an I/O is in cache and the box reboots, it will be guaranteed that you will not have the data on the disk. Furthmore, the problem is that some of the drives can have the change and others not, the drives have acked to the host of the i/o completion so it has gone on. Half of your raid5 drives have the committed data, the others don't is bad enough, but because it's been acked to the host you can have multiple commited i/o's to some disks while their parters still have them in cache.

      To keep up the performance you use hardware raid that has battery-backed write cache to destage to disk, write caching gives significant performance increases (an order of magnitude 2x+) larger than algorithms. Linux may have the fastest algorithm in the world, but it can't get around the physical limitation of how fast the drive rotates, you use write caching to get that. I'll use exagerated numbers to more clearly explain this, the software raid5 algorithm in the world may run 50% faster than the hardware in the world, but using write caching is 500% faster than non. Hardware raid will than run 450% faster than software, while actualy having a slower algorithm.

      This page basically shows exactly this (corruption) in ext3 & reiserfs filesystems using software raid: http://sr5tech.com/write_back_cache_experiments.ht m

    25. Re:These are the cheesy RAID cards, right? by kwalker · · Score: 2, Informative

      That's why we have UPSes. It doesn't matter if your RAID controller has battery-backed write cache if the power to the drives dies.

      --
      ... And so it comes to this.
    26. Re:These are the cheesy RAID cards, right? by InsaneGeek · · Score: 1

      Actualy it wouldn't be a problem, normally the batteries on the chip last significantly longer than any UPS (and then there are those that use NVRAM which never would be a problem). When the drives come back online the pending I/O's are destaged to disk. Let alone the real-life examples of full-on datacenters with UPS's going dark during the eastern US blackout.

    27. Re:These are the cheesy RAID cards, right? by MerlynEmrys67 · · Score: 1
      Again... I am talking about big iron servers - not toy PCs...

      I will easily agree that for Desktop PC environments, you might as well use software RAID - there just isn't any reason not to. However, for big database servers, and things doing massive ammounts of I/O (yeah - stuff that you can't even do with a standard PC based server)

      The claim was made first that Software RAID is faster (I claim it isn't on the high end) then the claim was made that it would only take a fraction of a percent of a CPU - and I claimed it would take at least 5% of a high end CPU, and you are backing that claim up saying it is 22%.

      Kind of like the people saying you don't need any fancy routers, just throw a few interfaces on a Linux box and route to your hearts content. Yeah - it works great for T-1 speeds, and a couple of other slow networks, but if you want to be routing multiple 100mbps interfaces (Gig now, if not 10 gig) you really nead custom ASICs to get you over the limitations of toy PCs and their bus architecture.

      --
      I have mod points and I am not afraid to use them
    28. Re:These are the cheesy RAID cards, right? by macmouse · · Score: 1

      Then what happens when the RAID card dies?? The data that is left sitting on that battery-backup memory still hasn't made it to disk. So when you swap the card out, you *still* have to run FSCK (or equivilant) because the last bits of data are still not on the disk. This happened with fastmail (http://status.fastmail.fm/). This caused them major grief for them.

    29. Re:These are the cheesy RAID cards, right? by InsaneGeek · · Score: 1

      If they are non-redundant this is when the perverbial shit hits the fan, you'd be in a position like I've mentioned one would be with software mirroring with write cache, you'd have lost your transactions in flight and cannot even guarantee a fsck would recover the filesystem as you might have commited writes on some drives but not others.

  7. Re:Why aren't you running a dedicated controller.. by andrewman327 · · Score: 4, Interesting
    I agree with this. For most people, backing up your data every week is a LOT better option for data security. Users who should be using RAID 5 should also have dedicated controllers.


    Still, this is a problem for Intel. Their products are supposed to do what they do extremely well under all conditions. I hope that they find a way to fix this admittedly niche problem.

    --
    Information wants a fueled airplane waiting at the hangar and no one gets hurt.
  8. Re:Why aren't you running a dedicated controller.. by Pyrowolf · · Score: 1

    Exactly. I'm baffled how a separate controller would cause such an issue. You would expect more than just raid controllers to exibit this behavior considering even non-intel cards are causing woes.

  9. hi man, do you forget SO/HO users? by Anonymous Coward · · Score: 0

    I know a lot of people are using on-board RAID-5 feature.

  10. Re:Why aren't you running a dedicated controller.. by Shoeler · · Score: 1

    The point TFA makes is not that a RAID5 setup would be used on a desktop, but that real-world performance seems to suffer on this chip.

    Am I halucinating to recall something happening like this a long time ago with Intel?

  11. Re:Why aren't you running a dedicated controller.. by lukas84 · · Score: 2, Informative

    Yes, i tought about this too.

    Using RAID5 in software (be it completely in software like Linux MD or Windows Dynamics Disks, or 99% in Software, like most Onboard RAID Controllers out there) isn't a good idea if you want to run an "enterprise" setup. It might be okay for your mom's basement, or for test systems.

    But productive systems should be using real raid controllers, equipped with half a gig of cache memory, a battery backup in case of a power failure for the cache, and dedicated processor for the raid5 overhead.

    Intel might've screwed this up, but it will only affect non-professional IT.

  12. Re:Why aren't you running a dedicated controller.. by moggie_xev · · Score: 5, Insightful

    Reading the article it's all about software raid and the performance they get.

    The interesting question is what other peices of software that we run will get unexpectedly bad performance.

    ( I have > 2TB of hardware RAID 5 at home so I was wondering ... )

  13. It's only onboard RAID by b00m3rang · · Score: 2, Insightful

    You should be using a controler with a dedicated processor, anyway.

    1. Re:It's only onboard RAID by catch23 · · Score: 1

      I disagree. I've had lots more successes with open source software raid over hardware raid. Usually the raid overhead is relatively minimal anyway. With hardware raid, if your raid card somehow dies (it happens more often than you'd think) you'd have to get the exact same one... which is usually hard when the company that created your raid card went out of business a few years ago. At least with open source software raid, you don't have to worry about those kinds of problems.

    2. Re:It's only onboard RAID by Billly+Gates · · Score: 1

      RTFA

      It happens with other hardware based non intel based boards as well. IBM raidserve as an example

      Another poster mentioned that it could be DMA related or the way the new speculative branch prediction algorithms work. Perhaps a race condition exists that rarely happens with earlier chips but is executed with the newer ones often which would slow it down.

    3. Re:It's only onboard RAID by aachrisg · · Score: 1

      Echo this. I've got terabytes of software raid5. Performance with dual cpus matches what the actual disks can deliver. The good thing, is if my disk controller dies, or the machine the drives are in dies, or other mishap, I can take those drives, hook them up to any random pc, boot linux off cd, and my data will be accessible.

    4. Re:It's only onboard RAID by lukas84 · · Score: 2, Informative
      which is usually hard when the company that created your raid card went out of business a few years ago

      Professional IT doesn't work like that. You have a maintenance contract on your machine, usually from the machine manefacturer itself (like IBM, HP, DELL, whatever floats your boat). You buy this maintenance contract depending on the time you will need the machine (they're usually available from 3-5 years).

      You renew the machine before the contract runs out. IBM, HP, DELL running out of Business seems very unlikely to me.
    5. Re:It's only onboard RAID by b00m3rang · · Score: 1

      Aah... I'd only read the second article earlier today, didn't notice there was another posted here.

    6. Re:It's only onboard RAID by h4ck7h3p14n37 · · Score: 1

      Why are you using a RAID card from a defunct manufacturer? Either buy a replacement before this happens, or stay away from EOL hardware.

    7. Re:It's only onboard RAID by pdbaby · · Score: 1
      RTFA
      Hi, you must be new here!
      --
      Global symbol "$deity" requires explicit package name at line 2. - If only $scripture started "use strict;"
    8. Re:It's only onboard RAID by Anonymous Coward · · Score: 0

      ... or Compaq. And don't tell me HP supports all of Compaq contracts, because they don't.

  14. Re:Why aren't you running a dedicated controller.. by cyanics · · Score: 2, Informative

    I totally agree. If this is actually a RAID-5 setup, then it requires at minimum 3 drives. Most onboard (intel) RAID controllers are only setup for 0,1,0+1, or 10. And not RAID 5. I don't see how it could possibly be correlated to the CPU. It seems much more likely that if it is a new North/South bridge, that the problem is the with IO controller.

    CPU utilization in RAID5 configurations is almost entirely offloaded to the RAID controller.

    The article (including spelling errors) fails to mention a lick about the RAID controller. Only that "it's a cpu problem."

  15. Won't matter for most users by Anonymous Coward · · Score: 1, Insightful

    If you don't care about data integrity, throughput, or CPU utilization, like me. Most users will be buying the core 2 for household heating, and not superfluous stuff like data access.

    1. Re:Won't matter for most users by Anonymous Coward · · Score: 2, Funny

      cat coke | nose > keyboard

    2. Re:Won't matter for most users by firl · · Score: 3, Funny

      Well, my 60 hard drive array provides much better heating than my 4 processor sparc server

    3. Re:Won't matter for most users by RightSaidFred99 · · Score: 2, Informative
      "This is 2002. I want my joke back."

      In the real world, Core 2 has a comparable or lower thermal envelope than AMD's chips.

  16. Gee, let me think. by Anonymous Coward · · Score: 1, Insightful
    Can Intel afford to make a misstep now with even in the small subset of users running RAID 5 systems?"
    Yes. Yes, they can. They have a shitload of money, and survived the Pentium dividing by zero.
  17. Re:Why aren't you running a dedicated controller.. by Ruie · · Score: 1
    If you're running raid5 it's probably in an enterprise setup. If so, why aren't you running a dedicated controller? The CPU should have little to no impact on the raid subsystem...

    This test is interesting for two reasons:

    • Cheap cluster nodes or desktops - one might not want to shell out $300+ for a dedicated controller
    • RAID code basically just munches data around. If software RAID performance is bad, it is likely that the performance of interpreted and bytecode/JIT languages (such as perl, python, tcltk, java, etc) suffers as well. This would be a good reason to avoid the cpu completely. I really hope this is just some sort of driver issue and not an inherent cpu weakness.
  18. Re:Why aren't you running a dedicated controller.. by RingDev · · Score: 1

    Not to mention that most workstations and home PCs don't run RAID 5. If the Core/Core2 chip sets are targeted for machines that don't run RAID, it's not a big deal. If you are running RAID 5, it's likely in a server environment where you would probably have a RAID controller and a Opteron or Xeon based chip.

    -Rick

    --
    "Most people in the U.S. wouldn't know they live in a tyrannical state if it walked up and grabbed their junk." - MyFirs
  19. Because software raid outperforms controllers. by Anonymous Coward · · Score: 1, Informative
    If you have a dedicated file server, you're likely to find that software raid will significantly outperform a "HW" controller.
    Remember that all a HW raid controller is, is a low end (compared to Xeons, etc) embedded CPU running software not unlike what your software raid solution would run.


    This extra coprocessor for RAID is great when you have a box doing many different things like rendering, etc. But on a dedicated fileserver you'll be better off using the really fast CPU rather than the much slower raid controller chip to do the RAID logic.

  20. Re:Why aren't you running a dedicated controller.. by saleenS281 · · Score: 1

    Python/perl/java have not suffered in any tests I've seen. I guess that leads me to question these findings even more.

  21. Re:Why aren't you running a dedicated controller.. by Anonymous Coward · · Score: 1, Interesting

    Intel might've screwed this up, but it will only affect non-professional IT.

    Or those with budgets.

  22. Re:Why aren't you running a dedicated controller.. by martok · · Score: 2, Informative

    Because it's often slower to do so. We ran tests on a good Adaptec u320 raid controler about a year back and though cpu usage was good. We got much better performance out of Linux softraid5. I would suspect this was because the host cpu was faster than that on the controler.

    Not to mention there is a huge cost savings in going with a softraid solution.

  23. Raid 5 - Non Issue by Iberian · · Score: 0, Troll

    Raid 5 is 3 hard drives at the minimum. Most gamers will have 2 that run in striped configuration as that is the best price/performance ratio saving money for the ever expensive video cards. Corporations on the other hand have servers with dedicated raid cards.
    Short answer: Move along nothing to see here.

  24. Talk about Fear Mongering by hcob$ · · Score: 1

    If it truely is a problem with the CPU, then that's what Intel has the Microcode Upgrades. A simple little upgrade and voila... no more AMD people shouting and pointing "OHH! OHHH!!!! Intel made a MISTAKE!!!!! Lookie Lookie! HAHA!!!! AMD NEVER MAKES MISTAKES!!"

    Of course, this is definately a case of the pot calling the kettle black. I mean, name one processor in the last 20 years that HASN'T had a bug in it.

    --
    Cliff Claven
    K.E.G. Party Chairman
    Founding Leader of: Koncerned for Egalitarin Governance
    1. Re:Talk about Fear Mongering by WilliamSChips · · Score: 1

      Name me one AMD processor nearly as buggy as Intel's offerings.

      --
      Please, for the good of Humanity, vote Obama.
    2. Re:Talk about Fear Mongering by hcob$ · · Score: 2, Informative

      K6. As a bonus answer... Comparable Intel Processors at the time also had the Microcode Upgrade ability as well.

      Next QUESTION!

      --
      Cliff Claven
      K.E.G. Party Chairman
      Founding Leader of: Koncerned for Egalitarin Governance
    3. Re:Talk about Fear Mongering by plague3106 · · Score: 1

      How do you know the microcode update doesn't also have bugs in it? Its a trust thing. This isn't the first time Intel has a chip that raises questions about trust.

    4. Re:Talk about Fear Mongering by cnettel · · Score: 1
      Even after the already mentioned K6, there certainly were times where the only affordable and performant chipset was a VIA offering with definite problems. The 686C southbridge problem, for example, was naturally completely out of control for AMD, and was also used with Intel chips, but it was more common on the AMD ones. I'm mostly mentioning it now as that was also a performance and data corruption problem showing up much more easily in RAID configs and HD-HD transfers in non-RAIDed systems. The Inquirer articles are mentioning Southbridge problems in one sentence, but they do not elaborate conclusively on what chipsets have been tried this time.

      (And, yeah, while Intel was most heavily touting RAMBUS, VIA was the primary source for those who were more afraid to overclock 440BX like crazy on the Intel side, as well.)

    5. Re:Talk about Fear Mongering by Hymer · · Score: 1

      "I mean, name one processor in the last 20 years that HASN'T had a bug in it."
      Like the Digital Alpha 221164 and 221264 ?
      ...I don't care it's considered dead... it is the best CPU...
      --
      ...and a Tru64 cluster is what clustering should be like.

    6. Re:Talk about Fear Mongering by Dr.Ruud · · Score: 1

      RAID by language: also ... as well.

  25. Re:Why aren't you running a dedicated controller.. by cnettel · · Score: 2, Interesting

    My personal "analysis", is that this sounds much more like a DMA issue, either in chipsets, in the processors, or in OSes. Core 2 is doing some speculative prefetching and a quite different cache management scheme, so some naive ideas would be that some piece of code or hardware got away with doing things improperly before, a very rare race condition might have become commonplace. If that's the reason, it might be easy to fix. Of course, it might also mean that the prefetching or cache sharing between the cores (or a couple of other things) are actually faulty...

  26. Re:Why aren't you running a dedicated controller.. by Kadin2048 · · Score: 4, Insightful

    I'm slightly confused.

    The articles are both very light on technical details, and somewhat vague as to what's really going on. (Admittedly, maybe they don't know it.) In the first article, they allude to the problems being the result of the "softmodem"-like RAID systems that modern integrated motherboards use, which would remove some of the blame from the processor. But then they also suggest that the same problem occurs with dedicated RAID controllers (IBM ServeRAIDs -- I think these are dedicated controllers), which don't cause too much CPU load at all ... further implicating the mobo. However, similar mobos with AMD processors didn't experience the problem, so there's obviously something going on that's Intel's fault.

    It doesn't seem like it would be that difficult to pin the blame down to the particular component: is it the integrated RAID subsystem utilizing the processor inefficiently? Or is it the processor itself, being slow? And if it was the processor, why wouldn't this slowness be exhibited in other situations?

    Seems to me that what needs to happen, is for somebody to do a test with a Conroe processor in a motherboard that doesn't include any of the integrated, offload-work-to-the-processor type of integrated subsystems (RAID, sound, Ethernet), use a 'real' hardware RAID controller, and see what the results are. If there are still problems in that scenario, then there would seem to be something wrong with the processor, and this could be confirmed with simulative benchmarks.

    As a criticism of Intel's complete "systems" (processor plus chipset) I suppose this is a valid criticism, but I'd like to see more of a breakdown as to where the performance hit is coming from.

    --
    "Ladies and gentlemen, my killbot features Lotus Notes and a machine gun. It is the finest available."
  27. Re:Why aren't you running a dedicated controller.. by Karma+Farmer · · Score: 1
    Seems odd to me that the Inquirer is the only one reporting this.
    Do you consider it equally odd when a news article is only reported in Pravda, The Sun, the Washington Times, or WorldNetDaily?
  28. Re:Why aren't you running a dedicated controller.. by Anonymous Coward · · Score: 0

    If the Core/Core2 chip sets are targeted for machines that don't run RAID, it's not a big deal.

    Everyone's downplaying this, but nobody's asking the real question: Why does it suck at RAID? If we don't know why it's doing this, what if there are other things that will cause the Core to likewise suck?

  29. doesn't matter by dJOEK · · Score: 1

    Real world businesses use External Hardware RAID boxes, or SANs
    using your CPU to play RAID5 controller is just plain dumb

    now, as long as mirroring works fine, i'm happyh

    --
    Exercise caution when modding this message up: the author acts like a jerk when his karma is excellent.
    1. Re:doesn't matter by Anonymous Coward · · Score: 0
      --
      Exercise caution when modding this message up: the author acts like a jerk when his karma is excellent.
      Mod parent -2: One for signature karma whoring, and one for referring to himself in the third person.
    2. Re:doesn't matter by Nom+du+Keyboard · · Score: 1
      as long as mirroring works fine, i'm happyh

      Mirroring uses 1/2 the disk space for redundant storage. It works with even numbers of hard drive 2 and above.

      RAID 5 puts all the redundant storage on a single drive, and works with numbers of hard drives 3 and above.

      Example: with 4 equally sized hard drives, mirroring would only give you .5 the storage capacity of all the drives. RAID 5 would give you .75 the storage capacity of all the drives. A much better buy, especially considering that many computers only accept 4 SATA drives.

      Class dismissed!

      --
      "It's the height of ridiculousness to say for those 9 lines you get hundreds of millions."
    3. Re:doesn't matter by Covener · · Score: 1

      RAID 5 puts all the redundant storage on a single drive, and works with numbers of hard drives 3 and above. ...
      Class dismissed!


      No, parity is distibuted across all disks in a raid 5 array. raid3/raid4 use a single drive for parity.
    4. Re:doesn't matter by dJOEK · · Score: 1

      you're an idiot.

      raid5 and raid1 serve completely different purposes
      for example, i want my backend storage deviced to employ some sort of raid5, to protect against hardware failure
      then i want my cluster application nodes to use 2 such disks and write data mirrored, and managed by the os?

      why? failover scenarios work much better then the software is aware of the mirror disk

      Class Dismissed, and i give you a C-, and that's coz i'm generous

      --
      Exercise caution when modding this message up: the author acts like a jerk when his karma is excellent.
    5. Re:doesn't matter by Nom+du+Keyboard · · Score: 1
      No, parity is distibuted across all disks in a raid 5 array. raid3/raid4 use a single drive for parity.

      You missed my point. Taking any number of drives and adding one additional one for RAID 5 works because the additional drive, in the RAID 5 setup, contains all the additional amount of storage necessary to rebuild any failed drive in the array, including itself. I wasn't dealing with the details of how RAID 5 accomplishes this feat, which is by parity of all the other drives.

      --
      "It's the height of ridiculousness to say for those 9 lines you get hundreds of millions."
  30. Re:Why aren't you running a dedicated controller.. by whoever57 · · Score: 1
    Reading the article it's all about software raid and the performance they get.
    Err, NO! It's about FAKERAID, which is a H/W S/W combo.
    --
    The real "Libtards" are the Libertarians!
  31. *NOT* just on-board RAID by Anonymous Coward · · Score: 2, Interesting

    From TFA:
    The reason was that there were severe problems when Woodcrest was paired with a 1E RAID field when using IBM ServeRAID controllers. The problems didn't occur just in benchmarking, it was the every-day usage model that produced unexpected errors.

    ServeRAID controllers aren't some cheapo CPU-based RAID, it looks like this might be a more serious problems.

  32. Re:Why aren't you running a dedicated controller.. by lukas84 · · Score: 1, Interesting
    Or those with budgets.


    If you can't afford to do something right, don't do it.
  33. XOR is very common by HaeMaker · · Score: 3, Informative

    You use XOR to clear a register. XOR CX, CX sets the CX register to 0. It is faster than MOV CX, 0.

    1. Re:XOR is very common by cnettel · · Score: 1
      The actual opcodes are certainly also more compact, which means more room in the L1 instruction cache. I'm not sure how relevant the speed argument itself would be these days, I would have guessed that they would use 1 cycle on average in any case, but maybe with different characteristics regarding register file usage and whatnot. As XOR has been "The" way to do it on x86, I guess it should continue to work well. On the other hand, Netburst suddenly made shifts relatively expensive. As this (XOR y, y) is so common, I would guess that the register dependency analyzer will class this as a "non-dependency". A naive parsing of the instruction would otherwise naturally conclude that CX was an operand used for both reading and writing, and that the instruction would be dependent on any earlier store to CX.

      If there is special handling to fix the register assignment, then it's reasonable to guess that the execution of the instruction is never really done as an actual XOR, either. To speculate even further, one could start imagining all kinds of bad things happening if a "pre-parsing" treats all XORs as "reset register" with some kind of pipeline stall taking place when the XOR actually has to be executed. Quite unlikely, but an entertaining idea, at least to me, right now.

    2. Re:XOR is very common by Anonymous Coward · · Score: 0
      I think I read somewhere (although I cannot find it now) that
      mov cx,0
      and
      xor cx,cx
      produce the exact same micro-ops on modern x86 CPU's. If that's correct,
      mov cx,0
      should be preferred, as it is more descriptive.
    3. Re:XOR is very common by neirboj · · Score: 1

      XOR is also very handy for computing parity. If you think of this bitwise as applying one bit to another by way of the exclusive OR operation, then the application of a 1 toggles the bit to which it is applied, and a 0 doesn't.

      Looking at a serial example, we start with a 0 if we want EVEN parity or a 1 if we want ODD. Then we apply each data bit against that in sequence using XOR and the result after every operation is the parity bit for the bits examined so far. So with EVEN parity (0) and the octet 0x6B (01101011b), the result is a 1 after three holds and five toggles, or rather the value which must be appended to the octet such that the number of 1's is even.

      RAID5 is striping with parity, which means (roughly speaking) that each nth stripe is the XOR of the n-1 stripes behind it. In this way, any one stripe can be regenerated by combining the others... with a whole load of XOR's.

    4. Re:XOR is very common by Anonymous Coward · · Score: 0

      They don't generate the exact same micro-ops since xor has the side effect of changing the flags registers.

      Although both instructions take the same amount of time to execute, the reason xor is used in preference to mov is that the opcode is shorter. You end up with a smaller executable and a larger percentage of it can fit into cache thereby increasing performance.

    5. Re:XOR is very common by Kaenneth · · Score: 1

      I would classify XOR'ing a number against itself to obtain Zero as a 'hack' (clever, but obfuscated) no matter how frequently it was used...

      But I suppose making a special instruction just to Zero a register would be just as much a hack (hardcoded values...)

      But depending on a workaround to be faster than the direct way of setting the register to Zero just seems to me to be a bad idea... But as often as it's done, it's probably saved years, if not centuries of cumulative CPU time.

      I don't envy CPU designers work.

    6. Re:XOR is very common by Anonymous Coward · · Score: 0

      When you say XOR CX,CX, you're explicitly creating a partial register stall unless one of the very previous instructions were XOR ECX,ECX (or MOV ECX,0), to clear the whole register.

      Heck, for all I know the Intel-ish 64-bit CPU's might even need full 64-bit register clears: XOR RCX,RCX.

      Also remember Intel is deeply in love with deep, and I mean deep, pipelines. A stall is something that can take more CPU cycles than even a DIV. If the code is optimized for 386SX, it will run like in molasses on one of the later beasts.

      I'd say rewrite it to use even Screaming Cindy 2 (SSE2) where available. That way one could XOR 16+16 bytes in one instruction, and then copy the 16 resulting bytes to the buffer using the next instruction (not in one clock cycle though, obviously).

    7. Re:XOR is very common by be-fan · · Score: 1

      This is no longer true. On modern processors, both take exactly one cycle, and the MOV sequence is shorter when working with the 32-bit registers.

      --
      A deep unwavering belief is a sure sign you're missing something...
    8. Re:XOR is very common by Anonymous Coward · · Score: 1, Insightful

      1. You use xor to do *exclusive-or*

      2. Talking about faster in context of 16-bit 0x66 prefixed instructions is hilarious

      3. No, it is not "faster", it just doesn't need the constant so it encodes the instruction into smaller space, however, that doesn't say anything about being faster. You really need to know the x86 implementation and the relevant details, how it decodes into microcode, is there tracecache, is there instruction macrofusion and so on.

      4. xor reg,reg would be relevant in 1990's

      5. Decoder bandwidth is not an issue unless the code is generated on-fly and most of the code being executed is 'new' (which doesn't happen very often in binaries which are generated offline)

      6. Wow.

    9. Re:XOR is very common by Anonymous Coward · · Score: 0

      >But I suppose making a special instruction just to Zero a register would be just as much a hack >(hardcoded values...)

      There are architechtures out there, where some register is "hardwired" to value zero.

      move r0,r1 ; r1 = 0
      move r1,r0 ; nop (writing to zero register is no-operation)

      Usually number of registers is a power of two, for the simple reason that the register names are encoded in specific number of bits (wasteful but works anyway, and much simpler and logical and other arrangements like "table" driven decoding). SO, say, we got 16 registers that would mean encoding would need 4 bits for the distinction. If "wasting" one register for zero, we'd end up wasting 1/16 of the encoding real-estate for this job.

      On x86 instructions often have fixed source or destination ports (mul, imul, div, etc..) which saves encoding space. This means the encoding is *compact* but also restricted.

      The real problem with this arrangement is that there is always some aliasing and dependencies that are *arbitrary* due to limitation in number of nodes in a typical dependency graph. But that's not a real serious problem all the same until we can issue more instructions that we can decode, we are not limited by that - yet. ;)

    10. Re:XOR is very common by level_headed_midwest · · Score: 1

      A few things:

      1. The Core 2's pipeline is 14 stages- Intel's love affair with long pipelines has at least taken a hiatus.
      2. SSE2 calculations can be and are used for RAID 5 and 6, at least in the Linux MD kernel RAID. When I boot, MD does a short CPU benchmark of how fast different SSE and SSE2 algorithms do RAID 5 and 6 checksumming and then picks the fastest one to use. (I do not have MD set up, but it is included in the later kernel versions and runs its little test at boot.)

      --
      Just "gittin-r-done," day after day.
    11. Re:XOR is very common by Anonymous Coward · · Score: 0

      It used to be faster when XOR CX, CX took a byte and MOV CX, 0 took three. Not sure nowadays.

  34. Timing problem by toybuilder · · Score: 3, Insightful

    This sounds like a timing problem -- the processors are too fast, causing the system to slow down.

    There was a similar problem that I had to wrestle with on a Linux when runnig 3Ware RAID controllers w/ RHEL3 on fast dual-processor servers. When battery backed write caching was turned on, the fast acceptance of IO requests (by the CPU's and then by the hardware RAID controller) lead to awesome sustained performance for short bursts, but under constant load would suddenly hit a wall and then IO would practically hang. (https://bugzilla.redhat.com/bugzilla/show_bug.cgi ?id=121434)

    1. Re:Timing problem by Anonymous Coward · · Score: 0
      This sounds like a timing problem -- the processors are too fast, causing the system to slow down.

      Saying "the processors are too fast" makes this sound like the fault of the processors.
      CPUs waiting for an I/O device just sit in an idle or I/O wait state until the device is ready.

      What's an example of a non-buggy algorithm that breaks when your modern, general purpose PC runs "too fast"?
    2. Re:Timing problem by toybuilder · · Score: 2, Insightful

      I was implicitly blaming bad driver/kernel design when I said that there was a timing problem caused by the processor being too fast. I'm not talking about a timing problem within a single IO/memory bus access cycle -- I'm talking about driver code that breaks because some critical section of code used to run slow enough while a peripheral was processing a request, but due to faster processing, was now allowed to livelock.

    3. Re:Timing problem by Anonymous Coward · · Score: 0

      OK, thanks. So you're talking about software bugs in the kernel or drivers.
      Makes me wonder about the title of this entire article...

    4. Re:Timing problem by Anonymous Coward · · Score: 0

      We've definitely found our share of livelock problems in both the processor and the chipset during both pre-silicon testing and testing of the first production parts. However, I'd be surprised if the existing chipset mechanisms for breaking livelock were failing. But the logic that detects and breaks out of livelock scenarios most definitely has a performance impact. Sometimes the only way to guarantee the system can make forward progress is to halt all other processor agents that are making forward progress until the agent that isn't making progress sucessfully does so. If this happens often enough in a multiple processor platform, overall performance will definitely degrade.

      However, for your average software flow, I'd be surprised if livelock handling logic was triggering that often to have a noticeable performance penalty. All of the livelock scenarios I've seen in pre-silicon or post-silicon testing involved a couple things:

      1) Self-modifying code scenarios (Does any modern O/S software do this kind of stuff?)

      2) Bus lock requests (I could see these happening in a device driver for an I/O card, although normally locks are handled at the cache level and not by the chipset)

      2) Multiple processors fighting to have cache ownership of the same memory location. Since I/O memory isn't cached, that would prevent a lot of the scenarios. Also for the average driver, I wouldn't expect it to be multi-threaded and having multiple processors working on the same memory locations.

  35. Is the sky falling? by DysenteryInTheRanks · · Score: 4, Funny

    Can Intel afford to make a misstep now with even in the small subset of users running RAID 5 systems?

    No. No, it cannot. Sell your stock. Rip the CPU out of your boxen. One hundred ten billion dollars in market capitalization has disappeared in a flash with the publication of this groundbreaking article in the Inquirer.

    Intel has signed its own death warrant. As goes RAID5, so goes the world.

  36. Re:Why aren't you running a dedicated controller.. by Albanach · · Score: 5, Insightful
    You are correct that RAID isn't a backup solution, but incorrect when you say if you're using RAID5 you should be in a data centre.

    What if you have a lot of photos, music or movies - these aren't unusual things these days. I don't want to go rummaging through DVDs to find the picture I want, I want to fire up f-spot and see it there straight away.

    RAID5 provides sensible protection against data loss when using consumer hard disks - software RAID5 is readily available on linux and hard disks in the 2-300GB range are easily affordable. You can often pick them up for $50 after rebates. So I can get a TB of storage for a few hundred dollars, but to use hardware RAID5 would probably double the cost. Fine if you're an enterprise, but not fine if you're using it at home.

  37. The Inq by yem · · Score: 1, Flamebait

    Every Inquirer story I've clicked through to from slashdot has been subsequently debunked.
    Anyone got independant verification of this startling discovery?

    --
    No, I did not read the f***ing article!
    1. Re:The Inq by rrohbeck · · Score: 1

      Every Inquirer story I've clicked through to from slashdot has been subsequently debunked.

      That's because only the crappy ones are mentioned here.
      The important ones (e.g. http://www.theregister.co.uk/2005/11/04/black_heli copter_hybrid/) don't even make it here.

  38. Re:Why aren't you running a dedicated controller.. by jelle · · Score: 5, Insightful

    "I agree with this. For most people, backing up your data every week is a LOT better option for data security. Users who should be using RAID 5 should also have dedicated controllers."

    You're generalizing a little too much. For example: I have >1TB storage on my mythtv box (I just like to have a good selection of stuff to watch when I finally get to watch tv, and I'm never at home when the shows I like are being broadcasted), and I'm using software RAID5 on that. That is, software raid5, on shared controllers: All together seven disks off the mainboard, from a mixture of pata and sata connectors. I wouldn't do this on something like a server, but it's plenty fast enough for mythtv. It also gives a lot of protection for the array of disks, and it's a much, much better option than the weekly backup you suggest (first of all, a backup would take ages, cost waay more in disks (which wouldn't even fit in the HTPC), and last but not least: without raid5, if one disk dies, I could lose up to 7 days of recordings...).

    --
    --- Hindsight is 20/20, but walking backwards is not the answer.
  39. Re:Why aren't you running a dedicated controller.. by Anonymous Coward · · Score: 0

    The problem persists even if you use a dedicated controller.

  40. Re:Why aren't you running a dedicated controller.. by kimvette · · Score: 1

    Not any more. With even consumer-level boards offering embeddedRAID5 and RAID 1+0/0+1 support at the $100-$150 price level, and with hard drives being uber-cheap nowadays, there is absolutely no reason you won't see an explosion of growth in the use of RAID5 in at least higher-end home and SOHO machines.

    --
    The Christian Right is Neither (Christian nor right). See: Matthew 23, Matthew 25, Ezekiel 16:48-50
  41. Re:Why aren't you running a dedicated controller.. by Nicolas+MONNET · · Score: 1
    If you're running raid5 it's probably in an enterprise setup.

    I have installed a software Raid5 at work for online backups of workstations. 250GB SATA disks cost nothing (~80?); it'd pain my anus to fork out a kilobucks or two to pilot them. Sorry if that's not enterprisy enough for you!
  42. Re:Why aren't you running a dedicated controller.. by Jedi+Alec · · Score: 1

    If you can't afford to do something right, don't do it.

    hope you're happy with the way your life is going ;-)

    --

    People replying to my sig annoy me. That's why I change it all the time.
  43. Re:Why aren't you running a dedicated controller.. by myz24 · · Score: 5, Insightful

    I agree, it seems on slashdot (and actually, some of my friends) that you're an idiot if you're not running RAID but your equally dumb if you're running RAID5 because it's not a backup solution. It's as if there can't be any gray area in the matter. People make it seem like RAID5 has no purpose or benefit and everyone should just be using striping+backup. To me, the point of RAID5 or other redundancy RAID setups is it's your first line of recovery for a disk failure. If a disk fails, you replace it and you've suffered little downtime. If something major happens then yes, you restore from backup.

    My other issue is with people forgetting the idea behind being sensible about what needs to be protected and how much it should cost. There is no reason why my personal collection of photos, music and video should cost me so much. Software RAID is way more than adequate for providing a cheap way to store my files. If data protection AND peak performance are what you need, then yes you need to go full hardware. WHERE'S THE MIDDLE GROUND PEOPLE?

  44. Re:Why aren't you running a dedicated controller.. by andrewman327 · · Score: 1

    I go to school in Washington DC and I live the rest of the time in Southeast Pennsylvania. Both places have been hit by floods recently, which should alert you all to use things other than RAID for backup. RAID does have its palce, as a devestating hard drive failure can become a minor 45 minute annoyance.

    --
    Information wants a fueled airplane waiting at the hangar and no one gets hurt.
  45. Re:Why aren't you running a dedicated controller.. by dfn_deux · · Score: 1
    You are talking out your rear if yout think that going to a hardware raid is going to "double the cost" of a TB+ storage box. You can pickup an nice 3ware card for a few hundred bucks and if you want to go on the cheap a tekram or similar card is less than 100 bucks now days.

    /me uses hardware raid at work (DataCenter) and at home and is perfectly satisfied with the price/performance ratio...

    --
    -*The above statement is printed entirely on recycled electrons*-
  46. Re:Why aren't you running a dedicated controller.. by lukas84 · · Score: 1
    hope you're happy with the way your life is going ;-)

    This isn't about life. It's about a professional decision as someone working for a company. Before deploying half-assed solutions, it's usually better to deploy nothing at all.
  47. Re:Why aren't you running a dedicated controller.. by Jeff+DeMaagd · · Score: 1

    It probably should be pointed out that many software RAID systems use a dedicated channel for every drive. RAID-5 on a SCSI hardware RAID adapter doesn't do this. The more drives you operate on the same channel, the bigger the issue can get. This could be an easy explaination as to why software RAID would be faster in your circumstance.

  48. Re:Why aren't you running a dedicated controller.. by Nom+du+Keyboard · · Score: 1
    most workstations and home PCs don't run RAID 5.

    I, for one, intend to run RAID 5 on my next home system, and not just for Geek Pride. Three hard drives with the advantage of mirroring for reliability, striping for performance, and the loss of only 1/3d of my hard drive space for this redundancy, verses 1/2 for mirroring alone, and no redundancy with striping alone. And since I'm doing this for improved performance, I want that performance. Three modern moderate performance hard drives are hardly expensive, and RAID 5 can make up for average performing of 7200rpm drives otherwise.

    --
    "It's the height of ridiculousness to say for those 9 lines you get hundreds of millions."
  49. Re:Why aren't you running a dedicated controller.. by Nom+du+Keyboard · · Score: 1
    If the Core/Core2 chip sets are targeted for machines that don't run RAID, it's not a big deal. If you are running RAID 5, it's likely in a server environment where you would probably have a RAID controller and a Opteron or Xeon based chip.

    If you'd read TFA you'd see that the problem has shown itself with a Woodcrest (the next Xeon) CPU using an IBM RAID controller.

    --
    "It's the height of ridiculousness to say for those 9 lines you get hundreds of millions."
  50. Re:Why aren't you running a dedicated controller.. by kimvette · · Score: 2, Interesting

    Actually the market has become so diluted with everyone's jumping into the RAID game (thanks to Highpoint Tech and Intel with their hybrid solutions) that it's becoming increasingly difficult to discern the true hardware RAID controllers from the hybrid models. Of course there are the companies that won't so much as touch software RAID (namely 3ware) but Promise, Koutech, and even Adaptec all are very slick with their descriptions of the controllers and make it unclear as to whether or not their products are actual RAID controllers of if they offload the processing to the CPU. If you want to give a small business (mom & pop or a larger business with a tightass PHB who sees IT as solely a cost center rather than an essential tool to keep things going) better assurance of data integrity than a single HDD will provide, and they're NOT willing to back up regularly, and obviously won't spend $300-$700 for an entry level GOOD RAID5 controller, then a hybrid solution may be all that you can offer them. Given that these controllers are being implemented on motherboards more and more now, the performance they provide has to be reasonable, without hogging the processor.

    Also, when you do find a hardware controller: will it run in your board? In other words, if it's PCI, do you actually have a PCI slot to fit it? This is especially a problem in a high-end consumer box or in a lower-end workstation, where you might have one or two PCI slots and the rest are PCI-E x1 slots. Where you're likely going to have a GOOD sound card and a capture card in your legacy PCI slots, or maybe a multi-port Firewire 400 card, where is the hardware RAID controller going to live? Obviously the solution is going to be to go with an embedded solution on the motherboard, hopefully with a model that doesn't totally suck.

    --
    The Christian Right is Neither (Christian nor right). See: Matthew 23, Matthew 25, Ezekiel 16:48-50
  51. Oh Ye of Little Faith by muffinass · · Score: 1

    You seem to be resisting the group-think... surely you didn't read TFA, as your sig suggests. Here ya go.

    1. Re:Oh Ye of Little Faith by yem · · Score: 1

      Hah! I forgot about that .sig - I have them hidden, usually.

      For the record, no I still haven't read TFA. Though I did break my unwritten rule of not reading slashdot stories whose title ends with a "?".

      --
      No, I did not read the f***ing article!
  52. Re:Why aren't you running a dedicated controller.. by DudemanX · · Score: 2, Insightful

    I'd also like to see how a Pentium D would perform in the same system seeing as how it's socket compatible with Conroe. This will help isolate whether it is indeed the CPU or if it's more to do with the I/O subsystem(chipset).

  53. Re:Why aren't you running a dedicated controller.. by norton_I · · Score: 1

    This isn't really reasonable. Software RAID can be as good as many hardware RAID configurations. Modern CPUs are very, very fast, and in many cases can calculate parity faster than dedicated controllers, at the cost of some CPU overhead. However, the cost of the CPU can be much less than the controller. Also, large disk cache helps reduce the read-pairty-write overhead from RAID 5, but most systems have more cache than the drive controller you would pair them with. Finally, if my computer has a UPS, the value of battery backed cache is limited.

    Obviously there are many cases where hardware raid is desirable, but to say that any "real" system must use the most expensive hardware possible is just wrong.

  54. Oops. by Jerk+City+Troll · · Score: 1

    Did you get a refurbished drive that was once part of a RAID-5 array? ;)

  55. I dunno I've had bad luck with Raid5 by vertinox · · Score: 1

    I tried Raid5 once with 5 hard drives but one drive had a failure while I was swapping out one and the whole thing went kaput so I had to do the image all over again.

    For saftey's sake, I just used 6 hard drives with 2 pairs striped and then had those drives mirror each other with 2 extra that would go to a drive when one failed. So in theory I could loose 3 drives instead of two and still keep my data.

    And yes... This was on my personal setup for no good reason other than a big ego, but in reality Raid5 isn't that useful ore efficient unless you are using enteprise applications that requires 100% uptime and you have way more than 3 hard drives (just in case two of them fail on you at once for no particular reason) and then you should have that server mirrored by another one so if one server fails because of a bad motherboard/powersupply etc, you'll have another server ready to go.

    --
    "I am the king of the Romans, and am superior to rules of grammar!"
    -Sigismund, Holy Roman Emperor (1368-1437)
    1. Re:I dunno I've had bad luck with Raid5 by Gazzonyx · · Score: 1

      I keep my source code on a linux box. I'll trade 1/3 disk space as parity for the ability to sleep well at night. Effeciency takes a back seat to sleep.

      --

      If I mod you up, it doesn't necessarily mean I agree with what you've said, sorry.

    2. Re:I dunno I've had bad luck with Raid5 by fdisk3hs · · Score: 1

      Score:0 more like it. Hey vertinox, if you run RAID5 and you lose two drives that have both copies of one chunk, you have lost that chunk.
       
      2 chunks minus 2 chunks equals zero chunks, get it?
       
        "...in reality Raid5 isn't that useful ore efficient unless you are using enteprise applications that requires 100% uptime and you have way more than 3 hard drives (just in case two of them fail on you at once for no particular reason) and then you should have that server mirrored by another one so if one server fails because of a bad motherboard/powersupply etc, you'll have another server ready to go."
       
      So, what you are saying, is if you lose your data, you should do something better so you won't lose your data? Thank you, John Madden of sysadmins. "If the referee throws a flag, and he didn't make a mistake, there's probably going to be a penalty."

    3. Re:I dunno I've had bad luck with Raid5 by drsmithy · · Score: 1
      For saftey's sake, I just used 6 hard drives with 2 pairs striped and then had those drives mirror each other with 2 extra that would go to a drive when one failed.

      You are describing RAID 0+1 (mirrors over stripes). RAID 1+0/10 (stripes over mirrors) is more reliable as it potentially allows more drives to fail before data is lost and takes less time for replaced drives to resync.

      (Although with only four drives there is no difference.)

    4. Re:I dunno I've had bad luck with Raid5 by toddestan · · Score: 1

      I tried Raid5 once with 5 hard drives but one drive had a failure while I was swapping out one and the whole thing went kaput so I had to do the image all over again.

      The problem with RAID 5 is that when you have one drive die, you have to do some pretty intense I/O operations over several hours on the remaining drives in order to rebuild the array. If a drive is going to fail, it will probably fail during this process. Add to the fact that the drives in a RAID 5 are almost always identical models, and quite often have sequential serial numbers, and have been used under the same conditions for the same amount of time - you can pretty much assume that when one drive dies, the rest of them in the array are about to succumb to the same failure. So in other words, be sure to keep backups.

    5. Re:I dunno I've had bad luck with Raid5 by Bishop · · Score: 1

      (Although with only four drives there is no difference.)

      There is a difference with only four drives. A stripe of mirrors, RAID 1+0/10, can sustain two failed drives provided the second drive is part of the second pair. The result of the two drive failure is a stripe of two drives. A mirror of stipes, RAID 0+1, can only survive a single drive failure. When a single drive fails the result is a stripe of two drives and a third drive doing nothing. Draw a picture and it will make sense.

      I can't think of any advantage to RAID 0+1.

    6. Re:I dunno I've had bad luck with Raid5 by drsmithy · · Score: 1
      There is a difference with only four drives. A stripe of mirrors, RAID 1+0/10, can sustain two failed drives provided the second drive is part of the second pair. The result of the two drive failure is a stripe of two drives. A mirror of stipes, RAID 0+1, can only survive a single drive failure. When a single drive fails the result is a stripe of two drives and a third drive doing nothing. Draw a picture and it will make sense.

      I've drawn many pictures teaching others. You can sustain two drive failures with a RAID 0+1, as long as both drives are the same stripe pair. I can see your argument that the second drive in a stripe is pointless once its partner dies, but it's still not a failure of a physical disk.

      Maybe I should have been more specific :). In terms of physical disk failures, there's no difference between a four drive RAID 0+1 and a four drive RAID 10. Both can sustain two failed drives, although the chance of the second drive taking down the entire array is substantially higher than it is with >4 drives.

      With four drives - if write performance is not critical - you're better off with RAID6, or to a lesser degree RAID5 with a hot spare.

    7. Re:I dunno I've had bad luck with Raid5 by miro+f · · Score: 1

      if you need more security then you could always just go RAID6 which has two parity bits allowing for two drives to fail without destroying the system.

      The odds of three drives failing at a time are pretty low.

      --
      being vague is almost as cool as doing that other thing...
  56. Mod up, please by SaDan · · Score: 1

    No kidding. That part caught my eye, as we use IBM where I work, and everything is RAID-5. If there's some kind of problem with even the lower end xSeries machines, this will affect our purchasing.

  57. Re:Why aren't you running a dedicated controller.. by RingDev · · Score: 1

    Sorry, my bust, I'm illiterate.

    -Rick

    --
    "Most people in the U.S. wouldn't know they live in a tyrannical state if it walked up and grabbed their junk." - MyFirs
  58. Re:Why aren't you running a dedicated controller.. by bano · · Score: 1

    Your looking for "improved performance" out of raid5?
    Never used it, or even read about it I'm guessing.
    Raid5 while excellent for redundancy, is quite slow when you are talking about alot of writes.

  59. Re:Why aren't you running a dedicated controller.. by Anonymous Coward · · Score: 0

    "I could lose up to 7 days of recordings..."

    Oh the humanity..

  60. Re:Why aren't you running a dedicated controller.. by temojen · · Score: 4, Insightful
    If so, why aren't you running a dedicated controller?

    Because if your dedicated controller goes you have to find the same make & model of controller. On no notice. Possibly a few years after that make and model has been discontinued.

    With software RAID-5, any controller that works with your host bus (PCI) and HDD bus (ATA, SATA, or SCSI) will do just fine.

  61. Re:Why aren't you running a dedicated controller.. by ocbwilg · · Score: 3, Insightful

    Most onboard (intel) RAID controllers are only setup for 0,1,0+1, or 10. And not RAID 5. I don't see how it could possibly be correlated to the CPU.

    That's because you can do RAID 0, 1 or any combination of 0 and 1 without needing parity data. The performance killer on RAID 5 (and any other form of RAID that requires parity) is in the XOR operations used to compute and verify the parity information. In order for RAID 5 to perform at a satisfactory rate and not totally bog down your CPU, the XOR calculations should be handled on a dedicated hardware controller, not in software.

    However, for non-parity RAID setups the amount of CPU overhead is almost trivial, so referring to "fake RAID" or "software RAID" with the integrated RAID controllers on most motherboards is a misnomer. That being said, at least one of these articles is talking about servers using third-party RAID controllers.

  62. Re:Why aren't you running a dedicated controller.. by Anonymous Coward · · Score: 0

    RAID5 for less than 100$? Perhaps if you go with PATA (or those dinky 2 drive cards), but all SATA RAID (or SCSI) cards I've seen were all a LOT more than that...

    Yes, Tekram has a TR-824 card for 50$, but it's not RAID5, and it's only old PCI (not PCI-X or PCIe) so not too fast.

    Find me a 4 port PCIe SATA RAID card that does level 5 for ~100$ and I'm buying 4 today for myself. Willing to pay a bit more than that for more ports or a nicer/better card, but not the extorsion prices for the 3ware and such cards.

  63. Question by ktlewis02 · · Score: 0, Offtopic

    Why does Slashdot still accept article submissions from that piece of shit excuse for a website..?

    1. Re:Question by Ancil · · Score: 1

      Are you talking about dupes? Or the Inquirer?

    2. Re:Question by ktlewis02 · · Score: 1

      TheInq..

  64. Re:Why aren't you running a dedicated controller.. by (negative+video) · · Score: 1
    It doesn't seem like it would be that difficult to pin the blame down to the particular component: is it the integrated RAID subsystem utilizing the processor inefficiently? Or is it the processor itself, being slow? And if it was the processor, why wouldn't this slowness be exhibited in other situations?
    I suspect it's a driver problem: programmed IO instead of DMA. It's a standard gotcha in the cesspool that is commodity computers.
  65. Re:Why aren't you running a dedicated controller.. by Anonymous Coward · · Score: 0

    Given hwo cheap disks with rebates are (and since we probably all have extra computers) a few hundred dollars probably IS doubling the cost of such an array. As for the cheap cards? Well have fun finding 3 cheap 333+ gb drives to get that 1TB array, because I doubt any of them will support more than 4 drives per controller.

  66. Re:Why aren't you running a dedicated controller.. by SlashDread · · Score: 1

    Because hardware controllers add a failure point.
    If your not going for ultimate disk performance, and your cpu isnt overly burdened, the use of software raid is sensible.

  67. Re:Why aren't you running a dedicated controller.. by ocbwilg · · Score: 1

    If you're running raid5 it's probably in an enterprise setup. If so, why aren't you running a dedicated controller? The CPU should have little to no impact on the raid subsystem...

    The first of the two referenced articles talks about them using Woodcrest CPUs (the Conroe-based replacement for the Xeon server CPUs) on IBM systems that used IBM ServeRAID controllers. They obviously aren't talking about the Intel RAID controller integrated into the chipset. And while I agree that the CPU should have little to no impact on the RAID subsystem, especially if they are using third-party controllers, apparently that's not the case here. As to whether the problem lies in the CPU or in the mainboard chipset itself isn't mentioned, but it's certainly possible that there is a chipset bug related to high-bandwidth operations. The FSB architecture is relatively starved for memory bandwidth to begin with. Generally speaking, for a given number of hard disks a RAID 5 array will have the highest level of read throughput of the commonly deployed RAID versions. Could so much data be flooding the bus?

  68. Why aren't you reading the article? by SaDan · · Score: 1

    They specifically stated this was seen with an IBM ServerRAID controller. Not exactly a non-dedicated controller.

    It IS the Inquirer, though, so the required grain of salt has been taken at my end.

    1. Re:Why aren't you reading the article? by lukas84 · · Score: 1

      Unfortunately, you're wrong.

      There's the 7e and similar lines of ServeRAID Controllers, which are Fakeraid.

      The 6i and 7k are "real" raid controllers, the 7e is not. Its even labelled as "Adapted HostRAID" in the BIOS.

    2. Re:Why aren't you reading the article? by SaDan · · Score: 1

      They said dedicated card, which was used in another system (non-Intel). This means it was a PCI or PCI-X board, not the onboard Fake RAID, and not a 7e.

      Thanks for checking some of these facts out before running your virtual mouth.

    3. Re:Why aren't you reading the article? by lukas84 · · Score: 1

      From the article:
      The reason was that there were severe problems when Woodcrest was paired with a 1E RAID field when using IBM ServeRAID controllers. The problems didn't occur just in benchmarking, it was the every-day usage model that produced unexpected errors.

      I don't see anything mentioned about a dedicated card...

      The second article even specifically mentioned fakeraid

      It seems to us that Intel's Core 2 is suffering from a specific overhead when on-board RAID controllers are used. If you're unfamiliar with current on-board RAID5 controllers, let's just say that they really look like soft-modems and soft-sound cards, utilising power of the CPU for everyday work.

    4. Re:Why aren't you reading the article? by SaDan · · Score: 1

      My apologies, I apparently read too much into the part where they tested a similar configuration on the non-Intel hardware, and not enough in the section where they mentioned the fake RAID controllers onboard in the systems.

      I've been building x346 systems lately, so I must have had hardware controllers on the brain.

  69. Likely a driver bug by jgarzik · · Score: 3, Insightful
    This crap does not happen on Linux, on the same hardware. Most likely *BSD is not affected either, though I have not tested such.

    It's almost a certainty that this is a software problem of some sort. Driver bugs are the most common source of "hardware" instability, particularly on Windows. Drivers are often written by clueless intern-level engineers, and quickly forgotten once the drivers initially pass based Windows hardware quality tests.

    Jeff, the Linux SATA driver guy

    1. Re:Likely a driver bug by fdisk3hs · · Score: 2, Insightful

      Yeah, so if this is software RAID, what OS are they using? XP I guess?

      Why is Intel's hardware bad just because Windows performs poorly on it?

      I'm a little surprised that, all the way through BOTH articles and this thread, it took this long for someone to ask if it was a HARDWARE or SOFTWARE issue.
       
      As Carlos Mencia would say, "Dee Tuh Dee".

    2. Re:Likely a driver bug by ArbitraryConstant · · Score: 1

      Indeed. A driver issue could easily be causing these problems, particularly if it's a new chipset and so forth as well.

      --
      I rarely criticize things I don't care about.
    3. Re:Likely a driver bug by Anonymous Coward · · Score: 1, Interesting

      I oversaw testing of those drivers at Intel in Hillsboro, OR. Believe me when I say, the practices at the WHQL labs and testing for the RAID stuff is scary. When we were doing the bulk of the RAID work on the chipsets, we were transitioning over to a new bug tracking system, nobody knew what was going on, and many many bugs and issues were lost or simply not filed or followed-up on. I raised hell with several people over the situation and the development deadlines - we had to adhere to the workweek schedule no matter what, no matter what, no matter what, the Intel Borg says, and you jump. I dropped it. Personally I am so SO not surprised. I saw this whole thread... my initial reaction was Ha Ha. It's smug face day around the office as this is making the rounds.

  70. ServeRAID controllers... by Anonymous Coward · · Score: 0

    ...are shite. Use an HP (compaq) SmartArray 64xx or an LSI MegaRAID 320 series controller. They're fast and quite reliable. I especially like how the MegaRAID will store it's configs on each and every array member hard drive as well as the card's eeprom itself.

  71. Great, get rid of RAID5 by Anonymous Coward · · Score: 0

    Check out http://baarf.com/

    BAARF = Battle Against Any RAID Five

    Doesn't seem like a good idea anyway.

  72. Re:Why aren't you running a dedicated controller.. by SaDan · · Score: 1

    It wouldn't explain why he was faster if he was doing software RAID with SCSI, though.

    I've done software and hardware RAID-5 with SCSI, using the same drives and cable, and can confirm that software RAID can be faster than a dedicated hardware controller, all drives and cabling remaining the same. Some hardware controllers can be pretty darn slow.

  73. Re:Why aren't you running a dedicated controller.. by Some_Llama · · Score: 2, Informative

    "3ware card for a few hundred bucks "

    The nice 3ware cards for 100 bucks are NOT hardware raid, they use the CPU to calculate the RAID, it might even say it is in the literature but working at company (tech support) who sells servers that use 3ware for 80% of it's business, I can definitely tell you this isn't the case.

    You CAN get a hardware based 3ware card, but then you are looking at 400-500 bucks (+some for the battery backup unit).

    Plus if you read the parent correctly, 4 300GB hard drives for 50 bucks totals 200 bucks, a "3ware card for a few hundred bucks " WOULD double the cost.

  74. Does B-2 Step Fix This? by Nom+du+Keyboard · · Score: 1
    Interesting post from: www.warp2search.net

    Intel hints that Conroe is going to release at B-2 Stepping as Intel Core 2 Duo processor. As for the previous version, a problem was found to make the system full loaded. It's only solved in the new stepping. We don't encourage anyone to buy the engineering sample from the web. The retail version is going to release in the end of this month, and it's much stable.

    --
    "It's the height of ridiculousness to say for those 9 lines you get hundreds of millions."
  75. Re:Why aren't you running a dedicated controller.. by GoRK · · Score: 2, Informative

    I've never heard it called FAKERAID maybe it should just be called FAID? I'll file that one back for use later...

    Anyway, it's not entirely a hw/sw combo. These types of raid controllers are entirely software based. They consist basically of an ata or sata controller and an interrupt handler. When the disk is being accessed in legacy bios mode (ie during an os install, etc) the cpu pulls the interrupt to write to the disk and the BIOS calls the software stored on the card. This software is executed by the BIOS on the CPU and this code basically does whatever reading or writing to disk is necessary to keep the array consistent. Performance is improved after an OS's native driver is loaded since the software raid is done entirely in the driver.

    It is arguable that software raid done at the OS level actually can have a performance advantage since it can know about file tasks at a much higher level than the driver itself, which basically only knows about block accesses. In the example presented in the article it seems that most of the tricks that software raid has been using do not perform well on the Core2 chips; I don't really see what the big deal is though; nobody has had a chance to write optimized routines for Core2. Probably their legacy BIOS handling ability is poor; so be it. There are a lot of alternatives out there from completely managing the array inside of software to full on ASIC-based hardware RAID.

  76. Re:Why aren't you running a dedicated controller.. by Some_Llama · · Score: 1

    "Modern CPUs are very, very fast, and in many cases can calculate parity faster than dedicated controllers,"

    Especially given the fact that most CPU usage is less than 18% when calcualting parity for RAID 5, so compare 18% of your CPU cost and see if it is worth it to lose that much over head (or buy the next higher model) OR pay 400+ for hardware RAID.

    I'd say RAID 5 is the best solution for home backups, you have redundancy for disk failure and with monthly or even quarterly backups you should be fine. (who wants to spend 12 hours a week backing up to tape?).

  77. omfg by Anonymous Coward · · Score: 0

    Can you niggers post real stories instead of spam/trash/rumors of beta software from spam/trash/rumor websites like theinquirer.net?

  78. Re:Why aren't you running a dedicated controller.. by Some_Llama · · Score: 1

    "Of course there are the companies that won't so much as touch software RAID (namely 3ware)"

    Until the 9500 (and MAYBE 9000) series ALL of 3ware's offerings were software RAID. this is why you couldn't initalize a RAID array until the OS booted (and 10 minutes after that), it even says this in the manual.

    I worked in tech support for a company that sold these for 2 years and talked with the 3ware techs quite extensively on the subject.

    As a company they don't officially admit it, but from a technical standpoint it is obvious they only supported software RAID.

  79. Re:Likely a driver bug - Excuse Me, But!!! by Nom+du+Keyboard · · Score: 0
    This crap does not happen on Linux, on the same hardware...It's almost a certainty that this is a software problem of some sort.

    Excuse me, but isn't the function of the CPU to efficiently execute software written in its instruction set? If some CPU instructions are more equal than other CPU instructions, Intel should have said so a long time ago. That's like saying that its the fault of the road because some cars traverse it better than other cars. If the road is optimized for certain car types, it should be publicly disclosed.

    --
    "It's the height of ridiculousness to say for those 9 lines you get hundreds of millions."
  80. Re:Why aren't you running a dedicated controller.. by dfn_deux · · Score: 1
    If you are demanding pci-e then you are going to pay more... Duh...

    Although if you really want someone to do your legwork for you I'd point out that there are cards in the 100 dollars range with 4 channel hardware raid 5 from LSI, Highpoint, and Promise. I can't really speak to the quality of these cards as I use 3ware cards myself. But as with most things the more $$$ the more features, support, and reliability. Then again I don't really understand the need for super high storage speeds for TB+ systems in a home environment. Most TB+ home users are probably going to use that space for a big ass file dump where they toss all their music and video files and perhaps stream them to other devices... This isn't really heavily IO intensive...

    --
    -*The above statement is printed entirely on recycled electrons*-
  81. Re:Why aren't you running a dedicated controller.. by SigILL · · Score: 1
    why aren't you running a dedicated controller?

    Because such controllers use a proprietary disk format. So when (not if) your controller breaks, you need to get a new one to be able to read your data. Whereas when using a free (libre) operating system and software raid, you'll always be able to get to your data.
    --
    Error: password can't contain reverse spelling of ancient Chinese emperor
  82. Re:Why aren't you running a dedicated controller.. by pH7.0 · · Score: 1

    "Why aren't you..."

    May be you should ask Intel?
    The D975XBX motherboard have on-board RAID5 and sound. Is it an enterprise setup or not?

  83. Re:Why aren't you running a dedicated controller.. by Some_Llama · · Score: 1

    I dunno about that, with U160 and U320 you would have to use at least 5 drives or more to max out the bandwidth that U160 provides and upwards of 8 or more for U320.

    No RAID systems use one channel per drive, maybe you are thinking connector, in which case that would be SATA, and possibly IDE to maximize bandwidth.

    Most RAID cards have more than one channel per card, but multiple connectors per channel (7-15 on SCSI, up to 12 that I have seen on SATA, and 2 per channel for IDE).

  84. Re:Why aren't you running a dedicated controller.. by drinkypoo · · Score: 1

    RAID5 is extremely common in the home media appliance/server area. It's a small part of the market, but also a part of the market fairly likely to buy a core duo - they're lower-power than most competitors and they have enough horsepower to do full-HD video.

    --
    "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
  85. Re:Why aren't you running a dedicated controller.. by drinkypoo · · Score: 1

    It makes more sense to use ordinary software RAID than to use half-hardware RAID. Put the OS on its own (small) disk and stuff all your data into the software RAID. The funny thing is that software RAID can often give better performance than hardware RAID, so long as the box isn't using all its CPU for something else, because your primary CPU has more powerful than almost any RAID controller.

    --
    "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
  86. Re:Likely a driver bug - Excuse Me, But!!! by Anonymous Coward · · Score: 0

    Linux actually has a number of different algorithms for doing the software RAID5 XOR operation and tries which one performs best on the given CPU during bootup. On certain other operating systems, this is typically implemented in each device driver, so the driver author may have chosen an algorithm that performs well on Pentium4, but not on Core2.

  87. Re:Why aren't you running a dedicated controller.. by jelle · · Score: 1

    "I could lose up to 7 days of recordings..."
    Oh the humanity..


    True, it's only TV, but still, better with raid5 than trying to do weekly backups...

    ps: TV does become a whole lot more interesting when you can choose from a large set of recorded shows, plus when you can play it back at 1.4x real-time speed with mythtv's fantastic 'time stretch', which speeds up TV shows without distorting the sound.

    --
    --- Hindsight is 20/20, but walking backwards is not the answer.
  88. Re:Likely a driver bug - Excuse Me, But!!! by ArbitraryConstant · · Score: 2, Insightful
    If some CPU instructions are more equal than other CPU instructions, Intel should have said so a long time ago.
    Yeah, I'm outraged!

    Oh wait, they did. Not only did they say so a long time ago, they publish documentation and maintain a compiler to help you optimize for the way their processors work.
    --
    I rarely criticize things I don't care about.
  89. Re:Why aren't you running a dedicated controller.. by dfn_deux · · Score: 1
    Not to be a total pedant, but if you are going to say that because you can buy *A* 300GB disk for 50 AFTER rebates that 4*300GB is going to be 200 bucks then you have a very perilous grip on reality... That said, he said "double the cost of the box" so eve with your $200=1.2TB of storage arguement you are still not factoring the cost of the MB, PSU, chassis, etc... Which will certainly add another few hundred tot he cost of the box... So this theoretical box is still like $400-600 before the cost of a 4 channel SATA raid solution comes into the equation... There are certainly raid solutions which are cheaper then $4-600 which are of high quality and do not offload the proccessing...

    I'm just trying to be real here folks....

    --
    -*The above statement is printed entirely on recycled electrons*-
  90. Software RAID not CPU-bound on modern hardware by ottffssent · · Score: 1

    I just built a new file server for home use, using 4 300GB SATA drives and the slowest socket-939 CPU I could get (1.8Ghz Athlon 64) in a RAID5. I'm getting pretty close to the limits of the individual drives during testing, with just over 100MB/sec writes and just under 200MB/sec reads. CPU usage remains comfortably under 50% (that's the spikes; average is more like 20%) in that configuration, and dmesg reports my RAID6 checksumming speed at over 4GB/sec.

    With the drives on the PCI SATA controller, I'm bus-limited (writes much faster to a degraded array, because only 3/4 as much data is actually being written to disk). With the drives on the chipset controller, I seem to be limited by the drives themselves. Hopefully later this summer I will be building a larger array and will be able to do more tests, but even with 8 drives I doubt I will be bottlenecking on the CPU.

    1. Re:Software RAID not CPU-bound on modern hardware by TwinkieStix · · Score: 1

      I thought that onboard controllers still attached to the system bus, but through hard-wire instead of through connectors. Is that not the case? If it is, then the PCI adapter should go just as fast as an onboard IDE controller assuming all other things are equal. Again, if this is the case, then you are not limited by the BUS; you are limited by the PCI controller itself.

      Am I incorrect here?

    2. Re:Software RAID not CPU-bound on modern hardware by ottffssent · · Score: 1

      The four SATA ports provided by the southbridge do not go through the PCI bus. They are part of the chipset, and connect to the CPU via hypertransport. The PCI SATA controller is onboard as well, but is a separate ASIC, connected to the southbridge via the PCI bus. On my particular board, there is an extra PCIe lane available which would have been a better choice for the extra SATA ports (250MB/sec each way theoretical bandwidth), except that Silicon Image currently makes only a 2-port PCIe chip, whereas the Sil3114 is a 4-port PCI part.

  91. Re:Why aren't you running a dedicated controller.. by gonk · · Score: 1

    RAID is not a backup solution. Anyone who tells you differently is either a fool or trying to sabotage your data.

    robert

  92. Don't use RAID5 by elronxenu · · Score: 1
    Don't use RAID5. It's slow to write, if you lose a disk then read performance is significantly degraded until you get another, rebuilding is slow, and if you get an unreadable sector during rebuilding you'll probably lose your entire array.

    See www.baarf.com

    1. Re:Don't use RAID5 by Anonymous Coward · · Score: 0

      So, it's better to use one disk, and KNOW that you'll lose everything? Come on. What is your point exactly?

    2. Re:Don't use RAID5 by Ayanami+Rei · · Score: 1

      It's slow to write
      Only if you allow partial writes (make sure you use a correctly configured filesystem). Okay so it's still N% slower, but you've got caching and you'll probably hit the limit of your bus first anyway.
      if you lose a disk then read performance is significantly degraded until you get another
      Always have spare disks. I like to use the baker's dozen rule... for every 12 you buy, get a 13th.
      rebuilding is slow
      True. But modern systems (software RAID) let you tune the rebuild performance.
      Also, you should consider the following:
      If you get a failed disk in a RAID (any RAID), the next thing you should do BEFORE a rebuild is to MAKE A BACKUP.
      Because a rebuild is write-intensive and could trigger further failures if you're having disk remapping issues on your drives.

      and if you get an unreadable sector during rebuilding you'll probably lose your entire array.
      For this reason (and the clustered failure scenario mentioned above) you want to use drives that have functioning SMART implementation and that you know do internal sector remapping. Do research on your drives before you buy them. And replace disks as before they reach the 50% mark on remapped data.

      --
      THIS THING CAN TURN ON A DIME, MACROSSZERO STYLE ALSO FUCK BETA, ~NYORON
    3. Re:Don't use RAID5 by Lehk228 · · Score: 1

      the documents cited by baarf are like a thousand years old, they focus on SCSI and claim "recently popular" IDE/ATA drives do not have sectors reserved for remapping

      --
      Snowden and Manning are heroes.
    4. Re:Don't use RAID5 by Anonymous Coward · · Score: 0

      That raid 5 is the worst of the available raid options?

  93. Re:Why aren't you running a dedicated controller.. by d-rock · · Score: 1

    Sorry, I don't know about the whole line, but the 7500 and 8000 series are definitely hardware. The 7500 page specifically talks about the hardware XOR engine:

    http://www.3ware.com/products/parallel_ata.asp

    The 8000 page also says: "True hardware RAID protection for all your valuable data"

    http://www.3ware.com/products/serial_ata8000.asp

    Not to beat a dead horse, but the Escalade 7000/8000 confiuration manual specifically discusses using the BIOS utility (invoked through Alt-3 during boot) to create/modify arrays:

    http://www.3ware.com/support/UserDocs/7000IG_04290 3.pdf pages 25-27

    Where do you see that any of their products are software?

    Derek

    --
    Don't Panic...
  94. Re:Why aren't you running a dedicated controller.. by Anonymous Coward · · Score: 0

    No kidding? Did you read his comment? RAID is a good first line of recovery against disk failure. Then he goes on to say that if you have something more serious you restore from backups.

  95. Re:Why aren't you running a dedicated controller.. by Salamander · · Score: 2, Interesting

    If it happens with a dedicated controller such as ServeRAID, then my first hunch would be that the chipset isn't handling memory contention very well. We used to see this at Dolphin a lot; the Intel chipsets at the time would behave terribly if there was any kind of serious memory traffic coming from the "far side" of the memory controller. This could also be a problem on the "softmodem-like" RAID controllers, where one core is trying to bring previously DMAed data in for its XOR while the other is trying to do its normal stream of memory accesses. It wouldn't be quite the same kind of thrashing as in the previous case, but it's very easy to imagine that it would still occur.

    --
    Slashdot - News for Herds. Stuff that Splatters.
  96. Re:Why aren't you running a dedicated controller.. by Anonymous Coward · · Score: 1, Funny

    "It's as if there can't be any gray area in the matter."

    No, the problem is that there's no grey matter in the area.

  97. The middle ground here... by Anonymous Coward · · Score: 0

    ...is a dvd writer. If you're generating more than 8 gig of backup-worthy data a week as a home user then you need to put down the digicam and get another hobby.

  98. I must strongly disagree. by Ayanami+Rei · · Score: 1

    We run very "enterprise" setups where I work and you're right, it's "enterprisey" to go full on hardware RAID.
    And deal with all the headaches that leads to:

    1) Poor OS integration
    ** 3Ware
    ** Dell PERC
    ** Adaptec
    need I go on?
    You get alarms coming from inside the unit and you have no idea what the fuck is going on until you reboot and drop into the BIOS.
    It's only recently that 3Ware started shipping a WBEM data source that could let you know what was going on with an array from MMC, etc. etc.
    With Dell you have a propietary MMC snap-in which is a pain in the ass to keep straight depending on the particular PERC flavor you're using and firmware revision of your card.
    *nix or BSD support? A shitty CLI or nothing. Or maybe a web management tool (WTF?)
    There is no standard for this sort of thing and it pisses me off to no end. Hell, there's a fucking standard for removable drive enclosures, but not for offloaded RAID volumes.

    2) Shitty firmware / embedded system design
    I've been burned one too many times by firmware bugs and NVRAM failures.

    3) Propietary on disk layout
    * No way to swap drives between systems with different RAID engines
    * Have to prepare unlabeled disks in BIOS/firmware tool before it can be used
    * Boot floppies for OS installs onto array. God, what is this, 1992?

    The only RAID offload I ever use is complete offload. Seperate NAS/SAN devices with an embedded OS that present the volumes as LUNs or network shares. I've found these to be engineered to a much higher tolerances and you can always connect a TTY to it or SSH in and figure out what's going on with it.

    Other than that we completely standardize on software RAID. CPU performance these days is sufficient enough that RAID-5 isn't a particularly notable hit. We'll actually buy 3WARE 95xx cards, and put in 12 JBOD disks just so we can have hotswap. :-/
    It's really nice to be able to pull a subset of disks set up for softraid in Linux, and plug it into any old machine and have it magically appear.

    So yeah, software RAID is your friend.

    --
    THIS THING CAN TURN ON A DIME, MACROSSZERO STYLE ALSO FUCK BETA, ~NYORON
    1. Re:I must strongly disagree. by Nutria · · Score: 1
      You get alarms coming from inside the unit and you have no idea what the fuck is going on until you reboot and drop into the BIOS.

      That's a pathetic enterprise, then.

      A Real Enterprise doesn't use Dell Crap. It uses Tier-1 h/w and proprietary operating systems that the vendor has certified all works together. And if it doesn't work right, your EVP calls their EVP, and Action Is Taken, even if that means the Senior OS Engineer lives at your DC until it's fixed.

      I've found these to be engineered to a much higher tolerances and you can always connect a TTY to it or SSH in and figure out what's going on with it.

      That's sooo 1980s. DEC (then Compaq then HP) has let you "telnet" into storage controllers since 1985. Their original CI controllers were really custom-configured PDP-11s running an RTOS, and all of it's storage controllers (not cards) since then have let you plug Ethernet or a serial VT into them.

      I'm sure that IBM mainframe controllers let you do the same thing, but have no direct knowledge.

      --
      "I don't know, therefore Aliens" Wafflebox1
    2. Re:I must strongly disagree. by lukas84 · · Score: 1

      1) I agree. That's why you standardize hardware in an environment. We use HP CCISS for our linux boxes, and the "real" variants of IBMs Serveraid. For the first, we have a Nagios Plugin, and for the second we have a MoM Plugin. Everything goes through our centralized notification and network managment system.

      I agree that software raid isn't bad. We usually use it in "cheaper" machines like ALGs, Firewalls, VPN Concentrators, Domain Controllers (i.E. anywhere where disk speed doesn't matter).

  99. Lot's O' Pr0n? by Anonymous Coward · · Score: 0

    Damn, let me download some!

  100. Re:Why aren't you running a dedicated controller.. by courtarro · · Score: 2, Funny

    I've never heard it called FAKERAID maybe it should just be called FAID? I'll file that one back for use later...

    No, it's aFRAID array. Oops, forgot the space.

  101. Watch out though... by Ayanami+Rei · · Score: 1

    RAID5 can cause extra bus traffic to and from the controller. You need to check to see if your bottleneck is the card itself when you like 8+ drives attached.
    It helps if you have more than one controller on seperate busses and you can stripe across them RAID50-style.
    (Recommended on Opteron based systems with multiple CPUs and MCPs)

    --
    THIS THING CAN TURN ON A DIME, MACROSSZERO STYLE ALSO FUCK BETA, ~NYORON
    1. Re:Watch out though... by Joe+The+Dragon · · Score: 1

      Sound like it may be a good idea to have a HTX based raid card some day.

  102. Where are you getting 10gbps from? by Ayanami+Rei · · Score: 1

    Even 1gbps is a whole fucking lot of sustained disk traffic.
    The example is hyperbolic.

    You're better off using multiple RAID5s instead one large one, or better yet, using JFS or something that understands stripe sizes so it doesn't cause unnecessary parity reads for partial writes.

    --
    THIS THING CAN TURN ON A DIME, MACROSSZERO STYLE ALSO FUCK BETA, ~NYORON
  103. Re:Why aren't you running a dedicated controller.. by Nefarious+Wheel · · Score: 1
    IBM ServeRAID units described in the Inquirer article have "Adaptec" written on the card. And we've had a number of problems with them on our IBM x226 servers, which we buy by the 100's -- not reliability or performance problems, but watch your BIOS compatibility as IBM shifts part numbers mid-delivery and blows up your builds.

    Backup is a simple issue and a huge issue, and the playing field drifts. Intuitively simple, but very complex to manage when you grow into the enterprise space.

    An interesting issue for us, because the moment we did away with MT backup and replaced them with nice little SEAGATE USB external hdd's (we're *very* distributed) they started getting nicked, so the security people keystone locked the little items next to the server. Result is we're keeping the backup media next to the server, even after one site burnt to the ground. Comedy is not pretty. I suggested burying a unit in a floor safe underground (cable through the sealed cash slot) but the only people who thought that was a good idea was the Fire Department. RAID ain't backup, it's hardware fault-tolerance. There's a difference.

    --
    Do not mock my vision of impractical footwear
  104. Re:Why aren't you running a dedicated controller.. by drsmithy · · Score: 5, Informative
    That's because you can do RAID 0, 1 or any combination of 0 and 1 without needing parity data. The performance killer on RAID 5 (and any other form of RAID that requires parity) is in the XOR operations used to compute and verify the parity information. In order for RAID 5 to perform at a satisfactory rate and not totally bog down your CPU, the XOR calculations should be handled on a dedicated hardware controller, not in software.

    No, no, no, no. The processing overhead of parity calculations is miniscule on any remotely modern CPU (even a paltry 300Mhz Pentium 2 has a parity throughput of ~700M/sec).

    The performance killer on parity-based RAID configuration is the additional disk reads required to calculate the parity, *not* the parity calculations themselves. Which is why modern software RAID is typically faster than hardware RAID until you get into larges numbers of disks and/or machines with limited bus bandwidth.

    This "RAID 5 is slow because of parity calculations" meme must die (although, admittedly, it's a good indicator of whether or not someone really understands what's going on).

  105. Re:Why aren't you running a dedicated controller.. by drsmithy · · Score: 0
    Because it's often slower to do so. We ran tests on a good Adaptec u320 raid controler about a year back and though cpu usage was good. We got much better performance out of Linux softraid5. I would suspect this was because the host cpu was faster than that on the controler.

    It's got nothing to do with CPU power. It's because the OS almost certainly has vastly more memory available (how much RAM in the system vs on the controller ?) and is able to better utilise that to cache data and avoid physical disk activity.

    The (relative) slowness of RAID 5 does *NOT* come from calculating parity, it comes from the additional physical disk activity getting the data to caclulate the parity (and the subsequent disk activity to update parity and/or data blocks on disk) entails.

  106. Re:Why aren't you running a dedicated controller.. by Anonymous Coward · · Score: 1, Insightful

    Users who should be using RAID 5 should also have dedicated controllers.
     
    Unsupported assertion.
     
    Here's mine: "People who are self-described social conservatives should be beat with bricks."
     
    I like mine better.
     
    The fact is that my friend, who knows more about RAID than everyone else reading this (if you tie his knowledge you are very, very good) told me that RAID 5 in software doesn't take that much CPU and hardware RAID is pointless for me in my NAS.
     
    Look at what the Internet Archive uses (no link because if you can't find it, you are probably a social conservative and should report immediately to the brickyard).

  107. Re:Why aren't you running a dedicated controller.. by rcw-home · · Score: 1
    Especially given the fact that most CPU usage is less than 18% when calcualting parity for RAID 5

    CPU usage is 100% when calculating anything. In fact, at any given time, CPU usage is either 0% or 100%. It's either doing something or it's not. Same with network links - an ethernet card is either transmitting a packet, or it's not. It's only when you average these things out over periods of time that you see percentages like 18%. In the meantime, your other processes wait until the interrupt handler finishes or the scheduler gets to them. A 100% idle system is not merely 122% as responsive as an 82% idle system. Also, as another poster mentioned, software RAID5 may potentially be slower for reasons other than the CPU.

    BTW, regarding one of your other posts, I own an old 3ware 6410, which you allege is software RAID, and I distinctly remember going into its BIOS to create and initialize its RAID5 array. The Linux 3ware driver source is public - show us the part where it does the parity calculations.

    (who wants to spend 12 hours a week backing up to tape?)

    Consider using rdiff-backup to automate syncing to an external hard drive.

  108. Oh Great by aflat362 · · Score: 1

    and I just spent all this money on my Core 2. Or wait . . did I get the core Duo . . . er . . maybe the Core 2 Duo. Double Duo 2?

    --

    Conserve Oil, Recycle, Boycott Walmart

  109. Re:Why aren't you running a dedicated controller.. by Joe+The+Dragon · · Score: 1

    SAS and SATA RAID cards have one driver per channel
    look at Tekram ARC and 3ware

  110. Re:Why aren't you running a dedicated controller.. by Anonymous Coward · · Score: 0

    "Before deploying half-assed solutions, it's usually better to deploy nothing at all."

    You obviously aren't a manager who "spearheads a new e-commerce initiative in customer motivation" while telling the staff to buy $299 servers from Dell or whatever, refreshes the ol' resume the week before the system goes into production, and "transitions" to a new job using the well-worded but half-assed system as a major bragging point during the interview.

  111. Re:Why aren't you running a dedicated controller.. by Joe+The+Dragon · · Score: 1

    most of the on board raid and some raid cards are part hard ware and part software also you can not boot form software raid 5 useing windows build in raid.

  112. Re:Why aren't you running a dedicated controller.. by Firehed · · Score: 1

    I'm inclined to think that flooding (or fire, hurricane, or other insurance-termed Acts of God) will cause problems for most backup mediums, not just RAIDed hard drives. In fact, if what they say about the life of a burned DVD is true, then hard drives are likely the best solution for most home users. Any solution can naturally fail, but hard drives under normal use have pretty good lifespans, and that being combined with a parity bit or proper mirroring should be quite reliable within the home. I don't bother with RAID in any form, though if I were to set it up on my fileserver, I would go for RAID5. I've got enough of my stuff replicated in six places that I'm not especially worried. Hell, even if my house burned down and cooked every bit of data I posess, a decent amount of my photos and whatnot would be recoverable as they're all uploaded to photobucket, and then Google Cache has probably made a mirror of them from wherever they were posted.

    --
    How are sites slashdotted when nobody reads TFAs?
  113. Re:Why aren't you running a dedicated controller.. by dnoyeb · · Score: 1

    What you described has come to be called FakeRAID. This is basically when there is some additional software in the BIOS, but that also requires a special Driver in the OS to handle it. Software raid is on the partition level. FakeRAID can be done on the disk level.

  114. Re:Why aren't you running a dedicated controller.. by Nutria · · Score: 3, Insightful
    WHERE'S THE MIDDLE GROUND PEOPLE?

    There's no "middle ground", there's cost-benefit analysis.

    I.e., is it worth my time to spend $50, $100, $200, $500, etc, and an hour a week to mirror a pr0n collection? Some people would say $50 and 5 minutes, and others would say $500 and 6 hours a week. And some would say, "Chunk it. If the disk dies, I'll just download it all again."

    --
    "I don't know, therefore Aliens" Wafflebox1
  115. Re:Why aren't you running a dedicated controller.. by Anonymous Coward · · Score: 0

    I hear you talking but it sounds like gibberish.

  116. Re:Why aren't you running a dedicated controller.. by dnoyeb · · Score: 1

    Here http://www.newegg.com/Product/Product.asp?Item=N82 E16816116036 a nice 3ware, hardware RAID controller for ~$100.00. I use this.

    Software RAID is good for speeding up your disks, but I would never do Software RAID for data redundancy. Nor would I use it for east of recovery after a failure. I have used it for several years and the failures are nothing but trouble to get going again.

    Hell there are only a few OS that can boot from software raid partitions. And your own personal education level will have to go way up to get it working anyway.

    This $100 hardware raid card is no worries. Disk crash? put new disk in, reboot, tell RAID in BIOS to add disk to array. Finished. BTW, you get an email when the disk is acting funny or when it drops from array. Also supports smart. Can't be beat.

  117. Re:Why aren't you running a dedicated controller.. by dnoyeb · · Score: 2, Informative

    These 3ware cards are definitely hardware RAID. You are spreading FUD.

    The parallel card is the $110 on newegg.

    From Newegg StorSwitch switched architecture delivers the full performance benefit of Parallel ATA's pointto- point architecture up to 133MB/sec per port On-board processor provides true hardware-based RAID and intelligent drive management functions BIOS set up utility and 3ware Disk Management (3DM) web-based management software Bootable array support for greater system fault tolerance

    http://3ware.com/products/parallel_ata.asp

  118. Re:Why aren't you running a dedicated controller.. by Nutria · · Score: 1
    This isn't about life. It's about a professional decision as someone working for a company. Before deploying half-assed solutions, it's usually better to deploy nothing at all.

    That's a crock.

    Every non-trivial decision needs some form of cost-benefit justification.

    Sure, U320 SCSI controllers with a GB of batter-backed RAM are fast and durable, as are 15K RPM 300GB Cheetahs, 4Gbps fiber and Hitachi 9980 SANs with 128GB of cache.

    However, not all data deserves to be stored in an Hitachi 9980V. And not everyone can afford them. Some organizations can only afford FAKERAID.

    And some managers are stupid, cheap bastards.

    --
    "I don't know, therefore Aliens" Wafflebox1
  119. Re:Why aren't you running a dedicated controller.. by Nutria · · Score: 1
    This "RAID 5 is slow because of parity calculations" meme must die

    Yes and no.

    Yes, because the mathematics is trivial.

    No, because, as you said, in order to do the calculations, all N disks must be read.

    That's why, though, Big Controller Cache makes RAID-5 speed just about on par with RAID-0.

    --
    "I don't know, therefore Aliens" Wafflebox1
  120. Software RAID 5 is useful! by WoTG · · Score: 1

    Folks here are dissing software RAID a tad too much. This isn't the dark old days of critically limited CPU and bus bandwidth. With dual core CPU's, more often than not CPU processing capacity is far in excess. So, for many reasonably light loads, why not save a few bucks on the RAID controller? Heck, for the cost of a "real" hardware RAID card (never mind a battery backed one), you can buy a 2nd CPU which will be more flexible since it'll help both RAID speed calculation times and regular processing tasks. Sure, you wouldn't run top end database servers on software RAID 5, but there are many applications where speed is not important, but downtime is.

  121. Re:Why aren't you running a dedicated controller.. by Nutria · · Score: 1
    SCSI hardware RAID adapter doesn't do this.

    Sure, an adapter card.

    SCSI disk controllers let your RAID-5 sets stripe across channels to get extra bandwidth.

    Maybe this is just a DEC feature, though.

    --
    "I don't know, therefore Aliens" Wafflebox1
  122. Re:Why aren't you running a dedicated controller.. by TACNailed · · Score: 1

    Buyer beware: That's a IDE (rather than SATA) controller card.

  123. There's enterprise, then there's Enterprise. by Ayanami+Rei · · Score: 1

    That's big-E Enterprise.

    I'm talking about sub-1m yearly IT budgets here *eye roll*.

    --
    THIS THING CAN TURN ON A DIME, MACROSSZERO STYLE ALSO FUCK BETA, ~NYORON
  124. It might be nice to have... by Ayanami+Rei · · Score: 1

    HTX based _anything_ some day.

    That being said, that means every device would have to have carry a PCI bus emulation layer with it.
    OS's don't know how to negotiate resources unless it looks like its connected on a well-known logical bus.

    Unless you've got some magic "platform" driver that probe-detects it and works without resource allocation (you'd have to negotiate your IRQ for OOB notification through some OS-negotiated process... Hello PnP!)

    --
    THIS THING CAN TURN ON A DIME, MACROSSZERO STYLE ALSO FUCK BETA, ~NYORON
  125. Re:Why aren't you running a dedicated controller.. by martok · · Score: 1

    Well that's certainly possible. The card only had something like 128mb of onboard ram where the server had 2gb. However, I don't see how that could be a major factor. If I do a random read and that data is cached on the system, I get it from cache regardless of whether I am using softraid or hardware. Same for a write, it's gonna use system ram for a bit and flush it out when it's good and ready regardless of the underlying hardware.. In otherwords, the raid should still benefit from the 2gb of ram even in a hardware raid configuration.

  126. Re:Why aren't you running a dedicated controller.. by lukas84 · · Score: 1
    However, not all data deserves to be stored in an Hitachi 9980V. And not everyone can afford them. Some organizations can only afford FAKERAID.

    I disagree. Usuable RAID-Controllers don't cost that much. They are in the price range from 750 to 1000CHF.

    That's not free, but it isn't expensive either. Think about it this way: An Ultrium 3 Tape Library costs about 20'000CHF. Does it really matter if you spend 1k CHF more on your server, when your Backup costs 20kCHF anyway?
  127. Re:XOR is very common - Wrong by wtarreau · · Score: 2, Insightful

    You use XOR to clear a register. XOR CX, CX sets the CX register to 0. It is faster than MOV CX, 0.

    wrong !
    The instruction is shorter (2 bytes instead of 3 or 5, depending on the word size). But it sets flags on return while MOV CX,0 will keep them intact, so the CPU cannot reorder instructions around a XOR. In the best case (no dependency), they will all need 1 cycle. In the worst case, XOR will stall the pipeline while MOV will not.

  128. Re:Why aren't you running a dedicated controller.. by SwellJoe · · Score: 4, Insightful

    Err, NO! It's about FAKERAID, which is a H/W S/W combo.

    RAID stands for Redundant Array of Inexpensive/Independent Disks. Nowhere does it say "Controlled By A Dedicated CPU" ("RAIDCBADC"? Doesn't quite sing like "RAID"). Software RAID is as much RAID as a top of the line server RAID controller with RAM and a battery backup. It isn't as fast, sure, and it loads the system CPU, but it is still RAID. Calling it "FAKERAID" is just pretentious and misleading. The data integrity benefits are still present, as are some performance benefits in some circumstances (in fact, Linux RAID is demonstrably faster in some workloads than a top end Adaptec hardware RAID controller, though this is the exception rather than the rule)

    That said, I hate pretty much all RAID controllers (whether software or hardware). Linux software RAID means that I can drop the disks into any PC and access the data. Every RAID controller from Promise, Adaptec, and Tektronic requires me to use their disk format, and if I lose the controller I lose the data until I can get another controller. Sure, in high availability environments, you keep a spare...but with Linux software RAID, every PC in the office is a spare controller. That's my kinda redundancy. I've even had two identical Adaptecs with different firmware lead to pretty massive data loss during a server migration. Thankfully there were good backups. I've never had similar problems moving Linux software RAID disks into a new Linux box.

  129. Re:Why aren't you running a dedicated controller.. by lukas84 · · Score: 1
    I'd say RAID 5 is the best solution for home backups, you have redundancy for disk failure and with monthly or even quarterly backups you should be fine. (who wants to spend 12 hours a week backing up to tape?).

    We create snapshot based online backups from our system nightly. Takes about 6 hours to backup our systems to an ultrium 3 library with multiple drives. Data is about 1.5TB now. You can access the system at that time without problems, performance is a bit limited though.
  130. Re:Why aren't you running a dedicated controller.. by drsmithy · · Score: 1
    However, I don't see how that could be a major factor. If I do a random read and that data is cached on the system, I get it from cache regardless of whether I am using softraid or hardware. Same for a write, it's gonna use system ram for a bit and flush it out when it's good and ready regardless of the underlying hardware.. In otherwords, the raid should still benefit from the 2gb of ram even in a hardware raid configuration.

    The advantage comes from fewer disk reads necessary to do parity calculations during disk writes. With more cache, there's more chance all the disk blocks that would otherwise need to be read and/or updated, are already in the cache. So, when your system has 16x as much RAM as your disk controller, it's far, far more likely the software RAID won't need to do as many physical disk operations due to the disk blocks necessary for updating parity already being cached.

    It's not the data on the filesystem level being cached that's delivering the benefit (although that is somewhat relevant, as software RAID can use knowledge of the filesystem to improve caching, whereas hardware cannot), it's the data on the disk block level.

    You would be surprised how much difference this can make. One of the main reason SANS and (good) NASes are so fast (relatively speaking) is because of the massive amounts of cache memory they have in them negating the additional disk activity inherently required for parity-based RAID configurations.

  131. not new, but worse by Chris+Snook · · Score: 1

    This isn't new. Hyperthreading puts a performance hit on software RAID 5 as well, but the impact isn't nearly as noticeable. The problem is that shared caches do poorly with the kind of workload that RAID 5 imposes on a system. HPC users have been disabling HyperThreading for years for the same reason. The higher degree of separation between execution threads on a multicore chip magnifies the problem several times.

    They might be able to improve the dynamic cache sharing policy with a microcode update, but they should probably also work with OS vendors to optimize their RAID 5 drivers to be friendlier to chips with dynamically shared caches.

    --
    There's no failure quite as dissatisfying as a complete and total solution to the wrong problem.
  132. Silly by m.dillon · · Score: 1

    This is all rather silly. Nobody who needs RAID performance uses software raid, especially the crappy software raid found in most MB chipsets. It's just plain stupid. I wouldn't trust that junk even if I didn't care about performance! Use a hardware raid solution like a 3ware card or something similar (in particular cards with battery backup capabilties) and wash your hands of the whole affair.

    -Matt

  133. Re:Why aren't you running a dedicated controller.. by (Score.5,+Interestin · · Score: 1

    >I've never heard it called FAKERAID

    It's WinRAID, like WinModems, WinPrinters, and other pretend technology.

  134. Re:Why aren't you running a dedicated controller.. by norton_I · · Score: 1
    A 100% idle system is not merely 122% as responsive as an 82% idle system.


    This can go both ways. A 0% idle system can be nearly as responsive as a 100% idle system. A purely CPU bound task (such as SETI) will not interfere with an interupt driven task (such as IO) (neglecting cache flushing effects).

    Regardless, for most non-desktop uses, the latency caused by scheduling overhead is a negligible contribution to total system performance. The important factors are CPU throughput and IO latency. 18% parity overhead reduces throughput 18% (or requires a 22% faster CPU to maintain throughput) and the IO latency and bandwidth requirements of RAID5 are independent of whether the pairity is done on a controller or by the system CPU, though they *do* depend on where the cache is.

    Personally, I use software raid5, raid1 and single disk systems which rsync cross-backup each other to provide tolerable limits on downtime due to disk failure as well as backup for data corruption/deletion failures. I have considered getting 3ware cards on occasion, but for the uses I have it just didn't make sense. Plus, software raid has the advantage of being independent of controller type.
  135. From the "2GHZ is entry level" school of thought.. by ratboy666 · · Score: 1

    Implications: "Home users shouldn't use RAID5" because its not "backup". Enterprise customers should use "hardware RAID5" because its somehow superior.

    What utter bunk.

    I like my data; I back up my data; and I use RAID5. NOTHING is more important than my data. Now, the big thing about RAID5 is computing the checksums. And how fast is that? Lets have a look (from ganymede, my storage box):

    Adding RAID1 or RAID5 is a graid5: measuring checksumming speed
          8regs : 430.800 MB/sec
          32regs : 219.600 MB/sec
          8regs_prefetch: 430.800 MB/sec
          32regs_prefetch: 219.600 MB/sec
          pII_mmx : 532.000 MB/sec
          p5_mmx : 554.800 MB/sec
    raid5: using function: p5_mmx (554.800 MB/sec)
    md: raid5 personality registered as nr 4

    Look at that carefully -- we are computing checksums at a rate of 555 MB/sec. And what is this mysterious box?

    A Pentium II/266. That's right, a bleeding Pentium II/266.

    Quite fast enough to do "software RAID". So, would adding a dedicated RAID5 card help? Nope, you would never see the difference. We are faster than the drives, and the network. The bottleneck is NOT the RAID5 software. Go back and look at the machine spec again.

    What happens if I put a RAID5 card in this system? Um... right, another potential point of failure. By Crom! Lets not do that, and increase reliability! If you need 24x7, you would need redundant hardware anyway (including CPU and disk controllers). If you DON'T need 24x7, do what I do, and invest in a "cold spare".

    Ratboy

    --
    Just another "Cubible(sic) Joe" 2 17 3061
  136. By Crom! by ratboy666 · · Score: 1

    Running RAID5, you NEED to do the parity calculation when writing to the array.

    When reading? The data rolls off FASTER because it is effectively striped. And... heres the kicker. UNLESS there is an error during reading, you don't need to do the parity calculation.

    So, for reading, RAID5 is almost on par with RAID1, *not* RAID0.

    For writing? The largest bottleneck in a RAID5 is (normally), the need to read the entire block for a small update. Why? because the checksum has to be recalculated.

    And this is where a "software raid" can have a MAJOR advantage over "hardware raid". Specifically, if the FILESYSTEM layer knows that a block is "fresh", is doesn't have to be read before the checksumming. A hardware implementation can never know this, and is forced to do the block reads.

    Ratboy

    --
    Just another "Cubible(sic) Joe" 2 17 3061
  137. Re:Why aren't you running a dedicated controller.. by Anonymous Coward · · Score: 0

    Going into the BIOS prooves nothing. I have promise cards that are software RAID and yet I can go into the BIOS and initalise an array. In fact, it is through this BIOS that software raid cards are bootable.

    The trick is that the card has an "extenion ROM" on it that hooks interrupt 13H (yes, this is all old-school real mode stuff!) which is the interrupt used when BIOSes, real-mode OS's and most (all?) bootloaders call to load a sector off disc.

    This hooked int13 of course runs the code stored on the card's ROM which can then do any parity calculations/mirroring or whatever else. This piece of trickey is what lets you boot from a software raid card as the BIOS uses INT 13H to get the boot process underway, and what lets MS-DOS or any other such OS access the raid without any special drivers.

    However, once your OS transitions to protected mode, it's usually quite a hack to continue to use the real mode 13H interface, so your OS specific protected mode driver normally does the parity calculations. And even if possible, there is a performance hit.

    (I believe that Windows 3.1 did do something like this as it still used MS-DOS for disc access. "32bit disc access" in Windows 3.11 was all about using protected mode disc controller drivers instead. I think that NTLDR, the WinNT/2k/XP bootloader, may also call INT 13h from protected mode mainly to maintain compatibility with situations like this. :-) )

    You're right though that auditing the linux driver will proove the matter. But I am aware that some linux drivers for such cards just exposed the IDE interfaces of the card as block devices to the OS - effectively, only supporting JBOD. Linux software mirroring could then carry it the rest of the way to get it all working.

    (An extension bios is a rom that has a certain magic checksum and is thus automatically executed by the main system BIOS during POST. Often video cards have one too, and they perform rudimentary checking at boot time and/or display annoying pictures. When initially executed, the extenion bios may hook an interrupt allowing the extension bios to intercept calls to that interrupt in the future.)

    In theory, a true hardware RAID card could hide all the details from the OS, making it look like just another HDD controller as far as the OS can see - so no specific driver is needed.

  138. Re:Why aren't you running a dedicated controller.. by Cicero382 · · Score: 1

    "If you're running raid5 it's probably in an enterprise setup. If so, why aren't you running a dedicated controller?"

    'Cos it's cheaper not to, that's why. I've seen plenty of installations that have large disk arrays running under Veritas Volume Manager and the performance is perfectly adequate for the client's needs. Disks are cheap, controllers are not. It's RAID - Redundant Array of INEXPENSIVE Disks - not RAIDVEC (RAID with Very Expensive Controller).

  139. Re:Why aren't you running a dedicated controller.. by dnoyeb · · Score: 1

    Lol, beware? I guess its kind of a problem that my old computers don't die. I bought the Harddrives first. I think I ran into a drive size barrier on my computer so buying the hardware PATA RAID card solved two problems.

    Beware: They have SATA versions as well. Not all of their SATA support hot plug though I don't think. Pay more get more...

  140. Re:Why aren't you running a dedicated controller.. by Vajsvarana · · Score: 1
    IBM ServeRAIDs -- I think these are dedicated controllers

    Beware! Not all IBM ServeRAIDs are dedicated RAID controllers. In fact most of built-in ServeRAIDs are software ones: the (in)famous ServeRAID "e" series (ServeRAID 7e and ServeRAID 8e, that you can find in most xSeries 206/206m) as you can see in the ServeRAID Red Book:
    http://publib-b.boulder.ibm.com/Redbooks.nsf/Redbo okAbstracts/tips0054.html?Open

    I don't think a real dedicated controller (like ServeRAID 6m) could have some impact on the CPU apart from saturating some I/O channel.
  141. Re:Why aren't you running a dedicated controller.. by swmccracken · · Score: 1

    Argh. As I posted elsewhere, the presence of a BIOS does NOT proove they are not software RAID. In fact, the BIOS is integral to how software RAID systems boot! (At boot time, the BIOS hooks Int 13H allowing the card's BIOS - which is a program that runs on the main CPU - to do specific processing for the RAID, such as parity calculations or mirroring or whatever - but invisibly to the main system BIOS, operating system bootloader or old-skool OS's like MS-DOS.) Effectively, it hooks in its own disc controller driver.

    http://hardware.slashdot.org/comments.pl?sid=19048 1&cid=15674142

    (In fact, this same trick of hooking INT 13h is what allows boot roms on network cards to work - and that's 100% software as the disc access requests are being directed to an image in ram!)

    A hardware XOR engine doesn't proove it either - it's possible that the hardware XOR engine runs under control of the OS driver (driver says load this block from this disk, driver then says xor that with this) which is somewhere between fully hardware and software. Not saying this is the case, just that it's theoretically possible.

    Now, if you give me a RAID card that works under Linux, Windows, NetBSD and whatever else operating system because it emulates a standard SATA or IDE controller at the hardware level (so much so that you use the same driver for the RAID card as you do for an ordinary controller) *then* you definitely do have hardware RAID. So far, I'm yet to see a card that will work with standard drivers though. What also is definitely hardware raid is external boxes that pretend to be a single disk to a SCSI controller despite containing many drives internally.

  142. Re:Why aren't you running a dedicated controller.. by andrewman327 · · Score: 1

    I use one of the GMail file storage front ends, which is great for critical documents.

    --
    Information wants a fueled airplane waiting at the hangar and no one gets hurt.
  143. Re:Why aren't you running a dedicated controller.. by WuphonsReach · · Score: 1

    Easy enough to put the O/S on top of RAID1 as well (assuming that we're talking Linux).

    After working with Software RAID for a while, I definitely prefer it over more proprietary RAID solutions (dedicated RAID controllers). Software RAID is extremely flexible, easily recovered (move the disks to another box) and my RAID array doesn't depend on a single piece of hardware that may be difficult to replace if it fails. For smaller companies who can't afford to buy three of everything, that's a big advantage.

    Mdadm also does a good job of letting you run without a config file. That means you can move a disk from a failed controller to a spare port on another controller and immediately get back up and running.

    --
    Wolde you bothe eate your cake, and have your cake?
  144. Re:Why aren't you running a dedicated controller.. by theotherfish · · Score: 2, Insightful

    What about hotswapping? AFAIK, you still can't do that with software RAID and SATA drives... (at least with Linux 2.6.x - never tried with Windows or Mac OS X). A hardware (SATA) RAID controller is your only option, unless you spend $$$ on a SCSI disk subsystem

  145. Re:Why aren't you running a dedicated controller.. by jdew · · Score: 1

    Wrong.
    Same Make, maybe, but not same model.

    I didn't have to do anything when I upgraded from a Compaq Smart Array 2 to a Smart Array 4200 to a Smart Array 5302, it just worked.

  146. Re:Why aren't you running a dedicated controller.. by tigersha · · Score: 1

    Been there, done that. Then you have to hunt for one on EBay. NOW. Ask the seller to send it via hand-delivered overnight courier. NOW. For $200 extra.

    And the controller that got wasted had its RAID config stored in NVRAM and not on the disks... And your predecessor did not write it down somewhere... Which makes the new controller useless....

    Fortunately switching NVRAM chips works. Never been glad to see NT 4.0 boot in my life, except for that one single day.

    --
    The dangers of excessive individualism are nothing compared to the oppressiveness of excessive collectivism
  147. Re:Why aren't you running a dedicated controller.. by Anonymous Coward · · Score: 0

    I used to run linux RAID 1 and RAID 5 at home for pretty much the reasons you cite. I now only use RAID 5 and that will be replaced when I'm able to do so. Why? Too many corrupted files. One of the "faults" with RAID is that it provides no protection from file system errors. Please note that I didn't rely on the RAID for data security, I have a regular backup plan as well to DVD that provides a historical archive.

    Because I was using software RAID 1 in linux converting to an rdiff solution was trivial (more or less turn off the raid, setup rdiff). That is one thing I really do like about software RAID 1. When I set it up I didn't have excess storage space to mirror, format and copy back. I got a second drive, told linux it was now a RAID 1 setup and it "repaired" to the new drive. Great stuff.

    The reason I used RAID 5 on one system was I had a supply of 9 GB SCSI drives. RAID 5 offers more efficient redundancy so... but unfortunately those 9 GB SCSI drives are aging. I've had two of them die and am running low on spares. I can, at a fairly reasonable price, buy a couple of ATA drives and have redundancy (either RAID 1 or using rdiff) with even greater storage capacity. And fewer drives taking less power and generating less heat.

    In sum? Although you can setup a RAID for a home setup it isn't necessarily the best way of introducing redundancy (all the file system eggs in one basket). Using something else, like rdiff, while not as efficient as RAID 5 offers greater redundancy and historical data.

    thoromyr

  148. Re:Why aren't you running a dedicated controller.. by kwalker · · Score: 1

    Cite? I'm looking at Newegg's site now and the cheapest 3ware card I can find is $175 and doesn't even do RAID5.

    --
    ... And so it comes to this.
  149. Re:Why aren't you running a dedicated controller.. by Anonymous Coward · · Score: 0

    RAID has a history independent of the expansion of the acronym. That history means that RAID is associated with and means hardware unless the term RAID is modified with the term software. You may not like it, but you do not get to rewrite history just because of your preference.

    In the context of this alleged Intel problem, it is misleading to call this just a RAID problem and not specify that you mean software RAID.

  150. Re:Why aren't you running a dedicated controller.. by razholio · · Score: 1

    on the relative slowness of RAID-5:

    your right that it's not the parity calculations soaking CPU time that cost you, however, it's not the read latencies that get you. You calculate parity on write, from data you hold in memory. What kills you is every block written to the logical drive must first be written to memory, parity calculated, and only then written to disk. It isn't the CPU usage that is the problem here, but the latency you incur from those (albeit fast) calculations and memory operations. TO contrast that, RAID-1 requires only that each block be written to 2 physical drives instead of just one. The only added latency here is that you must wait for both drives to complete the write and you will always wait for the slower one.

  151. Re:Why aren't you running a dedicated controller.. by necrogram · · Score: 1

    Gee, I've been buying dell and hp comntrollers for years. Never had a problem with doing controller swaps. Both of them write the configs to disk as well as nvram, so its 2 minutes spent in the contorller's bios and i'm back on line. Plus I spare controllers, and in the 6 years i've run this shop, I have only 5 raid controllers in my inventory. I've even got in the habbit of ordering servers with hot spare drives in them, so i just rebuild and not worry. I've been hot sparring in my SAN for ages, and I sleep better with it. the short and sweet of it is, I'm not knocking linux software daid, but I just dont trust software at all for that stuff. and in all the years i've been doing this stuff in an enterprise environment, i've never had a major problem running hardware raid controllers.

  152. Re:Why aren't you running a dedicated controller.. by myz24 · · Score: 1

    For less than $100 I can the same things with mdadm and smart. Software RAID is built into the kernel so it's more likely to work better than the drivers for whatever card you have.

    Case in point. I ran software RAID on a file server at work for years. I started running of space and decided to upgrade the machine. I decided now was the time to "do it right" and ordered a hardware RAID card and new disks. After 2 months of running ok I started suffering from random system failures, one, two or more disks would fail at once. Rebooting the system would bring them back online. After 3 months of messing around with Dell and Adaptec I ripped out the card and started using software RAID again. I got better performance and no more crashes!

    I've also suffered disk failures under software and hardware RAID and honestly, neither works any better than the other one for recovery. The one really nice thing about hardware RAID however is the RAID set appears to be just one drive to the system and you're able to install without any funky paritioning or initrd's.

  153. Re:Why aren't you running a dedicated controller.. by Some_Llama · · Score: 1

    Like I said, it DOES say in their documentation that it is hardware, they can SAY anything, the proof is that if you read further into the documentation, you will see that any initialization occurs 10 minutes AFTER the OS has booted.

    The hardware RAID is just marketing BS, from years of supporting these cards I can definitely tell you they are NOT hardware based.

    Yes you can "create" an array in the BIOS, will it be initialized or useable until the RAID driver has loaded into the OS? No.

  154. Re:Why aren't you running a dedicated controller.. by Billly+Gates · · Score: 1

    Raid5 is expensive for all but servers and you need hot swappable drives to replace a bad drive and continue on.

    For cheaper you can purchase a DVD rw and Roxio that can archive your system. I have never had a hard drive fail on my home computers. I dont leave them on 24x7 but alot of my drives are older adn therefore better built.

    How many disks do you need for a raid5 setup? Its been years since I was in IT but I remember seeing at least 4 disks. Also joe six pack doesn't want to open his box and try to guess which drive failed in his raid and replace teh drive himself. So raid is not recommend for all but workstation users who know what they are doing or work in a place with an IT department.

    Raid5 also slows down access with all the mirroring and ecc parity stripping going on. This would annoy a home user and slow it down considerable. Fine for servers if you have a fiber connection and 5 drives spinning simultanously.

  155. Re:Why aren't you running a dedicated controller.. by Some_Llama · · Score: 1

    You are confusing channels with controller ports. SATA raid cards have one driver per controller port but multiple ports on a "Channel".

  156. Re:Why aren't you running a dedicated controller.. by Some_Llama · · Score: 1

    From the user manual of he 7000-8000 user guide (pdf) found on page 42:

    11 Press F8 to save your configuration and reboot the system.
    The rebuild will start within a few minutes of the 3ware driver
    loading, once the operating system has booted.

    The 3ware 7506 does not do hardware RAID by itself.

    I have talked extensively with 3ware engineers and can assure you this is the case. They might have aprocessor on the card but it is using the CPU to do RAID calculations.

    If you want to compare marketing blurbs with real life experience diagnosing and troubleshooting these products, then by all means go ahead, but saying I am spreading FUD is not informative.

  157. Re:Why aren't you running a dedicated controller.. by Some_Llama · · Score: 1

    BTW, regarding one of your other posts, I own an old 3ware 6410, which you allege is software RAID, and I distinctly remember going into its BIOS to create and initialize its RAID5 array."

    Correct, you setup the array in the BIOS, but it will not build or rebuild the array until the OS is booted and the driver is loaded:

    From the user manual of he 7000-8000 user guide (pdf) found on page 42:

    11 Press F8 to save your configuration and reboot the system.
    The rebuild will start within a few minutes of the 3ware driver
    loading, once the operating system has booted.

    I know this is from a later model, but I am certain it is the same way for your controller, they didn't start hardware RAID until the 9000 series.. (I have talked with their engineers extensively and worked with their hardware in real life situations for years).

    I'm sorry but I won't be spending any time pouring over code to find where they do the calucations, but that would be a great place to look, any coders out there wanna give it a try?

    http://www.3ware.com/support/windows_agree.asp?pat h=/download/Escalade7000Series/7.7.1/3w-xxxx.tgz

    Just click agree to get the 7000 series linux 2.4 source.

  158. Re:Why aren't you running a dedicated controller.. by Some_Llama · · Score: 1

    Should have read "one drive per controller port".

  159. Problem with testing at Intel by Anonymous Coward · · Score: 0

    Daniel, Fran, What say you guys? Is it the usual management chain crap, or is there a tech problem? Cache thrash? More test monkeys would help, automatic testing is only good for regression, not for new issues and unfound problems. Have Steve pull Melissa off, and put her on something she would be competent at.

  160. Re:Likely a driver bug - Excuse Me, But!!! by Nom+du+Keyboard · · Score: 1
    Oh wait, they did. Not only did they say so a long time ago, they publish documentation and maintain a compiler to help you optimize for the way their processors work.

    Well apparently what they previously published has changed radically, since the code in question seems to have no problems running on earlier processors. Does Conroe need everything recompiled just to run efficiently on it now? Do we need multiple binary distributions to run on different Intel IA86 processors? Did you even get the first point in the original post?

    --
    "It's the height of ridiculousness to say for those 9 lines you get hundreds of millions."
  161. SATA HW RAID controllers by Ender+Ryan · · Score: 1
    Of the 10 or so SATA HW RAID controllers I've purchased, at least 5 of them have failed on me. I will not trust another LSI Logic (megaraid) or 3Ware controller, ever again. A quick reboot to swap in/out a new/broken drive is far more preferable to me than the headache of a flaky HW RAID controller.

    If you're not spending more than $1,500 USD on a HW RAID controller, cross your fucking fingers :)

    --
    Sticking feathers up your butt does not make you a chicken - Tyler Durden
    1. Re:SATA HW RAID controllers by sumdumass · · Score: 1

      I use 3ware and lsi raid controlers all the time. I have several in production machines running a mirrored stipes on production machines servicing around 25 clients. Maybe the problem your using is in the amount of use your card is seeing. I guess it might be prone to failure if you drive it harder.

      But seriously, most of the raid i use is just drive insurance were i know it won't be too dificult to get a system functional if a drive fails. Some of them are for added speed but I belive the lan conection (10-100-1000 copper ethernet) to them is slower then 2 drives mirrored. Of course most of these systems are low usage (less then 10 clients)with the exception of two 2 25+ client shops. But 25 clients is hardly enterprise level so that might make a difference. I am very please with them so far. The 3ware were a little buggy a few years back on linux works greate now, they were always great on windows and windows server machines.

      For a cheap solution on a small network, i don't see any problem with them. Maybe with more volume, the more expensive solution might be better. I think that might be a reason these cards are so cheap, they are targeted at smaller networks(companies)

  162. Re:Why aren't you running a dedicated controller.. by sumdumass · · Score: 1
    I don't really see what the big deal is though; nobody has had a chance to write optimized routines for Core2.
    The big deal is several things. First, this signifies that the dual core intel processors might have an issue were standard code doesn't function as expected. This isn't a matter of optimizing either. If you purchased another computer that was supposed to be faster, you wouldn't think you had to upgrade your software in order to get the same speeds. This might lead some to think that the bigger-beter processor expense could have been avioded and therefore they were ripped off.

    The next issue is what else doesn't perform as expected. Is this processor actualy backwards compatible? Will any other thing creep up causing the system to not perform as expected. While this is more of an "lets hope not" or maybe a "not likley to happen" type situation, anyone making buying decisions should be aware that it could be a potential problem. Think about it as if you bought a new dell poweredge server using a couple of these processors and ended up getting less performance averall then if you had recycled a P133 workstation and converted it. Now, if your the one doing the selling (consulting firms resell these) and this problem creeps up, it can damage your reputation as a provider. If it makes your applications become unavailible, it can damage the business as well as COST for upgraded applications that are optimised for this processor. Sometimes this can be in the area of more then the server itself. The last acounting package I helped upgrade (because of XP incompatabilities) cost around $25,000 for software along then we had to puchase a support contract on top of that to get some data transfered to the new system. I don't think a company would want to do this because they purchased an intel dual core based server instead of an outdated intel P4 (itanium, P3, whatever) based server or even an AMD operatron based server.

    There are probably more scenarios to it then this. But these do seem to provide a big headacke potential. I guess the likley hood of any one of those scenarios happening are minimal but it is a potential disaster for someone. It could devestate small consulting firms if thier new server they provided didn't work correctly. It could also potentialy set back any other companie who was forced to upgrade this software because of this.
  163. Re:Why aren't you running a dedicated controller.. by Anonymous Coward · · Score: 0

    Calling it "FAKERAID" is just pretentious and misleading.

    AFAIK, in the linux kernel driver code, it *is* called "fakeraid".

  164. Re:Likely a driver bug - Excuse Me, But!!! by ArbitraryConstant · · Score: 1
    Well apparently what they previously published has changed radically, since the code in question seems to have no problems running on earlier processors.

    Issues with the chipsets or much more likely their drivers haven't been eliminated, and that's entirely possible because they're new too. These are much more likely as candidates than a CPU being mysteriously slower than it ought to be.

    But, that being said... of course what they published previously has changed radically. That's what happens when you throw the old architechture out and introduce a new one. What did you expect? How could they or any other company ever make advancements if they weren't willing to change things? They have an obligation to maintain compatibility with previous x86 chips, but in no way can they be expected to give them identical performance characteristics. Netburst's performance characteristics suck; that's why they're throwing it away.

    Does Conroe need everything recompiled just to run efficiently on it now?

    No more than for every other implementation of the x86 ISA. Some performance sensitive things like video codecs have different code paths for Athlons, Athlon 64s, Pentium 4s, Pentium Ms, etc. This isn't exactly the first time a new iteration has been released. For most things it doesn't matter, and where it does matter the developers are used to having to perform new optimizations every few years.

    The more you optimize software for a particular chip, the more you'll run into problems on different chips. That's how optimizations work.
    --
    I rarely criticize things I don't care about.
  165. Write barriers have solved this pb by Anonymous Coward · · Score: 0

    What you said was valid... 3 years ago (the page you cite is 3 years old.) Since that time, Linux has implemented write barriers to precisely fix the problem you describe. With write barriers, the fs layer is now able to force data blocks to be committed to the device to enforce write ordering constraints (because the disk cache might choose to flush data in random order). And if that fails, it knows it and the journalled fs can act accordingly without corrupting the fs. The MD layer also implements write barriers, and advertises the capability to filesystems.

    All in all, as of today, raid cards with battery-backed cache are not needed for Linux to implement reliable filesystems over software raid arrays. These cards are only useful for other OSes whose filesystems to not use write barriers.

  166. Re:Why aren't you running a dedicated controller.. by SwellJoe · · Score: 1

    What about hotswapping?

    I don't do it, and I've never been in a situation where I felt like I needed to be able to do it. If it's a critical server, it has a hot spare disk (which is supported by Linux software RAID) so I can schedule downtime at the earliest convenient time. It it's not critical, I schedule downtime roughly immediately (i.e. end of business day, or whatever fits). I don't have any systems that must always run, forever and ever, amen. I just don't. I'm sure there are situations where it's necessary, but I've never seen one.

    I used to believe in hardware RAID, but I've come into enough situations where the prior admin had setup critical systems with hardware RAID and were using whatever controller Dell shipped them--and they didn't have extras on-site. When the RAID controller failed, and I've seen them fail enough to consider it a real threat, the wait was hours or days while Dell got a new controller on-site. Sometimes at great expense.

    Thankfully, I don't have to worry about that sort of thing these days (might have to again in the future, but IT admin is in my past for now). But at least a few of my most hair-raising IT experiences were due to problems with hardware RAID controllers. I'm not saying they don't do their job, or that a good RAID controller doesn't do a better job (for some definitions of "better") than software RAID. I'm saying that if you don't have a spare for every RAID controller in your office available, on-site, you're going to regret it one day. Even if you do, you might still regret it--as I mentioned, I've met two identical model Adaptec controllers that were incompatible due to different BIOS versions. Data was lost and had to be recovered from backups. I've also managed to screw up the RAID table due to human error (sometimes those Sunday late night upgrades go awry) in ways that would have been less likely to happen and more easily repaired in Linux, but the limited nature of the OS available on a hardware RAID controller made it far easier to screw up and more difficult and time consuming to repair. Having a whole OS with excellent partition and array management tools and a real shell to explore the situation with is a mighty fine thing when trying to figure out what stupid thing you did that led to a system not finding any system disks on boot.

    If you need hot swap, you'll need a controller that supports it...though I feel obligated to mention that hot swap is a part of the SCSI and SATA standards, not any magical RAID feature. It can be implemented on any SCSI or SATA controller, regardless of RAID capability, and some Linux drivers probably support it (though I've never needed it, so I don't know for sure). Though it's probable that the controllers that most often have support for the hot-swap features are the ones also equipped with RAID capabilities.

  167. Re:Why aren't you running a dedicated controller.. by dnoyeb · · Score: 1

    BAsically I got hardware RAID because its so easy to use on failure. I don't have time to worry about bootsectors and which OS can boot in RAID and which not. Hardware is flawless.

    I might also point out that you too have drivers for your drive controllers. And you have drive controllers. Using Hardware RAID does not inject more hardware into the equation than what you have.

    Besides, Software RAID is not the same as hardware RAID. one RAIDs partitions, one RAIDS drives. Sorry about yoru bad experience but I have been using software RAID on Redhat for about 4 years now. The hardware RAID setup was much simpler and have not had any issues as yet.

    I should also say that FakeRAID offers the worst of both worlds...

  168. Re:Why aren't you running a dedicated controller.. by dnoyeb · · Score: 1
  169. Re:Why aren't you running a dedicated controller.. by dnoyeb · · Score: 1

    Since i am using a 3ware card I am fully aware of this. I say your spreading FUD because you are saying its not a hardware RAID card.

    If you want to say its not the best because it uses more of the CPU that others, then OK. I'm not trying to make out like this card is up there with the more expensive cards...

  170. Re:Why aren't you running a dedicated controller.. by kwalker · · Score: 1

    Interesting, thanks. I wonder why it didn't come up when I browsed through the 3Ware category.

    Regardless, this one wouldn't help the situation mentioned in TFA because it doesn't do RAID5.

    --
    ... And so it comes to this.
  171. Re:Why aren't you running a dedicated controller.. by Anonymous Coward · · Score: 0

    Wait until that software raid 5 (in windows at least) fails a disk and then you see the CPU price you pay. Even duing rebuild you are hurting.