Hard Drives Made for RAID Use
An anonymous reader writes "Hard drive giant Western Digital recently released a very interesting product, hard drives designed to work in a RAID. The Caviar RE SATA 320 GB is an enterprise level drive without native command queueing and uses an SATA interface. In works better in RAID than other drives because of features like its time-limited error recovery and 32-bit CRC error checking, so it is an option when previously only SCSI drives would be considered."
Sheesh, this is a VERY thinly disguised ad. Here's a direct link to NewEgg $169. Has the same details as this "story."
If they would stop eating around the hard drives, leaving crumbs in them, we wouldn't need to use Raid to take care of the cockroaches in them. Ugh.
Advice for my fellow geeks: before seeking out that threesome you dream of, you might see what a TWOsome is like first.
Interesting that they don't have NCQ, whereas SCSI drives generally do (well, called TCQ on SCSI IIRC)
Is this just marketing speak, has it truly included scsi features, or could it actually be better performing than SCSI in a RAID array?
"In works better in RAID..."
You should change "In" to "It"
Thank you very much.
While I've been a proponent of SCSI for a long time -- Apple really was thinking ahead when it had it in Macs all those years -- it has been getting thread-worn. Ultra-wide-tall-double-hex-SCSI is just getting to be too much!
SATA is the right technology, especially for controllers since each channel is dedicated. The only alternative is Firewire, and there are no native controller drives.
I love my computer -- You make me feel alright (Bad Religion)
How does the lack of Native Command Queuing improve RAID performance? Generally I thought NCQ improved all drive's performance, and TFA says that NCQ is normally part of Enterprise High-Performance.
"It's the height of ridiculousness to say for those 9 lines you get hundreds of millions."
You can't sell your oh so cool hardware review site for millions of dollars and retire at age 12 just because Cowboy Neal posts your article on slashdot!
p.s. Pay attention in English class.
Summary of article:
The Good (+)
- Very good performance
- Looks cool (for a hard drive)
- Optimized for RAID use
The Bad (-)
- High initial investment
http://www.wdc.com/en/products/Products.asp?DriveI D=92
I bought one to replace what I thought was a bad drive in a RAID configuration about a year ago.
Proper TechReport's review here.
Go read. Now!
Here's an interesting quote from Tom's Hardware:
"In sum, we must state that all Command Queuing enabled drives have an advantage over those that do not support this feature. At the same time, CPU load is also slightly higher when Command Queuing technologies are used. However, considering the performance of today's processors, the additional CPU load is a marginal factor."
Basically, you put some load on the processor for increased disk performance... Why not include it?
Support alternatives to Paypal: http://www.e-gold.com
The manufacturer specifically says to only use these in a RAID-1 configuration (mirroring). They have a reason for this: The error recovery mechanisim is abbreviated. So what does Sal do... He connects two drives in a RAID-0 configuration. Now his data reliability has gone to about 1/4 of a regular drive.
Personal mod up to 2 yaro points!
Well, it seems the Roland experiment worked very well, so they've decided to move up to the Big Money.
NCQ allows hard drive to reorder various commands/accesses to suit its current head position. Depending on your app you might not see a lot benfits from it e.g when you do serial access all the time but lack of it will certainly cause degradations when multiple apps are active. Also by using one big hard-drive instead of multiple smaller ones its putting all eggs in one basket. Mechanical problems are more frequent than magnetic ones for a hard drive..
This sig doesnt exist.
These buggers are hard to find for anywhere near decent cash. I've found one model that is fairly popular, going by several different names and brands, but nobody seems to have them in stock. They look like a GREAT deal and loaded with most or alll of the best features of raid5. (hot swap, live rebuild, live GROW, etc) Has anyone seen one IN STOCK anywhere?
D =45&Cat=Producti harasyfor5.html9 19&refid=1057r e-mini.html
u cts_id/657
Same exact models:
http://www.raidweb.com/fb605fw.html
http://www.micronet.com/General/prodList.asp?CatI
http://www.firewiremax.com/fire-wire-1394-ilink/m
http://www.pcrush.com/prodspec.asp?ln=1&itemno=77
http://www.cooldrives.com/firewire-raid-5-enclosu
http://www.topmicrousa.com/combo-205.html
same internals, different enclosure:
http://fwdepot.com/thestore/product_info.php/prod
http://www.cooldrives.com/fii13toatade.html
Everyone I call says they have them in stock. Then I ask them to check and they suddenly change their mind and say no it's not really in stock, (despite what their web page says) and they expect it in the generic "1-2 weeks". (retail-speak for "we don't know when it'll be in, please call back later")
Two of them actually told me they have yet to receive any of these units, so I don't think they've shipped from the manufacturer yet? (vaporware?)
I work for the Department of Redundancy Department.
Is there a reasonable cost, relatively low power RAID-5 setup for home networks? I'd love to set up a file server with gigabit ethernet and RAID-5 to serve as the home directories for my multiple machines. Things like the Buffalo LinkStation are a step in the right direction, but no RAID, etc. Is my only solution a Celeron or Pentium-M based PC? If so, is it possible to set up such a system to act as home directories for a combo of Windows, Mac, and /or Linux machines?
Ooh, a sarcasm detector. Oh, that's a real useful invention.
EIDE drives are the cheapest type. But AFAIK, each drive has a controller card onboard, which seems redundant when all the drives are being controlled in conjunction. Software RAIDs seem to have parity (pun intended ;) with HW raid controllers, but wouldn't a real "Made for RAID" drive have nearly no controller logic of its own (maybe just data separator and head/spindle speed/position calibration)? Lots of logic for controlling the RAID drive will be on the central controller card, or running on the CPU. So why have more on the drive? The cheaper the drives, the bigger the array at the same budget (shared overhead of common controller).
Am I correct, or are some RAID drive makers already doing this? Or have I just got all the controller:drive economics wrong?
--
make install -not war
I got an alpha hardware version of one of these bad boys and it came with a very handy extra that they assumed the end users wouldn't need: a de-bug port. Now I don't even NEED raid, but it's nice to have the option.
Specificially RAID 0, mirrored. It would be nice to be able to split one of those oversized drives into a mirrored drive, using the opposite sides of the disk platter as mirrors. You'd get better reliability with the slight trade off of all that surplus disk space you never use.
Now, firstly, yay! I've loved the idea of RAID arrayed drives since I first heard of it about 2 years ago when my friend was writing linux drivers for a company's (obligatorily namelessness) RAID Card. The technology has some very high potential, but it has very much sat by the wayside. Seeing a major hardware manufacturer target a product to the RAID crowd means it's starting to get some industry clout, which I for one think is mucho overdue.
Secondly; however, the article seems very short on technical specs:
"In works better in RAID" according to.....?
A couple fans told me that my last journal entry was mint; give it a shot. Hope you like.
Is it just me, or did this review stink for lack of proper testing and comparison...
If I were comparing this product and it's performance, I certainly would not be benchmarking a SATA based RAID setup against a single Parallel ATA drive. Something in this arrangement just doesn't seem... well, logical.
If you were really going to try to impress me with it's performance, then you would have to show me how it compares to "non-RAID" optimized drives of near simular characteristics. Show me how this drive performs against, say, Hitachi SATA 320 gig drives using an identical test rig. Also show me how this drive compares to 320 gig SCSI drives. Show me the results as JBOD, RAID-0, RAID-1 and RAID-5. You know, like the real world.
While the graphs are pretty, I'm afraid that this "review" it fairly content-free.
Ron Gage - Westland, MI
Don't forget the "poor man's" RAID -> rsync. If you have 2 complete machines with 1 drive each, and you rsync them, you are *nearly* equal to one machine with dual power supplies, and mirrored disks. Of course, you probably would just mirror data, but you get the idea. Truly convenient if your backup fails one night.
Buffalo TeraStation
Supports RAID 5.
I emailed if external USB hard drives could be added and swapped to a raid 5 array, and if it can be done "on the fly"...
but all I got was this lousy message:
"Please call (800) 456-9799 x. 2013 between 8:30 and 5:30 CT and our presales guys will be able to assist you."
I'm one of those weird people that would rather communicate in writing. Oh well - no sale.
Spoon not. Fork, or fork not. There is no spoon.
IMHO, the biggest things manufacturers could do to make the drives more RAID friendly is to change the name (even with just a v1, v2, etc...) when they change platters.
Nothing is worse than buying a bunch of drives and a couple of spares and building the array and then discovering down the road that in fact one of your spares came from a different production run and has a slightly different (maybe 3 block smaller) geometry and can't be used on your array. Usually there is absolutely no indication on the box or the drive that one of your drives is different unless you decode the cryptic serial number.
For that matter, just printing the exact LBA count on the back of the box would be a huge boon.
This isn't limited to ATA drives either. I've seen it plenty of times in professional SCSI solutions too, especially as the arrays start to get older.
I read the internet for the articles.
I would think if these drives are really designed for RAID (like other drives have been in the past), then they would have support for synchronized spindles.
The idea behind synchronized spindles is that in order to read data from a disk, you have to wait for the platter to come around part of a revolution for your data to become available, just like picking up your suitcase on the luggage carousel at the airport. How long you need to wait is a matter of luck, because the disk can be assumed to be in a random position when you decide you want your data. When you have RAID without synchronized spindles and you want data that's bigger than the stripe width (or when you're writing and need to update the parity), you have to wait for multiple disks, and they will tend to be spread out so that you tend to wait longer than if you were just waiting for one. With synchronized spindles, as soon as the whole group hits the right position, you've got what you're looking for, and you're done.
So, the point is, not having synchronized spindles tends to increase average access time, so having synchronized spindles is a desirable feature for a drive designed specifically for RAID.
Buy their gear if you must but I would not put my data on it.
-- RLJ
I've had two of these drives in my server at home for almost year now. This is news??
No kidding. I've had a pair of 250GB IDE Raid Edition drives for over a year now.
That the RAID card handles it. Please remember I am just guesing here, I don't know. However I know there used to be PATA IDE RAID cards that did this. The discs didn't support any kind of special reading, they just processed requests in order. Ok, no problem, the controller, which had a processor, RAM, etc did all that. It would implement scatter-gather and so on. Basically it was a SCSI RAID controller with IDE connectors instead.
So perhaps the thought here is since you have a controller that'll handle it, leave the features off the disk. PErhaps it's cheaper, perhaps it works out to be faster, perhaps NCQ qould interfere with the RAID card's operation. You know, maybe the RAID card queues up operations and then dispatches them to the disk in the order they should be executed,b ut the disk then queues them up and does them in an order that makes more sense to it, but less to the actual RAID.
Or, maybe they are just ripping people off.
I'm no expert, but I look forward to mostly buying 2 platter drives from now on. Early failures seem to double when you add a third platter, and 5 platters is just scary. You can get 250gb SATA 2 platter Seagate drives for about $110 each, which seem to have a great record for reliability so far. But when I need real SCSI reliability I'll just get a real SCSI. The warranty for most SATA drives may be 5 years, but usually it's void if you put it in a server.
Hmmm ... NOPE!
ATA is good for 2nd teir storage but I wouldn't dare put anything "enterprise level" on it, just far to slow at the moment.
There have been speculations about all these 1207 pins in AMD's SocketM2 (due somewhere in 2006) will be used for an integrated PCIe controller.
http://www.theinquirer.net/?article=24756
throw new SuccessException("Sig read successfully");
Not if you are super-l33t and run RAID-0 striping (which is apparently what these people think "RAID" is). Then it's like using two bullets in Russian Roulette.
"Yeah, boss... drive, uh, died. I'll get a nice new one-- let me... uh... take this one home. I think it's, y'know, dead. Honestly this time."
Here's the direct link about the AAC driver mentioned in the story.
I really wish there was some sort of standardisation of reviews such that ones like this could be filtered out. How it got on slashdot is beyond me!
They review a RAID edition drive yet don't even test it in RAID5! Unless reviews are thorough how are we supposed to draw anything but the vaguest conclusions. This reviews testing set should have included all these combinations:
- software vs hardware RAID solution (including hybrid semi-hardware cards like RocketRaid 1820A)
- 2,4,8 drive tests for RAID0,1,1+0,5
- synthetic tests such as the one they used or HDTach or similar as well as real world tests such as a database benchmarch, file server test
i hate to rant but these thoughtless reviews really are a waste of time.
I think the slashdot poster got it wrong. NCQ is generally a good thing. Without NCQ, the only way to get really good random-write speed is to turn on write-back caching, which is unsafe. THere's no good way the raid controller can tell otherwise which block would be best to send to the drive next, based on where the head is - only the drive knows that, and NCQ lets the drive order the writes best.
From what I understand, when writing to ATA drives, you have three options:
- writeback caching on, in which case the raid controller can't really tell whick blocks made it to the drive and which ones are still sitting in the drive's (volatile) RAM.
- No caching - send a block, wait for the drive to write it, send another. Unless the raid controller can really figure out where the head is and time it just right, this will be slow, because the writes will have to wait for the block to move under the head each time.
- NCQ (which is in SATA only). The controller gives the drive a lot of blocks to write, the drive writes them in whatever order it chooses, and sends back acknowledgements for each block after it is written. These acks let the controller mark that block in its cache as clean.
If you compare NCQ and write-back caching, you'll probably find NCQ is slower due to the overhead of acknowledgements. If you don't care about the safety of the writes, then write-back caching is probably the best choice. If you do care about the data integrity, then NCQ is probably the best choice.
Some vendors (apple xserve for example, which doesn't support SATA), say you should use write-back cache but use a UPS to make sure the drive's cache is not lost. THis is not a "mission-critical" solution. Doing this means that you've now introduced single-point (the power connection between the UPS and the drive), where if there is a failure, you can wind up with bad data. Since RAID is usually intended to prevent data loss due to single points of failure, this is not a serious solution.
I have a serious question. I have had *VERY* bad experience with SATA drives. I had a two-drive SATA raid0 for temp and cache and video processing. One drive crashed within 2 years; click of death. So of course the raid is gone, but I reformatted the remaining drive and used it by itself. Gone in under another year.
I dont remember the manufacturer but I realize it could just be a bad manufacturer.
BUT just recently the SATA drive in a friend's factory-spec dell crashed hard. No click-death but seems like serious controller damage/failure; will boot into safe mode with command prompt in about 20 minutes. Won't DIR, MD, etc. I threw int in another machine on the second SATA channel and results were the same so it was the drive.
This is the third SATA drive I've seen crash hard in the last 2 or so years.
Am I just having bad luck?
Since I wanted some facts, Wikipedia ordered two systems for database service, both dual Opterons with 4GB of RAM and six drives. One with 10,000 RPM SCSI drives and one with 10,000 RMP SATA drives. The SATA system, without NCQ, was generally faster and ended up with a higher proportion of the site load assigned to it. The SCSI system was sometimes faster in mixtures which included lots of writes with lots of reads and that made it lag a bit less in replication of bulk update operations, so newer systems have been SCSI. If more drive bays had been available, adding another couple of SATA drives would probably have made the SATA set faster for that case as well and still cheaper.
:)
If lower access times are needed, SCSI drives beat SATA drives just because you can only get 15,000 RPM with a SCSI interface. May also make sense to have 15,000 RPM drives if you're already spending a lot of money on 16GB of RAM.
The question about this drive which interests me is whether drive write caching can be easily turned off and will stay off, so you don't lose database data when the database thinks the data has been flushed to the surface but it hasn't really been flushed. If you can't do that, it's unsuitable for a lot of database work - certainly unsuitable for use with RADI controllers with battery backed up write caches, where you have the battery to make sure you don't lose cached data if the power goes off. Anyone who things colo power and UPS will protect against loss of power hasn't suffered enough yet...
Redundantly Expensive Array of Disks.
So, I have seven database servers, all with identical copies of the data. Do I really care if I lose all the data on one of them because one drive in a RAID 0 set fails? The completely redundant systems do the job better than any RAID setup can.
You consider RAID 0 when you don't care about losing the data if there's a drive failure and want the benefits of striping and the extra space available for a given number of drive bays, compared to other RAID levels. RAID 5 can get you some of the space but it's slower for database work.
The units you listed (at least the first few) look like a drive cage inside an external enclosure. I do not know where you can buy one of those pre-assembled, but you can get the three major parts easily enough and just put it together yourself.
r ebridgeboards.htm
n closures.php
t egory=15
5 drive cage (built in raid - IDE/SATA with IDE drives):
http://areca.us/products/html/ide-ide.htm
1394 to IDE bridge board:
http://www.granitedigital.com/catalog/pg19_firewi
external 4 drive enclosure:
http://www.macgurus.com/productpages/scsi/mgscsie
if you don't already have drives, you could look here:
http://www.newegg.com/ProductSort/Category.asp?Ca
Alternately, you could get the areca unit and mount it inside the computer - just check that you have 3 free 5.25" drive bays without any protrusions.
I recently picked up one of the SCSI to SATA areca raid units and it has worked well so far - one of the fans went south, and they are shipping out a new one.
RAID-0 is equally as redundant as RAID-1. There are two disks performing a job that either one of them could do alone. The second disk is redundant. RAID-0 uses the redundancy for performance instead of continuous backup.
Who the heck mod'ed that up?
Say you have two identical disks striped without mirroring ("RAID 0"). The data is interleaved across the two disks. You usually get higher performance. You always get twice the disk space. There is no redundency here; your disk space is increased. Your reliability is much lower. If you loose either disk, you loose your entire logical volume.
Say you have two identical disks, mirrored ("RAID 1"). The same data is written to both disks. Data storage is redundent but reliability is increased. You can loose either disk but still have your logical volume. You usually get higher read performance (reads can be interleaved).
With mirroring, writes have to be done to both disks. Performance is complicated. Depending on the implementation, writes can be faster, slower, or the same as other RAID levels. With shared I/O paths, writes are slower then a single disk (you have to write one disk, then the other). In theory, if you have parallel I/O paths, writes should be the same as a single disk, but slower then a striped array (like RAID 5). That's because with striping, you can distribute the writes across multiple disks. In practice, many if not most RAID implementations (even those with dedicated RAID processing hardware) lack the processing power to make this happen. So mirroring is actually faster on writes then RAID 5, because no RAID calculations need to be done.
RAID 3 is striping with a dedicated parity disk.
RAID 5 is striped partiy (both data and partiy are distributed over all disks).
The best performance and redundency typically comes from using a striped array of mirrored disk pairs. You can loose either disk in a pair without loosing data. You distribute reads and writes over all disks. No fancy XOR calculations; just simple mirroring and interleaving. Very expensive, though.
dragonhawk@iname.microsoft.com
I do not like Microsoft. Remove them from my email address.
I have a serious question. I have had *VERY* bad experience with cars. I had a two-car setup for running to the store and doing errands and picking things up.
One car crashed within 2 years; tires went flat and I couldn't steer. So of course the car is gone, but I got in the remaining car and used it by itself. Gone in under another year.
I dont remember the manufacturer but I realize it could just be a bad manufacturer.
BUT just recently the car in a friend's garage crashed hard. No flat tires but seems like serious damage/failure; the engine will start with about 20 minutes of cranking. Won't drive or turn, etc. I threw int in another garage and results were the same so it was the car.
This is the third car I've seen crash hard in the last 2 or so years.
Am I just having bad luck?
I've had enough abrasive sigs. Kittens are cute and fuzzy.
It's "etc.", not "ect.".
_Why_ are they "made for RAID use"?
I mean why is it that they are claimed to be made for RAIDs? The article claims some speciall error correction without explaining it. OK, you might think that the target audience might be to advanced for that.
On the other side he reviews hardware under only one OS where much of the performance is actually limited by the drivers. (There are many popular commercial OSes where gigabit Ethernet cards run at only a fraction of the speed of other OSes) IMHO that's a very unprofessional way to evaluate something like that.
You're a dumbass.
If your car is crashing itself, while you're away, 3 times with 3 different cars, I'd see what the common variables are.
Let your daddy back on the computer, honey.
From RAID Explained. (See link for info on other RAID configurations.)
RAID 0 uses a method of writing to the disks called striping. Let's assume you have a server with three drives of 500 MB, 1 GB and 2 GB. Normally a server would treat each of these drives individually. By incorporating striping, the system would see all of the drives as only one drive for a total of 3.5 GB. Big deal, you say. Wait, there's more.
When the system writes data to the disk, the RAID 0 striping kicks in and automatically distributes the data across all three drives. Part of a file (chunks of data) will be written to the first drive, the next part to the second drive, the next part to the third drive and then it starts all over again until the entire contents of the file have been written.
What this does is greatly increase the speed of the reading/writing process. If you have two drives on your server, it increases the speed by about 25%. If you have three drives, it increases the speed about 33%. When you consider that the main task a server is performing is reading and writing data, any increase in speed is highly welcome.
Besides increasing speed, the other benefit is that the drives can be of different sizes.
Because RAID 0 only writes the data once, it does not achieve data redundancy. If one of the drives fails, the entire system has to be restored because all files are split or striped across all drives.
Because there is no data redundancy, there is no loss of disk space.
Deltron 3030 - Virus (music video)