Where are the High-Capacity SCSI Drives?

Always That Way by VJoseph · 2004-07-23 12:57 · Score: 1

Hasn't it always been this way? It's nothing new.

Re:Always That Way by schon · 2004-07-24 02:24 · Score: 1

Hasn't it always been this way?

No, traditionally, SCSI provided higher capacity than IDE.. For example, when the largest IDE drive you could get was 130MB, 2GB SCSI drives were available.
Re:Always That Way by OnTheMoney · 2004-07-25 09:34 · Score: 1

SCSI is nice when you really do need the speed, but I think they are missing a bet when it comes to capacity.

The cost of buying myself a personal server with 1 TB of a SATA RAID array is extremely less expensive than if I wanted to do that with SCSI. A large part of that is being able to buy decent 250 GB drives to create the RAID and hot spares from. I can actually fit 1TB nicely in a 4U multi-cpu server.

However, while I'm sure the makers want to sell bigger disks with the same speed and reliability as their current SCSI disks, I'm pretty sure it's a lack of technology rather than a lack of trying that's preventing them from doing so.

As the platters get bigger, the problems of speed, access times and reliability all just get worse.

--
Healthy Info

God that's just sad by aztektum · 2004-07-23 12:59 · Score: 0, Troll

Screw SCSI Srives. It looks like we need to work on spell checker technology.

Sorry sif sI sidn't so sit someone selse swould

--
:: aztek ::
No sig for you!!

Re:God that's just sad by arcanumas · 2004-07-23 13:10 · Score: 1

He is sriving man! He is sriving to provide a good story and all he gets are the spelling nazis.
man!

--
Slashdot Sig. version 0.1alpha. Use at your own risk.
Re:God that's just sad by Anonymous Coward · 2004-07-24 04:03 · Score: 0

If you can't figure out what he is saying, you are the bigger idiot. Who cares if he made a typo, or a spelling mistake?

Where are the High-Capacity SCSI Srives? by Johnso · 2004-07-23 13:00 · Score: 1, Funny

I don't know, but it srives me crazy!

--
I'm a signature virus. Please copy me to your signature so I can replicate.

SCSI capacity has always lagged by Anonymous Coward · 2004-07-23 13:04 · Score: 0

It was the same way a decade ago. It's simply a lack of demand for the drives in general.

What speed are most SCSI drives? by Sean+Johnson · 2004-07-23 13:10 · Score: 1

Isn't it because SCSI drives are for high speeds and generate more heat? I may be out of the loop, but I really haven't heard of 10,000 RPM & 15,000 RPM drives for SATA or IDE. Maybe there are design considerations that prevent these higher performing drives from allowing more capacity. Or maybe the market demand does not warrant the manufacture of such units due to economics. In any event, I want to know if there is a 15,000 RPM SATA drive out there somewhere. I am curious!

--
>>>>>> Chewie, take the professor in the back and plug him into the hyperdrive.

Re:What speed are most SCSI drives? by cfallin · 2004-07-23 13:14 · Score: 1

There are 10k RPM SATA drives - look for the Western Digital Raptors. I think that with SATA's speed and hotplug capability, the end has finally come for SCSI. Sure, SCSI drives may on average be of a slightly higher build quality, but 1) I'm sure that'll change with more demand for SATA in servers and 2) you should be using RAID anyway.
Re:What speed are most SCSI drives? by MBCook · 2004-07-23 13:24 · Score: 3, Interesting

SCSI will be replaced by SAS, or Serial Attached SCSI. That is basically a superset of Serial ATA (IIRC). All the benefits of going from ATA->Serial ATA apply to SCSI->SAS. The smaller cables, the longer lengths, the lower voltages, etc. SCSI has had command queueing and hot-plug for a while already though.
Everyone is going serial. USB, SAS, Serial ATA, etc. Time to invest in Kellogs.
Oops, wrong "cerial".
(sorry for the pun, couldn't help it).

--
Comment forecast: Bits of genius surrounded by a sea of mediocrity.
Re:What speed are most SCSI drives? by nomel · 2004-07-23 13:27 · Score: 1

What is max throughput? Aren't the nicest SCSI srives 320MBytes/s?
Re:What speed are most SCSI drives? by MBCook · 2004-07-23 13:31 · Score: 2, Interesting

Yeah, I was thinking IDE and a standard 33mhz 32bit PCI bus. Your right, SCSI goes up the 320 and the bus can handle it if you use PCI-X, PCI express, 66mhz PCI, 64bit PCI, etc. Nice catch.

--
Comment forecast: Bits of genius surrounded by a sea of mediocrity.
Re:What speed are most SCSI drives? by walt-sjc · 2004-07-23 13:45 · Score: 3, Informative

Check out the MTBF numbers. They look similar until you see that desktop drives are rated with a low duty cycle - the typical 8 hour day as opposed to the 24 hour day servers are deigned to run.

As for real performance, my old 18G 7200 RPM IBM scsi drives are faster than my brand-new SATA raptors in real world applications (compiling the linux kernel for example.)

So here's what I do. I use my scsi drives for my everyday stuff, and archive on the SATA drives (MP3's, old source / packages, etc.) That way I get my performance and reliability, and space. Since I have two of each, I just raid mirror.

As for real world server applications, we run some Large raid arrays. We don't need the space as much as we need the performance you get with dozens of spindles spread over multiple channels on 64bit controllers.
Re:What speed are most SCSI drives? by MarcQuadra · 2004-07-23 14:02 · Score: 3, Informative

Agreed, I've said this before, but my old 18GB Ultra2Wide (80MB/sec SCSI) drive can wipe the floor with my new DeskStar 180GXP (ATA-100).

It's all about those command queues, they let the computer spit commands at the disk without having to see their immediate completion.

I actually get better performance with my SCSI drive _mounted over NFS_ than I can with my previous local 40GB ATA-66 drive.

--
"Sometimes, I think Trent just needs a cup of hot chocolate and a blankie." -Tori Amos on Nine Inch Nails
Re:What speed are most SCSI drives? by BrookHarty · 2004-07-23 15:13 · Score: 1

This is one reason I like samba shares, loading/saving fles over TCP takes less CPU than IDE. (In windows)

IDE sucks the life out of PC, even newer 3ghz+ pc's still pause when you put in a floppy or eject a cdrom in windows.

I can put a bunch of slow IDE's in a pentium 2 box, and over TCP no performance hit loading/saving files. (In windows)

BTW, this is also how bootless terminals or low class cpu's can be so smooth. My laptop on the network has slow HD IDE issues, so I put apps on the server and load, quicker and more space.

I think this is the reason I like the new Intel HT cpu's, anything to get rid of the slow factor of IDE under windows, throw more time slices at it.
Re:What speed are most SCSI drives? by Micro$will · 2004-07-23 15:55 · Score: 2, Informative

IDE sucks the life out of PC, even newer 3ghz+ pc's still pause when you put in a floppy or eject a cdrom in windows.

That's a drive and/or Windows issue. When you insert a CD, the CDROM has to spin it up to read it, and then Explorer.exe (not Internet Explorer, Windows Explorer, A.K.A. the Windows "shell") immediately wants to know what's in it, so you have a slight lag, depending on background services, the drive, the media condition, etc. You can see what's going on by opening up Explorer while there's no CD in the drive, then watching the drive's reaction, and Explorer's reaction when the drive finally reads the CD.

Some performance can be gained by keeping CDROM drives on seperate IDE devices to prevent the CD drive from hogging the bus during moderate to heavy harddisk activity. As for floppy drives... you still use a floppy drive?!
Re:What speed are most SCSI drives? by innosent · 2004-07-23 16:36 · Score: 1

I believe that's related to a threading issue buried somewhere in the windows API. A similar issue happens when a background thread (like reading from a disk, network, etc) touches an object on another thread (like to change window contents). In certain cases, this moves the touched window to the background thread, which is a problem if the background thread is blocking for an interrupt (like waiting for disk or network I/O). One of those annoyances that have been around for years, and never got fixed, only to get covered up with more layers above it (i.e. .NET).

--
--That's the point of being root, you can do anything you want, even if it's stupid.
Re:What speed are most SCSI drives? by BrookHarty · 2004-07-23 16:36 · Score: 1

I did mention it was a Windows issue. ;) But IDE still spikes more CPU than loading of programs over ethernet. You can turn off auto-start and other registry commands to make explorer.exe more cpu friendly, but the system still has more CPU spike on loading off IDE.

The normal IDE drives are fore CDroms only, most HD's go on the Raid controller (even if you dont do raid). That seperates it, like you said.

USB floppy, so I can unplug it when I dont need it. Still OS's out there you need a floppy for.
Re:What speed are most SCSI drives? by shaitand · 2004-07-23 17:10 · Score: 1

Yeah but they can't handle 320mb/sec thoughput it requires faster pci slots but those are present on pretty well all server boards and some workstation/desktop boards as well.

Also a 10k RPM scsi drive is a cheapy, generally they are 15k RPM.

You also generally use scsi in raid configuration. SATA raid devices generally don't compare to intelligent scsi raid controllers.

There is also the quality and warranty on the drives, usually 5yr or more warranty with priority replacements. The replacements are usually advance RMA as well, meaning they send you the new one upon calling and you send them the bad when it gets there.

If the data is valuable enough to justify an expenditure for scsi, you want scsi, period. SATA doesn't change that. Serial SCSI may, but not SATA. SATA will pretty much replace IDE raid (is replacing I should say) and when it's done there will be no such thing as standard IDE raid.
Re:What speed are most SCSI drives? by innosent · 2004-07-23 17:34 · Score: 2, Interesting

Some SATA RAID controllers do support the advanced features offered by high-end SCSI RAID controllers, it's just that it seems strange to spend $700 on a controller for $1000 in disks, vs. spending $500 on a controller for $4000 in disks. Most recent server boards support 133MHz PCI-X, so bandwidth to SCSI devices is not an issue. The difference is speed and quality. Still though, if you don't need the absolute fastest array (ours is about 610MB/sec read, 300MB/sec write, RAID-10 striped within, mirrored across channels), SATA is a good solution. We use 6 15k U320 SCSI disks for our primary database system (54GB total array size), and 12 7.2k 250GB SATA disks for our document imaging system (2TB, RAID-5 with hot spares). Sustained transfers from either array are practically identical, but the access times to the SCSI array is much lower (though this is partly due to differences between RAID levels, as having a RAID-10 array means that the closest stripe to the data can get priority). Either storage method easily outperforms Gb Ethernet, so the only real difference is the access times for processes performed on the local machines (such as index and data lookups for a database), and aside from a few ms difference in query time, which is usually much smaller than the running time for the query, a remote user can't tell the difference.

--
--That's the point of being root, you can do anything you want, even if it's stupid.
Re:What speed are most SCSI drives? by Anonymous Coward · 2004-07-24 08:12 · Score: 0

Oops, still wrong "cereal".
Re:What speed are most SCSI drives? by Neillparatzo · 2004-07-24 09:36 · Score: 1

"IDE sucks the life out of PC, even newer 3ghz+ pc's still pause when you put in a floppy or eject a cdrom in windows"
That's because Explorer blocks until the media is picked up and recognized, regardless of the underlying hardware. Granted, it's stupid (they really should do that in a separate thread), but it's totally not a IDE vs. SCSI issue.
How do I know? I'm using a SCSI CD-RW drive and it does exactly the same thing.
Re:What speed are most SCSI drives? by compwizrd · 2004-07-25 02:33 · Score: 1

That's only the interface, the drives are nowhere near that speed.
Re:What speed are most SCSI drives? by nomel · 2004-07-25 17:59 · Score: 1

What's the sustainable speed from a 15k RPM SCSI drive?
Re:What speed are most SCSI drives? by compwizrd · 2004-07-26 00:01 · Score: 1

somewhere around 70-100 meg/s
Re:What speed are most SCSI drives? by afidel · 2004-07-26 02:18 · Score: 2, Interesting

storage review says about 64MB/s on the outside of the platter for the best performing 15K RPM drive (which is a 74GB drive not a 140GB one). So, to swamp an U320 bus with sustained transfers you will need at least 6 drives, not the 3 that some people keep spouting around here. So if you need 6 drives to saturate the bus, why have a few high capacity drives when more drives gives you lower latency and gets you to max sustained transfer.

--
There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
Re:What speed are most SCSI drives? by homer_ca · 2004-07-26 13:28 · Score: 1

The reason you need that speed is because the whole bus of up to 6 srives shares that 320MB/s just like ATA shares its bandwidth between master and slave. Multiply 6 x 55MB/s and you have 330MB/s.
Re:What speed are most SCSI drives? by jrockway · 2004-07-29 19:38 · Score: 1

Reading from IDE disks should not use the CPU. Turn on DMA, for christ's sake!!!

hdparm -d 1 /path/to/drive

In Windows... uh... anyone care to enlighten him?

--
My other car is first.

Time to do some reading by bconway · 2004-07-23 13:16 · Score: 1

Instead, businesses (and some individuals) wanting greater storage capacities are required to buy more drives which takes up more space, generates more heat, provides more points of failure, uses more electricity, etc.

Are you unfamiliar with the concept of RAID? That's where all those SCSI drives are going, and it most certainly does not add more points of failure as it pertains to systems. Business do not want high-capacity single SCSI drives, especially when they can pile together 146 GB drives.

--
Interested in open source engine management for your Subaru?

Re:Time to do some reading by UnrefinedLayman · 2004-07-23 14:24 · Score: 1

I think the point is that they can pile 4 250 GB drives into a server to have a terabyte array, or they can pile in 7 146 GB drives to get the same result.

Being limited to 146 GB drives means you are limited in scaling, which, of course, is what RAID is all about, as you've pointed out.
Re:Time to do some reading by DAldredge · 2004-07-23 14:37 · Score: 1

But the 7 r/w heads in those 7 drives can make for a faster setup for some/most workloads.
Re:Time to do some reading by UnrefinedLayman · 2004-07-23 14:50 · Score: 1

While it's true that it will increase the speed of the array, it isn't true that the array scales as well. You're limited by the size of the disks for the total size of your array; it all depends on what type of controller you can get, of course, but you're hard pressed to find many controllers that will support more than ten drives.
Re:Time to do some reading by shaitand · 2004-07-23 17:16 · Score: 1

Which is where more advanced raids come into play, raids of raids and so forth.
Re:Time to do some reading by shaitand · 2004-07-23 17:24 · Score: 2, Insightful

"Business do not want high-capacity single SCSI drives"

Why? they make for higher capacity raids.

The more devices the controller has to be able to handle the more expensive, also although more drives means better overall performance, the overall efficiency goes down.

The big thing is like beggers home users have pretty much been locked out of SCSI. Even a single scsi drive yields better performance than an IDE drive.

If scsi drives were offered widely in home pc's, there would obviously be a performance increase, but also most of the artificial price increase would disappear from them. Manufacturing methods would also be improved to support the increased demand and controllers would be included onboard as IDE are now. Before you know it SCSI would be just as cheap as IDE is today.

Wouldn't that be better?

It would be good for other things too, for instance faster buses would have already taken off in your standard desktop instead of just being a secondary bus in a few desktop boards and server boards.

The result in the end would be cheaper, faster, more reliable storage and bus technologies for everyone. Too bad the drive manufacturers know that and do their best to keep scsi on the server ;)
Re:Time to do some reading by UnrefinedLayman · 2004-07-23 18:19 · Score: 1

Again, the need to implement more complex and advanced systems, which have their inherent pitfalls, can be averted simply by having larger disk sizes.
Re:Time to do some reading by shaitand · 2004-07-23 18:58 · Score: 1

Yes but larger disks don't provide the redundancy of a raid of raids.

However many disks worth of space you've lost in your raid to parity, that's more or less how many drives have to fail before you've lost data and the raid goes down.

Depending on the scheme you use, the same is true in a raid of raids, except with raids. That many raids, have to lose that many drives, before you've lost data.

I agree however, simply because there are benefits to large numbers of small disks, doesn't mean there aren't benefits to large disks as well.

People seem to have this idea that a large disk means just one. Hardly, you can have 15 large drives just as easily as you can have 15 small ones. And the price of every smaller capacity drive drops a notch when the larger capacity disk comes out. If 250gb scsi drives came out today, within 3months they would occupy the price position that 146gb scsi drives hold now and those who do want only 146gb would get it cheaper.
Re:Time to do some reading by Evo · 2004-07-23 22:32 · Score: 1

Lets see... with RAID 5+1, you get..

4x250GB = 500GB of storage
7x146GB = 730GB of storage

For the same redundancy, you get nearly 50% more space by using 146GB drives, using your numbers.
Re:Time to do some reading by gl4ss · 2004-07-23 23:16 · Score: 1

**Business do not want high-capacity single SCSI drives, especially when they can pile together 146 GB drives.**

that's failed logic. that's saying that businesses don't want more space. of course they want more space, higher capacity, more reliability and faster speeds and you don't lose reliability at all if the drives are as reliable and as many.

however.. as to the original poster: just buy some damn sata drives.

--
world was created 5 seconds before this post as it is.
Re:Time to do some reading by UnrefinedLayman · 2004-07-24 03:33 · Score: 1

And yet you choose to select a very rarely used RAID implementation?

With RAID 5 you get...

4x250GB = 750GB of storage 7x146GB = 876GB of storage

Now let's assume the backplane only supports a total of ten drives:

10x250GB = 2.25TB of storage 10x146GB = 1.31TB of storage

Gee, which solution scales better?

And let's say we wanted to be really crazy and implement RAID 6, a more practical solution than RAID 5+1, and more likely to be implemented in the real world:

10x250 = 2TB of storage 10x146 = 1.2TB of storage

The fact of the matter is the needs for storage space are growing, particularly for small to medium business who have a desire to retain data for longer periods of time. When they go to Dell and order their server, which supports a maximum of four drives, are they going to be happy knowing that their server is going to be limited to 438 GB, and that to upgrade from that would require ditching all four drives and purchasing four new drives of a higher capacity a year from now?

Not many small to medium businesses are jumping at the opportunity to purchase external storage solutions to give them the expandability that they need, either, primarily because of the cost. They're also not going to be implementing RAID 5+1 (or even RAID 6). 90% of them are going to end up using RAID 5.

Even with all that said, there's only one salient point: given an equal number of drives, larger disks translates to larger arrays. Larger arrays = GOOD, Napster = BAD.
Re:Time to do some reading by itwerx · 2004-07-24 06:38 · Score: 1

you're hard pressed to find many controllers that will support more than ten drives

Most SCSI RAID controllers have at least two channels. At 13 SCSI devices each (14 minus the controller's ID) that's 26 devices. A three channel (also very common), or four channel (much less common) would handle 39 and 52 devices respectively. Not to mention you can have more than one card (some even have cross-connects for a true failover configuration).
Worst case you have one el-cheapo single channel - you still can have up to 13 devices!
I think your thoughts of ten must be a holdover from the bad old days of narrow SCSI and RAID limited to 5 devices (2 channels, two RAID-5 = 10 devices)
Re:Time to do some reading by foidulus · 2004-07-24 08:31 · Score: 1

Heh, Apple used to use SCSI as the standard, but they switched(ironic) a while ago. For the home user, there isn't really the need for that much power.
I know, it's ironic that that when computers are advertised, they usually have the most blazing fast cpu and crap for the rest(thought this is improving). For the money, more RAM would probably help users out better than a SCSI disk(or 2 since the advent of high end video games, mp3s and movies has really caused the demand for storage to soar)
Re:Time to do some reading by Anonymous Coward · 2004-07-24 15:07 · Score: 0

Please explain how Apple switching from SCSI is ironic. You fucking pathetic sorry excuse for a humanoid MORON!!
Re:Time to do some reading by shaitand · 2004-07-24 23:52 · Score: 1

"For the money, more RAM would probably help users out better than a SCSI disk(or 2 since the advent of high end video games, mp3s and movies has really caused the demand for storage to soar)"

I doubt it, since 90+ percent of them are running windows and windows starts to swap before you hit the desktop, no matter how much memory you have. That means the hard drive is your bottleneck, not memory.

On a linux, bsd, or pretty much anything not windows I'd agree, you need to put in enough memory that it's rare for the system to need to swap. If you have less than that, you'll see a bigger performance increase from the added memory (since you've eliminated the disk bottleneck, memory is faster than a scsi drive).

Don't believe me? Boot up your old p1 200, then put a pci ide controller in it and ghost the drive to a fast modern HDD. Don't change anything else and see the performance difference it makes.

Apple used shitty scsi for starters, I won't go into why, you can look it up if you care to. But SCSI was also too expensive then, without the pc guys using it, the price never went down. One of the things I don't like about apple, they dropped the scsi, but didn't drop the price ;)
Re:Time to do some reading by router · 2004-07-26 05:54 · Score: 1

Yeah, but for windows you can turn off swap and run in RAM and get blazing (for windows) performance. You just don't have any overhead for leaky apps. I used to do that until my memory load exceeded the available RAM (which was maxed out for the system). Only time it was slow was for initial system load and app load, other than that it was pretty darn quick. Good SCSI does make a huge difference tho even on desktop systems, I have another box with a decent SCSI card in it and it is noticably faster on the subjective scale than the IDE boxes. Not sure about the cost trade off for it tho; RAM is cheap once and with SCSI you keep paying.

andy
Re:Time to do some reading by tepples · 2004-07-27 11:22 · Score: 1

Larger arrays = GOOD, Napster = BAD.

I understood you to this point. But what does an online music store have to do with hard disk arrays?
Re:Time to do some reading by UnrefinedLayman · 2004-07-27 12:40 · Score: 1

A long time ago, in a galaxy far, far away, back when IPOs were flush and the beer flowed like wine, there was a flash comic that depicted Lars Ultrarich and James Hetfield as two goons sue-happy against Napster users.

Hetfield, being a big, dumb idiot, was depicted as a big, dumb idiot relegated to simple phrases, such as "NAPSTER, BAD!" He was rather cave-man-like. I believe one full phrase went, "MONEY, GOOD! NAPSTER, BAD!"
Re:Time to do some reading by JWSmythe · 2004-07-28 17:01 · Score: 1

Actually, some businesses *DO* want the capacity.

We bought two Promise 15100 arrays, and put in 30 250Gb drives. Sometimes an array is wanted to be sometimes large, sometimes redundant, and in this case, both.

--
Serious? Seriousness is well above my pay grade.

My Guess by MBCook · 2004-07-23 13:20 · Score: 4, Insightful

My guess is a simple one. Who buys SCSI stuff? It's expensive so it's mostly businesses and others who need high reliability (which one of the major reasons SCSI is more expensive). Now while normal people can "afford" to lose 250, 300, or more GB of data, for a business that could be worth billions of dollars.

The solution to this reliability problem is the RAID. There are two RAID levels that are ideal (there are more, but this is a simple explanation). There is 1, which is just a mirror; and 5, which is striping with parity.

With RAID 1, if you have 500 GB of data, you would need 2 500 GB drives. You lose 50% of the capacity you buy. The other option is RAID 5, where you lose (1/number of disks). So you could store 500 GB of data on 6 100 GB disks. This way you've only lost 100 GB of storage to redundancy as opposed to 500 GB.

So when businesses want to store large ammounts of data, it's more economical to use many smaller drivers than to large drives. Even if you don't need the redundancy (for example the disk is just being used for temporary storage while working on large digital picture or video files) they it's still better to use many small disks. While using a 500 GB drive will only go so fast (lets just say 60 MB/s sustained), by using a RAID, you can mulitply that. So by using 5 100 GB drives, you might be able to sustain 300 MB/s (assuming the bus can keep up, etc). Even if you only scale at 50% (that would be 150 MB/s) that's still 2 to 3 times faster than a single drive. That performance can save you money.

So, if you can afford it you can get much better performance or economics from using multiple smaller drives from one large one.

That's my theory/understanding. Begin tearing it apart!

--
Comment forecast: Bits of genius surrounded by a sea of mediocrity.

Re:My Guess by Robbat2 · 2004-07-23 13:54 · Score: 4, Informative

RAID is a wonderful concept, but work needs to be on points of failure other than the drives.

Most decent external RAID units today have dual hot-swappable dual power supplies and fans. However there is still only a single backplane and RAID controller board (IBM PowerPC chips are very popular for this) involved. I've both a backplane and controller a fail on me in the span of 2 years, in both cases taking all the data with them. These units were 6x200GB IDE drives, 1TB usable, 1 parity drive, and we had several cold spares available to hot-swap in on a failure.

Sure I agree that statistcally your drives, fans and power supplies are much more likely to fail than the backplane or controller, but it can still happen.

Never forget the important of having backups, and make sure you can recover from them as part of implementing your backup solution. (1 month rotation of Ultrium tapes here).

There is a solution to the above, but it's very costly, and that's RAID over distributed storage (iSCSI and the like).

--
ICQ# : 30269588
"I used to be an idealist, but I got mugged by reality."
Re:My Guess by Alphanos · 2004-07-23 13:59 · Score: 2, Informative

I think it is more likely that high rotation speed doesn't combine easily with high capacity. If you are spinning the disks more than twice as quickly as standard ATA drives (15k vs. 7200 rpm), then having the same data storage density isn't going to work without new technological developments. In other words, when the disk reading head moves at twice the speed, the bits need to be roughly twice as large. This is why the first CD drives didn't read at 52x: they needed time to develop the technology that allows the reading of that data density at that high of a speed.

I'm not familiar with which mathematical formula is involved, but from this perspective, 150 gb scsi drives operating at twice the speed seems reasonable compared to 300 gb ata drives. I suspect a similar reason is responsible for the low capacity of the 10k WD Raptors (serial ata drives) which have capacities of only 36 or 72 gb!

--
Alphanos
Re:My Guess by keesh · 2004-07-23 14:05 · Score: 1

I guess you've never used an IBM 2105 Enterprise Storage Server. Totally redundant, dual controllers, SSA loops to avoid failures in SCSI cables. You *can* get fully redundant kit, but as it's aimed at mainframe and high end people, it's about half as much again as, say, a 7133.
Re:My Guess by lylonius · 2004-07-23 15:31 · Score: 2, Interesting

You make good arguments, but reliability and storage capacities are only two of the issues involved.

The largest benefit is performance. Gamers invest so much in their system bus, cpu, and memory, but disk i/o is 5 orders of magnitude slower. if performance is key, a small investment in SCSI improves disk intensive apps considerably.

1. IDE requires CPU cycles. SCSI buses have embedded ICs that handle queuing of data and such, freeing the CPU to perform other tasks.

2. IDE channels are shared. Most IDE ribbons allow for two devices, but one device can talk on the channel at a time, much like CSMA/CD, whereas SCSI allows you to daisy-chain 7 or more devices to simultaeneously talk on the same channel.

3. IDE is not bidirectional. similar to (2), this causes read/writes to wait.

one reason that you didn't mention, that falls under reliability is SCA. this interface combines signalling, power input and data i/o, which enables hot-swappable SCSI drives, critical to any non-appliance or diskless system that requires high availability.
Re:My Guess by shaitand · 2004-07-23 17:44 · Score: 1

The first problem with that is that scsi drives usually carry a 5yr warranty, as opposed to the 1yr warranty IDE drives carry.

The second is that in terms of performance there is a reduced efficientcy for every drive added to a raid.

The third is that controllers to handle an increased amount of drives are orders of magnitude more expensive and you can only have 15 devices on a scsi chain. With 146gb drives that gives you a max of 2.1TB on a chain, with 250gb drives that becomes 3.7TB on a chain, yeah it's only a terabyte, a terabyte is nothing right?

Also lets talk about speed, the max speed you can get on scsi is 320mb/s sustained, a single scsi drive can give you that, a 3 drive can give you that all the time. A 4 drive will only give you that. a 5 drive raid will only give you that, etc etc etc.

Once you have at least 3 drives will give you that 320mb sustained, what your really talking about now isn't sustained transfer, but latency. When I request data, how soon does the transfer start. This is critical for some applications, like most databases and webservers, in which requests will be for small amounts of data but need to be served quickly.

There is actually some overhead with a raid in every access request, but this is actually where a scsi raid excels and you want more drives. If you have 3 drives, and 3 simultaneous requests, each request will be served with transfer time + access time + minimum controller overhead. But if you have 4, the 4th will be queued. If you had 4 drives it wouldn't be. In theory, the more drives you have, the more simultaneous requests you can handle with a minimum delay.

So in short, there is no reason why people wouldn't want larger drives, there IS reason to want more smaller drives, but it depends on your needs. You have to weigh the amount of data being stored, the size of your average request, and the number of simultaneous requests.

People argue whether more smaller drives, or a multiple of larger drives is faster, and the truth is that it depends.
Re:My Guess by innosent · 2004-07-23 17:51 · Score: 2, Informative

Higher-end controller cards (read: NOT IDE) can share a bus with another controller, allowing two systems (with a controller in each) to share access to a single (external) array, with dual power supplies, and technologies like SSA allow even the cabling to be redundant.

Of course, by the time you spent the money on this type of setup, you could probably have purchased another complete machine, with another array in it, and used software to handle redundancy and updates to the array. We did this with our SQL server setup. We have two machines with redundant power supplies and RAID-10 arrays, setup in a load-balancing cluster with one as an updating subscriber to the other (updates are sent between the machines real-time, losing about 10% on write performance, but doubling read performance). If you share the array between machines, you will save the 10% or so on write performance, but won't gain the read performance, so it all depends on your primary usage for the storage system. Of course, you can't completely avoid downtime (something WILL happen eventually), but by having a separate system, you can reduce the chances of having downtime, and reduce the length (since only one has to be up).

--
--That's the point of being root, you can do anything you want, even if it's stupid.
Re:My Guess by innosent · 2004-07-23 18:03 · Score: 3, Insightful

Gamers? How many gamers really NEED large (>147GB) disks? SCSI drives are not produced for gamers, they are produced for business workstations and servers. I agree with you about SCSI being better, but the reasons you gave don't apply to all IDE controllers (number 1), and certainly not to all SATA controllers/disks (all reasons). A GOOD (i.e. usually not onboard, probably something from 3ware, etc.) SATA controller has a processor, command-queueing, separate, bi-directional channels for each device, and SATA connectors are designed for hot swapping (better than SCA actually, even to the point of connections being made in sequence due to staggered pins). I've got a 12-disk SATA RAID-5 array at work, and don't have any of the problems you listed, because I hand-picked the hardware to avoid those (and other) limitations. If you really want your games to run as fast as possible, then it's going to cost a few thousand dollars anyways, and if you really need that much space, maybe it'd be a good idea to buy a decent controller.

--
--That's the point of being root, you can do anything you want, even if it's stupid.
Re:My Guess by Anonymous Coward · 2004-07-23 18:03 · Score: 0

much like CSMA/CD

which is why ethernet (well most folks anyways) went Full Duplex.
Re:My Guess by innosent · 2004-07-23 18:07 · Score: 1

Hyundai's have great warranties, too, but you still have to walk if it's in the shop, whether it's covered by a warranty or not.

That is all, the rest is absolutely correct, except to mention that if you're running a Windows OS, the maximum volume size is 2.4TB anyway.

--
--That's the point of being root, you can do anything you want, even if it's stupid.
Re:My Guess by megabeck42 · 2004-07-23 18:43 · Score: 3, Informative

1. No Longer True.

This has, in large part, disappeared with the advent of UDMA. It was true that IDE was very cycle expensive a decade ago when the IDE really meant Internal Disk Eletronics. The IDE "interface" was just a set of tri-state latches and the CPU would be responsible for pushing and reading every single byte. If you ever look at the pinout for an IDE cable, it's no surprise that it very closely resembles the ISA bus. Another historical note, ATA means AT-Attachment because the first set of IDE drives that were really popular were designed to attach to the IBM PC AT (the successor of sorts to the IBM PC XT) bus.

Now, processors queue dma requests in and out of the drive and the "interface" really has grown up to be more of a "controller." They're not as complex as the SCSI adapters, of course, but then again, SCSI is a much more complex signaling system.

2. No Longer True.

What you're trying to describe is called as "bus disconnect." I'm not sure which side of the bus was responsible, however, the idea is that while a drive was processing a command, the bus was locked until the command finished.

Note, the first version of SCSI did not have Disconnect either. However, given many more devices sharing the bus, bus contention was more severe, especially using slow devices like tape drives and cdroms, that it became necessary rather than just a feature.

SCSI supports disconnection as well as Tagged Command Queueing. TCQ allows the host to issue multiple outstanding commands to the device. The device is allowed to complete these commands out of order. Many drives will reorder the requests to take advantage of the head movement.

Recent revisions of IDE include support for TCQ.

I will add, however, that it is still worthwhile to have only one device per channel. Compare this to putting more than two 15K drives on a U160 channel.

3. Not even remotely true. SCSI is a parralel bus, much like IDE, ISA, or half a dozen others. Its only possible for one device to drive the bus at one time. This is clearly evident since a few of the lines in the SCSI cable are used to indicate the Target of the bus transaction. There is only one set of these signals, therefore, there can only be one target.

Also, the electrical interface for Serial ATA is designed with hot-swap in mind.

While your first suggestion is accurate, disk i/o is very slow and SCSI equipment tends to be of better quality than IDE hardware. SCSI drives with higher spindle speeds have much lower latency, which can lend a dramatic difference to a similar computer with IDE drives. However, that difference is of no fault of IDE. I would encourage, you, in future to be more accurate with your information.

If you believe I have written inaccurately, I would recommend reading the draft documents from INCITS T13, the ATA technical comittee.

--
fnord.
Re:My Guess by shaitand · 2004-07-23 18:51 · Score: 1

"Hyundai's have great warranties, too, but you still have to walk if it's in the shop, whether it's covered by a warranty or not."

What sort of idiot runs anything that needs multiple TB of storage and doesn't keep extra drives on hand? Also in a raid 5 configuration, you haven't lost data, and the raid isn't down simply because a drive has failed.

"That is all, the rest is absolutely correct, except to mention that if you're running a Windows OS, the maximum volume size is 2.4TB anyway."

True that, although there really aren't many things you'd need 2.4TB that could be done with a windows OS, or if they could be, that anyone would.
Re:My Guess by MrResistor · 2004-07-23 20:30 · Score: 1

There's really not much you can do about having only one backplane. Most RAID manufacturers deal with this by not putting stuff that really matters on the backplane, they're typically just pass-through with maybe some passive filters.

However, every RAID unit I've dealt with has at least had a slot for a redundant controller. Of course, these are SCSI RAIDs. I guess now you know what the price difference is all about.

That said, unless there's something extremely screwed up about the design of your RAIDs, there's no reason you should lose data from the failure of a part that isn't a drive. You should be able to replace a faulty controller or even backplane, and as long as the drives get put back in the same slots, everything should be back to normal when you bring it back up. I strongly suspect that either you weren't careful about keeping the drives in the same order, or you remade your filesystem unecessarily.

--
Under capitalism man exploits man. Under communism it's the other way around.
Re:My Guess by Alex · 2004-07-23 22:54 · Score: 1

Higher-end controller cards (read: NOT IDE) can share a bus with another controller, allowing two systems (with a controller in each) to share access to a single (external) array, with dual power supplies, and technologies like SSA allow even the cabling to be redundant.

Of course, by the time you spent the money on this type of setup, you could probably have purchased another complete machine, with another array in it, and used software to handle redundancy and updates to the array. We did this with our SQL server setup. We have two machines with redundant power supplies and RAID-10 arrays, setup in a load-balancing cluster with one as an updating subscriber to the other (updates are sent between the machines real-time, losing about 10% on write performance, but doubling read performance). If you share the array between machines, you will save the 10% or so on write performance, but won't gain the read performance, so it all depends on your primary usage for the storage system. Of course, you can't completely avoid downtime (something WILL happen eventually), but by having a separate system, you can reduce the chances of having downtime, and reduce the length (since only one has to be up).

You can get fully redundant SATA RAID devices, controllers, PSUs, fans, etc - for less that 10000GBP.

Alex
Re:My Guess by innosent · 2004-07-24 10:11 · Score: 1

True, but if Hyundai made hard drives, chances are they would all fail at the same time (about the computer equivalent of 24,000 miles), and it would be down. Besides, replacing drives costs money too, warranty or not, because it takes up your time, especially when UPS loses the replacements, crushes them, and then delivers them to the wrong address (at least, that's what they do with my replacement DSL modems).

--
--That's the point of being root, you can do anything you want, even if it's stupid.
Re:My Guess by drinkypoo · 2004-07-24 10:31 · Score: 1

While perhaps IDE SHOULD be as fast as SCSI, it never is. while it SHOULD use no more CPU than SCSI, it doesn't seem to work that way.

--
"You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
Re:My Guess by Anonymous Coward · 2004-07-24 21:22 · Score: 0

SCSI is a parralel bus

I hope you're aware that SCSI stands for Small Computer Serial Interface.
Re:My Guess by shaitand · 2004-07-25 00:28 · Score: 1

"True, but if Hyundai made hard drives, chances are they would all fail at the same time (about the computer equivalent of 24,000 miles),"

Maybe, fortunately with harddrives it generally doesn't work that way :) Some will be DOA, some will drop off in two months, some in a year, some in two, some in ten. Depends on the conditions, the drives of course, the luck of the draw, whether the groundhog saw it's shadow, that sort of thing.

"Besides, replacing drives costs money too, warranty or not, because it takes up your time, especially when UPS loses the replacements, crushes them, and then delivers them to the wrong address (at least, that's what they do with my replacement DSL modems)."

It can, but most of the guys doing this are on salary, and would be reading slashdot getting paid if they weren't replacing a drive and getting paid.
Re:My Guess by compwizrd · 2004-07-25 02:41 · Score: 1

I hope you're aware you're wrong.

SCSI stands for small computer system interface

SAS (serial attach SCSI) will be serial though.
Re:My Guess by megabeck42 · 2004-07-25 03:13 · Score: 1

Not on this planet. Here, SCSI is an acronym for:

Small Computer System Interface

Check the ANSI document X3.131:1994[1999]

You may be thinking of some serial adaptations of SCSI, like SBP, Serial Bus Protocol, or SAS Serial Attached SCSI. ... next time, try google.

--
fnord.
Re:My Guess by innosent · 2004-07-25 04:35 · Score: 1

"It can, but most of the guys doing this are on salary, and would be reading slashdot getting paid if they weren't replacing a drive and getting paid."

Very true, except for the UPS issue. Seriously, I had 3 modems coming for some remote offices last week, one was lost, crushed, and delivered to the wrong address, and the other two were two days late.

--
--That's the point of being root, you can do anything you want, even if it's stupid.
Re:My Guess by MemRaven · 2004-07-25 07:36 · Score: 1

If the controller goes really wonky and starts writing bad data to the drives, then it doesn't matter if the drives are in good shape or not, if the data on them is wrong.
For example, if you send a block of 00000 to your RAID array, and the controller barfs and actually tells the drives to write 00100, then it doesn't matter if all your drives are okay, the data on them is actually wrong, meaning that your controller corrupted your data.
Re:My Guess by MrResistor · 2004-07-25 18:15 · Score: 1

That's true, but then that isn't a problem that's going to be solved with redundancy[1], which was what the grandparent was griping about the lack of.

Also, while this is theoretically possible, I've never seen it happen. I troubleshoot RAIDs for a living, and I probably average about a controller a day between my various fixtures (several different chassis from several different manufacturers) over the last 2 years, so I don't think that's due to lack of exposure. In my experience, controllers either work or they don't, though it may be that they only fail under a very specific set of circumstances. I've seen controllers do some wierd stuff, but I've never seen them write bad data (and yes, my tests would definately pick that up).

[1] I suppose you could add some ECC as part of the failover system. Maybe they do, and I've just never seen it come into play. Failover is typically the last thing I check for, and then only on units we intend to ship to a customer. The bulk of my testing is done without redundancy. Graceful failures are harder to isolate.

--
Under capitalism man exploits man. Under communism it's the other way around.
Re:My Guess by afidel · 2004-07-26 01:46 · Score: 1

HP high end servers DO in fact use redundant systems to check every calculation and every data transaction. Every calculation is done by two cpu's and if the results aren't identical it is run again, if they still aren't the same both CPU's are taken offline and the thread is migrated to another part of the system. Likewise all data path's are redundant and ECC'd. This costs LOTS of money, but if you are paranoid about data corruption/loss then that's what you pay for. Btw controllers going crazy isn't all that uncommon, every field tech I've met has at least one story of a crazy RAID controller necessitating going to tape, often from weeks before the repair call since the controller had been silently failing for some time before the corruption was noticed. Luckily I've never heard of it happening on a DB server, that would be one heck of a mess.

--
There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
Re:My Guess by MrResistor · 2004-07-26 18:36 · Score: 1

HP high end servers DO in fact use redundant systems to check every calculation and every data transaction. Every calculation is done by two cpu's and if the results aren't identical it is run again, if they still aren't the same both CPU's are taken offline and the thread is migrated to another part of the system. Likewise all data path's are redundant and ECC'd. This costs LOTS of money, but if you are paranoid about data corruption/loss then that's what you pay for.

Interesting. That's a little beyond the scope of the kind of RAIDs we're talking about here, though.

Btw controllers going crazy isn't all that uncommon, every field tech I've met has at least one story of a crazy RAID controller necessitating going to tape, often from weeks before the repair call since the controller had been silently failing for some time before the corruption was noticed.

I'm sure all of our field service guys have a story like that, too. Trouble is, I'm the one they send those bad cards to for failure analysis, and I've never seen one do that. In my experience the data corruption has always happened elswhere in the system, like on the codec card (these are video servers).

Of course, I've also seen field service guys swap out entire RAIDs trying to solve a problem that could not possibly have been caused by the RAID, so I don't put a lot of stock in the stories of field service guys.

--
Under capitalism man exploits man. Under communism it's the other way around.
Re:My Guess by Bios_Hakr · 2004-07-27 08:19 · Score: 1

See DRBD. It's 2 computers that mirror each other in a RAID1 fassion...

--
I'd rather you do it wrong, than for me to have to do it at all.
Re:My Guess by alexburke · 2004-07-28 18:13 · Score: 1

You, my friend, know your shit.

I suggest you contribute this (and more) to Wikipedia.
Re:My Guess by Anonymous Coward · 2004-07-29 05:32 · Score: 0

Actually it stands for Small Computer Systems Interface, and when you have more than one drive on a single SCSI channel, those drives are in parallel series to each other.

SAS - development by johnjones · 2004-07-23 13:26 · Score: 1

would you put effort into product development when you where already spending money on Serial Attached SCSI ?

well the storage people do not think it wise to spend the money...

iSCSI and SAS are good things !
(pitty there is not a MacOS X driver for iSCSI...)

regards

John Jones

Re:SAS - development by Anonymous Coward · 2004-07-23 14:03 · Score: 0

Just to clarify your point, SAS is basically SATA++, and IIUC, you'll be able to mix SAS and SATA devices on the same controller.

Just a thought... by kernelistic · 2004-07-23 13:46 · Score: 1

Could it be that the various manufacturers have a large stock of the smaller drives that they're trying to get rid of before putting larger ones to market?

Maybe it is due to the fact that SCSI storage has typically doubled in size... 9.1, 18.2, 36.4, 72.8, 145.6... Could it be that they're currently testing 291.2GB disks?

My $0.02.

THE ANSWER by icandodat · 2004-07-23 13:47 · Score: 5, Informative

This info is from an IBM Magnetic Storage Engineer. The reason is that the IDE market is a retail home market and very competitive. He said "If an IDE manufacturer can save 5 cents on a component he'll buy the cheaper one". The time from R and D to store shelf is less than a year. For SCSI drives on the other hand are primarily for servers and they have expensive components and are tested for a long time before they reach the market. The time from R and D to store shelf is about three years for SCSI. what was the bigest drive you could buy three years ago (ide)? Thats right about the same size as the biggest SCSI drive today. So ... what does this mean? IDE drives suck, they are cheap they are the zip lock bag of the storage industry. If you are going to grandmas with your data thats ok but if its going to the moon... buy tupperware, (SCSI).

Re:THE ANSWER by Anonymous Coward · 2004-07-23 15:16 · Score: 3, Interesting

I always find comments like this amusing. It's just not true.

Not long ago I had to set up a several terabyte array (around 4 TB) using SCSI drives. We were constantly replacing the damn things. And this was supposedly quality hardware from Sun. Now, with as many drives as we had, there were bound to be failures. Eventually the failure rate stablized at about 1 or 2 drives per month. A rate which continues to this day, some 3 years later.

Previous to that array I had helped set up a similar system using PC components and IDE drives. The array was actually nearly twice as big at around 7 TB but cost less than the SCSI array. Guess what? In the last 3 years only 1 drive has failed. One drive.

Which one is more realible?

Fuck SCSI.
Re:THE ANSWER by hawkbug · 2004-07-23 16:02 · Score: 1

To my surprise, I have found the same results. I've had 5 SCSI drive failures and 1 IDE drive failure in some servers I maintain over the last 2 years. That absolutely shocks me from what I understand about the hardware. However, I do have some extremely high volume servers that have lasted 4 years - and they are still using the same SCSI drives since day 1. I continue to have faith in those servers while running RAID 5 like they are. I just think it's luck, or maybe the SCSI drives spin faster (10k vs 7.2k RPM) which makes them wear out faster, I'm not sure.
Re:THE ANSWER by Kevin+Burtch · 2004-07-23 16:56 · Score: 2, Interesting

I'm very curious which Sun array this is, and which drives you are using.

I've worked in the Sun market for well over a decade, and I haven't seen failure rates like you're describing since the old Seagate 2.9G 5-1/4" full-height drives they used to have in their "Mass Storage" cabinets (the ones that looked exactly like a SPARCcenter 2000)... and that was only after the drives were out of production for a few YEARS (all replacements were refurbs).

My guess is you have serious environmental issues... heat/humidity due to a non-datacenter environment (do you have raised-floor-cooling? is it under 70F?), or non-isolated air (is the A/C air-handler the same one used for the rest of the building?), or you have non-isolated power and have regular spikes (in Miami maybe?).
You need a _true_ UPS system, not an SPS labeled as a UPS.

--
- Preferences: Solaris 10 (servers), Ubuntu (desktops), Solaris 11 (personal servers) -
Re:THE ANSWER by Phillup · 2004-07-24 02:31 · Score: 2, Interesting

Were the IDE and SCSI drives rotating at the same speed?

--

--Phillip

Can you say BIRTH TAX
Re:THE ANSWER by nbvb · 2004-07-27 11:31 · Score: 1

I call BS.

You really should check the Seagate 18.2 GB FC-AL disks. They're crap. The firmware is crap, the drive is crap, and the failure rate is WAY WAY WAY too high.

I can't tell you how many times I've seen an entire loop on an A5200 go offline because a single disk was failing.

Piece of ..... drives. Not to mention the multi-initiator bug, where the drive locks up both the A and B bus. That does WONDERS for clustered environments.

The active system can't access the disks, so it attempts a failover. Can't deport the disk group, so it shoots itself in the head .... Then the secondary node can't import the disk group either, so it shoots ITSELF in the head, and now you have 2 downed nodes.

Lovely.

Yeah, Sun's storage sucks. It's sucked since the A3500 hit the market. The only thing they sell that isn't garbage is the D1000 (how hard was that to build? It was outsourced anyway...) and the HDS storage (9970/9980)...

They do exist! by MarcQuadra · 2004-07-23 13:53 · Score: 4, Informative

Hitachi/IBM produce the 300GB UltraStar 10K300, which is a mighty drive if I've ever seen one.

The real reason is that when you move up to higher rotational sppeds to reduce latency, you have to reduce density relative to the motion of the disk under the head, so a 10K drive can generally pack only 60%-ish as much data per-inch as a 7200RPM drive.

The same can be seen in 15K disks, which are much lower density than their 10K counterparts. The 15K platters are smaller too, to keep them from flying apart.

Do you remember when the 5400RPM disks had higher capacity than the 7200 ones? I sure do, it was for the same reason.

Until the latency of the read-write head improves this will be the case.

--
"Sometimes, I think Trent just needs a cup of hot chocolate and a blankie." -Tori Amos on Nine Inch Nails

Re:They do exist! by Pegasus · 2004-07-23 22:51 · Score: 3, Insightful

Heck, give me then 3600rpm disks with transfer speeds of 20mb/s and capacity of 2Tb! I'd gladly have dozen of them to put my dvd collection on.

I've heard some things about the new Hitachi 400gb drive being optimized for tv settop boxes. Does that mean that it's optimized for linear reads/writes? If so, why did they not decrease rpm in order to gain more capacity?
Re:They do exist! by AKnightCowboy · 2004-07-24 03:36 · Score: 0

Heck, give me then 3600rpm disks with transfer speeds of 20mb/s and capacity of 2Tb! I'd gladly have dozen of them to put my dvd collection on.
You can easily get a 7200 RPM 250GB (which is just around 2Tb) ATA disk drive for under $200. And 20mb/s?? My LAN transfers data faster than that. These drives will easily top out over 45MB/s. Also, I store my DVD collection quite easily on these 250GB disks in a RAID setup.
Re:They do exist! by MarcQuadra · 2004-07-24 05:21 · Score: 1

I think you'll be seeint this sort of thing once the 64-bit paradigm shift is complete. The entire system, disks and all, will be mapped-out to the address space. All you have to do is load the system with RAM, I'm talking about 64GB of RAM, and have the huge storage disk sync with the RAM every now-and-then. All of your OS and apps, and most of your recent documents will reside in RAM, the rest will shuffle off to the disk when it gets 'cold'.

--
"Sometimes, I think Trent just needs a cup of hot chocolate and a blankie." -Tori Amos on Nine Inch Nails
Re:They do exist! by Anonymous Coward · 2004-07-24 15:37 · Score: 0

That's a great idea, until there's a system crash due to flaky hardware, drivers, or software... or a power failure...
Re:They do exist! by SuiteSisterMary · 2004-07-26 04:10 · Score: 1

You can easily get a 7200 RPM 250GB (which is just around 2Tb)

You do mean '.25 TB,' yes?

1024 megabytes = 1 gigabyte; 1024 gigabytes = 1 terabyte.

--
Vintage computer games and RPG books available. Email me if you're interested.
Re:They do exist! by lewp · 2004-07-26 16:03 · Score: 1

Little "b" is bit. Big "B" is byte. 2Tb is 2 Terabits.

A byte is 8 bits. 250GB == 2Tb.

Pegasus probably meant TB, and AKC probably knew that.

--
Game... blouses.
Re:They do exist! by SuiteSisterMary · 2004-07-27 02:21 · Score: 1

Aye. Unfortunately, as so many people don't know the difference, I tend to assume, when they're talking about hard drives, that they mean bytes, and when talking about network speeds, they mean bits.

--
Vintage computer games and RPG books available. Email me if you're interested.

It's the RAID, silly... by wolf31o2 · 2004-07-23 14:41 · Score: 1

In most cases, companies are purchasing large numbers of drives and adding them to RAID arrays. They don't want a single large drive, but rather several smaller drives in a redundant configuration. Also, more spindles is usually a good thing in a server environment, where multiple data sets are being requested from multiple hosts, rather than a few sets from a single host as in the typical IDE setting.

Re:It's the RAID, silly... by shaitand · 2004-07-23 17:49 · Score: 1

Some companies want a number of LARGER disk in an array.

I dont know... by Cyno01 · 2004-07-23 14:55 · Score: 1

But its probably the same reason we dont have large capacity/high speed SATA drives either.

--
"Sic Semper Tyrannosaurus Rex."

Thin client servers by Gary+Destruction · 2004-07-23 15:06 · Score: 1

I would think that SCSI has been shifted toward thin client servers. Gigabit Ethernet is fast as it stands, but extra speed calls for faster drives and faster disk access.

More Drives = bad reliability? by Xepo · 2004-07-23 15:29 · Score: 1

Overall, a valid 'Ask Slashdot'. But one thing bugs me. "More points of failure." I guess that's technically true, but I'd think having half of your data saved is better than having all of it lost -- I fail to see how having more drives is a bad thing when it comes to reliability.

...provides more points of failure... by strabo · 2004-07-23 15:32 · Score: 2, Funny

...provides more points of failure...

Yeah, that's a problem. It's much better to reduce potential points of failure... preferably down to a single point of failure.

Or is that not what you meant?

Re:...provides more points of failure... by shaitand · 2004-07-23 17:53 · Score: 1

It's much better to logically reduce to a single point of failure and then introduce redudantcy for that single point... oh wait, that's why you have multiple drives in a raid.

Too slow to be useful? by TheLink · 2004-07-23 15:47 · Score: 5, Informative

Drive speeds haven't really gone up tremendously. Still too slow.

Imagine you have a 1TB drive, but were stuck at a 100MB/sec max seq transfer rate. It takes you 2.7 hours to read/write the entire drive. And that's for _sequential_ access. Gets ugly for random seek.

A similar speed 10TB drive will take you more than a day (27+ hours) to read sequentially.

Before the point where it takes too long to read an entire single drive you might as well start using multiple drives to add capacity rather than having bigger drives.

Taking too long is subjective, but I'd say this: how long can you make your boss/customer wait whilst you are restoring an entire disk image from backup? 27 hours or 2.7 hours? or 25 minutes?

So 70GB would be about the limit if you have impatient users and bosses.

Larger capacities are OK if they are to hold data that aren't important enough to be backed up, and don't require masses of data to be available quickly. Or you are doing mirroring and read speeds are important but write speeds aren't as important (but remember that restoring from backup = writing ;) ).

--

Too many replies beneath your current threshold

Re:Too slow to be useful? by shaitand · 2004-07-23 17:56 · Score: 1

Just because you increase drive capacity, doesn't mean you increase drive utilization.

if you have 20gb of data on your 100gb drive, and then ghost that to a 1TB drive, your copying 20gb of data. If you then ghost that 1TB drive to a 10TB drive, your still only copying 20gb.
Re:Too slow to be useful? by Cecil · 2004-07-23 19:18 · Score: 2, Informative

You forget the important fact that as the drive DENSITY increases, so does the amount of data read per revolution of the platters. Bigger drive, faster transfer rate. Unless you're talking about limits on things like ATA, but those are being replaced and upgraded as needed.

--
Random and weird software I've written.
Re:Too slow to be useful? by TheLink · 2004-07-23 23:12 · Score: 2, Interesting

"You forget the important fact that as the drive DENSITY increases, so does the amount of data read per revolution of the platters"

The _evidence_ of actual transfer rates is more important that your "important fact".

This might be helpful. Select WB99 transfer rate - Begin.

If you have evidence of significantly faster single drives do let me know.
--
- Too many replies beneath your current threshold
Re:Too slow to be useful? by Anonymous Coward · 2004-07-24 15:35 · Score: 0

"Your" also an IDIOT who doesn't know how to spell.
Re:Too slow to be useful? by shaitand · 2004-07-25 00:37 · Score: 1

Odd I've never found a correlation between spelling/grammar and intelligence.

Unless you consider the stupidity of spelling/grammar trolls, who live in such shame they have to convince their pathetic little minds of superiority. They do this of course by pointing out the mistakes of others in the most anal fashion, usually where those mistakes matter least.

Speed. by alienw · 2004-07-23 16:44 · Score: 1

If you are actually going to use those drives in a server (guess where they are used, for the most part), you need them to be fast. An array with fewer larger drives is much slower than one with lots of small drives.

Re:Speed. by shaitand · 2004-07-23 17:59 · Score: 1

Only to a point. Although you can handle more simultaneous small requests faster with more drives, you can only transfer the data from those requests at 320mb/s max, and 3 scsi drives in raid 5 can provide that sustained.

And there are always those who wants lots of larger drives.
Re:Speed. by alienw · 2004-07-24 08:40 · Score: 1

Although you can handle more simultaneous small requests faster with more drives

That's exactly the point. What do you think limits the bandwidth of, say, a database?

you can only transfer the data from those requests at 320mb/s max

I am pretty sure fiber channel is faster than that. That's what fast arrays are hooked up with, anyway. These are generally independent boxes, with their own highly sophisticated and intelligent controller and a really fat pipe.

And there are always those who wants lots of larger drives.

If you have a database that takes up 500GB, you don't need 15 terabytes. You do need reliability and extremely high throughput and low latency. If the 500GB is all on one drive, you will be limited by the speed with which that drive can move its head. If it's spread across 30 smaller disks in a RAID configuration, your latency will be greatly reduced and the throughput will be greatly increased.
Re:Speed. by shaitand · 2004-07-25 00:24 · Score: 1

"I am pretty sure fiber channel is faster than that. That's what fast arrays are hooked up with, anyway. These are generally independent boxes, with their own highly sophisticated and intelligent controller and a really fat pipe."

That is a little different, but there your still maxed at 1gb/s tops, because that is the fastest network link your looking at. Since the drives can each put out 320mb/s and in that case would be doing so in parallel, you still can't signficantly improve throughput beyond 4 drives.

"If you have a database that takes up 500GB, you don't need 15 terabytes. You do need reliability and extremely high throughput and low latency. If the 500GB is all on one drive, you will be limited by the speed with which that drive can move its head. If it's spread across 30 smaller disks in a RAID configuration, your latency will be greatly reduced and the throughput will be greatly increased."

*nod* That would be exactly what I said, as I said, that would be the ideal case, a database or webserver. Maybe it would be more useful to give a scenerio where you WOULD want fewer large drives.

What if you don't have a database at all. What if instead you have 15TB of worth of 100mb-1gig video and audio files, what if those files are used by 20 people?

Lets say we're looking at 500gb drives, we'd be talking 30 of them. Lets compare that to 146gb drives, now your looking at 103ish drives, just your network access fibrechannel device that supports at least a gigabit fiber link to the network and full 320mb/s drives at full speed is going to cost you 10x as much if you want it to handle 100 drives and in this scenerio there would be no benefit to it at all. That's not even considering the cost of disks, this isn't ram, a disk of twice the capcity doesn't cost twice as much.

Nobody wants it by jgardn · 2004-07-23 17:30 · Score: 1

The bottom line is nobody wants it. Right now, data storage is so abundant that having twice or four times the amount of data per disk won't solve any problems. We've hit the stage in my line of work that long-term data storage is a non-issue. Suck as much data as you want and store it anywhere, we're never going to run out.

--
The radical sect of Islam would either see you dead or "reverted" to Islam.

Re:Nobody wants it by DAldredge · 2004-07-24 12:18 · Score: 1

And what type of work do you do that data storage isn't an issue?
Re:Nobody wants it by Anonymous Coward · 2004-07-24 15:39 · Score: 0

Cashier at McDonald's.

SCSI not meant for high capacity by tute666 · 2004-07-23 18:12 · Score: 1

The whole point of SCSI was high speeds, not high capacity. With the growing size of DB nowadays bigger drives with higher speeds are needes, but in a close future non volatile RAM will be much cheaper. Hell google already hosts itself on ram.

Re:SCSI not meant for high capacity by innosent · 2004-07-23 18:24 · Score: 1

Google doesn't host their data on RAM, just the server portions. When you do a query, they still have to check their (albeit advanced and highly parallel) database. Find me 3TB of NVRAM for less than $5,000, and you've got a sale. With simple DDR at about $120/GB, that will probably be difficult to do. Even if NVRAM were that cheap, that's still $360,000, which is a hell of a lot more than you want to spend to store your data.

--
--That's the point of being root, you can do anything you want, even if it's stupid.
Re:SCSI not meant for high capacity by afidel · 2004-07-26 02:28 · Score: 1

Actually, yes Google DOES hold their entire dataset in ram. The HDD's are just there to repopulate ram should they lose power. If IDE HDD's weren't basically free they would probably just net boot the boxes and read the data from central storage. Each Google box is responsible for a dataset that is exactly the size of its ram minus the minimal overhead for the OS. They get redundancy by holding the entire database in chunks each of which is held on several machines at each datacenter and each datacenter has an entire copy of the whole dataset. Hell AFAIk there IS no central copy of the database, just lots of automagically updated data chunks at multiple datacenters worldwide.

--
There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.

Reliability by MrResistor · 2004-07-23 20:38 · Score: 3, Interesting

My company was offering 180GB SCSI drives in one of our RAID products, but we had to stop due to reliability issues. There was a huge difference in reliability between the 180GB and 146GB drives (which we still offer).

--
Under capitalism man exploits man. Under communism it's the other way around.

Re:Reliability by crow · 2004-07-24 02:46 · Score: 1

Also, the 180GB SCSI drives were half-height, while the 146GB SCSI drives are low profile, so you can fit more of them into the same space.

I haven't seen any of the roadmaps recently, but it has been a while since the 146GB drives came out, so it's probably time for a bump in the next 3-6 months.

I actually benchmarked that one day... by leonbrooks · 2004-07-23 23:03 · Score: 1

...and at the time (4 years ago?) with equivalent-speed SCSI and IDE drives and controllers, dd if=/dev/hdc | dd of=/dev/sda and vice versa chewed about 3x as much CPU horsepower to work on the IDE drive (same ration in either direction).

--
Got time? Spend some of it coding or testing

Battle of the formats...IDE vs SCSI ? by splungent · 2004-07-24 02:40 · Score: 1

Could it be the VCR vs beta max syndrome? Or something more sinister..... I am going to put on my afdb ( http://zapatopi.net/afdb.html ) and think about this....

--
./what?

They're in... by GypC · 2004-07-24 04:09 · Score: 1

... the computers of people with high-capacity wallets.

THE ANSWER just isn't right by jgoemat · 2004-07-24 05:07 · Score: 1

I could see if SCSI was just behind IDE, but I've seen 146 gb drives available for years. In the same time, IDE has gone from 160 gb to 300+ gb.

small drives good -- big drives cheap. by Stephen+Samuel · 2004-07-24 11:02 · Score: 1

The third is that controllers to handle an increased amount of drives are orders of magnitude more expensive and you can only have 15 devices on a scsi chain. With 146gb drives that gives you a max of 2.1TB on a chain, with 250gb drives that becomes 3.7TB on a chain, yeah it's only a terabyte, a terabyte is nothing right?

If you're only worried about how much data you can put on a chain, SCSI has a two-level addressing scheme. Each 'target' (usually a drive), an have up to 8 or 15 Drives on it... It's not used very often, but I've seen some SUN enclosures that used that addressing method. In any case, that's pushing you up into the 16 and 32 terabyte range for a single chain.

As you pointed out, however, the bottleneck that you run into at that point is performance. When you're looking at performance, smaller drives have two advantages:
* More spindles means lower average latency
* more IO drives means faster overall transfer.

Simple fact of the matter is that it can take way too damn long to dump the contents of a 300GB drive -- and during that time, the drive is tied up. Far better to split that data among 4 80GB drives where only one is tied up at a time and/or you get a higher overall transfer rate if you have them properly distrubuted among controllers, etc.

At 30megabytes/second, a 300gigabyte drive is going to take almost 3 hours to dump/fill. If you're running a raid-5 system with data worth millions of dollars, thats a long time to be exposed to the possibility of a second drive failure. (I once suffered a 2-drive raid failure -- trust me, it's not pretty.).

From a manufacturer's point of view, 300GB RAID isn't a good bet. For the consumer market you can get away with a 1 year warranty... Few enough pay attention to the small print that you could probably get away with a 90 day warranty for a lot of them.

On the other hand, the kinds of businesses that like buying SCSI drives pay attention to their warranties. They'll want a 5 year term, and they'll put your feet to the fire if the drives fail in that 5 year span.

Given the lower volume, the lower profit margin and the lower reliability, it's just not worth building a 300GB drive with a 5 year warranty on a SCSI controller. It's just asking to et your foot blown off.

--
Free Software: Like love, it grows best when given away.

The reason is speed. by Dirttorpedo · 2004-07-24 11:05 · Score: 3, Informative

I will try to avoid the SCSI vs IDE flame war.

1) RPM. It is easier to spin a 2.5" platter at 15K than a 3.5" platter. (someone else can figure out the addtional energy but I would guess more than double the juice adduming uniform density.)

2) IOs per second. In large arrays the driving factor is not necessaraly throughput but IOs per second. Which leads to more transactions per second for your server farm. So more spindles = more IOs per second.

3) Access time. The bigger the drive the longer it takes the drive's processor to position the head. Therefore increasing access times. decreasing IO per second. I now its a trivial amount of time but it adds up over millions of IO.

4) Error correction. I cannot speak for IDE but each block on a SCSI drive has an Error Correction Code (ECC) which helps the drive recover from read errors. Again minimal.

5) Cynical answer. Smaller drives means your drive company sells more product to meet a given capacity.

educational point. SCSI is a protocol like IP or TCP. It can be tunneled through or carried by anything.
SPI -SCSI Parralel interface (old school).
FCP - Fibre channel protocol
SAS - Serial attached SCSI. SAS can also tunnel SATA.
iSCSI - scsi in TCP. (not ethernet)
SBP - SCSI Block Protocol. firewire.
ATAPI - yep SCSI ove IDE so your CDROM works.
many others.

Cost and reliability by Fiz+Ocelot · 2004-07-24 11:16 · Score: 1

SCSI is generally more reliable, that's part of what you are paying for. But some people have figured out how to get great reliability with still lower costs. For example, Google is based upon inexpensive and easily replaceable hardware. They have so much and such a robust system that hardware failure is not a problem.

Best of both worlds by chaoskitty · 2004-07-25 08:23 · Score: 1

I have been using the best IDE drives I can get (usually, Maxtor with 3 year warranty / 8 meg cache / 7200 rpm) in my servers with Acard SCSI-IDE adapters. The latest adapters, such as the 7726Q, even support command queueing, and all of them support all of the other SCSI goodies like disconnection / reconnection and speeds up to 160 MB/sec. No more adding multiple PCI IDE controllers, no more cabling nightmares! They're expensive ($90 USD an adapter, usually), but worth it for applications which require good performance (like servers).

http://www.acard.com/eng/product/scside/aec-7726q. html

Even my Amiga has one; the motherboard IDE is pretty slow and can only support drives up to 128 gigs, but a 160 gig drive on the 40 MB/sec UW SCSI bus is so much faster!

Lots of reasons.... by Anonymous Coward · 2004-07-26 09:43 · Score: 0

The big OEM HDD customers don't require larger SCSI/FC capacities. There are many reasons:

-Less data per disk means faster RAID rebuild times and less data dedicated to a single disk.
-Big RAID configs already provide plenty of capacity.
-Reliability is enhanced with lower data densities. Desktop always leads the density curve.

Re:Lots of reasons.... by Anonymous Coward · 2004-07-26 16:23 · Score: 0

Screw that, those are all just lame excuses.

the fact is that SCSI is yesturdays tech, pretty much obsolete.

This does exist! by Anonymous Coward · 2004-07-26 17:10 · Score: 0

This is what IBM calls a "Single-level store", and is used in their AS/400 architecture. The RAM in the machine is essentially nothing more than a cache for the DASD (disk array). Everything exists as an object within the 2^64 bit address space, which is mapped to the disk.

For more info, just Google for Single-Level Storage. It seems that this type of system causes more problems than it solves.

aQazaQa

Capacity vs. Speed by Detritus · 2004-07-26 17:34 · Score: 2, Interesting

I've read of companies that bought a bunch of SCSI drives and then set them up to only use half their normal capacity, by throwing away half the cylinders. This reduced the average access time of the drives. I'm not sure if they reconfigured the drives in-house or if the manufacturer did it for them.

--
Mea navis aericumbens anguillis abundat

OT: Use Konquer! by mabhatter654 · 2004-07-26 19:03 · Score: 1

yet another reason to use linux! In the Konquer browser, the spellchecker works in all text boxes...very cool!

Maybe that's why the /. programmers haven't felt the need to add spell checking to the site...

It's called "short-stroking" by Gothmolly · 2004-07-27 05:08 · Score: 1

You only format the outer half of the disk, so you reap the higher DTR.

--
I want to delete my account but Slashdot doesn't allow it.

"Every now-and-then", or "all the time"? by tepples · 2004-07-27 11:38 · Score: 1

I'm talking about 64GB of RAM, and have the huge storage disk sync with the RAM every now-and-then. All of your OS and apps, and most of your recent documents will reside in RAM, the rest will shuffle off to the disk when it gets 'cold'.

You just saved your document. I just pulled the plug. You just broke your monitor.

I'm just clarifying that in order to satisfy Durability, such a system would have to write out dirty data (which you called "cold") within milliseconds after it's marked dirty, with applications blocking on syncs of their VM spaces. Your "every now-and-then" would explode to dozens or hundreds of times a second.

Re:"Every now-and-then", or "all the time"? by MarcQuadra · 2004-07-27 14:51 · Score: 1

well applications would all run in 'disposable cold' states, where their memory spaces wouldn't have to be disk-backed because they could be started again easily. Documents, however, would be 'white hot' as soon as you hit 'save' because there would be a priority on getting them to disk ASAP in case of a failure. This CAN be done, you just need something like metadata for the stack space, or an area for data that needs to be flushed quickly.

--
"Sometimes, I think Trent just needs a cup of hot chocolate and a blankie." -Tori Amos on Nine Inch Nails

Parallel vs. series by tepples · 2004-07-27 11:40 · Score: 1

Whether a single point of failure is better than multiple points of failure depends on whether the points of failure are in parallel (e.g. RAID 1) or in series (e.g. RAID 0).

Size and Speed by neuroscr · 2004-07-28 05:57 · Score: 1

Its because we want more drives. The more drives the fast the Disk IO. So as corporate storage size requirements increase so do demand to access to that size. One extra drive means one extra set of reads per second in most raid arrays. Another reason SCSI has had 10k and 15k longer than IDE too.

Not all about SCSI.... by chbrules · 2004-07-28 22:02 · Score: 1

Well I'm not the guru on enterprise level buisness, but I think SCSI was intended for middle-ground? You're talking about Fibre, FICON, and ESCON up at enterprise. I've actually never got to mess with any SCSI devices in my short (4 years) as a pc guy. I've seen them, touched them, read up about them, but I've never got to impliment one or an array of them. Sounds like fun! ^^

--
-Conrad

Slashdot Mirror

Where are the High-Capacity SCSI Drives?

138 comments