Google Proposes New Hard Drive Format For Data Centers (thestack.com)
An anonymous reader writes: In a new research paper the VP of Infrastructure at Google argues for hard drive manufacturers and data center provisioners to consider revisions to the current 3.5" form-factor in favour of taller, multi-platter form factors — with the possibility of combining the new format with HDDs of smaller circumference which hold less data but have better seek times. Eric Brewer, also a professor at UC Berkeley, writes "The current 3.5" HDD geometry was adopted for historic reasons – its size inherited from the PC floppy disk. An alternative form factor should yield a better TCO overall. Changing the form factor is a long term process that requires a broad discussion, but we believe it should be considered."
Also, I thought the world was going SSD anyway, which is thinner, not thicker?
I just wonder if, by the time they agree on this (if they do) the price of SSDs will have dropped enough so that they can be used instead? Storage-wise they are already there, and then some.
But they need to convince someone that there is an economic case for making the fucking thing
I find 2.5" preferable for desktops/laptops. But for server farms — I doubt it matters at all what size is practical for PCs.
Multi-platter was always a good idea, I assume it stopped in a desperate attempt to cut costs.
8" hard drives often had 4 or even 8 double sided platters - and SCSI interfaces! Early 5.25" drives often had two, double sided platters. They desperately needed to access more data with less head movement because they had quite low areal bit density and used floppy-derived stepper motors for head positioning!
Sent from my ASR33 using ASCII
Bring back these removable 10 inch disk packs - 1970s tech
http://www.nf6x.net/2014/03/da...
Multiple heads on each side of the platter might be a better solution, one for the inner part and one for the outer.
I have a feeling that in a few years we'll be left with just expensive SSDs and even more expensive "datacenter" drives.
I apologize for the lack of a signature.
There are other form factors other than the typical low profile 3.5".
In particular there is the "half-size" thickness, witch is the thickness of 5.25" bays. It was a rather common form factor for 3.5" SCSI drives.
"In a new research paper the VP of Infrastructure at Google argues for hard drive manufacturers and data center provisioners to consider revisions to the current 3.5" form-factor in favour of taller, multi-platter form factors"
Google needs to fire this idiot. First off, we moved away from the larger drives BECAUSE THEY REQUIRE MORE POWER TO OPERATE AND GENERATE FUCKTONS OF HEAT. Secondly, SSDs are at and beyond the capabilities of spinning rust under a layer of vapor-deposited platinum. Third, SSDs are much smaller - even the typical 2.5" laptop drives are mostly empty space. Fourth, the energy required to manufacture an SSD is about 25% of what is required to make a comparable HDD.
Can someone tell me what 'University' this 'VP of Infrastructure' attended so I know not to send my children to such a moronic 'Institution of Education?'
Still waiting on Serviscope_minor to wake up to fucking reality and realize that Jessica Price isn't going to fuck him.
Realistically, there should be several size formats for different purposes and market segments. Google wants a new format for their purposes. That doesn't mean that it would necessarily be good for the needs of someone else.
I personally wouldn't mind seeing the return of the 5" HDD.
Given the more than doubling of area (capacity) in each platter by going from 3.5" to 5", I could live with higher seek times to have a 16TB HDD taking up one of the 5" bays in my PC case.
Surprised they haven't just gone with 2X or 4X height 2.5" drives. Same connectors, same platters, easy retrofit. You just need a different bracket.
apk has been saying this for years!!! Turns out this guy is brilliant, not a lunatic. -apk
Hey, Eric? Google needs to fire you. You're too stupid to be any visionary of the future.
First, spinning rust drives eat fucktons of power and generate fucktons of heat. This is a NO-GO for any datacenter wanting LEED or any other 'green' certification. Second, SSDs meet and beat platter drives in every way conceivable. Third, the cost of manufacturing an SSD is about 25% that of manufacturing a platter HDD. Fourth, even as drive sizes shrink, drive capacities are still increasing.
Someone please tell me where Eric Brewer attended university, so I may not send my children there to receive such a poor education.
Still waiting on Serviscope_minor to wake up to fucking reality and realize that Jessica Price isn't going to fuck him.
We dont have any 3.5" disks in our data center, all smaller SAS disks.
Are we still talking about hard drives and seek time? In 1999 Jim Gray said "These calculations suggest the simple rule: Cache web pages if there is any chance they will be re-referenced within their lifetime." Rules of Thumb in Data Engineering
Taller, more heads, smaller platter, less seek distance -- the logical end point is the drum! I'm sure we can do better than the FH-1782 today.
Everything old is new again...
We are not in nineteen-sixties any more. Instead of rw head on a moving hand, we should have full random-access rw disk placed above the platter. We have wafers, we have photolitography in tens of nanometres of resolution, so why are we pretending that we need to do things like they were when disks were concieved? It is not like anyone is going to remove platters from the housing, so that heads need to retract out of the way. We can have a multitude of rw "faces" per each track, eliminating not only the seek time, but also shortening latency till sought sector begins (or conversely, slowing down angular speed for longer lifetime). Besides, there is possibility of not only random, but also non-blocking parallel access!
The more platters they put in the more expensive the hard drive is going to get. These things don't exactly have perfect yields, much like a CPU in which not every core on every processor necessarily makes it through manufacturing in a state that is usable, not every platter makes it through manufacturing. Now increase the number of perfect platters you need and all you are doing is driving up the price of one of the most expensive parts. Far better off going with SSD drives since the quantity will help increase production of the drives and bring down the price for everyone. They have already made it clear that storage density is not their concern but seek times are. Typical VP that doesn't understand how things are made just saying: "put in more of them, that'll make it better!"
The current 3.5" HDD geometry was adopted for historic reasons --- its size inherited from the PC floppy disk.
The form factor of 3.5" floppy drives was decided during the early planning stage of the Great Data Railroad. You can place exactly 16 3.54" (90mm) bare floppy discs side by side within the standard railroad gauge of 4 feet 8.5 inches. For the original 1982 HP single sided format of ~280kB this yields roughly ~4.3mB along every 3.5" of railroad track, or 137 rows along the floor of a a standard 40-foot railroad boxcar without the use of stacking. Thus ~600MB was the capacity of a original single density data railroad car, though it was only only ~1mm in height.
While the floppy disc made data railroads possible, media stacking made them practical. A cylinder of bare floppy media ~10 feet high is roughly 3048 discs, so your standard railroad boxcar held ~1.8TB of floppy storage, in 1982! With an average rail speed of 18mph a single boxcar passes every ~1.5 seconds, which is ~1.2T terabytes or 9200 gigabits per second! By 1998 floppy media storage density had improved ~714-fold, yielding transfer rates of 6568800Gb/s or ~821 TB/s.
So why was floppy data railroad ultimately limited to this 'arbitrary' ~821 TB/s? Northern rail gauge of the US railway based on the English rail system which were based on tramways which used the same jigs used to build wagons whose wheel base was determined by ancient ruts that were left by Roman chariots which were sized to accommodate the width of two horses' asses. As not-quite debunked here.
So the short story is, any chain of decisions regarding technology leads back to some horse's ass.
<blink>down the rabbit hole</blink>
The research paper is not available. Any pointers ?
something like this?
http://www.101101.io/storageen...
I realize this actual device had terrible data density by today's standards but I'm sure they could do something similar with modern platters.
I always believed that the Oval (tm) format had the best reliability and bit density.
CAP === 'excels'
Most of the data within these companies is "cool" meaning it's not actively being accessed. Take for example the massive amount of photos within FB. When was the last time you looked at a photo from 6months ago? If it needs to be accessed frequently, as in it becomes viral, then you move the photo from HDD to SSD. Sure there's 3-4TB SSD coming, but they're still much more expensive $/GB than HDD.
Also, Google's point isn't so much about $/GB but rather that they don't need as much reliability of the drive. Why buy enterprise grade drives when you're making 3+ copies of the data? Consumer grade drives can work for this. If your operational processes are smooth enough, the physical replacement process becomes trivial. Sure there's a lot of them to replace, but it's like replacing books on shelves. So if the cost of a drive can go down even further because it's 99.5 instead of 99.9, even better.
I would think that adding additional platters would greatly lower the mean time before failure on the drives?
The disk's spindle motor and actuator are shared across platters, but the media and read/write heads are per-platter. With many small platters your seek times would go down, but the odds of a head crash or media failure would be greatly increased.
Multi r/w heads aren't a new concept. Some of the really old drives had them, and in fact the very original magnetic recording "disks" had a r/w head per track. I think in the trade off of more heads versus faster spinning, faster spinning won out.
I seems that there should be a market for more platters, in a slightly different form factor.
If my maths is correct, eight 2.5" hard drives will fit into the 5.25" full height form factor.
According to Wikipedia, the hypothetical 5.25"FH hard drive can have 4TB x 8 = 32TB capacity. Maybe even 36TB as extra platters could be squeezed in.
Because it is internally 2.5" platters, it should have very quick seek times.
I guess the problem is the MTBF will drop from 1.2 million hours to only 150,000 hours.
Any new solution would have to maintain backwards compatibility. The new standard would have to be ether 3.5" x 2, 3, or 4 bays; or 5.25" x 1, 2, 3, or 4 bays. The industry has 30 years behind existing bay standards, it would take them a long time to change their tooling.
Personally I thought the Sun Fire X4500 (a/k/a thumper) was a very efficient way to maximize storage density
they're using the cheapest consumer disks they can find, but now ask for a form-factor tailored for data centers. This does not compute. Why should such data center for factor disks get such consumer use that they can be had for the same low price?
Of course, consumers love everything that's called industrial quality, so maybe it does work out...
So Google wants higher performance conventional hard drives? This should have been proposed around 1995. In today's world this has no chance.
You want performance? Get high performance 15,000 RPM drives from Hitachi or Fujitsu or IBM, or whoever makes them now. Just know that all really high performance drives are going SSD.
Current drives are made to optimize a combination of price, performance, reliability, and the standardized drive bays. Google attempts to revise the priorities so that performance ranks higher in the optimization criteria.
The thing is, does anyone care what Google wants? Are they big enough to get custom runs of HDDs made? Maybe, but I doubt it. Seriously, hard drives are a mature technology. And if performance is really the issue you need to abandon mechanical storage and go all-electronic.
I assign this issue to the That Ship Sailed Long Ago category.
This is not bounded by reality, but just some back of the napkin types stuff.
Let's say you have a 3" platter w/ 1TB capacity. And you can get up to 7 in a 1-inch high 3.5" drive.
That's 7TB.
The spindle is about 1" in diameter, but from looking at the IBM microdrive, it may be possible to reduce that to 0.33"
Next let's shrink the platter to 0.75". Because we're talking single speed, the amount of data is proportional to r (instead of r^2). So it's 0.42/2.5 = 0.168 TB.
The drive is 5.75" deep, Assuming 1.75" for stuff, that leaves 4", assuming there wasn't extra space in the first part, we can fit in 28 platters for a 4.7 TB capacity.
Because the platters are smaller, the access times should be faster. It may even be possible to stick another spindle into the system (more than gaining back lost capacity), and maybe more platters in a stack for higher capacity.
Practical issues: power, heat, complexity. MTBF is probably lower. Non-failure error rates may be higher. Heat and power may be higher. The 0.75" platters may be impractical because of physical reasons. There may be some fundamental problem with having a 4-inch high 0.33" diameter spindle.
He effected a bored affect.
OK lets start thinking about what this means. So if you start making more platters and more heads that raises the cost. It also increases the chance of a head crash. More and more platters and more and more head eventually lead to a platter that doesn't even have to turn and just heads that right and read to a stationary platter.
Hence you essentially have a solid state disk. Why should they spend money trying to improve a dying technology. Solid state disks will eventually take over there too. They should be looking at different forms of solid state disks.
Now they are just going to make it taller and when it fails, will they just dump the whole "tower" of platters? Or are they going to fix it (expensive)? Seems like a method for maximizing waste. This would have made sense if the platters were bigger in diameter, making the pack more reliable as well.
More platters yields more heads. More components to fail. This will increase failure rates for these drives at a given capacity over a similar capacity 3.5" drive with a lower platter count.
Spinning media is still hard to beat on price. Desktop 7200 RPM drives are at $.03/GB. "Enterprise" 7200 RPM SATA at volume is between $.03/GB and $0.05GB. Cheap SSD is around $0.60/GB to $1.20/GB.
A lot of data is still cold. At volume, this price difference matters a lot.
I don't understand the logic of sacrificing storage capacity for seek time. In which case, you merely end up with an incompetent SSD, an defeat the whole purpose of having a HHD in the first place.
Wouldn't it make more sense to leverage the whole advantage of a HHD and go strictly for capacity, and use more intelligent caching or more hybrid technology to reduce seek time? You can already fit a lot of platters into the 3.5" format, and stuffing more hardware into a single enclosure will probably result in too many other trade-offs to be worth it, like reduced reliability. That's especially true if the new drives are made in such small quantities that economy of scale makes the 3.5" format a better choice in the end, anyway.
Seems like another attempt to force a new format on the market, whether it's needed or not.
Storage drum systems had many heads, arranged in a spiral around the drum so there was time at the end of a "ring" to select the next head. (Apparently nobody thought of making a straight line of heads and spiraling the data.) One of the later models of IBM multi-platter disk drives had 2 sets of head arms. All of these are mechanically complex, which is part of the reason for RAID (Redundant Array of Independent Disks) (redundancy of course being the other). Instead of trying to make one disk drive bigger and/or put more heads on it, make the disk SYSTEM bigger with modular increments of the then-current technology, and get more heads by spreading them across the modules.
There is a tradeoff between system complexity and speed. Highest performance has been attained on specialized systems with intelligent distribution of indices and data, particularly as differentiated by activity, across different levels of storage media; but there is benefit in maintaining a simple generic model of storage, particularly across replicated networked storage systems, so that multi-server and/or back systems need not be identical. One can envision managing active data in many gigabytes of RAM, backing it to SSD, further demoting it to HD as activity decreases; but this involves a lot of system complexity
Remember, though, that the 3330 10-platter drive held a whopping 200 MB, and a controller managing five of them could get all the way to 1 GB - when today I have a 64 GB microSD card in my camera that weighs nothing and runs at 50 MB/sec.
a proposal for something that decreases the reliability of mechanical disks even more. I don't want higher bit rates to deal with in addition to the stream of other unpredictable failure modes associated with these things.