Slashdot Mirror


Google Proposes New Hard Drive Format For Data Centers (thestack.com)

An anonymous reader writes: In a new research paper the VP of Infrastructure at Google argues for hard drive manufacturers and data center provisioners to consider revisions to the current 3.5" form-factor in favour of taller, multi-platter form factors — with the possibility of combining the new format with HDDs of smaller circumference which hold less data but have better seek times. Eric Brewer, also a professor at UC Berkeley, writes "The current 3.5" HDD geometry was adopted for historic reasons – its size inherited from the PC floppy disk. An alternative form factor should yield a better TCO overall. Changing the form factor is a long term process that requires a broad discussion, but we believe it should be considered."

35 of 202 comments (clear)

  1. Form Factor not "Format" by michaelmalak · · Score: 4, Insightful

    Also, I thought the world was going SSD anyway, which is thinner, not thicker?

    1. Re:Form Factor not "Format" by drinkypoo · · Score: 5, Interesting

      The world will probably keep using spinning rust until purchase price (not TCO) on SSDs is lower. I wouldn't be surprised if makers went back to 5.25 x half height, and low spindle speeds. It would still permit large throughput with high density, but seeks would be slower. Not a big deal with enough caching in front of them, and/or with enough disks in an array. As SSDs approach HDD price, they will take up more of the workloads that actually have to be fast anyway.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    2. Re:Form Factor not "Format" by p4ul13 · · Score: 4, Insightful

      SSD is the heir apparent, but platter based disk storage will likely provide higher capacity at denser, more affordable prices for quite some time to come. I suspect Google is proposing this altered platter HD design as something that could bridge the gap until SSD reaches an affordability / density point that can catch up / replace conventional platter HD designs.

      --
      Paul Lenhart writes words!
    3. Re:Form Factor not "Format" by gman003 · · Score: 4, Insightful

      That's odd, because my laptop's SSD is four years old and still has plenty of usable life left - and it's from a middle-line vendor, from the early SATA3 days, so it's not even a particularly good SSD. The hard drive in the same laptop (dual-bay) is actually reporting as closer to failure. Maybe that's because it's a laptop, so it suffers more vibration and temperature variation, which is harder on hard drives than solid-state.

      And the rest of your bitching seems to be based more on shoddy cloud hosts than SSDs, or on badly-configured servers. "SSDs are too fast, they bring down the entire system by filling up RAM"... wouldn't that be true of hard drives as well, IF they could transfer data that quickly?

    4. Re:Form Factor not "Format" by NeoMorphy · · Score: 2

      SSDs have been in data centers for years now. They are more reliable than hdd. That wasn't always true, but it is now. You don't use them for "low write/high-bandwidth", you use them for high iops/random access like database indexes, or the entire database if you can cost justify it. I have never seen a linux server lockup when copying files to/from and we have thousands of linux servers, you might need to tune something if that is happening. We have multi-tiered storage with 7200/10000/flash drives. The downsides to ssd are cost/data density, otherwise hdd would be dead, the noise/power/heat/performance difference is huge.

      Virtualization is awesome! Servers have been increasing in performance every year while server applications resource usage grows at a slower rate. It's ridiculous to buy a server for a webserver application and another for network filesystems and printers etc. You also have to manage heat dissipation and worry about power/network/fc cables to a bazillion servers. With virtualization you can downsize your server farm(and the network/power/fc cables) 20X+ and save on hardware/cooling/power/management. If a virtual server needs more resources you can vmotion it to another server with more available resources, you can even have the process automated. With san storage you can carve out servers anytime you want. It used to suck when a server died because of hardware failure and you were down until it was fixed. Now you can bring it up in another frame!

    5. Re:Form Factor not "Format" by lgw · · Score: 2

      Old SSDs died quickly under DB loads - not enough write count in their lifetime. New ones are better, but still won't last as long as HDD. This is only going to improve over time, though, and at the right price, who cares about 2 vs 4 year lifetime?

      The HDDs you have to use for high IOPS DB load are dammed expensive in the first place: it's the last domain of big-box storage (think 10x consumer drive cost, 100x with fancy replication software built-in).

      Google doesn't use that big-box crap of course, but I'm baffled why they want a faster HDD standard that won't come for years - in a few years, will anyone still care about HDD speed?

      --
      Socialism: a lie told by totalitarians and believed by fools.
    6. Re:Form Factor not "Format" by swb · · Score: 2

      I think you're right, but I think it only counts/matters for organizations operating at the spreadsheet analysis scale where the potential savings are really only realized across many thousands of disks in extremely customized environments.

      It also wouldn't surprise me if this was also being floated by Google to induce hard disk makers to leverage their existing manufacturing base to mass produce something that really only a very small number of customers are likely to have in any interest in. Hard disks are probably the one component Google can't just design and have made for themselves, unlike the rest of their rack hardware.

      I also wonder if the business math on Google's distributed storage system doesn't make as much sense with SSDs versus hard disks and they worry that their model is going to be upset by HDDs potentially getting more expensive as everyone else transitions to SSDs. It doesn't seem too outrageous to think that in 10 years that a lot of the manufacturing capacity of HDD makers will be economically nonviable as demand shrinks. They won't disappear, but as demand shrinks and manufacturing follows, price might go up.

    7. Re:Form Factor not "Format" by nanoflower · · Score: 2

      I agree with you that this only makes sense for very large customers of hard drives. If Google really thinks this is a good idea they should approach one of the vendors with a long term commitment to buy the drives or a large investment for them to develop the drives. That's really the only way I see a Seagate or WD spending their time/money to develop a product that has no retail purpose and may have no commercial customers. Let the HD companies know it won't be a wasted investment and they'll come up with something that will help Google cut down on space/power usage for their HDs.

    8. Re:Form Factor not "Format" by LWATCDR · · Score: 2

      This is more enterprise than consumer. File systems like ZFS can use SSDs as a cache for spinning platters. On a modern server you may have a system that uses RAM as a traditional disk cache followed by an SSD or array of SSDs as a second cache layer, and then disks as the mass storage.
      It can even be pretty smart and using and ageing system to move files in and out of the SSD cache based on when they were used last and you could even tag some files to always be in the SSD cache and others to never be in the SSD cache.
      My question is simple. Will the benefits of these tall multi-platter drives be worth the cost? Sticking with the tradition drive sizes gives you economy of scale.

      --
      See my blog http://ilovecookes.blogspot.com/ for light hearted technical information.
    9. Re:Form Factor not "Format" by Anonymous Coward · · Score: 3, Interesting

      Old SSDs died quickly under DB loads - not enough write count in their lifetime. New ones are better, but still won't last as long as HDD.

      Let's take a decent 15k, even ignoring seek times the rotational latency limits us to about 500 IOPS.
      Saturating that with 4k writes 24/7 for a year... about 63.12 TB written.
      What's the write rating on a 200GB intel DC P3700? 3737.6TB.
      Do you honestly expect that HD to survive for nearly 60 years?

    10. Re:Form Factor not "Format" by Dutch+Gun · · Score: 3, Interesting

      You could very well be right. Speaking of oddball heights, the first 500 *MB* drive I bought (back when the main network drive was 120MB) cost $1000, and it was actually a 3 1/2" double-height size, meaning the bay next to it had to be clear before I could install it. It wasn't a problem since I was simply installing it in a workstation. This obviously wouldn't work for Google, since I'm certain they use computers with front-mounted hot-swappable 3 1/2" drive bays all neatly packed together - I've seen how nicely these work with my Synology 5-bay NAS. Unless a new form factor becomes standardized, you can't really hack in a solution... at least not on the scale Google is dealing with.

      I don't think Google is going to get its way here with a new standardized size, at least at mass adoption scales. Inertia is pretty damn hard to overcome, even if potentially superior solutions exist. I mean, the US is still using imperial measurements, for heaven's sake. The fact that we measure them as 3 1/2" inch drives should tell you something about how hard it is to change standards.

      --
      Irony: Agile development has too much intertia to be abandoned now.
    11. Re: Form Factor not "Format" by U2xhc2hkb3QgU3Vja3M · · Score: 2

      The data became blurry?

    12. Re:Form Factor not "Format" by kimvette · · Score: 2

      Funny.. I use an ssd for vms and swap and although it's thrashed heavily when I fire up all the VMs at once I've lost no capacity and no errors have been reported. Reliability has been so solid I'm thinking of replacing the spindles in my server with SSDs in a RAID5 or RAID6 array.

      There are occasional lemon SSD model runs... but that's true of all hard drive manufacturers as well.

      Furthermore empirical evidence arising from analyses from quite a few data centers indicate that SSDs are more durable than hard drives at this point. Continued avoidance of the latest-generation SSDs is based on superstition, not evidence.

      --
      The Christian Right is Neither (Christian nor right). See: Matthew 23, Matthew 25, Ezekiel 16:48-50
    13. Re:Form Factor not "Format" by Coren22 · · Score: 2

      So, if it isn't Iron anymore, what are the magnetic domains made from?

      --
      APK likes to ask for responses to the same things over and over. Maybe he just likes the responses?
    14. Re:Form Factor not "Format" by OrangeTide · · Score: 2

      Rust is iron oxide, which is chemically different than metallic iron. And cobalt is one of many ferromagnetic materials besides iron.

      --
      “Common sense is not so common.” — Voltaire
    15. Re:Form Factor not "Format" by drinkypoo · · Score: 2

      Um, it hasn't been "rust" in decades. That's as condescending as saying SSD's are lumps of sand...

      Poetic license. Besides, spinning is the really relevant part.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
  2. Go for it! Bring back full height 5 1/4" drives by Anne+Thwacks · · Score: 2
    but keep the compatible mounting holes!

    Multi-platter was always a good idea, I assume it stopped in a desperate attempt to cut costs.

    8" hard drives often had 4 or even 8 double sided platters - and SCSI interfaces! Early 5.25" drives often had two, double sided platters. They desperately needed to access more data with less head movement because they had quite low areal bit density and used floppy-derived stepper motors for head positioning!

    --
    Sent from my ASR33 using ASCII
    1. Re:Go for it! Bring back full height 5 1/4" drives by mrchaotica · · Score: 4, Interesting

      Multi-platter was always a good idea, I assume it stopped in a desperate attempt to cut costs.

      Wait, what? Last time I opened up a dead 3.5" hard drive (which was only a few years ago) it had either three or four platters. Are you saying they typically only have one now?

      But yes, I agree that if they want taller drives, 5 1/4" full height would be a good form factor. Maybe even not with 5" platters! If they want quicker speeds, they could maybe put four separate spindles of the platters from 2.5" drives inside the same box.

      --

      "[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz

    2. Re:Go for it! Bring back full height 5 1/4" drives by jeffb+(2.718) · · Score: 3, Interesting

      It sounds like you think that manufacturers have stopped making multi-platter drives. That's not true. Seagate and WD both use seven platters in their highest-capacity (10TB, standard-height) drives. The linked article further states that they use seven platters "instead of the usual six".

      I don't know how prevalent single-platter drives are today, but multi-platter drives certainly haven't disappeared.

  3. Re:Too late? by JoeMerchant · · Score: 5, Interesting

    I just wonder if, by the time they agree on this (if they do) the price of SSDs will have dropped enough so that they can be used instead? Storage-wise they are already there, and then some.

    The point is to keep spinning platters cost-competitive with SSDs - a taller, smaller form factor would increase performance and reduce TCO... I'm thinking they're looking at something like lots of 1.8" platters stacked 4" high, they can spin faster, have faster seek times, and package multiple TB per unit, and I think the longer single bearing should be a more favorable geometry than the ultra-thin notebook compatible drives that have been developed for the last 10 years. It will be slower than SSD, but the power performance (which is the key to TCO) should remain competitive with SSDs for a long time to come. Also, presumably, if this takes off it would be datacenter focused, so longevity (again, TCO focus) should also be "baked into" the design in favor of lower retail price.

  4. Multiple heads by chriswaco · · Score: 3, Interesting

    Multiple heads on each side of the platter might be a better solution, one for the inner part and one for the outer.

    1. Re:Multiple heads by Andrew+Lindh · · Score: 4, Informative

      This has been done before.... Both outside/middle dual heads and dual independent actuators on each side. Multi heads can increase performance, but cost space, power, and money. Also more parts = lower MTBF. They don't increase storage density. If you want performance use SSD.

      http://www.tomshardware.com/ne...

    2. Re:Multiple heads by nojayuk · · Score: 4, Informative

      There were SCSI drives with four head actuators, one in each corner of the drive casing. They were treated as four separate drives logically and used to speed up reads on a "first to deliver the requested block" basis. They were horrendously expensive and it turned out to be very difficult to optimise the read process to gain the desired perfomance boost.

  5. Why no use existing form factors? by GuB-42 · · Score: 2

    There are other form factors other than the typical low profile 3.5".
    In particular there is the "half-size" thickness, witch is the thickness of 5.25" bays. It was a rather common form factor for 3.5" SCSI drives.

  6. 2.5" 4X drives by wren337 · · Score: 4, Insightful

    Surprised they haven't just gone with 2X or 4X height 2.5" drives. Same connectors, same platters, easy retrofit. You just need a different bracket.

    1. Re:2.5" 4X drives by 0100010001010011 · · Score: 2, Interesting

      I'm surprised that they haven't just done away with the 'hard drive' as is. SSDs are just a bunch of chips. I'm thinking of a 1U server that is just a board populated with chips, a fiber interface and a powersupply. Treat the 1U server as a single unit.

      When you start to add up hard drive casing, interface connectors, etc you end up wasting a lot of space for no reason. For the home user that only has 1-2 drives they make sense but for someone like Google that may have thousands of drives just jump up to the next standard unit and make that the 'storage device'.

  7. Re:Eric Brewer = Moron by yodleboy · · Score: 3, Insightful

    Could be that Google has some inside information that leads them to believe that prices on SSD will not be dropping to acceptable levels any time soon, despite what SSD boosters would have us all believe. If they are proposing something like this, they must have some inkling that spinning platters have a great deal of life left.

  8. Drums! by kschendel · · Score: 2

    Taller, more heads, smaller platter, less seek distance -- the logical end point is the drum! I'm sure we can do better than the FH-1782 today.

    Everything old is new again...

    1. Re:Drums! by vtcodger · · Score: 2

      From a programmer's POV, drums were wonderful. Select an address, then read or write. No cylinder/head/sector calculations. No variable transfer rates. You needed better "seek" time, install multiple sets of read/write heads. Unfortunately, they were bulky and cost a LOT.

      --
      You can't see ANYTHING from a car, You've got to get out of the goddamned contraption and walk...Edward Abbey
  9. Re:Eric Brewer = Moron by jofas · · Score: 2

    +1 for repeating yourself almost exactly. +5 For not being correct. There is more involved in managing data center performance than simple access times or temps. Even with multi-ton cooling, a data center is *still* looking at a lower operating cost to spin hot drives than to use super $$$ high-capacity SSDs. SSDs do not solve any problems for data centers, because individual drive access time is not interesting to a data center. Rather, the performance in $ per KIOPS for the entire array is the real measure of performance.

  10. Re:Eric Brewer = Moron by darthsilun · · Score: 2

    the cost of manufacturing an SSD is about 25% that of manufacturing a platter HDD

    Really? I think if that were anywhere near true it would be reflected in the cost of SSDs. Do tell, where can I buy a 4TB SSD for $30!

    The disk drive market is pretty competitive. I tend to think if SSDs cost 25% of an HDD to make, they'd be selling for a lot less than they are. And with Google's buying power probably even less for them.

  11. Horse sense by TheRealHocusLocus · · Score: 4, Insightful

    The current 3.5" HDD geometry was adopted for historic reasons --- its size inherited from the PC floppy disk.

    The form factor of 3.5" floppy drives was decided during the early planning stage of the Great Data Railroad. You can place exactly 16 3.54" (90mm) bare floppy discs side by side within the standard railroad gauge of 4 feet 8.5 inches. For the original 1982 HP single sided format of ~280kB this yields roughly ~4.3mB along every 3.5" of railroad track, or 137 rows along the floor of a a standard 40-foot railroad boxcar without the use of stacking. Thus ~600MB was the capacity of a original single density data railroad car, though it was only only ~1mm in height.

    While the floppy disc made data railroads possible, media stacking made them practical. A cylinder of bare floppy media ~10 feet high is roughly 3048 discs, so your standard railroad boxcar held ~1.8TB of floppy storage, in 1982! With an average rail speed of 18mph a single boxcar passes every ~1.5 seconds, which is ~1.2T terabytes or 9200 gigabits per second! By 1998 floppy media storage density had improved ~714-fold, yielding transfer rates of 6568800Gb/s or ~821 TB/s.

    So why was floppy data railroad ultimately limited to this 'arbitrary' ~821 TB/s? Northern rail gauge of the US railway based on the English rail system which were based on tramways which used the same jigs used to build wagons whose wheel base was determined by ancient ruts that were left by Roman chariots which were sized to accommodate the width of two horses' asses. As not-quite debunked here.

    So the short story is, any chain of decisions regarding technology leads back to some horse's ass.

    --
    <blink>down the rabbit hole</blink>
  12. file not found by Spaham · · Score: 2

    The research paper is not available. Any pointers ?

  13. What's old is new... by klubar · · Score: 2

    Multi r/w heads aren't a new concept. Some of the really old drives had them, and in fact the very original magnetic recording "disks" had a r/w head per track. I think in the trade off of more heads versus faster spinning, faster spinning won out.

    I seems that there should be a market for more platters, in a slightly different form factor.

  14. Re:barking up the wrong tree by DidgetMaster · · Score: 2

    Someone needs to invent a RW 'bar' that is long enough to go from the spindle to the outmost track. Any point on the bar would be capable of reading or writing. Instead of moving a narrow head on an actuator arm, you just 'activate' (electronically) the part of the bar that is currently over the track you want to access. If cheap enough, you could install a dozen of these bars all around the disk so that you would never need to rotate more than 30 degrees of a circle before a sector you want is under one of the bars. Rotational latency would be very low even on a 2000 rpm drive. Even better would be the ability to activate multiple points on the bar simultaneously so that you could read or write multiple tracks at the same time.