Slashdot Mirror


SSDs: The New King of the Data Center?

Nerval's Lobster writes "Flash storage is more common on mobile devices than data-center hardware, but that could soon change. The industry has seen increasing sales of solid-state drives (SSDs) as a replacement for traditional hard drives, according to IHS iSuppli Research. Nearly all of these have been sold for ultrabooks, laptops and other mobile devices that can benefit from a combination of low energy use and high-powered performance. Despite that, businesses have lagged the consumer market in adoption of SSDs, largely due to the format's comparatively small size, high cost and the concerns of datacenter managers about long-term stability and comparatively high failure rates. But that's changing quickly, according to market researchers IDC and Gartner: Datacenter- and enterprise-storage managers are buying SSDs in greater numbers for both server-attached storage and mainstream storage infrastructure, according to studies both research firms published in April. That doesn't mean SSDs will oust hard drives and replace them directly in existing systems, but it does raise a question: are SSDs mature enough (and cheap enough) to support business-sized workloads? Or are they still best suited for laptops and mobile devices?"

30 of 172 comments (clear)

  1. Great for some apps (see netflix blog) by ron_ivi · · Score: 5, Interesting
    This blog article's very relevant: http://techblog.netflix.com/2012/07/benchmarking-high-performance-io-with.html

    TL/DR: "The relative cost of the two configurations shows that over-all there are cost savings using the SSD instances"

    at least for their use-case (Cassandra).

    At work we also use SSDs for a couple terabyte Lucene index with great success (and far cheaper than getting a couple TB of DRAM spread across the servers instead)

    1. Re:Great for some apps (see netflix blog) by wvmarle · · Score: 4, Interesting

      So you're replacing RAM with SSD, not HD with SSD. Interesting.

      And would you even be able to do this with DRAM modules? Normal PC motherboards don't support that.

    2. Re:Great for some apps (see netflix blog) by SQL+Error · · Score: 5, Interesting

      You can build a 48-core Opteron server with 512GB of RAM for under $8000. Going over 512GB in a single server gets a lot more expensive (you either need expensive high-density modules or expensive 8-socket servers - or both) but if you can run some sort of cluster that's not a problem.

    3. Re:Great for some apps (see netflix blog) by ron_ivi · · Score: 5, Informative

      How does that make sense.

      As the link to Netflix pointed out -- they benchmarked the entire system with the same REST API in front.

      They configured one cluster of SSD-based servers; which another cluster of spinning-disk-with-large-RAM-based servers. It took a cluster of 15 SSD-backed servers to match the throughput of 84 RAM+Spinning servers. With throughput matched, the SSD-based cluster provided better latency and lower cost.

      TL/DR: "Same Throughput, Lower Latency, Half Cost".

  2. 20x faster by drabbih · · Score: 3, Informative

    By switching to SSD's on a data intensive web application, I got 20 times speed improvement - from 20 hits per second to 400. I trust SSDs more than physical spindles any day.

    1. Re:20x faster by Anonymous Coward · · Score: 2, Insightful

      TRIM isn't necessary if the SSD uses spare sectors to keep the write amplification low. You can also partition the SSD to have a swath of unused space for that purpose.

    2. Re:20x faster by donaldm · · Score: 2, Insightful

      By switching to SSD's on a data intensive web application, I got 20 times speed improvement - from 20 hits per second to 400. I trust SSDs more than physical spindles any day.

      When designing storage for any Business or Enterprise the disks (solid state or spinning) should always be in some sort of RAID configuration that supports disk redundancy. Failure to do this could result in loss of data when the disk eventually fails and it will. I am often asked "How long" and my answer is "How long is a peace of string".

      At the moment SSD's are excellent when you need high I/O from a few disks up to say a few TB however if you look at enterprise storage solutions of 10's or even 1000's of TBytes you are still looking at spinning media with large cache front ends (BTW I am talking about $20k up to many millions of dollars storage area networks). Of course for smaller scale computing SSD's are excellent for high performance but unless you don't really care about your data you still need disk redundancy or I hope your backup and recovery services are excellent, keeping in mind that an outage may cost a considerable amount of money for every hour or even minute you are down.

      It must be noted that when designing any computing system you really need to consider performance expectations as well as backup and recovery requirements. The choice of using SSD's, spinning media or even SAN's is normally made after Business or Enterprise expectations are made clear.

      --
      There ain't no such thing as proprietary standards only proprietary formats. Standards are by definition open.
    3. Re:20x faster by Twinbee · · Score: 4, Insightful

      and my answer is "How long is a piece of string".

      Sorry, that phrase always strikes a nerve with me. More useful answers would include an average, or even better, a graph detailing the death rate of SSDs (and how they tend to die early if they do die, but tend to last if they get past that initial phase).

      --
      Why OpalCalc is the best Windows calc
    4. Re:20x faster by Anonymous Coward · · Score: 2, Interesting

      No, but then I can read and understand that "once the device runs out of untouched sectors" is not an "if" but a "when". An untouched sector is not the same as a spare sector either, because sectors which are used for reducing the write amplification are touched. An SSD maintains available sectors, not untouched or free sectors.

    5. Re:20x faster by SuricouRaven · · Score: 4, Insightful

      That requires explaining the poisson distribution to a pointy-haired boss.

    6. Re:20x faster by Culture20 · · Score: 4, Funny

      "How long is a peace of string"

      I have never known string to break a cease-fire.

    7. Re:20x faster by QuantumBeep · · Score: 2

      Modern SSDs offer under-provisioning for just this reason.

    8. Re:20x faster by boggin4fun · · Score: 3, Insightful

      The length of a piece of string is twice the distance from the center to an end.

  3. Long-term, not short-term by Todd+Knarr · · Score: 4, Insightful

    The question is really going to be what kind of shape the drives will be in a year or so from now after 12+ months of constant heavy usage. The usage profile in consumer computers is a lot different from that in a server, and the server workload's going to stress more of the weakest areas of SSDs. And when it comes to manufacturer or lab test results, simple rule: "The absolute worst-case conditions achievable in the lab won't begin to approximate normal operating conditions in the field.". So, while SSDs are definitely worth looking at, I'll let someone else to do the 24-36 month real-workload stress testing on them. There's a reason they call it the bleeding edge after all.

    1. Re:Long-term, not short-term by SQL+Error · · Score: 5, Informative

      We've been using SSDs in our servers since late 2008, starting with Fusion-io ioDrives and Intel drives since then - X25-E and X25-M, then 320, 520 and 710, and now planning to deploy a stack of S3700 and S3500 drives. Our main cluster of 10 servers has 24 SSDs each, we have another 40 drives on a dedicated search server, and smaller numbers elsewhere.

      What we've found:

      * Read performance is consistently brilliant. There's simply no going back.
      * Random write performance on the 710 series is not great (compared to the SLC-based X25-E or ioDrives), and sustained random write performance on the mainstream drives isn't great either, but a single drive can still outperform a RAID-10 array of 15k rpm disks. The S3700 looks much better, but we haven't deployed them yet.
      * SSDs can and do die without warning. One moment 100% good, next moment completely non-functional. Always use RAID if you love your data. (1, 10, 5, or 6, depending on your application.)
      * Unlike disks, RAID-5 or 50 works pretty well for database workloads.
      * We have noted the leading edge of the bathtub curve (infant mortality), but so far, no trailing edge as older drives start to wear out. Once in place, they just keep humming along.
      * That said, we do match drives to workloads - SLC or enterprise MLC for random write loads (InnoDB, MongoDB) and MLC for sequential write/random read loads (TokuDB, CouchDB, Cassandra).

    2. Re:Long-term, not short-term by Anonymous Coward · · Score: 3, Informative

      If you do RAID5 or RAID6, you should match your RAID block exactly to the write block size of the SSD. If you do not, then you will generally need two writes to each SSD for every actual write performed. This will reduce the lifetime for the SSD and reduces the efficiency. Most RAID controllers have no way of doing this automatically and it is not easy to learn what the write block size is on an SSD (it is not generally part of the information on the drive).

  4. Silver Bullet by SQL+Error · · Score: 4, Informative

    We have hundreds of SSDs in production servers. We couldn't survive without them. For heavy database workloads, they are the silver bullet to I/O problems, so much so that running a database on regular disk has become almost unimaginable. Why would you even try to do that?

    1. Re:Silver Bullet by AK+Marc · · Score: 3, Insightful

      write wear is a read herring. Unless you are over-writing the entire drive multiple times a day, you'll last longer with an SSD than spinning disk. And even then, the current generation will last longer than the early ones, and early ones are lasting longer than predicted.

    2. Re:Silver Bullet by cffrost · · Score: 4, Funny

      write wear is a read herring.

      Are you sure it's not a reed salmon?

      --
      Thank you, Edward Snowden.

      "Arguments from authority are worthless." —Carl Sagan
    3. Re:Silver Bullet by Threni · · Score: 2

      It's actually a read/write herring, but most of the writing is cached and deferred for actual physical writing later on.

  5. Near-line storage only: Has been for some years. by MROD · · Score: 5, Informative

    You have to remember that enterprise level storage isn't a single set of drives holding the data, it's a hierarchy of different technologies depending upon the speed of data access required. Since SSDs arrived they've been used at the highest access rate end of the spectrum, essentially using their low latency for caching filesystem metadata. I can see that now they are starting to replace the small, high speed drives at the front end entirely. However, it's going to be some time before they can even begin to replace the storage in the second tier and certainly not in the third tier storage where access time isn't an issue but reliable, "cheap" and large drives are required. Of course, beyond this tier you generally get on to massive robotic tape libraries anyway, so SSDs will never in the foreseeable future trickle down to here.

    --

    Agrajag: "Oh no, not again!"
  6. enterprise class SSDs not the same by Anonymous Coward · · Score: 5, Interesting

    The enterprise class SSDs are not the same as the "consumer" ones: http://www.anandtech.com/print/6433/intel-ssd-dc-s3700-200gb-review

    Don't be surprised if you stick a "consumer" grade one to a heavily loaded DB server and it dies a few months later.

    Fine for random read-only loads.

    And some consumer grade SSDs aren't even consumer grade (I'm looking at you OCZ: http://www.behardware.com/articles/881-7/components-returns-rates-7.html ).

  7. Price by asmkm22 · · Score: 4, Interesting

    Pricing really needs to come down on these things. A single drive can easily cost as much as a server, and when you're talking about RAID setups, forget it. It's still much more effective to use magnetic drives and use aggressive memory caching for performance, if you really need that.

    Another 3 to 5 years this idea might have more traction for companies that aren't Facebook or Google, but right now, SSD costs too much.

  8. Single component failure not a big deal any more. by 12dec0de · · Score: 5, Informative

    I think that the wide range adoption of server SSDs also shows how far server installations have progressed toward eliminating all single points of failure.

    In the passt HA and 'five nines' was something only done by a few niches, like telephony provider switches or banking big iron. Today it is common in many cloud installations and most sizeable server setups. A single component failing will not stop your service.

    If your business can support the extra cost for the SSDs, a failing drive will not stop you and the performance of the service will see great improvements anyway. The power savings may even make the SSD not so costly after all.

  9. Virtualisation by drsmithy · · Score: 5, Interesting

    This is being driven primarily by increasing levels of virtualisation, which turns everything into a largely random-write disk load, pretty much the worst case scenario for regular old hard disks.

  10. Re:And beyond SSD, the future is PCIe Flash by wonkey_monkey · · Score: 4, Insightful

    Up to 2.5 times faster

    Ah, "up to." Marketing's best friend.

    --
    systemd is Roko's Basilisk.
  11. Re:SAS SSD by TheRaven64 · · Score: 2

    SAS doesn't really get you anything useful with an SSD. The extra chaining isn't that important, because it's easy to get enough SATA sockets to put one in each drive bay. There's no mSATA equivalent for denser storage, and if you really need the extra speed then why not go all the way and get something like FusionIO cards that hang directly off the PCIe bus?

    --
    I am TheRaven on Soylent News
  12. Specific applications now, everything later by crucial_hendo · · Score: 2

    I work for an Australian hosting company and we have deployed the SolidFire all-SSD SAN for our cloud-based hosting (shared, reseller, cloud/virtual server), the major benefits of an all-SSD storage solution speak for themselves: far lower I/O wait time, huge IOPS numbers - in SolidFire's case 250,000+ distributed IOPS in our current configuration. We've recently shifted from the HP SAS-based Lefthand SAN offering up to 15,000 IOPS to the new SolidFire all-SSD SAN and the team behind SolidFire are partly from the Lefthand operation from HP, so there's some good know-how there. The article is quite broad in its content, for big data applications SSD SAN storage is still incredibly more expensive ($/GB) than SATA or SAS based SANs - our SolidFire was a huge investment. Many hosting providers are now switching to all-SSD based servers for the performance benefits, however the drawback is primarily total storage capacity of course. For example a typical VPS node using local storage with 10 x SATA drives can get up to 4TB of usable RAID-protected storage. The numbers for an all-SSD node in RAID configuration would be much lower in capacity and suitable higher in cost. Its important to note that many people view SSDs as desktop only hardware, which is fundamentally incorrect, as there are many units out there that offer write longevity much longer than needed (5-10+ years). For many server based applications (not big-data purposes), SSDs are, and will become the predominant choice among many hosting companies. Not every provider can afford the investment of an enterprise grade SAN, however the speed of development from Intel and Samsung will mean the $/GB will drop steeply and disk sizes will increase exponentially (like what we've with SATA in the past 5 years).

  13. Re:And beyond SSD, the future is PCIe Flash by cdrudge · · Score: 4, Funny

    They have to say up to. Reads and writes towards the inside of the chip are slower then they are towards the outside of the chip. I don't think anyone makes a constant linear velocity SSD.

  14. More Common Than You Think... by Whatchamacallit · · Score: 4, Interesting

    SSD's might not be used as primary storage, yet. The cost of using a lot of SSD's in a SAN is still too high. However, that doesn't mean that SSD technology is not being used. Many systems started using SSD's as Read/Write caches or highspeed buffers, etc. The PCIe SSD cards are popular in highend servers. This is one way that Oracle manages to blow away the competition when benchmarks are compared. They put a PCIe SSD cards into their servers and use them to run their enterprise database at lightning speeds! ZFS can use SSD's as Read/Write caches although you had better battery backup the Write cache!.

    Depending on a particular solution, a limited number of SSD's in a smaller NAS/iSCSI RAID setup can make sense for something that needs some extra OOMF! But I don't yet see large scale replacement of traditional spinning rust drives with SSD's yet. In many cases, SSD's only make sense for highly active arrays where reads and writes are very heavy. Lots of storage sits idle and isn't being pounded that hard.