Slashdot Mirror


SSDs: The New King of the Data Center?

Nerval's Lobster writes "Flash storage is more common on mobile devices than data-center hardware, but that could soon change. The industry has seen increasing sales of solid-state drives (SSDs) as a replacement for traditional hard drives, according to IHS iSuppli Research. Nearly all of these have been sold for ultrabooks, laptops and other mobile devices that can benefit from a combination of low energy use and high-powered performance. Despite that, businesses have lagged the consumer market in adoption of SSDs, largely due to the format's comparatively small size, high cost and the concerns of datacenter managers about long-term stability and comparatively high failure rates. But that's changing quickly, according to market researchers IDC and Gartner: Datacenter- and enterprise-storage managers are buying SSDs in greater numbers for both server-attached storage and mainstream storage infrastructure, according to studies both research firms published in April. That doesn't mean SSDs will oust hard drives and replace them directly in existing systems, but it does raise a question: are SSDs mature enough (and cheap enough) to support business-sized workloads? Or are they still best suited for laptops and mobile devices?"

172 comments

  1. Great for some apps (see netflix blog) by ron_ivi · · Score: 5, Interesting
    This blog article's very relevant: http://techblog.netflix.com/2012/07/benchmarking-high-performance-io-with.html

    TL/DR: "The relative cost of the two configurations shows that over-all there are cost savings using the SSD instances"

    at least for their use-case (Cassandra).

    At work we also use SSDs for a couple terabyte Lucene index with great success (and far cheaper than getting a couple TB of DRAM spread across the servers instead)

    1. Re:Great for some apps (see netflix blog) by wvmarle · · Score: 4, Interesting

      So you're replacing RAM with SSD, not HD with SSD. Interesting.

      And would you even be able to do this with DRAM modules? Normal PC motherboards don't support that.

    2. Re:Great for some apps (see netflix blog) by SQL+Error · · Score: 5, Interesting

      You can build a 48-core Opteron server with 512GB of RAM for under $8000. Going over 512GB in a single server gets a lot more expensive (you either need expensive high-density modules or expensive 8-socket servers - or both) but if you can run some sort of cluster that's not a problem.

    3. Re:Great for some apps (see netflix blog) by wisnoskij · · Score: 1

      How does that make sense. Sure SSD is very similar to RAM physically, but it is still like a thousand times shower, is it not?

      --
      Troll is not a replacement for I disagree.
    4. Re:Great for some apps (see netflix blog) by Bill,+Shooter+of+Bul · · Score: 1

      I don't understand the confusion, maybe a car analogy will help.

      John smith is switching to a new Mustang from a mid 90's civic to reduce his merge time. This represents a huge savings over buying a Porsche. Make sense now?

      --
      Well.. maybe. Or Maybe not. But Definitely not sort of.
    5. Re:Great for some apps (see netflix blog) by wisnoskij · · Score: 1

      Maybe if he was using a golf cart instead of a porsche it would be a better analogy.

      --
      Troll is not a replacement for I disagree.
    6. Re:Great for some apps (see netflix blog) by ron_ivi · · Score: 1

      And would you even be able to do this with DRAM modules? Normal PC motherboards don't support that.

      Even low-end (dual-CPU 2-U) servers these days support either 192 or 256GB. It's not that hard or expensive to get 4 256GB or 6 192GB servers.

      But as that link to Netflix's' blog points out - SSDs can have better price/performance than DRAM at the moment if you need a lot.

    7. Re:Great for some apps (see netflix blog) by ron_ivi · · Score: 5, Informative

      How does that make sense.

      As the link to Netflix pointed out -- they benchmarked the entire system with the same REST API in front.

      They configured one cluster of SSD-based servers; which another cluster of spinning-disk-with-large-RAM-based servers. It took a cluster of 15 SSD-backed servers to match the throughput of 84 RAM+Spinning servers. With throughput matched, the SSD-based cluster provided better latency and lower cost.

      TL/DR: "Same Throughput, Lower Latency, Half Cost".

    8. Re:Great for some apps (see netflix blog) by ron_ivi · · Score: 1

      but it is still like a thousand times shower, is it not?

      Yes; but it's still like 5-500x faster than spinning disks too (obviously depending on if you're talking sequential I/O, or random-acces).

    9. Re:Great for some apps (see netflix blog) by Archangel+Michael · · Score: 0

      I just spec'ed out a 2U Dual 8 Core server with 386 GB RAM. The thing could hold 768 GB total If I didn't put in the GPU. The cost for doubling the RAM doubled the cost of the server. And at that point, having more CPU is more useful than RAM.

      As for SSD vs HD, you should really start looking at something like Nimble Storage, which tiers storage between onboard RAM, SDD and regular HDs, to provide huge IOPS advantages over regular SAN storage, with the same kind of drive type/counts. In the datacenter, its IOPS for your storage, followed by Size. Long term slow, and big storage is less useful than having high speed access to data you actually need at that moment. IOPS is key to getting data on the wire and to the processors that need it.

      --
      Agent K: A *person* is smart. People are dumb, stupid, panicky animals, and you know it.
    10. Re:Great for some apps (see netflix blog) by Anonymous Coward · · Score: 1

      And since the typical Netflix operation is write once, read many*, this would negate the SSD write wear issue. Customer data might still work better under a more traditional setup however.

      * for movie storage.

    11. Re:Great for some apps (see netflix blog) by xeio87 · · Score: 1

      Because RAM caching is just a band-aid and only speeds things up if your entire working set can live in memory (or close enough that disk IO becomes rare). Netflix has way too much data to do this, and they were bottlenecked by disk IO even with the complicated caching system they had in RAM.

      Essentially the SSD servers eliminated the disk IO bottleneck, so the RAM changing became unnecessary (which also reduces CPU and memory load, since caching is not free).

    12. Re:Great for some apps (see netflix blog) by wisnoskij · · Score: 1

      Ahh, I think I understand.
      Netflix has loads of data, and it is all being used all the time.
      Too much data for all of it to be stored in RAM. And too much being used concurrently to even just store in the currently streaming files.
      Also Netflix cannot just quickly steam a file completely using all of its bandwidth, since the recipient cannot receive it any faster than their connection.

      So Netflix needs to quickly stream little bits of files and switch between them rapidly. So random access. And data is only used once, it is not like it is loading in a game resource that will be used over and over again, it is loading a frame from a particular video file that is then steamed to a customer and them removed as RAM space is limited.

      So RAM in this context offers basically 0 benefit.
      But faster, random access, memory, that does not even have to be good at writing data, is really the only important feature.

      --
      Troll is not a replacement for I disagree.
    13. Re:Great for some apps (see netflix blog) by jedidiah · · Score: 1

      In the old days, SSD was nothing more than RAM with a battery backup.

      So the idea that you would replace memory with SSD rather than spinny disk with SSD is not terribly surprising.

      The same "working set" problem that applies to RAM also applies to SSD. Your solution actually has to be appropriate to the problem and SSD isn't necessarily a cure all.

      SSD is new and trended and a bit overhyped.

      --
      A Pirate and a Puritan look the same on a balance sheet.
    14. Re:Great for some apps (see netflix blog) by swb · · Score: 1

      IOPS, blah blah blah.

      The more I think about IOPS the more I think it is a manufactured statistic designed to "prove" performance yet at the same time being something you can't compare to another environment.

      For example, every storage environment has a different I/O size and read/write mix, rendering IOP comparisons between storage devices moot.

    15. Re:Great for some apps (see netflix blog) by snadrus · · Score: 1

      Though modprobe bcache is probably cheaper.

      --
      Science & open-source build trust from peer review. Learn systems you can trust.
    16. Re:Great for some apps (see netflix blog) by demonbug · · Score: 1

      I guess, but he's only going from 7.5 seconds (1995 Civic Si) 0-60 to about 6.8 seconds (2013 Mustang V6 automatic), so only about a 10% improvement. I think the overall improvement from a HDD to a SSD is significantly more than that. Now if you said a mid-90s Civic LX to a new Mustang GT you might have a better point.

    17. Re:Great for some apps (see netflix blog) by lgw · · Score: 1

      IOPS is what matters for database/Exchange server load, and now for virtualized desktop load. Certain server uses have very well known workloads with lots of very small random I/Os. You can find software to simulate the I/O workload of an exchange server that you can easily tune to the details of your shop and use it to test-drive storage arrays.

      For other sorts of storage, you only care about cost/GB and reliability, and IOPS is pretty meaningless.

      --
      Socialism: a lie told by totalitarians and believed by fools.
    18. Re:Great for some apps (see netflix blog) by lgw · · Score: 1

      A V6 Mustang is like the Matrix sequels or Star Wars prequels - enthusiasts know they don't actually exist.

      --
      Socialism: a lie told by totalitarians and believed by fools.
    19. Re:Great for some apps (see netflix blog) by xeio87 · · Score: 1

      I've been using SSDs in my computers for 5 years now. If you think they're overhyped... I can only assume you haven't used one?

      Admittedly, they're more expensive than platter drives (I still use all spinning drives in my NAS, where storage is more important than speed), but I do not miss load screens or waiting for my computer to become responsive. I have to deal with that crap at work still, I feel the chug of the HDD every time I open a solution in Visual Studio.

    20. Re:Great for some apps (see netflix blog) by Bill,+Shooter+of+Bul · · Score: 1

      1995 Civics do not go 0-60 in 6.8 seconds. Maybe they did in 1995, but not now. At least *mine* did not. It was much closer to 15 seconds. I guess you had to be my passenger to get that reference, s/Civic/yugo/g

      --
      Well.. maybe. Or Maybe not. But Definitely not sort of.
    21. Re:Great for some apps (see netflix blog) by dublin · · Score: 1

      How does that make sense. Sure SSD is very similar to RAM physically, but it is still like a thousand times shower, is it not?

      Yep, that's why folks like John Ousterhout (originator of tcl) are now workgng on things like RAMcloud - to get the lightning speed of RAM scaled to big data sizes. Commodity 10/100G Ethernet will make this idea *really* interesting...

      --
      "The future's good and the present is nothing to sneeze at." - Roblimo's last ./ post
  2. 20x faster by drabbih · · Score: 3, Informative

    By switching to SSD's on a data intensive web application, I got 20 times speed improvement - from 20 hits per second to 400. I trust SSDs more than physical spindles any day.

    1. Re:20x faster by Anonymous Coward · · Score: 1

      Do you use TRIM? Otherwise that speed bump will go way down once the device runs out of untouched sectors. And TRIM over RAID is still a no-go in most environments.

    2. Re:20x faster by Anonymous Coward · · Score: 2, Insightful

      TRIM isn't necessary if the SSD uses spare sectors to keep the write amplification low. You can also partition the SSD to have a swath of unused space for that purpose.

    3. Re:20x faster by donaldm · · Score: 2, Insightful

      By switching to SSD's on a data intensive web application, I got 20 times speed improvement - from 20 hits per second to 400. I trust SSDs more than physical spindles any day.

      When designing storage for any Business or Enterprise the disks (solid state or spinning) should always be in some sort of RAID configuration that supports disk redundancy. Failure to do this could result in loss of data when the disk eventually fails and it will. I am often asked "How long" and my answer is "How long is a peace of string".

      At the moment SSD's are excellent when you need high I/O from a few disks up to say a few TB however if you look at enterprise storage solutions of 10's or even 1000's of TBytes you are still looking at spinning media with large cache front ends (BTW I am talking about $20k up to many millions of dollars storage area networks). Of course for smaller scale computing SSD's are excellent for high performance but unless you don't really care about your data you still need disk redundancy or I hope your backup and recovery services are excellent, keeping in mind that an outage may cost a considerable amount of money for every hour or even minute you are down.

      It must be noted that when designing any computing system you really need to consider performance expectations as well as backup and recovery requirements. The choice of using SSD's, spinning media or even SAN's is normally made after Business or Enterprise expectations are made clear.

      --
      There ain't no such thing as proprietary standards only proprietary formats. Standards are by definition open.
    4. Re:20x faster by wonkey_monkey · · Score: 0

      Otherwise that speed bump will go way down once the device runs out of untouched sectors.

      TRIM isn't necessary if the SSD uses spare sectors

      See where you went wrong there?

      --
      systemd is Roko's Basilisk.
    5. Re:20x faster by Big+Hairy+Ian · · Score: 1

      I know some hosting companies that have been all SSD for years this article is no surprise given how much data is flung around on the cloud.

      --

      Build a Man a Fire, and He'll Be Warm for a Day. Set a Man on Fire, and He'll Be Warm for the Rest of His Life.

    6. Re:20x faster by Twinbee · · Score: 4, Insightful

      and my answer is "How long is a piece of string".

      Sorry, that phrase always strikes a nerve with me. More useful answers would include an average, or even better, a graph detailing the death rate of SSDs (and how they tend to die early if they do die, but tend to last if they get past that initial phase).

      --
      Why OpalCalc is the best Windows calc
    7. Re:20x faster by Anonymous Coward · · Score: 2, Interesting

      No, but then I can read and understand that "once the device runs out of untouched sectors" is not an "if" but a "when". An untouched sector is not the same as a spare sector either, because sectors which are used for reducing the write amplification are touched. An SSD maintains available sectors, not untouched or free sectors.

    8. Re:20x faster by SuricouRaven · · Score: 4, Insightful

      That requires explaining the poisson distribution to a pointy-haired boss.

    9. Re:20x faster by Culture20 · · Score: 4, Funny

      "How long is a peace of string"

      I have never known string to break a cease-fire.

    10. Re:20x faster by drsmithy · · Score: 1

      At the moment SSD's are excellent when you need high I/O from a few disks up to say a few TB however if you look at enterprise storage solutions of 10's or even 1000's of TBytes you are still looking at spinning media with large cache front ends (BTW I am talking about $20k up to many millions of dollars storage area networks).
      Well, what you're usually looking at is a storage system with multiple types and speeds of disks that automatically moves data through the tiers depending on the frequency and type of access. SSDs will form one of these tiers. If the storage system is any good, it will also let you manually pin or hint specific subsets of your data so that they are always held on the fastest tier (ie: SSDs).
      Since the _active_ subset of data even in quite large organisations is generally relatively small, a few hundred GB or a few TB of flash will often give 90%+ of the real-life performance that a pure flash array would.

    11. Re:20x faster by JustOK · · Score: 0

      Perhaps, but it is often very knotty.

      --
      rewriting history since 2109
    12. Re:20x faster by Anonymous Coward · · Score: 1

      "How long is a peace of string"

      I have never known string to break a cease-fire.

      Throw a couple of const char*'s at it and all hell will break loose.

    13. Re:20x faster by Anonymous Coward · · Score: 0

      Someone failed Probability 101... Learn your distributions! I hate trying to teach this stuff to people over and over again. And here's the thing, I get that "this is not what you do" so you don't think you should have to spend any brain power on it (you know, like how you don't memorize the cast of your favorite TV show because that's not what you do) but then don't ask the question since you are just wasting everyone's time. You have already admitted that you don't get it and don't want to, so leave it to the people who do get it.

    14. Re:20x faster by QuantumBeep · · Score: 2

      Modern SSDs offer under-provisioning for just this reason.

    15. Re:20x faster by boggin4fun · · Score: 3, Insightful

      The length of a piece of string is twice the distance from the center to an end.

    16. Re:20x faster by drabbih · · Score: 1

      Let me rephrase this. The web application I use was on a read-only database which was recoverable. The uptime requirements were low. I used 3 SSD drives in a raid 0 configuration to attain 1800 MB/s transfer rate, which was constant whether or nto the read was sequential or random. That is faster than a 10GBe connection to any SAN configuration, much less expensive, and much more responsive. The machine has been running for one year in a 24x7 operation without issue. Obviously, RAID configurations let us mitigate speed and safety requirements on storage systems regardless of their underlying media. But for my real case example, the perfromance and reliability is incredible.

    17. Re:20x faster by Anonymous Coward · · Score: 0

      A piece of string walks into a bar and orders a beer. The bartender tells him, "We don't serve your kind here, get out!"

      So the string goes outside, gets mad, roughs up his ends and twists himself around, then walks back into the bar.

      The bartender asks, "Are you that same piece of string I just threw out of here a minute ago?"

      The string replies, "No, I'm a frayed knot!"

      Beware. Drunken strings can start wars. Just ask my ex-wife!

      Thanks! I'll be here all week!

    18. Re:20x faster by Archangel+Michael · · Score: 1

      Do you want to hear the joke about a piece of string?

      I'm a frayed knot.

      --
      Agent K: A *person* is smart. People are dumb, stupid, panicky animals, and you know it.
    19. Re:20x faster by dbIII · · Score: 1

      "How long is a peace of string"

      About the same as a "Concordat of Worms".

    20. Re:20x faster by Anonymous Coward · · Score: 0

      TRIM works fine on ZFS, which is arguably better than RAID for some applications

    21. Re:20x faster by citizenr · · Score: 1

      I trust SSDs more than physical spindles any day.

      for the 72TB of write, after that you are boned
      70TB is a month at max in a datacenter

      --
      Who logs in to gdm? Not I, said the duck.
  3. Long-term, not short-term by Todd+Knarr · · Score: 4, Insightful

    The question is really going to be what kind of shape the drives will be in a year or so from now after 12+ months of constant heavy usage. The usage profile in consumer computers is a lot different from that in a server, and the server workload's going to stress more of the weakest areas of SSDs. And when it comes to manufacturer or lab test results, simple rule: "The absolute worst-case conditions achievable in the lab won't begin to approximate normal operating conditions in the field.". So, while SSDs are definitely worth looking at, I'll let someone else to do the 24-36 month real-workload stress testing on them. There's a reason they call it the bleeding edge after all.

    1. Re:Long-term, not short-term by SQL+Error · · Score: 5, Informative

      We've been using SSDs in our servers since late 2008, starting with Fusion-io ioDrives and Intel drives since then - X25-E and X25-M, then 320, 520 and 710, and now planning to deploy a stack of S3700 and S3500 drives. Our main cluster of 10 servers has 24 SSDs each, we have another 40 drives on a dedicated search server, and smaller numbers elsewhere.

      What we've found:

      * Read performance is consistently brilliant. There's simply no going back.
      * Random write performance on the 710 series is not great (compared to the SLC-based X25-E or ioDrives), and sustained random write performance on the mainstream drives isn't great either, but a single drive can still outperform a RAID-10 array of 15k rpm disks. The S3700 looks much better, but we haven't deployed them yet.
      * SSDs can and do die without warning. One moment 100% good, next moment completely non-functional. Always use RAID if you love your data. (1, 10, 5, or 6, depending on your application.)
      * Unlike disks, RAID-5 or 50 works pretty well for database workloads.
      * We have noted the leading edge of the bathtub curve (infant mortality), but so far, no trailing edge as older drives start to wear out. Once in place, they just keep humming along.
      * That said, we do match drives to workloads - SLC or enterprise MLC for random write loads (InnoDB, MongoDB) and MLC for sequential write/random read loads (TokuDB, CouchDB, Cassandra).

    2. Re:Long-term, not short-term by wvmarle · · Score: 1

      Will also depend greatly on your specific use case: whether it's lookups from a huge, mostly read-only database, or for use in a mail server which is constantly writing data as well. By my understanding at least it's the writes that wear out the SSD, not the reads.

    3. Re:Long-term, not short-term by Anonymous Coward · · Score: 0

      The question is really going to be what kind of shape the drives will be in a year or so from now after 12+ months of constant heavy usage.

      Just fine. EFDs have been front ending SAN storage as a 'high tier' for years now, being hit far harder than any single server could manage.

      I've not seen anything like the attrition rate on them that I do on SAS / SATA.

    4. Re:Long-term, not short-term by 0ld_d0g · · Score: 1

      Do you happen to know the failure rate off hand? Also did you do any research into which manufacturer has the least failure rate before deciding on the brand?

    5. Re:Long-term, not short-term by Anonymous Coward · · Score: 3, Informative

      If you do RAID5 or RAID6, you should match your RAID block exactly to the write block size of the SSD. If you do not, then you will generally need two writes to each SSD for every actual write performed. This will reduce the lifetime for the SSD and reduces the efficiency. Most RAID controllers have no way of doing this automatically and it is not easy to learn what the write block size is on an SSD (it is not generally part of the information on the drive).

    6. Re:Long-term, not short-term by silas_moeckel · · Score: 1

      Enterprise SSD's have been out for half a decade in production. I have roughly 300 Ent SSD's and more than a thousand consumer ones in servers and no failures. Retired many of the early ent SSD's well before they were pushing there write limits as we aged out servers (3-5 years service life). The consumer ones are acting as read cache for local and iscsi disk does wonders.

      --
      No sir I dont like it.
    7. Re:Long-term, not short-term by greg1104 · · Score: 1

      I did my first write heavy deployment of PostgreSQL on Intel DC S3700 drives about a month ago, with each one of them replacing two Intel 710 drives. The write performance is at least doubled--the server is more than keeping up even with half the number of drives--and in some cases they easily look as much as 4X faster than the 710s. I've been able to get the 710 drives to degrade to pretty miserable read performance on mixed read/write workloads too, as low as 20MB/s, but the DC S7300 drives don't seem to fall down that way either. I'm replacing older Intel drives that are struggling with DC S7300 models now as fast as I can get them.

    8. Re:Long-term, not short-term by Spoke · · Score: 1

      now planning to deploy a stack of S3700 and S3500 drives.

      Yep, these are the only drives I'd recommend for enterprise use - or any other use where you want to be sure that losing power will not corrupt the data on the disk thanks to actual power-loss protection.

      Intel's pricing with the S3500 places it very competitively in the market - even for desktop/laptop use I would have a hard time not recommending it over other drives unless you don't care about reliability and really need maximum random write performance or really need the lowest cost.

    9. Re:Long-term, not short-term by Anonymous Coward · · Score: 0

      * That said, we do match drives to workloads - SLC or enterprise MLC for random write loads (InnoDB, MongoDB) and MLC for sequential write/random read loads (TokuDB, CouchDB, Cassandra).

      What are some models of SLC drives? They seem to be rare, and I have a hard time finding them.

    10. Re:Long-term, not short-term by AcquaCow · · Score: 1

      Have you looked at the price point of the ioScale cards?

      --

      up 12 days, 22:30, 2 users, load averages: 993.20, 994.21, 994.56
      *makes note to limit user processes...
    11. Re:Long-term, not short-term by SQL+Error · · Score: 1

      Not off hand, sorry. I haven't been the sysadmin for 18 months (moved back to programming), and I don't want to give a guess that might be off by a factor of two.

    12. Re:Long-term, not short-term by SQL+Error · · Score: 1

      Micron and Toshiba make them, but they're hard to find. You can also get SLC ioDrives. But the Intel S3700 looks to be nearly as good, and much, much cheaper.

    13. Re:Long-term, not short-term by jon3k · · Score: 1

      When you say it outperforms a RAID10 array of 15K RPM disks - how MANY disks? 4? 100?

    14. Re:Long-term, not short-term by jon3k · · Score: 1

      Also - thanks for the info, very interesting and honestly what I would have suspected. Nice to see it play out in the real world.

    15. Re:Long-term, not short-term by sl4shd0rk · · Score: 1

      The question is really going to be what kind of shape the drives will be in a year or so

      Thing is, you don't need SSD on every partition of a server. Just putting /var on a 10G of a 60G SSD RAID1 will get you some really awesome performance (Apache+mysql) with a fair amount of over-provisioning so you don't have to worry* about TRIM (which is panning out to be nothing more than benchmark magic anyhow) or killing the disks early. You get lots of IOPS to spare, especially on reads, so running mysqldump every hour or whatever may be within reason now without seeing a huge I/O bottleneck. Replace the SSDs every year. The performance gains will be worth the cost. smartctl will tell you the operating hours on the drive to make it easy to remember.

      [*] - http://www.spinics.net/lists/raid/msg40866.html

      --
      Join the Slashcott! Feb 10 thru Feb 17!
  4. Silver Bullet by SQL+Error · · Score: 4, Informative

    We have hundreds of SSDs in production servers. We couldn't survive without them. For heavy database workloads, they are the silver bullet to I/O problems, so much so that running a database on regular disk has become almost unimaginable. Why would you even try to do that?

    1. Re:Silver Bullet by Anonymous Coward · · Score: 0

      Are your DBs primarily read-only? I would think that they'd wear our quickly under heavy write usage.

    2. Re:Silver Bullet by SQL+Error · · Score: 1

      It's a mix. We use enterprise drives for the really heavy stuff, and mainstream drives for data that's either read-only, read-mostly, or is in a database that does sequential writes like TokuDB or Cassandra.

    3. Re:Silver Bullet by AK+Marc · · Score: 3, Insightful

      write wear is a read herring. Unless you are over-writing the entire drive multiple times a day, you'll last longer with an SSD than spinning disk. And even then, the current generation will last longer than the early ones, and early ones are lasting longer than predicted.

    4. Re:Silver Bullet by cffrost · · Score: 4, Funny

      write wear is a read herring.

      Are you sure it's not a reed salmon?

      --
      Thank you, Edward Snowden.

      "Arguments from authority are worthless." —Carl Sagan
    5. Re:Silver Bullet by Anonymous Coward · · Score: 0

      I call Bullshit*.

      We've gone back to spinning disk for heavy write applications as SSD just doesn't last.
      Its failure modes are also bad - you go from working -> fatally dead immediately. Spinning disk at least gives some warning.

      For low writes - eg web serving its fine. For anything where you do > moderate disk writes - forget it.
      We've had *far* too many failures on different SSD's to even consider them. Its still uncharted territory.

      Based off personal experience - +-60 SSD's of different brands tested. **100%** failure rate achieved, some within weeks, none lasted more than 6 months.

      I wouldn't touch SSD with a bargepole UNLESS its backed up elsewhere. For pure caching its fine, for data storage - forget it.

    6. Re:Silver Bullet by Threni · · Score: 2

      It's actually a read/write herring, but most of the writing is cached and deferred for actual physical writing later on.

    7. Re:Silver Bullet by war4peace · · Score: 1

      He reads data off a herring and writes it on what he wears.

      --
      ...gis sdrawkcab (usually not responding to ACs; don't bother posting as AC)
    8. Re:Silver Bullet by h4rr4r · · Score: 0

      Why does the failure mode matter?
      You toss another one in the array when it fails. Rebuild goes mighty quick with SSDs.

      You always have to have backups. SSDs do not change that.

    9. Re:Silver Bullet by SQL+Error · · Score: 1

      Because if write wear is the prime failure mode and you're running RAID, you're likely to lose multiple SSDs in a relatively short interval.

    10. Re:Silver Bullet by dbIII · · Score: 1

      I found write performance hit a huge wall once the things started filling up. Perfect to kb/s in an instant and then getting stuck at that speed, and of course since an erase is a write recovery from that state took ages. The answer I suppose is to not let them get anywhere near full - where that point is will undoubtedly vary by model based on their internal controllers. I can't recall where it fell over but I think it was still under 90% with one set of SSDs.
      I replaced them with spinning storage and people were happy, but that was something that didn't need a lot of file operations per second.

    11. Re:Silver Bullet by h4rr4r · · Score: 1

      So you replace them. I am still not seeing the issue. Surely you have hot spares.

    12. Re:Silver Bullet by SQL+Error · · Score: 1

      Depends a lot on the drive, but that can be a problem. The best solution is to either buy a drive with a significant amount of over-provisioning built in (like the Intel S3700 or Seagate 600 Pro) or over-provision it yourself. That means that when it fills up it still has plenty of spare area to remap blocks.

      Enterprise drives typically have at least 20% over-provisioning; consumer drives can be 5% or less. A 400GB Seagate 600 Pro is the same as a 480GB Seagate 600, except for that setting.

    13. Re:Silver Bullet by SQL+Error · · Score: 1

      Yes. And one of the reasons for having hot spares (and replicas and backups) is the chance of multiple drive failures close together. So it's not a problem if you've planned things properly, but it's something you need to consider to create a good plan in the first place.

    14. Re:Silver Bullet by Anonymous Coward · · Score: 0

      If they all fail at the same time the whole RAID goes down and there's no rebuilding, you're now recovering from backup.

    15. Re:Silver Bullet by h4rr4r · · Score: 1

      The odds of that happening are vanishingly small.

      It could even happen with spinning rust.

    16. Re:Silver Bullet by h4rr4r · · Score: 1

      SSDs don't change that either way.
      This is all stuff that gets done with spinning rust as well.

    17. Re:Silver Bullet by jon3k · · Score: 1

      Two possibilities:
      1. You're lying
      2. You were using consumer level MLC drives in high write workloads (I might believe this)

    18. Re:Silver Bullet by Anonymous Coward · · Score: 0

      Or a reed-solomon?

    19. Re:Silver Bullet by Hawke · · Score: 1

      You're assuming that the drive failures are independent. His point is that they might not be: the common cause may be write cycles.

      Let's say that a drive under your write patterns will last 9 months. (Bad wear leveling algo, combined with very re-write heavy data structures?). You put 5 of them in a raid 5 enclosure, all brand new drives. 9 months later, they all fail within minutes of each other. Whoops, lost your data.

      If they fail for different reasons, you're more likely to be safe. If they all fail from wearing out the ability to erase cells, you're more likely to be hosed, until you've swapped out enough to randomize the write count./p?

    20. Re:Silver Bullet by cffrost · · Score: 1

      Psst... Just between you and me? That was the joke... :o)

      --
      Thank you, Edward Snowden.

      "Arguments from authority are worthless." —Carl Sagan
    21. Re:Silver Bullet by AK+Marc · · Score: 1

      You also don't understand "failure mode". THe SSDs fail readable. spinning disk fails dead. You could get your data off an SSD, even if impractical for on disk in a RAID in a production environment. And if they fail so predictably, then you install them staggered by MTBF/(disks in array). Then the failures will be conveniently spaced. If as inconveniently consistent as you say (multiplying failure inconvenience), then you are either deliberate in trying to make them seem worse than they are, or terminally stupid for not trying the "fix" I came up with in 10 seconds and listed above. People with spinning disks would kill for that level of predictability in failure. That, and often spinning disks fail in clumps as well, especially if you buy a number in the same batch and they are less heavily used. A RAID rebuild is hard on a spinning disk, and I've had more than one rebuild fail for a drive failure during rebuild.

    22. Re:Silver Bullet by h4rr4r · · Score: 1

      Not going to happen.
      I have plenty in production right now proving that.

      The failure rates per drive are just not that close together. It is not like they all fail at exactly the same number of writes. Also friends don't let friends use RAID5.

      If losing a raid array loses your data, you should lose your job. Backups exist for a reason.

    23. Re:Silver Bullet by xeio87 · · Score: 1

      You also don't understand "failure mode". THe SSDs fail readable. spinning disk fails dead.

      This hasn't really appeared to be accurate in practice. In theory, they could do this, but from what I've read most SSDs on the market appear to hard fail. Probably due to the controller dying, as it's nearly impossible to wear the NAND into a read-only state anyway. They also tend to fail with no advance warning. Might be one of the only areas other than price where SSDs don't work out better than HDD, but time will tell.

      As usual, always keep a backup. Don't assume your drive will fail gracefully (that goes for HDDs too).

    24. Re:Silver Bullet by Anonymous Coward · · Score: 0

      That wasn't very nyquist of you

  5. Near-line storage only: Has been for some years. by MROD · · Score: 5, Informative

    You have to remember that enterprise level storage isn't a single set of drives holding the data, it's a hierarchy of different technologies depending upon the speed of data access required. Since SSDs arrived they've been used at the highest access rate end of the spectrum, essentially using their low latency for caching filesystem metadata. I can see that now they are starting to replace the small, high speed drives at the front end entirely. However, it's going to be some time before they can even begin to replace the storage in the second tier and certainly not in the third tier storage where access time isn't an issue but reliable, "cheap" and large drives are required. Of course, beyond this tier you generally get on to massive robotic tape libraries anyway, so SSDs will never in the foreseeable future trickle down to here.

    --

    Agrajag: "Oh no, not again!"
  6. I ran a Minecraft GSP off them for a year. by Anonymous Coward · · Score: 0

    I paid extra to have two in each machine so I could RAID 1 them in case one died. Minecraft is write-intensive and we also had map generation, although the maps were written to magnetic disks because they are so huge. Before that year, we were running everything on magnetic disks on hardware RAID 1.

    As soon as we switched, iowait went down to practically zero. System load followed. Map updates incurred high iowait on the magnetic disks, but had no impact on the SSDs or server performance, and they also finished a little faster.

    I don't think we ever had an issue with an SSD going bad. We did lose a magnetic hard drive once. Because the Minecraft servers were on SSDs, we just reinstalled the OS on a new drive and mounted the existing SSD RAID. It went alright.

    I would definitely use SSDs again if I was hosting something IO heavy, especially write heavy. They excel at it in a way that even a RAID10 of magnetic drives would be hard pressed to equal. You back up your data so if it fails, you're ready.

    1. Re:I ran a Minecraft GSP off them for a year. by Anonymous Coward · · Score: 0

      Minecraft isn't particularly write-heavy... it just likes to fsync(). A LOT.

  7. Perfect! by Anonymous Coward · · Score: 0

    Perfect! Now we just to sit back and watch the price come down even further.
    I am using Samsung (Pro) SSDs for all OS partitions and only use HDs for bulk storage nowadays.

  8. SAS SSD by Anonymous Coward · · Score: 0

    Ok great but where can you find an affordable SAS SSD?

    I really think that SSD have a great value but you must rethink you infra and application to work with them.

    1. Re:SAS SSD by TheRaven64 · · Score: 2

      SAS doesn't really get you anything useful with an SSD. The extra chaining isn't that important, because it's easy to get enough SATA sockets to put one in each drive bay. There's no mSATA equivalent for denser storage, and if you really need the extra speed then why not go all the way and get something like FusionIO cards that hang directly off the PCIe bus?

      --
      I am TheRaven on Soylent News
  9. enterprise class SSDs not the same by Anonymous Coward · · Score: 5, Interesting

    The enterprise class SSDs are not the same as the "consumer" ones: http://www.anandtech.com/print/6433/intel-ssd-dc-s3700-200gb-review

    Don't be surprised if you stick a "consumer" grade one to a heavily loaded DB server and it dies a few months later.

    Fine for random read-only loads.

    And some consumer grade SSDs aren't even consumer grade (I'm looking at you OCZ: http://www.behardware.com/articles/881-7/components-returns-rates-7.html ).

    1. Re:enterprise class SSDs not the same by CAIMLAS · · Score: 1

      I've been overwriting a consumer grade SSD as a ZIL on a daily basis, at home, for the past year and a half (Kingston). I know other people who have been doing the same with no problems. (Though, I have people who have had numerous issues with OCZ. OCZ isn't 'consumer' grade, it's 'gamer' grade, and shitty as a result.)

      --
      ~/ssh slashdot.org ssh: connect to host slashdot.org port 22: too many beers
  10. Price by asmkm22 · · Score: 4, Interesting

    Pricing really needs to come down on these things. A single drive can easily cost as much as a server, and when you're talking about RAID setups, forget it. It's still much more effective to use magnetic drives and use aggressive memory caching for performance, if you really need that.

    Another 3 to 5 years this idea might have more traction for companies that aren't Facebook or Google, but right now, SSD costs too much.

    1. Re:Price by Anonymous Coward · · Score: 1

      When you take a look at total cost of ownership, it's not bad (perhaps even cheaper) for many applications.

      An SSD is about two orders of magnitude lower latency than even the best high performance magnetic drive. Magnetic drives simply cannot compete with that, even in the most robust RAID setups. Magnetic media RAID setups can compete with single SSDs in sequential reads, but only by using many non-redundant drives.

      For any application where sequential read performance is the bottleneck (say, a media server), a RAID array of magnetic drives is likely the most cost effective. For any application in which random read/write performance is the bottleneck (almost all database driven applications), there simply is no competing on a performance or cost/performance measure with SSDs. You *cannot* achieve the same level of sustained IOPs (no matter how you configure the storage) with magnetic media as with SSDs.

      This, of course, is not to say there are no other concerns, such as amount of data, that may change the cost analysis.

    2. Re:Price by TheRaven64 · · Score: 1

      Even for sequential reads, SSDs can be an improvement. My laptop's SSD can easily handle 200MB/s sequential reads, and you'd need more than one spinning disk to handle that. And a lot of things that seem like sequential reads at a high level turn out not to be. Netflix's streaming boxes, for example, sound like a poster child for sequential reads, but once you factor in the number of clients connected to each one, you end up with a large number of 1MB random reads, which means your IOPS numbers translate directly to throughput.

      Spinning disks are still best where capacity is more important than access times. For example, hosting a lot of VMs where each one is typically accessing a small amount of live data (which can be cached in RAM or SSD) but has several GBs of inactive data.

      --
      I am TheRaven on Soylent News
    3. Re:Price by Anonymous Coward · · Score: 0

      But look at the article: they claim that one SSD-loaded server can functionally replace FIVE magnetic drive servers. So, even if the disks cost 4 times as much as the server, it's still cost effective (in their application) to make the switch. This is yet another demonstration of the actual power of modern processors: for any data-driven process, the CPU is primarily starved for data. Six or sixteen or sixty four cores in one shell, can just do more to more stuff faster than the iron particles can be moved under the read head.

    4. Re:Price by jedidiah · · Score: 1

      > Even for sequential reads, SSDs can be an improvement. My laptop's SSD can easily handle 200MB/s sequential reads, and you'd need more than one spinning disk to handle that.

      Except that's not true across the board. You may find that a more reliable brand doesn't have sequential performance nearly that good.

      For as much as SSD cost, you can easily double up on the spinning rust and still be way WAY ahead.

      You can get very noticable improvement even with spinny rust just by having more than one spindle and not pushing everything including the processes from 8 cores through a single physical bottleneck.

      --
      A Pirate and a Puritan look the same on a balance sheet.
    5. Re:Price by jon3k · · Score: 1

      Depends on workload and performance requirements. If you have a very disk intensive workload that requires low latency and high throughput, but low storage capacity requirements, like DB workloads specifically (SQL, Mongo, etc) it can be DRAMATICALLY cheaper to use SSDs versus building massive disk based RAID arrays, and you'll never match the latency. A pair of Intel S3700 drives can outperform several shelves worth of 15K RPM SAS disks. Not to mention power, cooling, management, etc.

      Again, all depends on workload and requirements. Making blanket statements like you did are just silly.

    6. Re:Price by CAIMLAS · · Score: 1

      Well, it depends on what you're talking about. You can get a "system disk" SSD now for less than comparable rotating disks - we're talking in the 32GB range. If you're buying your storage directly from the hardware vendor, that's another story - Dell/HP/etc. still charge a premium.

      --
      ~/ssh slashdot.org ssh: connect to host slashdot.org port 22: too many beers
    7. Re:Price by asmkm22 · · Score: 1

      Since the article is about datacenter equipment, I wasn't talking about 32GB drives. Look into the 1TB range or even 512MB and prices start to hit several thousands each.

    8. Re:Price by WuphonsReach · · Score: 1

      $2.00-$2.50 per GB is "too much"? The Intel 200GB units are only $400-$450 each. The 15k RPM SAS 300GB units are down to $200 now ($0.67/GB).

      That's only a 3x-4x cost difference, vs what used to be 10x-30x cost difference. Plus instead of using RAID 0+1 setups on 15k SAS drives for write-heavy workloads, you can possibly use RAID-6 on SSDs instead and still get better performance.

      --
      Wolde you bothe eate your cake, and have your cake?
    9. Re:Price by asmkm22 · · Score: 1

      200GB drives aren't at all big enough for most datacenters. Look up the cost of the 500GB or even 1TB+ drives.

      But yeah, if you can get by with 200GB drives, then it's not so bad.

  11. Single component failure not a big deal any more. by 12dec0de · · Score: 5, Informative

    I think that the wide range adoption of server SSDs also shows how far server installations have progressed toward eliminating all single points of failure.

    In the passt HA and 'five nines' was something only done by a few niches, like telephony provider switches or banking big iron. Today it is common in many cloud installations and most sizeable server setups. A single component failing will not stop your service.

    If your business can support the extra cost for the SSDs, a failing drive will not stop you and the performance of the service will see great improvements anyway. The power savings may even make the SSD not so costly after all.

  12. And beyond SSD, the future is PCIe Flash by snowtigger · · Score: 1

    SSDs are slow in that they rely on old school disk protocols like sata. Sure, you'll get better performance than spinning disk. But if you want screaming fast performance, you should look at flash devices connected through the PCIe bus.

    Products from Fusion IO would be an example of this. Apple Mac Pro would be another: "Up to 2.5 times faster than the fastest SATA-based solid-state drive".

    1. Re:And beyond SSD, the future is PCIe Flash by Twinbee · · Score: 1

      How about SATA 3? Is nearly a GB per second not good enough? Unless you're talking about latency....

      --
      Why OpalCalc is the best Windows calc
    2. Re:And beyond SSD, the future is PCIe Flash by wonkey_monkey · · Score: 4, Insightful

      Up to 2.5 times faster

      Ah, "up to." Marketing's best friend.

      --
      systemd is Roko's Basilisk.
    3. Re:And beyond SSD, the future is PCIe Flash by Anonymous Coward · · Score: 0

      We saved our company quite literally millions of pounds (GB) using just a pair of off-the-shelf servers with FusionIO cards and DRBD. They took over from an existing NetApp which was completely saturated by the workload (which laughed in the face of flash caching).

    4. Re:And beyond SSD, the future is PCIe Flash by jones_supa · · Score: 1

      SATA 3.0 is only 600 MB/s.

    5. Re:And beyond SSD, the future is PCIe Flash by cdrudge · · Score: 4, Funny

      They have to say up to. Reads and writes towards the inside of the chip are slower then they are towards the outside of the chip. I don't think anyone makes a constant linear velocity SSD.

    6. Re:And beyond SSD, the future is PCIe Flash by rjstanford · · Score: 1

      Really though its linguistically equivalent to saying, "We promise that it won't be more than 2.5 times faster. Could even be slower - who knows - but it certainly isn't 3 times faster."

      --
      You're special forces then? That's great! I just love your olympics!
    7. Re:And beyond SSD, the future is PCIe Flash by shinzawai · · Score: 0

      What did you use as transport between the two servers for DRBD traffic? 1GB? 10 GB? Infiniband?

    8. Re:And beyond SSD, the future is PCIe Flash by silas_moeckel · · Score: 1

      PCIe based flash is nice have more than a few in production. The downside is hot swap pcie MB's are extremely expensive and getting more than 7 pcie slots is also nearly imposible. I can get 10 or more 2.5 hot swaps on a 1ru server. I can get hardware raid even redundancy with the right back planes. I can connect up external chassis via sas if I need more room (yea pcie expansion chassis exist as well they are funky to deal with at times). The use cases for needing extremely fast IO without redundancy exist but are s small subset.

      --
      No sir I dont like it.
    9. Re:And beyond SSD, the future is PCIe Flash by Rockoon · · Score: 1

      Yes, and thats peak.

      The year SATA 3 was put into production, SSD's designs were reconfigured to saturated it, and those fusion I/O drives saturate their PCI lane bandwidth....

      SATA 3 was and always will be shortsighted bullshit brought to you by a consortium of asshats intentionally trying to undercut feature demand in their desperate attempt to preserve the old guard.

      --
      "His name was James Damore."
    10. Re:And beyond SSD, the future is PCIe Flash by cdrudge · · Score: 1

      My favorite are ads that say "Save up to $X (or X%) and more!" So in other words the savings can be any amount or none at all. Whatadeal.

    11. Re:And beyond SSD, the future is PCIe Flash by jones_supa · · Score: 1

      I don't know really. The SATA 3 spec was released in July 2008, which was about the year when only the very first consumer SSDs started to appear. Maybe the spec was mostly designed for fast HDDs and they couldn't fully predict the need for the speed. And it was a natural thing to just double the data rate.

    12. Re:And beyond SSD, the future is PCIe Flash by Anonymous Coward · · Score: 0

      In a similar vein, people are more willing to buy something marked at $9.99 than something marked at $10.00 by an extremely large margin, even before taxes or other mark-up that they know about. Marketing, and its tricks, are more psychology than logic.

    13. Re:And beyond SSD, the future is PCIe Flash by jon3k · · Score: 1

      Throughput isn't necessarily the most important factor. In fact, with SATA 6 SSD, it hardly ever is. Most people are concerned with latency and IOPS. Even a single SATA SSD can easily do 500MB/s. Throw a few of them onto a PCIe SATA controller and you can easily get several GB/s of throughput, it's been proven time and time again. And usually at a fraction of the cost of the PCIe based SSDs (mostly because the controller tech is expensive, but MORE so that they typically use enterprise level SLC NAND which is far more expensive).

    14. Re:And beyond SSD, the future is PCIe Flash by Rockoon · · Score: 1

      The SATA 3 spec was released in July 2008, which was about the year when only the very first consumer SSDs started to appear. Maybe the spec was mostly designed for fast HDDs

      You mean the HDD's that still to this day struggle at 100MB/sec across the platter? Just like they did then?

      Sure, maybe they designed their 600MB/sec interface based on 100MB/sec technology that was running on a 300MB/sec interface ..... *rolls eyes*

      As far as July 2008, that was the DRAFT specification of SATA 3 that they released on that date, but a year prior to that companies like Fusion-IO were making PCIe SSD's that were already faster than SATA 3 could handle...

      When thinking back to the early days of performance SSD's, I am always reminded of The Battleship Mtron project, which was in 2007. Note how they, too, struggled with interface.

      --
      "His name was James Damore."
  13. Virtualisation by drsmithy · · Score: 5, Interesting

    This is being driven primarily by increasing levels of virtualisation, which turns everything into a largely random-write disk load, pretty much the worst case scenario for regular old hard disks.

    1. Re:Virtualisation by jones_supa · · Score: 1

      Good point.

    2. Re:Virtualisation by jon3k · · Score: 1

      Especially for very large virtual workstation installations. Those workloads just absolutely wreak havoc on disks.

    3. Re:Virtualisation by Anonymous Coward · · Score: 0

      What does virtualization have to do with how your data is stored? I hope people weren't storing databases on their servers local drives.

  14. Prices? by Savage-Rabbit · · Score: 1

    are SSDs mature enough (and cheap enough) to support business-sized workloads? Or are they still best suited for laptops and mobile devices?

    I don't see maturity as a problem. If there is money to be made drive manufacturers will throw enough engineering and computer science talent at the task of solving the teething troubles. What interests me is that if SSDs mount a major invasion of server-rooms and data-centers worldwide it also means that we will now finally start to see SSD pricing drop like rock. Cheap high capacity external SSD drives, I can't wait. If we are lucky this will also popularize Thunderbolt with PC motherboard makers since that's where you start seeing some real performance advantages, i.e. when the time it takes to make a backup of your laptop/desktop system to an external drive drops by half or more compared to USB 3.0.

    --
    Only to idiots, are orders laws.
    -- Henning von Tresckow
    1. Re:Prices? by PlusFiveTroll · · Score: 1

      > What interests me is that if SSDs mount a major invasion of server-rooms and data-centers worldwide it also means that we will now finally start to see SSD pricing drop like rock.

      I'd think the opposite may occur. SSD flash is currently limited by the amount we can produce at a reasonable price.

      http://searchstorage.techtarget.com/news/2240181971/NAND-shortage-could-slow-pace-of-flash-price-drops-squeeze-SSD-makers
      http://www.seagate.com/point-of-view/nand-flash-supply-market-master-pov/

  15. Any experiences on Hybrid RAID-1? by schweini · · Score: 1

    What a coincidence! I am getting ready to transition our main DB servers (couple of GB mysql data) to SSD, but I simply dont want to trust it that much yet. So my plan is to set up RAID-1, with an SSD and another conventional drive. There seems to be this "--write-mostly" option that tells linux to preferably read from the SSD. Anybody know if this is worth it? If it works? What kind of random access performance gains can i look forward to, running mysql on SSD? I found it surprisingly hard to find any good data on these subjects.

    1. Re:Any experiences on Hybrid RAID-1? by jaseuk · · Score: 1

      I'm using that setup. I'm using a cheap, but high Capacity OCZ drive (960GB), with a software raid 1 mirror to a SATA replacement. I'm running this on Windows, which crucially always uses the FIRST drive for reads. So reads are at SSD speeds, writes are at SAS speeds.

      It's working well enough. I've not benchmarked this. We have had 1 drive failure, I suggest keeping 1 cold-spare to hand. Delivery times on SSDs are pretty variable, you won't want your entire DB running on a SAS drive for too long.

      Jason.

    2. Re:Any experiences on Hybrid RAID-1? by drsmithy · · Score: 1

      Your writes will be limited to the speed of the conventional drive, so if your workload is mostly reads, then you will see a significant benefit.
      Though, if your workload is mostly reads, you'd probably see the same benefit for a lot less $$$ by putting more RAM in your server...

    3. Re:Any experiences on Hybrid RAID-1? by rjstanford · · Score: 1

      That's what we ended up doing with our databases - did a bunch of comparisons and ended up sticking to 15K disks and maxing out RAM instead. Even at Rackspace prices we came out ahead on price/performance.

      --
      You're special forces then? That's great! I just love your olympics!
    4. Re:Any experiences on Hybrid RAID-1? by silas_moeckel · · Score: 1

      With a couple GB of data just put in ram you can get to 128GB cost effectively and if your read heavy you will end up with everything cached. If your writing just go all SSD it's night and day a single SSD pair easily outperforms a whole shelf of 15k drives.

      --
      No sir I dont like it.
    5. Re:Any experiences on Hybrid RAID-1? by jon3k · · Score: 1

      Bingo. Reg/ECC RAM is about $10/GB retail. If we're talking about a few GB, just buy a bunch of RAM. Unless you plan on the dataset outgrowing it relatively quickly, then you can either shard/partition (depending on your platform) the DB or move to some type of disk based solution.

  16. As early adopters... by Anonymous Coward · · Score: 0

    We've been using SSDs in our data centers for quite a while now, specifically to store our high volume i/o databases. Its pretty much an indispensable technology when your system depends on speed. Think communications & financial transactions. Delays can result in decreased user experience in the case of communications, or arbitrage opportunities for those who do it faster in the case of financial transactions.

  17. Specific applications now, everything later by crucial_hendo · · Score: 2

    I work for an Australian hosting company and we have deployed the SolidFire all-SSD SAN for our cloud-based hosting (shared, reseller, cloud/virtual server), the major benefits of an all-SSD storage solution speak for themselves: far lower I/O wait time, huge IOPS numbers - in SolidFire's case 250,000+ distributed IOPS in our current configuration. We've recently shifted from the HP SAS-based Lefthand SAN offering up to 15,000 IOPS to the new SolidFire all-SSD SAN and the team behind SolidFire are partly from the Lefthand operation from HP, so there's some good know-how there. The article is quite broad in its content, for big data applications SSD SAN storage is still incredibly more expensive ($/GB) than SATA or SAS based SANs - our SolidFire was a huge investment. Many hosting providers are now switching to all-SSD based servers for the performance benefits, however the drawback is primarily total storage capacity of course. For example a typical VPS node using local storage with 10 x SATA drives can get up to 4TB of usable RAID-protected storage. The numbers for an all-SSD node in RAID configuration would be much lower in capacity and suitable higher in cost. Its important to note that many people view SSDs as desktop only hardware, which is fundamentally incorrect, as there are many units out there that offer write longevity much longer than needed (5-10+ years). For many server based applications (not big-data purposes), SSDs are, and will become the predominant choice among many hosting companies. Not every provider can afford the investment of an enterprise grade SAN, however the speed of development from Intel and Samsung will mean the $/GB will drop steeply and disk sizes will increase exponentially (like what we've with SATA in the past 5 years).

  18. Re:Single component failure not a big deal any mor by necro81 · · Score: 1

    A single component failing will not stop your service

    Correction: a single component failing should not stop your service, if you have done your job right (either in designing and building, or in finding a vendor to provide the service). But having a single component failing can and still does ruin somebody's day on a regular basis.

  19. My company's experience by necro81 · · Score: 1

    At my company, we have gradually been moving away from spinning disks in favor of SSDs. My company does a lot of R&D work, so we have a lot of people doing CAD, simulation, number crunching, etc. For those users, our IT department hasn't built a machine with spinning media in over two years: the performance boost from SSD is outstanding, and the local storage needs are pretty modest. On the back end, our backup solution (daily incremental backups of everyone's machine, hourly for the network storage) uses a cabinet of HDDs, with a RAID of SSDs that contain the backup database / index. (there are tape backups in the mix, too, with offsite storage, but I forget the details). I'm sure if they could afford to create a 100 TB array of SSDs and do away with the spinning discs entirely, they would.

    1. Re: My company's experience by Anonymous Coward · · Score: 0

      The entire article is about the cost savings of SSD's. How could you afford not to switch? I wonder if bcache is an efficient lowend solution. Of course, when absolute performance is required, customers pay, which means ssd will only be around long enough to transition from disk based infrastructure to everything in memory. - flash.
      what we really need is bigger batteries and more generators.

    2. Re: My company's experience by Anonymous Coward · · Score: 0

      If you want just raw storage with 'low' io then a normal HD is the way to go. If you need good io SSD is the way to go.

      At this point for raw storage (think 20-40TB+) with modest retrieval requirements normal HD's still curb stomps the cost of SSD.

      You will see an inflection point when SSD's of 1-1.5TB become common at a reasonable price. You will see huge swaths of HD's retired and replaced with SSD.

      I use a similar bcache sort of thing on my own laptop (the intel windows flavor, I have 1TB worth of software I want at hand quickly). It works 'ok'. But not like a real SSD.

      If I could get 1TB for under 250 bucks I would buy them in a heartbeat. Right now you are looking at the 1-4k range for 1TB. If you need 100TB of HD that is nearly 100k-500k depending on what you buy.

      I figure about 3 years from now you will be hard pressed to buy a 1TB normal HD.

      Also normal HD capacity is not slowing much. They are talking 10TB in one drive within 5 years. SSD will be hard pressed to keep up with that sort of density. The per cell write limit has consistently gone down every generation. It is not uncommon now to see cells with 3k re-writes when 10 years ago it was 100k.

      "sd will only be around long enough to transition from disk based infrastructure to everything in memory" that works for transactional data that is short lived. But if you want to keep it long term you must write it out somewhere. There will be a power failure. Plan on it.

  20. Does the NSA have mod points? by Anonymous Coward · · Score: 0

    NSA uses hard disk, bigger capacity, you don't store 2GB/yr of data per person in SSD's. They might have some SSD cache, but more likely it's RAM cache, since they have they're own 150 Megawatt power station, its easier to hold things in RAM during data mining and ensure the power will stay on.

    Disks for the bulk of the people, RAM cache for the influencers in the graph (people who originate ideas, are tagged by the 15000 cyber staff as potential targets).

    If General Keith Alexander took off the limits on surveillance so it could be applied to the USA, I have no doubt he also took off the limits on propaganda too so we could get our share of NSA propaganda on Slashdot.

    14000 Cyber soldiers, mean we have plenty here on Slashdot, a lot with mod points.
    http://www.wired.com/threatlevel/2013/06/general-keith-alexander-cyberwar/all/

    "Alexander’s agency has recruited thousands of computer experts, hackers, and engineering PhDs to expand US offensive capabilities in the digital realm. The Pentagon has requested $4.7 billion for “cyberspace operations,” even as the budget of the CIA and other intelligence agencies could fall by $4.4 billion."

    "The forces under his command were now truly formidable—his untold thousands of NSA spies, as well as 14,000 incoming Cyber Command personnel, including Navy, Army, and Air Force troops. Helping Alexander organize and dominate this new arena would be his fellow plebes from West Point’s class of 1974: David Petraeus, the CIA director; and Martin Dempsey, chair of the Joint Chiefs of Staff."

    "In May, work began on a $3.2 billion facility housed at Fort Meade in Maryland. Known as Site M, the 227-acre complex includes its own 150-megawatt power substation, 14 administrative buildings, 10 parking garages, and chiller and boiler plants. The server building will have 90,000 square feet of raised floor—handy for supercomputers—yet hold only 50 people. Meanwhile, the 531,000-square-foot operations center will house more than 1,300 people. In all, the buildings will have a footprint of 1.8 million square feet. Even more ambitious plans, known as Phase II and III, are on the drawing board. Stretching over the next 16 years, they would quadruple the footprint to 5.8 million square feet, enough for nearly 60 buildings and 40 parking garages, costing $5.2 billion and accommodating 11,000 more cyberwarriors."

  21. Of course business adoption is small by neokushan · · Score: 1

    I recently was given the task of upgrading my development machine. We're a small company but management is happy to spend money on hardware if we need it.

    I decided I'd prefer an SSD and yet when I looked at the big suppliers of office machines - Dell, HP, etc. none of them even offered SSD's as an option. SSD's only came into it when you started looking at the really high-end, £2,000+ workstations but there's no reason why this should be the case.

    In the end, I just custom built the machine as it was the only way to get the hardware I needed without having to fork out for workstation graphics (which I didn't need).

    --
    +1 IDisagreeSoHeMustBeATrollOrAnAstroturferOrAShill
    1. Re:Of course business adoption is small by h4rr4r · · Score: 1

      We just buy a normal dell and toss the drive out when it arrives. Installing a hard drive is not difficult and you get to keep the NBD warranty on the rest of the machine.

    2. Re:Of course business adoption is small by neokushan · · Score: 1

      I would agree with that, but the cost Dell was charging was higher than what I could pay for a custom built option with the same (or in fact, better) specs.

      --
      +1 IDisagreeSoHeMustBeATrollOrAnAstroturferOrAShill
    3. Re:Of course business adoption is small by h4rr4r · · Score: 1

      That makes sense.
      We also do not buy one off machines for devs or really anyone. We just upgrade one of the hundreds of desktops we buy at a time.

    4. Re:Of course business adoption is small by styrotech · · Score: 1

      I decided I'd prefer an SSD and yet when I looked at the big suppliers of office machines - Dell, HP, etc. none of them even offered SSD's as an option.

      I was looking at small business desktops too recently. Lenovo (in Aus/NZ at least) had SSD options for all their desktops. Even on the cheapest models - which were also a lot cheaper than anything Dell offered. All the base warranties were 3yrs on site too.

      I'd have to be extremely hard pressed to go back to Dell (or HP) now.

    5. Re:Of course business adoption is small by neokushan · · Score: 1

      Unfortunately, Lenovo don't operate like that in the UK. Their site is just a showroom and you have to go to a reseller to get Lenovo machines.

      --
      +1 IDisagreeSoHeMustBeATrollOrAnAstroturferOrAShill
  22. Million Dollar SSD's by Anonymous Coward · · Score: 0

    We have spent over a million on a SSD setup for a single host. I'm talking about IBM's V7000's behind IBM SVC. And this is considered Mid-Range, cheaper stuff.

    Thing is, when you move from several hundred spinning disks to several dozen SSD's, we it's just not that impressive.

    It does the job sure, but it's not Eureka!

  23. It depends... by Anonymous Coward · · Score: 0

    The main concern I see (from a storage *network* perspective) is the cost of flash along with write issues (limited write cycles and speed deteriorating on an erase/write), but there are a few companies attempting to lessen the impact of those problems. PureStorage and EMC's XtremeIO attempt to increase efficiency and minimize write-wearing by using inline deduplication with a layer of cache at the controller level. One or both of them also write the data down in a RAID-3 type fashion. Save an entire stripe in cache, then lay it down on the flash to ensure you're not having to go back and fill in spaces, possibly doing erase/writes. Now...I haven't gotten to fully test one, but I'd like to see what happens when you start deleting and fragmenting data. That one should be interesting.

    From an internal storage perspective, FusionIO is magnificent (there are others, but I'm not too concerned), though there are a few sticking points that mess with me. Transient data database I've seen thrown on it blaze through like nothing I've ever seen, but if you want any data replicated (to my knowledge, correct me if I'm wrong) you have to do operating system or database level replication. You have a blisteringly fast storage device that finally brings the storage up to speed with the new CPU, and, if you replicate, you start eating into the CPU cycles you'll need to push that storage to the limit.

  24. Re:Single component failure not a big deal any mor by antifoidulus · · Score: 1

    I was actually curious about the power consumption so I went poking around and found this(Sorry I couldn't find the original article. The power consumption is markedly different....not sure it's enough to COMPLETELY offset the cost, but certainly makes it easier to swallow.

  25. Re: What does the NSA use? by Anonymous Coward · · Score: 0

    i see you haven't met my teenage daughter.

  26. More Common Than You Think... by Whatchamacallit · · Score: 4, Interesting

    SSD's might not be used as primary storage, yet. The cost of using a lot of SSD's in a SAN is still too high. However, that doesn't mean that SSD technology is not being used. Many systems started using SSD's as Read/Write caches or highspeed buffers, etc. The PCIe SSD cards are popular in highend servers. This is one way that Oracle manages to blow away the competition when benchmarks are compared. They put a PCIe SSD cards into their servers and use them to run their enterprise database at lightning speeds! ZFS can use SSD's as Read/Write caches although you had better battery backup the Write cache!.

    Depending on a particular solution, a limited number of SSD's in a smaller NAS/iSCSI RAID setup can make sense for something that needs some extra OOMF! But I don't yet see large scale replacement of traditional spinning rust drives with SSD's yet. In many cases, SSD's only make sense for highly active arrays where reads and writes are very heavy. Lots of storage sits idle and isn't being pounded that hard.

    1. Re:More Common Than You Think... by jon3k · · Score: 1

      Most med-large SAN use tiered storage. We're seeing SSDs in Tier0/Tier1. We will continue to see mechanical HDD in Tier 2 and near-line for a while, I believe.

  27. "business-sized" by Anonymous Coward · · Score: 0

    it does raise a question: are SSDs mature enough (and cheap enough) to support business-sized workloads?

    Yes and no. It's kind of a stupid question, because "business-sized" is undefined. Different people are doing different things. Some of them need big storage, some need less.

  28. Reliability data? by sjbe · · Score: 1

    I trust SSDs more than physical spindles any day.

    Based on what evidence? Where is your data? Faster != More reliable. Spindle based hard drives are (usually) quite reliable and there is plenty of real world usage data documenting exactly how reliable they are. Companies with big data centers like Google have extremely detailed reliability performance figures. SSDs have a lot of advantages but they only recently have started receiving wide distribution and to date they have poor market penetration in data centers where it is easiest to measure their reliability in the real world. Manufacturers estimates of reliability don't mean much in the real world since they have an obvious conflict of interest.

    I have little doubt that SSDs will over time replace spinning platters in most places but claims regarding their reliability in relation to spinning platters is somewhat premature, especially in a data center environment. I wouldn't be the least bit shocked to find out they were more reliable (having no moving parts helps a lot) but just because they should be doesn't mean they will be.

    1. Re:Reliability data? by jedidiah · · Score: 1

      I have had hard disks last for 7 years. I have some now that are about 5 years old. When I can say that about an SSD, I will have more trust in them. Until then, trust is really unwarranted. Without some actual experiences (yours or something else), you are really just engaging in a leap of "faith".

      --
      A Pirate and a Puritan look the same on a balance sheet.
    2. Re:Reliability data? by jon3k · · Score: 1

      There are lots of very large installations using pure SSD (MySpace went all SSD in 2009 for example). However, no one seems to be making the data available. And one reason it wouldn't help is that the lifespan of an SSD is incredibly dependent on the work load, unlike traditional disks. If you're workload is 99% reads and 1% writes, your failure rate would be exceptionally low. But if my workload was 50/50 reads/writes my failure rate COULD BE substantially higher than yours.

    3. Re:Reliability data? by jon3k · · Score: 1

      That's probably too small of a sample to draw any reliable conclusion don't you think? Even if you had 1 SSD that lasted 20 years, does that really tell us anything, statistically?

      For what it's worth, I bought my first SSD, a 30GB OCZ Vertex SSD (original version) on 6/21/2009 (i just logged into newegg and checked) and it's still going strong without a single problem. It's since been "demoted" to my HTPC in the living room, which has been great because the bootup is very "appliance-like" and it's completely silent.

      Oh and if you were curious - I paid $139.00 for it in June 2009. For comparison, today you can get a 128GB OCZ Vertex 4 for $119. So 15% cheaper for 2.5x the performance and 4x the capacity in 4 years. That's what I call progress!

  29. Hot/Crazy by bill_mcgonigle · · Score: 1

    Two years on and this is still relevant: The Hot/Crazy Solid State Drive Scale.

    I love SSD's in servers and they don't burn me because I always expect them to fail. Sure, one MLC SSD is fine for a ZFS L2ARC, because if it fails reads just slow down, but for a ZFS ZIL, that gets a mirror of SLC drives, because a failure is going to be catastrophic.

    If I'm using Facebook's FlashCache, two drives get mirrored by linux md and treated as a cache device and smartd lets me know when one of them goes TU. Another advantage here is linux md is hot-replaceable while pure FlashCache isn't. I just got linux 3.9 on my first server this weekend (thanks, ElRepo) and haven't yet tried Redhat's dm-cache, but the same logic ought to apply; it's only the abstraction and syntax that differs.

    Yeah, there's a write penalty with mirrors, but SSD mirror writes are way faster than the best pure spinning-rust RAID (no rotational latency), so it's way better than other options.

    Clustering can help too. The other strategy is to make nothing redundant except for a massive cluster of servers. I haven't benchmarked the two strategies (I tend to work with smaller clusters in small businesses) but tech time isn't free either. I suspect at Megalocorp scales where there are several people whose job it is to replace failing disks all day (this is a real thing), going redundant on the compute node scale is a better option. My systems tend to be remote in far-away data centers and nobody wants to have to touch them more than every few months.

    --
    My God, it's Full of Source!
    OUTSIDE_IP=$(dig +short my.ip @outsideip.net)
  30. Re:What does the NSA use? by bobbied · · Score: 1

    But the real *issue* here is being able to actually go though the data looking for information. Storage of this much data has been a fairly easy problem to solve if you have money, finding a way to organize and search though huge data sets to give timely results is not so easy even if you have money.

    Buying spindles and connecting them in huge RAID arrays is well understood. You just build what size you need and dump your data onto it. Yea, you will have to battle OS size limits on partitions and files, but that's not too bad or very expensive. As you point out, getting the hardware isn't that expensive, even at the apparent scale involved here. Buying enough power to turn it all on and keep it cool shouldn't be an issue either, but you need to include that in the $4 Billion budget. In short, if you have money, getting the hardware off the shelf is easy. Software for this is NOT off the shelf.

    The REAL money is going towards the software systems that mine the information being collected. There is no system configuration running MySQL that's going to be able to support ongoing data collection (inserts) and any kind of meaningful query results on a petabyte sized data base. I'm guessing that half their budget goes to research and development of software and systems used to collect, store and mine the data. I'm also guessing that they spend roughly 40% of their hardware budget on processing, 40% on storage and 20% on maintenance and operating costs. This puts their hardware budget ($4 Billion * 50%) * 40% or about $1 Billion, give or take.

    This means that your 2 Gig turns into about 1/4th that, not accounting for the space being thrown away because it is obsolete. I'm guessing there really isn't that much being kept around on folks who are not interesting, however that is defined.

    --
    "File to fit, pound to insert, paint to match" - Aircraft Maintenance 101
  31. Re:Single component failure not a big deal any mor by Anonymous Coward · · Score: 0

    Well, if that single component is the main circuit breaker to your server building, I think it will still take it down.

  32. SSD & RAM by unixisc · · Score: 1

    Yeah. When we're talking RAM, we are talking modern interfaces, such as DDR (now DDR2 or DDR3), whereas NAND flash, which is what is used here, uses page mode read & writes. Not to mention that internal writes, which are there on flash but not on RAM, would automatically slow down the process, even if the same interface were used (compare SRAM and NOR flash, as a comparison point).

    I think what's contributing to the confusion is SSDs being available not just in SATA interfaces, but now, in PCI-X interfaces as well. SATA doesn't enable SSDs to be much faster than HDDs, but PCI-X does. So having SSDs on multiple PCI-X slots makes faster storage available, and reduces the need for more DRAM, since the latencies in cases that data is not in RAM, would be much lower.

    1. Re:SSD & RAM by jon3k · · Score: 1

      I think you mean PCIe interface?

    2. Re:SSD & RAM by unixisc · · Score: 1

      Yeah!

  33. Re:Near-line storage only: Has been for some years by jon3k · · Score: 1

    This guy basically nails it. SSD will slowly move it's way from "Tier 0" out, and eventually in 20 or 30 years even our near-line will be SSD, quite possibly.

  34. How old is this post? by Anonymous Coward · · Score: 0

    As I read through this post I kept asking myself "How old is this? Am I in the archives?"

    We've been running SSD drives for years. There is no way our VDI infrastructure could scale as it does using old fashioned platter based disks. Our VDI SAN has a shelf of enterprise SSD, and our server SAN has automated shifting of blocks of data from platter based disks to SSD to the enormous amount of RAM located in the SAN.

    The answer is "Yes, SSD is ready for business-sized workloads. It has been for years."

  35. Slow writes, fast reads, low $$, application needs by billstewart · · Score: 1

    Flash writes are a lot slower than SRAM or DRAM writes, but the reads are still very fast, and both of them are a lot faster than rotating mechanical disk drives. Also, the price/performance means that you may be able to afford putting a lot more of it into your machine.

    Depending on your application, you may be getting a big performance win by moving from disk drive to some kind of RAM, and it doesn't matter if it's flash or DRAM because the important thing was eliminating rotation and seek latency. Or you might get better performance because you can put a terabyte of SSD onto your machine, while you don't have room (or budget) for that much flash.

    Back in the 80s, the price and capacity of RAM for Vaxen improved to the point that we could finally upgrade our machine to 16 MB and still fit in two cabinets. The run-time for our 12-MB simulation application suddenly went from a week down to an hour per run, and I was able to stop chasing mysterious virtual memory flakiness. The upgrade probably cost us about half a person-year's salary, and it would have probably made sense to get it a year earlier, but capital expenses and salaries came from different budgets, and big bureaucratic companies are that good at cost accounting.

    --

    Bill Stewart
    New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks
  36. Sometimes you need a truck by billstewart · · Score: 1

    (insert "series of tubes" joke here.)

    You don't just care about the acceleration speed; that 1995 Civic had pretty good handling, and might be more use on the freeway than a muscle car that's made to go really fast in a straight line. Or maybe what you need is a truck, because carrying everything in one trip is a lot faster than 50 slightly faster trips.

    --

    Bill Stewart
    New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks
  37. OS support for multi-tier caching? by billstewart · · Score: 1

    Operating systems have had good support for 2-tier storage for a while - disk drives cached in RAM. But Flash/SSD really offers an intermediate performance level, and I haven't seen much from either Linux or Windows to take advantage of it without lots of customization or niche applications (such as Readyboost, or mounting /usr/share on a flash stick or whatever.) Has anybody seen anything interesting happening to take advantage of flash?

    Enterprise storage solutions do a bit better job in hiding SSD or RAM caching into big expensive SANs, and for a long time they've had solutions for tape backup, etc. But what I've really been looking for is support for a moderate amount of fast disk drive and a large amount of slow bulk storage (e.g. use the expensive SAS drives for the database but archive the logfiles onto consumer-priced slow storage.)

    --

    Bill Stewart
    New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks
    1. Re:OS support for multi-tier caching? by ron_ivi · · Score: 1

      But Flash/SSD really offers an intermediate performance level, and I haven't seen much from either Linux or Windows to take advantage of it without lots of customization or niche applications (such as Readyboost, or mounting /usr/share on a flash stick or whatever.) Has anybody seen anything interesting happening to take advantage of flash?

      Did you look hard?

      Try this: http://www.phoronix.com/scan.php?page=news_item&px=MTM2ODM

      Linux 3.10 Kernel Integrates BCache HDD/SSD Caching

      ...

      BCache comes down to being a Linux kernel block layer cache where one or more SSDs (or other fast storage devices) can act as a cache for slower rotating disk drives, in somewhat a similar manner to some of the "SSHD" hybrid drives now on the market. BCache is similar to the L2Arc feature exposed on Oracle's ZFS file-system, but with being at the block device level, it's file-system agnostic.

  38. Re:Single component failure not a big deal any mor by PlusFiveTroll · · Score: 1

    It depends how many disks it replaces to get the same throughput, and at other times an individual machine will use more power with an SSD because it will spend less time in an IO-WAIT state. In general SSDs reduce the amount of equipment in total needed on most loads because of higher potential processor usage.

  39. SAN Terminology by Rob_Bryerton · · Score: 1

    I see a lot of people referring to disk arrays as "SANs", which is incorrect. A SAN (storage area network) refers to the network itself, not the array(s). Calling an array a SAN is like calling a networked PC a LAN!

    Sorry, just a pet peeve of mine...

  40. That wasn't it... by Anonymous Coward · · Score: 0

    A string walks in to a bar. Bartender says 'No Strings!' ... goes out and gets all twisty in the lane, goes back in. Bartender says 'I said no strings!'. He says 'I'm a freyed knot'!