Slashdot Mirror


Enterprise Datacenter Hardware Assumptions May Be In For a Shakeup (acm.org)

For the entire careers of most practicing computer scientists, a fundamental observation has consistently held true: CPUs are significantly more performant and more expensive than I/O devices. The fact that CPUs can process data at extremely high rates, while simultaneously servicing multiple I/O devices, has had a sweeping impact on the design of both hardware and software for systems of all sizes, for pretty much as long as we've been building them. This assumption, however, is in the process of being completely invalidated.

100 comments

  1. Video warriors! by Anonymous Coward · · Score: 0, Offtopic

    Look at the I/O towers.
    Every tower is lighting up.

  2. Can this entry be any more click bait? by turbidostato · · Score: 4, Funny

    Yes!

    For the entire careers of most practicing computer scientists, a fundamental observation has consistently held true... and you won't believe what happens next!!!

    1. Re:Can this entry be any more click bait? by Anonymous Coward · · Score: 0

      Yes!

      For the entire careers of most practicing computer scientists, a fundamental observation has consistently held true... and you won't believe what happens next!!!

      You have to use "shocking" in there somewhere....

    2. Re:Can this entry be any more click bait? by gstoddart · · Score: 1

      LOL, yeah ... my thinking was "it was a true fact, not an assumption".

      Throw in some fundamentally new pieces to this (which as I gather is suddenly everything has it own damned CPU) ... and, yes, the rules will change.

      Hundreds of CPUs spread across devices will cumulatively have more CPU power than the single CPU which has always been at the top of the food chain. All that really means is everything now has a ton of embedded compute power which previously wasn't there.

      Things which used to be classed as supercomputers can now practically be had in a Cracker Jack box. This will change things.

      --
      Lost at C:>. Found at C.
    3. Re:Can this entry be any more click bait? by Penguinisto · · Score: 1, Informative

      Same here.

      Now if they found a way to practically combine RAM and disk into one unified whole, and made the two faster than frig (and able to reallocate on-the-fly w/ minimal disruption as the workload changed, maybe on a curve or as load >= n )? That would be news.

      TFA... TFA has a lot of stuff to sift through to get anything of note out of it at all, and it wasn't much.

      --
      Quo usque tandem abutere, Nimbus, patientia nostra?
    4. Re:Can this entry be any more click bait? by jellomizer · · Score: 2, Interesting

      Technology changes, so does how to better use them.

      Old technology long term storage was very slow. So we used to CPU calculate a lot of data. Think Mario and Luigi in the original NES They were 1 bitmap and you just swapped the pallets, as well many of the creator's same expensive bitmap image, and use the CPU to cheaply give them different color. As times goes on Storage is cheaper and faster. So we have independent bitmaps for Mario and Luigi so they are different in appearance, luigi being taller and thinner. Technology advanced to a point where it was good enough to have different images, and it was worth it vs trying to spend all your time with silly hacks.

      But lets get away from games and onto serious computing.

      1980's Mainframes: Computing was expensive, so you were better off having a centralized computer that would be accessed with dumb terminals. The speed of communicating data to the terminals was fast enough to balance any speed in processing the data, allowing many users on one system. A lot of the data were stored in active RAM. And were very small, being that most data retrieval would be needed via tape. These programs were very small and concise. And they wouldn't be used today because they were very buggy, and could be hacked into. People who used the computers often had the title Computer Operator, as they would know what to do and what not to that would cause a problem.

      1990's Desktops: The PC has reached a price point where except of having one mainframe people would have their own PC. This means faster displaying of data, and also allow people to do big calculations without worrying about slowing down everyone else. This allows applications to query the disk for the programs much more readily and page data. This allowed for more advanced UI in programs that would prevent people from doing stupid things as often. However program sizes got larger.

      2000's Networked Desktops: With servers now being build on Desktop Technology and 100mbs - 1000mbs networks being common in an office. It allowed PC's to talk to the server easily and quick enough for most data retrieval. Data files began to be stored on the network, freeing the desktop to have smaller drives, enough to run the application, and RAM requirements peaked at the 4gigs. While the improvements went toward the servers. Also with cheap Servers, we were allowed to install effective Databases on them. MySQL, PostGreSQL, MsSQL Server. All very affordable DB that were able to deal with most businesses level of work. (Causing the decline of systems such a FoxPro, Dbase and MS Access) which were meant for PC usages.

      2010's The "Cloud": Well web based applications, they may not be setup on a cloud platform. But with higher speed internet access available to most people and business, the web standards have matured to a point where most UI functionality is available. We were able to run our apps in a browser or thin client, so the performance of our personal device matters a lot less. Most things we do we can still do on a PC that is 10 years old. Where back in the 90's if your PC was 4 years old it was too out of date to do anything new. Servers have been designed to be more distributed and faster storage means we have more than enough processing power. The limitations are usually more limited to total bandwidth.

      As time changes how we use software changes, now these are trends not hard and fast rules. I still work on a mainframe daily for work, and it is handing a lot of work rather well. I still run across FoxPro and Access Applications that are in production use that needs to be maintained. There are still file share drives. And now there is a set of hosted apps to use. They all come with tradeoffs. But changing technology gives us more options.

      --
      If something is so important that you feel the need to post it on the internet... It probably isn't that important.
    5. Re:Can this entry be any more click bait? by jedidiah · · Score: 1

      Meh. This assumption wasn't even true before. Those of us that are actually in the trenches already know this. Some magical new technology change doesn't really alter things.

      You actually have to pay attention to your workload and how your application is handling it.

      It's nice that some academic or journalist has finally caught up.

      --
      A Pirate and a Puritan look the same on a balance sheet.
    6. Re:Can this entry be any more click bait? by i_ate_god · · Score: 1

      > But lets get away from games and onto serious computing.

      And games are not serious computing how? There are grand challenges that have been overcome in the technologies that power video games. Path finding, latency reduction, 3D computation, and so on.

      Not to mention budgets that exceed that of some "enterprise" firms.

      --
      I'm god, but it's a bit of a drag really...
    7. Re:Can this entry be any more click bait? by Anonymous Coward · · Score: 0

      This is pretty much what the article is about. It's talking about Storage Class Memory which is a fairly new development. See: http://www.computerworld.com/article/2990809/data-storage-solutions/hp-sandisk-partner-to-bring-storage-class-memory-to-market.html

    8. Re:Can this entry be any more click bait? by lactose99 · · Score: 1

      Towards the end of the article they specifically mention this scenario with NVDIMMs.

      --
      Fully licensed blockchain psychiatrist
    9. Re:Can this entry be any more click bait? by Anonymous Coward · · Score: 0

      I agree this is sponsored content. It's an elabourate way of talking about SSDs.

    10. Re:Can this entry be any more click bait? by MachineShedFred · · Score: 1

      or "this one weird trick"

      Don't forget about the one weird trick.

      --
      Slashdot still doesnâ(TM)t support Unicode after it was added to the HTML standard in 1997.
    11. Re:Can this entry be any more click bait? by MachineShedFred · · Score: 1

      You're not kidding about the supercomputer in a cracker jack box. The average iPhone now has as much or more compute power as a Cray Y-MP from the early 90s.

      --
      Slashdot still doesnâ(TM)t support Unicode after it was added to the HTML standard in 1997.
    12. Re:Can this entry be any more click bait? by gstoddart · · Score: 1

      LOL, maybe a little with the Cracker Jack box ... but, no, I really wasn't kidding.

      For those of us old enough to remember when a gigabyte was a theoretical number nobody would ever encounter ... you can buy what used to be astronomical amounts of storage as an afterthought in the express checkout at Wal Mart for a couple of bucks.

      I'm afraid these days to know how cheap, small, and ubiquitous a 1GHz chip is ... because there was a time that was considered munitions grade hardware which was covered under export controls.

      --
      Lost at C:>. Found at C.
    13. Re:Can this entry be any more click bait? by TheCarp · · Score: 1

      I sent this to a co-worker. We came to the conclusion that this "Shakeup" really means "We just noticed whats been happening already for like a decade now".

      Yes, actual storage being centralized and compute resources are consuming it and caching file access.... if you wash away the buzz this sounds a lot like existing enterprise visualization setups.

      --
      "I opened my eyes, and everything went dark again"
    14. Re:Can this entry be any more click bait? by gstoddart · · Score: 2

      I tried it, it put my back out for a week, and I nearly lost an eye.

      Wait, are we talking about the same thing?

      --
      Lost at C:>. Found at C.
    15. Re:Can this entry be any more click bait? by Darinbob · · Score: 1

      Actually that observation wasn't even true in the past. A lot of mainframes had very fast and complex I/O processors with central CPUs that weren't necessarily faster (depending upon what models you bought).

    16. Re:Can this entry be any more click bait? by MattGWU · · Score: 1

      Discovered by an infrastructure engineering mom in ${Geolocate.getCounty()} County!

      Brocade hates her!!

      --
      "These people look deep within my soul and assign me a number based on the order in which I joined" --Homer re:
    17. Re:Can this entry be any more click bait? by mikael · · Score: 1

      A "paradigm shift" from "disruptive technology" that goes beyond "24/7"

      --
      Vintage computer adverts: http://www.vintageadbrowser.com/computers-and-software-ads
    18. Re:Can this entry be any more click bait? by Tough+Love · · Score: 1

      if you wash away the buzz this sounds a lot like existing enterprise visualization setups.

      I wonder what a dog thinks when it watches television?

      --
      When all you have is a hammer, every problem starts to look like a thumb.
    19. Re:Can this entry be any more click bait? by Anonymous Coward · · Score: 0

      And he doesn't even realize that flash SSD's are dead, lol.

      http://www.nextplatform.com/2015/08/18/intel-reveals-plans-for-optane-3d-xpoint-memory/

    20. Re:Can this entry be any more click bait? by TheCarp · · Score: 1

      He probably doesn't think about washing away the buzz. Next article is going to declare THIS is now going to be the era of thin clients too. Get off my lawn kid.

      --
      "I opened my eyes, and everything went dark again"
    21. Re:Can this entry be any more click bait? by Anonymous Coward · · Score: 0

      Next time you should use a reef knot at the apex and always make sure the frame was constructed using 3 run welding joins, not 1 run.

  3. The slowest thing in the datacentre ... by petes_PoV · · Score: 3, Interesting
    ... would be the pundits.

    This piece is citing articles written in 2005 as "ye olde world" and saying "OMG! something amaaaazing has happened.

    Well, those 10 years represent 2 or 3 generations of datacentre hardware, depending on how you amortise your assets. So if the author has only just woken up to SSDs or SCMs then what have they been doing for the past decade?

    In practice, the biggest bottleneck in the datacentre has been the network for a longish time. And the biggest bottleneck in most systems is the user's think-time. It is that last aspect which lies at the heart of multi-user systems.

    However, the guy does have a point: the need for "olde worlde" performance management - designing the bottlenecks out of a system and diagnosing where the choke-points are (ans. the network) when things slow down has largely disappeared. But as for the rest of his stuff? Yes, we know all that.

    --
    politicians are like babies' nappies: they should both be changed regularly and for the same reasons
    1. Re:The slowest thing in the datacentre ... by luvirini · · Score: 1

      While mostly true that the answer is usually/often network, there are some high data loads where actual database performance might still be a bottleneck and others where the actual calculations or other manipulations(=cpu) are the slow link so there is still a need to look at the values.

    2. Re:The slowest thing in the datacentre ... by Anonymous Coward · · Score: 1

      In some cases the RAM speed may be the bottleneck, and that one sure is a hard one to analyze.

      First you run 2 tasks concurrently on a 4-core (real cores, not hyperthreaded) machine, and get 2 tasks per second done. Then you run 4 of the same tasks concurrently, and get 3 tasks per second done.

      All the CPUs show 100% utilization, yet you see only 50% gains instead of the expected 100%.

    3. Re:The slowest thing in the datacentre ... by Anonymous Coward · · Score: 0

      A lot of things can just be crap middleware as well. If your garbage API doesn't use such modern ideas as database indexing or connection pooling, you've got huge performance issues while everything appears to be idle at a system level.

  4. You say performant, I say performance... by __aaclcg7560 · · Score: 0, Offtopic

    Never seen the word "performant" until today. Must be an obscure five-dollar word that scientists love to toss around. Meanwhile, I'll stick with cheap performance as my word of choice.

    https://en.wiktionary.org/wiki/performant

    1. Re:You say performant, I say performance... by caffeinated_bunsen · · Score: 1, Flamebait

      "Performant" is an invaluable word. It instantly identifies those who use it seriously as people who may be safely ignored.

      --

      Bugrit! Millenium hand and shrimp!
    2. Re:You say performant, I say performance... by jonnythan · · Score: 3, Informative

      Performance is a noun. Performant is an adjective. I guess he could have said "faster"

    3. Re:You say performant, I say performance... by Anonymous Coward · · Score: 0

      Never seen the word "performant" until today. Must be an obscure five-dollar word that scientists love to toss around. Meanwhile, I'll stick with cheap performance as my word of choice.

      https://en.wiktionary.org/wiki/performant

      Performance is a noun. Performant is an adjective, meaning "having (high) performance" (or performing well). If you stick with performance, be sure to reword your sentence so that it makes sense.

    4. Re:You say performant, I say performance... by Captain+Splendid · · Score: 4, Insightful

      Language is a tool. Just because you're not versed in its intricacies doesn't mean that someone who is is inferior to you.

      --
      Linux, you magnificent bastard, I read the fucking manual!
    5. Re:You say performant, I say performance... by QuietLagoon · · Score: 1
      I have a paper-based unabridged dictionary and performant ain't in it. (ain't is, btw)

      .
      So either the word is relatively new, or in niche use.

    6. Re:You say performant, I say performance... by Anonymous Coward · · Score: 1

      "Performant" is an invaluable word. It instantly identifies those who use it seriously as people who may be safely ignored.

      Speaking of invaluable, I have found that those who spew the most buzzwords in their vernacular also happen to control the budget.

      In other words, tread lightly. The "PHB" wasn't born from pure fiction...

    7. Re:You say performant, I say performance... by Anonymous Coward · · Score: 0

      Performant is a noun. It means performer, one who performs something.

      I suppose it could be jargon, but it's basically bullshit.

    8. Re:You say performant, I say performance... by gstoddart · · Score: 1

      A system which has good performance is said to be performant.

      Your own link says this has been in used for at least since the 70s.

      It's hardly a new term. It may only come up in specific contexts related to computing performance, but it ain't new.

      --
      Lost at C:>. Found at C.
    9. Re:You say performant, I say performance... by thoromyr · · Score: 1

      sorry, but "unabridged" means nothing with respect to the comprehensiveness of a dictionary -- except that if there is an abridged edition of the same dictionary the abridged version will have fewer words (or smaller definitions, or something).

      Your conclusion is based on a flawed assumption. I've been using a small dictionary for around thirty five years now. I can't check it (the dictionary is at home), but based on frequency of word use over time and the quality of that dictionary I expect the word would be listed. Maybe your unabridged dictionary isn't as good as you think it is -- if you actually care about words it is worthwhile having more than one dictionary.

      People often don't really register words that they hear or see (especially if the words seem familiar) and tend to underestimate the frequency of usage or overestimate their recency of usage as a result.

      Lets compare "performant" (which has apparently been in use for over a hundred years and is still used with many people having a basic idea of what it means from its form and in context) with "tergiversator" (which comes from latin and is essentially unused in recent years though still listed in dictionaries -- and I'd wager most people have no idea what it means without consulting a dictionary).

    10. Re:You say performant, I say performance... by Anonymous Coward · · Score: 1

      Language is a tool. Just because you're not versed in its intricacies doesn't mean that someone who is is inferior to you.

      People who use buzzwords to hide the fact that they aren't really saying anything are tools. It's been a long time since I've read an academic article so full of bullshit.

    11. Re:You say performant, I say performance... by Anonymous Coward · · Score: 0

      Funnily enough, whoever uses that word is also a tool.

    12. Re:You say performant, I say performance... by Anonymous Coward · · Score: 0

      In other words, tread lightly. The "PHB" wasn't born from pure fiction...

      "Players Handbook"?
      Ya, it's pretty much all fiction...

    13. Re:You say performant, I say performance... by Anonymous Coward · · Score: 0

      You just said "someone who is is"; might want to expand your vocabulary so you can avoid such awkward "intricacies" in the future.

    14. Re:You say performant, I say performance... by Anonymous Coward · · Score: 0

      Speaking of invaluable, my years of experience at the coalface of providing excellence in the industry have shown me that those who enunciate the constantly changing realities of modern computing paradigms with a flair for elegance, are usually those who perform rigorous cost-benefit analyses to provide maximum stakeholder value

      FTFY ;)

    15. Re:You say performant, I say performance... by Anonymous Coward · · Score: 0

      You just said "someone who is is"; might want to expand your vocabulary so you can avoid such awkward "intricacies" in the future.

      And he was correct in his usage of those words.

    16. Re:You say performant, I say performance... by Anonymous Coward · · Score: 0

      I thought that was "performer".

    17. Re:You say performant, I say performance... by Anonymous Coward · · Score: 0

      Just wait until you see a performentat at work.

    18. Re:You say performant, I say performance... by medv4380 · · Score: 1

      Sorry, but performant isn't in the Webster online dictionary, and even Chrome thinks it's a misspelled word. Googles Ngram views also shows that up until the last few decades it's a rarely used word. However, in the books the Ngram viewer references it's not being used to indicate performance in at least one case in 1812. Heck even the usage in the 70's and 80's seems to be referencing it as an actor in something, and having nothing to do with performance or efficiency as this article want it to be.

    19. Re:You say performant, I say performance... by tsqr · · Score: 1

      Performance is a noun. Performant is an adjective. I guess he could have said "faster"

      performant
      noun

      a performer

      Word Origin
      based on informant, etc.

    20. Re:You say performant, I say performance... by tsqr · · Score: 1

      I thought that was "performer".

      performant is to performer as informant is to informer.

    21. Re:You say performant, I say performance... by Anonymous Coward · · Score: 0

      I think it is the conceptual structure, not the specific word, which is awkward. Given the "Noun1 is superlative to noun2" structure, and the choice to compare only vague "performance," then performant is a perfectly appropriate word. That part of the sentence fails to say anything. How do you compare the performance of a CPU and a storage device? Is a gigaFLOP better or worse than 100,000 hours MTBF? The other part, "CPUs are more expensive than I/O devices," does give the reader a clear understanding of the metric of comparison. The word is a fine word; the thought process that makes its use appropriate is flawed.

    22. Re:You say performant, I say performance... by Anonymous Coward · · Score: 0

      It's clearly not a noun in this context. The only performer here is you, and you're a clown.

    23. Re:You say performant, I say performance... by Art3x · · Score: 1

      Never seen the word "performant" until today. Must be an obscure five-dollar word that scientists love to toss around. Meanwhile, I'll stick with cheap performance as my word of choice.

      https://en.wiktionary.org/wiki/performant

      Performance is a noun. Performant is an adjective, meaning "having (high) performance" (or performing well). If you stick with performance, be sure to reword your sentence so that it makes sense.

      You're right, performance is a noun while performant an adjective. Most of the time, though, you can just say fast.

    24. Re:You say performant, I say performance... by Anonymous Coward · · Score: 0

      Don't blame other for your ignorance.

    25. Re:You say performant, I say performance... by Captain+Splendid · · Score: 1

      Buffalo buffalo Buffalo buffalo buffalo buffalo Buffalo buffalo

      (slashdot lameness filter prevention plz ignore)

      --
      Linux, you magnificent bastard, I read the fucking manual!
    26. Re:You say performant, I say performance... by vikingpower · · Score: 1

      Concur. I am the proud possessor of a paper copy of the 20-volume Oxford English Dictionary. "Performant" isn't in the OED. "Performancer", as in "he / she who performs", however, is....

      --
      Religous speak to God. Insane are spoken to by God. When all shut up, one can finally hear Shostakovich in peace
    27. Re:You say performant, I say performance... by vikingpower · · Score: 1

      Thank fucking Gawd. Finally, someone says something on slashdot based on fucking data. Wow.

      --
      Religous speak to God. Insane are spoken to by God. When all shut up, one can finally hear Shostakovich in peace
    28. Re:You say performant, I say performance... by __aaclcg7560 · · Score: 1

      Everyone has an obligation to speak clearly in a way that everyone else can understand. As a college graduate, I could write enough buzzwords to make a scientist feel dizzy. But I write at an eighth grade level to reach the widest possible audience. If being clear makes me ignorant, I don't have a problem with that.

    29. Re:You say performant, I say performance... by TheCarp · · Score: 1

      Well Looks to me like Oxford says webster can suck it: http://www.oxforddictionaries....

      --
      "I opened my eyes, and everything went dark again"
    30. Re:You say performant, I say performance... by Anonymous Coward · · Score: 0

      Really? I've been using it since I was in college (decades) and everyone in my engineering firm knows what it means. And if you are using "performance" (noun) in place of "performant" (adjective) then you have a grammar error. Also, this word may not be popular, but it's been around for 150+ years according to Google book search: https://books.google.com/ngrams/graph?content=performant&year_start=1800&year_end=2000&corpus=15&smoothing=3&share=&direct_url=t1%3B%2Cperformant%3B%2Cc0

    31. Re:You say performant, I say performance... by __aaclcg7560 · · Score: 1

      According to the link you provided, popular usage peaked in 1990 at 0.000001551%. :/

    32. Re:You say performant, I say performance... by whit3 · · Score: 1

      ... performance is a noun while performant an adjective. Most of the time, though, you can just say fast.

      An important exception occurs when we're talking about MTBF. Fast doesn't mean performant, at all.

    33. Re:You say performant, I say performance... by Art3x · · Score: 1

      ... performance is a noun while performant an adjective. Most of the time, though, you can just say fast.

      An important exception occurs when we're talking about MTBF. Fast doesn't mean performant, at all.

      Right, then you can say something like reliable. We have now seen how useless the word performant really is. It's completely dependent on context. So it means nothing more than good yet takes three times the bandwidth.

    34. Re:You say performant, I say performance... by Anonymous Coward · · Score: 0

      Well Looks to me like Oxford says webster can suck it: http://www.oxforddictionaries....

      That's in French. Neither Oxford nor Merriam-Webster lists "performant" in English. It's certainly in use. Some people seem to prefer "is more performant" over "has higher performance" or "performs better".

  5. I'm not reading all that by Anonymous Coward · · Score: 0

    Can someone provide a more useful summary?

    1. Re: I'm not reading all that by Fwipp · · Score: 3, Funny

      "SSDs exist now"

    2. Re:I'm not reading all that by random+coward · · Score: 1

      I/O tends to be the system bottlenecks. Someone's trying to sell something.

    3. Re: I'm not reading all that by NatasRevol · · Score: 2

      And VMware made VSAN to go on top of them - RAID5 for compute/disk nodes w 10G network backplanes instead of SAS.

      --
      There are two types of people in the world: Those who crave closure
    4. Re: I'm not reading all that by ChrisMaple · · Score: 1

      PCIe SSDs exist now.

      --
      Contribute to civilization: ari.aynrand.org/donate
    5. Re: I'm not reading all that by Anonymous Coward · · Score: 0

      "wow, I just installed my first SSD like ever, and man these things are flipping fast!! why did nobody tell me?"

    6. Re: I'm not reading all that by Anonymous Coward · · Score: 0

      ..and soon PCIe NVMe 3D XPoint SSD's.

  6. Summary with no content by Anonymous Coward · · Score: 0

    That's a lot of description for telling absolutely nothing.

  7. Most data orignates from slow sources by Anonymous Coward · · Score: 0

    This is certainly interesting for processing data after it's been captured but in my experience the vast majority of data comes from sources that operate at nowhere near these speeds.

    I'll not be giving up on spindles as a cost effective cache for 'slow data' any time soon...

  8. Author doesn't know what he's talking about by guruevi · · Score: 2

    This issue has been known to anyone using SSD's. The CPU's are still fast enough but the bandwidth between clients and servers (10Gbps is the average these days within a datacenter) no longer uses the full capacity of the disk subsystem (which is now connected at >10Gbps to each drive). Even having multiple disks in a single subsystem you can no longer use the capacity, not because of CPU issues but because of bandwidth issues between the CPU and the PCIe bus. That's why we're going away from large disk arrays and using 1 or 2U servers with 4-12 SSD drives and hooking them together with 'object storage' or other distributed storage mechanisms. That way you don't have a single point of failure and resource contention slowing you down.

    But that's not the point, the point the article is making is that CPU's are getting too slow and that's not true. The CPU's are plenty fast and using any sort of off-loading mechanism would result in RAID controllers with CPU's that have to be just as powerful because if they aren't, you get the issues you have with current RAID controllers: they are slow and expensive (a single link to a 12Gbps chip is a bottleneck to an entire array of 12Gbps drives). Also you lose the scheduling, checksumming, hardware monitoring and all the other fancy things software-based solutions do these days.

    Using CPU's as glorified RAID controllers is just fine and I don't foresee another solution as long as your software is fast and concise (eg. ZFS). If you start handing off anything to dedicated CPU's then you're just losing the control and customization a software based solution allows you to have.

    --
    Custom electronics and digital signage for your business: www.evcircuits.com
    1. Re:Author doesn't know what he's talking about by Anonymous Coward · · Score: 0

      In your case are you using distributed or clustered filesystems as well? This seems like a perfect use case for a clustered filesystem.

  9. The Shakeup by requerdanos · · Score: 1
    The reason for the shakeup, according to TFA:

    The arrival of high-speed, non-volatile storage devices, typically referred to as Storage Class Memories (SCM), is likely the most significant architectural change that datacenter and software designers will face in the foreseeable future. SCMs are increasingly part of server systems, and they constitute a massive change: the cost of an SCM, at $3-5k, easily exceeds that of a many-core CPU ($1-2k), and the performance of an SCM (hundreds of thousands of I/O operations per second) is such that one or more entire many-core CPUs are required to saturate it.

  10. $4k/TB by Anonymous Coward · · Score: 0

    I found a single card that cost $14k to provide 5GB.
    Until the price drops by a factor of at least 10 I doubt this will go anywhere fast.

  11. summary of TFA...not really about the CPU at all.. by nimbius · · Score: 2

    storage is slower than processors until you consider caching things in ram and in which case its magically faster.
    Other points mentioned:
    Balanced Systems: you can have lots of ram but make sure you have the network to serve it. CPUs were unavaillable for comment.
    Contention-Free I/O-centric Scheduling: uh, has been around for nearly 15 years since the invention of the X86_64 architecture...at least...formally in the domain of commodity hardware. CPUS could not be reached for comment.
    Workload-aware Storage Tiering: remember all that crap we mentioned about memory caches for everything? well now we're drifting into the realm of object stores so sit tight. tiered storage has existed for 15 years as well...so we're a bit late to the party for this one.

    The Future: RAM + Acronym + expensive support contract = Storage Class Memory!. learn it, embrace it, and most importantly, make sure its on the fucking purchase order this year*


    *not applicable if youre using redis, memcached, ceph, couch, hadoop, hypercube, or any one of about 30 other different commodity hardware centric distributed data frameworks designed to purge the vendors from the budget as jesus purged the jews from the temple.

    --
    Good people go to bed earlier.
  12. SSD's Exist and are really fast.... by bobbied · · Score: 0

    Now tell me something I don't already know..

    OK, OK, so CPU speeds are not trending up at quite the same pace and nonvolatile storage. But it's not like this has gone unnoticed or we haven't been making hardware changes to take advantage of this over the last decade in the data center. Just like we've adjusted to new power, network and virtualization technologies in the data center.

    The real story is that CPU speeds are not trending up as steeply as they where 10 years ago, but we've been seeing huge leaps in storage speeds. How this changes the optimum hardware and software configuration is no real mystery, as system designers and integrators have been effectively dealing with this for years now...

    --
    "File to fit, pound to insert, paint to match" - Aircraft Maintenance 101
    1. Re:SSD's Exist and are really fast.... by jandrese · · Score: 1

      Really what we're seeing is storage finally starting to catch up with CPU after lagging behind for nearly 30 years. The author is freaking out that ZOMG the disk isn't always the slowest thing on the system anymore, but this is not really news at this point. The exciting part for me is that in some cases you may be able to eliminate one cache from the system. Caches are a necessary evil that introduce big headaches into system design, so being able to eliminate one can greatly simplify parts of your system.

      I did find it odd that the use case he kept going back to was someone buying some crazy expensive RAM storage array, and then sticking a single commodity server on the thing and being shocked that the server was CPU bound. The point about the Linux IO subsystem not being up to the task is interesting, but having not looked into it myself I can't help but to wonder if there isn't some kernel tuning or optional module support he could have enabled to improve the situation.

      --

      I read the internet for the articles.
    2. Re:SSD's Exist and are really fast.... by goarilla · · Score: 1

      I did find it odd that the use case he kept going back to was someone buying some crazy expensive RAM storage array, and then sticking a single commodity server on the thing and being shocked that the server was CPU bound. The point about the Linux IO subsystem not being up to the task is interesting, but having not looked into it myself I can't help but to wonder if there isn't some kernel tuning or optional module support he could have enabled to improve the situation.

      Well there is new kernel tech for it (https://www.thomas-krenn.com/en/wiki/Linux_Multi-Queue_Block_IO_Queueing_Mechanism_%28blk-mq%29).

    3. Re:SSD's Exist and are really fast.... by MrKaos · · Score: 1

      Well there is new kernel tech for it (https://www.thomas-krenn.com/en/wiki/Linux_Multi-Queue_Block_IO_Queueing_Mechanism_%28blk-mq%29).

      That is really interesting, thanks for pointing it out. I missed your question when you posted it:

      When do you decide to have a system managed service (for example apache) or a /etc/init.d initscript ?

      If it is a process I want to stay up, then I use inittab. Apache is a pretty good choice for an init service, another example is databases or messaging systems. However , if it is someone else's system I just do it how they do it to fit in.

      For example, apache could be be set up in inittab with a 'respawn' directive, so if the process is terminated it restarts automatically, if there is a problem with the service, and it won't start then, init will disable if it is respawning too rapidly.

      --
      My ism, it's full of beliefs.
  13. The first sentence doesn't even make sense by Anonymous Coward · · Score: 1

    What does it mean for a CPU to be "more performant" that an I/O device? They do totally separate things. You can't even measure them with the same units.

    Is a drill "more performant" than a hammer?

    1. Re:The first sentence doesn't even make sense by Anonymous Coward · · Score: 1

      Your autism is showing! I'm sure you know exactly what was meant, but you're just being a pedantic dickweed by choice.

      The measurements in this case are in terms of operations per unit of time, or if you prefer, the amount of time required per operation.

      Typically, a CPU can perform one of its operations (executing an instruction) much, much, much faster than a spinning platter hard drive could perform one of its operations (reading or writing a sector of data).

      So when both a CPU and a spinning platter hard drive are given an operation to perform, the CPU will finish much sooner, and is ready to perform further operations well before the spinning platter hard drive is.

      When it comes to construction tools, a drill can be very well be more performant than a hammer. The drill could be used to drive, say, 4 screws in the time it takes to hammer in 1 nail. Or the drill could be used to drill 3 holes in the time it takes to hammer in 1 nail.

      If the drill can perform more operations per unit of time than the hammer can, the drill is said to be more performant.

    2. Re:The first sentence doesn't even make sense by barbariccow · · Score: 1

      Is a drill "more performant" than a hammer?

      Obviously you missed the boat. Both the hammer and the drill have OBSOLETE! Introducing, the Hammer Drill: https://en.wikipedia.org/wiki/...

  14. Performant does not (necessarily) mean "faster" by SuperKendall · · Score: 4, Insightful

    Performant is actually a pretty useful word in place of "real" ones like "faster", because "performance" is a word that can change meaning depending on what you consider to be good (or desired) performance.

    Maybe good performance means that it's using all of the cores on a CPU well. maybe it means that it's not using much of the system at all, but is using the network very well, or work is spread out across a cluster in an extremely balanced fashion. "Faster" may be a by-product, but it may not, because people using the word "performant" often value stability over absolute speed.

    I guess the closest concept "performant" comes to is being well-balanced, or perhaps meeting some goal you had set during design.

    So don't be too dismissive of a new word, it can be the case a new word was made because old ones wouldn't really fit without a lot of verbosity.

    --
    "There is more worth loving than we have strength to love." - Brian Jay Stanley
  15. Article is kinda pie-in-the-sky wrong by m.dillon · · Score: 3, Interesting

    At least, not totally correct. Memory bus non-volatile storage such as Intel's X-Point stuff still requires significant cache management by the operating system. Why? Because it doesn't have nearly enough durability to just be mapped as general purpose memory. A DRAM cell goes through trillions of cycles in its live time. Something like X-Point might be 1000x more durable than standard flash, but it is still 6 orders of magnitude LESS durable than DRAM. So you can't just let user programs write to it however they like.

    Secondly, in terms of data-center machines becoming obsolete. Also not correct. SSDs make a fine bridge between traditional HDD or networked storage and something like X-Point. For two reasons: First, all data center machines have multiple SATA busses running at 6GBits. Gang them all together and you have a few gigabytes/sec worth of standard storage bandwidth. Secondly, you can pop nVME flash (PCI-E based flash controllers) into a server and each one has in excess of 1 GByte/sec of bandwidth (and usually much more).

    Third, in terms of memory management, paging to/from SSD or nVME 'swap' space, or using it as a front-end cache for slower remote storage or spinny disks, already provides servers with a fresh new life that means they won't be obsolete for years to come.

    And finally there is the cost. These specialized memory-bus non-volatile memories are going to be expensive. VERY expensive. To the point where existing configurations still have a pretty solid niche to play in. Not all workloads are storage-intensive and these new memory-bus non-volatile memories don't have the density to be able to replace the storage required for large databases (or anywhere near it).

    So, the article is basically a bit too pie-in-the-sky and ignores a lot of issues.

    -Matt

    1. Re:Article is kinda pie-in-the-sky wrong by Anonymous Coward · · Score: 0

      Not to mention, the convergence of multi-core CPU, GPU, and SIMD extensions means that all the long-observed performance ratios of IO versus compute now apply to your typical DRAM as well. To actually use the throughput of your compute elements, you have to think about things like accessing DRAM from the CPU or GDRAM from the GPU as an IO task with significant latency and restricted bandwidth. We keep getting deeper hierarchies with more inflection points, not shallower dance-hall architectures where you could assume everything was uniform and good.

    2. Re:Article is kinda pie-in-the-sky wrong by sexconker · · Score: 1

      A DRAM cell goes through trillions of cycles in its live time.

      Typical DDR4 (2133) runs at a little over 1 GHz, or a billion cycles per second of operation.

      We're in the quadrillions scale, not trillions.

    3. Re:Article is kinda pie-in-the-sky wrong by Anonymous Coward · · Score: 0

      So I have 2 concerns about the article.

      1). The article could be read as, "there's this danger with high performance storage systems!" No, it's not a danger, it's an opportunity, one that the storage world has needed for decades. I realize the article doesn't go there (danger), but in focusing on the problems of optimizing the software stack for the hardware, it's a small leap to make, and that leap is entirely inaccurate and incorrect;

      2). The article is misleading that that Storage Class Memories (SCM) are the key expense in the data center. The key expense is people and that has been true for 30 or more years. Again, I realize the article aims to limit the discussion to the components of a server/cluster/rack/what have you. However this misleading focus has dramatic implications.

      People are the key expense! That includes Data Center personnel, programmers, database analysts and all the rest. The article spends all it's time talking about optimizing the use of the hardware, but at what cost in personnel costs? Is it even worth it to optimize in this way? The reality is, aside from niche cases, most hardware (and this includes nearly all Data Center hardware) is used suboptimally, and often in ways downright wasteful.

      The authors want to talk about how multi-core or even multi-processor CPU's are needed to saturate Storage Class Memories. What of it? Do we lack availability of such systems? This is so wrong it's Not Even Wrong. Four core CPU's are literally entry level hardware now! Eight core CPU's are commonplace in servers and boring. Hyperthreading is also commonplace. You can buy single CPU's with up to 72 cores. Suggesting that lack of CPU firepower to load a Storage Class Memory is a serious problem is way off base.

      I'm a bit old school and like to see efficient computer use. However I'm realistic enough to see where the wind blows. Modern business is concerned about overall expenses and is perfectly happy with 80% solutions. The result is that most business computing systems are a result of throwing hardware at performance problems, until the performance problems go away. That's the bar we aim for, not "gee, we need to get to 98% utilization or someone gets fired!"

      There's nothing in flash storage systems, or indeed any computing architectural changes, that alters this reality of modern computing life. Spending large amounts of analytical time for relatively small increases in hardware utilization, smacks of mainframe era thinking. That's what will get you fired, spending excessive resources on things the business doesn't care about.

    4. Re:Article is kinda pie-in-the-sky wrong by LDAPMAN · · Score: 1

      SATA, even 6 or 12GB SATA, will be going away for HPC application and it may go away for even more moderate performance systems. Direct PCI is even faster. Thats what the article is about. The speed of storage and memory are converging. That will change how we build systems.

    5. Re:Article is kinda pie-in-the-sky wrong by subk · · Score: 1

      A DRAM cell goes through trillions of cycles in its live time.

      Typical DDR4 (2133) runs at a little over 1 GHz, or a billion cycles per second of operation.

      We're in the quadrillions scale, not trillions.

      Maybe he meant "cycles during which its own state is changed". They get less wear coasting for a few quadrillion laps than they do altering the value a few trillion times.

      --
      Now, if you'll excuse me, I have backups to corrupt.
    6. Re:Article is kinda pie-in-the-sky wrong by Anonymous Coward · · Score: 0

      I'll argue the other way: the OS doesn't understand X-Point as well as dedicated applications do. So it should just map X-Point to application space, for those applications which are X-Point aware. Practically speaking, such applications would include databases and the like.

      A secondary benefit is that you save the system call overhead in this way. That overhead can be less than a microsecond, which didn't matter even for SATA SSD's. It only becomes relevant once you start to approach 1M IOPS.

  16. Hyper Convergence by Anonymous Coward · · Score: 0

    Hyper Convergence is possible now. VSAN is my favorite :)

  17. CPUs more expensive? by Anonymous Coward · · Score: 0

    a fundamental observation has consistently held true: CPUs are significantly more performant and more expensive than I/O devices

    If I sold my computer now, I'd say 95% of what I get would come from my I/O devices. And here I'm including motherboard, RAM, SSD & chassi to "CPU".
    A good display, keyboard, mouse & headphones have been more crucial for my performance than CPU upgrades for soon a decade.

  18. ceph is cool and just want NON raid cards by Joe_Dragon · · Score: 1

    ceph is cool and just want NON raid cards to link the back planes to the system board. Hardware raid was good in the past but now days multi node software is better with out the hardware raid lock in / losing 1-2 disks = data lost.

  19. What about capacity, durability, thoughput... by Anonymous Coward · · Score: 0

    Latency is just one dimension of the problem.

  20. Well duh. by Anonymous Coward · · Score: 0

    1. 64 bit addressing offers huge address ranges perfect for addressing high speed devices.
    2. density of high speed SSD storage has almost reached a point where it's cost effective.

    The idea of having offboard 'controllers' for things will reach an end at some point. We've already seen consolidation here, with thunderbolt (PCIE on a cable), USB 3.1 (which is basically thunderbolt).

  21. embedded FPGAs by Anonymous Coward · · Score: 0

    Ohhhh...

    So that's why Intel bought out the FPGA company, think in terms of a programmable data filter
    built into the drive, itself.

  22. Authors maybe know what they are talking about by WoOS · · Score: 1

    Well. I think the authors do have some points although at least some of them are existing in embedded systems (which execute directly out of Flash) for a long time:
    * CPU cycle hungry, most efficient disk caching algorithms are not that efficient anymore once "disk" (or rather Flash) access manages to catch up to the CPU. Less efficient but also less resource hungry algorithms might be advantageous then.
    * Issuing lots of read accesses in advance to keep your worker threads busy might only help in occupying RAM but not speed up processing anymore if data arrives long before workers finished their previous job.
    * Multi-core access to the Flash needs better (and less blocking) synchronization than with disk, where actual colliding accesses would have been more rare due to long time between them (being executed).
    * If serial and random accesses show only small difference in access times (as they do for Flash: a few clock cycles for the Flash to throw away its read-ahead cache and get new data instead of the huge wait for head positioning and sector arrival of spinning disks), caching strategies might have to change, e.g. maybe caching leaf inodes is then not efficient anymore (just guessing here).
    * And they seem to be talking about networking due to attaching disk clusters to servers via ethernet. But there I guess the authors are not radical enough: Why not connect the Flash devices directly to the server, they take much less space and power than spinning disks.

    But as I said, I would have expected many of this being explored with respect to embedded computing long ago and with respect to servers already since the advent of the first SSDs (by talking to the embedded guys). Now seems a bit late for an ACM magazine article about that (unless ACM is falling behind tech development).

  23. Article has a major error by thedanyes · · Score: 0

    Seems to me there's a simple and major error in this document. Figure 2 shows storage latency in nanoseconds for a spinning disk... I don't know what kind of spinning disks they're using... maybe alien technology?

  24. It is food for thought... by ndykman · · Score: 1

    The basic point of the article is dead on. The major assumption that I/O is extremely slow has driven the organization of computer architecture from the beginning. But, as the article noted, in the last few years, that equation can be changed drastically. The memory hierarchy is going to get more complicated: DRAM, NVDIMM, NVM, SSD, HD, Optical/Tape, and best using that hierarchy means that there are changes that need to be made.

    For one, I think there will be a lot of research in this area. Just like modern network cards do a lot of processing before involving the CPU, it may be necessary to have similar abilities. For example, allowing the network and I/O devices to work with each other with much less CPU intervention. Or making the I/O controllers smarter so that computation can be moved to the disks. How to best organize operating systems to use this new memory hierarchy well.