Slashdot Mirror


100 Million-Core Supercomputers Coming By 2018

CWmike writes "As amazing as today's supercomputing systems are, they remain primitive and current designs soak up too much power, space and money. And as big as they are today, supercomputers aren't big enough — a key topic for some of the estimated 11,000 people now gathering in Portland, Ore. for the 22nd annual supercomputing conference, SC09, will be the next performance goal: an exascale system. Today, supercomputers are well short of an exascale. The world's fastest system at Oak Ridge National Laboratory, according to the just released Top500 list, is a Cray XT5 system, which has 224,256 processing cores from six-core Opteron chips made by Advanced Micro Devices Inc. (AMD). The Jaguar is capable of a peak performance of 2.3 petaflops. But Jaguar's record is just a blip, a fleeting benchmark. The US Department of Energy has already begun holding workshops on building a system that's 1,000 times more powerful — an exascale system, said Buddy Bland, project director at the Oak Ridge Leadership Computing Facility that includes Jaguar. The exascale systems will be needed for high-resolution climate models, bio energy products and smart grid development as well as fusion energy design. The latter project is now under way in France: the International Thermonuclear Experimental Reactor, which the US is co-developing. They're expected to arrive in 2018 — in line with Moore's Law — which helps to explain the roughly 10-year development period. But the problems involved in reaching exaflop scale go well beyond Moore's Law."

45 of 286 comments (clear)

  1. 100 Million? by Itninja · · Score: 3, Funny

    Can't we just start calling this a 'supercore' or something? When the numbers get that high it kind of goes beyond what most people can visualize. Like describing how hot the Sun is....let's just says it's "exactly 1 Sun hot".

    --
    I judt got a nre Kinesis keybiartf so please excusr ant egregiou typos.
    1. Re:100 Million? by MozeeToby · · Score: 2, Insightful

      How about 1 million cores being a mega-core. So the proposed supercomputer would be a 100 mega-core computer.

    2. Re:100 Million? by Anonymous Coward · · Score: 2, Insightful

      Because even though the number's "effect" on you diminishes as it goes up, doesn't mean it is still significant. There's a reason Engineers use quantitative instead of qualitative.
      How do you tell the difference between hot and really hot or really really hot?

      Really.

      How about the difference between 10, 20 and 30?

      10

      Which gives you more information?

    3. Re:100 Million? by Yvan256 · · Score: 4, Funny

      Let's just make sure it's 1 000 000 cores and not 1 048 576 cores... let's not make that mistake again.

    4. Re:100 Million? by _KiTA_ · · Score: 3, Funny

      Let's just make sure it's 1 048 576 cores and not 1 000 000 cores... let's not make that mistake again.

    5. Re:100 Million? by Yvan256 · · Score: 4, Insightful

      Just because CS has been abusing a system for over four decades doesn't make it right.

    6. Re:100 Million? by aztracker1 · · Score: 2, Interesting

      Far more computer science types wind up working with money (base-10) than anything base-2 or base 16.

      --
      Michael J. Ryan - tracker1.info
    7. Re:100 Million? by Mikkeles · · Score: 2

      'When loggers measure in board feet, they call them board feet not yards....'

      Board-feet measure volume; yards measure length.

      --
      Great minds think alike; fools seldom differ.
    8. Re:100 Million? by Thud457 · · Score: 5, Funny

      Can you translate that in "Library of Congress's"?

      yes. It generates the same amount of heat as burning 37 Libraries of Congress.

      --

      the preceding comment is my own and in no way reflects the opinion of the Joint Chiefs of Staff

    9. Re:100 Million? by JesseMcDonald · · Score: 3, Insightful

      You use SI prefixes with SI units. The 'byte' is not an SI unit; it's not even the most basic representation, being a group of eight bits. If you insist on using base-ten units in combination with bytes then you're essentially arguing for layering a base-ten system on top of a base-two one.

      So far as I know there is no designated SI unit for information. Following the pattern of the other SI units, however, the best choice would be the bit. If you want base-ten measurements, then, you should use "kilobit", "megabit", etc., which unambiguously use the SI prefixes, official unit or not. The non-SI term "megabyte" will never unambiguously mean "10^6 bytes", and trying to make it so just renders the term useless for any purpose requiring precision.

      --
      "The state is that great fiction by which everyone tries to live at the expense of everyone else." - Bastiat
    10. Re:100 Million? by sexconker · · Score: 5, Insightful

      CS abused nothing.
      KB means 1024 bytes, and it always will.

      KB is not K.
      KB is not stepping on the toes of any SI units.
      SI units are not sacred.
      SI units are not enforceable by law.
      SI units step on their own toes and are ambiguous themselves.

      Anytime you see a b or a B after a K, M, etc. scalar multiplier, you are talking about bits or bytes and are using 1024 instead of 1000. It is not confusing. It is not ambiguous.

      It's the fault of storage device marketers and idiot "engineers" who didn't check their work, made a mistake on some project, and refuse to admit it that the "confusion" exists.

      Furthermore, classical SI scalars are used for measuring - bits are discrete finite quanta - we COUNT them. Would you like a centibyte? TOO FUCKING BAD.

      The scalar of 1000 was chosen out of pure convenience. The scalar 1024 was chosen out of convenience, and was made a power of 2 because of the inherent nature of storage with respect to permutations (how many bits do I need to contain this space at this resolution? how much resolution and space can I get from this many bits?) and because of physical aspects relating to the manufacturing and design of the actual circuits.

      CS has a fucking REASON to use 1024.
      SI does not have a fucking reason to use 1000.

      There is more validity in claiming that all SI units should be switched to 1024 than there is in suggesting KB mean 1024 bytes.

      "But everything written before the change will be ambiguous!!!" yet you SI proponents tried to shove that ibi shit into CS (and failed miserably, thank you) despite the fact that it would cause the same fucking problem ("Does he mean KB or KiB?" "When was it published?" "Uh, Copyright 1999-2009" "Uh...").

      In short, 1024 is correct, 1000 is wrong.

    11. Re:100 Million? by Quantumstate · · Score: 2, Insightful

      SI has a very good reason. Try converting from mm to km. It is trivial. Now try your number *1024*1024, this clearly isn't as friendly.

      Therefore the reason we use 1000 in SI is because we work in base 10 and it makes the numbers easy.

    12. Re:100 Million? by Anonymous Coward · · Score: 3, Insightful

      Yes, numbers in computers go from 1, 2, 4, ..., 1024, 2048, 4096, ..., 1048576, ..., etc. Nobody is arguing against that.

      Nobody is debating wether or not computers work in base 2 for some areas (such as RAM, addressing, etc), so stop bringing that up, it's not a valid argument for making a kilo being equal to 1024. Not everything in a computer is base 2, such as the ethernet port being 10/100/1000.

      The problem is exactly that "1024 was chosen out of convenience". A kilo means 1000 and that's all there is to it, you cannot change facts.

      As for "causing the same fucking problem", we're already there. Fixing it now would mean that at least from this point forward we'd know for sure what people meant, that's why the new units were proposed. If I say "1 kibibyte" you know it's 1024. If I say "1 kilobyte" you can't be sure I mean 1000 or 1024.

      CS always have such stupid problems. If I give you the date "10/11/12", what the hell does it mean? First of all there's the Y2K problem with old data, second you have the MM/DD/YY vs DD/MM/YY vs YY/MM/DD problem (and ISO 8601 fixes this problem beautifully, YYYY-MM-DD).

      Just because there is old, set ways to do something doesn't mean it's the right way.

    13. Re:100 Million? by sFurbo · · Score: 2, Insightful

      Anytime you see a b or a B after a K, M, etc. scalar multiplier, you are talking about bits or bytes and are using 1024 instead of 1000. It is not confusing. It is not ambiguous.

      So, whenever you have a unit prefixed by M, it means 1000000, except in these two particular case. How is that not confusing and/or ambiguous?

      CS has a fucking REASON to use 1024. SI does not have a fucking reason to use 1000.

      CS have a reason for 1024, but none for wanting it to be called k. SI have a reason for wanting every k to mean 1000. It is fine if you want prefixes which fits the use of one particular field, but don't just take something welldefined and give it a conflicting meaning. That is just asking for trouble, which is exactly what you got.

  2. Who's President, Future-boy? by pete-classic · · Score: 4, Insightful

    As amazing as today's supercomputing systems are, they remain primitive

    Wait, what? You lost me. Are you from the future? How can you describe the state of the art as "primitive"?

    -Peter

    1. Re:Who's President, Future-boy? by Yvan256 · · Score: 2, Funny

      Forget the president, ask for the winning lottery numbers for the next 20 years!

    2. Re:Who's President, Future-boy? by mcgrew · · Score: 5, Interesting

      My cell phone is a supercomputer. At least, it would have been if I'd had it in 1972. Rather then being from the future, he, like me, is from the past and living in this science fiction future when all that fantasy stuff like doors that open by themselves, rockets to space, phones that need no wires and fit in your pocket, computers on your desk, ovens that bake a potato in three minutes without the oven getting hot, flat screen TVs that aren't round at the corners, eye implants that cure nearsightedness, farsightedness, astigmatism and cataracts all at once, etc.

      Back when I was young it didn't seem primitive at all. Looking back, GEES. When you went to the hospital they knocked you out with automotive starting fluid and left scars eight inches wide. These days they say "you're going to sleep now" and you blink and find yourself in the recovery room, feeling no pain or nausea with a tiny scar.

      We are indeed living in primitive times. Back in the 1870s a man quit the Patent office on the grounds that everything useful had already been invented. If you're young enough you're going to see things that you couldn't imagine, or at least couldn't believe possible.

      Sickness, pain, and death. And Star Trek.

    3. Re:Who's President, Future-boy? by David+Greene · · Score: 5, Insightful

      Wait, what? You lost me. Are you from the future? How can you describe the state of the art as "primitive"?

      Pretty easily, actually. There are lots of problems to solve, not the least of which is programming model. We're still basically using MPI to drive these machines. That will not cut it on a 100-million core machine where each socket has on the order of 100 cores. MPI can very easily be described as "primitive," as well as "clunky," "tedious" and "a pain in the ***."

      How do we checkpoint a million-core program? How do we debug a million-core program? We are in the infancy of computing.

      --

    4. Re:Who's President, Future-boy? by vtcodger · · Score: 2, Funny

      ***How do we debug a million-core program?***

      What is this "debugging" thing you speak of? If you are asking how we will test software for a million core system, we'll do it the same way we always have. We'll get a trivial test case to run once, then we'll ship.

      --
      You can't see ANYTHING from a car, You've got to get out of the goddamned contraption and walk...Edward Abbey
    5. Re:Who's President, Future-boy? by turgid · · Score: 2, Informative

      When a computer develops a mind of it's own in a logical manner it's starting to reach the human level and we can start to discuss if it's primitive or not. If it starts to reproduce on it's own it's time to be careful.

      That's not directly related to computing power per se. A computer 100 000 000 times as powerful as today's, running today's software will still not have developed a mind of its own. It'll just be very, very fast indeed.

  3. Limits on simulation. by 140Mandak262Jamuna · · Score: 4, Interesting
    The programming techniques and mathematical formulations needed to take advantage of such very large number of processors continue to be the main stumbling blocks. Some kind of simulations parallelize naturally. Time accurate fulid flow simulation for example is very easy to parallelize and technically you can devote a processor for each element and do time marching nicely. But not all physics problems are amenable to parallelization. Further even in the nice cases like fluid flow, if one tries to do solution adaptive meshing, no uniform grids etc, the time step slows down so much the simulation takes too long even on a 100 million processor machine.

    The CFL condition that limits the maximum time step one can take shows no sign of relenting. Score has been Courant (the C in CFL) 1, Moore 0 for the last three decades.

    --
    sed -e 's/Chuck Norris/Rajnikant/g' joke > fact
    1. Re:Limits on simulation. by David+Greene · · Score: 2, Interesting

      Further even in the nice cases like fluid flow, if one tries to do solution adaptive meshing, no uniform grids etc, the time step slows down so much the simulation takes too long even on a 100 million processor machine.

      That's true in general. However, techniques like dynamic scheduling can help. Work stealing algorithms and other tricks will probably become part of the general programming model as we move forward. More and more of this has to be pushed to compilers, runtimes and libraries.

      --

  4. How many problems can these systems really solve? by wondi · · Score: 3, Insightful

    All this effort at creating parallel computing ends up solving very few problems. HPC has been struggling with parallelism for decades, and no easy solutions found yet. Note that these computers are aimed at solving a particular problem (e.g. modeling weather) and not at being a vehicle to quickly solve any problem. When the comparable multi-processing capacity is in your cell phone, what are you going to do with it?

    --
    2B|^2B
  5. Why 100 million processors? by 140Mandak262Jamuna · · Score: 4, Funny

    Technically, shouldn't 640K processors be enough for every one?

    --
    sed -e 's/Chuck Norris/Rajnikant/g' joke > fact
    1. Re:Why 100 million processors? by Yvan256 · · Score: 3, Funny

      It is, if we're talking about cloud processors for running vaporware.

  6. Why build this monstrosity? by 140Mandak262Jamuna · · Score: 4, Funny

    We know what answer it is going to give. 42. Save the money.

    --
    sed -e 's/Chuck Norris/Rajnikant/g' joke > fact
    1. Re:Why build this monstrosity? by thewils · · Score: 2, Funny

      That's the answer though. They're building this thing to find out what the question was.

      --
      Once I was a four stone apology. Now I am two separate gorillas.
  7. Re:Sorry - I can't help myself by sherpajohn · · Score: 3, Funny

    Only if they run Linux and can render Natalie Portman covered in hot grits faster than my imagination already does....woohoo!

    --

    Going on means going far
    Going far means returning
  8. The Jaguar? by Yvan256 · · Score: 2, Interesting

    The Jaguar is capable of a peak performance of 2.3 petaflops.

    The first Jaguar was a single megaflop.

    1. Re:The Jaguar? by threephaseboy · · Score: 2, Funny

      But did it leak oil?

      --
      .
    2. Re:The Jaguar? by Yvan256 · · Score: 2, Insightful

      Well, okay, the second Jaguar (cue reply about the Atari Jaguar not being the second commercial product called Jaguar).

      Also, the person who modded my post above "interesting" is either on crack or I was right (by luck) about the Jaguar having 1 megaflop of computing power.

      My post, however, was that the Atari Jaguar was a mega-flop, i.e. its sales were abysmal, support non-existent, etc.

  9. Windows 2018 by smitty777 · · Score: 2, Funny

    Maybe this thing will have enough power to run Windows by 2018??

    --
    "Before God we are all equally wise - and equally foolish"
    Albert Einstein
  10. Re:How many problems can these systems really solv by Dgtl_+_Phoenix · · Score: 2, Funny

    When the comparable multi-processing capacity is in your cell phone, what are you going to do with it?

    Stream high definition porn... duh.

  11. human brain by simoncpu+was+here · · Score: 4, Interesting

    How many cores do we need to simulate a human brain?

  12. Re:AMD vs Intel by Eharley · · Score: 4, Informative

    I believe AMD was the first mass market CPU to include an on-board memory controller.

  13. Speaking of heat by ArbitraryDescriptor · · Score: 5, Funny

    I am currently accepting investors to help build a one billion core supercomputer to create high resolution climate models that take into account the waste heat from a 100 million core supercomputer making a high resolution climate model.

    (Seriously, how much heat is that thing going to put out?)

  14. Re:How many problems can these systems really solv by David+Greene · · Score: 2, Informative

    Note that these computers are aimed at solving a particular problem (e.g. modeling weather) and not at being a vehicle to quickly solve any problem.

    That's not entirely accurate. HPC systems are designed to solve a class of problems. That's not the same thing as a "particular" problem. Jaguar has, in fact, solved many different problems, including fluid flow, weather, nuclear fusion and supernova modeling. It's not going to run Word any faster than your PC but that's not what you buy a supercomputer to do.

    --

  15. Re:Partly a software problem. Erlang? by Anonymous Coward · · Score: 2, Informative

    What on Earth? You're bringing PHP and "rendering PDF reports" into a discussion about HPC? And you propose Erlang as some kind of solution? Nobody doing HPC is using Erlang. As usual for Slashdot, you have absolutely no idea what you're talking about.

  16. Re:How many problems can these systems really solv by Again · · Score: 4, Funny

    That's not entirely accurate. HPC systems are designed to solve a class of problems. That's not the same thing as a "particular" problem. Jaguar has, in fact, solved many different problems, including fluid flow, weather, nuclear fusion and supernova modeling. It's not going to run Word any faster than your PC but that's not what you buy a supercomputer to do.

    So you're saying that OpenOffice would still take forever to start.

  17. Re:Oink, oink by David+Greene · · Score: 2, Insightful

    Sounds like a pork program. What are "bio energy products", anyway. Ethanol?

    I'm no expert on this, but I would guess the idea is to use the processing power to model different kinds of molecular manipulation to see what kind of energy density we can get out of manufactured biological goo. Combustion modeling is a common problem solved by HPC systems. Or maybe we can expore how to use bacteria created to process waste and give off energy as a byproduct. I don't know, the possibilities are endless.

    It's striking how few supercomputers are sold to commercial companies. Even the military doesn't use them much any more.

    Define "supercomputer." Sony uses them. So does Boeing. The auto industry uses clusters to model crashes, but I believe that's more limited by the design of the off-the-shelf software than anything. They could certainly run on supercomputer-class machines if the vendors ported them.

    And the military uses them a lot. Much of the DOE research done on these machines is probably defense-driven.

    --

  18. Re:AMD vs Intel by confused+one · · Score: 3, Informative

    I'd be guessing but here are three possible reasons AMD might be in that place:
    1.) Value, ie. lower cost per processor
    2.) Opteron has built in straight forward 4-way and 8-way multiprocessor connectivity, Xeon was limited to 2-way connectivity without extra bridge hardware, until recently.
    3.) Opteron has higher memory bandwidth than P4 or Core 2 arch.

  19. Re:AMD vs Intel by hattig · · Score: 2, Informative

    Easy CPU upgrades because the socket interface stay the same.

    Some of those supoercomputers might have gone from dual-core 2GHz Opteron K8s through quad-core Opteron K10s to these new sexa-core Opteron K10.5s with only the need to change the CPUs and the memory.

    Or possibly if the upgrades were done at a board level, HyperTransport has remained compatible, so your new board of 24 cores just slots into your expensive, custom, HyperTransport-based back-end. To switch to Intel would require designing a QPI-based back-end.

    Of course Magny-Cours and Bulldozer will use the G34 socket, so that's not a plug-in and go upgrade when they come out in 2010 and 2011 respectively. But it will be a stable platform for several years itself, and thus be attractive.

  20. BILL LIVES! by TiggertheMad · · Score: 2, Funny

    just had an ugly thought...."Windows17 for PC (Personal Cloud)"

    --

    HA! I just wasted some of your bandwidth with a frivolous sig!
  21. Re:How many problems can these systems really solv by shmlco · · Score: 2, Funny

    Stream it? With that much processing power it should be able to create it on the spot: "Computer, let's start today's scenario with Angelina Jolie surrounded by...."

    --
    Any sect, cult, or religion will legislate its creed into law if it acquires the political power to do so.
  22. I've often wondered... by petrus4 · · Score: 2, Interesting

    ...what might happen if we could run a copy of The Sims on a truly massive supercomputer. It would need to be somewhat customised for that particular machine/environment, of course, but I think it could be interesting.

    There were times when I did see something close to genuinely emergent behaviour in the Sims 2, or more specifically, emergent combinations of pre-existing routines. You need to set things up for them in a way which is somewhat out of the box, and definitely not in line with real world human architectural or aesthetic norms, but it can happen.

    Makes me think; if we could run the Sims, or the bots from some currently existing FPS, parallel on a sufficiently large scale, we might eventually start seeing some very interesting results come from it, at least within the contexts of said games.