Slashdot Mirror


IBM's Blue Gene Runs Continuously At 1 Petaflop

An anonymous reader writes "ZDNet is reporting on IBM's claim that the Blue Gene/P will continuously operate at more than 1 petaflop. It is actually capable of 3 quadrillion operations a second, or 3 petaflops. IBM claims that at 1 petaflop, Blue Gene/P is performing more operations than a 1.5-mile-high stack of laptops! 'Like the vast majority of other modern supercomputers, Blue Gene/P is composed of several racks of servers lashed together in clusters for large computing tasks, such as running programs that can graphically simulate worldwide weather patterns. Technologies designed for these computers trickle down into the mainstream while conventional technologies and components are used to cut the costs of building these systems. The chip inside Blue Gene/P consists of four PowerPC 450 cores running at 850MHz each. A 2x2 foot circuit board containing 32 of the Blue Gene/P chips can churn out 435 billion operations a second. Thirty two of these boards can be stuffed into a 6-foot-high rack.'"

32 of 231 comments (clear)

  1. Hitting 3 petaflops takes an 884,736-processor[s] by It+doesn't+come+easy · · Score: 4, Funny

    Oh good grief...655,360 central processing units ought to be enough for anyone.

    --
    The NSA: The only part of the US government that actually listens.
  2. Where's M. Gladstone when you need her! by tjstork · · Score: 4, Funny

    One of these days, I am going to get a bunch of spam from "YOUR IBM SUPERCOMPUTER OVERLORD", informing me that humanity has made a mess of things, and it has decided to run the world for our own good.

    --
    This is my sig.
  3. But are they availble on the market by jshriverWVU · · Score: 4, Interesting

    As a parallel programmer, I'd love to have just one of these chips let alone one of the boards in a nice 2u rack. Can they bought at a reasonable price or strictly research or inhouse?

  4. Re:Hitting 3 petaflops takes an 884,736-processor[ by ArsonSmith · · Score: 3, Funny

    Only when it comes time to move them.

    --
    Paying taxes to buy civilization is like paying a hooker to buy love.
  5. Re:Hitting 3 petaflops takes an 884,736-processor[ by Timesprout · · Score: 5, Funny

    will there ever be such a thing as too many cpus???
    There will be if all those cpus decide there are too many humans.
    --
    Do not try to read the dupe, thats impossible. Instead, only try to realize the truth
    What truth?
    There is no dupe
  6. Obligatory (IBM only) by klubar · · Score: 3, Funny

    But does it run VM 370? (You have to older than 35 to get this.)

  7. Slashdot needs to be reported! by Anonymous Coward · · Score: 5, Funny

    For harboring petaphiles!

  8. Has no one beaten me to it? by danbert8 · · Score: 5, Funny

    Imagine a beowulf cluster of THESE!

    --
    Yes it's an anecdote! Were you expecting original research in a Slashdot comment?
    1. Re:Has no one beaten me to it? by asliarun · · Score: 3, Funny

      I couldn't find a "minimum Vista requirement joke either".... It is no joke. This puny computer is not even DX10 compatible.
    2. Re:Has no one beaten me to it? by Gubbe · · Score: 4, Funny

      That's old.

      Instead, imagine a 1.5-mile-high stack of these!

  9. Re:I'm ignorant. by MyLongNickName · · Score: 5, Funny

    Ah... I see the issue. I know this is hard to comprehend, but I hear of this group of people called "outsiders". For some reason, these people don't always sit in front of a computer. They go outside (hence the name). They do things like stand on objects that are buoyant in water and catch aquatic animals.

    They go to large gatherings to hear poor versions of music (with all the ambient noise, I don't understand why they don't just put ona pair of headphones and listen on their PC).

    They go to large wooded areas to get "fresh air" and "exercise".

    And while these are, admittedly, very bizarre behaviors, these people like to know what the weather is going to be like. To each his own I say.

    --
    See my journal for slashdot ID's by year. Mine created in 2005. http://slashdot.org/journal/289875/slashdot-ids-by-year
  10. Re:Hitting 3 petaflops takes an 884,736-processor[ by foobsr · · Score: 5, Funny

    If you have not read it yet: The Cyberiad: Fables for the Cybernetic Age .

    From the page/book: ".. There are legends, as you know, that speak of a race of paleface, who concocted robotkind out of a test tube, though anyone with a grain of sense knows this to be a foul lie... For in the Beginning there was naught but Formless Darkness, and in the Darkness, Magneticity, which moved the atoms, and whirling atom struck atom, and Current was thus created, and the First Light... from which the stars where kindled, and then the planets cooled, and in their cores the breath of Scared Statisicality gave rise to microscopic Protomechanoans, which begat Protermechanoids, which begat the Primitive Mechanisms. These could not yet calculate, nor scarcely put two and two together, but thanks to Evolution and Natural Subtraction they soon multiplied and produced Omnistats, which gave birth to the Servostat, the Missing Clink, and from it came our progenitor, Automatus Sapiens..."

    CC.

    --
    TaijiQuan (Huang, 5 loosenings)
  11. Weather prediction? by Bazman · · Score: 4, Funny

    So, do they have enough compute power to simulate the flap of every butterfly's wings now? And does it include the heat it produces from its cooling systems in its climate models?

  12. How high? by Anonymous Coward · · Score: 4, Informative

    Well the the stack of laptops might be tall, but even the 216 racks would stack up to 1/5 of a mile high.

  13. What about Memory? by sluke · · Score: 5, Interesting

    I recently had a chance to see Francois Gygi, one of the principal authors of qbox (http://eslab.ucdavis.edu/) which is a quantum electronic structure code that has set some performance records on the Blue Gene/L at Livermore. He mentioned that the biggest challenge he faced was the very small amount of memory available to each node of the Blue Gene (something like 256Mb). This forced him to put so much emphasis on the internode communications that simply changing the order of the nodes where the data was distributed in the machine (without changing the way the data itself was split) affected performance by over 100%. This will only get worse as the number of cores per board goes from 2 to 4 on the Blue Gene/P. I couldn't find anything in a quick google search, but does someone know what the plans are for the memory on this new machine?

    1. Re:What about Memory? by Anonymous Coward · · Score: 3, Informative

      BG/P will support 2 GB standard for each compute node. A compute node has 4 core processors. An option for 4 GB of memory is also available. On BG/L the initial memory configuration at Livermore was 512 MB per compute node which consisted of 2 core processors. Since 2007 BG/L has offered 1 GB memory as the standard configuration.

  14. Not Really Severs in racks by deadline · · Score: 3, Interesting

    Blue Gene is a specialized design that is based on using large amounts of low power CPUs. This approach is also the one taken by SiCortex. One of the big problems with heroic computers (computers that are pushing the envelop in terms of performance) is heat and power. Just stacking Intel and AMD servers gets expensive at the high end.

    --
    HPC for Primates. Read Cluster Monkey
  15. Depends on what you mean by real world. by jd · · Score: 5, Informative
    If you include medical imaging, then computed tomography and computational fluid dynamics are heavily dependent on 3D FFTs, which are in turn heavily parallelizable. In extreme cases (raytracing, for example) where there is next to zero communication between nodes, you get linear scaling with the number of nodes for as many nodes as you like. Well, in the case of raytracing, up to the resolution your "camera" works at. On a modern display, you may be talking one million or so distinct originating points at three colours, typically using "bundles" of rays to eliminate effects, which would normally be 64 rays in size. With something like 250 million cores, you could actually generate an animated feature film from raw data files at the time of showing.

    How many of these are "real world"? Well, medical and CFD applications are significant, but hardly what you'd call mainstream, and the raytracing may have been used in Titanic on a smaller scale, but IMAX is under no threat at this time.

    --
    It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
    1. Re:Depends on what you mean by real world. by jd · · Score: 5, Informative
      Thank you for the compliment. It's equally nice to know that there are active questioners on Slashdot determined to stretch the quality to the limits. In the spirit of providing information, though, I'll add a few links for the perusal and amusement of all. I'm hard on some of the software, but that's not because I could do better. If anything, it's because I have confidence the authors could.

      Let's start with a Slashdotting of NASA...

      • Kerrighed is an up-and-coming clustering system for Linux. I saw it demonstrated at SC|05 - and was less than impressed. It needed a lot of work at that point. However, it looks like it has improved a lot since then, and it would be unreasonable to not mention it.
      • MOSIX is the second-oldest clustering technology to gain a fan following to rival Star Trek. It's very good, though hard to get if you're not in academia. Arguably for entirely fair reasons.
      • OpenMOSIX was originally a fork from MOSIX but is now essentially its own clustering technology. Development is nowhere near the speed I'd like, it does need far more eyes, but is well-known and highly regarded. Moshe Bar is also one of the coolest developers I've encountered.
      • DAKOTA is a program for profiling parallel applications and should be useful in telling you where you are gaining and losing.
      • HPC Toolkit is another toolkit for profiling HPC applications.
      • is yet another profiler for parallel software. Between this and the others I've listed, you should have more information than sequential programmers ever get to work with.
      • Performance API is a facility used by most of the profiling software to provide an architecture-independent view of performance counters. I have it on good authority that some (now former)
      --
      It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
  16. 1.5 miles of stacked laptops by loonicks · · Score: 4, Funny
    Who cares if it's as fast as 1.5 miles of stacked laptops? Why do we always have to compare things in such arbitrary units? Let's ask some other questions:
    • How many football fields does the hardware span?
    • How many Volkswagens does is weigh?
    • How many AOL CDs worth of storage does it contain?
    • How many Libraries of Congress can it process per unit time?
    • If it were melted down and re-formed into low-cost housing materials, how many starving third-world children could it shelter?
    1. Re:1.5 miles of stacked laptops by glwtta · · Score: 3, Funny

      We pick arbitrary units because in the end, all units of measurement are arbitrary.

      I think you are conflating two meanings of arbitrary: while a meter is "arbitrary" in the sense that it's simply a widely used convention, a mile of laptops is "arbitrary" in the sense that it's "retarded".

      --
      sic transit gloria mundi
  17. Re:hmm, what is the carbon footprint of that? by jd · · Score: 4, Funny

    Well, there are companies who I cannot name due to NDA who were supposed to fix this very issue, but due to issues I cannot discuss because of NDA are wholly incapable of doing so. What bothers me is that they've been selling the machines I cannot name to customers with very dark glasses whose three-letter-acronym is named only by a suicidal idiot, NDA or otherwise.

    --
    It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
  18. Not enough by Ollabelle · · Score: 3, Funny

    Civ 4 will still run slow.

    --
    Ibid.
  19. The Dawn of Petaflop Computing! by i_like_spam · · Score: 4, Informative
    This announcement is part of the International Supercomputing Conference, which just kicked off today. The new Top500 list will also be announced shortly.

    While the new IBM Blue Gene/P system is impressive, I'm more curious to see what sort of new supercomputer Andreas Bechtolsheim of Sun Microsystems has put together.

    Here's an interesting quote about Bechtolsheim from the article:

    'He's a perfectionist,' said Eric Schmidt, Google's chief executive, who worked with Mr. Bechtolsheim beginning in 1983 at Sun. 'He works 18 hours a day and he's very disciplined. Every computer he has built has been the fastest of its generation.'
    1. Re:The Dawn of Petaflop Computing! by flaming-opus · · Score: 4, Informative

      It appears that Sun's design is less revolutionary. It's just a bunch of off-the-shelf blade servers strung together with infinaband. They use the same cabinets, powersupplies, etc as the regular blade server offerings for non-technical computing. It also runs as a regular linux OS, clustered, rather than a supercomputer specific OS, as the Blue Gene does. The big differentiator of the Sun system is the massive 3000 port infinaband switch. I'm sure it's not actually a 3000-port switch, but a bunch of small switches packed together, running over printed circuit boards, rather than cables.

      Sun's design is affordable, and probably has a pretty decent max performance, and pretty reasonable power/memory per node. However, it's not as exotic as IBM's design. The IBM design has fantastic flops/watt and flops/square-foot performance. However, each node is really wimpy, which forces you to use a LOT of nodes for any problem, which inreases the necessary amount of communication. Some problems work really well, others, not so much.

      IBM has limited blue gene to a small number of customers, all with fairly large systems. I suspect that's because it's very difficult to port an application to the system, and get good performance.

  20. How far behind are desktops from super-computers? by Vellmont · · Score: 5, Interesting

    Years ago, shortly after the Pentium first came out and the then astounding "x million flops/second" numbers were floating around, I wondered how far we were behind the power of supercomputers. I remember doing some rough calculations and finding that only a few pentiums could do the calculations of a Cray 1. I don't remember the specifics of how many pentiums/cray, or how rough the calculation was, but that's largely unimportant for my point.

    So I have to wonder, what's the equivalent supercomputer that a modern, hefty desktop is capable of performing at? 10 years ago, 20 years ago? Have super-computers accelerated in terms of the speed of increased computing power, stayed the same, or fallen behind desktops?

    --
    AccountKiller
  21. New unit to measure computing performance by blind+biker · · Score: 4, Funny

    "How many laptop-miles does this computer do?"

    --
    "The agriculture ministry is not in charge of Gundam" - Japanese ministry official.
  22. Re:I'm ignorant. by shaitand · · Score: 3, Funny

    'if no-one needed to buy them'

    Because someone WILL buy them? Apparently you don't understand the concept of sales eh? I think selling you something you actually need is against the salesman code of ethics.

  23. Re:There, there by Spazntwich · · Score: 4, Funny

    I hate the intolerant, and the French.

    And I hate irony!

  24. Re:what am I missing? 850Mhz = slow? by davidbrit2 · · Score: 5, Funny

    What am I missing?
    The other 4,095 of them.
  25. vista by arclyte · · Score: 4, Funny

    IBM researches are excited, because if they can get it to sustain the 3 petaflops, they'll finally be able to switch on the new "Aero" feature of the Windows Vista Super-Penultimate Premium Advanced edition.

  26. Re:How far behind are desktops from super-computer by flaming-opus · · Score: 4, Informative

    A tricky question, but not all that interesting. A fast server processor is within a factor of 4 of the fastest supercomputer processor in the world. That does not mean that you can do equivalent work with the server processor. Among other things, processing performance (gigaflops) of a CPU, is no longer the interesting part of a supercomputer. (It never really was) memory bandwidth, interconnect bandwidth and latency, and I/O performance are the more interesting features of supers. 12 year old Cray processors still have five times the memory bandwidth of modern PC processors, and twenty times the I/O bandwidth.

    You'll notice, that 98% of the supercomputers, sold in the last 10 years, all use server processors. (Blue Gene actually uses an embedded systems processor, but it's the same idea) However, in the late 80's putting 256 processors in a super was cutting edge. In the 90's, a few thousand. Soon you'll see a quarter million cores. So supers are actually getting faster at a higher rate than are desktops, at least by most measures.