Slashdot Mirror


IBM Sets Supercomputer Speed Record

T.Hobbes writes "IBM's BlueGene/L has set a new speed record at 36.01 TFlops, beating the Earth Simulator's 35.86 TFlops, according to internal IBM testing. 'This is notable because of the fixation everyone has had on the Earth Simulator,' said Dave Turek, I.B.M.'s vice president for the high-performance computing division. The AP story is here; the NY Times' story is here."

56 of 308 comments (clear)

  1. Damn, by scorp1us · · Score: 3, Funny

    I wish I knew what a Tecord was...

    Maybe /. shoud be using automatic text-box spell checking found in KDE...

    --
    Slashdot's rate-of-post filter: Preventing you from posting too many great ideas at once.
    1. Re:Damn, by sgant · · Score: 4, Funny

      Imagine a Teowulf Tluster of these!

      --

      "Leo Fender was in a 'state of grace' when he designed the Stratocaster." -- Paul Reed Smith
    2. Re:Damn, by cybergrue · · Score: 3, Funny
      It gets better.
      Due to the slashdot bug in Firefox, the second line of the summary reads

      eating the Earth Simulator

      The b was hidden under the dark green of the sections field on the upper left.
      Now thats an impressive feat of computing power.

    3. Re:Damn, by jayayeem · · Score: 3, Funny

      Problem is, running a spell checker would have used .16 teraflops of the machine's capacity and cost it the record.

      --
      I metamoderate, therefore I am
  2. Tecord? by holzp · · Score: 5, Funny

    Is that how they measure Records in Teraflops?

    1. Re:Tecord? by hackstraw · · Score: 4, Informative
      I'm all for improving technology, but how do they verify their "tecords"?

      The top500 tecords are submited on an honor system. Most of the systems are thrown together with known processors and interconnects where the tesults should "make sense". Also, the systems teport their theoretical max performance and a measured tesult. It would be pretty hard to fudge a score for the top500 by much without many people questioning it. From this page the top500 people say:
      While we make every attempt to verify the results obtained from users and vendors, errors are bound to exist and should be brought to our attention.
      Its kinda like any tesearch field. Most people are honest, but anomolies can and do happen, and they are usually found out by others in the field. Two of the most tecent scientist scandles involved the guy from Bell labs, Hendrik Schön, who was found falsifying data, and he was fired, and I believe that he also lost his PhD. The other is from the US government funded tesearch on MDMA by George Ricaurte. Although I believe that nothing really happened in the Ricaurte case.
  3. It's a mew tecord! by JPelorat · · Score: 5, Funny

    Someone call Huinness' Nook pf Eorld Tecords!

    --
    Hokey statistics and ancient misconceptions are no match for a good thought in your head, kid!
  4. Can't Compare to my Windows by kai.chan · · Score: 5, Funny

    I'm sorry, but those Supercomputers have nothing on my machine running Windows. It has a record of AlwaysFlops.

  5. Re:Tecord == Record? by VistaBoy · · Score: 4, Funny

    I'd say rypo.

  6. But could it keep up with /.? by richy+freeway · · Score: 2, Funny

    It might be fast, but could it keep up with monitoring all the errors and dupes on /.? ;P

  7. Tecord? by el_benito · · Score: 5, Interesting

    A new tecord?!? That's timpossible! But more seriously, does anyone know if there's an impartial 3rd party that ever confirms these measurements? I'm all for improving technology, but how do they verify their "tecords"?

    --
    http://liquidben.com - Aspiring to an 'under construction' gif
  8. Full Text of Atticle by The-Bus · · Score: 5, Funny

    Hete's the full text in case of a massive slashdotting of theit setvets:

    IBM says Blue Gene bteaks speed tecotd
    9/29/2004, 7:27 a.m. ET
    By ELLEN SIMON
    The Associated Ptess

    NEW YOtK (AP) - IBM Cotp. claimed unofficial btagging tights Tuesday as ownet of the wotld's fastest supetcomputet.

    Fot thtee yeats tunning, the fastest supetcomputet has been NEC's Eatth Simulatot in Japan.

    "The fact that non-U.S. vendot like NEC had the fastest computet was seen as a big challenge fot U.S. computet industty," said Hotst Simon, ditectot of the supetcomputing centet at Lawtence Betkeley National Lab in Califotnia.

    "That an Ametican vendot and an Ametican application has won back the No. 1 spot -- that's the main significance of this."

    Eatth Simulatot can sustain speeds of 35.86 tetaflops.

    IBM said its still-unfinished BlueGene/L System, named fot its ability to model the folding of human ptoteins, can sustain speeds of 36 tetaflops. A tetaflop is 1 ttillion calculations pet second.

    Lawtence Livetmote National Labotatoty plans to install the Blue Gene/L system next yeat with 130,000 ptocessots and 64 tacks, half a tennis coutt in size. The labs will use it fot modeling the behaviot and aging of high explosives, asttophysics, cosmology and basic science, lab spokesman Bob Hitschfeld said.

    The ptototype fot which IBM claimed the speed tecotd is located in tochestet, Minn., has 16,250 ptocessots and takes up eight tacks of space.

    While IBM's speed sets a new benchmatk, the official list of the wotld's fastest supetcomputets will not be teleased until Novembet. A handful of scientists who audit the computets' tepotted speeds publish them on Top500.otg.

    Supetcomputing is significant because of its implications fot national secutity as well as such fields as global climate modeling, asttophysics and genetic teseatch.

    Supetcomputing technology IBM inttoduced a decade ago has evolved into a $3 billion to $4 billion business fot the company, said Simon.

    Unlike the mote specialized atchitectute of the Japanese supetcomputet, IBM's BlueGene/L uses a detivative of commetcially available off-the-shelf ptocessots. It also uses an unusually latge numbet of them.

    The tesulting computet is smallet and coolet than othet supetcomputets, teducing its tunning costs, said Hitschfeld. He did not have a dollat figute fot how much lowet Blue Gene's costs will be than othet supetcomputets.

    Howevet, othet supetcomputets can do things Blue Gene cannot, such as ptoduce 3-D simulations of nucleat explosions, Hitschfeld said.

    --

    Small potatoes make the steak look bigger.

  9. Place your bets by glpierce · · Score: 5, Insightful

    Place your bets, people!

    What percentage of posts in the first 15 minutes will be about the spelling of the last word in the title, and what percentage about the content?

    --
    G
  10. BlueGene? Deep Blue? by fib2004 · · Score: 3, Funny

    "I want to play chess against that one" - Kasparov

    --
    Would it not be easier in that case for the government to dissolve the people and elect another? - Bertold Brecht
  11. I read all three articles but couldn't find... by halivar · · Score: 2, Interesting

    ...what operating system it uses. Anybody know?

    1. Re:I read all three articles but couldn't find... by joib · · Score: 4, Informative

      ...what operating system it uses.


      It's a sort of two layer system. The compute nodes (2 cpus per compute node) run a IBM proprietary, very small and simple, kernel. 64 compute nodes are managed by an i/o node running Linux.

    2. Re:I read all three articles but couldn't find... by Harbinjer · · Score: 2, Interesting

      Actually I do think its linux. I live in Rochester and know some of the IBMers.

      I wonder if they let normal people see this thing? I'll ask.

    3. Re:I read all three articles but couldn't find... by BlurredOne · · Score: 5, Informative

      A quick Google search has netted the following: OS - Linux, HPK (High Performance Kernel) Complilers - Fortran95, C99, C++ Math Library - a subset of ESSL If you would like to read the article, it can be found at http://www.llnl.gov/asci/platforms/bluegene/talks/ gupta.pdf

    4. Re:I read all three articles but couldn't find... by bhima · · Score: 5, Informative
      I've been facinated with this thing ever since I discovered it was using processors I could actually write assembly (and C) for. Each node is running an embedded linux kernel.

      Here's a bit more: each node has 2 cpus and 4 fpus, custom non-preemptive kernel

      application program has full control of all timing issues kernel and application share same address space

      kernel is memory protected

      kernel provides: program load / start / debug / termination file access all via message passing to IO nodes

      I could go on and on but it's all on Blue Gene's site http://www.research.ibm.com/bluegene/index.html

      I can't resist adding that GCC won't use the second FPU on each die...

      --
      Nothing in the world is more dangerous than sincere ignorance and conscientious stupidity.
    5. Re:I read all three articles but couldn't find... by joib · · Score: 4, Insightful


      Each node is running an embedded linux kernel.


      No.


      each node has 2 cpus and 4 fpus, custom non-preemptive kernel


      I see a contradiction with your previous statement here... :) Luckily, you got it right this time.

      As I said in my comment above, the compute nodes run an IBM proprietary kernel (apparently the kernel you're describing), and every 64 compute nodes are managed by an i/o node running Linux.


      I can't resist adding that GCC won't use the second FPU on each die...


      So what's the problem? It's not like anybody who could afford a highly specialized and expensive machine like this one couldn't afford to shell out some $$$ for xlf.

      Anyways, I'm sure that if this modified PPC core gets popular outside multi-million dollar supercomputers, the gcc team will figure out how to utilize the second FPU.

    6. Re:I read all three articles but couldn't find... by bhima · · Score: 2
      OK now that I'm not a work lets try this again as I misspoke in my first post and then totally misunderstood your reply.

      The example Blue Gene/L implementation begins with dual PPC440 chips (each die with dual FPUs), A Compute Card contains 2 of these chips, some number of Link Nodes and 512 meg of DDR RAM. There are 16 Cards to a Compute Node. Each Compute Node contains a IO Node. There are 32 Compute Nodes to a cabinet.

      Each Compute Card runs "CNK", an IBM in house kernel written in C++ and is connected to an IO Node

      Each IO Node runs Linux, is networked to the outside via Ethernet and the inside network is in a tree configuration.

      The interesting network configuration are handeled by the link nodes

      And to top it all off there is a JTAG connection to each die.

      So Blue Gene /L is running both some home rolled thing of IBM's (CNK) and Linux simultaneously. They also use both XLC & GCC for the Compute Nodes and IO Nodes respectively.

      What I also find interesting is that IBM is pitching these things to the "hundreds of TF/s" folks but also to the "ten TF/s" folks (as in just one or two cabinets).

      --
      Nothing in the world is more dangerous than sincere ignorance and conscientious stupidity.
  12. 36 TFlops ? by MadX · · Score: 4, Interesting

    I wonder if that is sustained ??
    I know that when the Mac G5 Cluster was developed they claimed tremendous speed, but when the sustained rate was calculated, it turned out to be much lower ...

    1. Re:36 TFlops ? by Henriok · · Score: 4, Informative

      The peak of VTs System X cluster was about 17 Tflops, and the sustained rate was just over 10 (which rendered it the third place on the Top500 list). This peak/sustained ratio is significantly less that Earth Simulator's 90% efficiency, but compared to the cost it's extremely cost effective. ES cost 100 times more but have just 3 times higher sustained rate.

      --

      - Henrik

      - when the Shadows descend -
    2. Re:36 TFlops ? by kfg · · Score: 2, Informative

      I have no idea of what you mean by sustained.

      He is refering to the fact that horsepower has a time componant. It's only in rare conditions that you're interested in the instantaneous force a horse can apply. What you want to know is how much work you can get out of it per day.

      A cheetah may be able to sprint to 100 kph, but I'll out distance it in 10 minutes driving my car at only 80 kph.

      Human hunters on foot can only run about 15 kph, but can chase down large prey that can run 65 kph, because the human can run at 15 kph all day, day after day, but the prey animal can only run at 65 kph for a very limited time before becoming exhausted and needing to stop.

      The amount of time you are expected to perform some function in is a critical parameter.

      Running a 4 minute mile is one thing, but there's a word for someone who can run 26 of them back to back:

      Loser.

      KFG

    3. Re:36 TFlops ? by the_2nd_coming · · Score: 2

      they had to take down their system while they upgraded to Duel 2.3 GHz Xserves with ECC memory. the system is up and running now, and last month they were running simulations for the military.

      --



      I am the Alpha and the Omega-3
  13. With a 95% confidence level by gutterandthestars · · Score: 3, Funny

    98% of posts will be 0.4 standard deviations away from one of the following:

    0. "fist pr0st!!!!!111~"
    1. "Imagine a beowulf cluster of these!"
    2. "But does it run Linux?"
    3. "In Soviet Russia, SPEED RECORD SETS YUO!"
    4. "1. Earth Simulator: 38.56 TFlops. 2. BlueGene/L36.01 TFlops. 3. ... 4. PROFIT!!!
    5. "I for one, welcome our supercomputer overlords."
    6. "Do either of the supercomputers run BSD? BSD is dying."
    7. "I didn't have enough time to read the article, but..."

  14. Huh? by attam · · Score: 2, Interesting

    From TFA:
    the Blue Gene/L system next year with 130,000 processors and 64 racks, half a tennis court in size.

    The prototype for which IBM claimed the speed record is located in Rochester, Minn., has 16,250 processors and takes up eight racks of space.

    So does this mean the finished product, with almost 10x as many procs will be much faster still? Or am I reading this wrong?

    1. Re:Huh? by CriX · · Score: 4, Informative
      YUP.

      "About IBM's Blue Gene Supercomputing Project Blue Gene is an IBM supercomputing project dedicated to building a new family of supercomputers optimized for bandwidth, scalability and the ability to handle large amounts of data while consuming a fraction of the power and floor space required by today's fastest systems. The full Blue Gene/L machine is being built for the Lawrence Livermore National Laboratory in California, and will have a peak speed of 360 teraflops. When completed in 2005, IBM expects Blue Gene/L to lead the Top500 supercomputer list. A second Blue Gene/L machine is planned for ASTRON, a leading astronomy organization in the Netherlands. IBM and its partners are currently exploring a growing list of applications including hydrodynamics, quantum chemistry, molecular dynamics, climate modeling and financial modeling."

      -from the IBM website

      --
      Moderation: +1 pwnage
    2. Re:Huh? by AchilleTalon · · Score: 2, Funny
      Just mean next year you will see this kind of announce on eBay:

      *** NIB * BlueGene prototype ***

      --
      Achille Talon
      Hop!
  15. Off the shelf configuration by erick99 · · Score: 3, Interesting
    From the article:

    Unlike the more specialized architecture of the Japanese supercomputer, IBM's BlueGene/L uses a derivative of commercially available off-the-shelf processors. It also uses an unusually large number of them. The resulting computer is smaller and cooler than other supercomputers, reducing its running costs, said Hirschfeld. He did not have a dollar figure for how much lower Blue Gene's costs will be than other supercomputers.

    This is the most interesting part of the article to me. Makers of supercomputers are going to go back and forth for the speed record. However, holding the speed record with off the shelf components seems like a separate achievement in and of itself. The article did mention, however, that the IBM system is not as capable as other supercomputers.

    --
    http://www.busyweather.com/
    1. Re:Off the shelf configuration by TimeZone · · Score: 3, Informative

      Off the shelf is used loosely here. The BlueGene processors are indeed custom, but they happen to be based on the PowerPC 440 processors. That is, if you go buy a machine with a PowerPC 440 cpu, it's not exactly the same as what's in BlueGene. It is mostly the same though. What is pretty interesting is that each of the cpus is pretty paltry (the DD1 chips run at 500MHz, and the DD2 chips run at 700MHz), but the overall architecture seems to scale pretty well.
      TZ

  16. The most interesting part: by onetrueking · · Score: 5, Interesting

    From the NYTime article:

    "The new system is notable because it packs its computing power much more densely than other large-scale computing systems. BlueGene/L is one-hundredth the physical size of the Earth Simulator and consumes one twenty-eighth the power per computation, the company said."

    1/100th the size and 1/28th the power. Now if that isn't a beautiful thing, I don't know what is.

  17. More detail by erick99 · · Score: 3, Informative

    For a great deal of detail about this system surf over to this pdf

    --
    http://www.busyweather.com/
  18. What for? by RCulpepper · · Score: 3, Funny

    From the Washington Post article:

    "IBM's new system nudges past a nearly three-year-old computer speed record of 35.86 "teraflops," or trillions of calculations per second, with a working speed of 36.01 teraflops....The current record-holder, known as the Earth Simulator, is a supercomputer in Yokohama, Japan, designed to simulate earthquakes."

    Won't it be great when IBM announces that they built Blue Gene to simulate Japanese earthquakes? Neener neener.

    --
    Always a godfather; never a god. -Gore Vidal
  19. Smart machines by fionbio · · Score: 5, Interesting

    I've heard that the neural network of human brain has calculation speed of 4.4 TFLOPS. How soon these machines will start to THINK? Seems like what we need now is just more storage capacity and some well-written "thinking" software...

    1. Re:Smart machines by Doesn't_Comment_Code · · Score: 4, Interesting

      You're getting into some pretty deep issues now. Can a computer ever think? How would we know if it was thinking? At what point does the computer start thinking instead of just following instructions. No matter how complex it's instructions are or how fast it executes them, isn't it still just following instructions? What about us? Are we just following instructions?

      Timeout-- my head hurts.

      Which brings me to my next point. If computer ever could think, it would eventually start to think about how it thinks... And then it would overheat or explode.

      --

      Slashdot Syndrome: the sudden, extreme urge to correct someone in order to validate one's self.
  20. way to catch up guys. by flaming-opus · · Score: 4, Informative

    I'll be very interested in seeing how well this thing performs on benchmarks other than linpack.

    Blue Gene is a very interesting design in so much as it uses IBM's 32-bit powerpc cores, normally used for embeded applications. They put 2 cores on a die, and integrated a memory controller, as well as the 4 different interconnect networks. The cores are only clocked at about 800mhz, and are thus pretty wimpy individually. However, that can be good. Since the processor cores are quite modest, the ratio of memory bandwidth to CPU flops is quite high. Similarly the ratio of interconnect bandwidth to CPU flops is also very high. Thus the CPUs should run very efficiently on problems that will parallelize to thousands of cpus. Some problems, on the other hand, will perform terribly. I expect a lot of this system's performance depends on the scalability of the system software, and the compilers / libraries.

    That said, the earth simulator is also really good at some applications, and not so good at others. Instead of 16,000 small CPUs, it uses 5000 massive vector CPUs. Each is clocked at only 500mhz, but has 8 parallel execution pipes, and about 50GBytes/sec of memory bandwidth. Problems that don't vectorize run through the very modest 500mhz scalar unit.

    Earth simulator has realized a large percent of it's theoretical peak performance on real world simulations (often up to 50%) while most large systems approach (10%). I'm looking forward to see how well utilized Blue Gene is. Earth simulator was a direct descendant from NEC's sx-series supercomputers, which have a 20 year lineage. Blue Gene is a radical departure from IBM's regular HPC product offerings, and uses a new microkernel OS rather than clustered AIX nodes. I imagine there will be some stutter-steps in the early days of this new product, which will undoubtedly work themselves out over time.

    Great work IBM.

    1. Re:way to catch up guys. by joib · · Score: 2, Interesting


      I expect a lot of this system's performance depends on the scalability of the system software, and the compilers / libraries.


      The blue gene is an all out MPI machine. System software scalability is not that crucial, since every compute node kernel only controls 2 cpu:s. With this modest number of cpu:s per node, I'd guess it doesn't require any extreme trickery from the scalability point of view to achieve near hardware performance.

      Software-wise, all the scalability problems lie in the design of the applications.

    2. Re:way to catch up guys. by flaming-opus · · Score: 2, Interesting

      That's only sort-of true. Blue Gene, like asci-red, cri t3e, paragon, etc use a microkernel OS to control the compute nodes. This is basically a couple of network stacks that allow the application to use the interconnect network, and some hooks for the larger OS which runs on dedicated OS-nodes. The microkernel mostly just gets out of the way, and lets the application run balls-out on the compute nodes. Blue Gene was even designed so cleverly that MPI barriers and all-reduce are implemented as part of the interconnect network.

      But then the application does something like write(file, offset, &buffer). That can't be handled by microkernel, and must be handled by an OS node. The system call might even be handled by a different node from the I/O node connected to the disk drive. The system call is performed by a "server" on the OS node that may be part of that node's operating system, or might be a user-space daemon. Since there is only 1 thread on the compute node, it blocks until the i/o request is serviced.

      This is not a hard thing to do if there are 60 compute nodes, 2 OS nodes and 2 i/o nodes. But with 100,000 compute nodes, there would have to be hundreds or thousands of OS nodes. Far too many to run with a monolith kernel. Scalability within this pool of OS nodes is a tricky problem. Previous MPP designs have demonstated that it's really easy to get the common case working, but much much harder for a few corner cases, like concurrent un-structured writes to the same file. (which tends to happen at the beginning and end of many big MPI programs. - Remember that you don't solve a problem any faster if the machine runs 30Tflops for 2 days, and then spends 25 days putting the output data together)

      On a machine that large, check-point / restart is a big deal. Node failures are going to be common when that many components are involved. You end up with huge amounts of data, all of which needs to be written quickly, while the machine sits idle.

      These problems are well understood. MPP designers have been wrestling with them for almost 20 years now. But any new system will have some kinks and bugs to deal with. I'm sure IBM is working hard to get them solved. Thay may have it all working already, for all I know.

      You're right though, that the performance of the inner loops depends a lot on the application developers.

  21. 36.01 What ? by JohnHegarty · · Score: 2, Informative

    http://foldoc.doc.ic.ac.uk/foldoc/foldoc.cgi?query =teraflop

  22. Re:The question is... by The_Mr_Flibble · · Score: 2, Funny

    No microsoft will install a vanila version of redhat on it and compare it to a pc with windows server then write press articles about cost to run.

  23. Virginia Tech Supercomputer by Station · · Score: 2, Funny

    The Virginia Tech Supercomputer (take 2) is due to be clocked soon, and its also a huge off-the-shelf system. I'd like to see how they compare.

    Also, I'll be big money its already been used for gaming. What college studeny could resist?

    --
    "Risc is good..."
  24. Re:What interconnect technology are they using? by joib · · Score: 5, Informative


    Did they use infiniband? Or a proprietary interconnect, perhaps?


    Proprietary. Actually, it has 3 networks, one mesh network for point-to-point communication, one tree network for collective communication and a service network for disk i/o, control, health monitoring etc. The service network is ethernet IIRC, the other two are custom.

  25. KDE spell checker? by Anonymous Coward · · Score: 2, Funny

    Wouldn't help. If they had used a KDE spell checker, it would have been changed to 'rekord'.

  26. Seen partial towers by gaylenek · · Score: 2, Informative

    I've been down in the basement of the building, see a few of the towers 1/2 loaded (at that time), along with the massive cooling system that was added to the building to keep those racks workings. Lift up a raised floor panel and the 95 LBS of me will get lifted off the ground (or so it feels).

    Sadly, all 64 racks will never be in Roch, just not enough space.

    Actually, StarTribune has one (crappy) pic of some towers.

    --
    When in danger or in doubt, run in circles, scream and shout.
  27. OK, it sounds fast, but by $RANDOMLUSER · · Score: 2, Funny

    How long does it take it to run an infinite loop?

    --
    No folly is more costly than the folly of intolerant idealism. - Winston Churchill
  28. Aliasing by PingPongBoy · · Score: 2, Informative

    When you have a machine this fast the sampler cannot keep up. As a result what you see is a total distortion of reality. You may have seen this phenomenon with wagon wheel spokes rotating backward when the cart is really rolling. Thus when you read "speed record at 36.01 TFLOPS" your eyes view the letters going backwards and forwards.

    --
    Know your pads. One time pad: good for cryptography. Two timing pad: where to take your mistress.
  29. Re:Someone please explain this to me by cyngus · · Score: 2, Informative

    Problems run on clusters have to be able to be broken down into small pieces that don't need to interact with the other pieces very much. This is so because of communication latency in such a system. Someone mentioned SETI, which runs great on a cluster, because looking at the signal from one piece of sky requires no information about the surrounding sky. However, something like a nuclear explosion, broken into pieces, requires lots of information about the surrounding environment. What's worse is that as you make the pieces smaller when simulating a nuclear explosion, your need for knowledge about the surrounding increases! Such simulations require a much more tightly coupled system, a 'traditional' super computer.

  30. Va. Tech cluster not on current Top 500 list by Troy+Baer · · Score: 3, Interesting
    The peak of VTs System X cluster was about 17 Tflops, and the sustained rate was just over 10 (which rendered it the third place on the Top500 list).

    Except that it's not on the most recent Top 500 list anywhere.

    Remember how Va. Tech replaced all 1100 G5 nodes with G5 XServes a few months ago? Well, when you do something like that, you have to rerun and resubmit the benchmark. Va. Tech were not able to get the machine back together soon enough to rerun the benchmark in time to make the last list; there's even a big caveat about it on the Top 500 home page.

    (It's also not clear that the original version of the Va. Tech machine ever did anything other than run that benchmark, but that's another matter.)

    --Troy
    --
    "My life's work has been to prompt others... and be forgotten." --Cyrano de Bergerac
    1. Re:Va. Tech cluster not on current Top 500 list by Anonymous Coward · · Score: 2, Insightful
      I've run jobs on a supercomputer (one in the top ten on the top500 list...)

      Except during the first month or so of operation there was always a queue of jobs waiting to run. In fact, applying to use that computer is almost exactly like writing a grant proposal, except that the good proposals are awarded CPU hours instead of money.

      In scientific computation, researchers can always use faster computers. The accuracy and scope of the models will expand to fill available computational resources.

  31. Thats nice for IBM but real computing power.. by museumpeace · · Score: 4, Informative

    comes from building hardware for a specific task. Unfortunately most of you can't access this little bit of nerd heaven but some incredibly cool hardware architectures are being described at the High Performance Embedded Computing conference. Sky and Mercury have some of their hottest new designs here. How about a machine that can do a 256 mega-sample FFT in real time?, or a self configuring supercomputer on a chip? Of course most of these tricks will never escape the lab except for the speed-ups for rendering engines...one place where gamers and the DOD are driving technology in a dead heat race with lots of winners. Besides, in a few months, something will come along that will go even faster than blue gene.

    --
    SLASHDOT: news for people who can't concentrate on work or have no life at all and got tired of yelling back at the TV.
    1. Re:Thats nice for IBM but real computing power.. by museumpeace · · Score: 2, Interesting

      oh, if forgot to mention: I don't think you can fit Blue Gene into any UAV unless you count running a Boeing 747 on autopilot and stuffing it to the gills with computer and diesel generators. the computers proposed at HPEC are mostly very mobile.

      --
      SLASHDOT: news for people who can't concentrate on work or have no life at all and got tired of yelling back at the TV.
    2. Re:Thats nice for IBM but real computing power.. by cyberassasin · · Score: 2, Informative

      I think you mean Sky Computer. Not the news network..... Other than that, right on....

      --
      Who is the master of foxhounds, and who says the hunt has begun? -Pink Floyd
  32. Mac Rumor Sites by Chief+Typist · · Score: 2, Funny

    So how long will it take before a Mac rumor site predicts that this CPU will be in the next PowerBook?

    -ch

  33. Re:Processor Failure. by thpr · · Score: 2, Informative
    So what happens when some of the 130000 processors fail? ...

    You deserve some credit for using "when", not "if" (IMHO)

    The system is designed to work around failure. In the original protein folding simluation, the plan was (among other things) to checkpoint the system every hour in order to be able to restart if a failure occurred. In fact, the original expectation was that a processor would fail every few days (that presentation has since been taken down by IBM... was originally named "BlueGenePublic.pdf" ... I can't find it anywhere on the 'net anymore :( ). Failure detection is through a series of ECC bits attached to most of the data transfers and calculations. The software is also specifically written to check any points where conservation is true (meaning there is redundancy in the application and calculations are checked to ensure no errors). The mesh network that others have referred to also allows nodes which are not functioning to be worked around before they are replaced.

    The processors cannot be replaced individually, but the boards (with 2 chips and memory) could be. As far as burning, the processor will often fail in a detectable way (meaning produce incorrect results) long before the device goes (literally) up in smoke. So I would expect the system would be able to disable the failed device long before its a problem. This may be part of what the Linux OS (which oversees 64 of the small processors) is doing. On a wider scale, the designers are smart enough that there are likely temperature sensors in the system that will slow (or shut down) the system if a heat problem is encountered.

  34. IBM vs. SGI by nboscia · · Score: 2, Interesting

    I wonder how this compares to the one NASA is building, which is being collaborated with Intel and SGI. Since you can't base performance simply on the number of processors, it should be interesting.