SGI & NASA Build World's Fastest Supercomputer
GarethSwan writes "SGI and NASA have just rolled-out the new world number one fastest supercomputer. Its performance test (LINPACK) result of 42.7 teraflops easily outclasses the previous mark set by Japan's Earth Simulator of 35.86 teraflops AND that set by IBM's new BlueGene/L experiment of 36.01 teraflops. What's even more awesome is that each of the 20 512-processor systems run a single Linux image, AND Columbia was installed in only 15 weeks. Imagine having your own 20-machine cluster?"
Why does it take so long to build a super computer and why do they seem to be redesigned each time a new one is desired?
It's a little like how Canada's and France's nuclear power plant system are built around standardized power stations, cookie cutter if you will. The cost to reproduce a power plant is negligble compared to the initial design and implementation, so the reuse of designs makes the whole system really cheap. The drawback is that it stagnates the technology and the newest plants may not get the newest and best technology. Contrast this with the American system of designing each power plant with the latest and greatest technology. You get really great plants each time, of course, but the cost is astronomical and uneconomical.
So to, it seems with supercomputers. We never hear about how these things are thrown into mass production, only about how the latest one gets 10 more teraflops than the last and all the slashbots wonder how well Doom 3 runs on it or whether Longhorn will run at all in such an underpowered machine.
But each design of a supercomputer is a massive success of engineering skill. How much cheaper would it become if instead of redesigning the machines each time someone wants to feel more manly than the current speed champion, that the current design be rebuilt for a generation (in computer years)?
Really, given the fact that most popular computers have enough processing power to handle anything, and the fact that clustering technology has evolved and is usable in case they aren't...what is the point in the "super computer"?
The super computer is a cluster (10k+ processors in 20 nodes).
Not all applications/computations scale by just adding computers to the cluster.
An example would be solving for z: x=84+19, y=5*3, z=x+y
The ultimate solution z is limited by the speed x & y can be solved. You can have an individual computer solve for x and another for y in parallel. But no matter how many more computers you add, none of them can solve z until x&y are solved first, and none of them would speed up the computation of x&y.
After a certain scale, you do not get benifits of parellel processing, so the only way to speed things up is to make each individual computer faster.
D6 63 0D 70 89 81 BB 8E 7B 7C 5F 5D 54 EA AB 73
Except running SETI is akin to playing Minesweeper on Columbia. Really, who gives a flying fuck about analyzing the universe's background EM noise in some pointless massive-scale sci-fi masturbatory nerd fantasy?
Do something useful, like folding@home, for fuck's sake.
Inevitably someone will say "we can finally predict the weather..." and in true Futurama Farnsworth fashion I say PSHAW! We don't even know how to properly frame the QUESTION of how to predict the weather, much less get closer to an "Answer" like "The hurricane will hit EXACTLY here, at EXACTLY this time. Only the people on these specific streets are boned."
Still, I bet I could get like 1 billion FPS on UT2004 at 3600x4800!
Seriously though, I want to see small improvements. Better, easier to grasp programming languages. More critical thinking skills taught in schools. And a cluster like this dedicated to uber-porn. I'm talking full frame, Hi Def, ggg stuff. (did I type that last part out loud?)
...But I digress. TREMBLE PUNY HUMANS!ONE DAY MY SPECIES WILL DESTROY YOU ALL!
"NASA Secures Approval in 30 Days" Knowing how govermental processes normally go, this part really seems incredible. Normally even the "fluffy" pre-study would take that long(or way more), before anyone actually sits down to discuss actual details and such. Specially the way most everything with NASA seems to be over budget and way late. It is indeed good to see that there is still some hope, so lets hope they get the procurement prosesses in general more working.
Yes of course the answer is 42. This computer was built to find out what the question is.
Weather prediction, it turns out, is *not at all* like playing chess. Chess is a deterministic linear process operating on rigid, unchanging rules. There is always a "best move" for every board state, which a sufficiently fast and capacious database could search for. Weather is chaotic, a nonlinear process. It feeds back its state into its rules, in that some processes increase the sensitivity to change of other simultaneous processes. Chaos cannot be merely "solved", like a linear equation; it must be simulated and iterated through its successive states to identify more states.
Of course, we're just getting started with chaos dynamics. We might find chaotic mathematical shortcuts, just like we found algebra to master counting. And studying weather simulation is a great way to do so. Lorenz first formally specified chaos math by modeling weather. While we're improving our modeling techniques to better cope with the weather on which we depend, we'll be sharpening our math tools. Weather applications are therefore some of the most productive apps for these new machines, now that they're fast enough to model real systems, giving results predicting not only weather, but also the future of mathematics.
--
make install -not war
Wonder why they run open source instead of proprprietary operating system on this? Maybe the multitude of answers to that question can show you why it can be considered open source victory.
What new private space industry? Spaceship One, for example, reached space. That's a long way from being able to do anything useful in space. They were nowhere near orbital velocity, for example. We're still many years, if not decades, away from private industry being able to take over NASA's near-earth space role.
You don't live somewhere that gets hurricanes, do you? 'Cause scientists can already "potentially predict hurricane paths a full five days before the storms reach landfall." Hell, I can do that. A freakin' Magic 8 Ball can potentially do that.
Maybe they're trying to say something about doing it with a better degree of accuracy, or being right more of the time, or something like that, but it doesn't sound like it from that quote.
"Hey, guys, look at this life-sized computer-generated stripper I'm rendering in real-ti... oh, what? Um, tell the reporter we think it'd be good for hurricane prediction."
Village idiot in some extremely smart villages.
Good job NASA? Yeah I'd agree. But what about good job SGI? Why does SGI always seem to have bad marketing and not get the press/praise they deserve?
This is an SGI system. SGI has laid out plans for terascale computing (stupid marketing speak for huge ccNUMA systems) a while ago. I'm sure NASA and SGI worked together but this is essentialy an 'Extreme' version of an off-the-shelf SGI system.
Like what? Go out and look up SPEC results next time you're bored. I think you'll find that I2 is quite a bit more capable than you make out. IBM's dual-core POWER5 is just about the only thing out there that's even close to (a single-core) I2 in FP performance, and Opteron isn't even in the game at that level.
Is it a commercial failure? Probably, but so was Alpha - commercial success is not an indicator of actual performance.
ABSURDITY, n.: A statement or belief manifestly inconsistent with one's own opinion.
Umm, not true. Sun, can hold up to 106 processers in its Sunfire 15K product, or 72 dual-core processors in the E25K.
SGI's Origin systems are equally large I believe. And manufacturers like IBM also have large SMP machines.
There's a difference between SMP and NUMA used in the big iron. SMP is normally a shared bus or switch topology with the processors connected to each other with little or no arbitration logic. So if you get above 4 you normally max out the busses as the CPUs try to figure out who's doing what and what instruction comes next. NUMA architecture is somewhere between SMP and clustering. The SGI boxes use c-bricks of 4 CPU's and I think 8GB of RAM. Each c-brick is connected to one or more routers via craylink cables. Get enough of these together and you've got your 512 CPU monster. Sun uses the same idea, but is unfortunatly a LOT slower with their interconnect technology. I've seen 16x SMP boxes before, but they really didn't scale at all. Anything over the standard 4-8 SMP is a waste of CPU's and money.