SGI & NASA Plan 10240-Processor Altix Cluster
green pizza writes "NASA has announced plans to cluster twenty 512-processor Silicon Graphics Inc Altix supercomputers connected to a 500-terabyte SGI InfiniteStorage SAN. The Altix uses Itanium2 CPUs running Linux atop an Origin 3000-derrived architecture. NASA and SGI scaled Linux to 512 CPUs late last year. There are also strong hints that SGI plans to bring its clustered ATI graphics to Altix in the near future. Lots of neat big iron project on the horizon!"
Good luck SGI, the Valley is rooting for its former star, and so are a lot of stock speculators.
This is great news for intel. They will double the number of itanics shipped in a single deal!
yes, for sure. they bought a congressman to make this happen. (no joke, trust me.)
and as usual, real science at nasa is going to suffer for a waste money on unneeded computing capacity just so the US can prove we have a bigger dick than the japanese.
-pissed off nasa worker
In a wonderful book "Homo Zapiens" by Victor Pelevin, the leaders of the world are rendered on clusters of SGI machines by a secret organization. Makes you wonder when you hear about these clusters :)
The reason? The License. While BSD License really is the most free, it would allow IBM to put a lot of effort into it, and then have MS swope in, modify it, and sell with a sorts of closed APIs, etc.
In essence, the BSD license would allow the creation of another Unix model where the core is identical or just similar, but the APIs would be used to lock users in. How would that solve IBM's problem? Or for that matter any Hardware vendors problem? It would not.
Finally an answer that doesn't involve ranting and raving about GPL/freedom/blah blah blah. Thanks for the simple common-sense answer to this question I wondered myself.
Just curious. My guess is that Intel keeps pumping money into SGI to get Altix systems out and those who have them (LLNL and ...?) got them at practically no charge to run Linpack and look good on the Top500 list.
I'm glad to see that SGI has regained its legs and is back in the high-end computing market again. The gamble they made in embracing Linux has paid off. Other folks had counted them dead because they came to the WinNT game late and were, therefore, fated to be high-priced integrators. Their days were numbered by the low-end market forces like Dell and HP.
Now we see that there is a market for high-priced integrators as long as the underlying technology fits the market segment you target.
"Rocky Rococo, at your cervix!"
Agent Green:
Cheaper? Not likely, you'd have to buy the high-speed interconnect to make it worthwhile. And the Opterons perform fairly poorly in larger clusters, since they have the NUMA latency penalties locally on each node. Checking the Top500 list, a cluster of 256 Opteron 246 using Infiniband will perform worse than a cluster of 256 Xeon 2.8GHz using Infiniband. The scariest example is that a cluster of 256 P4's@3GHz using Gigabit Ethernet outperforms the Opteron cluster.....
Important to note is that the Linpack test doesn't stress the interconnect that much. The more a task stresses the interconnect, the more the Opteron cluster will be penalized. There's one exception though, and that's the Cray Octiga Bay systems.... And if you go that route, it costs _at_ _least_ as much as an Altix system
I don't know what NASA would do with this, but I know what our group would do with it.
We always need machines. You could give me 1024 machines and I'd still need more.
For example, I study fluids currently. I may simulate 4,000,000 particles and it may take 3 weeks for my simulation to finish. If I had 10240 nodes, it may only take a day. Or perhaps I could simulate MORE particles for longer. There are all sorts of advantages to having this many machines hooked up.
One thing I can tell you for sure is that there most likely will not be *1* job that uses all of these at once. There are probably several researchers that are using it simultaneously and have a slice of the machines. Press releases like this are often time misleading because usually the CPUs are split between several jobs and researchers and research groups and what not.
Not to steal NASA's thunder -- a cluster this big is impressive.
Mike.
Mmmm......sacrelicious.