Supercomputer Repossessed By State, May Be Sold In Pieces

← Back to Stories (view on slashdot.org)

Supercomputer Repossessed By State, May Be Sold In Pieces

Posted by samzenpus on Thursday January 3, 2013 @12:05PM from the super-auction dept.

1sockchuck writes "A supercomputer that was the third-fastest machine in the world in 2008 has been repossessed by the state of New Mexico and will likely be sold in pieces to three universities in the state. The state has been unable to find a buyer for the Encanto supercomputer, which was built and maintained with $20 million in state funding. The supercomputer had the enthusiastic backing of Gov. Bill Richardson, who saw the project as an economic development tool for New Mexico. But the commercial projects did not materialize, and Richardson's successor, Susana Martinez, says the supercomputer is a 'symbol of excess.'"

8 of 123 comments (clear)

Min score:

Reason:

Sort:

Imagine... by PaulBu · 2013-01-03 12:07 · Score: 4, Funny

A Beowolf clusted of these! :)
Paul B.
Fools! by kurt555gs · 2013-01-03 12:21 · Score: 5, Funny

Think of how many Bitcoins this thing could make. Someone should tell New Mexico.

--
* Carthago Delenda Est *
Re:Symbol of excess ?? by TheGratefulNet · 2013-01-03 12:46 · Score: 4, Interesting

and supercomputers often require recoding of the 'app' so that it runs better and uses the hardware better.
when I was at SGI (and cray was still part of them) I got some time on a cray machine to run some code that I was usually running on indys and octanes. I expected a HUGE increase in speed but I saw only about 2x. my app was not broken down to be cray-friendly and so I never got any real speed out of it.
unless you go to lengths to use the SC in 'its preferred way' its a wasted and expensive resource.

--

--
"It is now safe to switch off your computer."
Re:Symbol of excess ?? by scheme · 2013-01-03 14:12 · Score: 4, Informative

My experience is it would be better to provision a cluster of EC2 boxes to run the task than build a purpose-built super computer (with some exception). One disadvantage of clustered machines is longer communication latency, so tasks that require lots of process to process communication will run slower. Many problems can be tweaked with search spaces sliced so that this latency is not a big deal.
There are huge classes of problems were you can't tweak things like this. Basically any simulation where things are large distances interact or where there is a lot of communication can't really be shoved into a cluster. For example, computation fluid dynamics (e.g. anything looking at air or water moving over surfaces), weather simulations, molecular dynamics, simulating gravity, etc. All of these types of problems will run like crap if you try to use EC2 instances for them.
Also, have you really priced out what computation and data storage on EC2 costs? There's a few studies that show that EC2 on-demand instance will cost you 2-3 times more than purchasing a comparable server even with power, cooling, and maintenance/administration factored in. See, this or this for example. EC2 is great if you want to explore certain problems and need to temporarily scale up or want the ability to scale up on demand but if you have a base level of work that you'll be doing all the time, it's much more efficient to buy your own hardware. That is doubly true if your problems need any significant amount of storage space.

--
"When you sit with a nice girl for two hours, it seems like two minutes. When you sit on a hot stove for two minutes, it
Welcome to the new America by fufufang · 2013-01-03 14:17 · Score: 4, Funny

Where science and engineering is considered as excess,but litigation/lawsuit are considered as normal.
Re:Symbol of "retarded governor" by mikael · 2013-01-03 14:23 · Score: 4, Informative

It's not going to be entirely broken up and sold as scrap. As the system is superscalar, the universities and mining institutes want to split the system into three blocks : UNM wants 10 racks, New Mexico State University want 4 racks, and New Mexico Institute of Mining and Technology would take 2 racks. They are each going to have their own physical campus space and energy consumption budgets, so no one could afford the entire system.
Look at the statistics of the system:
Type of system: SGI Altix ICE 8200 cluster
Number of racks: 28
Number of processor cores per rack: 500
Total number of cores: 14000
Processing power: 172 Trillion calculations per second
Power consumption 32 Kilowatts per cabinet (not sure if racks == cabinets, but that would mean 896 Kilowatts/hour if it were the case)
Normally, when someone requests time on a supercomputer, they put forward a funding bid, get some grant money which pays for fixed amount of time and number of cores. The administration of the system, then book in the time and schedule it with the other tasks running. If there are just a few regular customers and they each have a fixed amount of funding, then it's going to be cheaper for each of them to have their own portion of the system.
I'd imagine Intel and SGI thought they could work together to build this system, house it somewhere locally, and lease it out to whoever needed it, and gain experience with parallel processing as well as make a healthy profit, slowly gaining number of customers. Prospective customers probably freaked out at the cost of doing their processing on an external system that wasn't under their control versus running on desktop PC's with Kepler/CUDA/OpenCL systems.

--
Vintage computer adverts: http://www.vintageadbrowser.com/computers-and-software-ads
Re:Parallelism obsoleted the supercomputer by scheme · 2013-01-03 14:35 · Score: 4, Informative

There are not many problems these days that cannot be parallelized and split up to be run on a large number of off the shelf hardware. It is much easier to grow a Beowulf Cluster to add performance than redesigning to eke out every bit of capability of top-of-the-line hardware. Much easier also, to redesign your problem so that it can take advantage of parallelism. I agree that this was probably a boondoggle by a politician wanting to get some publicity for himself.
You're mistaken. There's a large class of problems that are pleasantly parallel and can be split up like you say (e.g. einstein@home or seti@home type problems). However, any problem that requires a lot of internode communication such as computation fluid dynamics, gravity simulations, weather or climate simulations/forecasting, combustion/flame problems (e.g. modeling engines), molecular dynamics will require a system like this. A beowulf cluster using ethernet to connect nodes together will result in most of the cpus waiting for information from neighboring nodes to be sent to it so that it can go through an iteration. A lot of the cost in a system like this comes from having a very low latency, high speed network connections. Ideally, you'd want to have every cpu connected to every other cpu, but that is impossible so you end up trying to maximize the number of connections and bandwidth while minimizing the collisions with other cpu-cpu communications for a given amount of money. It's not cheap by any means.

--
"When you sit with a nice girl for two hours, it seems like two minutes. When you sit on a hot stove for two minutes, it
Re:Oh, boy! by Immerman · 2013-01-03 15:00 · Score: 4, Insightful

Actually, for learning how to do good supercomputer programming it might be quite viable. After all most beginner code is horribly inefficient, and most beginner projects are quite small. On anything resembling a "real" supercomputer even the most inefficient code will still finish within seconds - whereas on slow hardware with poor I/O a poorly coded implementation may take many minutes or even hours versus the seconds needed for a well-written program to do the same task. Technically speaking the difference between .1 seconds and 10 seconds is just as informative as the difference between 10 seconds and 17 minutes, but the latter carries far more psychological weight.
Besides which - how many entry-level tasks can you think of that could actually make use of even a few dozen clustered "real" systems, much less a thousand? Hands-on experience in how to effectively partition a task between numerous nodes shouldn't be underestimated, and it's a rare university that's going to want to turn beginning programmers loose on their big iron, other departments want to use it for real research. A $30-50k cluster on the other hand might be just what the CS department ordered.

--
--- Most topics have many sides worth arguing, allow me to take one opposite you.