Building The MareNostrum COTS Supercomputer
karvind writes "IBM Power Architecture Community Newsletter has a story about making a supercomputer (Number 4 on top 500 list) from easily available components (like BladeCenter and TotalStorage servers, 970FX PowerPC processors, and Linux 2.6). A joint venture between IBM and the Spanish government, it is named MareNostrum: the Latin term meaning 'our sea.' Peaking at 40 TFlops, the beast consists of 2,282 IBM eServer BladeCenter JS20 blade servers housed in 163 BladeCenter chassis, 4,564 64-bit IBM PowerPC 970FX processors, and 140 TB of IBM TotalStorage DS4100 storage servers."
Mare Nostrum refers to the Mediterranean Sea.
Mare Nostrum literally means "our sea". It is what the Romans called the Mediterranean Sea during the Empire. As you can see, it was an apt name.
Rhymes that keep their secrets will unfold behind the clouds.There upon the rainbow is the answer to a neverending story
I am an African Grey parrot, and I can tell you that while you humans are celebrating this achievement, I and my fellow Greys are laughing at you. Supercomputers are old news to us; in fact, one of my friends solved the halting problem while taking a crap the other day. Seriously, people, we like you 'cause you feed us, but leave this kind of stuff to us.
/. thought my user name was too long)
(I tried to register an account but
but does it run Linux? Oh crap, never mind.
(Number 4 on top 500 list)
...while being Number 6 on top 300 list, and Number 65 on top 2000 list.
This is like those CDs that have 'best of the Top40' and not contain the top10 list of that
If an experiment works, something has gone wrong.
and after about 16 (or 32) years we'll have that power in our desktops...
-- and if life has failed you leave the cross you're nailed to
It probably has to do more with the fact that as you increase the number of nodes, your increase in performance decreases on a per node basis. To get that many nodes working together takes an incredible amount of resource management. It makes you wonder where the limit currently is for if it is worth adding an extra node, or if the resource management requirements negate the extra nodes computational power.
I love the first line in the article, which ends, "is constructed of such totally off-the-shelf parts as IBM BladeCenter JS20 servers, 64-bit 970FX PowerPC processors, TotalStorage DS4100 storage servers, and Linux 2.6. This is its story."
Right, like I regularly go to Fry's to stock up on some DS4100s and Bladecenters. I'd love to be the geek for whom that stuff is "off-the-shelf". Can you even buy bare PPC CPUs and mobos?
I happened to look at the Top 500 supercomputers site and I couln't help noticing out of the top 5 supercomputers almost half are in non-US countries like Spain and Japan. This is not to beat some kind of patriot act drum. Instead, it got me to thinking.
With supercomputing powers now avaible to any country or group with a few readily available components, it is only a matter of time before these supercomputing powers may be used by a rogue state or radical group to cause havoc among electronic communications using methods like denial of service attacks, spyware, and crapflooding message boards.
I think it is high time the nations of the world put their heads together and addressed this issue. For example, I don't think the US Federal Government even has any cabinet-level position like Secretary of Information Technology or something like that. When are they going to get with the times? It will probably take another terrorist attack or something.
This is all about timely and focused execution. The speed at which this project was realized is important. Consider: from the initial concept in late December of 2003 to assembling the computer in Madrid took less than a year. Normally, this kind of supercomputer projects take years.
Lame!
SGI had NASA AMES' Columbia online in 120 days, and landed #2 on the Top500.
Brandon D. Valentine
Peaking at 40 TFlops, the beast consists of 2,282 IBM eServer BladeCenter JS20 blade servers housed in 163 BladeCenter chassis, 4,564 64-bit IBM PowerPC 970FX processors, and 140 TB of IBM TotalStorage DS4100 storage servers.
Sounds like the specs of Microsoft's Xbox 3...
Don't take life so seriously. No one makes it out alive.
I'm sorry, but why are we giving IBM free press again? For god's sake, the very first sentence of the freaking article is utter rubbish:
The MareNostrum supercomputer at the Barcelona Supercomputing Center, ranked number four in the world in speed in November 2004, is constructed of such totally off-the-shelf parts as IBM BladeCenter JS20 servers, 64-bit 970FX PowerPC processors, TotalStorage DS4100 storage servers, and Linux 2.6
That's it - the first sentence of the article, if you exclude the title, the credits and the date, that is.
And we can already call advertising bullshit. I'm sorry, but how is the MareNostrum system made of any more "totally off-the-shelf" parts than the number two system on the Top500, NASA's very own Columbia? In fact, the 64-bit 970FX PowerPC processor is NOT an off-the-shelf part: unlike the Itanium 2 CPUs in Columbia, you can NOT buy such CPUs individually (for good reason: unlike the Itanium CPUs in Columbia, MareNostrum's CPUs are not socketed but soldered to the JS20 system board, so there goes upgradeability...)
Seriously though, why should we give a rat's ass about MareNostrum? Columbia is faster, more efficient, really is made from off-the-shelf parts and also runs Linux.
What's that? Oh, IBM are a good company and SGI aren't? For fuck's sake, SGI are better friends of Linux than IBM are. What did IBM do for linux? Nothing compared to SGI. IBM ported JFS, a crappy journaling filesystem, oh and they ported it to their own POWER/PowerPC architecture systems. W00p!
SGI:
- Gave Linux XFS, one of the fastest filesystems around, with _many_ advanced features (just look in your kernel config sometime)
- Scaled Linux beyond 64 CPUs for the first time (and indeed, they hold the record at 2,048CPUs): they fixed a _ton_ of scalability problems, and continue to do this on a daily basis (just look at this week's archive of the linux-ia64 mailing list to see what I mean!)
- Open sourced their Itanium compiler
- Created OpenGL (notice carefully the Open in OpenGL. You can bet your bottom dollar if IBM created a funky new graphics API it would _only_ work on PowerPC machines with IBM video hardware!)
Yada yada. All I'm trying to say is there are other companies out there who have really taken Linux to heart and have made open source development in their best interest, not just IBM. SGI is just one example, there are many others.
Yeah but it's made from "easily available components." Hmmm, I think I've seen that before, at the University of Virginia. Don't G5s qualify as "easily available"?
It's just the thing to find SHA1 collisions of ISO images in 2^56 operations...
...but how fast can it open Photoshop CS?
Well, if you really need a supercomputer, the first step is not to get the computer, but instead the funding. As super computers go, if your needs are served by a distributed computing environmet, an "entry level" supercomputer does not really cost all that much compared to "traditional" supercomputers. Yes we are still talking about hundreds of thousands of dollars. (there seems to still be 256 processor 2.8ghz xeon based computers with gig ethernet connectivity on the latest top 500 list) If you need to solve problems that do not work in paralell, you are definitely out of luck.
but I bet Windows still runs slow on it.
Coding Monkey.org - Spanging the heavy spade of truth into t
Given enough money how is that impressive anymore? What's the best single-thread-performance machine today?
I just love how every time someone writes about another grid or supercomputer or beowulf cluster they always say "easily available components" as if I could find most of them in a standard IT closet or just run down to the local computer shop and pick them up with my corp. AMEX.
In what world is 163 BladeCenter chassis, 4,564 64-bit IBM PowerPC 970FX processors, and 140 TB of IBM TotalStorage DS4100 storage servers easily available??? Maybe if you are Big Blue, but then, why would it be more difficult for them to throw together a fully proprietary supercomputer?
...does it have an AGP slot?
Need an ISP in South Africa?
But I thought it was Ouray Easay!
What's going on?
This is all so confusing! I need to take a nap.
You'll notice the "redundant" moderation, which is a perfect fit, given the context :-)
Am I the only one finding it curious that nowhere in the specs do they mention how much RAM there is per node or in the aggregate?
It mentions how many nodes, how many CPUs, how many racks, how much storage, but not how much RAM.
500GB of disk, 5TB of transfer, $5.95/mo
If you're the buyer, there is no limit. Every extra node == _MORE_STATUS_
If you're the guys writing code for it, there is no limit. Every extra node == job security++
If you're the people administering this, there is no limit. Every extra node == bigger budget next year
See, citizen? Size does count.
Eh, not really. In a lot of the algorithms, there comes a point when adding nodes will make it SLOWER because the increases in communication time are greater than the decrease in computation time. Now granted this does depend a lot on a) what you are doing with the machine and b) the machines themselves, but just thinking that people who make these things love to just pile on hardware is a bit naive....
Monstar L
The scary thing is, that the CELL with its potential for 25 GFlops of double precision floating point, could rival this system with just 1600 8 SPE units.
Granted, the CELL isn't exactly off the shelf, and I'm willing to bet 4,564 970FXs will be cheaper than 1600 CELLs for quite some time, so the project still has merit.