Slashdot Mirror


Truly Off-The -Shelf PCs Make A Top-500 Cluster

SLiDERPiMP writes: "Yahoo! News is reporting that HP created an 'off-the-shelf' supercomputer, using 256 e-pc's (blech!). What they ended up with is the 'I-Cluster,' a Mandrake Linux-powered [Mandrake, baby ;) ] cluster of 225 PCs that has benchmarked its way into the list of the top 500 most powerful computers in the world. Go over there to check out the full article. It's a good read. Should I worry that practically anyone can now build a supercomputer? Speaking of which, anyone wanna loan me $210,000?" Clusters may be old hat nowadays, but the interesting thing about this one is the degreee of customization that HP and France's National Institute for Research in Computer Science did to each machine to make this cluster -- namely, none.

20 of 231 comments (clear)

  1. Can you imagine? (obligatory) by Andorion · · Score: 5, Funny

    Can you imagine a Beowulf cluster of these... erm... clusters?

    -Berj

  2. What was even cooler... by UserChrisCanter4 · · Score: 4, Funny

    Was when the HP-powered cluster started assimilating some of the Compaq multi-Alpha machines as it's own.

  3. $210,000 ?? by a.out · · Score: 5, Interesting

    How about $0 Baldric a student run beowulf at the University of Western Ontario built one on hardware dontations. It's not exactly top 500 but it still kicks ass.

  4. Practically anyone? I think not. by Pulzar · · Score: 3, Insightful

    Should I worry that practically anyone can now build a supercomputer?

    Unless "practically anyone" has the funds, the storage room, and the manpower to maintain this monstrosity, there is nothing to worry about.

    And even if anyone could build a supercomputer, what's there to worry about? We don't live in the "War Games" world where supercomputers play chess, tic-tac-toe, and start nuclear wars for fun.

    --
    Never underestimate the bandwidth of a 747 filled with CD-ROMs.
    1. Re:Practically anyone? I think not. by Pulzar · · Score: 3, Informative

      Actually, the worry about the PS2 machines was that their imaging capabilities are strong enough to be used in the missile guidance systems. I think he never actually attempted to get any of them, but US blocked shipments to Iraq just in case.

      --
      Never underestimate the bandwidth of a 747 filled with CD-ROMs.
    2. Re:Practically anyone? I think not. by KyleCordes · · Score: 3, Insightful

      My guess is that people who understand how to use computers to do modeling for nuclear weapons design would be somewhat harder to come up with than the appropriate degree of computing power.

      Knowing nothing about it, I would nonetheless guess that it's rather non-trivial.

      Keep in mind that nukes were invented without the aid of a Beowulf or a Cray.

  5. Putting them to work. by nairnr · · Score: 5, Informative

    Well, it seems like super clusters are becoming very easy to build hardware-wise. If you throw enough commodity at a problem, it becomes easier. I would think the biggest problem with supercomputers is no longer the hardware itself, but networking, and the programming to take advantage of the hardware. These computers still only really work for something that distributes easily. The biggest factors are now the ability to distribute, and schedule work for each node. The more nodes you engage, the more you hope your problem is CPU bound, so it will scale more.

    Data transfer and message passing are such a big issue I belive the most important developments are in the networking topologies and hardware for these environments.

    That said, I still want one in my basement :-)

  6. This just goes to show you by bstrahm · · Score: 4, Insightful
    How powerful standard desktop computers are. There is only two orders of magnitude between a normal desktop computer (I refuse to call a Pentium III 733 as outdated) and a mainframe computer.


    Now all we need are ways of getting local connections significantly faster (Did someone say Gig Ethernet) to allow faster communication between the nodes and we will be able to scale beyond several hundred and break the top 100. I hear 1gig NICs will be falling in price to under $100 US retail soon...


    How fast do you connect to your cluster ?

    1. Re:This just goes to show you by kindbud · · Score: 3, Insightful

      I hear 1gig NICs will be falling in price to under $100 US retail soon...

      I hear Gigabit switches won't be...

      --
      Edith Keeler Must Die
    2. Re:This just goes to show you by BrentN · · Score: 4, Informative
      The problem with Ethernet in clustering isn't bandwidth, its the latency.

      The real issue is how parallel-efficient your algorithms are. We do molecular dynamics (MD) on large clusters, and we can get away with slow networks because each node of the cluster has data that is relatively independent of all other nodes - only neighboring nodes must communicate. If you have a case (and most cases are like this) where every node must communicate to every other node, it becomes a more difficult problem to manage. To deal with this, you need a high-speed, low-latency switch like the interconnects in a Cray. The only real choice for that is a crossbar switch, like Myrinet.

      And Myrinet is tres expensive.

  7. worry? by motherhead · · Score: 3, Funny

    Should I worry that practically anyone can now build a supercomputer?

    Yes, you should probably worry that practically anyone can build a supercomputer. But you could mitigate all that fear with the fact that not practically anyone can whip up software that takes full advantage of it.

    Thank god there isn't any off the shelf "missile trajectory" software in the CDW catalog. you would hope that any society that can whip together motivated coders to write such code already has access to some pretty spiffy kit.

    (yeah i said "kit"... and I'm from Chicago... I feel like such a wanker.)

  8. Re:Imagine... by Bonker · · Score: 3

    A Beowulf cluster of E-machines?

    I dunno. It's kinda lacking when you compare it to all the other Beowulf clusters we've considered.

    --
    The next Slashdot story will be ready soon, but subscribers can beat the rush and slashdot the links early!
  9. Yahoo! news by Anonymous Coward · · Score: 3, Funny

    shouln't that be Yahoo! Serious News now?

  10. Okay, so I'm curious: by Saint+Aardvark · · Score: 3, Funny

    Sez the cost was $210k US w/o cabling...why the qualification? What *would* cabling for 225-odd boxen cost?

  11. It's at #385 in the list by CormacJ · · Score: 4, Informative
    The latest top 500 list is here : http://www.top500.org/list/2001/06/

    The cluster is at #385

  12. Yawn. Another One. by 4of12 · · Score: 3, Insightful

    You know this Beowulf business is getting to be pretty staid and routine by now.

    In fact, I'd almost say it would be newsworthy if there were any organization (university, company, govt lab) that had not yet built "a supercomputer from the COTS components".

    What I'd like to see now is more metrics (some of which the article does, admittedly, reveal).

    1. hardware cost per FLOP (everyone already tells you this)
    2. FLOPS per human time to build
    3. FLOPS per sysadmin time to maintain
    4. FLOPS per kilowatt of electricity
    5. FLOPS per cubic foot of rack space
    6. can it run smoothly if Bad Andy goes behind the rack and unplugs a few network connections, a few power cords to some nodes?
    Everyone knows that you can spend your own time scouring dumpsters for cast-off computers and coaxing them to life, bringing up an old 486 with an ISA 10bT card as a member of your cluster. Unless you're doing it for your own educational benefit, it's just not worth it.

    Don't get wrong. I love these clusters and want to use them. It's just that, in 2001, their mere existence is no longer as exciting as it was in the mid 1990s.

    Now days, I care more about ease of use and ease of maintenance, taking the low cost of a Beowulf cluster as a given.

    With the size of these clusters going up and the ratio of hardware cost to human time constantly decreasing, I'd be more impressed to see how a system with many hundreds of nodes was brought up in a short time, never rebooted for a year, even as 13 of the nodes developed variously problems and become unproductive members of the cluster.

    --
    "Provided by the management for your protection."
  13. For those wondering about nuclear testing... by segfaultcoredump · · Score: 5, Insightful

    No, a beowulf cluster is the last thing that one would use for nuclear simulation.

    While great at highly parallel tasks that require very little synchronization between threads (think code cracking), nuclear testing (and almost all other fluid dynamic problems) generally requires all of the cpu's to have high speed access to all of the memory. So one needs a huge shared memory system (think Cray or Sun StarCat).

    And for this reason, I find the top 500 list to be a bit misleading in these days of massively parallel systems. Its great as a test of how many flops the system can crank out, but it does not take into account the memory bandwidth between the cpu's, and that is often more important than raw cpu horsepower.

    1. Re:For those wondering about nuclear testing... by tolldog · · Score: 3, Informative

      Ahh... Somebody else who gets it...

      I find too that people assume that an "X" type of cluster will solve all problems, regardless of what they are. Each cluster type serves a purpose. Cray and then SGI spent time developing the Cray Link for a reason. Sun, IBM, HP and others have gotten into the game as well. Sometimes you need a ton of procs with access to the same memory, sometimes the task divides well.

      I see this from almost the opposite side of the spectrum with rendering. To render a shot, you can divide the task amongst ignorant machines. They just need to know what they are working on. The cleverness goes into the managment of these machines. A place where the massively parallel machines would be nice is rendering a single frame. After the renderers initial load of the scene file and preping for render, the task can be divided amongst many processors on the same machine. To divide it beowulf style would throttle the network with the memory sharing over the ethernet ports.

      So from my experience:

      big data, long time ... massivley parallel machinebig data, short time ... generic cluster with smart master
      little data, long time ... beowulf style cluster
      little data, short time ... generic cluster with smart master

      --
      -I just work here... how am I supposed to know?
  14. Re:Power consumption of old CPUs by jmv · · Score: 3, Informative

    Actually, the best MIPS/Wh is probably with the slower versions of the current laptop chips. Maybe portable G3/G4?

    Also, I don't think you'd get much useful stuff done with early Pentiums and 486. Consider that a P4 2 GHz has 20 times the clock speed and probably does twice as much per cycle, so it's ~40X faster. Now, if you connect 40 P100 together, unless your problem is completly parallel (like breaking keys, as opposed to most linear algebra), you're going to lose at least a factor of 2 there. This means that in order to equal 1 P4 @ 2 GHz, you'll need almost 100 Pentium 100 Mhz. This means that 10 P4 would be like a thousand Pentiums. At these numbers, it's going to cost so much in networking and power...

    I'd say (pure opinion here) the slower you'd want to have today is something like a Duron 1 GHz and the best MIPS/$ is probably with a bunch of dual Athlon 1.4 GHz (A dual is not cheaper that 2 single, but you get more done because of parallelism issues).

  15. Re:Power consumption of old CPUs by Paul+Komarek · · Score: 3, Informative

    My experience doesn't suggest that the P4 does twice as much per cycle. I'm seeing P4s do a fair bit less than the P3 per cycle, and the P3, P2, and PPro cores didn't seem *that* much faster per clock than the original Pentiums. My gut tells me that the P4 doesn't do any more than the original Pentiums per clock cycle, and the only thing they have going for them is Intel's ability to manufacture them at high clock speeds.

    If you really want a cpu that does a lot in a single cycle, look at the IBM POWER series. IIRC, on the floating point side, a 2xx MHz POWER III is darn not too far from an Alpha 21264 at 733 MHz. And now there are 1.1GHz and 1.@GHz POWER IV chips, in the new IBM p690 machines. I don't know how they compare to the POWER III per cycle, though, because the POWER IV opens a whole new (good) can of worms.

    -Paul Komarek