Slashdot Mirror


Linux Clusters Finally Break the TeraFLOP barrier

cworley submitted - several times - this well-linked submission about a slightly boring topic - fast computers. "Top500.org has just released its latest list of the world's fastest supercomputers (updated twice yearly). For the first time, Linux Beowulf clusters have joined the teraFLOP club, with six new clusters breaking the teraFLOP barrier. Two Linux clusters now rank in the Top 10: Lawrence Livermore's "MCR" (built by Linux NetworX ) ranks #5 achieving 5.694 teraFLOP/s, and Forecast Systems Laboratory's "Jet" (built by HPTi) ranks #8 reaching 3.337 TeraFLOP/s. Other Linux clusters surpassing the teraFLOP/s barrier include: LSU's "SuperMike" at #17 (from Atipa ), the University at Buffalo at #22 and Sandia National Lab at #32 (both from Dell ), an Itanium cluster for British Petroleum Houston at #42 (from HP ), and Argonne National Labs at #46 (from Linux NetworX ) reached just over the one teraFLOP/s mark with 361 processors. In the previous Top500 list compiled last June, the fastest Intel based Netfinity 1024 processor clusters from IBM were sub-teraFLOP/s and the University of Heidelberg's AMD based "HELICS" cluster (built by Megware ) held the top tux rank at #35 with 825 GFLOP/s."

14 of 212 comments (clear)

  1. Enough Links? by rob-fu · · Score: 5, Funny

    It's going to take me 4 hours to read all of this.

    1. Re:Enough Links? by jdkincad · · Score: 5, Funny

      Look on the bright side, it might just spread out the /. effect enough to keep all the linked sites on line.

      --
      The great advantage of having a reputation for being stupid: People are less suspicious of you.
    2. Re:Enough Links? by Jugalator · · Score: 5, Funny

      What he said! Holy crap! This is the main thing I don't like about slashdot, I can hardly ever tell what the main point of the post is if I have to figure out what link to click first.

      Just act like the average Slashdot member. Never click any links to read the articles and just post your thoughts regarding the subject. :-)

      Everything get so much easier that way!

      --
      Beware: In C++, your friends can see your privates!
  2. can you imagine... by Anonymous Coward · · Score: 5, Funny

    a single node from one of these clusters?

    (hey what else can I say, it's already a cluster)

  3. Beowulf clusters aren't so tricky... by caluml · · Score: 5, Interesting

    I built a small Beowulf cluster. It was actually very easy, apart from writing the MPI enabled code.

    Step 1: Install the lam packages on all the nodes
    Step 2: Create an account on all nodes, and use a passphrase-less ssh key to avoid prompting.
    Step 3: Compile your code with mpicc (rather than gcc)
    Step 4: Copy to all nodes.
    Step 5: mpirun C ./your-prog

    Admittedly it was only a 4 node cluster, but hey ;)

    Please, someone break it to me gently if this wasn't actually a Beowulf cluster ;))

    1. Re:Beowulf clusters aren't so tricky... by dsfd · · Score: 5, Interesting

      We built and mantain a Beowulf with about 70 nodes. We use Debian GNU/Linux.

      I agree with you, in principle, it *is* easy to do but the problems increase with the number of nodes. IMHO, the main problems are:

      -Administration effort per node has to be almost zero. Beyond a number of nodes you definitely need things like fully automatic instalation, automatic power control, automatic diagnostic tools, a batch system, etc. All these tools already exist but you need some know-how to put all them together.

      -You need a large enough room with a cooling system that gives at least 100 W per node, 7kW in our case. Room temperature has to be about 20oC.

      -Low cost PC hardware is not allways reliable enough for this application. If you have codes that run 24x7 for months in a large number of processors, the probability to have a hardware problem is very high.

      We have found that our hardware suppliers do not carry out extensive tests on the systems they sell. This is because "normal" users run low quality OSs and they assume that it is normal that the computers just hang from time to time. Therefore, they do not allways detect failures in critical components such as RAM.

      -Of course, your application has to be suitable for parallel computing, specially if your cluster uses a low cost 100Mb/s network. In this case, compared to a "conventional" parallel computer (eg Cray T3E), the processors are roughly equivalent but the network is about 10 times slower and is easily the bottleneck of the system.

      Having said that, despite all the problems, I love Beowulfs. They have totally changed high performance computing, and they are definitely here to stay.

      All this has been possible thanks to free software, so thanks Mr. Stallman/Torvalds and many others...

  4. Re:Question? by Guppy06 · · Score: 5, Funny

    "How long until computing powerful enough to render the probability thought patterns of a manager? That's what I want to know.."

    Good luck. Last I checked, that one falls under Heisenberg's Uncertainty Theorem.

  5. Re:EARTH-SIMULATOR by gl4ss · · Score: 5, Funny

    int simulate_earth()
    {
    sleep(years_to_ms(30000));
    r eturn 42
    }

    dunno what they need the computing power for..'
    oh yeah, to generate the program to call that.

    --
    world was created 5 seconds before this post as it is.
  6. Re:FLOPs by jelle · · Score: 5, Interesting

    Since nobody is answering your question: The Top500 supercomputers are ranked by the results of the LinPack benchmark.

    --
    --- Hindsight is 20/20, but walking backwards is not the answer.
  7. LinuxBIOS by bstadil · · Score: 5, Interesting

    This is not such a dumb question. The LinuxBIOS project was started by and for the Los Alamos National Lab. One of the nifty things this allows them to do is change Kernel without taking the machines down. You can then switch to a kernel compiled for different purposes.

    --
    Help fight continental drift.
  8. Re:Question? by Masa · · Score: 5, Funny
    How long until computing powerful enough to render the probability thought patterns of a manager?

    That shouldn't be too hard... I bet that my Palm Pilot has enough power to predict exactly, what my boss is going to say in the next meeting tomorrow.

    If it's about schedules, he'll say:

    Work...

    1. harder
    2. smarter
    3. cheaper
    4. faster
    In that order.

    If it's about project goals, he'll ask me to:

    Make...

    1. miracles

    If it's about specifications, he'll say: "I have no idea. You find out yourself." And for anything else it would be just blank. All blank.

    On the other hand... if a manager actually has any real thoughts... Well, that would be as easy as to predict patterns from a pure chaos.

  9. Re:It seems like the Apple Mac..... by ikekrull · · Score: 5, Insightful

    Ah, that would be because Apples 'supercomputer on the desktop' marketing drivel was just that.

    Hell, the Sony Playstation 2 was subject to export restrictions because it was 'too powerful', which was driven by/followed with the requisite marketing drivel, but you don't see and PS2 clusters in the 'Worlds fastest supercomputer' list either.

    It has been a long time since Apple PPC was competitive in terms of price/performance with x86s. Of course thats not the only reason to buy a computer, i don't want to get the apple-zealots panties in a bunch.

    It's just that Intel/AMD didn't make a song and dance about breaking the GFLOP barrier, since that happened way back with the P3/Athlon 600-800, hardly cutting edge chips.

    Hell, a 600Mhz Alpha had GFLOP performance years before either the G4 or the x86s.

    The PPC has a nice vector processing unit (Altivec), which could make it a good choice in some situations, but given the premium you pay for Beowulf nodes (Xserves?) from Apple, you will, in general, get a lot more bang for the buck from x86.

    --
    I gots ta ding a ding dang my dang a long ling long
  10. Flops is not everything by Bender_ · · Score: 5, Informative
    Measuring MFlops does not mean a lot - even if it is from a "real life" benchmark. The TOP500 might look much worse for linux-clusters, if more communication-latency dependent benchmarks were used. Linpack, which works mainly on very large matrices, shifts the benchmarks results a lot towards linux-cluster solutions.

    A real supercomputer supports much faster I/O, higher interconnection bandwidth and lower interconnection latency.

    And btw. the new Cray X1 delivers the performance of a all but the largest linux-clusters in a single cabinet (820 GFlops peak that is..). In terms of computing efficiency it makes even the Earth Simulator look pale. I am really looking forward to the next iteration of the TOP500, when the first X1 machines are included.

  11. Imagine.... by DarkHelmet · · Score: 5, Funny
    All those computers that meet Doom3's system requirements...

    ... And they're used for trivial things like finding aliens, weather prediction and unified theory.

    1. The weatherman is usually wrong.
    2. Aliens are abducting us. We need to send radio signals to Fife, Alabama, not out into space.
    3. Unified Theory is based on Heisenburg's stuff... You can have relativity and quantum mechanics... but not both at the same time. Damn, that guy was a genius. By the way, the unified theory is:

      e = 42; // always 42.

    Of course, I'm sure Doom3 has this somewhere in its source code, so ummm... go crunch 40 TFLOPS on that ;)

    </humor>

    --
    /^[A-Z0-9._%+-]+@[A-Z0-9.-]+\.[A-Z]{2,4}$/i