Slashdot Mirror


Linux Clustering

An anonymous reader writes "Beowulf clustering turns 10 years old, and, in this interview, creator Donald Becker talks about how Beowulf can handle high-end computing on a par with supercomputers."

14 of 154 comments (clear)

  1. Can you imagine... by FortKnox · · Score: 5, Funny

    ... the amount of replies that will start with the same subject header as mine and not be funny at all?

    I sure can!

    --
    Good quote, too many chars. Seriously, the slashdot 120 char limit sucks!
  2. news? by dan2550 · · Score: 5, Insightful

    i dont mean to sound like a troll or anything, but is this really news. over the last year or so, (nearly) all of the articles on /. about fast computers have been clusters.

  3. Happy Anniversary... by crawdaddy · · Score: 5, Funny

    Happy Anniversary to the most over-used joke on Slashdot. I'll be wearing my tin-foil hat all day to commemorate it. (The 10th anniversary is the aluminum/tin anniversary)

  4. BlueGene by a3217055 · · Score: 4, Interesting

    All this sounds good and Interesting, and Becker did a tremendous ammount of development in this field. But I was just wondering, what about supercomputers like BlueGene/L which have very fast interconnects. Many supercomputers/distributed systems run MPI based programmes and such programmes need a high interprocess commmunication does anyone one know how good these are in a Bewoulf Cluster? thanks a3217055 They said that of all the kings upon the earth he was the man most gracious and fair-minded, kindest to his people and keenest to win fame. :-The Geats' tribute to Beowulf after his death.

    1. Re:BlueGene by jamesdood · · Score: 5, Insightful

      Since I administer a fairly large cluster, I can say that the answer is "It depends" (Of course that is ALWAYS the answer!). It depends on the codes being run, it depends upon the interconnect optimization.(yes myrinet is fast, but the real key is that it has much lower latency and this has to be engineered carefully if using more than one switch) My cluster runs both myrinet and Gig/E, some codes run well on the the ethernet interfaces (take codes like mpiblast for instance) while others (NAMD comes to mind) run faster on the myrinet. However this machine may be fast, but I have some large SMP boxes (IBM P-series) that cycle for cycle SMOKE the performance of the x86 boxes. But you have to remember that the cluster computers cost about $3000 /node while the SMP boxes with a similar config cost about $13,000 apiece, and even more if you want a box that supports more than 8 CPUs (think 1 million and up)
      So once again, it comes down to the types of jobs, and how much you are willing to pay to get those jobs done in a hurry! A Cluster is still great, I have just completed some jobs that consumed over 12 years of CPU time, in 1 week of wall-clock time!

      --
      *narf!*
  5. Beowulf seems older than that by Weaselmancer · · Score: 4, Funny

    ...doesn't it to you? I mean how long have you been sick of the "imagine a beowulf cluster of those" comments? Doesn't seem like only 10 years would make me that sick of it.

    --
    Weaselmancer
    rediculous.
  6. The choice between Beowulf and Big Iron... by corvair2k1 · · Score: 4, Insightful

    ...can be simple. The more complex a problem gets, the more likely you need one supercomputer as opposed to a cluster. It's not elitism, it's just that the problem will probably require a lot of communication between processors.

    Any kind of networking solution between computers will never be as fast as a hard-wired bus can be. If a lot of communication between nodes is required, you will spend more time waiting than computing, which shoots efficiency to hell.

    1. Re:The choice between Beowulf and Big Iron... by monoi · · Score: 5, Insightful

      The more complex a problem gets, the more likely you need one supercomputer as opposed to a cluster.

      I'm not sure it is that simple. For some problems (e.g. Monte Carlo simulations), a more complex problem means more individual nodes are required, with very little inter-node communication. For other kinds of problem (finite element methods, maybe?), you're probably right.

      In other words, the physical structure of the solution depends on the kinds of algorithms that you intend to run: there's not just one `correct' answer.

  7. imagine by justforaday · · Score: 5, Funny

    imagine a lone computer sitting by itself not connected to anything...

    --
    I'll turn into a supernova and burn up everything. Well I'll turn into a black little hole and you'll turn into string.
  8. On par? Yes and no by grape+jelly · · Score: 5, Informative
    Beowulf clusters have never been the fix-all solution to pricey supercomputer needs. Traditional mainframe supercomputers will forever have their niche in computing that can't just be muscled through sheer volume of vector processes (i.e., processes in which good latency is essential). Even the creator of the Beowulf cluster agrees:

    Quote from the article: *snip!*
    Supercomputer vendor Cray has created a new product that is designed to compete with some Linux clusters. Cray Canada CTO Paul Terry said that Linux clusters really can't compare to a supercomputer. What is your take on Cray's moves against Linux?

    Becker: They are simultaneously saying that Linux clusters are not high-performance computing systems while introducing a product to compete with Linux clusters. They clearly saw that a large part of their customer base was moving toward commodity clusters, Beowulf-class clusters, to do high-end computing.

    Clusters can't replace all of the workload being done by supercomputers today, but it can replace the bulk of the traditional vector supercomputers. There is always that 10% of the market that won't run well on clusters, and that is the market that Cray is in. We are happy to solve most of the problems of the world and run most of the applications and play in our part of the marketplace.
  9. It's not just about speed and massively parallel by Wizzy+Wig · · Score: 5, Informative

    processing...

    To be considered a "supercomputer," it also needs enough CONTIGUOUS MEMORY SPACE to hold the massive amounts of data associated with true "supercomputing." So far, no cluster has met that requirement.

  10. Imagine.... by drkich · · Score: 5, Funny

    Imagine there's no cluster,
    It's easy if you try,
    No adapter below us,
    Above us only loopback,
    Imagine all the computers
    computing for themselves...

    Imagine there's no internet,
    It isnt hard to do,
    Nothing to download or upload for,
    No porn too,
    Imagine all the computers
    computing pi in peace...

    Imagine no tokens,
    I wonder if you can,
    No need for ethernet or tcpip,
    A brotherhood of computer,
    Imagine all the computers
    Sharing nothing at all...

    You may say I'm a dreamer,
    but I'm the only one,
    I hope some day you'll leave us,
    And the computers will computer alone.

  11. A few Slashdot clichés: by Sean+Johnson · · Score: 5, Funny

    - Can you imagine a Beowulf cluster of these?
    - How long until the RIAA sues them into oblivion once they find out how may MP3's you can put on one?
    - "Can you put Linux on it?" or "Yes, but will it run Linux?"
    - "Yeah, but does it run Doom3?" or "And it still won't run Doom3."
    - Any comment regarding "Duke Nukem Forever" taking literally 'forever' and being termed 'vaporware'.
    - I am not buying one until they support ".ogg".
    - I for one welcome our new (insert name of company mentioned in post or story) overlords.
    - "George Lucas raped my childhood" or "Greedo shoots first" comments on any story incorporating the Star Wars franchise.
    - A comment including these 3 components in any order: Natalie Portman, naked and petrified, hot grits, one's pants.
    - Microsoft = Evil, MPAA = Evil, RIAA = Evil; with anything else incorporated to try and fit those equations into the topic at hand
    - Some type of reference to the size of one's ProN collection, the amount of ProN that can be stored on the gadget or technology in question, or the ProN industry itself being the first to make good sue of the new technology or gadget in question (ergo: the ProN industry drives technology)
    - The posted cliché being self-described as an "obligatory" post in the heading area if that particular cliché had not been addressed yet by previous slashdotters. (e.g. "obligatory Beowulf cluster comment")
    - Post revealing the fact that the story's homepage had been slashdotted already, culminating towards another post later on with the homepage story itself being copied & pasted verbatim (often with a subsequent post purporting that this is karma whoring, even though the poster admits it is indeed helpful anyways.)
    - Remark on the size of some new storage advancement about how many LOC's (Library of Congresses) can fit on it, or any other remark noting how this can be an actual valid unit of data storage measurement.
    - A variation of the Zero Wing video game intro dialogue regarding it's broken English translation: "Someone set up us the base....we have every ZIG, make your time".....blah, blah, blah.
    - Very soon lists such as this will be clichés as well.
    - Similarly noted and additional clichés may be found here: http://en.wikipedia.org/wiki/Slashdot_subculture

    --
    >>>>>> Chewie, take the professor in the back and plug him into the hyperdrive.
  12. FUD, FUD, and wrong. Crey stockholder eh? by Rhys · · Score: 4, Informative

    Sad to see this little knowledge about parallel computing on slashdot: blatantly wrong information marked as informative. +5 no less.

    Let's address this first: there are two common memory architectures, distributed memory (a cluster) and shared memory (a 'traditional' supercomputer). Each can emulate the other. Saying a cluster doesn't have enough memory, presumably at each node, is really saying: "I don't really understand message passing."

    This would be more important if datasets were actually large. Unfortunatly for your argument they aren't. A handfull of nodes and they'll hold the whole simulation easily in memory (albeit it'd take years to run because there's so few CPUs at work.)

    How would I know? Well, I work with the Center for Simulation of Advanced Rockets aka CSAR at UIUC, one of five DoE ACSI sites in the country. I manage their supercomputer, which is getting upgraded from 200 P3-class dual proc PCs to 640 dual proc Xserve G5s. Before that I was a grad student working with them, albeit not on the CSAR simulation but instead on a related grant, the CPSD.

    Now, there are computing problems which clusters aren't good at (or at least that's the traditional claim. My master's thesis and advisor would seem to dispute that this is actually the case.) However, most problems as the interview says, run just fine on clusters. Physical simulations (which covers CSAR's rockets to the national labs nuclear weapon research to hurricane/weather simulation, all the way down to protein folding and atomic and sub-atomic scale crystal formation simulation) need to know about what's in the area you're working on, and what's in nearby areas.

    Occasionally you'll find an oddball like galactic simulation (or molecular dynamics) that needs to compute gravity across the whole universe. Fortunatly we have multigrid methods and a friendly gravity equation to solve this problem: get real data from those near you. Average those far from you and use that instead.

    Then of course there's the idea that even "traditional" supercomputer problems that don't run well on clusters can be run efficiently on clusters IF you move beyond 1 process per CPU. Load up 10, 20, 100, 1000 little workers on a processor. Get fast context switching between them (not OS level!). Use message passing rather than shared memory (locking, ick!) to communicate. One worker blocked waiting for network data? Process the next one! If you've tuned things right you'll find you always have work to do.

    Sounds crazy? Supercomputing '02 didn't think so: http://charm.cs.uiuc.edu/research/moldyn/

    --
    Slashdot Patriotism: We Support our Dupes!