Slashdot Mirror


Linux Clustering

An anonymous reader writes "Beowulf clustering turns 10 years old, and, in this interview, creator Donald Becker talks about how Beowulf can handle high-end computing on a par with supercomputers."

154 comments

  1. Can you imagine... by FortKnox · · Score: 5, Funny

    ... the amount of replies that will start with the same subject header as mine and not be funny at all?

    I sure can!

    --
    Good quote, too many chars. Seriously, the slashdot 120 char limit sucks!
    1. Re:Can you imagine... by Anonymous Coward · · Score: 2, Funny

      Not really (well apart from your post of course), tell me, how would it go?

      "Hey, can you imagine a beowulf cluster of er... beowulf cluters?"

      Doesn't quite work, you see.

    2. re: can you imagine... by ed.han · · Score: 2, Funny

      i want you to know that i seriously thought about doing that and having had the same thought, decided against it.

      now, imagine a beowulf cluster of slashdotters having that same thought... :>

      ed

    3. Re:Can you imagine... by Anonymous Coward · · Score: 0

      Oh just give it a fuggin' rest already.

  2. news? by dan2550 · · Score: 5, Insightful

    i dont mean to sound like a troll or anything, but is this really news. over the last year or so, (nearly) all of the articles on /. about fast computers have been clusters.

    1. Re:news? by strictfoo · · Score: 3, Funny

      Great news too: creator says creation is really good!

      In other news: "Ford says their cars are just as good as BMW's and Emachine states their computers rival Apple's"

      --
      I've just signed legislation that'll outlaw Russia forever. We'll begin bombing in five minutes.
    2. Re:news? by fymidos · · Score: 2

      Clusters is power for the people.
      It's something that simple people, without a white form, a microscope, and 500 million budget, can work on, and make better.
      Undoubtfully many advances that have been made for clustering, will be used in many other aspects of computing. Even supercomputer will benefit from them.

      --
      Washington bullets will simply be known as the "Bulle
    3. Re:news? by Anonymous Coward · · Score: 0

      Interesting, so you are saying that one day soon we will see a Beowulf cluster of Cray's?

    4. Re:news? by EvilAlien · · Score: 0, Offtopic
      I'm sure Emachines do rival Apple, at least in terms of desktop penetration.

      I'm also sure that Ford kicks BMW's ass in terms of popularity, at least in the USA.

      I'm going to have to work something about the prevalence of bad taste into my Stupid People Theory.

      --
      perl -e 'print $i=pack(c5, (41*2), sqrt(7056), (unpack(c,H)-2), oct(115), 10)'
    5. Re:news? by Anonymous Coward · · Score: 0

      Heh. Desktop penetration. Heh heh. Err... heheh.

  3. imagine by Anonymous Coward · · Score: 3, Funny

    awww, fuggit

    1. Re:imagine by shfted! · · Score: 1

      Hey, don't go making fun of my Apple IIe!

      --
      He who laughs last is stuck in a time dilation bubble.
    2. Re:imagine by blindbat · · Score: 1

      ... and I'll show you a safe Windows computer

    3. Re:imagine by caluml · · Score: 1

      One of the funnier posts I've seen on here... :)

    4. Re:imagine by djdavetrouble · · Score: 1

      Hey, don't you go awakening my Apple ][e lust! Now I am doomed to spend an hour on ebay drooling over them.

      --
      music lover since 1969
    5. Re:imagine by UnknowingFool · · Score: 2, Funny
      imagine a lone computer sitting by itself not connected to anything...

      With a little editing that could describe most /. readers:

      imagine a lone geek sitting by himself not talking to anyone...

      ;)
      --
      Well, there's spam egg sausage and spam, that's not got much spam in it.
  4. Candle Cluster by KrackHouse · · Score: 3, Funny

    I'm picturing the ten candles on the Wolf-cake in close proximity with frosting interconnects and one big flame in the middle.

    --
    What if Digg added local news and a Slashdot inspired comment karma system? ---
    http://houndwire.com
  5. Happy Anniversary... by crawdaddy · · Score: 5, Funny

    Happy Anniversary to the most over-used joke on Slashdot. I'll be wearing my tin-foil hat all day to commemorate it. (The 10th anniversary is the aluminum/tin anniversary)

    1. Re:Happy Anniversary... by Stillman · · Score: 1

      o/~ And the word that you wanted was Aluminium.

      ;)

      Damn merkins.

      --
      Prisoner #655321
    2. Re:Happy Anniversary... by p0 · · Score: 1


      I, for one.. * click *

      goatse * click *

      * click *

      I own a Cray, you insensi... * more clicks *

      shit, I need to update my jokes!

      --
      This is my sig. There are thousands more, but this one is mine.
    3. Re:Happy Anniversary... by jepaton · · Score: 1

      There is two varients of the word. In British English your version is correct, however like many words the US chose something different. They also pronounce the word differently. See Wikipedia: http://en.wikipedia.org/wiki/Aluminum

    4. Re:Happy Anniversary... by Anonymous Coward · · Score: 0
      What's more the Aluminum spelling has the benefit of Primacy. It's the Brits who decided to spell it differently than the discoverer.

      FTA
      In 1808 Humphry Davy originally proposed alumium for the name of this then-undiscovered metal, but four years later decided to change the name to aluminum
      If they'd wanted to keep their precious -ium ending they should have gone with the original. The way they spell it and sound it is unneccesaryary.
  6. Re:Obligatory... by grape+jelly · · Score: 1, Funny

    Wouldn't a beowulf cluster of beowulf clusters in fact just be a beowulf cluster? Hmm....

    Oh yeah and *groan* with regard to that joke! =-P

  7. BlueGene by a3217055 · · Score: 4, Interesting

    All this sounds good and Interesting, and Becker did a tremendous ammount of development in this field. But I was just wondering, what about supercomputers like BlueGene/L which have very fast interconnects. Many supercomputers/distributed systems run MPI based programmes and such programmes need a high interprocess commmunication does anyone one know how good these are in a Bewoulf Cluster? thanks a3217055 They said that of all the kings upon the earth he was the man most gracious and fair-minded, kindest to his people and keenest to win fame. :-The Geats' tribute to Beowulf after his death.

    1. Re:BlueGene by jamesdood · · Score: 5, Insightful

      Since I administer a fairly large cluster, I can say that the answer is "It depends" (Of course that is ALWAYS the answer!). It depends on the codes being run, it depends upon the interconnect optimization.(yes myrinet is fast, but the real key is that it has much lower latency and this has to be engineered carefully if using more than one switch) My cluster runs both myrinet and Gig/E, some codes run well on the the ethernet interfaces (take codes like mpiblast for instance) while others (NAMD comes to mind) run faster on the myrinet. However this machine may be fast, but I have some large SMP boxes (IBM P-series) that cycle for cycle SMOKE the performance of the x86 boxes. But you have to remember that the cluster computers cost about $3000 /node while the SMP boxes with a similar config cost about $13,000 apiece, and even more if you want a box that supports more than 8 CPUs (think 1 million and up)
      So once again, it comes down to the types of jobs, and how much you are willing to pay to get those jobs done in a hurry! A Cluster is still great, I have just completed some jobs that consumed over 12 years of CPU time, in 1 week of wall-clock time!

      --
      *narf!*
    2. Re:BlueGene by Christopher+Thomas · · Score: 2, Insightful

      All this sounds good and Interesting, and Becker did a tremendous ammount of development in this field. But I was just wondering, what about supercomputers like BlueGene/L which have very fast interconnects. Many supercomputers/distributed systems run MPI based programmes and such programmes need a high interprocess commmunication does anyone one know how good these are in a Bewoulf Cluster?

      Anywhere from "terrible" to "almost not bad", depending on how much you're willing to pay for the interconnect network. The point of Beowulf-style clustering is low cost/node, allowing scientific computing to be done with commodity hardware (unheard-of at the time). While using something like Myrinet instead of Ethernet, and careful topology layout, can bring you to the "almost doesn't suck" stage, you'll still suffer heavily in communications-bound problems.

      Fortunately, there are many interesting problems with low enough communications load to make commodity technology based clusters very, very useful.

    3. Re:BlueGene by Anonymous Coward · · Score: 0

      12 years=52*12 weeks=624 weeks. That's 624 nodes, assuming 1:1 parallelisation (which is ridiculous). Using your quote of $3K apiece, that's almost two fragging million dollars! Probably closer to 1K nodes, which gives a nice round $3*10^6.

      Fairly large, indeed!

      mpiblast... NIH? Celera?

    4. Re:BlueGene by psavo · · Score: 1

      I doubt it's UP, most probably 2xSMP, so 312 nodes and about $1M. Which sounds reasonable for 12 years worth of computing in two weeks.

      --
      fucktard is a tenderhearted description
    5. Re:BlueGene by flaming-opus · · Score: 1

      Blue Gene is an extremely clever design in that it uses several interconnect networks all at once. The main memory-memory interconnect is a packetized load-store interconnect arranged in a 3D mesh. Each node also has an ethernet tap for the management network, and a very wide tree network for all-reduce calls. They built their networks with MPI in mind.

      The difference between a commodity cluster and something like blue-gene is only a half-step. The codes that run well on blue-gene are MORE like the codes run on clusters than those on a traditional vector super. The CPUs, memory controllers, etc, etc. are commodity parts from the microcontroller world, and they work on high compute, low memory-bandwidth tasks with moderate inter-node communication needs. Blue Gene will likely come in at costs somewhere between those of clusters and those of vector supers, or even traditional MPPs like the altix.

    6. Re:BlueGene by fitten · · Score: 1

      I'd mod the parent up if it weren't at 5 already :)

      He's pretty much dead on the money there. "Beowulf" in the strictest sense doesn't have Myrinet though, only commodity parts like 100 or 1000 Mb Ethernet. In these configurations, any latency bound application will be horrible (typically fine grained parallelism, lots of messages, typically small, being transferred). The latency of 1GbE vs 100MbE isn't that much different and both are an order of magnitude or more slower than Myrinet or any of the high performance interconnects.

      Having ported MPICH to the T3D and the T3E in the past, as well as ported MPICH to a number of other platforms and written an MPI (and MPI-2_ implementation from scratch and ported it to many different platforms, I can say that clusters do some things well (as the above poster) such as the somewhat trivial benchmarks of the Top500.org lists and some applications that are similar to that in communication needs. However, there are many problems that run much better on Myrinet and even much better on higher speed interconnects.

      You still gotta have some love for something like 6.5usec latency for small messages and 325MB/sec (yes, megaBYTES per sec) bandwidth, even on a supercomputer that is 10 years old (Cray T3E). And there are four such links off of each Cray T3E node. Source and shameless plug.

  8. Beowulf seems older than that by Weaselmancer · · Score: 4, Funny

    ...doesn't it to you? I mean how long have you been sick of the "imagine a beowulf cluster of those" comments? Doesn't seem like only 10 years would make me that sick of it.

    --
    Weaselmancer
    rediculous.
    1. Re:Beowulf seems older than that by lpangelrob2 · · Score: 2, Funny
      I mean how long have you been sick of the "imagine a beowulf cluster of those" comments? Doesn't seem like only 10 years would make me that sick of it.

      Depends... it only took 4 weeks for Floridians to get real sick of hurricanes. :-)

    2. Re:Beowulf seems older than that by Zerbey · · Score: 2, Interesting

      Depends... it only took 4 weeks for Floridians to get real sick of hurricanes. :-)

      About 24 hours, actually. The first night was ok because the power had only gone out a few hours previously and it was still really windy out. As a result, the house remained cool.

      By the following night, the winds where gone, the house had been without an air conditioner for 24 hours and it was really, really humid. After 7 days, there are no words to describe how thoroughly fed up I was with Hurricanes :)

      Then, two weeks later, Frances knocked my power out for 8 days. Again. Argh!!!

    3. Re:Beowulf seems older than that by Anonymous Coward · · Score: 0
      So the worst thing about the hurricane for you was that your aircon stopped working.

      Holy shit, send in the red cross.

  9. Re:Obligatory... by saudadelinux · · Score: 1, Redundant

    I don'wanna! Won't an infinite number of /. readers imagining an infinite number of variations on Beowulf cluster jokes topple all the turtles?

    --
    I didn't think the house band in Hell would play this badly.
  10. Passé? by Kurt+Wall · · Score: 2, Interesting

    Could it be that Beowulf clusters, however cost-effective and powerful they have become, are passé now that most universities and research institutions have some sort of COTS-based high-performance computing solutions? Not that Beowulf isn't cool - it is - it just doesn't seem as cool as it used to.

    1. Re:Passé? by maxwell+demon · · Score: 1

      You mean, Beowulf is dying?

      --
      The Tao of math: The numbers you can count are not the real numbers.
    2. Re:Passé? by Fallen+Andy · · Score: 1

      Cool. Hmm. Let me see , 200-300W per node * how many nodes. Hot . Very hot.

      It's possible (given how powerful GPU's in graphics cards have become) that one day we will get to see
      *smaller* clusters as all of that "wasted" power in the GPU gets reused for crunching.

      But, Don Becker didn't invent this stuff. If anything
      I'm more grateful that he was masochistic enough to practically be a one man code engine creating all of the ethernet support for linux...

      The first "cluster" I read about was one in Byte
      a long long time ago which ironically used four
      Apple Mac classics tied together with (I'd guess)
      AppleTalk (yucky huh?) and at the time I happened
      to be "in a position of power" (cough - well really second only to the CTO) and the BOFH on
      our Novell network (re read last bit (I was the B))

      I did think of hacking some TSR's to run on all of
      the client machines (c.a. 100) to turn our net into a cheapo cluster. Decided not to - 286's weren't exactly Crays (even if you had a boatload of them...).

  11. Imagine All... by grunt107 · · Score: 1, Funny

    In honor of the Beatles/Apple topic a while ago, I was inspired to alter the heading slightly (sorry).

    Imagine all the clusters
    It's easy if you try
    With Linux on them
    Computing Particles in Sky

    Imagine all Beowulf
    Crunching in harmony (AhIahahah...)

    1. Re:Imagine All... by jpmkm · · Score: 3, Interesting

      Though John Lennon was a Beatle, Imagine is not a Beatles song.

    2. Re:Imagine All... by Anonymous Coward · · Score: 0

      Try to Imagine that you have a sense of humor and could allow a little poetic license.

    3. Re:Imagine All... by Anonymous Coward · · Score: 0

      I am Yoko Ono, you insensitive crod!!!!!!!!!!

  12. The choice between Beowulf and Big Iron... by corvair2k1 · · Score: 4, Insightful

    ...can be simple. The more complex a problem gets, the more likely you need one supercomputer as opposed to a cluster. It's not elitism, it's just that the problem will probably require a lot of communication between processors.

    Any kind of networking solution between computers will never be as fast as a hard-wired bus can be. If a lot of communication between nodes is required, you will spend more time waiting than computing, which shoots efficiency to hell.

    1. Re:The choice between Beowulf and Big Iron... by monoi · · Score: 5, Insightful

      The more complex a problem gets, the more likely you need one supercomputer as opposed to a cluster.

      I'm not sure it is that simple. For some problems (e.g. Monte Carlo simulations), a more complex problem means more individual nodes are required, with very little inter-node communication. For other kinds of problem (finite element methods, maybe?), you're probably right.

      In other words, the physical structure of the solution depends on the kinds of algorithms that you intend to run: there's not just one `correct' answer.

    2. Re:The choice between Beowulf and Big Iron... by Psycho77 · · Score: 0, Redundant

      The more complex a problem gets, the more likely you need one supercomputer as opposed to a cluster.

      Yes, but imagine a Beowulf cluster of supercomputer ! =P

    3. Re:The choice between Beowulf and Big Iron... by Anonymous Coward · · Score: 0

      what makes you think MC requires little inter-node communication?
      that might be true for naive 'pure' MC, but
      no-one does pure MC any more, they do quasi-mc with heaps and heaps of variate control mechanisms and fancy sampling weights. some things that start out as an mc can get so fully controlled and use such simple low discrepancy sequences that often what people call MC is hardly an MC at all. they get closer and closer to a very complex, fancy-schmancy single path. at which point they are an analytic approx.

      beacsue, the more complex it gets, the more you need one supercomputer.

  13. imagine by justforaday · · Score: 5, Funny

    imagine a lone computer sitting by itself not connected to anything...

    --
    I'll turn into a supernova and burn up everything. Well I'll turn into a black little hole and you'll turn into string.
  14. Winterware by Eberlin · · Score: 2, Interesting

    If you could imagine a...ok, well maybe after ten years, we all could. The horse has been so beaten and tenderized that even takko vell wants a piece of the action.

    I've never seen a beowulf cluster personally. I've never run anything on one. However I do know that it made "supercomputing" more affordable. That in itself is a feat -- and a primary goal of most Open Source software. A proverbial "Hats off" to the open source hackeres out there. Thanks...and keep hacking.

    Now if I can gather enough old 486 machines, will I be able to run Doom III? With WINE? At least it'll keep me warm during the winter months.

    1. Re:Winterware by oberondarksoul · · Score: 1

      While with a lot of 486s you would have a fair amount of processing power, if a program attempts to use instructions introduced into later CPUs, chances are it'll not work.

      --
      And tomorrow the stock exchange will be the human race
    2. Re:Winterware by Anonymous Coward · · Score: 0

      smarten up both of ya'll. Run vmware over the beowulf cluster, now you just need to hack a driver to to use you second bc for the videocard!

      Tim

  15. In Soviet Russia... by halivar · · Score: 2, Funny

    In Soviet Rus-- oh damn. I pulled out the wrong dead horse.

  16. Yes I can! by shfted! · · Score: 0

    And I can already see the beowulf cluster of responses like yours ;)

    --
    He who laughs last is stuck in a time dilation bubble.
  17. On par? Yes and no by grape+jelly · · Score: 5, Informative
    Beowulf clusters have never been the fix-all solution to pricey supercomputer needs. Traditional mainframe supercomputers will forever have their niche in computing that can't just be muscled through sheer volume of vector processes (i.e., processes in which good latency is essential). Even the creator of the Beowulf cluster agrees:

    Quote from the article: *snip!*
    Supercomputer vendor Cray has created a new product that is designed to compete with some Linux clusters. Cray Canada CTO Paul Terry said that Linux clusters really can't compare to a supercomputer. What is your take on Cray's moves against Linux?

    Becker: They are simultaneously saying that Linux clusters are not high-performance computing systems while introducing a product to compete with Linux clusters. They clearly saw that a large part of their customer base was moving toward commodity clusters, Beowulf-class clusters, to do high-end computing.

    Clusters can't replace all of the workload being done by supercomputers today, but it can replace the bulk of the traditional vector supercomputers. There is always that 10% of the market that won't run well on clusters, and that is the market that Cray is in. We are happy to solve most of the problems of the world and run most of the applications and play in our part of the marketplace.
    1. Re:On par? Yes and no by EnderWiggnz · · Score: 2, Informative

      yeh - sometimes, you actually need the pure performace that a vector processor will give you, without the initial overhead of paralelizing a process to run loosely-coupled on a beowulf.

      --
      ... hi bingo ...
    2. Re:On par? Yes and no by Anonymous Coward · · Score: 0

      Anybody who says "mainframe supercomputers" is talking out of his ass. A mainframe is not a supercomputer. And no, it's not a Solaris box either.

    3. Re:On par? Yes and no by bobbozzo · · Score: 1

      Cray's new product IS a Linux cluster!

      Only advantage it (currently) has is a custom HyperTransportInfiniband bridge.

      And yes, Cray is trying to claim it's different than a "Linux cluster".

      --
      Nothing to see here; Move along.
  18. Hail Clusters! by olclops · · Score: 0, Redundant

    I for one welcome our new beowulf cluster of these linux-running overlords.

  19. Just out of curiosity... by Krypto420 · · Score: 2, Interesting

    I was wondering if it is possible to make some sort of cluster out of old computers I have lying around? Nothing spectacular, just hooking up 3-4 old P2's to make a game server or something of the sort. Is there software out there to do this?

    Has anyone had any experience with this?

    Just a thought...

    1. Re:Just out of curiosity... by Anonymous Coward · · Score: 0

      No. That's absolutely impossible. We'll need a major breakthrough in our understanding of physics and the universe before you're able to do what you are asking about.

    2. Re:Just out of curiosity... by Anonymous Coward · · Score: 1, Informative

      ClusterKnoppix may be just what you're looking for...

    3. Re:Just out of curiosity... by Krypto420 · · Score: 1
      ClusterKnoppix [bofh.be] may be just what you're looking for...

      Cool... thanks. Thats exactly the kind of thing I was looking for. Does anyone have any experience with this? I was wondering what kinds of applications would benefit from a small cluster of relatively slow processors? For example, what single processor would be equivalent to a cluster of 3 P2 300MHz? It sounds like it could be a fun cheap project and a cool way to see how this stuff works on a small scale.

    4. Re:Just out of curiosity... by kst · · Score: 2, Informative

      You might take a look at Rocks:
      http://www.rocksclusters.org/.

      Quite a few people have built Rocks clusters out of a bunch of old computers.

      Disclaimer: I work with the folks who created this.

    5. Re:Just out of curiosity... by CommieOverlord · · Score: 1

      What single processor would be equivalent to a cluster of 3 P2 300MHz?

      May a 500-600 MHz CPU. Tops. Seriously. Mainly because of the following - that old computer probably has a 10Mbit NIC, maybe 100Mbit. Can you say latency?

      If the game server you want to run is multi-threaded then you _might_ be able to run different threads on different nodes (using say OpenMosix). It'll probably be slow as hell because of the latency. Probably slower than running it on machine.

      Look, clusters are good for running parallel applications. Nothing else. If you don't have any parallel applications don't bother. It's not going to speed up your game server, or your game client, or your OS.

      And look at it this way too, it costs around $150-200 a year in electricity to run a computer. You could buy a single CPU machine that's faster than the combined speed of your three computers and still save money because of electricity.

  20. actually... by ed.han · · Score: 2, Interesting

    what is the most over-used joke on slashdot? options:

    1. step 1: [topic], step 2: ?, step 3: profit!
    2. natalie portman/hot grits
    3. in soviet russia, [inversion of topic] you!
    4. cowboy neal [action]
    5. beowulf cluster
    6. goatse guy
    7. [technology/entity] is dying!

    just curious...

    ed

    1. Re:actually... by XaviorPenguin · · Score: 3, Funny

      8. ???
      9. Profit!!

      --
      Friends help you move...
      REAL Friends help you move dead bodies... ^_^
  21. oh great... by Anonymous Coward · · Score: 0
    a story on how linux clustering is good.

    I'd predict this thread will degenerate, but where would it go from here? :)

  22. How do you mean, fast? by raehl · · Score: 2, Funny

    Clusters are the slowest computers available...

    If your metric is moving around data, as opposed to how many no-ops you can do a second while waiting for your data to get there.

    1. Re:How do you mean, fast? by Mark+of+THE+CITY · · Score: 1

      It's fast only if the problem being solved is, as they say on comp.parallel, embarassingly parallel. IBM mainframes are likely to still be the I/O champs.

      --
      The clearance system sounds logical. It is not. It is completely arbitrary. -- John Bolton
  23. It's not just about speed and massively parallel by Wizzy+Wig · · Score: 5, Informative

    processing...

    To be considered a "supercomputer," it also needs enough CONTIGUOUS MEMORY SPACE to hold the massive amounts of data associated with true "supercomputing." So far, no cluster has met that requirement.

  24. How fast will it run Doom3? by ARRRLovin · · Score: 2, Interesting

    That is what I want to know!

    In all seriousness though, what is the ratio of cluster to big iron in supercomputing nowadays? I know a clusters can scale out to a lot of FLOPS, but what is the highest FLOPS processor available?

    --
    -Randy
    1. Re:How fast will it run Doom3? by Anonymous Coward · · Score: 0

      I know a clusters can scale out to a lot of FLOPS, but what is the highest FLOPS processor available?

      I hear the Itanium processor is one massive FLOP. A cluster of them...

  25. New coolness by gowen · · Score: 1

    "Imagine a Beowulf cluster..." jokes are passe. The really hip respond to mentions of Donald Becker's name with oblique references to Steely Dan records.

    --
    Athletic Scholarships to universities make as much sense as academic scholarships to sports teams.
    1. Re:New coolness by Anonymous Coward · · Score: 0

      Actually, the two guys who are Steely Dan are DONALD Fagen and Walter BECKER....

      "...you crossed the diamond with the pearl, you turned it on the world, and then you turned the world around..." or something like that...

  26. Imagine.... by drkich · · Score: 5, Funny

    Imagine there's no cluster,
    It's easy if you try,
    No adapter below us,
    Above us only loopback,
    Imagine all the computers
    computing for themselves...

    Imagine there's no internet,
    It isnt hard to do,
    Nothing to download or upload for,
    No porn too,
    Imagine all the computers
    computing pi in peace...

    Imagine no tokens,
    I wonder if you can,
    No need for ethernet or tcpip,
    A brotherhood of computer,
    Imagine all the computers
    Sharing nothing at all...

    You may say I'm a dreamer,
    but I'm the only one,
    I hope some day you'll leave us,
    And the computers will computer alone.

  27. Cluster Schedulers by Anonymous Coward · · Score: 2, Informative

    And GridEngine is free and opensource:

    http://gridengine.sunsource.net/

  28. Otherwise known for... by jaylee7877 · · Score: 2, Informative

    Donald Becker also has done a large amount of work on Linux Network drivers. Grep through linux/drivers/networking and you'll find he's done work on Intel NICs, Realtek 8139s, even the ne2000 (I think he said he puked a few time while working on that one). Thanks for all your hard work Donald!

  29. newbie question re: ease of management by SABME · · Score: 2, Interesting
    A question for beowulf-savvy folks:

    At the end of the article, the comment is made that one reason for setting up a cluster is ease of management (for updates, applications, etc.). Can anyone with experience comment on whether this is true or not, with the way clustering exists today? I have no experience at all with cluster, and I'm wondering if this is something I should look into to ease administrative burdens?

  30. Re:Obligatory... by maxwell+demon · · Score: 3, Funny
    I know... I know... Imagine a Beowulf cluster of these!

    Ok ... but does it run Linux?
    --
    The Tao of math: The numbers you can count are not the real numbers.
  31. help me out by Anonymous Coward · · Score: 0

    Can't you just put a big mothertrucking hard disk on one of them and NFS mount it?

    1. Re:help me out by ghostlibrary · · Score: 1

      >Can't you just put a big mothertrucking hard disk on one of them and NFS mount it?

      Problem then is you're limited to a 'memory speed' equal to NFS. Clusters already want the fasted network connection possible. Making cache available only via internet speeds with cat5 speeds, ugh.

      That said, it's always useful to have a big mothertrucking hard disk mounted via NFS, mostly for when the run is done and you want to gather/centralize the data or post-data visualization products.

      Half of my cluster hassles are "okay, the run is done, now how the hell do I get my data to useful form?"

      --
      A.
  32. Lets imagine the oposite by genner · · Score: 1

    Imagine all the pathetic individual computers you'd have if you take apart a bewoulf cluster.
    I think they did this in soviet russia.

  33. openMosix by 241comp · · Score: 2, Informative

    Beowulf isn't the only game in town folks. A much easier to maintain and balance cluster can be built using openMosix. openMosix is a single-image-cluster extension for Linux.

    1. Re:openMosix by wertarbyte · · Score: 1

      OpenMosix is quite cool, I tried it once and it really impressed me. Although there are no paches for 2.6, does anyone know where the development of this great project is heading?

      --
      Life is just nature's way of keeping meat fresh.
    2. Re:openMosix by skelley · · Score: 1

      Don't forget OpenSSI as well. This is a single-system-image clustering product with a long lifespan and great support within HP.

    3. Re:openMosix by Anonymous Coward · · Score: 0

      openmosix is nice, but the shared-memory facilities are lacking (and still considered experimental patches? I should check but I'm so lazy).

      openssi is an contradictio in terminus: it claims to aim for single-system image clustering by adding even more bloat to the horrible linux kernel... go figure.

      what is seriously lacking in both beowulf and openmosix are serious queueing and batch control mechanisms, you only find those for mpi/pvm.

    4. Re:openMosix by Anonymous Coward · · Score: 0

      bproc also provides a SSI cluster similar to openMosix, etc... In fact, it's what Scyld is based on.

    5. Re:openMosix by drinkypoo · · Score: 1

      If you want queueing and batch control, I suggest that you check out DQS, the original clustering software for Unix. DQS is actually a job processing system, it certainly doesn't provide a single-image system, and it can only take batches and has no means to manage IPC. I suspect, however, that DQS would admirably provide your batching system, while Mosix did clustering.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    6. Re:openMosix by 241comp · · Score: 1

      1. This is true - does beowulf have shared memory capabilities I wasn't aware of?

      2. Not even worth...

      3. MPI/PVM and openMosix are like bread and butter - they compliment each other to use the available resources in the most efficient manner.

      Did I just get trolled?

    7. Re:openMosix by 241comp · · Score: 1

      If you're interested in the future development of oM then check out http://openmosix.sourceforge.net/plan.html. This shows that 2.6 patches are planned within the next 6-12 months. oMFS is coming out and oGFS, Lustre and PVFS are replacing it. SHM support is to be stabilized. Usability will really make leaps and bounds in the next year if this plan works out.

  34. Re:It's not just about speed and massively paralle by vsack · · Score: 2, Informative

    There are certain classes of problems that clusters don't map to well. Applications with a very high cost of inter-processor comminucation or that demand a huge piece of contiguous memory are probably always going to be outside the realm of clusters.

    However, problems that are embaressingly parallel can be handled by a cluster very adequately for a fraction of the cost of a traditional supercomputer. I don't know that you can ignore this class of problems and say that clusters aren't "true 'supercomputing'".

  35. Re:It's not just about speed and massively paralle by joib · · Score: 1


    To be considered a "supercomputer," it also needs enough CONTIGUOUS MEMORY SPACE to hold the massive amounts of data associated with true "supercomputing."


    Well, that's one way of seeing it I guess. A way not shared by most people in supercomputing, I might add.

  36. Thank you Donald by martser · · Score: 1

    He has been such a huge contributer to the Linux platform.

    Thanks,
    Eric

    1. Re:Thank you Donald by nblender · · Score: 1

      Well, maybe. Have you ever been paid to fix his write-only code? The man has never heard of #define.

  37. loaded question by flaming-opus · · Score: 1

    Cray has sold linux clusters before, and now have 2 products that use linux in some way. (Red Storm and the XD1) They even have done some experiments runnint linux on the X1 vector supercomputer. Cray certaintly isn't making moves against linux. They would just prefer you to run linux on their mpp box, rather than a rack full of Dells.

    Mr. Becker has an interest in you using a penguin computing setup, rather than either Dell, or Cray. I must, however, admire the way he didn't get sucked into the interviewers desceptive question.

  38. Why Beowulf? by Trogre · · Score: 2, Insightful

    If you maintain a group of networked but otherwise independent computers for example a student lab or office farm, consider deploying something like PVM or MPI. It's a great way to get some use out of those idle cycles.

    PVM at least scales incredibly well: 25 machines rendering a povray scene take just a fraction over 1/25 the time taken to render it on one machine. I haven't tested MPI yet.

    --
    "Nine times out of ten, starting a fire is not the best way to solve the problem." - my wife
    1. Re:Why Beowulf? by Anonymous Coward · · Score: 0

      dude, do you even know what mpi is? installing it isn't going to magically make a bunch of networked computers start crunching away at their tasks. mpi in a clustered environment is just a useless layer on top of your ip stack.

    2. Re:Why Beowulf? by pkhuong · · Score: 1

      Raytracing is one of those applications that are incredibly trivial to parallelize. I wouldn't dare hope that your average application will have performance close to that.

      --
      Try Corewar @ www.koth.org - rec.games.corewar
    3. Re:Why Beowulf? by Trogre · · Score: 1

      That's why you write your own cluster-able code to use the Message Passing Interface, as opposed to writing it to use PVM.

      --
      "Nine times out of ten, starting a fire is not the best way to solve the problem." - my wife
    4. Re:Why Beowulf? by drooling-dog · · Score: 1
      ...consider deploying something like PVM or MPI.

      Back in the mid-90s we were using PVM (on Solaris boxes) in sequence (DNA/protein) similarity search applications, among other things. It scaled very nicely, provided the target sequence database was distributed across the network also. Very easy to implement and not too difficult to administer, either.

    5. Re:Why Beowulf? by joib · · Score: 1

      Well, duh...

      Most clusters are used to run MPI applications.

      There's no hard line between "cluster" and "networked independent computers". If you wan't to make some distinction, it could be that cluster nodes are pretty homogenous, and the cluster has a dedicated network instead of just using the office LAN.

  39. Re:It's not just about speed and massively paralle by hackstraw · · Score: 1

    Huh? Contiguous in what sense? Attached to the same motherboard? In one DIMM? Addressable in one chunk by the OS?

    I've only been to one supercomputing conference, but when I was there most all of the people there ran clusters and the top500 site (although this list is produced by the same supercomputing conference people) lists many clusters there.

    In other words, where does this contiguous memory requirement come from?

  40. premature by commodoresloat · · Score: 1
    The beowulf cluster itself has the 10 year anniversary today. The over-used joke doesn't have its anniversary until ...

    (checking calendar)

    tomorrow.

  41. A few Slashdot clichés: by Sean+Johnson · · Score: 5, Funny

    - Can you imagine a Beowulf cluster of these?
    - How long until the RIAA sues them into oblivion once they find out how may MP3's you can put on one?
    - "Can you put Linux on it?" or "Yes, but will it run Linux?"
    - "Yeah, but does it run Doom3?" or "And it still won't run Doom3."
    - Any comment regarding "Duke Nukem Forever" taking literally 'forever' and being termed 'vaporware'.
    - I am not buying one until they support ".ogg".
    - I for one welcome our new (insert name of company mentioned in post or story) overlords.
    - "George Lucas raped my childhood" or "Greedo shoots first" comments on any story incorporating the Star Wars franchise.
    - A comment including these 3 components in any order: Natalie Portman, naked and petrified, hot grits, one's pants.
    - Microsoft = Evil, MPAA = Evil, RIAA = Evil; with anything else incorporated to try and fit those equations into the topic at hand
    - Some type of reference to the size of one's ProN collection, the amount of ProN that can be stored on the gadget or technology in question, or the ProN industry itself being the first to make good sue of the new technology or gadget in question (ergo: the ProN industry drives technology)
    - The posted cliché being self-described as an "obligatory" post in the heading area if that particular cliché had not been addressed yet by previous slashdotters. (e.g. "obligatory Beowulf cluster comment")
    - Post revealing the fact that the story's homepage had been slashdotted already, culminating towards another post later on with the homepage story itself being copied & pasted verbatim (often with a subsequent post purporting that this is karma whoring, even though the poster admits it is indeed helpful anyways.)
    - Remark on the size of some new storage advancement about how many LOC's (Library of Congresses) can fit on it, or any other remark noting how this can be an actual valid unit of data storage measurement.
    - A variation of the Zero Wing video game intro dialogue regarding it's broken English translation: "Someone set up us the base....we have every ZIG, make your time".....blah, blah, blah.
    - Very soon lists such as this will be clichés as well.
    - Similarly noted and additional clichés may be found here: http://en.wikipedia.org/wiki/Slashdot_subculture

    --
    >>>>>> Chewie, take the professor in the back and plug him into the hyperdrive.
    1. Re:A few Slashdot clichés: by ProfessionalCookie · · Score: 0

      - Can you imagine a Beowulf cluster of these?
      - How long until the RIAA sues them into oblivion once they find out how may MP3's you can put on one?
      - "Can you put Linux on it?" or "Yes, but will it run Linux?"
      - "Yeah, but does it run Doom3?" or "And it still won't run Doom3."
      - Any comment regarding "Duke Nukem Forever" taking literally 'forever' and being termed 'vaporware'.
      - I am not buying one until they support ".ogg".
      - I for one welcome our new (insert name of company mentioned in post or story) overlords.
      - "George Lucas raped my childhood" or "Greedo shoots first" comments on any story incorporating the Star Wars franchise.
      - A comment including these 3 components in any order: Natalie Portman, naked and petrified, hot grits, one's pants.
      - Microsoft = Evil, MPAA = Evil, RIAA = Evil; with anything else incorporated to try and fit those equations into the topic at hand
      - Some type of reference to the size of one's ProN collection, the amount of ProN that can be stored on the gadget or technology in question, or the ProN industry itself being the first to make good sue of the new technology or gadget in question (ergo: the ProN industry drives technology)
      - The posted cliché being self-described as an "obligatory" post in the heading area if that particular cliché had not been addressed yet by previous slashdotters. (e.g. "obligatory Beowulf cluster comment")
      - Post revealing the fact that the story's homepage had been slashdotted already, culminating towards another post later on with the homepage story itself being copied & pasted verbatim (often with a subsequent post purporting that this is karma whoring, even though the poster admits it is indeed helpful anyways.)
      - Remark on the size of some new storage advancement about how many LOC's (Library of Congresses) can fit on it, or any other remark noting how this can be an actual valid unit of data storage measurement.
      - A variation of the Zero Wing video game intro dialogue regarding it's broken English translation: "Someone set up us the base....we have every ZIG, make your time".....blah, blah, blah.
      - Very soon lists such as this will be clichés as well.
      - Similarly noted and additional clichés may be found here: http://en.wikipedia.org/wiki/Slashdot_subculture

    2. Re:A few Slashdot clichés: by MacJedi · · Score: 1
      - A comment including these 3 components in any order: Natalie Portman, naked and petrified, hot grits, one's pants.
      Actually, the Natalie Portman and Hot Grits memes were not originally connected (although they did seem to arise at about the same time...)
      --
      2^5
    3. Re:A few Slashdot clichés: by raffe · · Score: 1

      ...and dont forget:
      BSD is dying!!

    4. Re:A few Slashdot clichés: by lewp · · Score: 1

      It's p-r-ZERO-n.

      pr0n.

      --
      Game... blouses.
  42. On Par Or No, Vector CPUs Got Cluster-fucked by cmholm · · Score: 1
    For around about the last five years, it didn't matter whether clusters were the best or even a reasonable solution to a number of problems, which I'm not sure was only 10% of the then market.

    AFAIK, the cluster proponents sold the NSF and the DoD's HPC office on the idea that they would solve the limitations of "pile of PC" systems in software, the result being that both organizations basically mandated clusters for all new projects. Imagine the CIO of an aerospace firm requiring WinNT henceforth for any application of computing resources and you get the idea.

    The net result was that advances in vector computing stopped in the US. SGI sold a handful of existing Crays, but that was it. Otherwise, you needed to go to NEC, Fujitsu, or Siemens for iron. The only reason Cray has even a prayer is because there are one or two agencies of the US Federal government that need up to date vector processing, and ain't gonna buy from a foreign supplier that may either cut 'em off or screw with the systems for political reasons.

    So, while on one hand Becker and his buddies provided a great service to the difusion of parallel applications to a wider market, their politicking to be the high end solution end-all be-all moved the development and manufacture of high-end systems offshore. Becker is the Sam Walton of parallel computing. Good job, brother.

    --
    Luke, help me take this mask off ... Just for once, let me butterfly kiss you with my own eyes.
  43. copyright violation by commodoresloat · · Score: 1

    I'm pretty sure Steve Jobs owns that song.

    1. Re:copyright violation by Anonymous Coward · · Score: 0

      Doesn't matter... Parody is fair-use.

  44. FUD, FUD, and wrong. Crey stockholder eh? by Rhys · · Score: 4, Informative

    Sad to see this little knowledge about parallel computing on slashdot: blatantly wrong information marked as informative. +5 no less.

    Let's address this first: there are two common memory architectures, distributed memory (a cluster) and shared memory (a 'traditional' supercomputer). Each can emulate the other. Saying a cluster doesn't have enough memory, presumably at each node, is really saying: "I don't really understand message passing."

    This would be more important if datasets were actually large. Unfortunatly for your argument they aren't. A handfull of nodes and they'll hold the whole simulation easily in memory (albeit it'd take years to run because there's so few CPUs at work.)

    How would I know? Well, I work with the Center for Simulation of Advanced Rockets aka CSAR at UIUC, one of five DoE ACSI sites in the country. I manage their supercomputer, which is getting upgraded from 200 P3-class dual proc PCs to 640 dual proc Xserve G5s. Before that I was a grad student working with them, albeit not on the CSAR simulation but instead on a related grant, the CPSD.

    Now, there are computing problems which clusters aren't good at (or at least that's the traditional claim. My master's thesis and advisor would seem to dispute that this is actually the case.) However, most problems as the interview says, run just fine on clusters. Physical simulations (which covers CSAR's rockets to the national labs nuclear weapon research to hurricane/weather simulation, all the way down to protein folding and atomic and sub-atomic scale crystal formation simulation) need to know about what's in the area you're working on, and what's in nearby areas.

    Occasionally you'll find an oddball like galactic simulation (or molecular dynamics) that needs to compute gravity across the whole universe. Fortunatly we have multigrid methods and a friendly gravity equation to solve this problem: get real data from those near you. Average those far from you and use that instead.

    Then of course there's the idea that even "traditional" supercomputer problems that don't run well on clusters can be run efficiently on clusters IF you move beyond 1 process per CPU. Load up 10, 20, 100, 1000 little workers on a processor. Get fast context switching between them (not OS level!). Use message passing rather than shared memory (locking, ick!) to communicate. One worker blocked waiting for network data? Process the next one! If you've tuned things right you'll find you always have work to do.

    Sounds crazy? Supercomputing '02 didn't think so: http://charm.cs.uiuc.edu/research/moldyn/

    --
    Slashdot Patriotism: We Support our Dupes!
  45. You did not take this thought to the end... by PaulBu · · Score: 1

    The codes that run well on blue-gene are MORE like the codes run on clusters than those on a traditional vector super.

    And if you code your application for MPI you can debug/test/optimize it on a cheap cluster. THEN when you start running into communication latency and problems too large to be solved on commodity hardware you can recompile your code on big(ger) iron, like Blue Gene/L.

    Paul B.

  46. clusterknoppix by Yenhsrav_Keviv · · Score: 2, Informative

    ClusterKnoppix is pretty cool. Its got all the auto detect hardware features of regular knoppix, and also aautomatically adds itself to the cluster.

  47. And quality? by Gordonjcp · · Score: 0, Offtopic

    Although to be honest, any BMW made since the mid-90s has been an utter piece of crap. Not even BMW garages will touch them. The 735i with that fucking stupid iDrive thing just takes the Chocolate Homewheat, it really does. What a bag of shit.

    1. Re:And quality? by strictfoo · · Score: 0, Offtopic

      The 735i with that fucking stupid iDrive thing

      that's the 745i, not the 735i, I agree with you that it's a piece

      Although to be honest, any BMW made since the mid-90s has been an utter piece of crap.

      I know the new thing is to bash BMW and Mercedes (well, not really new, but gaining popularity). That's fine. I never was a big BMW fan either until I got to drive one a lot.

      And, having spent a lot of time driving both a 1999 and 2003 BMW 540i I would have to whole heartedly disagree with you.

      0-60 in 5.9 seconds. Quiet as can be. Neither one has had any problems.

      And the M5 - 0-60 in 4.6 seconds. Just beautiful.

      --
      I've just signed legislation that'll outlaw Russia forever. We'll begin bombing in five minutes.
    2. Re:And quality? by Anonymous Coward · · Score: 0

      Apparently, that's what happens when a computer company takes over a car company. All styling and stuff.

  48. Re:Obligatory... by gmf · · Score: 2, Funny

    In Soviet Russia, Beowulf Cluster imagines you!

  49. Why use beowulf? by axehind · · Score: 2, Interesting

    Why use Beowulf when you have openMosix? openMosix is all transparent to your application. You dont have to worry about remote execution, openmosix migrates your process automatically to the best node. Like I said.. it's all transparent and requires no additional programing in your application.

  50. Rocks by kst · · Score: 3, Interesting

    Rocks provides an easy way to build a Beowulf cluster. See http://www.rocksclusters.org/.

    You can build a working cluster, starting with the hardware and installation CD-ROMs, in minutes; see http://servers.linux.com/servers/04/08/27/1943227. shtml?tid=29&tid=94 for one account.

    Disclaimer: I work with the folks who created Rocks.

  51. (obligatory) by roesti · · Score: 1

    Is it running Linux?

  52. Windows NT error number 2 occurred. by Anonymous Coward · · Score: 0

    Error Occurred While Processing Request

    Error Diagnostic Information

    An error occurred while attempting to establish a connection to the service.

    The most likely cause of this problem is that the service is not currently running. You can use the 'Services' Control Panel to verify that the service is running and to restart it if necessary.

    Windows NT error number 2 occurred.

  53. ... right .... by Anonymous Coward · · Score: 0

    In Soviet Russia our linux loving Beowulf cluster overlords managing an extremely large collection of Natalie Portman pr0n, a copy of Doom 3 and my streaming MP3's (ya boo sucks to you RIAA) imagines you !!!

  54. This is disturbing... by Niet3sche · · Score: 1

    Because it looks like NONE of us remember Amoeba. Amoeba was a free* download, and could be used as an add-on to a *nix filesystem - I think, at the time, I was using Unixware7.0 or somesuch.

    As for me, I'd say that Donald's larger and more reaching contribution to *nix would be the network drivers that he wrote (3cX0Y cards, and the Tulip drivers come to mind).

    * "Free", here, as in beer definitely, and speech likely. I think that the source was provided, but cannot remember.

  55. Fast isn't everything. by Gordonjcp · · Score: 0, Offtopic

    The ride quality is abysmal. The least little bump or ripple in the road is transmitted through the rock-hard suspension direct to your arse, making driving at any speed above a walking pace feel like a trip on a malfunctioning rollercoaster. No sense having that kind of speed if you can't use it because the car won't stay on the road when you hit bumps.

    Maybe I'm just biased, because I've only really driven big old Citroens with the hydraulic suspension for the past five years. You can run over speed bumps at 40mph without even feeling a ripple, and they outhandle *anything* the same size. I do like the old E30 BMWs, though - I nearly bought an '89 320 Touring from a friend of mine, but was swayed by an '88 Citroen CX 22TRS at the last minute.

    1. Re:Fast isn't everything. by strictfoo · · Score: 1

      The ride quality is abysmal. The least little bump or ripple in the road is transmitted through the rock-hard suspension direct to your arse, making driving at any speed above a walking pace feel like a trip on a malfunctioning rollercoaster. No sense having that kind of speed if you can't use it because the car won't stay on the road when you hit bumps.

      Maybe this is a Euro/US thing. In Europe there are inexpensive BMWs, right? Or were you riding in a newer 323 or something? Maybe the quality has dropped off on the lower end models.

      I've done over 110mph (~180kmh (if my math is right)) in the 540i and it's been pretty damn smooth. Not quite as nice as the 3000GT I owned, but still pretty nice.

      --
      I've just signed legislation that'll outlaw Russia forever. We'll begin bombing in five minutes.
    2. Re:Fast isn't everything. by Gordonjcp · · Score: 1

      No, I've driven 5 and 7 series BMWs and they are the same. Put it this way, Rolls-Royce use a cheap knockoff of the Citroen hydraulic suspension system on their cars.

  56. SearchEnterpriseLinux by scoobrs · · Score: 2, Informative

    I noticed that this article by the PenguinComputing CTO appears to answer the article by the Cray CTO and contradicts it. All I want to know is this: how much did PenguinComputing and Cray spend on advertising banners on SearchEnterpriseLinux to have these articles made? Let's let vapor settle before it gets to our heads.

    --
    -Those who would give up essential liberty to purchase temporary safety deserve neither. -Ben Franklin
  57. That's a joke right? by Anonymous Coward · · Score: 0

    Please tell me this is an attempt at humour :

    Beowulf clusters......are passé now that most universities and research institutions have some sort of COTS-based high-performance computing solutions?

    COTS as in "Common off the shelf"? As in a load of COTS systems strung together to produce a high performance system? What's another name for that?

    I'll give you a clue - it starts with a "B" and rhymes with Airwolf.

  58. You forgot one! by Anonymous Coward · · Score: 0

    In Soviet Russia, cliche forgets you!