Slashdot Mirror


Flattening Out The Linux Cluster Learning Curve

editingwhiz writes "IT Manager's Journal has a good look at a forthcoming book, The Linux Enterprise Cluster, that explains in clear, concise language how to build a Linux enterprise cluster using open source tools. Writer Elizabeth Ferranini interviews author Karl Kopper in a Q&A. Is this more complicated than it appears to be? (IT Manager's Journal is part of OSTG.)"

12 of 89 comments (clear)

  1. OSTG? by ricotest · · Score: 4, Informative

    I must have missed this, and for anyone else who didn't know, OSTG is the new name for the Open Source Development Network (OSDN) Slashdot is a part of. They're now called the Open Source Technology Group.

  2. Re:This is the kind of book we need... by Anonymous Coward · · Score: 2, Informative

    Umm dude.. Enterprise cluster != beowulf cluster

  3. Very nice by a_hofmann · · Score: 4, Informative

    Installing and administering the various open source tools can be tedious work, especially without documentation of how to put things together.

    A quick Google search though reveals a lot of free papers and manuals on this very topic.

  4. Unless i read this incorrectly. by thegoogler · · Score: 2, Informative

    And he built a ten node cluster OF ten node clusters, then this is lame. and he is under-qualified to do the book(most likely) as most ACTUAL enterprise clusters are at least 20 nodes, possibly more if its clusters of blade servers.

  5. Editor needed by Bleeblah · · Score: 1, Informative

    Who edited that article before it went live? It is a mess!

  6. Mandrake CLIC by bolind · · Score: 4, Informative

    I will start by admitting that I am just a dumb university student talking out my ass. I have never set up an enterprise scale cluster.

    However, last january we set up a small (six node) cluster with the help of CLIC. Once we realized the link between a Mandrake and consective dead CD drives, we installed the cluster in little time.

    CLIC might focus a little too much on userfriendlyness and a little too little on flexibility, but for our purposes it was great. It sports ganglia, gexec, distcc and MPI (and probably more), and administration and deployment of nodes is a breeze.

    I heartily recommend CLIC for student/test/proof-of-concept projects.

  7. Re:VMS clusters by hachete · · Score: 2, Informative

    This seems fairly active:

    http://gnv.sourceforge.net/

    includes a port of bash to VMS. Not sure how good it is.

    Having used and programmed DCL, it's not that bad.

    h

    --
    Patriotism is a virtue of the vicious
  8. Re:Beowulf Newbie Question by photon317 · · Score: 3, Informative


    Inevitably high-performance clusters require software designed to run on high-performance clusters. It is better not to think of such a cluster as a single system, but rather as a network of individual machines with a tight network connection. Some of the clustering add-ons for linux approach and even achieve certain aspects of a "Single system image" type of configuration, but it's never completely like a single system.

    Back in 1997 or so I tried to get as close as I could to a true Single System Image by building off of the beowulf patchsets combined with patches for Distributed SysV IPC/SHM and a globally-shared root filesystem using CNFS (cluster-nfs, so that a few essential configfiles can have unique copies per cluster node). It was very daunting work to get those patches integrated together, and the end result was that without some kind of network-interconnect that was as high-speed and low-latency as a processor's FSB, there was always going to be a big performance hit doing things this way. Of course if an application happens to be perfect for simple HPC clusters (all cpu intensive, very little I/O, and the work is easily divisible without tons of IPC between the workers), then it runs fantastically on such a Single System Image cluster, but then again it would have run fantastically on a simple cluster that doesn't look like a Single System too. So what the Single System concept bought me really was a nice abstraction layer that made everything easy to deploy, configure and manage. But it came at a severe initial cost of human labour. It's not worth the trouble.

    --
    11*43+456^2
  9. Re:Beowulf Newbie Question by Junta · · Score: 2, Informative

    An openmosix cluster would behave more along the lines of what you are thinking, but ultimately for HPC applications at scale it is generally more efficient to not do openmosix and write the programs explicitly for parallelism mindful of the layout of processing elements (i.e. network topology or balance between SMP connected processing elements and network connections between nodes).

    --
    XML is like violence. If it doesn't solve the problem, use more.
  10. Please stop misusing the term 'learning curve' by double_h · · Score: 4, Informative

    A flat learning curve is a bad thing.

    The term "learning curve" was invented by the aerospace industry in the 1930s as a way to quantify improved efficiency from mass production (basically, the more you do a task, the easier it becomes). The term was later adopted by psychology and the social sciences, where most people first encounter it.

    In both cases, the horizontal axis of a learning curve represents time or effort, and the vertical axis represents amount learned or productivity. Therefore something that is intuitively obvious in fact has a steep learning curve.

    "Learning curve" was a technical term with a specific definition for decades before it was ever a (misused) marketing buzzword.

    Thank you for your time :)

    1. Re:Please stop misusing the term 'learning curve' by Anonymous Coward · · Score: 1, Informative

      Wrong - the American psychologist E.L. Thorndike was using learning curves in 1911 or before

  11. Re:little advice by dougnaka · · Score: 2, Informative
    The first part, act as one big SMP machine is what clustering does.

    The second part with shell acounts and home directories are all problems already solved by NIS/NFS. You could setup a pool of machines that all share the same NIS/NFS info so anywhere the user logged in they'd have the same files/passwords, and load balance it via ipvs or dns.

    AFAIK the current state of clustering works well for custom code situations, where you write your app to run on the cluster, but doesn't transparently make your 4 boxes act like 1 box with 4X the resources for just any program.

    I've used distcc with some luck on gentoo, but it only distributes compiling over your nodes.

    --
    My Linux Command of the Day site : LCOD