Slashdot Mirror


Where to Spend $1M on a Cluster?

Natchswing asks: "My university has been given a $757,825 NSF grant to build, 'A 256 node (128 pair) Beowulf parallel computing cluster ... to improve the realism of gravity-wave modeling by permitting treatment of the three dimensional problem and multiple wave interactions.' They want to pay a company to just show up and drop off a functional cluster rather than build it themselves. Since word has leaked out regarding the purchase intent, every computer manufacturer under the sun (including Apollo himself) has called up trying to sell their cluster. Since I'm no cluster expert, I'm writing Slashdot. If you had $0.7 mil to buy a pre-built cluster who would you go with and why?"

5 of 104 comments (clear)

  1. Penguin Computing by retostamm · · Score: 3, Informative

    Penguin Computing does this kind of stuff for a living. I think they are an all open source shop, too... There may be others, too.

    1. Re:Penguin Computing by DA-MAN · · Score: 3, Informative

      Penguin Computing does this kind of stuff for a living. I think they are an all open source shop, too... There may be others, too.

      As a Systems Engineer who has worked with a number of vendors, I would say that Penguin is the bottom of the barrel in service and quality control.

      We have five clusters at our facility, the slowest of which is on the top500 in the 150 range. We've tried big and small vendors.

      Penguin is the absolute worst. No two scsi hard disks had the same firmware version, the raid controller was DOA, etc. We buy/borrow a node from each vendor and evaluate them before buying clusters, and out of all the vendors the Penguin is the one that would crash or hang all the time. After months of trying, they were never able to get this going properly. Regardless of the fact that we shipped it back twice and were told each time that we'd get back a whole new machine (it wasn't).

      I would personally recommend Appro, IBM or Western Scientific in that order. Service and quality hardware are their game.

      --
      Can I get an eye poke?
      Dog House Forum
  2. Re:Do it with an apple by coldcup · · Score: 5, Informative

    A cluster of storage? Perhaps you mean the Xserve itself.

    They even have a page on clusters.

  3. Microway by brsmith4 · · Score: 5, Informative

    I run a 48 Node Microway beowulf and I must say that it is the most stable system available. Everything came assembled and ready to go (of course, I built the enclosure and did the networking, but they will do that for you if you'd like). If you're not very knowledgeable about beowulfs, how do you know you'll need so much power? Do you know how well the software you will be using will scale? Is it close to embarassingly parallel or does it lose efficiency over X nuber of nodes? What type of resources and consumption does the program use? Is it extremely processor hungry, or does it deal with dense matrices and require low-memory latency and high bandwidth or both? Do you know if you will need the power of Myranet or will you be able to get by on GigE?

    These are important questions you must ask your researchers and yourself before you purchase this cluster. But, to answer your question, I believe Microway is the best choice and I plan on having them build our next cluster in the next fiscal year.

    -brian

  4. Not Angstrom by Anonymous Coward · · Score: 4, Informative

    I currently maintain some Opteron based Angstrom Microsystems Linux clusters. We've had them for less than a year, and already 30% of our nodes have had to be replaced. Support has been a nightmare.

    Sadly, I was not around when the proposal was made, otherwise I would have rejected this cluster outright. There is no way to hook external storage up to this beast. There is no USB, Firewire, SCSI, external SATA, or fibre channel options. You can't even run an ATA cable out of the thing without drilling holes into the blade walls.

    Personally? I'm looking at an XServe or an IBM Bladecenter.. but maybe it's just because I'd like some real support.