Slashdot Mirror


Linux Clustering Hardware?

Kanagawa asks: "The last few years have seen a slew of new Linux clustering and blade-server hardware solutions; they're being offered by the likes of HP, IBM, and smaller companies like Penguin Computing. We've been using the HP gear for awhile with mixed results and have decided to re-evaluate other solutions. We can't help but notice that the Google gear in our co-lo appears to be off-the-shelf motherboards screwed to aluminum shelves. So, it's making us curious. What have Slashdot's famed readers found to be reliable and cost effective for clustering? Do you prefer blade server forms, white-box rack mount units, or high-end multi-CPU servers? And, most importantly, what do you look for when making a choice?"

201 comments

  1. Depends by Anonymous Coward · · Score: 3, Insightful
    Depends on

    (a) Your cost budget

    (b) Your work requirement: A Search engine is different from a weather forecast center.

    (c) Cost of ownership which includes maintenance etc

    1. Re:Depends by Anonymous Coward · · Score: 0

      That about sums it up.

      Throw in Cost of Electricity in cost of ownership, and it becomes a math problem.

      If they know if they need high cpu demands, memory demands or disk demands, they can design their system around those requirements.

    2. Re:Depends by RogerWiclo · · Score: 2, Insightful

      Congratulations! You get the honor the first post AND mod points for insightful, and you didn't have any useful information.

    3. Re:Depends by Donny+Smith · · Score: 1

      > Congratulations! You get the honor the first post AND mod points for insightful, and you didn't have any useful information.

      Congratulations!
      You get the honor the second post AND mod points for insightful, and you didn't have any useful information.

      The grandparent was correct (and insightful, for the smart reader).
      There's no correct answer to a generic question like that.

  2. Gates Will Not Be Happy by Master+of+Transhuman · · Score: 1, Offtopic


    when all his three-CPU XBoxes become Linux clusters!

    Bwahahahahaha!!!

    --
    Richard Steven Hack - This sig is TOO GODDAMN SHORT TO DO ANYTHING USEFUL WITH! MORONS!
    1. Re:Gates Will Not Be Happy by FuturePastNow · · Score: 1

      Actually, he probably will. Wouldn't you want your console to be known as a supercomputer? It's all publicity. And a guranteed free ad on Slashdot.

      --
      Give a man fire, and you warm him for the night. Set a man on fire, and you warm him for the rest of his life.
    2. Re:Gates Will Not Be Happy by Anonymous Coward · · Score: 0

      Not if its serving porn..

    3. Re:Gates Will Not Be Happy by rpozz · · Score: 1

      Not if they're making a loss on it (which I imagine they will be). If the XBox 360 does get used by a few companies as a Linux cluster - which isn't out of the question, then it'll be at MS's expense.

      And how was the grandparent post 'Offtopic'?

    4. Re:Gates Will Not Be Happy by Anonymous Coward · · Score: 0

      Can you say propritary CPU - that means no published information on its internal workings no publicly available systems level compiler. This may well be the first computer that will have no Linux for it. That may well be Bills reason for designing this xbox around their own CPUs. Do not be too supprised if in the next year or so MicroSoft starts building its own computers on this or simular propritary CPU as Mr. Bill is all for as much lockin as he can get.

    5. Re:Gates Will Not Be Happy by Charles+Jo · · Score: 1

      I believe that Gates will be happy if hackers repurposed Xbox360s for clustering for the following reasons: 1. Increase in unit sales for whatever reason will appear like Xbox360 is doing much better than otherwise. 2. MS can observe what all the interesting stuff that the FOSS community does with Xbox360 and then copy and replace the FOSS code with MS code, package it with expensive colorful labels and sell it. 3. As a previous poster noted, it's free advertisement for MS. CharlesJo360 http://www.charlesjo.com/newsletterissue?newslette rIssueEntityId=304

    6. Re:Gates Will Not Be Happy by Anonymous Coward · · Score: 0

      Can you say IBM PowerPC processor?

      Read up on http://en.wikipedia.org/wiki/PowerPC

    7. Re:Gates Will Not Be Happy by rpozz · · Score: 1

      While you make some very good points there, I'd imagine that the idea of Linux (the most likely OS for the job) running on it would be quite embarrassing for them. I would doubt that it would be doing anything different to normal clustering software too - it would merely be a few changes in the back-end to make it 'appear' as a normal computer to applications.

    8. Re:Gates Will Not Be Happy by Charles+Jo · · Score: 1

      Embarrassing? I think MS has evolved very thick skin on that front by now :) And I actually like some MS stuff.

      CharlesJo.com | Sarcastech
      OS Wars, Episode X.4
      http://www.charlesjo.com/newsletterissue?newslette rIssueEntityId=285

    9. Re:Gates Will Not Be Happy by Master+of+Transhuman · · Score: 1


      Excuse me, but how are they going to copy Linux code running on it, when Linux cluster code has been around for years?

      If they could do, they would have done it.

      And the increase in unit sales wouldn't be enough to pay for Gate's shoe shines. How many Linux buffs running clusters do you think there are?

      As for free advertisement, it's far MORE a free advertisement for how Linux can run on anything and turn it into something useful.

      --
      Richard Steven Hack - This sig is TOO GODDAMN SHORT TO DO ANYTHING USEFUL WITH! MORONS!
    10. Re:Gates Will Not Be Happy by Charles+Jo · · Score: 1

      Good questions. MS can copy Linux or any FOSS because the beauty of FOSS is that the source code is open for everyone,including MS, to study. They just need to study it and build a workalike. If there anyone in the world who is good at making near-workalikes, it's MS.

      Cluster usage: Slashdot crowd for starters. I'm sure that many more of us would have clusters in home offices if it meant going to Fry's to buy a 6 pack of Xboxes and download a simple program that would cluster these.

      Free ads for Linux: that goes without saying. Someone will install Linux on anything -- it's just a matter of time.

      CharlesJo.com
      Slashdotter

    11. Re:Gates Will Not Be Happy by Anonymous Coward · · Score: 0

      unit sales may increase sure...but thats the problem. microsoft (and sony) consoles are cheap because they rely on getting the money made up through the sale of software. although the number of clusters made will most likely not cause a big deal compared with the mass sales for the gaming community, it still means they lose money.

  3. Personally by EpsCylonB · · Score: 0

    I always choose the one with go faster stripes on the side.

    1. Re:Personally by cranos · · Score: 1

      Don't forget the fluffy dice and you might want to add some hydraulics to the case lid, so that it can bounce up and down when ever a "Hot Babe tm" walks past.

    2. Re:Personally by 01000011011101000111 · · Score: 1

      I prefer to paint my supercomputer's red... because as everyone knows, Red onez go fasta!

      --
      Programming is an Art. I am an Artist. Does that mean I get to wear a daft hat?
    3. Re:Personally by Anonymous Coward · · Score: 0
      I always choose the one with go faster stripes on the side.

      You joke, but Akamai, before a round of VC funding did apparently get custom plastic with blue LEDs to make their system look extra special to the VCs.

    4. Re:Personally by Too+many+errors,+bai · · Score: 1

      Holy hell, this was modded Informative?

  4. Dual Opteron 1U rack units.... by Fallen+Kell · · Score: 5, Interesting

    For the size and performance, they are hard to beat. A dual opteron setup in a 1U rack case is a very powerful setup in and of itself. The bonus of using off the shelf components with no need for proprietary hardware or software also make them very affordable. The added bonus is that you can simply get the parts from regular retailers for replacement.

    --
    We were all warned a long time ago that MS products sucked, remember the Magic 8 Ball said, "Outlook not so good"
    1. Re:Dual Opteron 1U rack units.... by FuturePastNow · · Score: 4, Informative

      It's no dual Opteron server, but this Dan's Data article reviews what are probably the cheapest 1U servers you can buy. Definitely something to consider of you're going for cheap.

      --
      Give a man fire, and you warm him for the night. Set a man on fire, and you warm him for the rest of his life.
    2. Re:Dual Opteron 1U rack units.... by imsabbel · · Score: 2, Informative

      Not to mention that if you need more power, you can drop in the new dual core opterons without need for anything but eventually a bios upgrade.
      (not to mention that the dual core opterons acutally consume less power than some of the early steppings of the single core ones)

      --
      HI O WISE PRINCE. WHT TOOK U SO DAM LONG?
    3. Re:Dual Opteron 1U rack units.... by Anonymous Coward · · Score: 0

      Price/performance I'll give you. But size/performance, no way.

      IBM has some blades that can fit 14 dual opteron servers with a gig ethernet switch, redundant power, kvm in a 7U footprint.

      A little pricy, but for easily deploying and managing large numbers of servers, it is hard to beat.

    4. Re:Dual Opteron 1U rack units.... by algae · · Score: 2, Informative

      Agreed, that's what my company is using. As an added bonus, their reliability is amazing as long as you go with tier 1 hardware manufacturers. We discovered after I'd installed lm_sensors on our cluster that one of the machines had a dead heatsink-fan and dead exhaust fans, and yet the CPU was only at 77 degrees C and was still running under full load. I'd like to see a Xeon do that.

      --
      Causation can cause correlation
    5. Re:Dual Opteron 1U rack units.... by Glonoinha · · Score: 2, Informative

      That's a cool looking rig, but it lists at about $850 (Australian, like $700 US) with no CPU, RAM, or drives.

      Here is a Dell PowerEdge 750 1U server with a P4 2.8GHz HyperThreaded, 256M and an 80G SATA (with room to add another drive and up to 4G of memory) for $499 shipped to your door. Yea, I'm a Dell fanboy, but even if I wasn't I would still see a pretty good price point in that box.

      Note that this specific box is pretty low end and could use some upgrades, but it is a complete machine ready to run, esp if you want to go on the cheap.

      --
      Glonoinha the MebiByte Slayer
    6. Re:Dual Opteron 1U rack units.... by Anonymous Coward · · Score: 0

      You can buy a Dell PE1850for $2500. Dual 3 GHz Xeon, 1 GB RAM, 2 73GB 10k RPM SCSI drives, RAID controller, CDROM, floppy and 3 year warranty. Why would you buy a barebones and the necessary parts for $3000.

    7. Re:Dual Opteron 1U rack units.... by FuturePastNow · · Score: 1

      The real US price is more like $460, at which the Dell is still a better deal. But that doesn't change the fact that it's a Dell ;)

      --
      Give a man fire, and you warm him for the night. Set a man on fire, and you warm him for the rest of his life.
    8. Re:Dual Opteron 1U rack units.... by Frumious+Wombat · · Score: 1

      I built my penultimate cluster from IBM 1U opteron systems, using Rocks (http://www.rocksclusters.org/ as the system software. That was a decent setup from a price-performance standpoint, ran well, and supported our legacy 32-bit linux apps rather well. Currently I'm using Apple XServe G5s, mostly because of the availability of XLF and the low power-draw. I also don't have the legacy binary issues in the new lab that I had when supporting a heterogeneous group.

      Having started with white-boxes on bread-racks, passed through IA-64 and SP2, and ended up with racked 1U, I would say that what I care about is reliability and support first, speed second, and convenient software environment third. Google can afford to have a system fail, due to their distributed structure and design for failover, but tightly-coupled scientific codes tend not to tolerate this as well.

      Therefore, my recommendation would be for scientific clustering, (2-way proc, 1U/2U rack enclosure from IBM, HP, Apple), G5/Opteron > Xeon. If I had the money, then swap G5/Opteron for Power4/IA-64(HP). Rack mounting saves more than just space; it's also easier to perform maintenance on than the bread-racked desktops had been.

      Note, my opinion only applies to my environment; high-performance/high-throughput computing. High availability clusters are another beast entirely.

      --
      the more accurate the calculations became, the more the concepts tended to vanish into thin air. R. S. Mulliken
    9. Re:Dual Opteron 1U rack units.... by Anonymous Coward · · Score: 0

      We have a large Duel opteron cluster 200+ and a large Xserve G5 cluster 100+ the apples can be cheaper the the right price point, but i would look at sun's new duel opteron. Its cheap and has LOM which the xserve does not have.
      The Xserves are fast (2 Ghz) the opterons not so fast (1.4Ghz first generation) I have been less than impressed with the xserve powerfull yes, but apples server tools work good for one mabe up to 10 xserves but not 100+ IE no video card only Seriel if the network is borked so you never know what it is doing. And the WORST the server setup asistant thing it whacks your ssh keys!!! bamn gone WTF!!

    10. Re:Dual Opteron 1U rack units.... by wwwillem · · Score: 1

      Here is a Dell PowerEdge 750 1U server with a P4 2.8GHz HyperThreaded, 256M and an 80G SATA [...] for $499 shipped to your door.

      Sounds nice, but to upgrade that 2.8 GHz even to 3.0 Ghz, Dell rips you off another $299, and so on. A 60% price increase for a 7% CPU speed increase sounds similar to what airlines are doing with their pricing. So, not a company I would trust with my business.

      --
      Browsers shouldn't have a back button!! It's all about going forward...
    11. Re:Dual Opteron 1U rack units.... by Glonoinha · · Score: 1

      Actually the 'real' price to jump from a 2.8 to a 3.0 is $100 - they are just having a promotion where you can get the upgrade from the baseline chip (a Celeron) to the 2.8 for free - Intel is blowing out the 2.8GHz P4's so they are taking advantage of price breaks to clear the channel of remaining stock.

      Getting the 3.0 is an option, but it's a dumb option - perhaps intended to influence the purchase decisions towards the 2.8, again to clear the remaining stock. I was comparing a specific box and configuration to the GP's post with a white box 1U barebones that had no CPU, memory, or drives and cost 50% more.

      Getting Tier I hardware at better than commodity pricing - sure Dell tech support sucks (compared to what it used to be like) but I buy the hardware from the perspective of someone that builds his own - buy the hardware like white-box and assume you will need to support it yourself. If you can get it as cheap (or cheaper, as we saw) than white-box, that's your justification.

      Looking back at my history, let me clarify my statement : I'm a fanboy of Dell hardware - not the company. I agree with you, I don't like their business ethics, morals, vision or leaders, nor would I trust them with my business - I trust me with my business though, and I trust the hardware they build (mainly because of my experiences with their hardware.) As a company they suck, but not as bad as Carly's new HP (yea, I know she's gone, but you know what I mean.)

      --
      Glonoinha the MebiByte Slayer
    12. Re:Dual Opteron 1U rack units.... by codeguy007 · · Score: 1

      Dual Core Dual Opterons can be had for the same or slightly less per cpu core. A fair bit less if you consider you can cut the number of interconnects needed in half. The interconnect gets expensive once you pass 48 nodes on a cluster.

  5. Re:XServe by Anonymous Coward · · Score: 0

    MacOS X Server is not UNIX, it's based on a UNIX-like OS.

  6. Ammasso by LordNimon · · Score: 3, Interesting

    Ammasso is a startup that makes iWarp-based RDMA hardware that runs over gigabit ethernet. Their technology is like Infiband, but much cheaper and almost as fast. Their drivers and libraries also provide MPI and DAPL support. The only support Linux (all 2.4 and 2.6 kernels) and they're way ahead of their competition in terms of performance, product availability, and support. Once you've decided on the servers, I strongly recommend you use Ammasso's hardware for the interconnects. Your hardware vendor may even bundle it with their systems - be sure to ask about that.

    --
    And the men who hold high places must be the ones who start
    To mold a new reality... closer to the heart
    1. Re:Ammasso by LordNimon · · Score: 1

      Oops - the URL for Ammasso is http://www.ammasso.com/.

      --
      And the men who hold high places must be the ones who start
      To mold a new reality... closer to the heart
    2. Re:Ammasso by Anonymous Coward · · Score: 0

      Ammasso's hardware IMHO is not yet ready for prime-time roll out of a large cluster from a maintenance perspective. In any case if you are willing to spend 400$ on a GigE Ammasso card why wouldn't you consider Infiniband.

      Again, the most important point is to consider what is your application before you rollout a Linux Cluster. Main Point to consider:

      * Requirement of HA
      * Fault Tolerance
      * Does your application require a large amount of IPC communication or is it an embarassingly parallel application
      * Ability to support a large cluster. If you are google and can support open motherboards and set up the labs to do that more power to you. But most organizations can't do that. If you are anyone else buying a blade server or a 1U rackmounted chasis from a major vendor will reduce your longterm TCO. Google doesn't use MPI and have custom applications which have FT/HA features due to which they are able to support low cost devices and deal with failure. Most MPI apps can't deal with failure. Hardware with redundancy is usually a good idea.

    3. Re:Ammasso by convolvatron · · Score: 1

      how is this iWarp-based? the iwarp did register-to-register 'systolic' style communication, not async bulk transfer.

    4. Re:Ammasso by LordNimon · · Score: 1

      Looks like there are two definitions of iWarp, and I linked to the wrong one. Sorry.

      --
      And the men who hold high places must be the ones who start
      To mold a new reality... closer to the heart
    5. Re:Ammasso by jabuzz · · Score: 1

      Is it rubbish nearly as fast. The top speed of Infiband is 30 times faster than gigabit ethernet, and the lowest speed is 2.5 times faster. By the time you have paid for the Ammasso cards you might as well have gone with one of the other higher speed low latency interconnects.

    6. Re:Ammasso by Anonymous Coward · · Score: 0

      I deployed 28 nodes of Ammasso.

      It installed quickly and it's giving me a 30-40% application boost over GbE.

      I haven't hit any maintenance issues whatsoever... If you want true RDMA failover they told me they have a dual port card coming out by next year.

      All in all I've been very happy with these cards, not dealing with IB cables and issues is enough of a save. These things are just straight Ethernet and TCP/IP, but on steroids. Run over standard GbE switches too...

      I paid mid-$400's per node but at 30% performance increases I'm saving $$ on node count and saving cash over all.

  7. Check out Xserve by Twid · · Score: 5, Informative

    At Apple we sell the Xserve Cluster node which has been used for clusters as large as the 1,566 node COLSA cluster. We also sell it in small turn-key configurations.

    Probably the most interesting news lately for OS X for HPC is the inclusion of Xgrid with Tiger. Xgrid is a low-end job manager that comes built-in to Tiger Client. Tiger Server can then control up to 128 nodes in a folding@home job management style. I've seen a lot of interest from customers in using this instead of tools like Sun Grid Engine for small clusters.

    You can find some good technical info on running clustered code on OS X here.

    The advantage of the Xserve is that it is cooler and uses less power than either Itanium or Xeon, and it's usually better than Opteron depending on the system. In my experience almost all C or Fortran code runs fine on OS X straight over from Linux with minimal tweaking. The disadvantage is that you only have one choice: a dual-CPU 1U box - no blades, no 8-CPU boxes, just the one server model. So if your clustered app needs lots of CPU power it might not be a good fit. For most sci-tech apps, though, it works fine.

    If you're against OSX but still like the Xserve, Yellow Dog makes an HPC-specific Linux distro for the Xserve.

    --
    - "When you want something with all your heart, the entire universe conspires to give it to you" -Paulo Coelho
    1. Re:Check out Xserve by Anonymous Coward · · Score: 0, Interesting

      Try getting parts for XServes after a couple of years. Want parts for a XServe G4 from yesteryear? Chances are you are gonna be fucked. We couldnt even source a drive caddy for one. Buy a proper server thats gonna be supported for longer than its trendyness.

    2. Re:Check out Xserve by Twid · · Score: 2, Informative

      That's simply not true. We stock parts for all Apple equipment for at least seven years. The drive caddys for the old Xserve G4 are exactly the same as what we ship in the current Xserve RAID, so not only is the drive caddy still available, it's also still in use.

      Actually, the cheapest way to go would be to buy the 250GB ADM, apple part number M9356G/A, and pull out the 250GB drive and use it somewhere else (or just use that drive). Service parts tend to be pricey (like everyone else).

      --
      - "When you want something with all your heart, the entire universe conspires to give it to you" -Paulo Coelho
    3. Re:Check out Xserve by Anonymous Coward · · Score: 0

      Wow, great service Apple!! You should be real freakin' proud. If Dell was doing this you'd be crying bloody murder.

    4. Re:Check out Xserve by IANAAC · · Score: 0, Offtopic
      Good information, however, this statement:

      We also sell it in small turn-key configurations.

      only makes it look like an ad.

      Granted, I don't begrudge you for trying to drum up Apple business in a Linux thread, but - and I supsect I'm not alone in thinking this - you're probably driving away some potential business using the aformentioned tactic.

    5. Re:Check out Xserve by lakeland · · Score: 2, Insightful

      *shrug* he's being honest about working for apple. I'd rather have that than him talking about it as if he was just a knowledgeable third party.

    6. Re:Check out Xserve by Che+Guevarra · · Score: 1

      Oh my, cough. All those hard hours of surfing Slashdot in search of a way to plug company hardware. First I read a post by a Dell guy and now a post by an Apple guy. Boy, it's going to be a big night in SF! - I hope I'm not being an a**, but it is worth commenting on.

    7. Re:Check out Xserve by Che+Guevarra · · Score: 1

      Lakeland is absolutely right. This guy was straight up and informative. He deserves his mods. What if Slashdot were always like that?????

    8. Re:Check out Xserve by Anonymous Coward · · Score: 0

      Except maybe in XGrid 1.14 that will change to 48 nodes.

    9. Re:Check out Xserve by Twid · · Score: 1

      Well, the Workgroup Cluster is really turn-key. It comes with everything, ethernet cables, ethernet switch, a small rolling rack, etc... and includes the Bioteam iNquiry software on disk images, so you just load the images and go.

      So, fair enough with the marketing comment. I do technical pre-sales for Apple, so I've really touched and used this stuff. If you do have any techie questions, fire away. :)

      --
      - "When you want something with all your heart, the entire universe conspires to give it to you" -Paulo Coelho
    10. Re:Check out Xserve by Twid · · Score: 1

      What the heck are you talking about?

      --
      - "When you want something with all your heart, the entire universe conspires to give it to you" -Paulo Coelho
    11. Re:Check out Xserve by Twid · · Score: 2, Informative

      Well, you can think what you want, but I'm not a paid Apple shill surfing Slashdot for posting opportunities. I saw the story go up, and since I do cluster pre-sales at work, I thought I had some topical comments. I even mentioned running Linux on the Xserves. :)

      --
      - "When you want something with all your heart, the entire universe conspires to give it to you" -Paulo Coelho
    12. Re:Check out Xserve by biglig2 · · Score: 1

      I agree, I remember the day when you could depend on Slashdot to give you nothing but uninformed opinion.

      Next thing you'll see some post saying "I am a lawyer, actually."

      --
      ~~~~~ BigLig2? You mean there's another one of me?
    13. Re:Check out Xserve by WebCrapper · · Score: 1

      Technically, this has already happened: Google

  8. Dual Core Opteron Blades by municio · · Score: 5, Insightful

    At the current time I would choose blades based on dual core Opterons form many reasons. Some of the main ones are:

    - Price
    - Software availability
    - Power consumption
    - Density

    Brand depends on what your company is confortable with. Some companies would want to have the backing of IBM, SUN or HP. Others will be quite satisfied with in house built blades. This days it's quite easy to build your own blade, some mother boards builders take care of almost all components and complexity (for example Tyan). But again, maybe the PHBs at your gig will run for the hills if you mention the word motherboard alone.

    1. Re:Dual Core Opteron Blades by PornMaster · · Score: 1

      Blades rarely make sense. Most blades available are not interconnected in any meaningful way, so you're really looking at independent systems, just of a small form factor. Why use blades? Well, they're space efficient. Great, unless you're actually using a lot of them, and don't have too much space. Because if you don't have too much space, you probably don't have too much cooling. If you don't have too much cooling, then you've totally killed your argument for blades based upon space. A rack full of HP BL40p blades puts out something on the order of 53k BTU. That's one rack. If you're hosting your stuff in a colo, then if the data center doesn't give a shit that you're pumping out that much heat from one rack, then they're not concerned about their other customers and the "commitment" they made to them. No data centers I know of are built to cool that much heat density... and if someone has the money to build space that can be cooled to that extent, then they can just make the damn room bigger and spread out the heat. Blades rarely make any sense.

    2. Re:Dual Core Opteron Blades by Anonymous Coward · · Score: 0

      The motherboards for dual core Opterons don't work yet, despite the vendors claims. The BIOS's aren't stable, and they're still trying to get the bugs out of their teeth from crossing the finish line first with no clothes on to reduce the weight on their vehicles.

      The results for numerous vendors of the crash and burn as their unprotected naked bodies landed on the pavement because they also left the brakes off their motorcycles to save weight is left to the reader's imagination. But the resulting body bits are also scattered all over the poor choices and integrations of components on the first round of dual-core motherboards. Then have some twit stuff it with 16 Gig of RAM sold them by their cousin Guido on a street corner to save a few bucks, and watch the systems crash.

      Technologies that new are basically using us consumers as beta testers: don't bother with dual core for at least six more months if you want your cluster to be stable.

    3. Re:Dual Core Opteron Blades by jcampbelly · · Score: 1

      Cost-effectiveness is often the biggest issue for a lot of companies who need the power but can't gain that TCO edge from buying in massive bulk. Many can't afford the convenient and complete offerings of an IBM/Intel or HP blade whose components alone can cost upwards of $10,000 with 5-10x that in software. A more practical solution would be an in-house project, but you're still paying off-the-shelf prices for something you may be buying in bulk (unless you've got some distributor channel to work with). In this case, the white-box manufacturer is what you need. There are cheap blade chassis that can easily be fitted with good hardware. I know of a few that won't get you the density of HP or Intel, but it's better than 1U's for cooling and for space. No matter how many systems you will be building, chances are (if you're not going the ol "massive bulk" route) you won't be buying enough hardware to get a decent price per component. That's where white-box companies come in, working with distributor channels to get profitable pricing competitive with solutions from larger vendors. At this point you're left with the warm-fuzzy of a smaller hardware OEM with kingly support (smaller companies often listen to customers more) and you're left to loose your own geeks on their new hardware. That is, of course, if you've got your own geeks. I'd recommend an OpenMosix Gentoo cluster for simpler job-processing work unless you've already got the software. Dual Opterons are the way to go as you can start out with 1 CPU, then in 3-4 years the same hardware should be able to run 8 cores -- that's real, present, available scalability with a side of cost-effectiveness. I work for a white-box custom integrator like this and we make such 1U systems which can also be put into a cheap blade form-factor. This is no advertisement so I won't give a name, but feel free to e-mail me if you think I've made a point or two.

    4. Re:Dual Core Opteron Blades by Stephen+Samuel · · Score: 1
      Reading the Google article, I figured that they'd like them too, for much the same reasons -- for them, dual core looks far nicer than hyper threading, which is better, in turn than deep pipelines (which are almost useless given the kind of software that google uses in their boxes). Combine that with the lower power consumption (which adds up when multiplied by 15,000 processors), and you've got a nice sales pitch.

      In terms of using boards on open trays -- why not? as long as you don't have airflow problems, it saves you the cost of all that (useless) metal. For me, the main value of a case is minimizing noise and the influx of dust -- neither of which really count in a colo facility.

      --
      Free Software: Like love, it grows best when given away.
    5. Re:Dual Core Opteron Blades by RedX · · Score: 2, Interesting

      Not sure I follow what you mean by "not interconnected in any meaningful way". The current line of HP blades not only offers an onboard Cisco switch that is certainly going to save ports on your main data center switches, but an onboard Brocade SAN switch has been announced as well that is going to save ports on the production SAN switches. Not to mention the cable savings of each option.

    6. Re:Dual Core Opteron Blades by sirsnork · · Score: 1

      What I would think he means is that they are still connected using GbE. Blades give the perfect oppurtunity to use a higher speed interconnect since you already have backplanes that the blades connect to.

      --

      Normal people worry me!
    7. Re:Dual Core Opteron Blades by notque · · Score: 1

      And the regular harddrive failures leave you secure in a job.

      --
      http://use.perl.org
  9. Pictures? by Anonymous Coward · · Score: 1, Interesting

    I would love to see a picture of the google hardware...

    1. Re:Pictures? by Anonymous Coward · · Score: 3, Funny

      Oh yeah, show me a picture of your rack baby!

    2. Re:Pictures? by sathia · · Score: 1

      afaik google doesn't use latest-top-marvellous hardware, but thousands of low level servers.

      look for pingeon + google

      and more i heard that there's a big casualty every day for this cheap hardware.

      --
      one bug, one crash
    3. Re:Pictures? by Anonymous Coward · · Score: 0

      Yeah, I read an article on Google's systems. The use the cheapest harware they can get their hands on, but have sophisticated software that the wrote specifically to distribute the data among all of there clusters. There isn't one byte of Google's data that isn't in at least three different physical locations, and it's mirrored at every location also.

  10. IBM zSeries by Anonymous Coward · · Score: 1, Interesting

    Definitely worth checking out. It's one bad-ass Linux server -- and probably the only one to offer instruction execution integrity. That's a fancy way of saying 2+2 will always equal 4 on zSeries -- because everything is executed twice and compared at the hardware level -- or it won't execute.

    If you need this, you need it bad.

    1. Re:IBM zSeries by Anonymous Coward · · Score: 0

      Yeah right. Square peg, round hole.

    2. Re:IBM zSeries by Wesley+Felter · · Score: 1

      If you need this, you need it bad.

      Conversely, if you don't need it (because you've already decided on a cluster), then you really don't need it.

    3. Re:IBM zSeries by Anonymous Coward · · Score: 0

      Contrary to popular belief, IBM's zSeries platform is *much* slower than that of its Intel or RISC counterparts.

    4. Re:IBM zSeries by tmasssey · · Score: 1
      Z-series and clusters are diametrically opposed to each other. A Z-series processor is not any faster than in Intel processor--maybe even slower. So if you've got a 32-way mainframe (which is definitely on the big side), you've got no more processing power than a 32-way PC.

      So what did you get for your $2 million? Two things: incredible reliability and amazing I/0 bandwidth. You can fully saturate all 32 processors for weeks at a time if you wish, with uptime measured in *years*, 24x7. And that's great if you're a bank doing account transactions. But that's not what you usually use a cluster for.

      Clusters are for CPU processing. For doing lots and lots of calculations over and over. For calculating movie CG effects or modelling the movemt of molecules or nuclear explosions. In that case, the mainframe sucks. It's a very, very expensive 32-way box. Big deal. You would have been better spending $1 million on 2,000 dual-CPU plain boxes and connecting them together with even GigE. You'd have had far better results.

      Use the right tool for the right job. Mainframes for critical I/o-based applications without huge CPU loads, and clusters for highly CPU-bound applications.

    5. Re:IBM zSeries by chez69 · · Score: 1

      very true. but it'll blast them away for IO bound tasks, such as transaction processing.

      --
      PHP is the solution of choice for relaying mysql errors to web users.
  11. Read the Google paper ! by devitto · · Score: 5, Insightful

    In the paper, it goes into tedious detail on the architecture and low-level operation of the application. Why do you think it does this? Because it is the application that *totally* depicts the solution, they chose lots of systems because of reliability, they made those systems "desktop class" because they didn't get much extra from using super-MP/MC systems.

    It's a great article, I strongely suggest you read properly, and do what they said they did - evaluate need against what's available.

    1. Re:Read the Google paper ! by Anonymous Coward · · Score: 0

      Read *what* google paper. There are many. There are even a few discussing hardware architecture.

    2. Re:Read the Google paper ! by devitto · · Score: 1

      The paper mentioned in the article!

      Doesn't anyone read before posting?

  12. well... by croddy · · Score: 5, Informative
    at the moment we have a rack with Dell PowerEdge 1750's. They're very nice for our OpenSSI cluster, with the exception of the disk controller. Despite assurances by Dell that the MegaRAID unit is "linux supported", we're now stuck with what's got to be the worst SCSI RAID controller in the history of computing.

    we're hoping that upgrading to OpenSSI 1.9 (which uses a 2.6 kernel instead of the 2.4 kernel in the current stable release) will show better disk performance... but... yeah.

    1. Re:well... by Anonymous Coward · · Score: 0

      I completely agree with you. We've had ongoing problems with a PERC 4/DI that Dell refuses to fix properly. It randomly drops disks out of the array causing complete data loss sometimes. They've replaced it with a PERC 4/DC, which has worked fine.

      Tech Support: We need to reboot the machine so we can determine if you have a failed hard disk or backplane.
      Me: No. The reason we have RAID in the machine is to maintain uptime, rebooting is a no go.
      Tech Support: Call us back when you can reboot the machine.

      Unfortunately, I don't see any other complaints like this on the Dell support forums. IMO, don't buy Dell if you are expecting a vendor to do what it takes to keep your servers running, and this was after we purchased about $100,000 of hardware.

  13. Dead Popes - sig? by Mynorrrr · · Score: 1

    Yeah but this is the first time you get to watch them decay "live"....

  14. nes beowulf cluster by FLAGGR · · Score: 1, Funny

    I run a 4096 node NES beowulf cluster. Works great!

    1. Re:nes beowulf cluster by FidelCatsro · · Score: 3, Funny

      Is that an SMB(super mario brothers) server

      --
      The only things certain in war are Propaganda and Death. You can never be sure which is which though
    2. Re:nes beowulf cluster by spectre_240sx · · Score: 1

      Damn, you took a good joke and just slaughtered the shit out of it.

    3. Re:nes beowulf cluster by Anonymous Coward · · Score: 0

      Well looks like the public thought it was the good joke after a crap one

    4. Re:nes beowulf cluster by Anonymous Coward · · Score: 0

      It would have been great if he didn't explain it as he was telling it.

  15. Hillbilly Supercomputer OWNZ U by Anonymous Coward · · Score: 0

    http://bhs.broo.k12.wv.us/homepage/staff/seti/stac k.htm

    Some plywood, a power supply powering multiple motherboards.. crunching seti.

    Life is good.

  16. Total Overkill by Anonymous Coward · · Score: 5, Funny
    We can't help but notice that the Google gear in our co-lo appears to be off-the-shelf motherboards screwed to aluminum shelves.

    That would be typical of a prima donna company like Google that's floating in cash from their IPO.

    Around here, we don't waste money on fancy designer metals like aluminum. Salvaged wooden shipping palettes work just fine for us; they're free. And screws!? No need to waste resources on high-end fasteners when you can pick up surplus baling wire for less than a penny per foot. A couple of loops of wire and a few twists are all you need to assemble a working server.

    The dotcom days are over. There's no reason to throw money around like there's no tomorrow.

    1. Re:Total Overkill by Sjobeck · · Score: 0

      Not to mention the fact that we are fiddling while the planet is about to burst in to flames! Seriously, people, wak up, stop entertaining yourselves to death, which is only a figure of speak, when there are actually people who are starvign to death, not a figure of speech. www.one.org www.data.org www.makepovertyhistory.org

  17. Orion Desktop Clusters by Anonymous Coward · · Score: 0

    How about Orion 's Desktop and Deskside clusters? Where else can you get a cluster or 12 or 48-96 CPU's in one machine that you can plug into a normal power socket?

  18. Depends on your CoLo's power & cooling. by Anonymous Coward · · Score: 0

    Note that racks full of blades (and even dual-cpu-1U systems) can easily exceed the power and cooling capacity of many colocation center's racks. I'm not convinced blade systems are cost effective anyway; but just wanted to point out that the answer may depend on where you store the systems.

  19. It depends what you are clustering for... by Imposter_of_myself · · Score: 0

    Are you clustering for performance? Are you clustering for high availability? Are you clustering for fault tolerance? There are differnent types of clusters for different applications. We can't tell you what is "best" for clustering without knowing what you are trying to accomplish.

  20. minitowers!!!! by Anonymous Coward · · Score: 0

    We have a farm of freebsd webservers in small tower cases on shelves. We have 10 of those. All servers were built from very cheap components that we got online. I think that each one is something like $300 - $400. The farm serves dynamic pages via php and apache and also does some image manipulation using netpbm tools. Servers talk to the outside via 2 redundant ballancers running freevrrpd. These have the same hardware as webservers. Our application stores a lot data so I have 2 (for redundancy) 2TB raid arrays served via nfs to the webservers. These are in a rack of course, as well as 2 (for redundancy again) Postgres servers. Rackmount servers are built from off the shelf components again. Works great.

  21. No one size fits all answer but here is mine :) by Anonymous Coward · · Score: 5, Informative

    My .02 cents worth ...

    I build Linux and Apple clusters for biotech, pharma and academic clients. I needed to announce this because clusters designed for lifesci work tend to have different architecture priorities than say clusters used for CFD or weather prediction :) Suffice it to say that bioclusters are rate limited by file I/O issues and are tuned for compute farm style batch computing rather than full on beowulf style parallel processing.

    I've used *many* different platforms to address different requirements, scale out plans and physical/environmental constraints.

    The best whitebox vendor that I have used is Rackable Systems (http://www.rackable.com/ . They truly understand cooling and airflow issues, have great 1U half-depth chassis that let you get near blade density with inexpensive mass market server mainboards and they have great DC power distribution kit for larger deployments.

    For general purpose 1U "pizza box" style rackmounts I tend to use the Sun V20z's when Opterons are called for but IBM and HP both have great dual-Xeon and dual-AMD 1U platforms. For me the Sun Opterons have tended to have the best price/performance numbers from a "big name" vendor.

    Two years ago I was building tons of clusters out of Dell hardware. Now nobody I know is even considering Dell. For me they are no longer on my radar -- their endless pretend games with "considering" AMD based solutions is getting tired and until they start shipping some Opteron based products they not going to be a player of any significant merit.

    The best blade systems I have seen are no longer made -- they were the systems from RLX.

    What you need to understand about blade servers is that the biggest real savings you get with the added price comes from the reduction in administrative burden and ease of operation. The physical form factor and environmental savings are nice but often not as important as the operational/admin/IT savings.

    Because of this, people evaluating blade systems should place a huge priority on the quality of the management, monitoring and provisioning software provided by the blade vendor. This is why RLX blades were better than any other vendor even big players like HP, IBM and Dell.

    That said though, the quality of whitebox blade systems is usually pretty bad -- especially concerning how they handle cooling and airflow. I've seen one bad deployment where the blade rack needed 12 inch ducting brought into the base just to force enough cool air into the rack to keep the mainboards from tripping their emergency temp shutdown probes. If forced to choose a blade solution I'd first grade on the quality of the management software and then on the quality of the vendor. I am very comfortable purchasing 1U rackmounts from whitebox vendors but I'd probably not purchase a blade system from one. Interestingly enough I just got a Penguin blade chasssis installed and will be playing with it next week to see how it does.

    If you don't have a datacenter, special air conditioning or a dedicated IT staff then I highly recommend checking out OrionMultisystems. They sell 12-node desktop and 96-node deskside clusters that ship from the factory fully integrated and best of all they run off a single 110v electrical. They may not win on pure performance when going head to head against dedicated 1U servers but Orion by far wins the prize for "most amount of compute power you can squeeze out of a single electrical outlet..."

    I've written a lot about clustering for bioinformatics and life science. All of my work can be seen online here: http://bioteam.net/dag/ -- apologies for the plug but I figure this is pretty darn on-topic.

    -chris

    1. Re:No one size fits all answer but here is mine :) by dumbfounder · · Score: 1

      You can get a dell sc1425 with 2x2.8ghz xeons and 1GB ram for $1200, but the cheapest you can get a dual opteron sun for is $2444. Do you really think the opterons are more than twice as fast?

    2. Re:No one size fits all answer but here is mine :) by chris.dag · · Score: 3, Informative


      Depends on the specific code to meet your criteria of "twice as fast"...some apps will be more than twice as fast; some will be slightly faster, equal or in some cases slower.

      For more general use cases (at least in my field) I can give a qualified answer of "dual Opteron currently represents the best price/performance ratio for small SMP (2-4 CPUs)".

      I've also seen cheaper pricing from Sun than what you mentioned. You are right though in that there is a price difference between xeon vs opteron - whenever I consider a more expensive alternative I tend to have fresh app benchmark data handy to back up the justification.

      -Chris

    3. Re:No one size fits all answer but here is mine :) by sysadmn · · Score: 1

      In our experience, the number of cpus per rack is limited by data center cooling. Blades are great to look at, and somewhat easier to provision, but having 60 - 80 cpus in 7U chassis isn't any better than having 60 - 80 dual cpu 1U pizza boxes. Our latest (commercial) cluster is 604 Dell boxes - PESC 1425 for the low-end jobs, 1850's for the heavy lifting (12 GB RAM).

      --
      Envy my 5 digit Slashdot User ID!
    4. Re:No one size fits all answer but here is mine :) by Bert64 · · Score: 2, Insightful

      There are cheaper vendors out there than sun. Sun are known for producing premium hardware whereas dell are known for producing bargain bucket hardware... A better comparison would be to look at a vendor who produces both xeon and opteron hardware, such as HP or IBM.

      --
      http://spamdecoy.net - free throwaway anonymous email - avoid spam!
    5. Re:No one size fits all answer but here is mine :) by dumbfounder · · Score: 1

      $1995 for the base v20z is the cheapest I have found on Sun with a moderate amount of digging, then add the second cpu for $445. Is there another model that is cheaper?

      I am building a search engine and for my first big server purchase I will be spending about $60k. Right now I am vacillating between 40x Dell sc1425's with 2x2.8ghz Xeons(64bit), 1gb, and 2x160gb ($1557)or 30x Penguin Computing Altus 1300 with 2x244 Opterons, 1gb, and 2x120gb ($2035).

      I am assuming you would lean toward the Penguin side, but since I/O tends to be a major bottleneck with search engines I am leaning towards the Dell's. Also I can get a decent lease through Dell. But I do LOVE Opterons, I have a few already and several Athlon 64's that I run my site off of (dumbfind.com). I don't have any of the 64 bit Xeon's, but I sent a simple custom benchmark app to a friend that has access to some and I was pretty impressed with the results. The 32bit Xeon's pale in comparison to my AMD chips, but comparing all of the results on all the various chips I have lead me to the conclusion that the 64bit Xeon chips running the 64bit IBM jvm (all my stuff is in Java) made up a ton of ground, and were faster than the opterons in some tests. But the benchmarks are not true representations of how my app will behave so I am still a bit skeptical.

      So basically I am a scared and confused little boy about to dole out a good chunk of my initial investment. Any advice you could give would be AWESOME.

      Also, do you think that an idiot can figure out how to kickstart all those servers via the network, or will I need a floppy or cd-rom drive in those machines?

      thanks!

      - also Chris

    6. Re:No one size fits all answer but here is mine :) by sirsnork · · Score: 2, Informative

      Not sure why you think Xeon's have better I/O than Opterons but I would spend some serious time looking into the on-chip memory controllers on Opterons and also do the maths to establish the maximum amount of data a Xeon based systems front side bus can move

      --

      Normal people worry me!
    7. Re:No one size fits all answer but here is mine :) by dumbfounder · · Score: 1

      Oh, I don't know if Xeons are better head-to-head in I/O, but spreading an index out onto 80 disks (40 xeon servers) is inherently better than 60 (30 opteron servers). However, the search is cpu bound on some searches and then I/O bound (specifically, hd read transfer rate) on others, so that is part of the reason for my vacillation. I hope to be able to optimize the cpu bottlenecks out, but I don't know how much time there will be for that.

    8. Re:No one size fits all answer but here is mine :) by GeekTek · · Score: 1

      I'm also evaluating platforms, thanks for the post. Why do you choose Tyan over Supermicro?

    9. Re:No one size fits all answer but here is mine :) by jweage · · Score: 1

      I have a couple of clusters under my belt. I've used Appro and Dell hardware in the past.

      Although I've had issues with Dell RAID hardware, their PowerEdge servers have been very solid. When you buy a bunch of machines they will steeply discount your order above and beyond the standard business account discount. This made it extremely difficult to justify the expense of Opteron systems when I could get dual Xeons from Dell for consideribly less money. Add to that the discount on storage, tape backup, etc. and Dell is extremely attractive.

      Another niceity is Open Manage which is included at no additional cost with Dell hardware. There is a Linux client which provides all of the hardware monitoring most people need. Although the documentation isn't that great, it is a pretty useful monitoring platform.

  22. Cheap isnt always the way to go by oh_the_humanity · · Score: 3, Insightful

    When doing clkustering and super computer work. Cheap isnt always the best way to go , if you take into consideration that 5 - 10 % of nodes will either not be functioning correctly or will have some sort of hardware failure. The more you cluster the more man power it takes to repair these nodes. if you buy 1000 $499 colomachines , and 50 of them are failing at any given time, it becomes very time consuming and tedious to keep the cluster going. Spending the extra bucks on high quality hardware , will save you money and head ache in the end. I always use the analogy when talking to older folks who want to get started in computers. spend the extra bucks to get a new machine. The extra money you spend on buying new good equipment , will more than pay for itself in comparison , to the amount of frustration you get from buying old used slow computers. My $.2

    --
    "When they invent bitch slaps that can go through a monitor you better f'ing duck" --deft (253558)
    1. Re:Cheap isnt always the way to go by Anonymous Coward · · Score: 0


      I'll second that, but I've experienced up to over 30% failure rate.

      I will never do whitebox stuff again.

      FYI, the vendor was Microway.

    2. Re:Cheap isnt always the way to go by Anonymous Coward · · Score: 1, Informative

      You are wrong. In *most* cases, cheap is the the way to go (but soo few people realize it).

      Short argument: Google choosed the cheapest way, they are happy with it and don't plan to change their mind.

      Long argument: What you don't realize is that cheap hardware is so DAMN cheap, that you CAN afford employing the manpower required to maintain your cluster. In other words: instead of spending a lot of money on expensive hardware, buy the cheapest gear (where the price/perf ratio is optimal), and then spend a fraction of the money you saved to employ technicians that will maintain/replace defectuous hardware. Moreover with such cheap hardware, you don't even care about the warranty/support, you could trash it directly without even bothering contacting the manufacturer to get replacement parts ! Nowadays $300 is all you need to get a mobo + AMD Sempron 2800 + 512 MB RAM + 80 GB harddisk + NIC 100 Mbps + PSU. So why spend $3000 on a dual-opteron server when you could buy 10, TEN, fscking white boxes for the same price. The only reason would be that the software cannot be parallelized, or that you have constraints on the space occupied by your clusters (that's why I said "in *most* cases cheap is the way to go"). But then even in thoses cases you could maybe spend the money to rent more racks/space, and then you would save money by buying those white boxes.

    3. Re:Cheap isnt always the way to go by Anonymous Coward · · Score: 0

      I forgot to say that, of course, one of the key point to successfully run a white box cluster is: design the software so that hardware failure is the norm instead of the exception. A cluster with 50% working nodes should run as well as a cluster with 100% working nodes.

      One of the poster said "I will never do whitebox stuff again." I guess the reason is that the sofware he was running was not designed with hardware failure in mind...

    4. Re:Cheap isnt always the way to go by IANAAC · · Score: 1
      if you buy 1000 $499 colomachines , and 50 of them are failing at any given time, it becomes very time consuming and tedious to keep the cluster going.
      actually, the same monitoring and swapout/repair of hardware is the same whether it's cheap hardware or expensive hardware. I've had a fair bit of hardware failures on an IBM cluster. Once monitoring catches it and notifies me, I get the new hardware in place and use xcat and kickstart to push everything back out to the replaced node(s). The most time consuming part is waiting for the hardware to arrive. Xcat/kickstart provide a decent toolset to automate everything after the hardware arrives.
    5. Re:Cheap isnt always the way to go by Anonymous Coward · · Score: 0
      Cheap isnt always the best way to go , if you take into consideration that 5 - 10 % of nodes will either not be functioning correctly or will have some sort of hardware failure.

      With cheap hardware you can have 50% lower costs (or 200% the performace/dollar); so even if 10% of your nodes are down you still have a system performing 180% as well as your expensive version.

      My $.2

      Nice freudian slip! A factor of ten markup on that as well.

    6. Re:Cheap isnt always the way to go by toddestan · · Score: 1

      The extra money you spend on buying new good equipment , will more than pay for itself in comparison , to the amount of frustration you get from buying old used slow computers. My $.2

      I would agree that in many cases for clusters, you want high quality new computers. Why deal with a bunch of used P3 class computers, when a cluster of AMD64 computers could get more done with 1/4 the units.

      However, if someone wants an inexpensive computer to browse the internet and do email, I do recommend getting a used computer. The used computer to get is an ex-coporate workstation by a company like HP, Compaq, IBM, etc. There are many resellers of these systems out there, and they will guarantee them. These computers are reliable, easy to work on, and very cheap (High end P3 systems are less than $200). They will give less problems and are overall better systems than the low end $400 eMachines and Dells out there.

    7. Re:Cheap isnt always the way to go by jpc · · Score: 1


      It depends on 2 things, your application and how much software development you can do. Google trades off a lot of software development against the lower hardware costs, and their application is well suited to the piles of cheap stuff as they mostly just want lots of RAM. Other applications are not so suitable and some people are severely constrained in the amount of development time they have (and need the application to run now, not when a new redundant version has been tested in several months time).

      I do believe in doing a good amount of burn-in and testing of machines. I dont want to debug stuff that is failing because of memory errors for example so will always do memory tests. Clearly this is not "necessary" but I think 99% of people dont want memory errors affecting their applications. Also hardware drivers may have bugs - if the etherenet cards on your whitebox machines lock up every now and again with hundreds of them it will happen too often to be useful.

      But other than that, its really a question of whether your application is IO bound, CPU bound, memory bandwidth bound and so on.

    8. Re:Cheap isnt always the way to go by scum-o · · Score: 1

      Figure it out:
      50-node cluster w/myrinet (HPC): $150,000
      3 cheap sysadmins for mainenance: same price, but more than enough to maintain several hundred nodes.

      If you have 1000 nodes worth of clsuter nodes, you can expect 5-10% downtime due to overheating, power supply or harddrive problems. If you have 1000 nodes, your sysadmin staff is probably around 10 people. This is more than enough to maintain 50-100 broken nodes (handling RMA's for harddrives or power supplies) is totally do-able with 10 people. If you buy DELLs vs SuperMicro or another cheaper vendor, you'll pretty much double your hardware cost, but you won't be able to cut your admin staff in half based completely on hardware reliability. Sure, your HW will stay up and running longer, but sysadmin tasks are more than just hardware. You'll still need your staff and hardware will easily fail.

      My recommendation: Buy cheap hardware, buy several complete spare nodes and a bunch of undergrads to swap out hardware. You'll have a cheap, unstoppable cluster that will stay up and running for years to come.

    9. Re:Cheap isnt always the way to go by CommieOverlord · · Score: 1

      The difference with cheaper, slower hardware is that people need more of it to run than with more expensive equipment. More computers means more system administrators.

      I've seen people say "Hey, I can make a cluster out of my 16 old P3-400MHz computers, it's cheap". You can, sure, but a single 3GHz dual-xeon will both be faster and save you time, electricity, and space.

      An upfront cheap may lead to far higher support costs.

  23. Re:Try a Green Solution by Anonymous Coward · · Score: 0

    Damn, how many times have you tried to karma-whore and link that phrusa.org site now? Don't you have a job or something?

  24. Re:XServe by jschottm · · Score: 4, Interesting

    To those who say Apple isn't targeting the enterprise, look no further.

    Let me know when they stop trying to force their iPod updater (you know, the one that breaks Real's compatability DRM software) onto my servers. No matter how many times you put that update in the "Never update this" category, it shows back up the next time you run Software Update. Until they stop trying to play childish games on my production servers, I'll not consider them ready for the enterprise.

  25. Hardware? by woah · · Score: 1
    What about software?

    What are people's experiences with OpenSSI vs OpenMosix?

  26. We do nice house built clusters by nikkoslack · · Score: 1

    We typically build clusters that are NOT destined for a fancy, high end data centers, so 110 volts and a less dense configuration is essential. We build them into the skinniest mid towers we can, with 2 80mm fans in front and two fans in back. we use the baker's rack method and a hot isle/cold isle arrangement of racks. It's hard to find desktop cases without crappy fans in the sides, which are no good to us. Currently, the ASUS A7V400-MX and SEMPRON 3000+, 1GB DDR400 ram, and 120 gig ide hd is the choice. OnBoard NIC and video are essential to clusters, IMO. We'll be moving to SATA drives and gb ethernet onboard soon, but IDE's are dirt cheap right now. Coincidently, this is the same white-box web server we usually build.

    1. Re:We do nice house built clusters by grozzie2 · · Score: 1, Insightful

      I'm curious as to why video is even relavent, never mind essential, for a cluster node? Just another part generating heat, that is serving no purpose other than to possibly pacify a brain dead bios at boot time. Brain dead bios is fixable, heat takes real effort to get rid of it. Seems like a total waste to put video into a machine that's gonna sit in a rack, and likely never have a monitor plugged into it. Seems like an even bigger waste to actually plug some sort of video output device into it.

    2. Re:We do nice house built clusters by oneishy · · Score: 1

      "onboard video is essential" for exactly the reasons you mentioned. Because you don't have to toss an overpriced agp card in the box. Granted a linux box works fine without a video card, but it does have it's uses.

    3. Re:We do nice house built clusters by nikkoslack · · Score: 1

      Onboard, as in built into the MB, not taking up space in a PCI slot, and 0 configuration. I thought the requirements were obvious for commodity clusters, as bios POST screens don't go to serial ports, etc. You gotta set it up somehow, so why buy a card to stick in the machine? Just find a MB that has video and LAN onboard, and you're all set.

    4. Re:We do nice house built clusters by grozzie2 · · Score: 1
      that's one way, altho I prefer to use stuff that comes out of the box with pxe enabled. Just remote boot it, and away to the races.

      Guess it says something about legacy bios if you really need to have a video system in a machine that will never have a monitor/keyboard attached in normal operation. The other detail that really irritates when it comes to off the shelf pc equipment, cant count how many times I've seen a bios post screen sitting there saying 'keyboard error, keyboard not attached - hit F6 to continue'. Then you have to attach a keyboard, just to set the bios up to ignore keyboard errors during boot. I guess it would be just to logical for the folks making bios to think, and realize, if there's not keyboard, and no video, there is nothing to lose (and lots to gain) by directing i/o to a serial port.

  27. Re:XServe by Twid · · Score: 3, Interesting

    Tiger Server lets you run your own Software Update Server, which would solve this problem for you. You run a central update server, point all your servers and clients at that, and then you can approve or disapprove each update before it goes out.

    --
    - "When you want something with all your heart, the entire universe conspires to give it to you" -Paulo Coelho
  28. Obviously by iamdrscience · · Score: 4, Funny

    Isn't it obvious that the best technology is blade servers? I mean, c'mon fucking BLADE servers! It's far and away got the coolest name of any of them. The only way you could beat them would be if some company came out with something cooler like ninja star servers, now that would be awesome.

    1. Re:Obviously by PornMaster · · Score: 2, Funny

      Yeah.

      HP, instead of calling its big boxes Superdomes, should have used Thunderdrome!

    2. Re:Obviously by Anonymous Coward · · Score: 0

      and by awesome, you mean totally sweet.

    3. Re:Obviously by Anonymous Coward · · Score: 0

      Come on!

      Pizza-box servers are the way to go! When you get hungry, just go to the server room and there are 10s or 100s of pretty warm pizzas waiting for you!

    4. Re:Obviously by Anonymous Coward · · Score: 0

      No, the best technology isn't obviously blade servers. It totally depends on what you're using your clusters for. I manage 2 small HPC blade clusters (one has 26 nodes and the other one has 84 nodes), and we've found the following problems:
      1. heat - the buggers run very hot - we have limited space so we ear it, but others may be better off with cooler but larger boxes if they have the room.
      2. They are slower for some latency-sensitve MPI apps because of the switch architecture - rather than 1 huge switch, you have several layers of switches, which means that communication between nodes on different switches takes a small hit, that can be significant for highly latency sensitve MPI applications.
      We run a variety of codes, so once again, it's no big deal for us, but for some sites it could be significant if latency-sensitve apps are the primary use for the cluster.
      Some people may even be better off with 1 4 CPU opteron system with 32gig of ram then say an 8 node cluster for about $50,000 AUD because of the larger ram on 4-CPU system.
      It all depends on the model, and the options you have for improving the code vs the expense of getting and maintaining the hardware.

    5. Re:Obviously by iamdrscience · · Score: 1

      Way to miss the joke.

    6. Re:Obviously by mikeage · · Score: 3, Funny

      Hi, this post is all about blades, REAL BLADES. This post is awesome. My name is Mikeage and I can't stop thinking about blades. These machines are cool; and by cool, I mean totally sweet.

      Facts:

      1. Blades are servers.

      2. Blades fight ALL the time.

      3. The purpose of the blade is to flip out and kill people.

      Testimonial:

      Blades can kill anyone they want! Blades cut off heads ALL the time and don't even think twice about it. These guys are so crazy and awesome that they flip out ALL the time. I heard that there was this blade who was eating at a diner. And when some dude dropped a spoon the blade killed the whole town. My friend Mark said that he saw a blade totally uppercut some kid just because the kid opened a window.

      And that's what I call REAL Ultimate Power!!!!!!!!!!!!!!!!!!

      --
      -- Is "Sig" copyrighted by www.sig.com?
    7. Re:Obviously by Pathwalker · · Score: 1

      Personally, I think they should have stuck with the "Halfdome" name for the single cabinet boxes, and reserved "Superdome" for the multi-cabinet boxes.

      Put two (or more) halfdomes together, and you get a Superdome? Makes sense to me.

  29. It really depends on use. by the_mutha · · Score: 1
    I think it really depends on use and in your facility. For example:
    • Do you host your servers at your own facility or do you host them in a data-center where rack space is a premium?
    • Is your application CPU intensive, disk intensive, memory intensive (i.e. scientific application, web servers, database servers, etc.)


    We run a very large website. We have a 1U dual Athlon MP box for administration and log processing, and a 2U dual Xeon box with a 6 disk RAID 10 solution for apache + mysql. Its a great solution to us because we run things in a data center co-lo where space is a premium. Both servers use off-the-shelf components and are easy to upgrade and maintain.

    If we were running some scientific application and had our own location and good network connection (for example, a university), then I suppose a cluster of standard hardware components like how Google does it would be perfect. If you were doing the same CPU-intensive stuff, but at a co-lo facility, then perhaps a blade solution would be best... it really all depends - you have to look at what your main needs and constraints are, and look at what is available and what will best suit your needs. There is no way to give a standard answer.
    1. Re:It really depends on use. by Sjobeck · · Score: 0

      Decent comments, but ... I do not think that two servers (one admin & one node) really constitutes a cluster, and if it did, it certaintly is not what the poster asked about.

  30. Re:Thanks for the info by symbolic · · Score: 1


    If I had mod points, this is definitely where i'd put one of them.

  31. Teraflopping Bogus Science by Anonymous Coward · · Score: 0

    Star Wars redux: "Medeiros' team and RDECOM will also use the supercluster to improve the design of intercept missiles that defend against attack."

  32. Mobos on Ikea shelves by astrojetsonjr · · Score: 4, Interesting

    Currently 65 (1 master, 64 nodes) of AMD Mobos on Ikea shelves. Cheap, easy to swap out, good air flow around the hardware. The shelves are wood, so everything just sits on them. It would be nice to find power supplies with extra connections to power more than one system.

    1. Re:Mobos on Ikea shelves by Iamnoone · · Score: 1
      What model of shelves?
      I am looking for good alternatives.

      ZOMG, /. now has captchas:
      To confirm you're not a script,
      please type the text shown in this image: random letters - if you are visually impaired, please email us at pater@slashdot.org
    2. Re:Mobos on Ikea shelves by astrojetsonjr · · Score: 1

      Ivar. The shelves are solid wood, they will hold the weight of anything else. The mobo can be held down with wood screws an spacers. The shelves can be spaced 2" apart if you want. It's easy to drill holes in the shelves for cables, extra air flow, etc.

  33. Quick FYI re: Google gear by Anonymous Coward · · Score: 0

    off-the-shelf motherboards

    Mostly true, though increasingly Google has commodity vendors producing custom hardware runs (ie - "Google" branded RAM, variant ASUS motherboards with a custom BIOS and no built-in video or sound, etc.).

    screwed to aluminum shelves

    Nope, no screws. The motherboards are attached to the trays by plastic pop-on/pop-off standoffs and everything else is held together by velcro straps; no screwdrivers or other tools are necessary when swapping parts. The trays can also hang from the front of the racks while parts are being swapped.

    1. Re:Quick FYI re: Google gear by Anonymous Coward · · Score: 0
      Does anyone sell such systems?

      I've always thought it was silly to be paying for video chips and sound chips when the serial line and ethernet were the only way I interacted with the system.

    2. Re:Quick FYI re: Google gear by Anonymous Coward · · Score: 0

      Sure, just order them in lots of five to six thousand every couple of months. Most hardware vendors will bend over backwards to make you happy if you throw that kind of money at them.

  34. Re:XServe by NitsujTPU · · Score: 3, Interesting

    Are you running iTunes on your production servers? Can't you just uninstall iTunes and be fine?

    Why would a system configured as a fileserver have that software on it to begin with? Is Apple's apt tool so bad that it tries to patch software that hasn't been installed?

  35. What to look for? Thats simple: by Anonymous Coward · · Score: 0

    Look for the "ULTRAsparc Driven" logo on the front .. and you've found an ideal base for your cluster...

    -GenTimJS

    1. Re:What to look for? Thats simple: by Anonymous Coward · · Score: 0

      if by 'best' you mean 'expensive and slow compared to others'.

  36. Not everyone can be Google by Anthony+Liguori · · Score: 2, Insightful

    It's tempting to just go buy a bunch of motherboards on ebay and some bread racks to build your cluster. It's certainly the cheapest and most flexible approach.

    However, it takes a special type of people to manage that kind of hardware. You have to deal with a high amount of failure, you have to be extra careful to avoid static problems, you've got to really think through how your going to wire things.

    On the other hand, if you get something like a IBM BladeCenter, you have a very similar solution that may cost a little more but is significantly more reliable. More importantly, blades are just as robust as a normal server. You don't have to worry about your PHB not grounding himself properly when he goes to look at what you've setup.

    I expect blades are going to be the standard form factor in the future. It just makes sense to centralize all of the cooling, power, and IO devices.

  37. Re:Try a Green Solution by Anonymous Coward · · Score: 0

    CAN we just ban the parents IP??

    please...

  38. Re:XServe by WMD_88 · · Score: 1

    You mean how Windows 2003 updates Windows Media Player?

  39. SunFire Servers by PhunkySchtuff · · Score: 4, Informative

    SunFire v20z or v40z Servers.
    http://www.sun.com/servers/entry/v20z/index.jsp
    http://www.sun.com/servers/entry/v40z/index.jsp
    They're the entry-level servers from Sun, so they have great support. They're on the WHQL List, so Windows XP, 2003 Server and the forthcoming 64-bit versions all run fine.
    They also run Linux quite well, and as if that wasn't enough, they all scream along with Solaris installed.
    The v20z is a 1 or 2 way Opteron box, in a 1RU case. the v40z is a two or for CPU box that is available with single or dual core Opterons.
    Plus, they're one of the cheapest, if not the cheapest, Tier 1 Opteron servers on the market.

    1. Re:SunFire Servers by eryanv · · Score: 3, Informative

      The Sun 1U servers are great products for clusters. Last year our research group purchased around 72 v60x's (dual Xeons) and they've never given us a problem. Sun doesn't sell this model anymore as they've pretty much dropped Intel for AMD, but they run Linux with ClusterMatic just fine. Not only that, but Sun has the best techincal support I've ever encountered, I don't mind when we do need to call them up to fix something.
      Besides that, we also have several clusters of midsized towers from a local company, which have the benefit of being able to be used as desktop machines when the cluster gets replaced.

    2. Re:SunFire Servers by Anonymous Coward · · Score: 0

      I have no objection to Sun v20z/40z. However, I tested a demo unit v20z and found the noise generated by its fans in-tolerable. I had a HP 1U beside the unit and thus could do a simple comparison on the spot by switching things on and off.

      I understand it is no a problem if such machines are shipped to co-lo datacenter. But for such brand investment, I supposed Sun could use a better fan. No?

    3. Re:SunFire Servers by iggymanz · · Score: 1

      yeah, the V20z scream alright, after boot Sun has the fans in them pegged by the BIOS at 15K rpm. The high pitched shriek they make is annoying even in noisiest datacenter environment.

  40. It's not that the MegaRAID cards are bad... by Ayanami+Rei · · Score: 3, Insightful

    ...it's the firmware Dell shoves on them.
    It's only designed to hook up with Dell disc arrays and tape drives and everything else can shove it (from their point of view).
    Do yourself a favor and skip 'em and just by the cards straight from LSI.

    --
    THIS THING CAN TURN ON A DIME, MACROSSZERO STYLE ALSO FUCK BETA, ~NYORON
    1. Re:It's not that the MegaRAID cards are bad... by croddy · · Score: 1

      we're even using their disk arrays. i wasn't here when that purchasing decision was made, but man oh man, these things are crap.

  41. None of the above... by Anonymous Coward · · Score: 0

    He, like so many others these days, is clustering for style. It's fashionable to do clusters right now and he doesn't want to be left out.

    He's a slave to fashion and that means having the most kick-ass cluster that ever served a Word document! w00t! w00t! w00t!

  42. Re:XServe by Sjobeck · · Score: 0

    two servers is not a cluster. it is not what the poster asked.

  43. Great story re w/ build your own IBM cluster by MilesParker · · Score: 4, Interesting

    We wanted to set up a small 4-8 node cluster mostly for testing and as a compute resource. For various political reasons we were looking at an IBM solution. At my uirging we went for dual Opterons in the 1U format. And the price seemed right. Here's where it gets wierd *after* the OBM sales people step in. Going thourgh it peice by piece I thought I could put a decent system together - with our substantial IBM discount -- for $14k. By the time we got the quote with all of the crap they thought we needed it was 34k! Just to give the flavor, the rack and assorted pieces was 4k. But thats not the funny part. We were like, "well for this much money, we assume you are putting it together for us." "Um no...didn't you see the services quote that went along with this?" We hadn't -- with the services/support quote came in at $60k! So at this point we asked, can't we just buy the individual pieces we need and put it together ourselves. "Well, yes, but then it won't be an IBM e1350 cluster 'solution'..." "Yea, well, we don't really care what its called, it'll be just as fast and 75% cheaper..." At that time they were getting rid of their 325 servers for way cheap and we actually put that system together for as cheap as a whitebox and probably as cheap as if we'd tried to put it together ourselves. The moral I guess is that if you have to deal with the big vendors, have a very sharp pencil handy!

  44. Clarification.. by MilesParker · · Score: 2, Funny

    ...after posting I realized that the 34 and 60k quotes were before discount (I think). SO the actual price descrepncy was "only" 62%.

  45. It Depends... by xanthan · · Score: 3, Informative

    I think most of my posts to Slashdot begin with "It Depends..." =)

    Answering this... It Depends...

    What is your cluster's tolerence to failure? If a node can fail, then you have the option of buying a lot of cheap hardware and replacing as necessary. This is the way that most big web farms work.

    What is your cluster machine requirements? Do you have heavy I/O? Does cache memory matter? Do you need a beefy FSB and 64G of RAM per node? You may find that spending $3000/node ends up being cheaper than buying three $1000 nodes because the $3000 node is capable of processing more per unit time than the three $1000 units are.

    What is your power/rack cost constraint? Google is an invalid comparison simply because of their size. They boomed when a lot of people were busting and co-lo's were hungry for business. I'd bet they have a song of a deal in terms of space, power, and pipe. You are not Google and I doubt you have a similar deal. Thus, you may find that there is a middle ground where it is better to get a more powerful machine to use less rack space/power.

    In the end, you have to optimize between these three variables. You'll probably find that the solution, for you specifically, is going to be unique. For example, you may find that: Node failure is an option since the software will recover, power/rack costs are sufficiently high that you have to limit yourself, and CPU power with a good cache is crucial, but I/O isn't. This means getting a cheaper Athlon based motherboard with so-so I/O and cheesy video is a good choice since it frees your budget for a fast CPU. Combine with the cheapest power suppy the systems can tolerate and PXE boot and you have your ideal system.

    Best of luck.

  46. Classic problem: Fast, Cheap, Good. Pick Two. by sirwired · · Score: 2, Interesting

    Obviously, there ain't no such thing as a free lunch. It all depends on what you want.

    For sheer processor density, if you need complete servers, the IBM BladeCenter servers offer the most "Bang" (Fast), and they are fairly reliable and compact (Good). They are not cheap. They do have better density than the HP Blades. WETA Digital (Peter Jackson's FX company) uses them.

    That will get you 2 server processors, two server-class IDE drives + 2 GigE ports + all peripherals (Power, KVM, CD, Mangement, GigE switches, SAN switches if you want, etc.) per one-half of a rack unit. This is well over twice the density of pizza box units when you count external peripherals like the networking switch, KVM, etc.

    Google's setup is Fast and Cheap, but their hardware reliability is quite lousy. However, their clustering setup is specifically designed around expected hardware failure.

    (As a side note, Google no longer uses bare boards for their basic nodes. They use fairly small and slow nodes with a LOT of RAM from some company I can't remember. They look kind of like over-sized hard drives.)

    If you need crap-loads of raw computing power, in a relatively compact power-efficient chassis (1024 processors/rack), IBM's Blue Gene simply cannot be beat. This is Captial-F Fast, and Capital-G Good, but you certainly can't afford one. (While it provides more cycles for the watt and dollar than any other setup, it isn't exactly as simple as a Beowulf cluster.) And you would still need to buy pesky things like large GigE switches and storage. Check out the current issue of the IBM Journal of Research and Development on IBM's website (or your local university library) for all sorts of juicy details.

    [Yes, I am an IBM shill]

    So realistically, you really need to look at your application. If it can tolerate failure of any individual node on a regular basis, get the cheapest stuff you can find that will fit in your space and CPU requirements. If node reliability is important, but space is not, 1U servers from any of the three major vendors (or Apple, if that is your thing) will do the job just fine. If you need reliability and space, then honestly IBM's BladeCenter boxen are the best, as long as they fit your application. (I am not just speaking as an IBM'er here... they really are the best blades out there.)

    SirWired

  47. Re:XServe by Anonymous Coward · · Score: 0

    Like most Apple zealots, you're completely missing the point here. Why should he have to run his own software update server to circumvent Apple's childishness? Quite a waste of resources, don't you think?

    Yes, but that also means he has to buy more pricy Apple hardware.

  48. Depends on your problem by Anonymous Coward · · Score: 0

    The vast bulk of the computing in the branch of the sciences that I build and manage clusters for are basic 1U Intel/AMD Dual servers combined with medium level networking and a growing number are managed by Rocks, the RHEL based rocks cluster distribution available at rocksclusters.org.

    If you are building a cluster and do not try out rocks you are doing yourself a disservice regardless of hardware.

  49. Re:XServe (Stay clear) by madmarcel · · Score: 1

    I would recommend against using Xserve's for clustering. We've been using one for the last 5 months, and we've had nothing but trouble with the damn things. The only thing good about them is their comparatively low price :(

    Not recommended.

  50. Good to know they are not useing cardboard... by Anonymous Coward · · Score: 0

    Hum... It's good to know they have upgraded... A little bird once told me that they had a fire at their colo once due to their rack just being stuffed with MB separated with bits of cardboard. Course I'm probably lying, so don't take my word for it ;-)

  51. Good Thread by eutychus_awakes · · Score: 1

    I posted an "Ask Slashdot" question almost identical to this one about 6-months ago - but it was rejected (go figure). Regardless, I got what I wanted from this one! Thanks folks.

    --
    This sig is a test. If this had been an actual sig, you would be reading something quite a bit wittier than this now.
  52. Re:Classic problem: Fast, Cheap, Good. Pick Two. by Anonymous Coward · · Score: 0

    On the blades, I would hold out just a bit for the LS20's (68W Opteron's) to mature. It will/is available as dual core as well.

  53. Good Cheap Servers by Anonymous Coward · · Score: 0

    The best way to build a cluster on a budget is www.cheap1u.com. For what our servers cost no one can touch the price and performance of a basic server without all the bells and whistles that you don't want or need. Built with off the shelf parts you cant even buy most 1U rack mounts for less than $1000, ours start at $399 and then if anything burns out on their server you are stuck with buying high priced replacement parts.

  54. Cluster + Infiband = Big NUMA by Jah-Wren+Ryel · · Score: 1

    Check out VirtualIron.

    These guys make a hypervisor that sits on top of infiniband between multiple blades and turns them into one big shared-memory linux NUMA system, up to around 32 cpus and probably even bigger eventually. Plus they can dynamically move cpus and memory in and out of each virtual machine, split the group into multiple smaller virutal machines, etc.

    I have no connection to them, just saw them at one of the east coast linux tradeshows. I think their hypervisor will eventually be superceded by Xen or one of the other virutalization systems, but for now what they've got looks like hot snot on a gold platter.

    --
    When information is power, privacy is freedom.
  55. XPC boxes - cheap and reliable by Anonymous Coward · · Score: 0

    We've been using Shuttle XPC systems - the older VIA chipset kind 333MHz/P4 based with a GigE card in the PCI slot. Very cheap, easy to self-assemble, so far stable. The same boxes are on the office desks with an AGP video card added. Every box is the same hardware. 8 of these plus switches and UPS fit in a 1/4 rack - OK its a bit crammed, but hey so is NY.

  56. Re:XServe by Guido+von+Guido · · Score: 1
    It sounds like he/she/them/it has uninstalled iTunes. I suspect it does some dependency checking, and iTunes just happens to fulfill some stupid dependency.

    Most likely this is some idiotic setting in the default update settings (although it take some digging to find it).

  57. Re:XServe by duffbeer703 · · Score: 1

    Even more disturbing than obnoxious practices like this are the other "updates" applied.

    For example, when you update iTunes, you must accept retroactive licensing changes to all of the music that you have purchased.

    --
    Conformity is the jailer of freedom and enemy of growth. -JFK
  58. Re:XServe by Twid · · Score: 1

    Well, he's complaining about something quite minor, which is just having to uncheck the updates you don't what when you run an update. By comparison, in the Linux world Novell forces you to purchase Red Carpet update service for the same feature, and I believe Red Hat does the same with RHEL.

    So, I'm not sure why all the hate. The software update server is bundled with Tiger Server, which you can get for as little as $500.

    --
    - "When you want something with all your heart, the entire universe conspires to give it to you" -Paulo Coelho
  59. Re:XServe by jschottm · · Score: 1

    Not iTunes, iPod updater. The thing that tries to put new firmware onto [my non-existant] iPod when you plug it in. And no, I do not have iTunes installed on my productions servers.

    And to answer Guido's question, it's not a matter of fixing dependencies for iTunes - on desktop systems [which do have iTunes installed] you can upgrade iTunes just fine if you tell it not to install the iPod Updater, but it will keep trying to install it again and again and again.

  60. Re:XServe by jschottm · · Score: 1

    I wouldn't know, having not used WK3. Trying to update the media player that frequently has security holes is different than trying to stuff your iPod firmware updater onto every system possible though. And I've never seen an instance of XP/W2K (which I use quite a bit) adding an update you told it not to apply back to the list of patches to download and install, except for the notable exception of things like SP2 for XP.

  61. Re:XServe by jschottm · · Score: 0, Flamebait

    Well, he's complaining about something quite minor

    It's not just a minor thing, it's symptomatic of what's wrong with Apple. You will use your computer the way Steve says you'll use your computer. You will use your iPod the way Steve says you will. Alt(Flower)-Tab will work the way Steve says it will (they intentionally broke the ability for third party software to change its functionality to match Windows/[many] Linux style window switching).

    When I say I don't want a package installed on my system, that's final. Any company with that attitude is not ready for the enterprise IMO.

    By comparison, in the Linux world Novell forces you to purchase Red Carpet update service for the same feature, and I believe Red Hat does the same with RHEL.

    Access to the Red Hat Network is included with every RHEL subscription. I've not used any of the Novell branded Linuxes, but in SuSE (owned by Novell) all I do is fire up YAST and point it at a mirror and download away. No separate purchase or Red Carpet involved. All of which is irrelevent to my point.

  62. Imagine..... by Stenson · · Score: 1

    Imagine a beowolf cluster of ask slashdot articles about beowolf clusters!!!!

  63. Re:XServe by MrResistor · · Score: 1

    By comparison, in the Linux world Novell forces you to purchase Red Carpet update service for the same feature, and I believe Red Hat does the same with RHEL.

    And if they then ignore the customers choice, they aren't ready for the enterprise either.

    But, for the record, you can uncheck updates you don't want in YOU just fine without Red Carpet.

    --
    Under capitalism man exploits man. Under communism it's the other way around.
  64. Stealing cycles from coworkers by cazzazullu · · Score: 1

    is what I do. We have a linux network, 50 identical pc's, ldap authentication and nfs-imported homedirs. This means I can ssh to each and every of these 50 pc's, and that all these pc's can communicate with each other by exchanging data through this common homedir.

    When I need to run a program (calculation, simulation, something that takes a LOT of power) I just make a list of parameter-values I would need to run, and feed them to a script that automatically distributes this over these 50 pc's. With renice 19 nobody notices (except for casefans that go 100% ;), and it goes FAST. Although this is not really parallel computing this method is very simple to implement (some bash-scripting, an expect-script for auto-login...) and I've gotten really good results with it.

    --
    int main(void) {while(1) fork(); return 0;}
  65. To be honest, I think *everyone* will be Google by Colin+Smith · · Score: 1

    Google are doing to I.T. what Arkwright did to textiles during the Industrial Revolution and Ford did to manufacturing at the turn of the century. The vast majority of those in I.T. who don't follow their basic model are going out of business in the near future. There will only be a few niche players and a few Google like companies.

    Anyway, the magic formula for the system to use is MIPS per Watt, or MFLOPS per Watt. The power requirements and heat produced by high density computers is a real problem to deal with, it's the real limiting factor for your data centre. The cost of the individual computers is relatively small in comparison.

    e.g.
    http://www.acmqueue.org/modules.php?name=Content&p a=showpage&pid=80&page=4

    Anyway, I'd be building a Transmeta based cluster even though the absolute per CPU performance is lower than the latest AMD or Intel offerings (I'm assuming you need to keep ix86 compatability). So, go look for Transmeta blades or Transmeta 1/2 U or 1 U based servers.

    --
    Deleted
  66. Dealing with the heat by Colin+Smith · · Score: 1

    And power requirements significantly increases the cost of your data centre. There's no point buying lots of cheap servers if you have to then spend a fortune on uprating your air conditioning and UPS.

    --
    Deleted
  67. Re:XServe by defaultXIX · · Score: 1
    So you probably don't update Media Player on your 2k servers either do you?

    Man, how do you get any work done?

  68. Here you go. by Colin+Smith · · Score: 1

    Comparison of various CPU in MIPS/Watt or MFLOPS/Watt.

    http://www.acmqueue.org/modules.php?name=Content&p a=showpage&pid=80&page=4

    And some suppliers of efficient servers.
    http://www.transmeta.com/success/server.html

    Having dealt with HP, I would be inclined to look at one of the other vendors.

    --
    Deleted
  69. I highly recommend Penguin Computing by jimshep · · Score: 3, Informative

    We recently puchased a 13 node dual Opteron cluster from Penguin Computing after evaluating clusters from them, Dell, IBM, HP and single memory image machines from SGI. Penguin provided a solution with the best price/performance as well as ease of use. They let us benchmark our codes on some of their test clusters to determine wheteher the Opteron or Xeon based clusters would be better suited to us. My favorite aspect of their system besides price was the plug-and-play setup. The cluster was shipped fully assembled, configured, and tested to our site. All we had to do was roll it out of the crate, plug in the power and network connections, configure the network settings in the OS, and start running our simulations. All of the solutions from the other vendors would have required significant setup time on our behalf unless we spent a large amount of money for the services. I also really like the Scyld operating system that was included in the cluster. It makes the cluster work almost like a single image memory machine. Scyld on the compute nodes is setup to download the kernel image and necessary libraries from the master node at boot-up, so any changes made to the master node automatically propagate to the compute nodes. After several months of running simulations, the cluster has not given us any problems. It has been very reliable (never needed a reboot). Their technical support has been very responsive about ansering questions we had with the initial startup.

  70. Re:XServe by Anonymous Coward · · Score: 1, Insightful

    Apple may be targeting the enterprise, but they are off target.

    Let us compare, shall we?

    Sun & HP & IBM all are perfectly willing to have response times under 4 hours. Not 4 business hours, not 365.24 days per year except.... 365.24 days per year period.

    Does Apple do that? No.

    Let's take sysadmin certification. Are they like Sun, HP, and IBM's AIX certs where they are good for the version of their OS you take it for? Or is it like Microsoft's, where it expires? Is it one test or set of tests...or is it a la cart? Hmmm, it is like Microsoft's.

    Nope, Apple isn't serious about the enterprise, at least not where Unix is concerned.

    By the way, for the original poster: Here's a nickle, sonny; get yourself a real computer. http://www.sun.com/servers/highend/>

  71. Re:XServe by bani · · Score: 1

    Agreed. This is the major problem with apple (other than apple's terminal case of NIH syndrome).

    OSX dictates to the user how they will perform functions. Users are given no choice in how to do things. OSX is a dictatorship, you're forced to use a computer with training wheels on and you can't take them off.

    Apple also keeps changing the API on developers. Whenever they manage to do something with OSX that Apple doesn't like, Apple insists on removing or breaking the API so you can't do that anymore. So some very useful utilities have slowly lost functionality over time and OSX revisions.

    Re: the updater -- I too have run into the problem where OSX software updater wants to install software you've removed. I've removed itunes from my Mac Mini and all that other trash, but software updater always wants to keep 'upgrading it', even though I told it not to.

  72. You've sold me. :) by MilesParker · · Score: 1

    From your description and what we've gone through, the advantages you list for Penguin -- esp. configured setup before shipping and decent support -- are key issues as you can spend a *lot* of time getting everything setup.

  73. Anyone tried RocketCalc or OrionMulti? by MilesParker · · Score: 1

    I ask because this is one other solution we looked at -- acutally, we dropped Orion almost immediatly -- of you add up the power of all of those Celerons, its still not a very good deal. :) But I liked the Rocketcalc very much for a personal cluster. 2-way, 8 nodes all in one package, single power-on, etc.

  74. You may find this interesting by deadline · · Score: 1
    --
    HPC for Primates. Read Cluster Monkey
  75. Re:XServe by claudius0425 · · Score: 1

    Last time I checked, FreeBSD was based on the 4.4BSD codebase, which is directly descended from the original ATT UNIX. So, while FreeBSD isn't Open Group certified as UNIX, and neither is Darwin, both are directly descended from the UNIX codebase, so yes, OS X is UNIX, it just isnt UNIX(tm).

    --
    Phus. Sysiphus.
  76. Re:XServe by Fancia · · Score: 1

    Re: the API bit, Apple has said that after Tiger, they won't be changing the APIs, just adding new ones.

    --

    Bít, zabít, jen proto, ze su liska!
  77. anybody use these ?? by johnn · · Score: 1
  78. Tiger! by Mechcozmo · · Score: 1

    Of course, if you are really cheap, you can gather any Mac running Tiger, get one that runs Tiger Server, and use Xgrid. (http://www.apple.com/macosx/features/xgrid/).

  79. Virtual alternatives? by Anonymous Coward · · Score: 0

    So for smaller shops...maintaining the hardware and this level of complexity involves quite a bit of time and effort. Time, I might add, that might be better spent doing business development for our ASP offering rather than investing in equipment and doing all the work setting up the boxes, then worrying about hardware failures and misconfigurations and so on. Any good virtual solutions out there yet?

  80. .02 cents = $0.0002 by Anonymous Coward · · Score: 0

    "My .02 cents worth ..."

    $0.0002

    ?

  81. ObSimpsons by sharkey · · Score: 1
    HP, instead of calling its big boxes Superdomes, should have used Thunderdrome!

    Bart: Don't you mean "ThunderDOME"?
    Willie: Quiet! You want to get sued?

    --

    --
    "Outlook not so good." That magic 8-ball knows everything! I'll ask about Exchange Server next.