Macintosh Clustering
HiredMan writes: "Wired is running an article comparing the set-up and admin of Linux Beowulf clusters versus Mac based clusters. Slant of the article is that the Macs are easier to set-up, maintain and are more flexible. They note that the Linux "how to" manual is 230 pages while the corresponding Apple document is a 1 page PDF file. Dauger Research of former Appleseed fame is mentioned as well, of course. MacSlash is also covering the article. Let the on-topic (for once) Beowulf comments fly..."
The Linux manual is a Beowulf cluster of Mac manuals.
Finally we may rejoice! For once Apple has surpassed the user-friendliness of Linux! Let the merriment begin!
"Ask me about Loom"
Having used the old Nextstep API (which I believe have been ported to OS X under the guise of CoCo) I can say that they are well suited for cluster computing.
I remember Richard Crandall and the mathematica guy (Wolfram) using Zilla (an old Next distributed computing program) to crack the world's largest prime in the mid nineties...
Anyone know if Zilla is back on OS X?
Also the Gigabit ethernet on motherboad and the large 2MB cache on the PowerPC chips will go a long way on making these machines a good cluster.
It's been a while since I've done distributed computing (hey, I am out of acedemia) but OS X will hopefully make the whole shebang easier...
I can't comment on whether or not a Mac cluster is easier to create or maintain (since I've never used a Mac cluster), but I'd prefer a Linux cluster running PC hardware, because:
-- Initial build costs are much lower (dual Athlon 2000+ right now without graphics hardware is way cheaper than a dual G4 1GHz).
-- Maintenance costs are much, much lower. Anything goes wrong with a PC node, just swap out that part with another commodity part. Mac repair or parts replacement costs will eat you, especially if you start to have many, many nodes.
Plus you can modify bits of Linux if you need to optimize the behavior of your cluster for the sort of computing you do, which you can't do with Mac OS.
My $0.02.
STOP . AMERICA . NOW
Wow, imagine one of these clustered machines running ALL BY THEMSELVES!
Strange, it doesn't seem to have the same comedic value this way...
I don't think this comment is insightful at all, but hey - I don't have moderator points today, so I'll argue.
The fact that the manual is shorter - VASTLY shorter in this case does in fact imply that accomplishing a task is easier.
Here's the skinny: Human factors. A one-page PDF is easier to scan and reference than a 200-page text file without references or pointers. If references an pointers are added along with a TOC, then scanning for specific instructions becomes much easier.
Comparing a 200-page document written by programmers to a one-page document made possible by a more graceful GUI and architecture, and written by professional tech writers is ludicrous. Less instructions to accomplish the same task = easier. Plain and simple.
yes, but according to the article they got a sixth-grader in Hawaii to set one up. Doesn't that say something about ease of use? --Triv
... is the ease of use. Tech Professionals can't make a living supporting the platform.
... However, he hasn't done any consulting yet because all of his clients have figured it out for themselves. All they need are a few G4 Macs, some Ethernet cables, a hub and the Pooch software. Getting it up and running is as simple as installing the software and configuring it through a couple of dialog boxes. ..."
Before someone accuses me of saying they never break, always work flawlessly, and the like: They do need support. It's just that the ideal career envoirment is when there is more work than workers. An underwhelmed support staffer soon finds the company wants him to help unload pallets in his spare time.
When all the IT staffers know one platform, what do you think they're going to recommend come upgrade time?
From the article:
"
Wouldn't it be great if "plug and play" clustering became a reality. Say your office mates are out to lunch, or there's no one scheduled to use the school computer lab for the next hour and you want to render the effects for you three-hour iMovie, or you want to perform batch despeckle on a few hundred inages in Photoshop...
Nothing against Linux (I use it myself for a router), but a three-day setup for Beowulf clustering isn't a great deterrent if your calculations will be going for a month or two.
The type of clustering we're talking about here is something that could potentially appeal to the average SOHO or school, where they have five to 500 general-use Macs that have processor cycles to spare.
My question is this:
What would it involve to make Mac OS X and every program that runs natively on it to be able to take advantage of clustering right out of the box? If they can natively use multiprocessing, how much of a leap is it to patch the OS to natively support clustering?
Not only would this be great for techies, but it seems that this would be a great incentive to volume sales from Apple, where they now generally only get one or two Macs per site and the rest are Wintel workstations.
Why don't you just make 10 louder and make 10 be the top number...and make that a little louder?
Cost of 10 good Intel machines to install Linux on... trivial (pobably about $15,000)...
Cost of 10 good Highend Macs, (about $30,000)...
Both are in the trivial range compared to the costs of time, energy, etc.
There is a more important question, which machine gives you the most bang for your buck?
We know that Photoshop runs better on the G4, what about your operation?
If the Mac gets a 2:1 performance advantage, then the costs are equal. If the Mac out-performs it regardless, you get an advantage.
For the moment, let's assume that you are getting real machines that are tested, not parts off of a sketchy vendor from pricewatch.com. If you are really trying to build a parallel computer, you want real systems, not junk that may or may not work.
This also rules out eMachines, or home computers. You are basically in the Compaq Workstation, Dell Workstation, HP Workstation, or IBM Workstation area. You aren't setting up a bunch of Presarios.
First things first: I'm a big Mac fan -- when it comes to using Macs as client machines.
But I really do have to agree here. Short documentation doesn't necessarily mean a simpler product -- it could just mean bad documentation. In the case of Apple, unfortunately, that's very likely to be true; I've always found that their products come with unbearably flimsy documentation addressed to the most newbie user. Of course, this works most of the time, since their market is largely made-up of newbie users, and since many of their products are, in fact, much easier to use than their Windows/Linux counterpart. But when you're doing something like clustering, well, you know, I'd rather have a big manual, thanks.
Do domain names matter?
And don't note that the manual (if it's the Beowulf book everyone cites) is mostly about how to PROGRAM it (e.g., includes an intro to MPI).
"The fact that a manual is shorter doesn't mean that it is a better or easier to install program."
While this is true, it's not even to the point. They didn't compare manuals. They took a book written on building a Linux cluster, and compared it to what is basically a step by step outline for for plugging together a G4 cluster. There are similar outlines out there for Linux clusters, too:
The SCL Cluster Cookbook by the folks at Ameslab is a bit longer than 1 page, but still shorter than 230. (http://www.scl.ameslab.gov/Projects/ClusterCookbHow to Build a Beowulf Cluster -- this is 10 pages long, but goes into such detail as processor, network, RAM, and disk speeds separately for both master and slave nodes. (http://www.mcsr.olemiss.edu/bookshelf/articles/h
But the point is, this article was written by pro-Mac people, so obviously they're going to take a pro-Mac stance. I mean, if these G4 clusters get to be useful, someone is going to write a 230 page book on how to build one of them. Right now, all the documentation that may be out there could be contained in this one page outline. The books come later, if the technology becomes accepted.
----This it patently absurd.
Apple's documentation was thin because it could be thin. PC makers shipping thin documentation in an attempt to be "like Apple" and failing to provide adequate information (because they couldn't get away with a thin manual due to PCs complexity and Windows' complexity) is solely the PC manufacturers fault.
There's plenty of things to hate Apple for: you don't have to go looking for stupid and pointless things.
Potato chips are a by-yourself food.
First things first: I'm a big Mac fan [...]
Mmmmmmm, Big Macs....
Step One: Plug them in.
Step Two: Turn them on.
Step Three.... there's no Step Three! There's no Step three...
MOSIX clusters are a one-liner to set up, for example. I challange Apple to beat that!
I'm not sure about Compaq's One-Stop Linux Clustering. I've never got it to compile. But, assuming it can be made to work, I bet it'd be pretty decent, too.
Last, but by no means least, clustering in the Real World tends to be through PVM or MPI, which are platform-independent. Hardly anyone uses OS-specific clustering, because hardly anyone but high-energy physicists ever develop large clusters in the first place!
It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
So ever wonder why you get so little useful information with your computer? Blame Apple.
Nice flamebait, Reality Master, but I think cost savings had *way* more to do with PC manufacturers ditching the paper manuals than Apple ever did.
As far as that commercial goes, it was true for the average user (the commercial's target) -- the Mac didn't need a ton of manuals to do the equivalent tasks that you needed those manuals for on PCs.
It doesn't mean there shouldn't be better documentation nowadays, of course (apart from keeping David Pogue et al. in business). But let's try to keep our pre-conditioned biases as tenuously connected to reality as possible.
I watched C-beams glitter in the dark near the Tannhauser gate.
How is a Mac "easier to set up" in a Beowulf cluster than a group of identical PCs?
I can see where the author might make a point to say that the Mac is nice to use for a cluster because Mac hardware doesn't really change much from box to box, but the same could be said for a group of equal-built PCs. Infact, most real-world (re: not your bedroom.) Beowulf cluster nodes are NOT loosely conglomerated machines with wildly different capabilities from node to node. Most clusters are planned out well in advance, in where each node is precisely equal in terms of its hardware and horsepower.
"Its easy to set up because all of your nodes are the same with a Mac!!" ceases to be a valid "advantage", when the same can be said of a group of SGI O2 boxes, a group of Sun E10K boxes, or a group of lowly 386 PC boxes.
Besides, "its see-thru orange!!!" shouldn't top your list of reasons to purchase Macs for your cluster. You buy a pile of 1U rackmounts, because you normally don't have a whole room to dedicate to a cluster. (duh)..
Cheers,
Bowie J. Poag
In fact, my first inclination is to try to use the Beowulf stuff rather than Pooch simply because such a detailed work exists and is available for Beowulf clusters, but I don't know if any such information exists for Pooch.
I've just read the article, and added my grain of salt for bias, but most people here fail to realize that hardware costs are *very cheap* in relation to human costs. If what they say is true, it's worth the extra price on hardware.
You think the P4 price/performance is bad, G4's are insane
USC Macintosh Cluster Running the AltiVec Fractal Benchmark achieves over 1/5 TeraFlop on 152 G4's and demonstrates excellent scalability.
KLAT2's complete results are: Rmax=64.459 GFLOPS with 64 Athlon 700MHz with 128MB PC100 CAS2 SDRAM
So a 1 tflop apple machine would cost about $440,000 in hardware for 152 G4 1000mhz -vs- 270 Tbird 1400mhz at about $160,000.
The difference, $280,000 could certainly hire someone literate enough to read the long linux manual.
If voting were effective, it would be illegal by now.
"A good use for these [ancient] machines is to recycle them and one way to recycle is to create a bigger faster machine with them."
Not if your primary concern is getting the most FLOPS/$. Given that a brand-new $1000 computer will be something like 10 times as fast as your old ones, at the same power consumption, it doesn't take very long before your new computer pays for itself with the money you save in electricity not running 9 additional machines.
Consider:
150 Watts (low for a PC, probably average for a Mac) x $0.10/KWH x 24 Hr/day x 30 day/mo. x 10 machines = $108 per month. Your $1000 new machine will pay for itself in less than a year, from electrical savings alone.
Of course, this assumes dedicated compute servers running all the time. If you run the cluster software as a backgound task on desktop machines with many users, it's a different story.
They note that the Linux "how to" manual is 230 pages while the corresponding Apple document is a 1 page PDF file.
Meanwhile, documenters have been developing a "What to do with a linux beowulf cluster" list. That document has grown to 230 pages. The corresponding mac list has come up with one idea (And it fits on a 1 page PDF file): "Create a system that allows us to use Photoshop to edit super-high resolution pictures of Natalie Portman eating hot grits."
(j/k!, and, btw, I'm using a Mac right now. :-)
--You will rephrase your request for me to go to hell. Goto statements are not acceptable programming constructs
The point isn't flexibility: sure you can be more flexible with a Linux-based cluster. You can tweak and tune a Linux-based cluster to meet your specific needs. This is why Google uses such a cluster.
The point isn't about cost: the real difference between a decent name-brand PC and a Mac is negligible. In the case of these Mac-based clusters, since the clustering software is just another app, a Mac-cluster can be setup and torn down quite readily. You come into the lab on Wednesday to find your workstation has been appropriated for the cluster.
The point is accessibility! If you're a physicist in a small school looking to model some complex interaction, you can rent some computer time from somebody (expensive), build a cluster (very expensive, because you'll have to hire somebody to do it--physicists aren't likely to be Beowulf experts), or use the Mac clustering software (expensive, because you'll have to buy the machines if you don't already have it, but you can do it yourself, quickly, without much bother).
Accessibility! It's what keeps Apple in business. This is another example of it.
I'm pretty disappointed in the posters who knock it, because it strikes me that they are a bit put out that they won't remain the Technical Elite because they've got the spare time to read the 230-page Beowulf manual.
Potato chips are a by-yourself food.
I DL'd and read the manual. It really does seem just that easy. It costs $100+ per node, but you pays fer yer time & headaches, doncha?
The faster the machine and traffic the better of course, but you could do this with the cheapest iMac ($799 new, ~$400 used) or a bunch of cubes (banking finally on their close packing ability) if you want Altivec in the mix.
Gosh, a reason to make a headless iMac2 - that would be quite the aesthetic eh? Seventy six of those snuggling on a ping pong table...
Communication can be over Airport, too - so you can imagine ad hoc Mac Clustering begin setup during the first half of every Jobs keynote - you know, the part where he just says stuff - to go thru all possible iterations of the product to be intro'd in the second half of the keynote...
"Win treats sysadmins better than users. Mac treats users better than sysadmins. Linux treats everyone like sysadmins."
... for scientists like myself, this is a very nice thing. Not all of us in the sciences are tech-savvy... I'm probably the one in my 5-person research group who understands the most about *nix. For those of you who don't realize this, many research scientists have to work hard to get their grants and outside money.
So, what does all this mean to us? As an atmospheric scientist, having some serious number crunching power is mighty helpful. Weather modeling is quite the processor intensive task, and then interpreting the results can take years after all the computing is done, including further computations and visualization routines. To put it shortly, we can easily tax our computers.
So, now you know that we need computing power, but money is a premium for us in many cases, so why shouldn't we just get some cheap Intel boxes and *nix cluster them? Well, we could, but then we'd need to hire a systems admin. Someone who is tech-savvy enough to keep everything running decently well for us. That requires another person who REALLY understands what's going on in many cases, which is another salary on the payroll. For us, it all ends up balancing in the end. The $5-10K that we save in clustering our 8 Intel boxes over the Macs is eaten up in one year or less by the guy (or woman) who has to set up the whole thing. So, for us, the ease of setup and use is something that can translate into some good savings and we don't have to worry as much about having to rely on another person to save us if something goes wrong. That's the benefit of simplicity for us.
I agree that it is important to know, as one person said, "The nature of the beast", but that's something that takes time to do, and when you're not being paid to learn about how to cluster computers, but to figure out how the atmosphere works, then things like "The nature of the beast" are just further complications. I would rather have something that I can slap together, know that it works, and get back to my work, without the interference of others if I don't need it.
And that brings me to another rebuttal, about someone mentioning that if you buy the Macs, you're also going to pay for all the extra Superdrives and video cards and all that. I say to that, "Good." That way, if the cluster doesn't need to be used, then I don't have a bunch of mostly useless boxes sitting around... or if a collaborator comes around and needs a computer, I can just remove one of the computers from the cluster and let them use that for as long as they need. The point is that there are advantages and disadvantages to each setup. Now you've heard some advantages and why the scientific community might care about this. Remember, not everyone here can compile their own kernels and not everyone cares about being able to do that. Some of us, thank the deity of your choice, actually want to do something with this power and not care how it works in depth. To each their own.
-Jellisky
Photoshop is optimized for the G4. That was my point. We're not clustering Quake here. We're talking about special purpose applications that do scientific calculations.
If you application does better on the Intel, you are likely better off considering a Linux cluster. However, if it isn't much better, you might be better off with the Mac cluster by adding a few more machines to compensate... depends on the costs of time.
If you are running an application that, LIKE Photoshop, does better on the G4, you will see the price performance favor the Mac line. That's my point.
If this market was a decent size, I bet Apple could get some really competitive cluster systems. It would be nice to see an Apple dual or quad G4-1 GHz, with a CD-ROM, ATI Rage 128, and Gigabit Ethernet for the scientific community.
They could make the machine without PCI slots and fit in a 1U case for OS X processing goodness.
However, the reality is that the extras (better video card, Superdrive, etc.) don't add much to the Apple's price. However, the right form factor could make them tremendous cluster machines.
Alex
Obviously, you know very little about the Macintosh. You should learn a bit more before you go spouting off flames.
The software used to accomplish the clustering for AppleSeeds is Mac MPI, which is based upon the *standard* for parallel computing, MPI. The reason that the PDF doesn't talk about programming MPI is that there is no need for redundant documentation. Go find a book on MPI if you want to learn to prgram to that API.
And yes, I will get quite far telling you it's easier to upgrade Mac OS X to its latest version/. Thanks to Apple's Software Upgrade control panel program, this can all take place automatically according to any schedule you desire. Two clicks of a mouse is all it takes to set this up, as opposed to spending quite a lot of time figuring out how to use the incredubly arcane "apt". In fact, AFAIR, Software Update is now set to operate automatically by default.
Gee, I didn't realize that particle physics simulations involving millions of particles wasn't a *real* application...
The fact that your comment has been moderated up to four (so far) is simlply an empiric demonstration of the lack of knowledge of most Slashdot readers.
If you have a scientific cluster, you don't want to be swapping things out. You don't want to take nodes offline because a video card fried. You want a system that is going to work.
I just priced out some Compaq Workstations yesterday and compared them to Apple Powermacs (Apple's workstations) for doing some OpenGL game development.
Apple Powermac with dual monitors and the upgrades we'd want... $5k. Compaq Workstations... $5k.
In the price-conscious area, Apple's iMacs/iBooks offer a good solution at a reasonable price. You can't compare Apple's workstation line with your "look ma, I built it myself" machine.
Apple does QC. You don't. You and your screw driver does not equal scientific requirements for reliable and predictable. If a node fries, you likely need to start over again. You can't just try to fix the damage.
Linux is great, OS X is great. They are very different UNIXes in different markets.
Alex
Sigh. This is why WinTel continues to dominate.
Dumb businesses look at the quick and short-time costs, relegating longer-term costs to secondary status. Most semi-smart people know that longer-term (and usually recurring) costs tend to dominate over the long term. Even though there are many studies showing Macs having higher ROI, WinTel gets the vote. Just look at arguments here on Slashdot- "I can get (whatever) much more for the price Apple charges!" But it is proven that Macs need less maintainence and less configuration stuff.
With Linux, it's less clear because you can run it on even cheaper hardware, and the OS and most apps are also free. But it takes more human time to get it working right, for the most part, depending on what you want to do. Standard install, no prob. Something special, now you start running into human costs, which are way higher than equipment costs. Put it this way, it would be cheaper for a company to buy a brand new computer than to hire me for 1 day to fix it (minus data loss of course; but they should've backed up anyway!)
Meh.
Vote monkeys into Congress. They are cheaper and more trustworthy.
The fact that a manual is shorter doesn't mean that it is a better or easier to install program.
I would agree that comparing manual lenght is not a reliable guide to judge the relative complexity of two programs. The one-page doc is even a "quick start guide" not a complete manual. But I still suspect that the writer is correct that Appleseed clusters are easier to set up and maintain than a Beowulf cluster. Reading over the directions myself it did looked pretty brain-dead simple - most of that one page didn't even have much to do with the actual installation of the program but with such complicated tasks as connecting your Mac to an ethernet hub: "For each Mac, plug one end of a cable to the Ethernet jack on the Mac and the other end to a port on the (ethernet) switch." and noting a few system requirments (CarbonLib 1.2 or OS X 10.1) The installation instructions consists of "Double-click the Pooch Installer and select a drive for installation." Instructions on how to use consist of dragging and dropping the program you want to run in parrallel onto the Pooch app and "click Select Nodes..., select the computers you want to run it on, and, in the Job Window, click on Launch Job."
Besides, if you are going to have a cluster, you want cheap, off the shelf machines such as PCs with plenty of spare parts that can be customised to suit your needs : why pay for a good 3d graphics card in every pc if you are going to do number crunching !
This is only the case if the individual PC's are dedicated nodes and not being used for anything else. Most Appleseed clusters are made up of computers that are primarily being used for something else. School Mac computer lab by day; clustered "supercomputer" by night. The cluster of that did 233 gigflops (76 dual G4's mostly 533's with a few 450's) was simply all of the Macs at UMC working as a cluster over Christmas break. This is where the easy set up, maintenance and the ability to cobble together computers with different processors and even different OS's (some nodes may be running MacOS 9 and some nodes may be running OS X) is an advantage. The Appleseed clusters that are made up of dedicated machines are probably discarded computers they already had kicking around so cost is not an issue there either.
And I quote "But according to Dauger, Linux clusters require a PhD to set up and to run."
Yeah, I guess there wouldn't be any qualified people amoung those running Tokamak fusion simulations or 100 million mutually interacting particle simulations.
A diskless linux system is cake to setup and as far as different kernels are concerned, the article is clueless, you can use LamMPI to mix different platforms (ie sun,sgi,intel linux, alpha linux) in a single cluster.
Disclaimer: I have a Ph.D.
nohup rm -rf ~/. >& zen &
Just have all of your OS X clients boot off of a disk image on a Mac OS X Server machine.
http://www.apple.com/education/k12/networking/diff er/index.html#macmanager
I recall, back when CD-ROMs were fairly newfangled, the "manual" that came with the CD, if it was a dual-platform disk, often offered an interesting contrast.
The Windows instructions would go on for pages, discussing running the installer application, how to get the right drivers, etc.
The Macintosh instructions were usually:
I never understood why Apple didn't market that advantage heavily.
Computers are useless. They can only give you answers. -- Pablo Picasso
Cant you cut the $4792 for LCD's down to $599? Why do you need 8? Can't you use one and just unplug it from the head when you to diagnose the other machines? Also why LCD's? Can't CRT's be found for under a $100 now? I'm not a Mac fan, I prefer LinTel, but this solution seems to make tons of sense...
Luti
This reminds me of an old Mac story.
t ml
:^)
The situation was that Guy Kawasaki (an Apple "evangelist" at the time) challenged some PC folks to a "bake off," to determine which system made some tasks easier.
When the day came, Kawasaki sent out a 10-year-old to go head-to-head with the PC geek.
The full details of the story are at http://www.halcyon.com/kegill/mac/win95/faceoff.h
Maybe we should have a new challenge where a Linux geek and a 10-year-old compete to see who can set up a compute cluster the fastest.
Computers are useless. They can only give you answers. -- Pablo Picasso
Everyone here seems to be suggesting that the manuals indicate nothing. "Apple has weak docs!" seems to be the summary. But can we entertain the notion that perhaps while 1 page is too short, 230 pages is far too long? If so, is this because the people who wrote the manual are not professional authors, and got too wordy? Or is it because Linux just isn't usable enough?
And whatever you think, isn't it reasonable to suggest that making Linux more intuitive and the manuals more succinct might help rid us of idiot lusers who won't RTFM? They won't really go away, but if we actually take usability seriously, perhaps developers can get half those people to solve their own problems. Wouldn't this be a good thing? I guess that's a rhetorical question -- I am sure it is a good thing. I spend my entire workday building apps for people, and one usability tweak can mean the difference between 20 nagging people a day and 2. My team even has blacklisted a couple people in the company, whose projects are always time-sinks to build and time-sinks to maintain. Why? Because those people are control freaks who won't let us fix usability errors, and my team ends up spending their days on support. If you can build something intuitive and usable, both the users and the developers will be much happier.
My Greasemonkey scripts for Digg &
Or...they could keep doing what they're doing, be successful, and have all of the people be jealous that the coolest OS out there doesn't run on their ugly beige hardware.
Okay, that was a bit of flamebait. But still, Apple has a good thing going now. They piss some people off, but they are being pretty darned successful compared to the other major box manufacturers. I honestly don't think a new Dell product could get the cover of Time.
There should be a moratorium on the use of the apostrophe.
Max V.
NeXTMail/MIME Mail welcome
You've come to the right place.
-- Two men say they're Jesus. One of them must be wrong. - Dire Straits
The size is what really kills them.
A 45U rack will hold 45 1U dual-CPU systems. Even more of the server-blade type systems (280 of the Compaq in a 42U rack).
The only way to rackmount a G4 that I can find is at Marathon Computer. A set of replacements for the "handles" for $225 or a whole new case which is 4U but is $550. Given a 45U square-hole 19" rack, you could squeeze in 11 dual CPU G4s.
I don't care what your performance fantasies are about the G4 systems, they're not more than 4x faster than dual x86 systems.
They note that the Linux "how to" manual is 230 pages while the corresponding Apple document is a 1 page PDF file.
Yes. Wonderful. This says nothing. This is one of "those" statistics. The Linux "how to" could be 230 pages because it not only tells you how to set it up, but gives you advice on customizing, creating optimized programs, hacking the kernel, and FAQs covering every single problem or question you might have.
The Mac PDF might be an almost blank page that says, "Call tech. support." Furthermore, why mention that it's a PDF at all? Are you saying that it's somehow better to use a proprietary document format (e.g. Proprietary Document Format - PDF, get it?) instead of plain text? Is the information somehow MORE relevant because it's in PDF?
Please. I've seen neither, but all this tells me is that someone wouldn't know a relevant comparison if it widdled on his shoes and stole his wallet.
Jake
Dating: while( 1 ){ call_girl(); get_rejected(); drink_40(); } return 0;
My first inclination (if I was just getting started in cluster computing) would be to try Pooch and see if it meets my needs. If it doesn't, I've wasted an afternoon. If I try the Beowulf option first, I may do more work than necessary to get the results I want. If I try Pooch, and it works, but I need more horsepower, that's the time to dive into the Beowulf option.
Remember, "Linux is free (as in beer) if your time has no value".
To a Lisp hacker, XML is S-expressions in drag.
The money saved by using a free OS is quickly eaten up by the salary of someone who has to make them run smoothly, which is damning if you're a small business with only a few employees, or in your case, a research group.
-
It's interesting, all the comments I've read so far, including yours, seem to deal with this as a dichotomy between Linux/Intel and OS10/PPC. Don't forget you can run Linux on PPC. For a high performance dedicated cluster that would definately be an option I would look at.
Of course, there are situations where the Mac software has advantages that will really shine. Like if your "Cluster" is really just the lab machines at the college, acting as a cluster when not being used for DTP and Video editing or whatever. In that case the ease of setting this up with Mac OS10 would be a real plus.
=-=-=-=-=-=-=-=-=-=-=-=-=-=-
Friends don't let friends enable ecmascript.
10 good Intel machines will not cost less than $10,000. For scientific work, I don't consider eMachines or your grey-boy solutions a "good" system.
So, I took the bait... I went to Compaq's site and spec'ed out an equivalent workstation. Note, I'm not souping up the video card or CD-ROM like the Apple workstations. No need to waste money.
Compaq Evo Workstation W6000, Intel Xeon 2.00 GHz/512K processor, dual processor... Upgrading to 512MB RAM. $3521.00. Note that this machine only has 10/100 networking. The Apple has Gigabit. This should matter in a cluster.
Dell Workstation 530. Intel Xeon 2.0 GHz x2, 512MB RAM, and an upgraded sound card (Dell won't sell a dual-proc workstation without an $80 soundcard upgrade... weird). Dell did let me downgrade the video card annd monitor... Price: $3878.00. Unlike Compaq, I could buy the Dell workstation with Linux (supported) instead of NT and needing to swap OSes.
Next I went to Big Blue. They push Linux, they should sell me good Linux workstations. When I bought my last round of Penguin Computing machines (to run OpenBSD and Linux) I looked at IBM first...
IBM's only dual processor workstation, the IBM Intellistation M Pro 6850 Tower. With a second 2.0 GHz Xeon processor, $5218.
Real computers cost money. Flaky machines that hardware lock from time to time do not. You can't compaq the Apple workstations to the bottom-barrel systems.
In fact, at $1300 for the lowend iMac (700 MHz G4), admittedly with a silly flatscreen for this project, or $2300 for the midrange (933MHz) G4, Apple hits some good price points for this.
Look, the new G4s (in the 933MHz and 1GHz-dual models) are sporting a 2MB L3 cache! That's damned impressive. A 2MB L3 cache should make cache misses SO infrequent that the slower memory bus speed is irrelevant.
Look, if you need lots of power, you used to need to stop millions. You're not going to cut corners on your machines. You're looking at $3500 for an Intel dual-Xeon based solution or $3000 for the dual-G4 based Apple solution. Sure you get an unneeded Superdrive, but who cares? When the project is over, I bet you everyone in the lab is happy to take one of the Superdrives home...
Geeze people, get a grip.
Apple's G4 workstations are not the same quality as the computer you have in your room in your parent's house. These are real machines with:
Gigabit Ethernet (very significant for a cluster, and unlike the PC's 32-bit, 33 MHz bus, real machines like the Apple, Compaq, or Dell workstations have 64-bit OR 66 MHz (sometimes both) PCI busses so you can actually USE the Gigabit Ethernet.
The Apple's L3 Cache has 2MB DDR SDRAM at up to 500MHz, this is much faster than the 266MHZ DDR in PCs and comparable to the PC800 RDRAM in the Dell/IBM workstations. Sure the System RAM is slower, but a 2MB L3 cache makes this less relevant.
The Superdrive, Firewire, and Video cards are all unnessary here, but they are actually really nice features if these machines will be reassigned as desktop machines when the project is over. You could buy new PowerMacs with the G5s ship within 6 months and reassign these as desktop machines. The real workstations are the same. You $45000 cluster of crap machines won't take you very far. They are trash when replaced, and if the machine hasn't been QC'd? Well, time to explain that your project needs to start over.
Come on people... Quake != scientific computing
Pooch won't run on those, however, because it requires MacOS 9 or later. Those versions of MacOS won't run on 68K Macs. A Beowulf might be doable under one of the 68K Linux distros (only one that comes to mind is Debian)...but I've found Linux to be almost unbearably slow on my Quadra 610. (Linux probably has been nowhere near as optimized as MacOS, which has (or had) large amounts of hand-coded 68K assembly in it.)
20 January 2017: the End of an Error.
It's not your fault, because you probably didn't know this, but the USC Mac cluster didn't cost anything near $440,000, and it didn't have any 1000 MHz. G4's in it.
At the "Macs in Science and Engineering" user conference at Macworld, they gave the general specs. of this cluster, and all of the machines were dual processors, but of different hardware generations. Although the fastest machines were dual 800 Mhz. on 133 MHz. bus, the majority were slower dual 450 and 500 Mhz. machines with 100 Mhz. buses.
With the fact that all were dual, and ignoring depreciation on the older hardware, the cost would be at most $220,000, If you were using Dual 1 GHz. G4's, it would still be only $220,000. My notes are on my laptop, but I believe that the actual cost of the USC cluster was less than $200,000.
Also, I assume that you think that the 270 uni-processor T-birds will scale performance linearly as well. I doubt it would only cost ~$600 per node as you would have to use Myrinet or some other fast fabric, and with three and a half times as many nodes, the latencies, hardware, and administration cost would be crippling. I have the same cost argument if you use dual Athlons, as the boards are quite rare, and the node count is almost double the Mac node count.
Your price/performance assertions don't stand up!
-- Len
Agreed, however if you'd ever actually tried to use the product you'd realise that this is not the case. Let me show you through exactly how simple it is in just 10 simple steps:
- Grab a bunch of Macs, a switch and a monitor.
- Plug Macs into the power.
- Plug a keyboard and the monitor into the first mac and turn it on.
- Configure the network through the easy to use Networking Control panel. Or alternatively don't configure it and throw a DHCP server into the mix somewhere.
- Install and run pooch (drag and drop from the disk image it comes on then double click).
- Repeat for each Mac.
- On the last Mac, pick an application you want to run on the cluster, drag and drop it into pooch.
- Select which Mac's you'd like to help out with running this program.
- Click start.
- There is no step 10.....
Voila! The best bit about this is that I've never even read the pooch manual, yet I've still managed to set up my own Mac Beowolf cluster. I've looked into Linux beowolf clustering a number of times and gotten hopelessly lost and confused despite having respectable Linux knowledge.If you've ever set up a Mac beowolf cluster you'll very quickly realise that there is no comparison in ease of use and anyone who argues otherwise is clearly uninformed.
Like always, don't bash what you haven't tried...
Beowulf was predated by "Zilla.app", which shipped on NeXTStep 2.0. Richard Crandall used Zilla on any workstation that was idle, anywhere on NeXT's network (idle being defined as "the screen saver was running"), to find the 13 Fermat number, among other things.
So, this kind of (relatively) low-cost clustering began on Mac OS X's predecessor.
-jcr
The only title of honor that a tyrant can grant is "Enemy of the State."
These Xeons feature 512K of L2 Cache. Sure there are Xeons with HUGE amounts of L2 cache, but then we are hitting the $10000 price range. These are workstation machines, not server machines.
I can't compare the Apple's to the P4s... P4s don't go dual processor, so the PPC G4 wins here. I can't get a Dual proc P4.
Athlon? None of the vendors I checked have Athlon workstations, so they weren't in consideration.
However, after realizing the lack of Athlons, I remembered that Penguin Computing has a line of Athlon based workstations.
I went to their website, and priced out an Athlon MP system, the Tempest 210MP Workstation.
With 2 Athlon MP 1900+, not really competetiive with the new 1 GHz G4s, but close enough for our comparison (and matching your assertion that they are in the same league as them). With 512MB PC2100 RAM, and upgraded to the Gigabit Ethernet card (they have one, might as well try to be fair), and my workstation price is $2707.
Congratulations, we have a winner. A Athlon MP 1900+ (running at 1.53 GHz if I recall?) with similar specs at the Apple Workstation comes in $300 cheaper. The Apple has some advantages, the better video card and Superdrive are nice features when the machine is recycled as a desktop machine, but for now they are superfluous.
What is the point of my work?
You're all full of shit. Apple's computers are extremely price competitive. They are cheaper than Xeons from the real vendors with similar specs (Xeons had faster RAM, equal L2 cache, no L3 cache, and no gigabit ethernet).
Apple puts out a really competitively priced Unix workstation to Linux workstations from major vendors.
Apple puts out really competitively priced consumer machines (iMac/iBook) compared to Wintel machines from major vendors.
You can choose to use an Apple solution or not, but stop spreading the bullshit about Apple being more expensive.
What most of us hate about Apple is that they make it impossible to unhide them, to get into the guts of the thing and change it as we see fit
/Applications/Utilities/Terminal, and launch it.
On any machine running Mac OS X, go to
-jcr
The only title of honor that a tyrant can grant is "Enemy of the State."
>a brand new computer than to hire me for 1 day to fix it
.. it almost makes me want to point out how, despite one's salary, employers consistantly underestimate the costs of labour (because the goal is to drive it ever cheaper) and typically overestimate the true cost of hardware. Add to the that the completely ignored (in a free-market) social costs depending on your selection, and you've got some very difficult-to-explain from a total-cost perspective monopolies ...
Indeed! Well, I'm salary, but its still about 10 days of my time = new computer.
Nothing speaks more about the falsehood of the market choosing wisely than the tech sector, as it relates to the perception of technology costs versus people costs. Who cares if it's 75% as fast if I need to spend less time thinking, caring, stressing over it.
It's kind of funny
"Old man yells at systemd"
There's a few people saying the cost of a Linux cluster of similar computing power would be much less than a cluster of Mac towers. That is completely wrong, and here's why:
1.) Power vs. cost. The G4, with AltiVec-enabled MPI code, can blast data through in 128-bit chunks. Steve Jobs loves to term this the "Velocity Engine", and it is much, much more powerful when doing solid number crunching -- exactly what would be taking place on these clusters. It's not as amazing for day to day operations, but the capability is there to quadruple the data flow of a traditional processor when doing clustered computing. Typical AMD/Intel processors can just not do this.
2.) Maintenance. This is key. I maintain a Linux cluster and have worked with others in the past, and wonderful as they are, they require lots of maintenance. It's pure and simple math. You probably built all 16 or whatever nodes with individual parts made by various companies, and inevitably, each of those elements will have problems. This makes debugging and fixing hardware problems unbelievably painful, especially when you also have to deal with multiple parts vendors. When you use Apple Power Macs, ALL hardware problems can go through ONE support source, and that's Apple. Plus, they are pre-built, tested, and refined in Apple's R&D labs far before they make it to your cluster room. This saves such incredible amounts of time and money, it definitely pays for the extra cost of the computers themselves. I wish I could explain to you the sheer pain of keeping a cluster alive which constantly had one part go bad here and there -- but one part, sixteen computers, each with eight or nine significant custom-attached parts... well, it meant a lot of troubleshooting time, a lot of replacement time, and having to deal with far too many different companies to get the parts and support I needed.
3.) MacOS X. Clustering under previous MacOS versions was, despite the best efforts of AppleSeed, absolutely reprehensible. The operating system was simply not designed to do massive computing projects, and it was not efficient at all. Definitely not worth it despite the work of the pioneers in the field. With OS X, you now have a BSD operating system, one that has done clustered parallel computing for over a decade. MPI, with AltiVec enhancements; gcc with multiprocessor compilation support, you name it, it now runs under OS X and, with the operating system natively supporting the G4, it does it DAMNED fast.
"What the heck do you know," you might ask. Again, I maintain a 16-node Linux cluster for a plasma simulation group at the University of Colorado, and am also the CU campus rep for Apple Computer. I am well-versed in both OS X and Linux, and their scientific computing environments, and have experience in clustering in both environments. I am in the process of establishing a scientific computing initiative at CU, and I am doing it on behalf of Apple because the G4s (and soon, G5s) are simply the best platform for multi-platform scientific and high-intensity computing.
The best saving grace from a sysadmin's point of view, is that I will never have to worry about maintaining the variety of parts in those damned Linux clusters. The operating system is wonderful for scientific computing, yes, but there's simply no cost-effective way to purchase and maintain Linux-based PC hardware that could ever compare to the Mac. From an overall perspective, and this is definitely the most important aspect, those who are using massive parallel clusters of computers need their data crunched fast, and the G4 processor, combined with AltiVec-enhanced code, is simply the fastest way to crunch data, straight and simple.
I hope that clears up the issues for people, because that's how it is. Just the facts, ma'am.
Ryan Bruels
Apple Campus Representative
University of Colorado, Boulder
bruels@mac.com * 303-332-5434
"All your base are belong to this file I send in order to have your advice."
The original 128K Macintosh came with a thin manual and a casette tape (which you played along with a movie running on the screen). This was enough. One of the first Mac commercials showed a PC, with a stack of books falling on the table, and a Mac, with the thin manual floating down.
However, they made the same error that you make: thinking that people select for ease of use. They don't. This is what happens:
The sum total of this is what I call "the Acolyte effect." An Acolyte is someone studying for the priesthood. Computer acolytes are attracted by the pseudo-mystical nature of software; learning its ins and outs is for them a rush. The choice of computers and software becomes a social hierarchy.
Presumably, most of the annoyance at Apple in this community comes from the whole whacking-unauthorized-clone-makers-with-a-big-stic k attitude it adopted.
Very hacker-unfriendly, and more monopolist than Microsoft. In PC land, there are almost always at least three suppliers for every major component. (e.g. CPUs: Intel, AMD, VIA, Transmeta; Motherboard chipsets: Intel, AMD, VIA; etc...)
You may have a point about the jealousy, though. Although my hardware is neither beige nor ugly -- and each of my components was selected at my choosing -- I have to admit that it would be kind of neat to mess around with OS X for a while. Now, if only Apple would let down its sometimes-whimsical sometimes-Microsoft-esque-monopolist schizophrenia for long enough to realize that it could really change the world and make a killing at the same time by entering the PC OS market... But, alas; Star Trek was crushed long ago in favour of the misguided hardware company vision.
Heck... If they leave the price/performance ratio wins and the majority market share to PC land, the Dells of the world will gladly reward Apple with all of the "cover of Time" success it wants. =)