Windows Compute Cluster Server 2003 Released
grammar fascist writes "According to an Information Week article, on Friday Microsoft released Windows Compute Cluster Server 2003." From the article: "The software is Microsoft's first to run parallel HPC applications aimed at users working on complex computations... 'High-performance computing technology holds great potential for expanding opportunities... but until now it has been too expensive and too difficult for many people to use effectively,' said Bob Muglia, senior vice president of [Microsoft's] Server and Tools Business unit, in a statement."
Hasn't running "parallel HPC applications aimed at users working on complex computations" traditionally been done under Unix, and Linux as well. Seeing how Linux is free it's hard to see how "it has been too expensive", or "too difficult" (since unlike your home user the people running these systems are rocket scientists, I am sure a little command line use doesn't stump them).
Philosophy.
Considering I work at one of the companies listed in the article MS is working with, and all our HPC clustering is working with linux, and will be for the coming future, I think this is primarily just a PR attempt. I don't even think we're remotely considered running anything on windows for our tasks.
I did manage to flummox the professor with the following question (I'm being serious here!):
Why does a 256 or 512-node computational cluster need a copy of Windows Media Player on each machine?
It was then that I realized that MS's approach to operating systems targeted to different applications is not to strip anything out, but to add layer upon layer of extra functionality to their basic "home computing" OS.
Yes, friends, their Windows 2000 computational cluster software does indeed ship with Media Player.
I can't speak of SQL Server or other variants: does anyone have a clue?
"Microsoft Message Passing Interface (MS-MPI) implementation is fully compatible with the reference MPICH2"
I guess given the fact that Microsoft is pathetically behind Linux when it comes to high performance computing, they may actually play by the rules here.
Anyone has an insight on this one? Do they have a API lock-in strategy here as well?
Microsoft's MPI implementation is, if I understand their materials correctly, based on MPICH (a BSD-licensed Open Source product) with some in-house fine tuning. MPICH is a good reference implementation but is not terribly fast and is getting to be long in the tooth. Far as I know, it doesn't have much in the way of fault tolerance in it, either. LAMPI and OpenMPI are built for speed (although I've found OpenMPI has room for substantial improvement) and have some fault tolerance support. So, they don't seem to be using an amazing architecture.
Last, but by no means least, Microsoft's freebies were limited to an Opteron-specific Windows 2003 Cluster Edition beta and a cookie. By comparison, many others had booklets on what their products did, papers on the theoretical work being done, working demos (the molecular modeler with forced feedback was amazing) and some highly knowledgeable geeks to answer detailed technical questions.
Microsoft may - someday - be an interesting player in the cluster market. Right now, though, they really don't seem to get what it is all about. I'm not trying to bash Microsoft here, they really don't have a product that is useful for the high-performance market, and seem to have the wrong libraries and interfaces for using the servers in a load-balancing, fail-over or distributed storage environment. This isn't to say the other vendors were perfect - I saw many areas that were horribly inefficient and poorly implemented - but rather that Microsoft would have done better to have come back from the show and re-thought what it was that they wanted the Cluster Edition to do.
It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
Does it also need a hard disk in every machine? Imagine all the extra heat produced by 500 superfluous hard drives. All the clusters i've built have been diskless, and booted from the network (or from a floppy, i did produce such a ghetto-cluster from old hardware once as an example)
http://spamdecoy.net - free throwaway anonymous email - avoid spam!
Only sort of.
The front end web servers are IIS. The business logic is all Java, Solaris and Oracle.
Peter