Cray CTO: Linux clusters don't play in HPC

← Back to Stories (view on slashdot.org)

Cray CTO: Linux clusters don't play in HPC

Posted by CmdrTaco on Tuesday April 13, 2004 @03:42AM from the you'd-think-he-should-know dept.

jagger writes "Linux clustering was touted as the next big thing by many vendors last week at ClusterWorld Conference & Expo 2004. But supercomputer vendor Cray Inc. scoffed at the notion of putting Linux clusters in the high-performance computing (HPC) category. "Despite assertions made by Linux vendors, a Linux cluster is not a high performance computer," said Dr. Paul Terry, CTO of Cray Canada."

15 of 435 comments (clear)

Marketing by Allen+Zadr · 2004-04-13 03:42 · Score: 5, Insightful

While Paul Terry makes some good points, in his statements, including the partial quote from the post, "Despite assertions made by Linux vendors, a Linux cluster is not a high performance computer, said Dr. Paul Terry, CTO of Cray Canada. "At best, clusters are a loose collection of unmanaged, individual, microprocessor-based computers."

Remember to take this with a grain of salt. The inflammatory nature of the comment is nothing more than a marketing ploy to increase visibility of, and sell, the new Cray XD1

--
Kinetic stupidity has a new brand leader: Allen Zadr.
1. Re:Marketing by Total_Wimp · 2004-04-13 03:54 · Score: 5, Funny
  
  "At best, clusters are a loose collection of unmanaged, individual, microprocessor-based computers."
  
  I'm sure Paul Terry is nothing more han a loose collection fo unmanaged, individual human cells too. But I'm sure, with hard work and love, he can become a _real_ boy! Lets all have a hug.
Seymour Cray by JargonScott · 2004-04-13 03:43 · Score: 5, Funny

A quote I've seen before:

"If you were plowing a field, which would you rather use? Two strong oxen or 1024 chickens?"

Maybe he meant penguins?

--
Nuke Gay Whales for Jesus.
1. Re:Seymour Cray by Waffle+Iron · 2004-04-13 04:38 · Score: 5, Funny
  
  "If you were plowing a field, which would you rather use? Two strong oxen or 1024 chickens?"
  When Seymour Cray made that statement, he was probably pointing out the difference between his he-man vector processors vs. clusters of the wimpy microprocessors of old.
  After reading the article, it seems that this new Cray is powered by a bunch of the exact same AMD microprocessors that a cluster of Linux boxes would use. So what they have now is more like an ox-shaped sack stuffed with chickens.
2. Re:Seymour Cray by epiphani · 2004-04-13 05:05 · Score: 5, Funny
  
  wow. I've never seen someone fail so miserably when trying to start a flamewar over why kind of Tractor is better. Man, I thought they woulda been all over that here on slashdot.
  
  --
  .
And in other news... by heironymouscoward · 2004-04-13 03:45 · Score: 5, Insightful

Oracle disclaim MySQL and PostgreSQL as "toy databases", Microsoft claims that "Apache cannot be used for real web serving", and Sun announces that "Intel and Linux simply cannot be used for enterprise computing".

So all those supercomputing labs that use Linux clustering (that invented Linux clustering, even) have been wasting their time?

--
Ceci n'est pas une signature
Are too by Anonymous Coward · 2004-04-13 03:45 · Score: 5, Interesting

"Most cluster [experts] know now that users are fortunate to get more than 8% of the peak performance in sustained performance."
Tell that to PIXAR. I don't believe it either.
I guess that the simple problem is just that the algorithm applied is usually not suitable for massively parallel computing.
1. Re:Are too by dead+sun · 2004-04-13 04:06 · Score: 5, Insightful
  
  Pixar doesn't need telling, their problem breaks up so miraculously well that they'll see the best performance you could possibly expect from a cluster. The big problem, rendering a movie, decomposes into thousands of small problems, rendering a frame. Each machine in their cluster can handle a group of frames at a time with zero need to communicate or worse, share computation, with other machines in the cluster. It's the best case scenario.
  Many other computing problems don't decompose nearly so nicely. So there are certainly problems that probably won't see more than 8% of peak performance. If you were particularly inclined you could probably invent a problem that had to be done serially, leaving percent of peak performance equal to what percent of your cluster one box was. Cray is right to that extent and if you're solving a problem that falls into the category of not easily parallelized then perhaps one of their machines is the better tool for the job. But, like you mention there are instances where the cluster is a great tool and cost effective to boot.
  Heck, ever check out some of the faster interconnects like Myrinet? They're insane and exist because fast ethernet just doesn't cut it in some places. Just using a slow interconnect is enough to bring real performance down below theoretical peak. Luckily for Pixar off the shelf fast or gigabit ethernet is likely enough.
  Anyway, use the best tool available. If your problem falls into the category of trivially parallelizable like rendering a movie is then don't bother wasting your money on a Cray. If your problem isn't suited to a cluster, however, then maybe a cluster isn't the right answer. If you have a big problem that needs serious computation take the time to figure out what you need before taking a marketing drone's spiel for gospel in your situation.
  
  --
  If not now, when?
Problem by rawgod0122 · 2004-04-13 03:51 · Score: 5, Interesting

It all depends on the problem you are trying to solve. I have been doing some work of late that would not complete in my life time on the 108 node cluster that we have. But when programmed for and run on two Cray X1s I should complete inside of a week.

Granted there are many codes (and more every day) that will run on clusters, the big iron will never die.
Just because we love Linux.... by foooo · 2004-04-13 03:52 · Score: 5, Insightful

Just because we love Lunux doesn't mean that clusters are HPCs.

There are real issues that differentiate mainframe/supercomputers from large, powerful, clusters.

Of course this all depends on your definition of an HPC. But I believe that it's reasonable to say that if parts of your computer are connected with low bandwidth connections (10/100,gigabit) they just can't handle the same kinds of transactions that a computer with parts that are connected by 10 gigabit or 1000 gigabit connections or whatever it is nowadays.

As far as I know if you're deploying a large database it's still advisable to have a big huge IBM mainframe or a Unisys box or a Sun 10k instead of 4,8 or 16 clustered 8 proc machines.

My point is there are valid arguments for not including clusters of commodity hardware in the HPC category.

In my mind they aren't High Performance Computers... they are High Performance Clusters of Commodity Computers.

~foooo
Can you multithread your application? by LostCluster · 2004-04-13 03:55 · Score: 5, Insightful

Clusters can rival a supercomupter when they are assigned is a task that's suitable for distributed computing. That is, work units can be divided up and worked on in any sequence... the result of segment 45 doesn't depend on knowing the result of 44 and such. Effectively, you can have the sum of all of the processors minus just a little overhead for the clustering.

What Cray's rightfully pointing out is that for most business applications, however, distributed computing is not a viable option. When processing on a transaction basis, the transactions often need to posted in the exact order they were recieved, which means they must be taken serially. In those situations, the programs can't multithread work out to the other processors so well, and the cluster will end up running at roughly the speed of just one processor while the others waste clock cycles waiting for something to do.

The cluster isn't the solution to everything. Nor is the supercomputer. You've gotta think about the job, then figure out which tool is right for the task.
Well... he is sort of correct... by nacks1 · 2004-04-13 04:05 · Score: 5, Interesting

I happen to work in a facility that has large had both large supercomputers (cray t3e, j90, sgi) and linux and *nix based clusters (beowulf/linux, compaq/Tru64). The Cray CTO is correct that you can't just call every linux cluster out there HPC. Just about anyone with networking and linux knowledge can build a linux cluster.

What really makes a difference between an HPC cluster and your normal every day cluster is the hardware interconnects used. There is a comment in the artical that refers to not using I/O for memory and message passing. I am not quite sure what he means by that, but I am guessing that he is saying that the network is not used for shared memory/message passing (MPI/openMP/SHMEM).

If a cluster can limit the impact of latency between nodes either through smarter software or faster interconnects then I can't see any reason not to concider a linux cluster as HPC.

Clusters without smarter software tend to be a real difficult coding platforms. Some developments with things like globally shared memory might make the difference, but there will still be the problem of latency between nodes.
Re:Well.. by s00p41337h4x0r · 2004-04-13 04:16 · Score: 5, Insightful

How could Cray be wrong. I mean just becuase linuxis running some of the top 500 computers there is no reason to consider HPC right. What a self serving statement Cray makes....they still dont get it .... there way is a dead-end...
That's right. Dataflow vector processing has been shown to be a dead end. The fact that fastest computer in the world is a dataflow machine is a statistical anomaly, right?
Oh, here's the TOP500 list, btw.
Re:Help me here... by krlynch · 2004-04-13 04:17 · Score: 5, Interesting

So depending on the task at hand, the cluster might perform very well, or perhaps a little less well.

Surely what you meant to say is that, depending on the task at hand, a cluster might perform very well, or perhaps perform attrociously. :-)

Clusters tend to work well when the various nodes don't need to communicate very often but you need lots of cycles for the subtasks, while dedicated supercomputers tend to perform very well in tasks requiring vast amounts of internode communications bandwidth along with large numbers of cycles. If you need vast bandwidth and relatively low numbers of cycles, your pricepoint is likely a mainframe. And if you don't need either, you get a cheap desktop machine.

Certain problems parallelize well on a cluster ... others don't. Some don't parallelize at all, and a cluster won't do you a darn bit of good. The different machines are designed for different uses ... and one should be careful not to push a "one size fits all" solution. The Cray guy clearly got it wrong on that point, and likely knows it, but he was marketting, not teaching a course in choosing hardware for the task at hand.
Re:If it walks like a duck, and talks like a duck. by flaming-opus · 2004-04-13 04:34 · Score: 5, Informative

Cray could easily be at or close to the top of the top500 list, their X1 architecture will extend that far. However, for a lot of really important supercomputing codes, it's no contest: The cray will trounce the clusters (linux or otherwise). Those #19 crays are only 256 processors. To get similar performance a stack of xeons requires thousands of processors. Some tasks just can be split appart that easily.

A cray processor has eight floating-point units running at 800Mhz. The big Mac cluster (for example) uses G5 processors which have 2 FPUs at 2000Mhz. Thus the cray has a ~40% advantage. However, the G5 processor has ~4GB/s memory bandwidth. The Cray has ~50GB/s memory bandwidth. If you have a problem that needs to do a HUGE amount of math on a tiny amount of data, the G5 will rock. If you have a problem that needs to do a HUGE amount math on a GINORMOUS amount of data, buy the cray. (for a GINORMOUS amount of money too)

Similaraly infiniband (ala the big mac) is really hot in the cluster interconnect space because it gives 2.5GB/s per node. The Cray gives you 51GB/s.
You need to move a little data, buy a cluster. You need to move a lot of data, buy the Cray.

There's no one solution for all problems.