Cray CTO Says Cray Computers Are Great

Theyyyyyy'rrrrrre Great! by ackthpt · 2004-08-20 03:07 · Score: 3, Funny

I thought that was Tony the Tiger.

I wonder how Cray computers are in milk...

--

A feeling of having made the same mistake before: Deja Foobar

Re:Theyyyyyy'rrrrrre Great! by Jerry+Smith · 2004-08-20 07:58 · Score: 1

When I was young, a Cray was synonymous for supercomputer. Whenever Asimov wrote about Multivac, a SF supercomputer, I couldn't help imagining a Cray. That was lightyears away, before I joined the IT. Now, married with children, having worked for Apple for years and actually seen dozens of XServes humming quietly and content all together in one room, covering walls, I still thought: nice, but not a Cray. I hope I'll never see one in real life, I'm quite sure I would be disappointed.

Happiness is a belt-fed automatic weapon

--
All those moments will be lost in time, like tears in rain. Time to die.
Re:Theyyyyyy'rrrrrre Great! by Hannes+Eriksson · 2004-08-20 13:12 · Score: 1

I don't think you would be disappointed. At least not if you would se one of the models with built-in sofa.

--
Geek rants since like... 2000 or something.

Imagine... by Rosco+P.+Coltrane · 2004-08-20 03:07 · Score: 5, Funny

no nevermind.

--
"A door is what a dog is perpetually on the wrong side of" - Ogden Nash

Re:Imagine... by Nakkel · 2004-08-20 03:42 · Score: 1, Funny

Ok, I went and imagined a beowulf cluster of these. Now thats weird because I have absolutely no clue WTF a beowulf cluster is.
Re:Imagine... by scotch · 2004-08-20 04:22 · Score: 1

Imagine 15,239,425 beowulf clusters of beowulf cluster jokes!

--
XML causes global warming.
Re:Imagine... by menkhaura · 2004-08-20 05:26 · Score: 1

You imagined it first

--
Stupidity is an equal opportunity striker.
Fellow slashdotter Bill Dog
Re:Imagine... by Anonymous Coward · 2004-08-20 07:29 · Score: 1, Funny

A beowolf cluster is a group of Scandinavian warriors. Not just your everyday Scandinavian warriors, but super smart Scandinavian warriors. Working together, they can solve all sorts of problems very quickly.

The issues are progress and long-term usefulness by Space+cowboy · 2004-08-20 03:07 · Score: 5, Informative

Given the difference in rate-of-evolution in the two camps, it can't be long before PC clusters, probably running Linux / with PVM or BSP (that's bulk-synchronous parallel rather than 3D graphics :-) are perfectly capable of doing what supercomputers do today. Of course, there'll be new really-super computers then, but that's a different story :-)

It's all very well to mock the I/O of PCI, but that's why we're all imminently moving to PCI Express, at a rather more respectable (current) maximum of 8+GBps rather than 133Mbps... Run a few gigabit ethernets in a hypercube formation and you have some rapid data transfer...

I notice he hasn't quoted the data-transfer rate on these new super-duper chips. The whole article does rather look like a piece of advertising on the cheap, speaking of which, the cluster solution is (relatively) CHEAP. Did I mention that ITS CHEAP...

Simon.

--
Physicists get Hadrons!

NO WAY! by FortKnox · 2004-08-20 03:07 · Score: 5, Funny

The CTO from Cray said Crays are great machines and are priced competitively!

Next you'll tell me the CEO of SCO thinks the lawsuit is completely valid and fair!

--
Good quote, too many chars. Seriously, the slashdot 120 char limit sucks!

Re:NO WAY! by Rosco+P.+Coltrane · 2004-08-20 03:09 · Score: 2, Funny

Apparently, since Cray uses Linux in clusters now, I'm sure SCO thinks Cray machines would be even greater if they costed $699 more per node...

--
"A door is what a dog is perpetually on the wrong side of" - Ogden Nash
Re:NO WAY! by gcaseye6677 · 2004-08-20 03:19 · Score: 5, Insightful

In other news, a Porsche performs better than a Ford Focus. Nevermind the 'slight' price difference.
Re:NO WAY! by Flashpot · 2004-08-20 04:00 · Score: 1

Nope, the porsche cannot carry 3 passengers to lunch.

--
That which does not kill her only prolongs my agony.
Re:NO WAY! by Megaslow · 2004-08-20 04:36 · Score: 1

Nope, the porsche cannot carry 3 passengers to lunch.

Sure it can. Looks like 4 actually...
Re:NO WAY! by Flashpot · 2004-08-20 05:03 · Score: 1

Not on MY income!

--
That which does not kill her only prolongs my agony.
Re:NO WAY! by dasmegabyte · 2004-08-20 07:31 · Score: 1

A porsche will get you and your girlfriend there faster, but two ford focuses will get your whole family there in less than the time it takes for you to drive back and pick them up.

--
Hey freaks: now you're ju

How about... by OxygenPenguin · 2004-08-20 03:09 · Score: 3, Funny

a Linux cluster of Cray's?

--
Read the only personal Runyon page out there.

Re:How about... by einhverfr · 2004-08-20 04:45 · Score: 1

You mean like Red Storm?

--

LedgerSMB: Open source Accounting/ERP
Re:How about... by dasmegabyte · 2004-08-20 07:34 · Score: 1

I could imagine that.

--
Hey freaks: now you're ju

Re:*Shock* by Nos. · 2004-08-20 03:09 · Score: 5, Insightful

The thing is makers of big supercomputers are scared of clustering technology. Look at google. A large cluster, and if one of the machines dies, you don't worry about it. Every once in a while you go and replace those that died. If only a small portion die, you haven't seriously impacted your production. However, if your supercomputer goes down... well, your screwed. 1000 machines are more reliable then 1 big machine.

Linux vs. linux by Anonymous Coward · 2004-08-20 03:10 · Score: 5, Funny

Is MS somehow involved? Who am I supposed to hate? Editors?

Re:Linux vs. linux by sharkey · 2004-08-20 05:05 · Score: 1

Who am I supposed to hate? Editors?
Yes. After all, they picked this color scheme.

--

--
"Outlook not so good." That magic 8-ball knows everything! I'll ask about Exchange Server next.
Re:Linux vs. linux by bhima · 2004-08-20 05:45 · Score: 1

For the love of christ is there an easy way to turn this colour scheme thing off?
Or at least set it to anything else? Why can't we choose the colour ourselves?

--
Nothing in the world is more dangerous than sincere ignorance and conscientious stupidity.
Re:Linux vs. linux by sharkey · 2004-08-20 08:28 · Score: 1

Change "it.slashdot.org" to "shit.slashdot.org". The colors are based on which hostname you use, so anything goes. Change to "games.slashdot.org" to get the purple, for instance.

--

--
"Outlook not so good." That magic 8-ball knows everything! I'll ask about Exchange Server next.
Re:Linux vs. linux by Geoff-with-a-G · 2004-08-20 13:38 · Score: 1

Is MS somehow involved? Who am I supposed to hate? Editors?

Normally that would seem the way to go, and lots of people around here even bitch about michael specifically, but damn if this isn't one of the best headlines I've seen on slashdot in weeks.

Put a big grin on my face.

Re:*Shock* by Anonymous Coward · 2004-08-20 03:10 · Score: 5, Informative

No, no, you misunderstand.
He's saying that linux-based *supercomputers* are faster then linux-based *clusters*.
(although, you can probably cluster those supercomputers...)

This is going to be interesting... by Anonymous Coward · 2004-08-20 03:10 · Score: 1, Funny

Most /.'ers are probably dying to mod this whole article down to troll.

I guess they can take their anger out on this anonymous post instead.

Re:*Shock* by Crazy_MYKL · 2004-08-20 03:11 · Score: 2, Informative

No, he's saying you should buy their Linux-based supercomputer instead of a Linux cluster. If you don't RTFA, at least skim the summary.

--

<jedi> There is something funny here. You laugh. </jedi>

The difference by rwven · 2004-08-20 03:11 · Score: 4, Insightful

The difference is that linux clusters aren't really designed for supercomputing... more of distributed computing. Cray specializes in it. Of course they're going to come out on top....

Re:The difference by Junta · 2004-08-20 03:14 · Score: 1

Huh? Umm, no, clusters are heavily used for supercomputing. Take a glance at the top500 and see for yourself. With high-speed interconnects (i.e. infiniband/myrinet), it is very feasible.

--
XML is like violence. If it doesn't solve the problem, use more.
Re:The difference by trifakir · 2004-08-20 03:27 · Score: 2, Informative

With high-speed interconnects (i.e. infiniband/myrinet), it is very feasible.
Hm, I haven't played with infiniband, but I have access to a small Myrinet cluster and it takes hell lot of efforts to write your application in such a way as to overcome the big disparity CPU power/network thoroughput and to have some normal speed-up.
Paul Terry is right - if they remove the PCI bottleneck it will be much easier to write scalable high-performance applications and then the costs will decrease.

This just in ... by Anonymous Coward · 2004-08-20 03:11 · Score: 1

Bear found defacating in a wooded region.
Pope reveals his membership of the Catholic Church.

Re:This just in ... by Anonymous Coward · 2004-08-20 03:14 · Score: 1, Funny

Bear found defacating in a wooded region.

I resent that remark.
-- Signed: a constipated polar bear...

editor training by Knights+who+say+'INT · 2004-08-20 03:11 · Score: 2, Interesting

You really shouldnt place commentary on a story title, unless it's an "its funny, laugh" one.

Oh, by the way, everyone who has a slashdot account should go to their preferences and set the "light" layout. You wont suffer with the bad color schemes anymore, and the results are more printer-friendly too.

Re:editor training by ggy · 2004-08-20 03:34 · Score: 1

...and the results are more printer-friendly too. I just have to ask, why would you want to print as /. discussion? I'm not trying to make fun of you, but I'm just baffled by the mere concept.
Re:editor training by drooling-dog · 2004-08-20 03:41 · Score: 2, Funny

I just have to ask, why would you want to print a /. discussion?
Well, if I make a particularly witty comment, of course I'd like to frame it and hang it on the wall behind my desk...
Re:editor training by Geoffreyerffoeg · 2004-08-20 09:53 · Score: 1

Why's everyone complaining about the colors? They're not harsh or ill-chosen (like brown-on-black or something). Is it just that they're different and you're not used to them? That kind of goes counter to the whole open-source and Linux trend around here....
Re:editor training by Geoff-with-a-G · 2004-08-20 13:42 · Score: 1

Personally, I think they're pretty ugly, and that the link text is difficult to read. Dark beige on light beige bordered with darker beige and white isn't exactly straight out of readability studies.

PS - nice user ID.

Slashdot Poster Says Comment Is Funny by UncleBiggims · 2004-08-20 03:12 · Score: 1, Funny

And it is, too.

Re:Slashdot Poster Says Comment Is Funny by justkarl · 2004-08-20 03:18 · Score: 1

To be totally honest...I'm a little dyslexic.

I first read the headline as "Cray Says Computers Are Great", which brings forth two obvious questions: Who is Cray, and why didn't he know already that computers were great?
Re:Slashdot Poster Says Comment Is Funny by Narchie+Troll · 2004-08-20 03:38 · Score: 1

The name Cray should be synonymous with supercomputing for anyone even vaguely aware of the field. I mean, Seymour Cray? Does the name ring a bell?
Re:Slashdot Poster Says Comment Is Funny by justkarl · 2004-08-20 04:00 · Score: 1

Troll indeed...Of course I know who Cray is...You missed the point.

Re:*Shock* by ohad_l · 2004-08-20 03:13 · Score: 5, Informative

Uhh, no, he's not dissing Linux at all. He's saying that one big supercomputer (running Linux, perhaps) will get you more price-performance (bang per buck, I guess) than a Linux cluster.

--
If it weren't for fog, the world would run at a really crappy framerate.

Supercomputers have advantages by Anonymous Coward · 2004-08-20 03:13 · Score: 1, Funny

I bet the cray can maintain higher fps in doom 3 than a cluster ever could.

Re:Supercomputers have advantages by TheRaven64 · 2004-08-20 04:56 · Score: 1

Maybe, although Chromium probably lets you get a very respectable frame rate. For those of you unfamiliar with Chromium, it's an implementation of OpenGL designed to run on a cluster, spreading the load between the nodes. On the other hand, it does suffer from a few frames latency, which is less than ideal for a FPS...

--
I am TheRaven on Soylent News
Re:Supercomputers have advantages by ftzdomino · 2004-08-20 05:21 · Score: 1

Yes, you may be able to achieve 10 frames per second in the highest quality setting.

Re:*Shock* by krog · 2004-08-20 03:14 · Score: 5, Insightful

Dude, the makers of "big supercomputers" invented clustering. I don't think they're afraid of it.

There are tasks that a cluster of Linux shitboxen will do well, and tasks where the cluster will not hold up so well against a real supercomputer. Google is an example of a perfect application for networked Linux servers. If you're simulating cloud physics one molecule at a time, though, you are a lot better off using the right tool for the job instead of 1,024 wrong ones.

--
Cretin - a powerful and flexible CD reencoder

A better angle would have been... by Linker3000 · 2004-08-20 03:14 · Score: 4, Funny

...Your square boxes will never look as sexy as our 'Love Seat'

--
AT&ROFLMAO

Deja vu? by X-ite · 2004-08-20 03:14 · Score: 1

Compare this article with http://it.slashdot.org/article.pl?sid=04/04/13/145 2255&tid=126&tid=106

Re:*Shock* by Anonymous Coward · 2004-08-20 03:15 · Score: 5, Insightful

FUD = Fear, Uncertainty, Doubt. Provide examples in his statements of any of those three?

P.S. You are so l33t for using TT.

Re:The issues are progress and long-term usefulnes by Marx_Mrvelous · 2004-08-20 03:15 · Score: 4, Interesting

There are some limitations to clusters that "supercomputers" don't have. Even if your network were exactly as fast as the internal bus of one of the Cray supercomputers (which I highly doubt it is), you still have a logical layer on top of it (TCP/IP/UDP etc). This slows it down.

For some applications, a cluster of slow PCs is ok. Bu if you want to do real time-intensive computation, you really can't beat a good internal bus.

--

Moderation: Put your hand inside the puppet head!

He's right!! by Hugonz · 2004-08-20 03:15 · Score: 1

"Linux clusters can not offer the same price-performance as supercomputers"

He's completely right, just not in the way he intended. You'd have a hard time making the cluster as expensive as the supercomputer....

Re:He's right!! by rusty0101 · 2004-08-20 03:41 · Score: 1

No, I can easily make a cluster as expensive as a super computer. Well, assuming I can spend all the money on the hardware necessary. Of course for a couple of million dollars, you can expect a rather impressive cluster.

Let me see, we'll take a quarter mill and use that to purchase the switches an cabling needed to interconnect everything. Might have to spend a bit to upgrade the power to our facilities, and speaking of facilities, we will probably need a warehouse some place to keep all the systems we are putting together into this cluster.

If we dedicate no more than a million to the actual hardware, cases, motherboards, cpu's, memory, (network card built into the motherboard) at perhaps a $100 per set (volume discounts) we are talking about a 10,000 node cluster. If we are talking about a $20M supercomputer, then we might need closer to a million each for networking and space, and we would get a 180,000 node cluster. (or better.)

Of course at that point, we might get better performance with quad, or 8way smp motherboards and processors, which might cost a bit more, but perhaps less than $400 or $800 per package. It could get interesting. Of course by the time the cpu lot got out of Intel, AMD, or IBM it would no longer be their fastest set of processors, but I think that shouldn't be a significant problem for the effectiveness of the volume of processors we would be working with.

We may not have an optimized bit of clustering software to run on it yet, but that's something to be worked out along the way as we are adding blocks of processors to the cluster.

The other problem is that we will probably need to put together a IPV6 infrastructure with over 65,000 nodes in the cluster.

--
You never know...
Re:He's right!! by mr_z_beeblebrox · 2004-08-20 04:40 · Score: 2, Insightful

He's completely right, just not in the way he intended. You'd have a hard time making the cluster as expensive as the supercomputer....

No, he's right in the way he intended.
He just leaves out a lot of information. The business environment determines what is or is not expensive. The computational environment determines what will or will not run fast, the two make a measure of how expensive something is.
If you are crunching a big continuous stream of numbers with multiple small results which are then looped in and crunched more (think major statistics, math, language interpretation etc...) These might be quicker on a single machine. If you are in an environment where (time=money)^2 (think casinos, trading floors, JIT manufacturing etc...) the lag of shared resources becomes MORE expensive than the single Cray. However, that statement is actually under hard analysis a no brainer and he is hoping that no one will question his statement enough to notice that for the other 85% (best guess statistic, blast it if you want to) of applications the cluster will offer considerable savings.
He's also hoping that no one will go down the road of the obvious that someone who has worked on the major 85% apps will do. Which I am guessing (no offence) is what happened with you. If you read it to critically his statements don't seem to hold any water. However they do, it's just the amount of water that is questionable.

Dupe! by Xpilot · 2004-08-20 03:16 · Score: 4, Informative

Yeah, no wonder this post looked familiar. Yup, it's a dupe, folks.

--
"Backups are for wimps. Real men upload their data to an FTP site and have everyone else mirror it." -- Linus Torvalds

Re:Dupe! by glenrm · 2004-08-20 03:29 · Score: 4, Funny

Cary most have bought the two posting, ad package.

--
Onward to the Aether Sphere!
Re:Dupe! by AbbyNormal · 2004-08-20 05:20 · Score: 1

No man, its a Cluster! heheh

--
Sig it.

Maybe "APPLE" will buy another Cray! by callipygian-showsyst · 2004-08-20 03:17 · Score: 2, Interesting

Remember when Applebought a Cray? It was mostly for show, so their R&D group can have the blinkenlights.

However it spawned a popular story about how "Cray designs on Apple and Apple designs on Cray" (see link.)

And now for the REST of the story:

Did you know that Macintoshes are designed on PCs!? That's right--PCs running WINDOWS. You see, nobody makes software to burn eproms or design printed circuit boards that runs on MacOS, so the hardware group has a bunch of Windows PCs!.

So now you know the *rest* of the story!

--
Best Buy can have you arrested

Re:Maybe "APPLE" will buy another Cray! by NoMoreNicksLeft · 2004-08-20 03:48 · Score: 1

Bullshit. Alot of the high end auto-routing stuff for PCB design runs on HPUX, AIX, even some Solaris, if I remember. Windows is an also-ran in this category, but mostly for the small developer. I doubt that Macs are designed on windows....
Re:Maybe "APPLE" will buy another Cray! by Thagg · 2004-08-20 04:00 · Score: 4, Interesting

As usual, there is more to the story. Apple brought my company in on a project back in the mid 80's when they bought the Cray. While we had to sign an NDA in blood, I doubt anybody will mind me talking about it now, almost 20 years later.

Apple was trying to design a new cpu chip. It would have had vector processing capabilities not all that different from the Cray, so they bought the Cray both to do circuit simulations on the chip and as a model for their own design.

The chip was going to be a 100 MHz chip (an astonishing speed for the time) with a four-pipleline vector processing unit.

They considered (but eventually declined to) hire us to develop some kind of 3D desktop for the Mac. The idea was this would distinguish the Mac further from other computing systems, but they wouldn't be able to emulate the interface because they didn't have the horsepower.

Anyway, that's the Apple-Cray story as I understand it. I'm sure that there is a lot more to the story than I know, of course.

Thad Beier

--
I love Mondays. On a Monday, anything is possible.
Re:Maybe "APPLE" will buy another Cray! by Billly+Gates · 2004-08-20 10:59 · Score: 1

SO the intel commercials featuring the Pentium are done on macs.

--
http://saveie6.com/

Re:The issues are progress and long-term usefulnes by vondo · 2004-08-20 03:18 · Score: 4, Informative

The latency on Ethernet is too high for many tightly coupled applications (lattice QCD for example). This is why people who need better networking use something like Myrinet. I would assume that these Cray machines have very high band-width, low-latency communications. This is where super-computers distinguish themselves from clusters.

heh by NanoGator · 2004-08-20 03:20 · Score: 1

I saw this MST3k blooper once where Tom called out "Cray" instead of "Crow". Still in character, And with false modesty, Crow replied with "Well that's very nice of you, Tom. I'm really more of a PC though."

(Not a verbatim quote.)

--
"Derp de derp."

Re:Unfuglify by AndroSyn · 2004-08-20 03:20 · Score: 2, Informative

Viola, un-fuglied version.

Not to nitpick but a Viola is a string instrument in the violin family, the word you want is voilà.

Or for an alternative press release by bfree · 2004-08-20 03:20 · Score: 1

You could look to SGI. Their Altix range is up to 1024 Itanium 2 processors in a single supercomputer, and they are putting 20 512 * processor nodes together in a cluster of linux supercomputers for NASA while also working on doubling up the maximum single machine cpu count to 2048.

--

Never underestimate the dark side of the Source

Re:The issues are progress and long-term usefulnes by PythonCodr · 2004-08-20 03:22 · Score: 5, Informative

It's not just the speed of the data transfer, it's also the latency of the interconnect. A lot of scientific codes will pass around a lot of little messages, and GigE is fast for bulk transfer, but it's not so good for that. That's why there are companies like Quadrics, Myricom, etc... Infiniband should fix this, but you'll want a big infiniband switch.

His point is building fast machines is hard, and the fastest machines are really hard. Too many folks think all you have to do is throw enough PCs and GigE nics at the problem. You can build a machine that way, but the codes don't scale well. Some scientific code will quickly show negative scaling in fact (where the more processes you add, the *slower* you code will run.) MPI codes do that all the time, which is one of the reasons you'll see people running their code at sizes smaller than the whole machine, and different sizes on different machines.

Yeah, you can build a Linux based world-class supercomputer as a cluster, but you better be willing to sweat the details is all. Or buy a Cray, I guess. ;-)

Cray CTO Says Cray Computers Are Great by md81544 · 2004-08-20 03:22 · Score: 1, Troll

In other news, Bill Gates says Windows is secure...

Re:Cray CTO Says Cray Computers Are Great by jack_csk · 2004-08-20 03:50 · Score: 1

In other news, Bill Gates says Windows is secure...

When you installed a firewall...

No ... by gstoddart · 2004-08-20 03:22 · Score: 5, Informative

There are entire classes of computational problems which are calssed as Embarassingly Parallel.

It means it is so trivial to parallelize the problem and get gains from it (think SETI@Home) that it's a no-brainer.

Other computational problems don't just simply fan out to the bazillions of nodes with tiny independant pieces of data.

Your assertion that the Cray CTO is talking FUD when he uses the actual term is just plain wrong and unfair to him. He actually knows what he's talking about.

--
Lost at C:>. Found at C.

Re:No ... by iamdrscience · 2004-08-20 03:47 · Score: 1

Well, I think you're right that he's not just talking FUD and he knows what he's talking about, but still though, tell me it isn't a little unfair that he's very directly implying Cray supercomputers are, across the board, a better solution. There are some problems with which Cray's computers are a better solution (not just those that don't parallelize, but plenty that don't parallelize very well), but there are also problems where a Cray is rather unnecessary.
Re:No ... by gstoddart · 2004-08-20 04:07 · Score: 1

Well, the man is expected to be something of an advocate after all.

He is saying that "yes, you could build a Linux cluster cheaper, and in many problems it works. But there are other factors where we feel we offer an improvement which people will find valuable".

I don't see any unfairness in what he said at all. I think the people buying supercomputers/clusters would legitimately need to assess if the build-time of a cluster is worth it to them.

Unfair would be saying "the results from a Linux cluster are morally inferior to those from a Cray since we're on cooler hardware." =)

Me, I'm just happy that we still have cool companies like Cray around and that they're using Linux on their machines.

--
Lost at C:>. Found at C.

Clusters don't scale, huh? by FyRE666 · 2004-08-20 03:22 · Score: 2, Informative

Scaling or upgrading these systems requires much more than simply ordering more parts; it opens up the whole integration exercise. From an application perspective, clusters limit application scaling. Bandwidth and latency restrictions significantly constrain performance as more processors are applied to a problem.

Has this guy ever heard of Google? I can see his point to an extent; in fact his whole q&a session/blatant advert really boiled down to a single point: If you need to move a lot of data between processors, then a cluster will faire worse than one of Cray's supercomputers which have (obviously) more bandwidth between the CPUs and shared memory. It really does depend on the application, but for him to suggest an HPC is always a more economic, or even better option than a cluster of cheap x86 boxes is demonstrably false...

--
Code, Hardware, stuff like that.

Re:Clusters don't scale, huh? by argent · 2004-08-20 03:33 · Score: 5, Insightful

for him to suggest an HPC is always a more economic, or even better option than a cluster of cheap x86 boxes is demonstrably false

It would be if he'd said it, so it's a good thing he didn't. He even commented that there are applications (emabarassingly parallel algorithms) that clusters do very well at. And Google is a perfect example of that.
Re:Clusters don't scale, huh? by simon_hibbs2 · 2004-08-20 03:58 · Score: 1

>It really does depend on the application, but for him to suggest an HPC is always a more >economic, or even better option than a cluster of cheap x86 boxes is demonstrably false... But he doesn't say that. He spends quite a bit of time explicitly saying that clusters are great for some problems, particularly embarrasingly parallel computational problems. I thought his comments were much fairer and overall more informative than most of the score 4 and 5 ones here. Simon Hibbs
Re:Clusters don't scale, huh? by CommieOverlord · 2004-08-20 05:15 · Score: 1

Clusters don't scale, huh?

No, they don't (not really anyway). Google isn't a HPC cluster by the way.

Take the big 1000+ CPU clusters build by labs and universities. They aren't running the same application parallelized across 1000 nodes. What happens is UserA gets 16 CPUs, UserB gets 32, maybe 10 for UserC.

Most calculations don't scale well beyond a certain number of processors, and often adding more CPUs will _slow_ down performance. Depending on the problem, a large supercomputer can scale better than a cluster.

Some applications are parallelized to run on 500, 1000, more nodes but those are very rare and very hard to write.
Re:Clusters don't scale, huh? by dasmegabyte · 2004-08-20 07:42 · Score: 1

It's my understanding that Google works because they have split the results system up so granularly and spread the load so accurately that once a machine receives its request, it doesn't have to contact any other machines to make a response. So in essence, each member of the cluster receives a small input task from the controller and does the rest of the work by itself. Google doesn't necessarily need to share information between two artibtrary nodes on its system, which further decreases the complexity of the software.

Very, very few supercomputing tasks can be so easily parallelized. And to be honest, you don't even really need Linux or clustering to do it. I used to do something similar using a regular expression and a BigIP firewall device...servers were numbered 0-9 and pages were broke down to a single number then forwarded to that specific reverse cache server.

--
Hey freaks: now you're ju

Re:Yes he's talking FUD by CommieOverlord · 2004-08-20 03:23 · Score: 3, Informative

Are you being funny or serious?

There's an entire branch of parallel application which are labeled "embarrassingly parallel". This description simply means that such programs are trivially parallelized and achieve as close to linear as possible when scaled across many nodes. This is because of the low inter-node communications.

For "embarrassingly parallel" applications, a cluster is a really good tool. For programs that parallelize as nicely a nice big vector or smp will do nicely. Some code will run better on small 20CPU SMP machine than on a 1000 node cluster.

Re:The issues are progress and long-term usefulnes by ctr2sprt · 2004-08-20 03:23 · Score: 5, Insightful

You're right, the key is "cheap." Clusters don't offer the same level of performance as supercomputers. I don't think you'd disagree with that statement. What they do is offer a similar level of performance - once unattainable by desktops or even high-end servers, and here I mean real high-end servers instead of just quad Opterons or the like - for probably a tenth the cost.

But even then, there are legitimate needs for supercomputers. A traditional PC-based server solution will address probably 99% of all problems. An inexpensive cluster will get you 99.9%. But there's that remaining 0.1%, and that's the target audience for whom Cray and similar companies exist.

The fact that PCs can be used almost unmodified to create supercomputers and high-speed clusters is remarkable, and says tremendously good things about the flexibility and power of the architecture as a whole. But there are just places it can't go, not yet. For example, you know how you never get 99% efficiency with 100 megabit ethernet? You're lucky to get 70% with gigabit, and 50% is a pretty common figure. PCI-X, at least at the speeds we're talking about here, is so rare now that it's hardly cheaper than custom supercomputer-style solutions - effectively because it is a custom supercomputer-style solution. I don't think we'll ever see common systems, even midrange servers, with more than one 16X PCI-X slot.

I really think this is what Cray mean here. Not that Linux-based clusters have no use, but that there is still a significant market for which they are suboptimal. And, in all probability, will always remain suboptimal. However fast PCs get, however popular PCI-X and similar high-speed buses become, supercomputers will just get faster to match... and computational problems will get harder to go along with them. I just don't see the need for supercomputers, at some level, ever going away.

(I hope people find my comment useful in some way. I elected to post it rather than mod down the idiot posting flamebait about Macs in reply to you. And here's hoping people don't interpret this as karma whoring, since usually if you say "This will get modded down" it doesn't. But... oh, hell. I don't even know which Slashdot rule of thumb applies to my post at this point.)

Geez by iamdrscience · 2004-08-20 03:23 · Score: 4, Informative

Being the CTO of Cray, can you expect him to say anything less? Now while his points are often valid, I think his conclusion, that supercomputers outshine linux clusters is a little inaccurate. Rather, I think the real conclusion is that linux clusters and supercomputers are both good, but at slightly different things. Which one you need to solve your problem depends ultimately, on the specific details of your problem. Again, though, being the CTO of the company, can really expect him to give a balanced opinion like that, rather than the skewed opinion that his company is always on top?

Cray is a great company, but I really hate that they have to come out with things like this every now and then. Most people in need of a lot of computing power already know the difference between your products and linux clusters and really, they're going to choose whichever's most appropriate for their problem regardless of what your CTO says.

Re:Geez by argent · 2004-08-20 03:38 · Score: 4, Informative

I think the real conclusion is that linux clusters and supercomputers are both good, but at slightly different things. Which one you need to solve your problem depends ultimately, on the specific details of your problem

Indeed. He actually made that point himself: "There are some applications where a well-designed Linux cluster can deliver good price/performance on a particular application; those embarrassingly parallel applications where processors spend little time exchanging data."
Re:Geez by lscotte · 2004-08-20 04:15 · Score: 1

Being the CTO of Cray, can you expect him to say anything less?

Exactly.. Next thing you know, Bill Gates will be telling us Windows is better too!

--
This post is licensed under the Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 License.

Re:-1 Informative by callipygian-showsyst · 2004-08-20 03:25 · Score: 1

What does this have to do with...

Who cares? This is /.! Nobody reads the article, and I got modded up instantly! All it takes is a few lines of text with a few links in it. Why bother doing any more?

--
Best Buy can have you arrested

Re:Yes he's talking FUD by compupc1 · 2004-08-20 03:25 · Score: 1

"embarassingly parallel" is just a phrase used to describe certain types of problems. Ok, I'll give you the fact that the phrase is a bit biased, but it's still just a phrase. Clusters work well for problems that can be broken up into smaller chunks that can be independantly solved individually and then combined to produce a final result, but for problems that require signifigant amounts of communication and data transfer between processors, clusters just don't cut it. Crays and other supercomputers use specially designed communication networks between processors and memory and such, and that's why they're so much more useful for those types of applications.

--
-James

Correction by Leomania · 2004-08-20 03:26 · Score: 2, Funny

Cray CTO Says Cray Computers Are Great

Actually, I think he said that "Cray computers rock, eh?" or perhaps it was "Cray computers kick ass, eh?" or something like that.

- Leo

--
You don't use science to show that you're right, you use science to become right.

Re:Correction by drphuck · 2004-08-20 05:57 · Score: 1

funny, cuz americans are very similar to canadians in how they talk, there are just 2 differences... Canadian: "How's it going, eh?" American: "Eh, Hows it going?"

--
"Software is like sex... it's better when it's free"
Re:Correction by Leomania · 2004-08-20 10:27 · Score: 1

I dunno... I bet we Americans manage to interject a lot more "Ummm"s and "Uhhh"s into a given sentence than our neighbors to the north.

Come to think of it, does this make Bugs Bunny a Canadian-American? Think about it... "Ummm, eh; What's up doc?" Looks like a combination of both sides of the border, eh?

- Leo

--
You don't use science to show that you're right, you use science to become right.

Not quite so simple really is it? by Anonymous Coward · 2004-08-20 03:27 · Score: 5, Informative

I don't think the Cray assertion is that crazy.

For a 12 CPU opteron unit the academic pricing (admittedly lower than commercial but where most of their sales will go) is about 45K. That's not too shabby. Before you bounce up and down and say I can build four times the cluster for that price, it should be noted that the XD1 gives you a single systems image, which simplifies programming and makes shared memory applications (increasingly important for areas such as bioinformatics).

We have a cluster with dolphinics wulfkit, using distributed shared memory slows us down. It's not the end of the world type slow down but it's a factor. Our cluster is a sixteen node, dual xeon 2.2GHz with wulfkit 3d torus interconnects. It cost us, at academic prices, $50K. Admittedly more CPU power than the 12 Opterons but we find ourselves using distributed shared memory alot, wulfkit is great here, and that would probably be much better on the XD1. Had the XD1 been available a year ago we may have bought one instead.

It really depends on your application. Are Crays cheaper than clusters in terms of harnessable compute power per dollar? Maybe. Depends on your application. Surely that's the correct answer.

Also, buying Cray is about getting access to their software technology too.

R-S

Re:Not quite so simple really is it? by jaywee · 2004-08-20 04:04 · Score: 1

Also, don't forget one XD1 unit has 6 Virtex II Pro FPGAs. Which should speed up certain applications a *lot*.
Re:Not quite so simple really is it? by Wesley+Felter · 2004-08-20 04:13 · Score: 1

the XD1 gives you a single systems image, which simplifies programming and makes shared memory applications

Is this really true? The XD1 Web site just talks about MPI.
Re:Not quite so simple really is it? by night · 2004-08-20 06:11 · Score: 1

ER, the 12 CPU opteron is actually 6 dual opterons, and does NOT provide single system image. Shared memory is only available between each pair.
Re:Not quite so simple really is it? by Wesley+Felter · 2004-08-20 07:02 · Score: 1

The XD1 uses Mellanox Infiniband switches.

How would a cluster of commodity hardware behave, if using a low-latency interconnect?

#2 in the world, if you consider Itanium to be commodity. The fastest Xeon cluster is #5 in the world.
Re:Not quite so simple really is it? by Shinobi · 2004-08-20 20:40 · Score: 1

Otoh, those figures are for a task that is very trivial and easily parallellizable, i.e they fit well on clusters.

In other news... by Anonymous Coward · 2004-08-20 03:27 · Score: 1

Linus Torvalds claims Linux is "better than Microsoft."

Re:*Shock* by beh · 2004-08-20 03:28 · Score: 3, Insightful

Well, supercomputing can be either of two issues

a) (google-like) jobs well suited to a high degree of parallel processing.

b) complicated problems that can't easily be broken down to make use of a large number of CPUs, but require a lot of operations to be completed in the proper sequence.

On the first, a cluster is a great idea.
On the second, a reaaaaaallly fast CPU is a great idea.

Re:*Shock* by networkBoy · 2004-08-20 03:28 · Score: 4, Insightful

If you're simulating cloud physics one molecule at a time, though, you are a lot better off using the right tool for the job instead of 1,024 wrong ones.

In this case the right tool is a vector based supercomputer like the SV1 (8 vector processors at 2Gflops each . . . MMmmmmmmm). A cluster based approach will waste more processing time with the message passing than anything else. Cheaper maybe, but grosely ineffecent.
-nB

--
whois gawk date unzip strip find touch finger mount join nice man top fsck grep eject more yes exit umount sleep dump

Re:Yes he's talking FUD - NO!!! by cintyram · 2004-08-20 03:28 · Score: 1

"Emberassingly parallel" is a term referring to parallel computations involving minimal or no communication across the computing nodes.
it is not to be interpreted as "parallelism is emberassing to him" :)) ;
oh and by the way, what he said is not wrong !!
-ram

Re:Yes he's talking FUD by Jerbiton · 2004-08-20 03:28 · Score: 1

> why is parallel "embarassing"? oh I see, facts aren't good for your bottom line.

Actually that is a common expression, and not something he pulled out of his heinie - peek here

Re:Yes he's talking FUD by argent · 2004-08-20 03:29 · Score: 1

I think you misunderstand what he said. The term "embarasingly parallel" has been in common use for many years to describe problems that require so little communication between processors that they can be scaled up more or less indefinitely just by adding more computers. The ultimate examples of "embarassingly parallelizable problems" are things like the human genome project or SETI-at-home, where it's practical to farm it out to completely disconnected computers to do bits of the work in isolation.

Re:*Shock* by lukewarmfusion · 2004-08-20 03:29 · Score: 1, Interesting

No, the inventors of big supercomputers (couple million dollars a pop) are definitely scared of clustering.

If you want a Cray supercomputer, you have to buy it from Cray. If you want a Linux cluster, you can buy it (or build it) from anyone.

I'm sure there are applications for a supercomputer, but I see universities, production studios (Pixar!), and research labs moving toward clusters. The supercomputer companies will do anything it takes to either stop that from happeneing or to gain in that market.

Re:*Shock* by SocietyoftheFist · 2004-08-20 03:29 · Score: 1

Didn't read the article did you? He's pushing one of their machines running Linux and even talking about when a supercomputer isn't the best solution.

Re:*Shock* by stratjakt · 2004-08-20 03:30 · Score: 1

Clusters are really only good when what you're doing is massively parallelizable, like 3D rendering or folding@home types of applications.

For stuff thats not, algorithms that only work sequentially, nothing beats a crazy-insane-fast CPU, memory and system BUS.

You'd think that'd be a no-brainer to the "computer experts" here at slashdot, but you'd be wrong to assume that /.ers are actually "computer experts" when they're merely crusaders for their favorite OS.

Cray isn't "scared" of linux clusters. They just can't do what a Cray does.

--
I don't need no instructions to know how to rock!!!!

No, they're SUPER by Anonymous Coward · 2004-08-20 03:30 · Score: 1, Funny

I think it was Big Gay Al that said they were SUPER.

Re:No, they're SUPER by decepty · 2004-08-20 03:57 · Score: 2, Funny

Well, hey, thanks for asking.

--
Be careful! Bears shouldn't consume large furry dogs.
Re:No, they're SUPER by NanoGator · 2004-08-20 04:35 · Score: 1

"Well, hey, thanks for asking."

Don't you just look great in that dress?

--
"Derp de derp."

Re:Unfuglify by vrt3 · 2004-08-20 03:30 · Score: 1

Actually, I like http://developers.slashdot.org/article.pl?sid=04/0 8/20/1337224&tid=137&tid=163 better.

--
This sig under construction. Please check back later.

Re:Whee! by the_mad_poster · 2004-08-20 03:34 · Score: 2, Informative

Hi, clueless Slashbot. This is a quick rundown of why your post was stupid, and why Cray supercomputers do, in fact, do some things better than a PC cluster regardless of price.

If you have a supercomputer, you have a very, very, very fast internal bus handling all necessary data transfer. Even with the advent of PCI Express, a cluster of PCs must run in a network model. Therefore, any data crunching that occurs must pass through a network layer, the bus, the physical medium, and back through those limiters once more on the next system. Therefore, if you are doing number cruncing that truly cannot afford the delays caused by the data transfer limitation of a PC cluster, a self-contained supercomputer is far and away the best option, even if it's more expensive.

Therefore, contrary to the idiotic drivel you just spouted, Cray does, in fact, have something to offer that no PC cluster currently can.

We now return you to your regular informed diatribe in the name of the self-gratifying masturbatory stupidity that is Slashdot.

--
Alito: A vote for Alito is a punch in the eye to put that bitch back in her place!

Re:The issues are progress and long-term usefulnes by Wesley+Felter · 2004-08-20 03:35 · Score: 5, Informative

Good clusters don't use IP; they use Infiniband, Myrinet, or Quadrics, which all have OS bypass and trasport offload features so that the app can talk directly to the NIC. In fact, Cray's XD1 "supercomputer" uses the same Infiniband interconnect as some "clusters"; Cray just has better NICs.

Re:*Shock* by ranrub · 2004-08-20 03:35 · Score: 5, Informative

Have you ever worked with supercomputers?

However, if your supercomputer goes down... well, your screwed

Cray supercomputers have built-in redundancies. All the subsystems are separate from the processors and memory, which are actually "clustered" (depends on model). Even the OS has build-in means to survive the harshest hardware catastrophe by checkpointing the running jobs regularly, to off-site disks.

1000 machines are more reliable then 1 big machine

Wrong again. With 1000 lousy cheap machines, you need an on-site team of technitians to keep the all up. Supercomputers (with built-in redundancy etc.) have equal or less maintenance requirements.

Re:Cool, competition by Mistah+Blue · 2004-08-20 03:37 · Score: 1

If you had RTA you would have learned that these new Cray systems run Linux as their OS.

Re:Unfuglify by Mr.Ned · 2004-08-20 03:37 · Score: 1

http://www.electricstate.com/articles/defuglify-sl ashdot/

Found this a while back, and now have it in my Firefox Toolbar - works great.

The argument by manavendra · 2004-08-20 03:40 · Score: 4, Informative

is based on :

Heritage and resultant architecture: Linux clusters are typically processors are connected through I/O links, whereas supercomputing machines where processors exchange data and instructions through shared memory.
PCI bottlenecks: This the key argument made - the bottlenecks introduced by PCI communication and the bottlenecks therein. He goes on to say that performance problems in any given such cluster tend to remain with any other such cluster. I agree with that.
High Availability: He then goes on to talk about the reliability, availability and manageability of the supercomputers against typical clusters. I think there is where the FUD creeps in, along with marketing BS.

In all fairness, he does raise a critical point, however, overall, I think considering the relative ease and popularity of building, administering and growing a cluster these days, I think cost-effectiveness of a single monolithic machine is a moot point

--
http://efil.blogspot.com/

He basically said faster communications needed by VernonNemitz · 2004-08-20 03:40 · Score: 2, Insightful

That is, for a Linux cluster to keep up with a supercomputer, the cluster needs faster communications between processors. The bottleneck of going from processor to South Bridge to PCI Bus to Ethernet card, and back again at another processor, is the problem.

So, the answer is to recognize that in a cluster most of the machines don't need video cards. That means Somebody can design a fiber-optic communications card that plugs into the AGP slot (or maybe a PCI Express slot). Then, Cray, look out!

Re:He basically said faster communications needed by questionlp · 2004-08-20 03:52 · Score: 1

AGP is pretty much an over-glorified PCI bus anyway, it just runs faster than your standard PCI bus. PCI-X will get you up to 1GBps and PCI-X 2.0 increases that to around 2GBps. Also, any server that is worth it's salt will have multiple PCI-X busses (instead of one or two plus AGP, which is more common in workstation boards) and relegates video to serial or a basic PCI video chip.

PCI Express will definitely help bump up performance, but it will take some time for cards to be released in x1, x2, x4 or x8 and hopefully without using PCI-to-PCI Express or PCI-X to PCI Express bridging.
Re:He basically said faster communications needed by imroy · 2004-08-20 04:14 · Score: 1

No, AGP has pretty much the same problems as PCI. What supercomputers generally use is much more direct - special memory management in what is essentially the "north bridge". Supercomputer nodes have no need for peripherals anyway, so a peripheral bus is a waste. Each node is just a CPU, memory, and some communications hardware. The latter is the important part.
Re:He basically said faster communications needed by vidarh · 2004-08-20 04:14 · Score: 2, Interesting

And in doing so you are essentially building a super computer. However you'd have to keep in mind that it isn't all about total bandwidth - latency also needs to be extremely low. That said, HP is working on an open source Single System Image clustering support for Linux on "normal" hardware
Re:He basically said faster communications needed by VernonNemitz · 2004-08-20 05:11 · Score: 1

Thanks for the replies!

OK, then what about a communications card that plugs into, of all the mad ideas, a DIMM slot!!! All it has to do is fool the BIOS into looking like memory during Boot, and maybe it would actually contain some RAM, but whenever the CPU writes to a certain address range, it is being commanded to communicate. (Not to mention that the I/O buffer is about as handily available as can be imagined!)
Re:He basically said faster communications needed by dasmegabyte · 2004-08-20 07:53 · Score: 1

Then, Cray, look out!

For whom? You seem to be of the opinion that complicated networking hardware will just spring up out of a vaccuum. Such is not the case. If and when said technology is invented, it will be BY Cray -- or by another player in the HPC field.

The cost will be high (because of high performance electronics, redundancy etc). The availability will be low. The cabling and control unit will be very expensive.

And at the end of the day, Linux based low latency distributed computing will approach or exceed the cost of Linux based mainframe computing. You're paying a lot for the hardware in either direction. You're paying a company to build and support your solution. You're relying on a single vendor for your processors and interconnects. Where's the benefit?

--
Hey freaks: now you're ju

2 word summary of article by chill · 2004-08-20 03:40 · Score: 1

PCI sucks

--
Learning HOW to think is more important than learning WHAT to think.

Re:Cool, competition by Progman3K · 2004-08-20 03:41 · Score: 1

So the claim is "Our supercomputer performs better than a PC?" No kidding. Even for geeks, this isn't really news.

--
I don't know the meaning of the word 'don't' - J

Re:The issues are progress and long-term usefulnes by Performer+Guy · 2004-08-20 03:42 · Score: 4, Insightful

Clusters are nice for some problems but message passing and memory copying over a network is not ideal even when you have what *you* think is a lot of bandwidth. Latency and cache coherency and having a single image system can be critical factors in some classes of supercomputing problem, not to mention ease of use and specialized fp vector instructions that are often supported. The topology in large systems is often built (flexibly) into the memory controller hardware, the CPU writes to memory and it finds the right node, page migration and process affinity along with other advanced features like hardware level cache coherency helps these systems outperform clusters with ease given the right problems.

The coolest thing about this IMHO is that Cray are using Linux for their single image systems.

Yep the performance of computers is always on the increase but there will always be demand for more compute, the question is where do you want to be on the performance curve, not the absolute performance. People solve increasingly difficult problems with increasing detail and there looks to be no slowdown. They buy what suits their budget and solve as rigorously as they can for their hardware, and as hardware improves they redefine the types of problem they want to solve.

Yup clusters are cheap and they're on the top 500 but nobody actually buys a supercomputer to run LINPACK. They use them to solve real problems, the list is just for bragging rights.

Re:*Shock* by Rei · 2004-08-20 03:42 · Score: 1

It depends. If you're simulating cloud physics across a wide range of starting conditions, for example, a cluster will be your best bet.

There are very few situations in which there isn't a single parallizable task, and if there is one, a cluster is probably your best bet.

--
No matter how kind you are, German children are kinder.

Re:The issues are progress and long-term usefulnes by NoMoreNicksLeft · 2004-08-20 03:43 · Score: 2, Funny

Don't suppose anyone has an old YMP or whatever that they'd be willing to give to a good home in Virginia?

Or for that matter, a warezed copy of Unicos....

I for one ... by cascadingstylesheet · 2004-08-20 03:43 · Score: 2, Funny

I, for one, welcome our new story-duplicating, supercomputer-mocking, Slashdot editor overlords ...

Re:I for one ... by pandrijeczko · 2004-08-20 03:57 · Score: 1

story-duplicating
Perhaps it was "The Cray Twins"!
(For the benefit of our non-English readers, Ronnie & Reggie Kray were two infamous East-End (of London) criminal twin brothers during the 1960s.)

--
Gentoo Linux - another day, another USE flag.

Why are Linux clusters' interconnects slow? by Louis+Savain · 2004-08-20 03:44 · Score: 1

On the other hand, supercomputers are purpose-built to handle HPC applications, which place enormous demands on both processing power and inter-processor communication. Their design includes high performance interconnects that provide high bandwidth, low-latency communications across the entire system, regardless of the number of processors required.

Why can't Linux clusters use the same high performance interconnects? Is it because of cable overhead (length, signal travel, insulation, etc...) or is it because of slow electronic switching? Why can't optical linkage provide the same low-latency interconnect performance as that of supercomputers. Somebody tell me, please. I need to know.

Re:Why are Linux clusters' interconnects slow? by argent · 2004-08-20 03:53 · Score: 1

Why can't Linux clusters use the same high performance interconnects

They can. It's just a matter of how much you want to spend, and the result wouldn't necessarily be a "cluster" any more. It's distance, bus overhead, network overhead, chipset architecture, everything you listed and more.
Re:Why are Linux clusters' interconnects slow? by stratjakt · 2004-08-20 03:58 · Score: 1

If you're talking about commodity PC hardware, ultimately whatever interconnect you use is going to be hampered by the PCI bus.

There's interconnect stuff more efficient, protocol wise, than giga ethernet, you can lower latencies and do compression and stuff, but in the end, it's the PCI bus that creates the bottleneck.

Now, come up with a BUS that operates closer to CPU speeds, and now we're talking. That's (a part) of what a Cray does that a pile of G5 X-servers cant.

The zealots can calm down. This would be like a Ford rep saying "cars are good for some things, but nothing beats a F150 if you want to move a whole lot of dirt cost-effectively".

--
I don't need no instructions to know how to rock!!!!
Re:Why are Linux clusters' interconnects slow? by mr_z_beeblebrox · 2004-08-20 05:43 · Score: 1

Why can't Linux clusters use the same high performance interconnects?

Linux can use them and does on the new Linux/Cray XD1. It is the PC Achitecture that he is saying slows down Linux clusters (and I find it humorous that Windows clusters are not mentioned...they have the same problem plus there is no interest in them) When PCI is replaced (I have heard of PCI express but not seen it yet) these numbers will start to even out.
Re:Why are Linux clusters' interconnects slow? by argent · 2004-08-20 05:52 · Score: 1

Uhmm.. No.. they can't.. We are talking proprietary hardware.

Right, like I said, it's just a matter of how much you want to spend. You can build a Linux cluster that's got the performance of a supercomputer if you remove enough bottlenecks. It just takes money, and building custom motherboards is just the start of what you'd have to spend.

And as I implied... by the time you finished, it wouldn't really be something you'd call a cluster any more.

And you'd be Cray.

ALERT!!! BREAKING NEWS!!! by still+cynical · 2004-08-20 03:46 · Score: 1

This just in! Company exec. says their products are great!!!

Seriously, this is news?

--
Ignorance is the root of all evil.

cost???? by positroniumman · 2004-08-20 03:46 · Score: 1

now, perhaps i missed the point, but i can afford the beowulf cluster in my basement. But, i don't think i can afford even a used cray:p

Re:*Shock* by Rei · 2004-08-20 03:46 · Score: 1

As I've mentioned elsewhere, all you need is a single parallelizable task for a cluster to be worth it. For example, if you need a simulation to be run over a range of starting conditions, a cluster is probably your best bet.

Unless you have a single monolithic entangled run, you don't need a supercomputer - hence, the surging popularity of clusters. Yes, not everything is suited for clusters... but most things are, because most have parallelizable components at least *somewhere* in the process.

--
No matter how kind you are, German children are kinder.

Re:*Shock* by krog · 2004-08-20 03:47 · Score: 4, Insightful

Networked clusters are useful only when the task is parallelizable, and each subtask is computable on a single node. Cloud physics is not like that. Cracking RC5, for instance, is.

--
Cretin - a powerful and flexible CD reencoder

A little inaccurate... by dfj225 · 2004-08-20 03:47 · Score: 1

While many things that the Cray CTO said are true, I think the issue (obviously) has be skewed some. It really depends on the problem you are solving. Some problems will need to have data shared between all of the the nodes, but others will require that each node only has access to the data that is important to the small part of the problem that it solves. Also, the CTO mentioned that clusters don't scale very well. I don't really know what made him think this, but it seems to me that clusters do scale pretty well. For instance, ILM supposedly uses all of its employee's workstations at night to help do the daily renders. This way all of the cpus sitting on desks don't go unused during off hours.

--
SIGFAULT

Re:A little inaccurate... by stratjakt · 2004-08-20 03:53 · Score: 3, Insightful

They don't scale for applications that require shared memory access.

Something like SETI@home could scale almost infinitely. The data elements are completely unrelated.

But if every node needed access to the same chunk of data, then the more nodes you add, the more they "fight" over that chunk of data.

Ultimately, with a PC cluster solution, only one node at a time can be accessing any given section of "shared" memory.

That's what he means, and he's right. ..offtopic..

Look at the slashbots who can't understand the article throwing a fit because of a percieved "diss" against linux. This place really makes me laugh sometimes. Hell, Cray's new gear is using linux. Cray is a card-carrying linux loving company, and have been for quite awhile.

And Cray's got some friggin crazy tech. I can't wait to see what they have to kick back into the kernel.

--
I don't need no instructions to know how to rock!!!!

Re:The issues are progress and long-term usefulnes by red+floyd · 2004-08-20 03:48 · Score: 1

Infiniband uses a variant of IPv6 for addressing, and I believe the protocol is IPv6 based (It's been a few years since I looked at IB).

--
The only reason we have the rights we have is that people just like us died to gain those rights. -- Cheerio Boy

In other news... by mrjb · 2004-08-20 03:48 · Score: 4, Insightful

MS says their operating system is great. McDonald's says their food is great *and* cheap.

--
Visit http://ringbreak.dnd.utwente.nl/~mrjb/growingbettersoftware to download your free copy of the book

Re:*Shock* by Rei · 2004-08-20 03:48 · Score: 1

Please reread my post. I mentioned "a range of starting conditions" as being one possible (of many) mitigating case. Do you disagree? If there is a range of starting conditions, each condition's simulation can be assigned to a node.

That's the thing about clustering - you only need *one* parallelizable cpu-intensive task, and a cluster becomes worth it.

--
No matter how kind you are, German children are kinder.

I dont have enough money.. by essreenim · 2004-08-20 03:50 · Score: 4, Funny

for a Cray, you insensitive clod.

It ain't religion. by Performer+Guy · 2004-08-20 03:51 · Score: 5, Insightful

It's a but depressing to watch everyone jump on Cray here despite having no clue about the key differences between supercomputers and clusters are. All this cheerleading for clusters in various posts here illustrates how thoughtless some of these posts are. Why the heck should you care if someone makes a supercomputer or a cluster. Both clusters and supercomputers lose value fast over time.

Yes clusters are good for some stuff but we should be rooting for Cray if they're creating interesting products that fill a need, and that's exactly what they do.

It is a fact that supercomputers have an architecture that clusters cannot compete with for some classes of problem. Get over it, live with it and enjoy the fact that supercomputers are running Linux too.

It's pretty darned cool that Cray survived until now and that they still have a market for large single image systems.

Re:It ain't religion. by stratjakt · 2004-08-20 04:02 · Score: 1

This is slashdot. The submission was worded so that it sounds like the guy was "dissing" linux, hence all the hate and stupidity from the zealots.

If he said "Crays are faster than dual G5 PowerMacs", he'd be completely right, but the Apple astroturfers would be all over the comments talking about megahertz myths and other moronic tripe.

I've always thought of "supercomputer" and "cluster" as two completely separate tools for largely seperate applications.

Myself, I can't wait to see what kind of magic juju Cray has that could be included into the linux kernels of tomorrow.

--
I don't need no instructions to know how to rock!!!!
Re:It ain't religion. by BCW2 · 2004-08-20 04:27 · Score: 1

In actuality he just claimed clusters were inferior to super computers. A matter of opinion more than fact. Each can do certain things better than the other. Use that as the criteria on which one to buy. Most companies are smart enough to do that.

--
Professional Politicians are not the solution, they ARE the problem.
Re:It ain't religion. by ksheff · 2004-08-20 05:13 · Score: 1

Here is the benchmark set that the guy was referring to. I think it has a cluster or two on the results page along with some Crays.

--
the good ground has been paved over by suicidal maniacs
Re:It ain't religion. by Performer+Guy · 2004-08-20 05:15 · Score: 1

It's pretty difficult to argue clusters over supercomputers on anything except price so it's pretty reasonable to say supercomputers are better as a matter of informed opinion, even on the stuff clusters are built for supercomputers win, but they're just not cost effective. Cray seems to claim this new system is more cost effective but they would say that. There are maybe a couple of things like modularity but even supercomputers allow this now, creating multiple virtual systems or even physically splitting them.
Re:It ain't religion. by Ilgaz · 2004-08-21 00:11 · Score: 1

There are some guys comparing SGI top workstations to Dual Xeon Dells...

Give it up, I gave up long time ago :)

Re:*Shock* by jedidiah · 2004-08-20 03:51 · Score: 1

You really don't have any business making any comments regarding what a "computer expert" should or shouldn't know.

Depending on a particular Cray, the tech may or may not be significantly different than a Beowulf cluster. Let's take NUMA as an example. NUMA started at Cray, was acquired by SGI and then sold to Sun.

In those examples, the "supercomputer" is nothing more than what amounts to a fancy cluster. The interconnects are faster. However, you are still just tying together a bunch of big bricks that look remarkably familiar (IOW, like a fat PC).

Don't automatically assume that what Cray is pushing is qualitatitively different than a Beowulf or an Altix.

--
A Pirate and a Puritan look the same on a balance sheet.

Re:Cool, competition by Mistah+Blue · 2004-08-20 03:52 · Score: 1

No, the claim is a supercomputer generally outperforms a cluster. The fact that Linux is running on either/both/neither has no relevancy.

Latency by khrtt · 2004-08-20 03:52 · Score: 3, Informative

It's all very well to mock the I/O of PCI, but that's why we're all imminently moving to PCI Express, at a rather more respectable (current) maximum of 8+GBps rather than 133Mbps... Run a few gigabit ethernets in a hypercube formation and you have some rapid data transfer...

The main reason for supercomputers to exist is not the high bandwidth, it's the latency of the switch. The network hardware that is used in clusters as the interconnect medium (switch) can provide very high bandwidth, but the latency is high simply because you can not have low latency over large distance, and the network hardware is designed to connect over large distances. Even if you put your nodes in the same rack, the 1000000 gigabit ethernet or whatnot stock solution you use to interconnect them, will still take milliseconds ping time.

The supercomputers run on a custom, specially designed switch instead. This design includes a lot of cost and complexity just to get the latency down. This may not make any difference for your typical web-server application, but that's not what the supercomputers are designed for.

Some scientific computations have very low dependency between parts of the dataset. For example, pretty much any simulation or search application does fine on a cluster. Anything that allows you to split the work into a large number of independent tasks runs fine on a cluster. Some scientific applications do not allow the work to be split into independent pieces. Sometimes you just need random access all over your distributed data space, and for such applications the speed of computation is determined mostly by network latency. This is where you need a supecomputer, and no cheap cluster would help.

Re:Latency by Wesley+Felter · 2004-08-20 04:04 · Score: 3, Insightful

That's a nice theory, but Cray's XD1 "supercomputer" uses the same Mellanox switch chips as some "clusters". Cray is splitting hairs to justify their product.

(BTW, I get 100 us ping time on my GigE network, but you're right that that's still 100x too slow for HPC.)

...and in yet more news... by pandrijeczko · 2004-08-20 03:54 · Score: 4, Funny

...drain cover manufacturer says their product is grate...

--
Gentoo Linux - another day, another USE flag.

However by 2names · 2004-08-20 03:54 · Score: 1

if you can get a big enough cluster that will get the work done faster than the supercomputer and still be cheaper, doesn't this override the inefficiency factor?

--
"I'm just here to regulate funkiness."

Re:However by lpp · 2004-08-20 04:07 · Score: 2, Informative

In short, if clustering provides a better/cheaper solution, go with it.

Um, yes. The grandparent and ggp were (I think) inferring though that for that particular application you actually won't be able to be both better and cheaper with a clustering solution.

i.e. if you throw enough Linux boxes into the cluster to be able to achieve the "better (faster)" solution, you will no longer be cheaper.

But I don't think anyone was arguing that even if a cluster is cheaper and faster you should still go with a supercomputer instead.
Re:However by jeff4747 · 2004-08-20 05:23 · Score: 1

if you can get a big enough cluster that will get the work done faster than the supercomputer and still be cheaper, doesn't this override the inefficiency factor?
No. There's a diminishing return for each new box added to the cluster.
Yes, another box can do more work, but it also requires more messaging to do it. Theoretically, you could end up with so many nodes in the cluster that all your cluster can't do anything because of all the messaging.
Re:However by Tony-A · 2004-08-20 17:31 · Score: 1

if you can get a big enough cluster that will get the work done faster than the supercomputer and still be cheaper

The problem is that for the kinds of problems where you need a supercomputer, the bigger the cluster, the slower the cluster.

Clusters work well where, given a system of n equations in n unknowns, you can solve the n equations "one equation at a time".

Re:The issues are progress and long-term usefulnes by Wesley+Felter · 2004-08-20 03:59 · Score: 3, Informative

Only in routed Infiniband networks, which no one uses. The normal Infiniband protocol is very lean and totally different from TCP/IP.

Re:Unfuglify by data64 · 2004-08-20 03:59 · Score: 1

Here's my Privoxy filter that handles this automatically.

FILTER: slashdot turn off top banner s|http(.*)shit.slashdot|$1shit.slashdot|si gU

then I just assign this filter to URLs

{+filter{slashdot} \ } .slashdot.

Re:Two words: by stratjakt · 2004-08-20 04:04 · Score: 1

You dork, Crays are running linux. And a cluster is not a supercomputer.

And you've obviously never worked with either.

As for your explanations, this is exaclty the place for it. Tell us how you can magically share memory over a PCI bus as fast as a Cray does.

--
I don't need no instructions to know how to rock!!!!

My understanding of what Cray is actually saying by einhverfr · 2004-08-20 04:06 · Score: 2, Insightful

Cray makes at least two types of supercomputers according to their SEC forms. These include massively parrallel clusters and vector-based supercomputers. In general massively parallel clusters are less expensive for the number of calculations per sec than the vector-based supercomputers. However, for many applications, the vector-based supercomputers will massively outperform the clusters.

Cray's competitors in the cluster markets include IBM, and their main competitor in the vector-based market is NEC.

I remember reading an article about how the US is losing the supercomputer technology war. But this criticism is best directed at companies other than Cray who are pushing cluster-based solutions to the exclusion of others. It is true, however, that the only company I am aware of in the US which markets these supercomputers is Cray.

--

LedgerSMB: Open source Accounting/ERP

Re:The issues are progress and long-term usefulnes by drinkypoo · 2004-08-20 04:06 · Score: 2, Informative

While your comment is largely informative you are still confusing PCI-Express with PCI-X. They are different things. I know that it's inherently confusing, but still...

--
"You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"

Re:agreed by jedidiah · 2004-08-20 04:07 · Score: 2, Interesting

I'm not sure you do either.

A NUMA machine is just a cluster where the wire is in the form of a bus rather than copper or fibre cabling. The communications protocol for the bus may be better optimized for "supercomputing". However, you can do the same thing for a MPP optimized network protocol.

It's all ultimately just wires and protocols.

The total lack of process migration between nodes in a cluster might actually give clusters and edge over some NUMA implementations.

Watching a single process dance around a number of bricks in a Sun 15K can be rather entertaining.

--
A Pirate and a Puritan look the same on a balance sheet.

Re:Cool, competition by Progman3K · 2004-08-20 04:08 · Score: 1

Funny claim to make: The supercomputer would have to be structured internally just like a cluster, with multiple processors overseeing different parts of the calculation, just like any cluster would do. Also, I don't believe a supercomputer can REALLY beat a really big cluster. Of course it depends on how big the cluster is and the latency between nodes.

--
I don't know the meaning of the word 'don't' - J

Re:Cool, competition by Ilgaz · 2004-08-20 04:09 · Score: 1

I saw guys claiming their Dell PCs are better than SGI (as its 600 mhz) nothing would surprise me.

Re:*Shock* by lpp · 2004-08-20 04:09 · Score: 1

Just a thought... if the original experiment (particle level modelling of cloud physics) was better done on a supercomputer instead of a cluster, why would having to run multiples of that problem (differing starting conditions) run any better on a cluster? Mind, in this case I'm thinking of a cluster of lower powered machines. A cluster of supercomputers would obviously work out. :)

Re:Two words: by vidarh · 2004-08-20 04:09 · Score: 3, Insightful

Cray supports Linux. In fact, the supercomputer platform Cray's CTO was pushing in the interview is running Linux. What he's saying is really that supercomputers can handle classes of problems that clusters have problems with.

If your goal is to run simulations where each piece of the simulation depend on large subset of the other pieces, then you will need ridiculous interconnect speeds, and you're likely to end up with something you could have bought from Cray or SGI or some of the other remaining supercomputer manufacturers for a fraction of the price.

Luckily for you and the rest of us many problems can be split into relatively independent pieces, in which case a Beowulf cluster or similar is more than adequate.

If you seriously believe that clusters can compete with supercomputers for every type of problem, you need to think again.

only 1/2 correct, which means... by zogger · 2004-08-20 04:12 · Score: 1

...1/2 wrong. Supercomputers, and before they were called that, just whopper mainframes, used to do "all of the above" computing tasks because that's all there was. Now, stand alone PCs and clusters of them can probably do at least 1/2 the jobs out there, and maybe even a higher percentage. So, of course they (Cray and the others) are concerned for their market, losing half your market has to hurt, and daily the ways you can use smallish commodity hardware computers increases, both in complexity of job and in numbers of jobs. It's also a function of cost. If I can mangle some negatives here, PCs are not getting worse, nor more expensive, nor able to handle less tasks of less complexity. If that was true, so called "super" conmputers would not be in any threat, but they are, because it's happening. PCs and clusters are getting much mo bettah, at a fantastic rate. There is not a 100% replacement for "super" computers yet, but just in the last ten years I would bet a LOT of jobs previously only possible on "super" computers are now being handled easier/better/cheaper on "normal" PCs and clusters of them, and that trend will only continue, and there's only one place that market can come from, and that is the mainframe-stand alone "super" comnputer of yesteryear.

Re:The issues are progress and long-term usefulnes by AnonymuosCovvard · 2004-08-20 04:15 · Score: 1

Yeah, you can build a Linux based world-class supercomputer as a cluster

But to do this, wouldn't Linux first have to be a "world-class" operating system? And we all know that there isn't much chance of that ever happening, right?

Re:*Shock* by ThosLives · 2004-08-20 04:16 · Score: 3, Insightful

Uh, I think he understands, but do you know how long a single current-day [Linux] node would take to compute a cloud simulation? The reason you use supercomputers for this is because it's a really huge set of simultaneous (possibly nonlinear) floating-point equations. Most (if not all) desktop / server type computers are not designed for that type of computation; they're better at nice sequential stuff (like RC5). For example, trying to compute one car crash on my desktop would probably take it on the order of weeks (if not months). A cray will typically do that type of computation in about 12-24 hours. So, do you have 15 computers each taking 2 weeks to crunch 15 different simulations and not get any result for 2 weeks, or do you run 1 simulation a day for 15 consecutive days and make decisions based on the current result? The latter makes much more sense for most of the applications.

Of course, it really does depend on the problem you're facing. Most people who pay for results, though, want results as fast as possible, and that's why supercomputers win for problems that aren't "embarassingly parallel".

--
"There are a dozen opinions on a matter until you know the truth. Then there is only one." - CS Lewis (paraprhase)

Amdahl's law, baby by mekkab · 2004-08-20 04:20 · Score: 1

Why you will always be wrong. Basically, clusters make use of existing networking technologies (higher overhead) and supercomputers (SIMD, MIMD, whatever) are designed to make that overhead as low as possible.

Thus, by definition, for HPC applications that can't be parallelized enough to overcome the communications overhead, super computers will always beat out clusters and have a place.

--
In the future, I would want to not be isolated from my friends in the Space Station.

linux? by bazooka_foo · 2004-08-20 04:23 · Score: 1

Does this mean I can run linux on my Cray?

Re:linux? by blargster · 2004-08-20 11:07 · Score: 1

In a recent meeting with Cray that I attended, it was revealed that the XD1 does indeed run SUSE Enterprise 8 on each dual Opteron node.

Re:Two words: by CommieOverlord · 2004-08-20 04:24 · Score: 2

Hmmm...

1. Cray is definitely pro-linux. It's what their XD1 runs. Though not their bigger computers.

2. There are some problems for which that a cluster can not even come close to achieving the performance of a supercomputer. For a lot of problems yes, for some maybe if you spend a fortune on fancy interconnects, and for some no.

3. If you're commercially building clusters let me know company it is. I'm in the market for a 128CPU cluster and I want to know who not to buy from.

Re:The issues are progress and long-term usefulnes by einhverfr · 2004-08-20 04:24 · Score: 3, Interesting

You might want to read the latest 10-K form from CRAY.

http://www.sec.gov/Archives/edgar/data/949158/0000 89102004000325/v96761e10vk.htm

Here they discuss the limitations of clusters and vector-based supercomputing.

Basically, they offer three types of supercomputers aimed at different markets: vector, massively parallel, and multithreaded. Not really sure why multithreaded means in this context (Microkernel capable of threading itself across many processors i.e. UNICOS/mk?) but they do a decent job of explaining the whole thing:

Cray Research pioneered the use of vector systems, from the Cray-1 to the Cray C90 and T90 systems. These systems typically use a moderate number (one to 32) of very fast custom processors in connection with a shared memory. Vector processing has proven to be highly effective for many scientific and engineering application programs which over the years have been written to maximize the number of long vectors. Traditional vector systems do not scale effectively (that is, increase performance by increasing the number of processors) past a limited number of processors. We currently market one classic vector supercomputer, the Cray SX-6 system.

Massively parallel processing architectures typically link tens, hundreds or thousands of standard or commodity processors to act either on multiple tasks at the same time or together in concert on a single computationally-intensive task. Type T systems connect each processor directly to its own private memory and the programmer must manage the movement of data among memory units and processors. Consequently these systems can be difficult to program. Type C massively parallel systems, unlike low bandwidth clusters, have high bandwidth and low latency interconnect systems and are said to be "tightly coupled" -- the Cray T3E, Red Storm and the OctigaBay product are examples of balanced high bandwidth purpose built systems that employ standard microprocessors.

The Cray X1 system is revolutionary in that it is the first supercomputer that combines the attributes of both vector and high bandwidth massively parallel systems. The Cray X1 system has up to 64 processors per cabinet and a shared memory. The Cray X1 system can run small problems as a vector processor would or, by focusing many processors on a task, the Cray X1 system operates as a massively parallel system with a system-wide shared memory and a single-system image. The Cray X1 system is designed to provide efficient scalability and high bandwidth to run complex applications at high sustained speeds. The Cray X1E system furthers this architectural design with increased processor speed and capability.

Our MTA-2 project for NRL is designed to have sustainable high speed, be broadly applicable and easy to program, provide scalability as systems increase in size and have balanced I/O capability. The multithreading processors make the MTA-2 system latency tolerant and, with the system's flat shared memory, able to address data anywhere in the system.

--

LedgerSMB: Open source Accounting/ERP

Re:The issues are progress and long-term usefulnes by LurkerXXX · 2004-08-20 04:24 · Score: 2, Informative

Wow. 8+ GB/s. Nice.

Unless I'm now out of date, the last figures I saw said the CrayLink Interconnect can do 102 GB/sec. That's Just a tad bit more, don't you think? No messing with masses of gig ethernet to crossconnect them. It's just done.

Let's do some bandwidth math... by JBMcB · 2004-08-20 04:26 · Score: 3, Interesting

From Cray (From XD1 page):
"A 96 GB per second, nonblocking, crossbar switching fabric in each chassis provides four 2 GB per second links to each two-way SMP and twenty-four 2 GB per second interchassis links."

-So for a dual-opteron XD1 processor unit, there is 8GB total bandwidth available.

Total aggregate PCI bandwidths (Accepted standards):

PCI32 33MHz = 133MB/s
PCI32 66MHz = 266MB/s
PCI64 33MHz = 266MB/s
PCI64 66MHz = 533MB/s
PCI-X 133MHz = 1066MB/s
PCI Express = 200MB/s (Per slot)
PCI Express x16 = 3000MB/s (Usable bandwidth)

-So for PCI Express x16 we're talking 3GB/second

SMP Opteron with two PCI Express x16 slots can do 6GB/second aggregate bandwidth. A couple of Infiniband links can easily saturate that. I'm sure this all costs quite a bit less than Cray's propriatary stuff.

--
My Other Computer Is A Data General Nova III.

Re:Let's do some bandwidth math... by Anonymous Coward · 2004-08-20 08:54 · Score: 2, Insightful

That's all very nice, but have you ever heard of latency?

Re:Cool, competition by vidarh · 2004-08-20 04:27 · Score: 1

No it's not a funny claim to make. One of the defining features of a supercomputer vs. a cluster is tightly intergrated low latency high bandwidth interconnects, whereas clusters typically rely on software support (networking etc.) to shuffle data over relatively low speed, high latency communications channels.

Not the only player in that game by CarrionBird · 2004-08-20 04:27 · Score: 1

SGI may have something to say about those ideas.

--
Free Mac Mini Yeah, it's

Taken a little out of context by UnknowingFool · 2004-08-20 04:28 · Score: 3, Insightful

In a way he's right. Reading the whole article, it seems apparent that he's talking about certain high performance applications. Clusters are not always the best way to solve a problem. For problems that can broken down into small independent tasks like SETI, clusters are a good solution. Clusters do have their optimization challenges with latency, bottlenecks, etc. For simulations where the tasks are dependent on each, these bottlenecks add up. The individual nodes spend as much time communicating with each other as they do computing. There are also problems that cannot be distributed. In these cases clusters are not the right solution and it may not be cost effective to use a cluster.

--
Well, there's spam egg sausage and spam, that's not got much spam in it.

Re:Whee! by superdan2k · 2004-08-20 04:37 · Score: 1

Hi flametroll. Please re-examine my statement. Note that I did not say Linux clusters were superior to Crays in all aspects. I merely mentioned that Cray's marketshare was slipping and that, par for the IT course, the CTO came out and stroked his ego in public. In the future, please do not read into the statements of others.

--
blog |

Re:The issues are progress and long-term usefulnes by Coolmoe · 2004-08-20 04:38 · Score: 1

While your statements may be true and valid I don't think that I would want to base a business off selling a machine that costs seven figures and would only be marketable to that remaining 0.1% and expect to be around in a few years. I think that the flexability of the PC in clusters is good enough for most applications and they can throw student labor and people with phd's at the rest on the software end. Just like the PC market itself windows is good enough to do the job but nobody would say that it is perfect.

--
Got hosting

not just Cray using Linux for SSI systems, SGI too by grey1 · 2004-08-20 04:38 · Score: 1

don't forget the SGI Altix range - scaling to 256 processors in a single image

and there was the recent PR about the contract with Nasa for a 10,240 processor server (though that's not 10,240 in a single image...)

--
"we demand rigidly defined areas of doubt and uncertainty!"

Re:*Shock* by Rei · 2004-08-20 04:43 · Score: 1

Because clusters are cheaper, per raw unit power. Because they're made of commodity mass-produced parts.

--
No matter how kind you are, German children are kinder.

Re:*Shock* by Rei · 2004-08-20 04:46 · Score: 1

If you're going to run over a range of inputs, you're going to need the total CPU power anyway. The only advantage you'd get with a supercomputer is that you'd get your first results out sooner than the later results, instead of all results coming out at the same time from a cluster.

And forget not the reason *Why* you'd want to use a cluster. Namely, commodity hardware is *cheap*. Incredibly bloody cheap, compared to supercomputers, for the same amount of total cpu processing time.

--
No matter how kind you are, German children are kinder.

Re:Two words: by iPaul · 2004-08-20 04:52 · Score: 4, Interesting

Not quite true. First off, you get much higher bandwidth between processors using proprietary (NUMA) based interconnects than you can with commodity hardware. Why? Because you can optimize for your situation. Second you can exploit things like cache-coherency between processors (even if they're in different "nodes") and therefore true shared memory. So, a 1024 processor SGI Altrix, or a 256 processor Cray is one computer as far as the OS and user-land stuff is concerned.

There's another advantage Cray has on the SV and X series and that's a vector unit on the processor. That allows you to conduct operations on arrays of numbers at once instead of having to cycle through the numbers in a loop. For example, the dot_product between two small arrays might be accomplished with one or two instructions, as opposed to a loop. Apple's AltiVec is also a vector unit.

If you took money out of the picture it would be easier to deal with a big-honkin' super computer like an SGI or Cray rather than a cluster. One computer is easier to manage and you could always use threads and plain old heap memory (which is much faster than message passing over a network).

Add money back in and 500,000 goes a lot farther in raw compute power when you're buying racks of DELLs and infiniband interconnects. However, depending on the application, you may be faster, slower, or even dog-slow compared to the cray. If you need the answer today, and the $ is not a factor, go to Cray or SGI with a blank check. If you have to balance cost and time, then a cluster might be better.

Essentially, it boils down to how much communication you do between nodes. Cray does it orders of magnitude faster than off-the-shelf stuff. If you hardly ever pass messages between nodes, clusters are fast. If you have to pass a lot of messages between nodes, one big computer will trounce lots of little ones.

--
Leave the gun, take the cannoli -- Clemenza, The Godfather

Re:Unfuglify by data64 · 2004-08-20 04:52 · Score: 1

that should be

s|http(.*)it.slashdot|$1shit.slashdot|sigU

I forgot to turn off the filter before posting :(

Re:The issues are progress and long-term usefulnes by stevelinton · 2004-08-20 04:53 · Score: 1

Of course the cost of this kind of networking technology does eat up quite a lot of the cheapness factor. In many clusters the interconnect costs more than, sometimes several times more than, the processors,memory, etc.

As always, it depends on the application by Orp · 2004-08-20 04:53 · Score: 4, Insightful

Both clusters and big iron have their place. I am a meteorology professor and my current research involves high-resolution numerical modeling of thunderstorms. For a problem where the domain decomposition is straightforward and internode communication isn't your bottleneck, clusters are great. One huge advantage of clusters is that they are cheap and it isn't too big of a deal to get a grant together to buy the hardware, and it's YOURS and nobody else's. A huge disadvantage to big iron is that you have to share it with about a hundred other researchers. Waiting in a queue for three days only to find you goofed up in your startup script (and the model exits immediately) is NO FUN (cf the Regatta at NCSA).

I am currently running a model using legacy FORTRAN 90 code which was written before there were clusters. It does use OMP but OMP sucks and is no substitute for code which is written with MPI in mind. The model as it currently stands requires big iron to do big runs, and it is inefficient, but it works and sometimes I just need to do science and not model development. I am working on MPI-izing the code; no small feat, but the rewards would be quite worth the effort.

In summary, both clusters and big iron have their place. Folks have a habit of making a false dichotomy with regards to these two options. I wouldn't trade my cluster for the world (currently doing parallel POV-Ray rendering of my 3D thunderstorm data, see my web link and an upcoming [not sure what month] Linux Journal article if interested) as it is perfect for much of what I am doing right now and I don't have to share it with anyone. But I will also use big iron when necessary.

--
A squid eating dough in a polyethylene bag is fast and bulbous, got me?

Re:As always, it depends on the application by jonbelson · 2004-08-20 21:21 · Score: 1

>I am currently running a model using legacy FORTRAN 90 code which was written before there were clusters.

Clustering was around for at least 10 years before Fortran90.

--Jon
Re:As always, it depends on the application by Orp · 2004-08-21 04:16 · Score: 1

The code began as a simpler model written in FORTRAN 70... or maybe even earlier... in the late 1970's. It has since been converted to F90.

--
A squid eating dough in a polyethylene bag is fast and bulbous, got me?

electricity by wobblie · 2004-08-20 04:55 · Score: 1

does anyone stop to think about what that 200 PC cluster costs in power? quite a bit i think ...

Re:The issues are progress and long-term usefulnes by Tired+and+Emotional · 2004-08-20 04:58 · Score: 1

SGI has some real interesting Linux based "supercomputers" that they were showing at LinuxWorld. I don't know if the Cray units are the same hardware - probably they are.

There is a very fast NUMA (non-uniform memory access) interconnect in each case (about the size of a washing machine). So you can access memory on another board only slightly slower than on your local memory.

You can have up to 4 processors per board. Then you can connect together multiple washing machines with (I think) Infiniband.

You still want to access local memory if you can as that gives lowest latency. Work is going on in the kernel to better support this kind of architecture. Linux (or at least open source) is really important to these machines because you do need to be able to modify the kernels.

A bunch of PCs work real well when your problem can be partitioned. What kills you is high levels of synchronisation activity - whether signalling or updating because that's when latency kills you. For some apps you may have as much or even more compute horsepower in your PCs than the supercomputer but it spends all its time twiddling thumbs.

So for many hard applications these machines really are the bee's rollerskates.

--
Squirrel!

Not only that... by 59Bassman · 2004-08-20 05:00 · Score: 1

I sat through an Altix demo a couple of months (SGI's technology demonstrator truck is cool...). I believe they were saying that the current Altix cluster can run up to 128 CPU's as a single Linux image - pretty much the world's largest SMP-kernel.

Current development looked to be 512 CPU's before too terribly long. I like the system, but it's EXPENSIVE when you think about the fact that you're still buying x86 architecture - $6 - $8K per CPU.

To the tune of "Every sperm is sacred" by zarthrag · 2004-08-20 05:03 · Score: 1

Every Cray is sa - cred.
Every Cray is greaaat...

If a Cray is was-ted,
Paul gets quite i-raaate!

(Thank you, thank you very much.)

--
Why can't all fpga/microcontroller manufacturers just release free optimizing compilers???

Re:The issues are progress and long-term usefulnes by caseih · 2004-08-20 05:03 · Score: 1

The point is, though, that the Cray supercomputers are vector supercomputers, whereas Linux clusters and other similar machines are not. Currently it seems to my that most clusters are very remeniscant of computers of old days where you run programs in batch mode. Cray is pointing out that running stuff in batch mode, even massively parallel, often cannot match the flexibility of the cray vector system.

Re:The issues are progress and long-term usefulnes by ananke · 2004-08-20 05:03 · Score: 1

on a side note, have you worked much with myrinet? personally, i find it to be the most buggy thing i've ever seen. their hardware seems to fail more than an antique chevette. i just wonder if anybody else has a similar experience with myricom's products, since at this moment i doubt i'll ever again invest in their hardware.

--
--- d'oh

Re:*Shock* RTFA by Tandoori+Haggis · 2004-08-20 05:05 · Score: 1

No! What he is trying to explain here is that a Linux cluster using "standard" hardware, (eg x86 based), suffers from the usual PCI related bottlenecks that standard hardware has. Therefore it cannot be as efficient as a system specifically designed for supercomputing which has no PCI bottlenecks.

If you read the article you would note that Cray is promoting their new LINUX BASED SUPERCOMPUTER....

--
My hyperlinks aren't worth the paper they're printed on.

Re:Yes he's talking FUD by Jeremy+Erwin · 2004-08-20 05:12 · Score: 1

In linear algebra, there are often many algorithms that could be used to solve a problem, but the obvious algorithms require many more calculations than the clever algorithms. For instance, you don't solve A*x=b by calculating inv(A)*b.

Just as it would be embarrassing for the mathematician to recommend calculating inv(A) for a one-off solution of A*x=b, it would be embarrassing for a computer scientist to recommend a freon cooled million dollar supercomputer when, with a slight optimization of the algorithm, the solution could be calculated
with a cheap cluster of PCs interconnected with 100baseT.

Queueing issues by ehiris · 2004-08-20 05:16 · Score: 1

I'm not a subject matter expert but it seems like the Cray is a M/m/X (X>=8) system while Linux clusters are multiple M/m/x (x=4) systems.

It seems to me that the mathemetical limitation of how much workload a Cray can handle is a lot worse then a Linux cluster.

Can it be that the price/performance issue that he is talking about is just for specific applications?

Re:Queueing issues by argent · 2004-08-20 05:34 · Score: 1

Can it be that the price/performance issue that he is talking about is just for specific applications?

I think that's pretty much a restatement of his comment that clusters give you better p/p for highly parallelizable applications. He can point to those applications and say "except for these apps, we're the better deal". The cluster folks can point to the stuff they're not good at and say "maybe Cray is better for those apps, but we're the better deal otherwise". You're both right, it's just a matter of the spin you're applying.

Doom III by Yousef · 2004-08-20 05:22 · Score: 3, Funny

Finally, a machine capable of running Doom 3!

--
-- "To ask a question is to show ignorance; Not to ask a question means you'll remain ignorant."

Re:The issues are progress and long-term usefulnes by Tiosman · 2004-08-20 05:25 · Score: 2, Interesting

I work a lot with it, like ~3000 customers, almost half of them are industry (non academic or gvt).

You found bugs ? Care to share them ? Hardware failed ? Did you get it replaced ?

Can you give me the tech support ticket numbers so I can see if your complaints are reasonable (and have been addresses) or are just plain FUD ?

Re:*Shock* by mattyrobinson69 · 2004-08-20 05:28 · Score: 1

beowolf (is that spelt right) is not the only type of cluster. there are clusters such as openmosix that will cluster applications that aren't even written specially - its a kernel module. if i wanted a stupidly fast kde desktop, for example, i could get a room full of cheap x86's and cluster them using linux and openmosix.

Re:*Shock* by CommieOverlord · 2004-08-20 05:32 · Score: 1

Right and no. If engineers/scientists need to wait for results, then two things happen:

1. Run a cluster and wait two weeks. Twiddle thumbs in mean time.

2. Run on supercomputer. Start processing next day.

For a lot of companies, it is justifiable to spend more money to acheive #2.

Not too mention that a lot of large programs that can't be parallelized have high memory requirements. If your program needs 50GB of memory and can't be parallelized to run across a cluster then you have no choice but run on a supercomputer, since no cheap system can handle 50GB.

Deadly serious. by NoMoreNicksLeft · 2004-08-20 05:34 · Score: 1

There was a guy about 90 miles away, offering one on ebay for $7000, it never sells (he tries every 12 months or so). If I suddenly landed a job for $60k a year, I'd almost certainly buy it from him. Rent a Uhaul or something, go pick it up. I've heard of universities practically junking them.

Yes, Cray's are one of my saved ebay searches...

Re:agreed by JonAnderson · 2004-08-20 05:38 · Score: 1

Watching a single process dance around a number of bricks in a Sun 15K can be rather entertaining.

Ever heard of priocntl, psrset, pbind? If you don't like the way your resources are being used then change it.

Re:*Shock* by CommieOverlord · 2004-08-20 05:39 · Score: 2, Insightful

Because clusters are cheaper, per raw unit power.

But if the supercomputer is more efficient per raw unit of power, then the price per unit doesn't matter.

I work for living with HPC, buth with clusters and with large SMP machines. The cluster is nice, but there are some things than can _only_ be run a large SMP machine or are much, much faster on a SMP.

Re:This is FUD by argent · 2004-08-20 05:42 · Score: 1

I don't see that your rant says anything different, other than you're giving more emphasis to problems that are more parallelisable, and he's giving more emphasis to the ones that aren't.

Oh, and you're implying Cray's product's vaporware, and he's implying clusters are less reliable, so I'll grant you both one FUD point. Happy?

Re:The issues are progress and long-term usefulnes by JonAnderson · 2004-08-20 05:43 · Score: 1

Hmmm. I would say also that first byte latency is also very important in a lot of/most workloads. Clusters can mask this on some workloads through parallelization. Introduce interdependencies and it loses some of it's advantages. I know I will get flamed for this but I think Sun understands this quite well hence the philosophy behind throughput computing and their next gen core designs (Niagara, Niagara2, Rock)

Re:Whee! by the_mad_poster · 2004-08-20 05:44 · Score: 1

Actually, it was just your regular run-of-the-mill "I hate slashbots, so I'm going to point out the obvious to you like you're a small child because you displayed the cognitive ability of a small child" troll.

And, actually, truth-be-told, you said this as the follow up to the stroking-ego thing:

But I guess that's the best they can do...

Which is wrong. For the reasons I outlined. Now, please go back to your regular DMCA-RIAA-MPAA-Microsoft bashing while I slink back into the woodwork.

k thx!

--
Alito: A vote for Alito is a punch in the eye to put that bitch back in her place!

Re:The issues are progress and long-term usefulnes by JonAnderson · 2004-08-20 05:50 · Score: 1

It depends what your TCO estimates for an installation are. Typically, your buying costs aren't the signifcant part of fielding a solution. You need to look at how much it is going to cost to run (power/heat/real estate/maintenance) and how it costs you when it ain't running. Also, just throwing this out there as I don't have any specs to hand, does anyone know if commodity hardware is accurate enough (i.e IEEE FP precision etc.) to be used in all cases a 'super computer' (sic) is used?.

Re:*Shock* by jpmahala · 2004-08-20 05:59 · Score: 1

In other news:

Microsoft CEO Bill Gates announces today that their Windows Operating System is *really* great!

Re:This is FUD by deadline · 2004-08-20 06:03 · Score: 1

Exactly. Cray would love to sell you a system to do CFD, but guess what, a cluster works just as well is available now and costs less.

Read carefully. I did not say it was vaporware. I saw a real system. What I said was, trying to keep up with the commodity curve is battle that has been lost by many an HPC vendor. The key is time to market. By the time you get your latest and greatest to market, the commodity market has passed you by.

--
HPC for Primates. Read Cluster Monkey

Re:The issues are progress and long-term usefulnes by DustMagnet · 2004-08-20 06:11 · Score: 1

I don't think that I would want to base a business off selling a machine that costs seven figures and would only be marketable to that remaining 0.1% and expect to be around in a few years.

I would. That 0.1% has little choice but to pay big for these computers. If you are the only one making them, it gets even better. Cray has been selling to a niche market for decades.

--
'SBEMAIL!' is better than a goat!!

So why did you say it was FUD? by argent · 2004-08-20 06:12 · Score: 2, Insightful

You and him, you're saying the same thing, you're spinning it your own way, but the actual content is the same. So why are you describing his as FUD?

Re:So why did you say it was FUD? by deadline · 2004-08-20 06:56 · Score: 1

My response was a rant, an opinionated response. I'll argue verifiable facts to support my case if need be. But crafty marketing things like this interview are FUD. I did not use a phony interview to get my point across.
For example the question which used the word some:

Is it true that some applications don't run well on cluster systems? Why not?
Gets turned into many in the response:
Terry: Yes, that's true for many HPC applications.
Clever. If the guy owned it as a vendor biased opinion, then I would give it a bit more respect. But he can't because the marketing department probably orchestrated this FUD campaign.

--
HPC for Primates. Read Cluster Monkey
Re:So why did you say it was FUD? by argent · 2004-08-20 08:34 · Score: 1

Dude, you have to learn to relax. Everyone does this, all the time, and most people can figure it out. In the catalog of marketing evil, this barely qualifies as a venial sin... it's not worth a rant. Keep rants about FUD for stuff that really warrants it. You'll feel better, honest.

Re:*Shock* by Shinobi · 2004-08-20 06:17 · Score: 1

They're not really scared of clustering for reliability reasons. You can do big machines that are far more reliable than the cluster made of cheap components. In fact, it's being done all the time.

First of all, the Google setup isn't the type of cluster that is being compared to Cray supercomputers, and it's also a simple form, with little communication between nodes. Secondly, in a HPC cluster, the reliability of each node becomes far more important, since a failure for a node can mean hours or even days of having to performm calculations again. If you're unlucky, and you have a project where each node needs a lot of data from the other nodes, and a node goes down, you've stalled the entire project for quite a while. If a CPU dies in a supercomputer, process state etc can be migrated over to other CPU's.

Big Iron by totallygeek · 2004-08-20 06:39 · Score: 1

I once heard this about big iron mainframes, but seems fitting here too: You cannot replace a bull with 10,000 chickens.

It will always be the right tool for the job. If a company wants many machines and has a service agreement with someone to monitor, replace (or not), etc., then that works for them. If a company wants all that in one power draw in a large box, they get a supercomputer. There are also many processing models that clusters of smaller machines can not provide solutions to.

That having been said, of course Cray is going to say that smaller boxen clusters = bad.

--
Click here or here.

Re:*Shock* by undercanopy · 2004-08-20 06:46 · Score: 1

no, because in a cloud simulation, for each raw unit of power that you use there's a vast amount of communication that has to go on between nodes before they can keep working (checking for collisions after recent movement of local particles that were computed on another node, for example) that is either orders of magnitude faster on a supercomputer or perhaps just not necessary due to a large shared memory in a supercomputer. so yes, for each raw CPU cycle the cluster price is cheaper, but in simulations where the outcome of a computation on one processor affects the computation on other processors, many many many of those cycles are wasted in commodity clusters because they just can't talk fast enough between nodes to work efficiently and will therefore take far longer. sure, the overall problem may be parallelizable, but when the underlying details are so bloody complex, you

--
-- D-23994, Muff#2613

Re:The issues are progress and long-term usefulnes by gardyloo · 2004-08-20 06:51 · Score: 1

Run a few gigabit ethernets in a hypercube formation and you have some rapid data transfer...

Serious question here (yes, wrong place to ask one of those): "Hypercube formation"? Is this just a cubic lattice where the nodes are relatively densely packed so each can communicate with several others over not-too-far distances? What makes it "hyper"? Someone with an expertise in communications/computing theory help me out here!

Re:*Shock* by Retric · 2004-08-20 06:53 · Score: 1

Not if your task needs 10+ GB of ram.

Think of this your simulating 100 plain crashes for a new design.

The plain takes 40GB of memory (courent position and stresses on each person and component)

Let's say each "frame" of the crash takes 10 Gflops of CPU time to finish and you need 5,000 fraims to run the simulation. Now you could run this on a cluster of 100 systems BUT if they don't have 50 GB of ram it's going to take 1000x as long because you need to fetch the whole simulation from disk 5,000 times vs a cray which whould load it into ram once and be done with it.

Re:The issues are progress and long-term usefulnes by Wesley+Felter · 2004-08-20 06:56 · Score: 1

A Cray XD1 is quite different from an SGI Altix.

Re:The issues are progress and long-term usefulnes by ultranova · 2004-08-20 07:08 · Score: 1

It's not just the speed of the data transfer, it's also the latency of the interconnect. A lot of scientific codes will pass around a lot of little messages, and GigE is fast for bulk transfer, but it's not so good for that. That's why there are companies like Quadrics, Myricom, etc... Infiniband should fix this, but you'll want a big infiniband switch.

A nasty thing called physics says that latency will always be higher when moving messages over the cable than over the machines internal bus because the cable is longer.

That said, isn't the obvious solution to this problem a "smart" clustering software that puts the processes that exchange the most messages with each other into the same computer ? A bit like NUMA, but replace "memory" with "message".

Of course, if someone absolutely must write code that passes around a zillion messages, then it's going to be slow no matter what... So our smart clustering software should be really smart and arrange the threads so that a single machine contains threads that are likely to block on messaging at different times so you can run one as the other waits.

And of course, if you get enough bandwith and low enough latency, you can treat a cluster as a big NUMA machine (syncing shared memory areas over wire as neccessary).

--

Forget magic. Any technology distinguishable from divine power is insufficiently advanced.

Target audience... by umshaggy · 2004-08-20 07:12 · Score: 4, Insightful

Many posts have pointed out the true fact that supercomputers are better for certain jobs that are not suited to clustered solutions (and visa versa).

Most slashdotters are technical enough to realise this...but...we are not the target audience of the original article. Such articles are meant for high level executives and relatively non-specialist managers who don't always hear all sides of the story. Every day these people are seeing articles and news blurbs stating how the latest linux cluster is as good or better than a supercomputer, and gee isn't that swell! While such press is good, and important, not everyone hearing that implicitly understands that such reports only apply to SOME applications.

So what the original article is, is a message from one executive to other executives trying to clarify the situation. Basically saying "hey, just because Wired ran a story that says linux clusters are the next best thing since sliced bread, doesn't mean that this is the best solution for you. Now, let us talk about what you need."

I see nothing wrong with this. I read the article, and found nothing in it that was false.
It is good because sometimes an exec will listen to a fellow exec when they won't listed to the advice of their own techs because of something said exec read in Scientific American.

Welcome to corporate america boys and girls.

(Disclaimer: Wired and American Scientific were random examples. I know of know articles in either publication about linux clusters. Both are fine publications.)

--
Did you buy a Neuros today?

Re:*Shock* by dasmegabyte · 2004-08-20 07:17 · Score: 1

There are also tasks which are quite paralellizable and quite sequential...but that are so heavily interrelated that they require a fast interconnected bus.

The major benefit of Linux clustering kind of falls apart when the bulk of the cost of your project becomes the interconnect. Mainframes are designed to communicate large data between multiple processors wheras standard servers are designed to communicate over a relatively slow I/O bus. If you're paying a few hundred grand for a top of the line control system and interconnects and that's STILL your major bottleneck...well, you should have had a Cray.

Of course, the Cray also has limited expandability...I guess the point is, do your fucking research, analyze the needs from a total cost perspective and don't go with Linux just because you can imagine a Beowulf cluster.

--
Hey freaks: now you're ju

Re:*Shock* by PenguiN42 · 2004-08-20 07:18 · Score: 1

"Networked clusters are useful only when the task is parallelizable, and each subtask is computable on a single node. Cloud physics is not like that. Cracking RC5, for instance, is."

The issue isn't parallelizability. *All* supercomputers are designed to exploit large amounts of parallelization (and *all* supercomputers will do poorly if the task isn't parallelizable).

The issue is internode communication. If your simulation requires a *lot* of transmitting data between each processing stage (like "cloud physics" probably would) then a more tightly-integrated parallel machine (like a cray) would be better. If you can do more computation before having to communicate (like cracking RC5), then a cluster will be more cost effective.

--
The following sentence is true. The preceding sentence was false.

Re:The issues are progress and long-term usefulnes by Coolmoe · 2004-08-20 07:20 · Score: 1

Thats true there arent a lot of choices but IBM and Sun are also making big iron as well. Cray is not the only way to go nowdays. Even if you land an order for one machine the only income as a company you will have is a support contract. How long are you going to have a company if you only sell 2 units or less a year? Don't get me wrong I would love for the kind of research that these machines excel in to pick up in the US but it doesent seem to be happening. Worse than that I would hate to see Cray computing to become another government subsidy!

--
Got hosting

Re:*Shock* by Rei · 2004-08-20 07:21 · Score: 1

Once again, a person misses the fact that I *Did Not Suggest Running A Single Cloud Simulation On Multiple Computers.*. I suggested, given a number of starting inputs for cloud simulations that need to be run, a separate simulation on each element of the cluster. Networking bandwidth is essentially negligable.

This is just one example of how having a single parallelizable task makes a cluster the cost-efficient choice.

--
No matter how kind you are, German children are kinder.

Re:*Shock* by Rei · 2004-08-20 07:24 · Score: 1

Then get twice as many commodity PCs and run every simulation on two separate machines in case on edies. Get 5 times as many. Get 10 times as many. It doesn't matter, it'll still be cheaper than the supercomputer.

--
No matter how kind you are, German children are kinder.

An analogy by lildogie · 2004-08-20 07:25 · Score: 1

A train does not have the same price-performance ratio as a ship.

But you don't use a ship when a train would do, and vice versa.

As the parent poster said: clusters and supercomputers are not the same.

Re:An analogy by Performer+Guy · 2004-08-20 09:57 · Score: 1

That has to be the most flawed analogy ever.

Re:*Shock* by BinxBolling · 2004-08-20 07:36 · Score: 1

As I've mentioned elsewhere, all you need is a single parallelizable task for a cluster to be worth it.

This makes little sense. Parallelizability is not a simple yes or no question as you seem to think. Rather, it's a matter of degree; You express parallelizability of a computation as an integer indicating the maximum number of parallel threads of execution it can be split into.

A computation whose dependencies permit you to benefit from running two seperate threads of execution is technically parallelizable, but it's not going to benefit from a cluster unless your "cluster" consists of two machines and you're comparing the cluster to a uniprocessor with the same performance as one node of the "cluster".

Re:*Shock* by undercanopy · 2004-08-20 07:43 · Score: 1

so what's the advantage to having it on a cluster? why not just run 8 simulations on 8 completely separate PCs?

do you have any idea how monstrously complex these simulations are and how long they take to process? the point of using a supercomputer is that these problems take too damned long (read: years) to run on a PC. no, sorry.. commodity hardware just does not cut it in these types of situations.

--
-- D-23994, Muff#2613

Re:The issues are progress and long-term usefulnes by fatcatman · 2004-08-20 08:20 · Score: 1

In fact, Cray's XD1 "supercomputer" uses the same Infiniband interconnect as some "clusters"; Cray just has better NICs.

No, Cray doesn't have "better NICs." In fact it doesn't really have "NICs" at all, not in the sense that we think of them. Your typical Infiniband card hangs off a PCI bus. PCI bus = major bottleneck, especially when you're talking a couple dozen Infiniband connections.

The XD1 is cool because the Infiniband is right on the hypertransport bus of the Opteron CPUs. It's damn fast.

Re:Whee! by scoobrs · 2004-08-20 08:26 · Score: 1

Whee! Could we mod this funniest flame war? I want to watch the flames burn some more. Haha, funny slashbots. And remember folks, the easiest way to win the Turing test is to pick dumber test subjects.

--
-Those who would give up essential liberty to purchase temporary safety deserve neither. -Ben Franklin

Hypercubes defined by billstewart · 2004-08-20 08:33 · Score: 1

An n-dimensional hypercube has 2**n vertices (x1, x2, x3, ... xn) where xi is 0 or 1. Your familiar 3-dimensional cube has (x1,x2,x3). Each vertex is adjacent to n other vertices - each one of them has one bit different from your starting vertex. Each vertex is at most n hops away from each other vertex. (Looking at it from the other side,

In a hypercube computer architecture, your put a node at each vertex, and a communication channel to each of the n adjacent vertices. That way, you don't need a huge number of communication channels per processor, i.e. log2(number of processors), at a cost of sometimes having to pass data N hops.

There are other popular architectures out there. Simple 2-D grids match a lot of applications, and require 4 comm channels per processor no matter how many processors you have. The old Transputers were built specifically for this. A minor extension to the 2-D grid is the torus, which you make by connecting the top and bottom of your grid together and the left and right ends together. (It basically doesn't cost any extra, since you had the spare channels at the processors at the edge, plus you get to say "ooohhh, donuts!"). And there are a bunch of applications with dense clusters of processors (for instance, N-way shared-memory nodes) with the clusters connected in hypercubes. Butterfly networks are another shape that was popular for a while - they look sort of FFT-like, and they basically keep the log-n number of communication channels while reducing the bottlenecks.

--

Bill Stewart
New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks

Re:electricity and Blade Servers by billstewart · 2004-08-20 09:08 · Score: 1

A popular PC cluster architecture the last couple of years has been blade servers - boxes of N blades, each with some RAM and a laptop disk, interconnected by GigE or equivalent. They're fun things to run in your own data center, but they're really annoying to put into conventional internet colocation data centers, because they use a lot more electricity per square foo of floorspace than the "rack full of 3-U PCs and Cisco routers" that most of them designed for, or the "rack full of 1U PCs" that we all scrambled to support when those came out.

There are some blade servers that use low-power CPUs like Transmeta to get the tradeoff of more MIPS per watt. E.g. 50 watts per processor gives you 10KW, as opposed to 300 watts-> 60KW. At 10 cents per KWH, a 10KW cluster is about $1/hour, which is cheaper than the grad student you've got managing the thing. (In practice, you often need to double or triple the power costs, because you also need cooling to get rid of all the heat from the CPUs.)

Obviously a supercomputer is a bit different, because you don't need all the disk drives, but CPU and RAM are using an increasing amount of power compared to disk drives. (So does high-end video, which obviously you don't need unless you're playing games like using the video processor for number-crunching instead of the main CPU.) But the power problems are still just as annoying. If you're doing anything custom-built for supercomputing, you'd obviously build boards with multiple CPUs and faster interconnects and skip all or most of the disk drive stuff, so that lets you fit more CPU per 1U or 3-4U of rack space. And you might build a system with lots of DSPs instead of general-purpose CPUs, which would probably get you more MIPS per watt.

Database supercomputers, on the other hand, look surprisingly like blade servers. The old Teradata machines had something like 488 CPU+disk units connected by a fancy back-end switched network, plus a front-end set of CPUs for managing work and communicating to the outside, with algorithms designed to split up queries intelligently across the processors. And of course there were the same kinds of arguments about database machine clusters vs. big iron mainframes vs. loosely-coupled clusters.

--

Bill Stewart
New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks

Re:*Shock* by Shinobi · 2004-08-20 09:26 · Score: 1

But, the bigger the cluster becomes, the more nodes and interconnect fabric you need just to keep up with the degradation of performance in the cluster for tasks that are not easily made parallell. As it is, for non-trivial tasks, a supercomputer is considered to have a good performance if they get 60-70% of the theoretical peak performance. With clusters, you're lucky if you come anywhere near close to that. And note that I said non-trivial tasks. The task of simulating gas flow or particle physics are non-trivial problems for example, while tasks such as SETI@Home and RC5-72 are quite trivial, they just require CPU power but can easily be split into separate chunks, and the only communication needed is between the individual notes and the master node handing out and receiving the chunks.

I'll use an example from my experience again: I sometimes freelance doing architectural visualizations, air flow simulations in buildings due to heating etc. I had the opportunity to try out my own code on both a Athlon cluster running on Myrinet and on a SGI Origin 3800. On both, I used 16 CPU's. Despite the superior FP performance of the Athlon CPU's, the Origin finished the work in less than half the time the clustered Athlons needed, despite me making specific changes that would allow the code to work better on the cluster.

Exploiting parallelism vs. efficient computation by billstewart · 2004-08-20 09:39 · Score: 2, Interesting

If you're trying to run 1024 cases with different starting conditions, then a 1024-processor cluster lets you run them all at once. A supercomputer with the same price as the cluster probably has only 1/10th the raw GFLOPS as the cluster, because supercomputer designs are much more complex and commodity cluster hardware is dirt cheap.

So if each cluster CPU can run a single instance the problem efficiently, it's 10 times as cost-effective to use the cluster.
On the other hand, if a single instance of the problem doesn't really fit in a cluster CPU, it might be 1/10th as efficient as the supercomputer CPU, because you're spending more time doing swapping or communications to get the numbers to crunch than you are crunching them, in which case it's a tie with the supercomputer.
But on yet another tentacle, if it's 1/100th as efficient to use the cluster CPU as the supercomputer CPU, because you have to spend a LOT more time swapping, then the supercomputer is a big win, 10 times as cost-effective as the cluster.

Back in the mid-80s, my department had a huge VAX 780 with 4 MB of RAM (16KB chips, I think), and we were working on a network simulation system that needed 12-14 MB RAM to run. I spent a while playing with different versions of 4.1BSD and Unix System VR2, but fundamentally the machine spent all its time swapping data in and out of disk, and the main performance with was helping the physics jocks who wrote the application get better algorithms and better localization and good checkpointing because the computer didn't always stay running for the full week it took to finish a simulation run. A year or two later, we got the budget to buy another 4MB of RAM (in 64KB chips, about $50K IIRC), which helped a bit, and a year or two after that, we got enough budget to buy another 8MB of RAM (maybe 256KB chips? not sure. Also about $50K), and suddenly the application could complete in under an hour instead of a week, because RAM really is a couple orders of magnitude faster than disk drives with a couple more orders of magnitude less latency, so our problem changed from being disk-bound to being CPU-bound.

That speedup not only improved the utilization of the equipment, it made a qualitative difference in the kinds of problems we could address because of the way we could interact with it. That's why people buy supercomputers if they need them - it really can be orders of magnitude faster for some problems. The first year or so, we really had all the RAM that could fit in the double-refrigerator-sized VAX cabinet. Once the denser RAM chips became available, we probably should have spent a bit more manager time beating up on the accounting department, because an extra $50K for hardware could have more than doubled the efficiency of 3-4 physicists, but of course the accounting droids don't think in terms of efficient use of physicists unless it lets you buy half as many of them, which was _not_ the objective here...

--

Bill Stewart
New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks

$4500 on EBay by billstewart · 2004-08-20 09:49 · Score: 1

Sure you've got enough money for a Cray! Cray J932SE supercomputer (dual IOS, 3 cabinet) for $4500, not including disk drives.

--

Bill Stewart
New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks

"Imagine what you could do with a Cray!" by billstewart · 2004-08-20 09:55 · Score: 1

C'mon now, don't you remember Cray's advertising slogan from the 80s? They used to make nice posters with it.

Back in the 80s I was doing a telecomm project for a large research lab that had a number of Cray supercomputers on one side of campus, and their campus backbone was a 30 Mbps baseband cable system feeding a bunch of 10 Mbps Ethernets, and a few of their buildings were starting to get brand-new 100 Mbps FDDI. They were getting very worried about what would happen if too many people _did_ imagine what they could do with a Cray, and wanted to do it from the other side of campus... Fortunately, the number of people who had access to the Cray was small enough that a variant on "sneakernet" worked fine - not using the sneakers to carry floppy disks around, but using the sneakers to carry the users to the building where the Cray lived :-)

--

Bill Stewart
New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks

well, smarty pants... by zogger · 2004-08-20 10:20 · Score: 1

... I do SO know. Supercomputers are mainframes that have radioactive spiders building webs inside them, and the cooties slop over and give them SUPAH POWAHZ.

neener neener

Re:*Shock* by sootman · 2004-08-20 13:06 · Score: 1

Sorry, but I can't believe you got modded as 'insightful'. (well, actually, this is /., so I guess I can.)

Look at google. A large cluster, and if one of the machines dies, you don't worry about it.

The most important thing to remember about Google is that they deal with NON-CRITICAL INFORMATION. Who cares if you get different results when looking for Janet Jackson Superbowl pics from one day to the next, or even one hour to the next? You're getting scads of results, most are probably good, so you're happy and think Google is perfect. Google's *actual* reliability and consistancy would get them NOWHERE in banking or airline reservations, let along weather prediction, nuclear blast modeling, etc etc etc. Just because a computer can do one job well does not mean it can do another, same way that Indy cars make really lousy drag racers and Top Fuel cars don't turn laps at the brickyard.

--
Dear Slashdot: next time you want to mess with the site, add a rich-text editor for comments.

Re:The issues are progress and long-term usefulnes by ananke · 2004-08-20 13:09 · Score: 1

I'll have to dig through our ticketing system and my desk to find the ticket numbers, so you may have to wait until Monday. None of them were 'bugs', simply hardware failing. Both myrinet cards in computational nodes, and switch cards. Hardware did get replaced, and I'm about to call in for a replacement of 10+ cards plus probably half of the cards in the switch. For a two year old cluster to have 20% of the myrinet hardware replaced, I find that not acceptable.

--
--- d'oh

NOT SURE WHAT "FAST" MEANS by hipparchus · 2004-08-20 13:29 · Score: 1

I seriously doubt that Cray can put faster circuits down on silicon that Intel. part of the nature of the silicon foundry is that stuff doesn't start getting good and fast unless you make A LOT of it. It also gets cheap at this point.

I see no architectural difference between a "cluster" and a "supercomputer". The links between different CPUs are just conventionally made using different technology.

There's a lot of rubbishing of PCI (hey it's 10 years old now, and there are MUCH faster new versions happening), and what is the point of saying unquantified/unsubstantiated crap like "CRAYS HAVE VERY FAST SHARED MEMORY BUS".

Yeah - HOW FAST THEN? I'd be surprised if they are 128 bit running at 2 GHz.

Shared memory can mean one of a number of things, also:
You can have one CPU sharing say a 4 meg block with each of 25 other CPUs. The first CPU acts as the hub for communication between the other CPUs.
You could have 27 CPUS in a 3 x 3 x 3 cube, each CPU sharing memory with up to 6 neighbours.
You could have 5 processors in a line with each one sharing memory with (up to 2) neighbours.

Or you could have a bunch of core memory that 4 processors share (they might have their own memory too).

The same thing goes for a cluster - you could have PCs with up to 6 network cards (or with unidirectional custom ethernet protocol, even 12 network cards linking to neighbours in a 27 CPU cube, and so on.

The topology will affect how the program is written for maximum speed, but also which tasks the computer is suited for. I think you could make very very fast links between ordinary PCs with say full duplex gigabit running a custom protocol (TCP has latency by the way, UDP has none since it doesn't wait to assemble packets in buffers in the kernel).

It's hard to imagine a task that is so i/o bound (in my mind this is the opposite of embarrasingly parallel problems) as to require more than 100 megabytes/second between each node, when each CPU node has a memory bandwidth of 12 gigabytes per second (based on 32 bit core of Pentium 4 at 3 GHz, assuming roughly 1 transfer per clock cycle, which in itself is unlikely).

In other words, a cluster using off the shelf gigabit ethernet hardware could transfer 1% as much data as the CPU could do with RAM.
Note if the CPU is in a 27 CPU cube the combined 6 gigabit ether cards would be transferring 6% as much as the CPU could. I guess it is possible to get motherboards with larger numbers of PCI slots, say 12 in which case you could run two streams of gigabit ethernet between each CPU giving you 12% as much data being transferred over ethernet as the CPU can transfer in and out of memory (not including cache flushing from CPU to RAM).

Once again, what problems require such a huge amount of communication with other nodes that say 12% as much bandwidth between nodes versus CPU-memory is not sufficient?
Say 12% isn't high enough: what CPUs, data bus widths, and shared memory speeds are used then?

Arguments people have made so far are so light on detail, and using terms like "much faster" instead of giving a figure, it sounds like FUD.

Remember parallel links between devices on chips can exhibit data skew, lowering data rate compared with a fast serial link. In fact there is talk (and I personally suggested a long time on a newsgroup) using light to get signals from one chip to another. (probably mainly serial, but not necessarily exclusively).

Hmmm... by Jozer99 · 2004-08-20 14:02 · Score: 1

I am detecting a slight conflict of interest here.

Re:*Shock* by Tony-A · 2004-08-20 14:48 · Score: 1

1000 machines are more reliable then 1 big machine.

Depends.
If any machine(s) are up, the system works. This is Google's advantage.
If any machine hiccups, the system fails. This is supercomputer turf.

A cluster is a fast efficient way to solve a few problems. A very small portion of the problems would be my guess. To the extent that the next computation depends on something non-local, which in turn depends in part on the results of your last computation, supercomputers have an advantage somewhat like the difference between cache and disk access speeds.

Re:The issues are progress and long-term usefulnes by Wesley+Felter · 2004-08-20 15:58 · Score: 1

The point is, though, that the Cray supercomputers are vector supercomputers, whereas Linux clusters and other similar machines are not.

The article is about the Cray XD1, which is not a vector system. In fact, the XD1 is remarkably similar to an Opteron/Infiniband/Linux cluster...

Re:The issues are progress and long-term usefulnes by chthon · 2004-08-20 16:07 · Score: 1

It is called a hypercube because it is a mapping of a four-dimensional (and higher) cube into three-dimensional space.

It has nothing to do with communications or computing, but with topology.

Re:The issues are progress and long-term usefulnes by gardyloo · 2004-08-21 05:52 · Score: 1

Right, right. But if it's a mapping (1-1, as it seems) then there's an isomorphism, and the need to do anything in hyperspace isn't there. Just call it an arrangement with a certain number of lines from each node and be done with it. If it can be done in meat space there's no need to hyper- it up.

Crays have 1024 bit busses.. by rofthorax · 2004-08-21 06:35 · Score: 1

Until PC's have as wide busses, it probably
won't matter how well the chips do.. Cray's probably require less maintenance than a PC cluster.. Its like saying a bunch of fast motorcycles are better than a stock-car. Sure, but if you get in a wreck, I'd rather be in the stock car..

--
Just say no to license servers!!

Re:The issues are progress and long-term usefulnes by Folkboat25 · 2004-08-21 08:24 · Score: 1

did I hear right cray's are using a type of linux kernal....

--
When you've got you install disks you know you haven't been screwed

Re:The issues are progress and long-term usefulnes by chthon · 2004-08-26 02:43 · Score: 1

I think that this configuration gives the best proce performance. On one end of the spectrum you have a completely interconnected mesh, on the other hand you have all systems on a single bus, in between you have a whole lot of possible topographies.

In the hypercube system, you need in N dimensions N communication channels per processor, and the maximum distance that any packet has two travel is N hops.

In a complete interconnected mesh with P processors, you need P communication channels per processor. While your maximum hop is in that case only 1, you need (P - 1)*P communication channels, which is quadratic.

Re:agreed by jedidiah · 2004-08-27 09:32 · Score: 1

The default behaivor should be sensible.

Migrating a single process on a mostly idle NUMA box across system boards is NOT sensible.

The whole "locality of reference" concept is CompArch 101 stuff.

--
A Pirate and a Puritan look the same on a balance sheet.

Slashdot Mirror

Cray CTO Says Cray Computers Are Great

254 of 338 comments (clear)