Slashdot Mirror


Cray CTO: Linux clusters don't play in HPC

jagger writes "Linux clustering was touted as the next big thing by many vendors last week at ClusterWorld Conference & Expo 2004. But supercomputer vendor Cray Inc. scoffed at the notion of putting Linux clusters in the high-performance computing (HPC) category. "Despite assertions made by Linux vendors, a Linux cluster is not a high performance computer," said Dr. Paul Terry, CTO of Cray Canada."

110 of 435 comments (clear)

  1. Marketing by Allen+Zadr · · Score: 5, Insightful

    While Paul Terry makes some good points, in his statements, including the partial quote from the post, "Despite assertions made by Linux vendors, a Linux cluster is not a high performance computer, said Dr. Paul Terry, CTO of Cray Canada. "At best, clusters are a loose collection of unmanaged, individual, microprocessor-based computers."

    Remember to take this with a grain of salt. The inflammatory nature of the comment is nothing more than a marketing ploy to increase visibility of, and sell, the new Cray XD1

    --
    Kinetic stupidity has a new brand leader: Allen Zadr.
    1. Re:Marketing by Total_Wimp · · Score: 5, Funny

      "At best, clusters are a loose collection of unmanaged, individual, microprocessor-based computers."

      I'm sure Paul Terry is nothing more han a loose collection fo unmanaged, individual human cells too. But I'm sure, with hard work and love, he can become a _real_ boy! Lets all have a hug.

    2. Re:Marketing by Shinobi · · Score: 2

      As I noted in another post: Those Cray's only have 256 CPU's. Larger installations are planned.

    3. Re:Marketing by Hoser+McMoose · · Score: 4, Insightful

      The Top500 list uses Linpack exclusively for it's test. Linpack can be split to run on clusters VERY easily, it could even fall under the catagory of "embarassingly parallel" problems. These sorts of tasks do exist in reality, but they definitely aren't the only kinds of problems you'll encounter.

      If you need to access remote memory in a super cluster, such as the ones mentioned above, you take a BIG hit in terms of performance. Think about running from swap space vs. running an application out of memory and you'll be on the right track. In these sorts of situations a system like that Cray down in slot 19 could easily beat out nearly anything above it on that list (almost all of which are superclusters except for Earth Simulator at #1).

      As others have mentioned, the guy was clearly talking from a marketing standpoint rather than a "chose the best solution for the job" standpoint, however what he said isn't entirely without value. There are a lot of tasks out there where that Big Mac supercluster that people keep touting would suck-ass. Even with their high-bandwidth, low-latency infiniband interconnect you're still looking at a good 3 orders of magnitude lower performance for remote memory vs. local memory.

    4. Re:Marketing by Shinobi · · Score: 4, Informative

      Actually, that crossbar memory bus is just the local bus for each cabinet, and they do have low-latency interconnects that allow globally shared memory and single system imaging. Otherwise they wouldn't be working on a 1024 CPU installation. A clue for you: The technology used in the Origin machines was originally developed by Cray, and it runs 1024 CPU installations as global shared memory and single system image.

      As for research, it's more a case of researchers doing the old "Damn, I'll have to make do with this". And Origin and Altix systems are still selling well in the research market.

      And don't forget, Cray is backed by US government departments such as the NSA. The X1 received a lot of such support, which Cray even admits themselves: http://www.cray.com/products/systems/x1/

  2. Yeah, but imagine... by Anonymous Coward · · Score: 3, Funny

    ...a Beowulf... cluster... thingy... doesn't that count?

    1. Re:Yeah, but imagine... by operagost · · Score: 2, Funny

      It would be ironic if someone were to port Linux to a Cray - then build a cluster. I wonder what he would say about that.

      --

      Gamingmuseum.com: Give your 3D accelerator a rest.
  3. Seymour Cray by JargonScott · · Score: 5, Funny

    A quote I've seen before:

    "If you were plowing a field, which would you rather use? Two strong oxen or 1024 chickens?"

    Maybe he meant penguins?

    --
    Nuke Gay Whales for Jesus.
    1. Re:Seymour Cray by Anonymous Coward · · Score: 3, Insightful

      You could just as well ask though

      "If you were building an ants nest, which would you rather use? 1024 Ants or a Bulldozer?"

      Perhaps he shouldn't be comparing plowing fields to high performance computing.

    2. Re:Seymour Cray by theatre_freak · · Score: 4, Funny

      Would that be a kilochicken?

    3. Re:Seymour Cray by wowbagger · · Score: 4, Insightful
      The analogy USED to be valid, however the times have changed as microprocessors are now much more powerful.

      The analogy now would be more like:

      Which would you rather use to plow a field - one big tractor or a 1024 little tractors.


    4. Re:Seymour Cray by lukewarmfusion · · Score: 4, Funny

      Chickens, for the same reasons that you would use 1024 Linux boxen instead of his Cray.

      And when you're done plowing, you can fry 'em up all tasty.

    5. Re:Seymour Cray by turgid · · Score: 4, Funny
      "If you were plowing a field, which would you rather use? Two strong oxen or 1024 chickens?"

      Personally, I'd prefer a John Deere 6003 Series.

    6. Re:Seymour Cray by kitzilla · · Score: 4, Funny

      "If you were plowing a field, which would you rather use? Two strong oxen or 1024 chickens?" How big are the chickens?

      --
      This is my post. There are many others like it. If you don't like what you read here, go try one of the others.
    7. Re:Seymour Cray by thedillybar · · Score: 2, Insightful

      How about a real tractor? Like an International.

    8. Re:Seymour Cray by xdroop · · Score: 3, Insightful
      Use the right tool for the job.

      If you are plowing fields, use the bull.

      If you are making eggs, use the chickens.

      This isn't a one-size-fits-all world any more. Only those deluded enough to think that Windows should be the world's standard desktop think otherwise.

      --
      you should read everything on the internet as if it had "but I'm probably talking out of my ass" appended to it.
    9. Re:Seymour Cray by rthille · · Score: 2, Insightful

      That's funny, but since plowing a field is very parallelizable it doesn't make a very good analogy. Especially since in that analogy the Cray isn't two strong oxen, it's more like a machine that can plow all/many of the rows at once, and the linux cluster is a machine that can plow one row at a time, but you can afford to buy a bunch and plow as many at a time as you have $$s to spend.

      --
      Awesome furniture, accessories and cabinetry in Santa Rosa, CA: http://humanity-home.com/
    10. Re:Seymour Cray by Skjellifetti · · Score: 4, Funny

      Depends. Does HPC stand for High Performance Cow or High Performance Chicken?

    11. Re:Seymour Cray by turgid · · Score: 3, Funny

      No silly, it's High Performance Cow, Highly Parallel Chicken

    12. Re:Seymour Cray by Anonymous Coward · · Score: 3, Funny

      2 Oxen sized Chicken cabinets with 512 Chickens each, or a cluster of 1024 chickens.

    13. Re:Seymour Cray by Anonymous Coward · · Score: 2, Insightful

      To hell with the analogy.
      Further clarification:
      One big computer with 1024 processors that allows every processors to access all of the system memory at full speed, and has been designed from the ground up to work as one system, or a cluster of 1024 processors spread across some number of computers with either 2 or more procesors per box, memory is only quickly local to the computer box it's plugged into and has been built to work as one system.

    14. Re:Seymour Cray by Anonymous Coward · · Score: 2, Funny

      No, a kilo of chicken would be approximately 2.2 pounds.

    15. Re:Seymour Cray by Waffle+Iron · · Score: 5, Funny
      "If you were plowing a field, which would you rather use? Two strong oxen or 1024 chickens?"

      When Seymour Cray made that statement, he was probably pointing out the difference between his he-man vector processors vs. clusters of the wimpy microprocessors of old.

      After reading the article, it seems that this new Cray is powered by a bunch of the exact same AMD microprocessors that a cluster of Linux boxes would use. So what they have now is more like an ox-shaped sack stuffed with chickens.

    16. Re:Seymour Cray by epiphani · · Score: 5, Funny

      wow. I've never seen someone fail so miserably when trying to start a flamewar over why kind of Tractor is better. Man, I thought they woulda been all over that here on slashdot.

      --
      .
    17. Re:Seymour Cray by krlynch · · Score: 2, Funny

      But only if they're GNU/Chickens....

    18. Re:Seymour Cray by jtev · · Score: 2, Insightful

      Um, John Deere cheap? are you looking at the same stores I'm looking at? I do know farmers that swear by both. I know a lot of people who own Green Tractors, and a lot of people who own Red tractors, not a few who own White tractors, of course the realy fancy tractors are the Yellow ones. All these colored paints seem to be expensive though.

      Never mind me, I forgot the point of this rant.

      --
      That which is done from love exists beyond good and evil
  4. Business or science? by grub · · Score: 2, Interesting

    Dr. Terry's assertions remind me of a Seymour Cray quote I had as my /. sig a while back:
    "If you were plowing a field, which
    would you rather use? Two strong oxen or 1024 chickens?"

    I'm not picking a side, it just seems interesting that the Cray CTO would echo Seymour's thoughts. I guess it's for business and marketting reasons though, sadly.
    --
    Trolling is a art,
    1. Re:Business or science? by Billy+the+Mountain · · Score: 2, Funny

      Even though the chickens might be 80% more efficient, there are other considerations: Can you imagine the ridicule you'd get when you went into town?

      "Here he comes, get ready boys! Cluck cluck cluck cluck cluck, here chickey chikcey, Haw Haw Haw!", etc.

      BTM

      --
      That was the turning point of my life--I went from negative zero to positive zero.
  5. CTO of Cray? by shachart · · Score: 2, Interesting

    You did notice he is the CTO of Cray... Canada??

    --
    Those who can, do. Those who can't, consult.
  6. Todays headlines... by stevens · · Score: 4, Funny

    Company officer claims competitor isn't as good as his product. Film at 11.

  7. And in other news... by heironymouscoward · · Score: 5, Insightful

    Oracle disclaim MySQL and PostgreSQL as "toy databases", Microsoft claims that "Apache cannot be used for real web serving", and Sun announces that "Intel and Linux simply cannot be used for enterprise computing".

    So all those supercomputing labs that use Linux clustering (that invented Linux clustering, even) have been wasting their time?

    --
    Ceci n'est pas une signature
    1. Re:And in other news... by dasmegabyte · · Score: 4, Insightful

      All of those statements are true. And a cluster is not a mainframe, and the products sold by Oracle, Microsoft and Sun *DO* go far beyond their Open Source competitors in terms of functionality.

      The problem for these guys is that, in terms of real world enterprise usage, not everybody needs the features they offer. My business doesn't need the easy management and clustering features in IIS, heck the website hasn't been updated in months and this time kast year nobody even knew which machine it ran on. We don't need the task scheduling, file striping, data transformation, replication or XML features of Orcale. In fact, we only need a tiny sliver of the possible functionality of these great products...but we're unable to pay a sliver of the price. With OSS ramping up its feature set daily, for a lot of companies with our needs it makes more sense to train a guy on Linux than to drop five digits on Windows Server 2003 and SQL Server.

      As for supercomputing...well, a cluster is NOT a mainframe. They're two similar, but different things, with the main difference being the databus. If your task is to perform a lot of calculations on a trivial dataset, clustering is the way to go. If your task is to perform a few calculations on a massive dataset, you want a mainframe. The mainframe is simply more efficient at processing massive inputs and providing massive outputs because it was designed to efficiently pass data between processors -- give the same dataset to a cluster and most of your time is wasted negociating the network.

      Of course, these days networking is so fast that a cluster will probably do for most of the things people used to do on mainframes...but a cluster is still best for tasks which are easy to split apart and process in pieces.

      --
      Hey freaks: now you're ju
    2. Re:And in other news... by mritunjai · · Score: 2, Insightful

      Oracle disclaim MySQL and PostgreSQL as "toy databases"

      Yup kid and for very good reasons. Take sometime off from paying with your toys and regiter at OTN to learn about what Oracle 8i and 9i database are capable of. You'd literally blow your head off if you see what their Apps are capable of.

      For a start, consider 4 page long nested views that pull data from more than 40 tables where some contain upwards of million rows. And these views are accesed thousands of times a day in their apps.

      Here is an example of what you can do with 9i

      declare
      type t1 is table of number;
      type t2 is table of varchar2(30);

      my_tbl_1 t1;
      my_tbl_2 t2;
      my_tbl_3 t1;
      my_tbl_4 t2;

      begin

      SELECT
      a.col1, b.col2, c.col3, decode(select foo from sometbl where colx = (select bar from yay where t=t1), 'Y', X, Y)
      MASS COLLECT INTO
      my_tbl_1, my_tbl_2, my_tbl_3, my_tbl_4
      FROM
      tbl1 a, tbl2 b, tbl3 c, tbl4 d
      WHERE
      decode(nvl(select n from y where col = a.col3), 'Y'), 'A', 'B', 'C', select colx from tabyz where col = 'F', 'Hello') = 'Hello'.
      Extra points from figuring out why this query is great! (*hint* : Its subqueries and 'decode' [think it like a C 'switch' statement inside a SQL query!] on steroids... if you can't see what you can accomplish using these features, you're not a database developer.!
      --
      - mritunjai
  8. Are too by Anonymous Coward · · Score: 5, Interesting
    "Most cluster [experts] know now that users are fortunate to get more than 8% of the peak performance in sustained performance."
    Tell that to PIXAR. I don't believe it either.

    I guess that the simple problem is just that the algorithm applied is usually not suitable for massively parallel computing.

    1. Re:Are too by Technician · · Score: 2, Insightful

      Tell that to PIXAR. I don't believe it either.

      Ya beat me to that one. I won't post it because it would be modded redundant, but I would have mentioned Google also.

      --
      The truth shall set you free!
    2. Re:Are too by dead+sun · · Score: 5, Insightful
      Pixar doesn't need telling, their problem breaks up so miraculously well that they'll see the best performance you could possibly expect from a cluster. The big problem, rendering a movie, decomposes into thousands of small problems, rendering a frame. Each machine in their cluster can handle a group of frames at a time with zero need to communicate or worse, share computation, with other machines in the cluster. It's the best case scenario.

      Many other computing problems don't decompose nearly so nicely. So there are certainly problems that probably won't see more than 8% of peak performance. If you were particularly inclined you could probably invent a problem that had to be done serially, leaving percent of peak performance equal to what percent of your cluster one box was. Cray is right to that extent and if you're solving a problem that falls into the category of not easily parallelized then perhaps one of their machines is the better tool for the job. But, like you mention there are instances where the cluster is a great tool and cost effective to boot.

      Heck, ever check out some of the faster interconnects like Myrinet? They're insane and exist because fast ethernet just doesn't cut it in some places. Just using a slow interconnect is enough to bring real performance down below theoretical peak. Luckily for Pixar off the shelf fast or gigabit ethernet is likely enough.

      Anyway, use the best tool available. If your problem falls into the category of trivially parallelizable like rendering a movie is then don't bother wasting your money on a Cray. If your problem isn't suited to a cluster, however, then maybe a cluster isn't the right answer. If you have a big problem that needs serious computation take the time to figure out what you need before taking a marketing drone's spiel for gospel in your situation.

      --
      If not now, when?
    3. Re:Are too by bugnuts · · Score: 3, Interesting

      All tests for the top 500 supercomputers are done solving a problem using Linpack, not some trivially parallel code such as raytracing 100,000 frames of a movie.

      Message passing is the biggest issue with such solvers, and in a way, cray was absolutely right about Linux, although misleading. There are some tests going on now with a modified Linux kernel for doing true HPC, and it's been done in the past (I know, I've used it). Things like disk swapping pretty much immediately disqualifies you for high performance computing. It has its place of course, such as trivially parallelizable codes is one example (Pixar).

      Myrinet was out before Gbit ethernet was really available, and also has some nifty routing capabilities. And since the bottleneck for HPC is usually message passing, high performance computing will better realize its theoretical performance as the communication speed catches up to the processor speed.

      But, to Cray's discredit, making a blanket statement that Linux can't do HPC is like saying Macintoshes can't do HPC.

    4. Re:Are too by GauteL · · Score: 3, Informative

      In fact even rendering a single frame decomposes easily into lots of seperate task, because it involves raytracing backwards from every single pixel trying to calculate it's colour. And furthermore, each of these tracings are completely seperate and just begs to be parallelised.

      Raytracing is sometimes referred to as "embarrasingly parallel", because of this.

      Mathematical dependencies is the real destroyer of parallelism. Any situation where the next calculation depends on the result on the previous is a typical serial calculation that would do badly on any super-computer and might as well be run on a single single scalar processor like the Athlon or P4.

    5. Re:Are too by bm_luethke · · Score: 2, Informative

      You are basically quoting Amdahl's law (you may know that, but it should be pointed out in case anyone who doesn't know wants to look it up). Though his machines run into the same problem, if the program can not be broken into little concurrent chunks then having 1024 processors isn't going to help either.

      When I worked at Oak Ridge National Labs there were several applications that people ran on our clusters that were serious computations. Very few of the people there really cared one way or another if it was on the IBM SP-2 or on the intel clusters, just run on the hardware that has the shortest runtime.

      We generally got well over 8% utliization, if that was all you were getting then you were not managing the cluster well. Basically both machines had similar problems, if one piece of software only utilized 10% of the machine (and that is possible, even probable, in either world) then you ran more than one person - they did it and so did we. It was rare a single person got exclusive use of the machines (they either shared on each individual node or the over all machine was split into smaller clusters/supercomputers). The lines between the two are very blurry, but of course Cray wants you to think differently.

      This article is just like one of the researchers there that ran the Big Iron stuff. When I was still an intern I overheard him telling the new director about how clusters sucked because they cost so much more in salaries to maintain. While true, he overlooked that thier service contract with IBM cost more than triple what it would cost us to replace the whole cluster per year and hire four full time people to manage them, and they never got any hardware upgrades for it.

      Each has thier strong points and weaknesses, and never trust someone who is trying to sell you something to give the full story.

      --
      ------- Sorry about the spelling, I suffer from two problems. Dyslexia makes it difficult to spell well, lazy makes it
  9. VA Cluster yet to be used by Anonymous Coward · · Score: 4, Insightful

    Regardless of whether I agree with the article or not I feel compelled to point out that:

    The 1100 node Apple G5 cluster in virginia has yet to run any real scientific code. So far it has only ran benchmarks.

    1. Re:VA Cluster yet to be used by The+Placid+Casual · · Score: 2, Interesting
      The VA cluster is no more. It is being sold as individual G5s by Macmall, each with a certificate proving that it was part of the cluster!

      There were issues with unbuffered RAM, so they have decided to make a new cluster with the new 2ghz X-Serves which use EEC RAM ( and new IBM 970fx chips).

      This has resulted in massive shipping delays for the dual X-Serves, but should mean that a very, very good machine is created at VT...

    2. Re:VA Cluster yet to be used by Slowtreme · · Score: 4, Informative

      It was annonced that VA tech actually purchased the G5 X-serves before production was in place, but were instead delivered the G5 towers as loaners to have the cluster built in time for ranking.

      The cluster remains, they have not shut it down and were swapping out individual racks for the upgrade.(something like one rack of X-serves is three racks of towers.

      I don't think it's been published that they have or haven't ran any data besides benchmarks.

      --
      Post: Sigged, for your pleasure.
  10. What do you expect him to say? by WarlockD · · Score: 3, Insightful

    "We are dropping our line of Cray supercomputers and replacing them with rack mounted Beowulf cluster of 486's!"

    I am not saying Cray isn't worth it, but there is something to be said on replacing/fixing your supercomputer with over the counter parts.

    1. Re:What do you expect him to say? by Ironsides · · Score: 2, Insightful

      yeah, like how to up your power bill by a factor of 10 or more. Seriously, think how much condensed power a cray computer has that dubs as a desk or bench. How many top of the line G5 macs would it take to equal that, and compare the power requirements and size. I'm not saying that the Linux supers aren't week, but that in some cases it is easier to use Crays because you can pack more of them into a smaller area with less power, and get better results per cubic foot and kilowatt.

      Also remember, these guys can customize their supers for specific applications to improve performance. You can't customize a linux super's harware very easily. Not unless you could make your own chips.

      I hate to say it, but he's essentially right on this one. It takes a massive amount of general purepose computers to equal one cray. And if you have to worry about space, or if you need a couple of supercomputers to do various tasks at once, you can't very well afford to keep a couple of college size gyms on hand to keep them in, much less the cooling systems.

      How much power does the Virginia Tech super consume? How much does it's cooling system consume?

      --
      Fly me to the moon Let me sing among those stars Let me see what spring is like On jupiter and mars
  11. So what DOES play in HPC? by huhmz · · Score: 2, Interesting

    REading the article it's fairly obvious that Cray's CTO has an agenda, however, assuming he's right, what does play in HPC? Cray Prorpritary Cluser OS (TM) or what?

    1. Re:So what DOES play in HPC? by glop · · Score: 2, Informative

      Hello,

      actually, if you read the datasheet, the XD1 runs Linux 2.4.21 with some modifications (see xd1_datasheet.pdf

      So does the SGI Itanium machine. What sets these computers apart is that they offer better interconnections between the processors than clusters do.
      The Bigmac has 1.2GB/s between two nodes through Infiniband whereas an SGI machine has 6.4GB/s.

      As a summary, in a cluster you use slower links with higher latency and your processors communicate through messages.
      In a SGI or Cray machine, you use fast and expensive links (think more wires, more expensive controllers) and your processors can work as though they all shared the same memory.

      SGI sells systems with 128 processors where there is only ONE Linux kernel (as opposed to 128 in a Linux cluster).

  12. Sure... by avalys · · Score: 4, Insightful

    In other news...

    "Despite assertions made by Toyota salesmen, a Lexus sedan is not a luxury car," said Bill Taylor, CEO of Mercedes-Benz.

    --
    This space intentionally left blank.
    1. Re:Sure... by boisepunk · · Score: 2, Funny

      "Windows(TM) has a lower TCO than Linux"

      -Microsoft ad campaign

      (mods: don't hurt me. I mean nothing but to contribute to good discussion.)

      --
      main(0)
  13. He's got a point by PissingInTheWind · · Score: 4, Insightful

    Clusters can get high performance on some types of tasks. But sometimes, you need fine-grained parallelism that just isn't available on a cluster.

    On the other hand, high performance usually comes through special hardware. And on that hardware, I think Linux could be the right thing (modulo some patches).

    --

    A message from the system administrator: 'I've upped my priority. Now up yours.'
    1. Re:He's got a point by timeOday · · Score: 2, Insightful
      No, the Crays, at least the traditional Cray vector machines, were not very fast at running sequential code - not even as fast as a normal, general-purpose CPU (especially one of comparable cost). That's why they were never used for things like running payroll. They were specialized machines for scientific applications.

      Nobody is disputing that clusters have relatively high communications penalties for tightly coupled computatoins. The issue is whether that justifies the generalization that clusters are not "high performance", which it doesn't. People have successfully used clusters for rendering, many types of physics simulations, big-time webserving (e.g. google)... all of these are high performance applications.

  14. Flame Bait? by Natchswing · · Score: 2, Funny
    This is news? This is the equivalent of posting, "My AMD processor is better than your Intel processor!" It's a quote designed to ignite a fact-less argument on who has bigger ones.

    Now, if the CTO of Cray Canada started talking about your mother than I think you're morally entitled and required to respond.

  15. It's not the vendors... by Anonymous Coward · · Score: 2, Informative

    It's not the vendors who are claiming that Linux clusters are real supercomputers, it's the people who are using them to do real supercomputer work. They sell themselves based on actual price and performance.

    Methinks Cray is feeling a little threatened...

  16. Maybe so by pair-a-noyd · · Score: 2, Insightful

    "Despite assertions made by Linux vendors, a Linux cluster is not a high performance computer,"

    Maybe so but not everyone can pull a Cray out of his ass when they need horsepower. A Linux cluster is affordable, a Cray is the thing of wet dreams..

  17. If it walks like a duck, and talks like a duck.... by Ridgelift · · Score: 2, Interesting

    Despite assertions made by Linux vendors, a Linux cluster is not a high performance computer, said Dr. Paul Terry, CTO of Cray Canada. "At best, clusters are a loose collection of unmanaged, individual, microprocessor-based computers."

    I guess they're not happy about being only #19 on the Top 500 Supercomputer List. Linux is considered faster than they are according to the list.

    The 'ol ad-hominem attack of "if you can't beat them ligitimately, attack them personally" just doesn't cut it Paul. Build a better computer.

  18. Efficiency and cost argument by yppiz · · Score: 3, Interesting

    The Cray CTO makes the point that Linux clusters get, at best, just under 10% peak as sustained performance and uses this as a justification that Linux clusters are not HPCs. This is a reasonable criticism. Let's take the percentage he cites as real for a moment. Now what is the cost difference between a Linux cluster and a Cray (not some future offering, but today) and how much more of a Linux cluster could you afford? Would that offset the quoted inefficiency? Would the flexibility of being able to use commodity components further offset any advantage Cray might have? What about 24hr or same-day parts replacement without a hyper-expensive service contract? At the end of the day, I suspect the Linux cluster wins out even given the sub-10% efficiency figure Cray cites. --Pat / zippy@cs.brandeis.edu

    1. Re:Efficiency and cost argument by vijayiyer · · Score: 2, Informative

      Given a problem that doesn't scale well in a cluster environment, throwing more nodes at it will not help significantly. In that case, the cluster will run slower than the Cray at an equal cost. When you're paying several researchers $100k/year each, the Cray is probably the better solution for problems which are not easily parallelizable.

  19. the list by hakr89 · · Score: 3, Funny

    well i guess now we'll have to add them to the list...after SCO and Microsoft of course

  20. Funny... by Greyfox · · Score: 2, Interesting
    Works for Google...

    Wish I'd been there so I could have slapped him after about 3 seconds of stunned silence.

    --

    I'm trying to teach myself to set people on fire with my mind... Is it hot in here?

  21. Problem by rawgod0122 · · Score: 5, Interesting

    It all depends on the problem you are trying to solve. I have been doing some work of late that would not complete in my life time on the 108 node cluster that we have. But when programmed for and run on two Cray X1s I should complete inside of a week.

    Granted there are many codes (and more every day) that will run on clusters, the big iron will never die.

  22. Just because we love Linux.... by foooo · · Score: 5, Insightful

    Just because we love Lunux doesn't mean that clusters are HPCs.

    There are real issues that differentiate mainframe/supercomputers from large, powerful, clusters.

    Of course this all depends on your definition of an HPC. But I believe that it's reasonable to say that if parts of your computer are connected with low bandwidth connections (10/100,gigabit) they just can't handle the same kinds of transactions that a computer with parts that are connected by 10 gigabit or 1000 gigabit connections or whatever it is nowadays.

    As far as I know if you're deploying a large database it's still advisable to have a big huge IBM mainframe or a Unisys box or a Sun 10k instead of 4,8 or 16 clustered 8 proc machines.

    My point is there are valid arguments for not including clusters of commodity hardware in the HPC category.

    In my mind they aren't High Performance Computers... they are High Performance Clusters of Commodity Computers.

    ~foooo

  23. Linux has failed you by Meor · · Score: 3, Funny

    Where's your God now hippies? Where's your God now?

  24. Partly right, partly wrong.... by ERJ · · Score: 4, Insightful

    How well a cluster will do depends on the application that it is performing. Some problems can be divided into several small problems with little reliance on other parts of the problem (SETI / Encryption breaking). These things can be easily distributed to hundreds or thousands of "small" boxes for processing and are what a beowulf cluster would be good at.

    Other applications require the breakneck interconnect speeds that large Cray / Sun / etc.. build on. When the data being calculated on one CPU requires data from CPU2 to continue its calculations you don't want to have it wait for 100mbit or even 1gbit ethernet speeds. Even quicker interconnects such as SCALI are going to be slowed by PC bus speeds.

    Cray fills an important niche for those who can afford it.

  25. Different tools by BoneFlower · · Score: 4, Insightful

    The comment was stupid, yes, but not all jobs that you'd use supercomputers for can be broken down into many threads as others can. A linux cluster will do well for some jobs, a cray box will do well for others. There *will* be times when a Cray system is so far superior to anything you could do with Linux that it becomes the only real option.

    However, dismissing linux cluster technology automatically is dumb. In many cases, it provides more than enough cpu power and I/O bandwith to support your reason for getting a supercomputer, and probably at less cost than the other options.

    Its all a matter of determining what you need the computer to do, determining your budget, and get the best system in your budget for the uses you have for it. Sometimes that will be a Cray, sometimes a Linux cluster.

  26. Says who? by dagnabit · · Score: 4, Funny

    Who is this guy and what does a company like Cray know about... oh... never mind.

  27. Can you multithread your application? by LostCluster · · Score: 5, Insightful

    Clusters can rival a supercomupter when they are assigned is a task that's suitable for distributed computing. That is, work units can be divided up and worked on in any sequence... the result of segment 45 doesn't depend on knowing the result of 44 and such. Effectively, you can have the sum of all of the processors minus just a little overhead for the clustering.

    What Cray's rightfully pointing out is that for most business applications, however, distributed computing is not a viable option. When processing on a transaction basis, the transactions often need to posted in the exact order they were recieved, which means they must be taken serially. In those situations, the programs can't multithread work out to the other processors so well, and the cluster will end up running at roughly the speed of just one processor while the others waste clock cycles waiting for something to do.

    The cluster isn't the solution to everything. Nor is the supercomputer. You've gotta think about the job, then figure out which tool is right for the task.

    1. Re:Can you multithread your application? by radish · · Score: 2, Informative

      No supercomputer (cluster or traditional) is going to work well if your app can't multi-thread as none of them derive their power from a small number of super powerful CPUs. For that you want something more like a traditional mainframe (and guess what, many banks still use them). The real difference between the Cray model and the cluster model is shared vs seperate memory. The question becomes "can your application be broken down into small chunks which are entirely self-contained". So rendering a movie works well because each frame, or even portions of frames, can be rendered entirely apart from others. However, doing analysis over massive data sets (e.g. data mining) will benefit from multiple threads being able to share one huge memory pool. So Pixar use a cluster and the NSA use a Cray. Right tools for the job.

      --

      ---- Den ene knappen er powerknapp, den andre er Bender voice knapp "Bite My Shiny Metal Ass"

  28. These are not the Droids you're looking for.... by RedLeg · · Score: 2, Insightful
    Buy MY droids instead..... Move along.....


    His rhetoric is quite predictable, actually. He talks at some length about how and why clusters of PCs can't get the job done, and how clustering is inherently inferior to a REAL SuperComputer, then goes on to describe how their new product (which sounds suprisingly like a cluster of propreitary machines) can work. Repeat the above as it applies to the management software.


    If clustering doesn't work, and Supers are better / cheaper, explain why large companies (Pixar, NVidia, ...) Government Labs (Los Alamos National Labs, Sandia National Labs, ...) have invested, and are continuing to invest in and support their clusters.


    Note that this does NOT mean that clusters are suitable for ALL traditional SuperComputing tasks. It really depends on the problem. If the problem is better solved with a vector processor, then a vector machine (like a Cray) is what you want. If the problem is solvable in parallel, then a cluster might be the right answer.

  29. Re:SGI by rawgod0122 · · Score: 2, Insightful

    I assume you are refering to the Altix systems. They are Single System Image systems (SSI), when means they are built like a supercomputer, and not like a cluster. They are big and expensive.

    SSI is where all CPUs can see all memory as if it was local. They are also Non-uniform memory access which means all the memory it sees is not as fast as all other memory, but really ALL single systems are like this. For example each CPU can address the entire TB of memory that is in the system, but reading from one memory location might take 100 cycles, and from another might be closter to 1000 cycles.

  30. Help me here... by ScottGant · · Score: 2, Insightful

    I'm a layman...I have no idea what I talk about, but of course that doesn't stop me.

    I know I keep coming back to Virginia Tech, but isn't all those G5's linked together to make the 3rd fastest supercomputer itself a cluster? Or is it considered something else?

    And if it IS considered a cluster, then why wouldn't a Linux based (along with the *BSD based G5s) be able to make a fast supercomputer?

    If so, then what Paul Terry is spouting is just FUD and marketing to help sell his product, yes?

    Just wondering.

    --

    "Music is everybody's possession. It's only publishers who think that people own it." - John Lennon.
    1. Re:Help me here... by maan · · Score: 4, Insightful

      You're right in saying that the Virgina Tech cluster is the 3rd fastest supercomputer (LINPACK tests). I think that for some other tasks however, it would be slower. Sure, they use infiniband as an interconnect (very fast & low latency), but that doesn't change the fact that it's many separate nodes, each with its own memory. So if one processor were to access some memory on a different node, it would slow down things a little.

      So depending on the task at hand, the cluster might perform very well, or perhaps a little less well. Cray supercomputers are a big number of processors all in the same machine, and more importantly all sharing the same memory. Each processor has the same delay to access any memory content.

      The argument in favor of clusters, however, is that it's still cheaper to throw more computers in than to buy a Cray that would perform the same task in less time.

      In the end, there's a lot of marketing involved in all of this...

      Hope this helps (and that I'm not completely wrong!),

      Maan

    2. Re:Help me here... by krlynch · · Score: 5, Interesting

      So depending on the task at hand, the cluster might perform very well, or perhaps a little less well.

      Surely what you meant to say is that, depending on the task at hand, a cluster might perform very well, or perhaps perform attrociously. :-)

      Clusters tend to work well when the various nodes don't need to communicate very often but you need lots of cycles for the subtasks, while dedicated supercomputers tend to perform very well in tasks requiring vast amounts of internode communications bandwidth along with large numbers of cycles. If you need vast bandwidth and relatively low numbers of cycles, your pricepoint is likely a mainframe. And if you don't need either, you get a cheap desktop machine.

      Certain problems parallelize well on a cluster ... others don't. Some don't parallelize at all, and a cluster won't do you a darn bit of good. The different machines are designed for different uses ... and one should be careful not to push a "one size fits all" solution. The Cray guy clearly got it wrong on that point, and likely knows it, but he was marketting, not teaching a course in choosing hardware for the task at hand.

    3. Re:Help me here... by SquadBoy · · Score: 4, Informative

      The how to from way back in the day.

      http://www.ibiblio.org/pub/Linux/docs/HOWTO/othe r- formats/html_single/Beowulf-HOWTO.html

      has a great explanation using a grocery story analogy that makes it really easy to understand what kind of tasks will work well and what kind will suck. And unlike the cheerleaders that have been showing up since clusters became a big business is very balanced about it.

      Still worth reading.

      --

      Cypherpunks: Civil Liberty Through Complex Mathematics. Those who live by the sword die by the arrow.
    4. Re:Help me here... by RalphBNumbers · · Score: 2, Informative

      There isn't a Cray system that can touch the brute parallel power of a big cluster like Virginia Tech's G5s. But depending on the kind of problem you're working on, there are Cray systems that would walk all over that G5 cluster.

      With problems that can be split up into hundreds or thousands of more-or-less independent subtasks, a cluster is the way to go. But for problems that can't be divided up like that, a smaller system with a few very tightly coupled extremely fast vector processors, like what Cray specializes in, is what you need.

      There are certainly plenty of HPC problems that aren't well suited for large clusters, but it sounds like the Cray guy might have been significantly overstating his point.

      --
      "The worst tyrannies were the ones where a governance required its own logic on every embedded node." - Vernor Vinge
    5. Re:Help me here... by eric2hill · · Score: 4, Informative

      For the love of Christ people, it's a simple thing.

      Format links like this: <a href="http://somelink">link text</a>

      It takes virtually no extra time and we don't have to trim the fucking slashcode spaces.

      Oh, and here's the link.

      --
      LOAD "SIG",8,1
      LOADING...
      READY.
      RUN
    6. Re:Help me here... by starm_ · · Score: 2, Interesting

      I'n not that familiar with HPC either but I'll try to explain what I know in laymen terms. A cluster is nothing more than a bunch of computer networked together intellegibly with an OS that is capable of seperating tasks between these computer. Crays on the other acts more like one big computer. Like the cluster, It also has hundred of CPUs but they are all on the same "motherboard" ( if you can call it that). Some of them share memory. The memory is very high speed. (somethimes in configuration equivalent to gigabites of L1 cash) And it often comes with a huge liquid cooling system so that it can be run at high speed. I've seen crays with very cool cooling systems with a running liquid fall on the front of it. Its uses an inert liquid called something like florinert. This liquid flows on the whole "motherboards" and since it is inert it doesn't conduct electricity or react with anything. Clusters a much cheaper to build, that's why people tend to use clusters, but they can't do everything Crays can.

    7. Re:Help me here... by CatOne · · Score: 2, Interesting

      It slows things down a little, yes, but it's not a huge difference. Infiniband can do DMA across machines -- so the memory on machine 2 *can* be directly accessed by the CPU on machine 1 (i.e. the CPU on machine 1 doesn't need to be consulted).

      Sure, this reduces peak efficiency. I think on the VT cluster it was in the 50-60% range (I could Google search but I'm lazy... shoot me)... that is, the total performance is about .5 or .6 times (2200 CPUs). This is pretty good, overall, compared to other systems.

      But the Cray guy is full of hot air. Of course you're going to sing the praises of massive SMP when that's what you have to sell. The fact is, if 1100 dual CPU machines clustered together can significantly outperfom the Cray, for less money, and they're easy to manage (they are...), then why not go that route?

      So Cray sells FUD, because it's their last option.

  31. Cray has some points. by Saeed+al-Sahaf · · Score: 4, Insightful
    While Dr. Paul Terry's comments are obviously self-serving, especially since in a way, with the Cray XD1 based on multiple AMD processors rather than proprietary Cray processors, he does have a point about the overhead of running the OS on each machine in a cluster, and the statement "The Cray XD1 is not a traditional cluster; it does not use I/O interfaces for memory and message passing semantics."

    In truth, such machine will always have a certain performance advantage over traditional clusters. The question is, will the price point be low enough to invalidate the idea of just adding more boxes to the traditional cluster.

    --
    "Who are in control, they are not in control of anything - they don't even control themselves!" - Glen Beck
  32. to sum it up by Revek · · Score: 2, Insightful

    Cray is losing market share and want people to believe that their expensive to maintain and operate machines are better and 'cheaper' than a clusteer of regualar pc's running together. My question is Does anyone with experiance with both systems back him up.

  33. He's wrong, but he's also right. by Richard+Mills · · Score: 4, Informative

    While I certainly disagree that you can't build a very high performance computer out a cluster of computers (Linux or otherwise), there is a lot of merit to the fact that clusters just don't scale well for certain classes of applications. Hence the renaissance of the vector supercomputer (ala the Earth Simulator ).

    Obviously, this guy is plugging the new Cray X1 architecture, which really is quite promising. For instance, check out this paper by some folks at Oak Ridge National Lab that appeared in Supercomputing 2003.

    Of course, since this is Slashdot, I expect that there will be a deluge of posts decrying everything about the new Cray machine because it commits the cardinal sin of NOT USING LINUX. Oh, the horror!

  34. Re:Linux not usable for HPC? by chammack · · Score: 2, Informative

    Octigabay does (did) in fact make linux solutions. However, it is not a cluster. It's a more traditional supercomputer although it does use low cost AMD processors.

    Cray isn't anti-linux per se, just anti-cluster.

    Somehow I wouldn't be surprised, the next step seems to be cray-marketed cluster nodes with a proprietary high speed interconnect. (If you can't beat them, join them).

  35. Marketing BS, but he has a point by Anonymous Coward · · Score: 3, Insightful

    The "interconnect" latency (especially) and bandwidth in a cluster, even using very high-end network hardware, is much worse than that of a Cray-style supercomputer. This does make certain applications run slower, especially if not specifically tailored to clustered architecture. Some applications are very difficult to break down into small pieces and require extensive memory sharing between nodes, which clusters just can't do well.

  36. Well... he is sort of correct... by nacks1 · · Score: 5, Interesting

    I happen to work in a facility that has large had both large supercomputers (cray t3e, j90, sgi) and linux and *nix based clusters (beowulf/linux, compaq/Tru64). The Cray CTO is correct that you can't just call every linux cluster out there HPC. Just about anyone with networking and linux knowledge can build a linux cluster.

    What really makes a difference between an HPC cluster and your normal every day cluster is the hardware interconnects used. There is a comment in the artical that refers to not using I/O for memory and message passing. I am not quite sure what he means by that, but I am guessing that he is saying that the network is not used for shared memory/message passing (MPI/openMP/SHMEM).

    If a cluster can limit the impact of latency between nodes either through smarter software or faster interconnects then I can't see any reason not to concider a linux cluster as HPC.

    Clusters without smarter software tend to be a real difficult coding platforms. Some developments with things like globally shared memory might make the difference, but there will still be the problem of latency between nodes.

  37. That's like saying... by infinite9 · · Score: 2, Insightful

    "Despite assertions made by Linux vendors, a Linux cluster is not a high performance computer,"

    That's like saying that the automobile is not a high performance team of clydesdales. That's true, but it may be irrelevant. If it can get you there faster or better, I guess it doesn't matter.

    --
    Disconnect your television. Do your own research. Draw your own conclusions. They're probably lying. Don't be a sheep.
  38. not a real supercomputer? by AngstAndGuitar · · Score: 2, Interesting

    My friends and I at the university create a beowulf cluster as a project for our Linux class, (just installing software, nothing impressive really, basic system administration) and benchmarked it (24 P200s 32MB ram each 10baseT network) against a reasonably fast computer running the non-parallel version of the same code... (HP C station) and ours finished faster, with the overhead of networking! as to being "a loose collection of unmanaged, individual" We managed it pretty tightly, using nfs to provide the binaries that would be used, thought we did not use NFS for the root partition, as this would put to much stress on our poor fileserving node. If I remember properly, we used dist for password files, and aside from pushing the power switches, we could manage everything in the room without leaving our seats (24 computers running 'xlock -mode matrix" hehe).

    Basicly, I disagree with Dr. Terry.
    our project writeup is here.
    (Please forgive any mistakes or stupiness therein, we were 15, 15, and a 30something non-geek at the time.)

    --
    Less look fast, more go fast.
  39. Re:Well.. by s00p41337h4x0r · · Score: 5, Insightful
    How could Cray be wrong. I mean just becuase linuxis running some of the top 500 computers there is no reason to consider HPC right. What a self serving statement Cray makes....they still dont get it .... there way is a dead-end...
    That's right. Dataflow vector processing has been shown to be a dead end. The fact that fastest computer in the world is a dataflow machine is a statistical anomaly, right?

    Oh, here's the TOP500 list, btw.

  40. The Cray will scale up by Richard+Mills · · Score: 4, Informative

    The reason that Cray only holds 19th right now is because they have only deployed X1 systems using up to 256 nodes. When the number of nodes is increased, you will certainly see the Cray moving up the top 500 list -- the architecture is VERY scalable.

  41. Re:Checking with the TOP 500 Supercomputers I find by LostCluster · · Score: 2, Informative

    Cray wants all forms of clusters disqualifed from that list for not being true supercomputers since they can only consider tasks that support multithreading at their full speed. If given a logical task that must be processed serially, the cluster will end up dropping to the speed of its fastest processor. Sure, the rest of the cluster is available to consider other questions... but the point is, it's going to waste some cycles while a true supercomputer would be able to dedicate its entire resources to the task.

    Every task has a maximum number of threads it can be broken into where adding another parallel process threads just won't make it any faster. For some, that number is in the stratosphere and doesn't have to be worried about. However, for others, that number is in the single digits. Those tasks aren't going to be helped much by a cluster that exceeds that number of processors.

  42. Yawn. by adrianbaugh · · Score: 2, Insightful

    For some tasks distributed clusters are better, for others ultra-high-bandwidth Cray-type monsters are better. So what's new?

    --
    "'I pass the test,' she said. 'I will diminish, and go into the West, and remain Galadriel.'"
    - JRR Tolkien.
  43. Strictly He's Correct by 4of12 · · Score: 2

    but practically, the performanc is "high enough" and certainly a helluva lot cheaper than buying a custom system.

    It's just like the old days, except more so:

    Performance = log(Price)
    and you can end up paying a lot of money to squeeze out that extra performance.

    Given that Linux clusters can achieve speeds in excess of a teraflop, that available dollars for computer purchases are finite, and that per processor performance and price performance is increasing, the market size for the world's highest performing machine is rapidly vanishing to a set of measure zero.

    --
    "Provided by the management for your protection."
  44. Still Linux by colores · · Score: 2, Informative

    The Cray XD1 System operating system is Linux

  45. Re:If it walks like a duck, and talks like a duck. by flaming-opus · · Score: 5, Informative

    Cray could easily be at or close to the top of the top500 list, their X1 architecture will extend that far. However, for a lot of really important supercomputing codes, it's no contest: The cray will trounce the clusters (linux or otherwise). Those #19 crays are only 256 processors. To get similar performance a stack of xeons requires thousands of processors. Some tasks just can be split appart that easily.

    A cray processor has eight floating-point units running at 800Mhz. The big Mac cluster (for example) uses G5 processors which have 2 FPUs at 2000Mhz. Thus the cray has a ~40% advantage. However, the G5 processor has ~4GB/s memory bandwidth. The Cray has ~50GB/s memory bandwidth. If you have a problem that needs to do a HUGE amount of math on a tiny amount of data, the G5 will rock. If you have a problem that needs to do a HUGE amount math on a GINORMOUS amount of data, buy the cray. (for a GINORMOUS amount of money too)

    Similaraly infiniband (ala the big mac) is really hot in the cluster interconnect space because it gives 2.5GB/s per node. The Cray gives you 51GB/s.
    You need to move a little data, buy a cluster. You need to move a lot of data, buy the Cray.

    There's no one solution for all problems.

  46. Isn't this obvious? by Slinky+Saves+the+Wor · · Score: 2, Insightful

    Cray is dying. The days of fat government sales have been over for a long time. It's only logical to discredit your competitors, especially if you stand to lose a lot because of them.

    This is nothing new, nor anything special. For instance, if you've looked at the latest computer magazines, Microsoft is doing the same kind of "it sucks" argument to anything related to Linux in a wide front. For example, Apache lost to IIS in a review, and IIS became the Editor's Choice in one magazine. In the next issue of the magazine there will be somekind of "debunking Linux myths" article. (This certain computer magazine is nothing special, even though it has become a nothing short of an unfunny joke paper written by people who don't have a clue. Some of their readers do have a clue, that's why they cancel their subscription.)

    So, to sum it up: it doesn't matter what the reality is, the people who decide only see the image which is created for them. Even if that image is wrong, that's the only thing they decision makers are going to see. They don't have the time or the energy to investigate things thoroughly on their own. This is why Microsoft pays the magazines to write garbage. This is why a Cray executive talks garbage.

    Lobbying is pretty powerful stuff.

    --
    I do not moderate.
  47. Good points but no cigar by Performer+Guy · · Score: 2, Insightful

    A lot of what he said isn't much of a surprise, however definitive statements about clusters not being supercomputers and being unmanaged loose collections of machines are a bit overblown. Management software exist for clusters and they are rather easy to program for with available popular and industrial strength libraries.

    Moreover many HPC applications actually scale quite well on clusters of Linux systems. Affordable interconnect infrastructure is increasing in bandwidth and reducing in latency, further broadening the scope of the problems these clusters can tackle. In addition each node can now comfortably have 2 or four processors giving even better bandwidth between CPUs sharing a node. With 64 bit processors and operating systems now available the final barriers to very impressive easy to use HPC Linux clusters have been removed which is exactly why Cray now sees them as a threat. Now is probably the worst time to talk of how a cluster is not a supercomputer. Clusters form a class of supercomputer that can now handle most supercomputer tasks. True there are classes of problems that the dedicated supercomputer systems CRAY sells will excell at, however clusters are useful workhorses in the supercomputer world and hold their own.

    Todays supercomputer problems are tomorrows computer problems and Cray must continue to find new classes of problems to solve as they always have, rather than attacking competing technologies, people will use clusters where the clusters meet their needs.

  48. Parallel Programming. by jellomizer · · Score: 2, Informative

    True Parallele Programming with computer with over 16 or so CPUs is a slightly different mindset then the way most people program. In PP you can write a sort routiene that runs in O(log(x)) While with one processor system you can only do it in O(x). Most programs today that are threaded tend run a buch of code on one processor and its own memory. That is much the way that linux clusters work, by writting programs that minimalize the amount of comunications needed so then they provide high performace. But crays and the like super computer allows all the processors to comunicate with each other and the shared memory a lot faster. Thus making some algorithms run in Maginatudes faser.
    An example is when I took a course in Parrallel processing we used a MassPar system which had 1024 processors in a grid formation. Now woring on that system I was able to sort a list of a million random numbers way faster then my Duel Processor PC could.
    But on the flip side when I ran a program on the MassPar that wasn't designed parallel processing (emacs) it took upwards of 3 minutes to load it due to the age of the computer. While my PC could open up emacs in a split second. So on the clusters even the fastest in the world a Cray that may not be the fastest could actually beat it on many applications because of the faster bus comunication.

    --
    If something is so important that you feel the need to post it on the internet... It probably isn't that important.
  49. ...the same ole FUD? by heironymouscoward · · Score: 2, Interesting

    You know Goodwin's Law? Well, here is Heironymous' Corollary to Goodwin's Law:

    "Anyone using the terms 'zealot' or 'FUD' in a Slashdot discussion is immediately declared the loser of the thread and discussion stops at that point".

    Of course I'm force to break my own corollary to make this point.

    But to call me a "Linux zealot spouting FUD" (and excuse me for paraphrasing your lucid comment) because I mock a commercial vendor who says that the free alternative is no competition... WTF?

    As it happens: I have 20+ years of experience in IT and I've used every one of those packages (except the Cray). Oracle, MySQL, IIS, Apache, Sun, Solaris, Linux. And hundreds of other platforms, as well.

    My opinions are not those of a zealot, but pretty impartial and generally very accurate. There is a good reason, for instance, why the most critical servers in my business all run Debian Linux, why the desktops use Xandros, why our applications use MySQL, and why we're phasing our out Microsoft/COM+/IIS/SQLServer platforms. Zealotry has little to do with it, but good sense does.

    The facts are these: open source, free, commodity IT has become good and cheap enough to exceed the capabilities (at any price) of many commercial systems. Most specifically, Cray, Oracle, Microsoft, and Sun find themselves spot center of the area that has been commoditized.

    --
    Ceci n'est pas une signature
    1. Re:...the same ole FUD? by heironymouscoward · · Score: 2, Interesting

      Hey, will the real Anonymous Coward please stand up!?

      True. But I spout what are called "stealth opinions", being understated (or even unstated) makes them harder to criticize, and I have the advantage of being able to change opinion in mid-spout to dodge the zealots.

      --
      Ceci n'est pas une signature
  50. Obligatory by Rhesus+Piece · · Score: 4, Funny

    "No, a Kilochicken is a 1000 chickens. You're thinking of a kibichicken. Check it out at http://www.nist.gov" Somebody had to, right? Right?

    1. Re:Obligatory by fatphil · · Score: 3, Funny

      Nope, a kilochicken in such a setup does require 1024 chickens. Only 1000 are active nodes, but 24 are available as hot-swappable replacements.

      --
      Also FatPhil on SoylentNews, id 863
  51. Whatever by hackstraw · · Score: 2, Interesting

    I'd like to see Paul Terry say this in front of everybody at the Super Computing conference where they announce the Top 500 Computers. Its worth noting that he is not bashing Linux per se, but "Linux Clusters", which is pretty arbitrary, because he should be saying "all clusters", because the OS really doesn't have too much to do with it. Supercomputing apps run in userspace, not kernel space, and the hardware, including interconnects or some kind of interprocessor communication drive the performance.

    The Cray XD1 looks like a nice system, but there are only theoretical performance values given, and noone can go out and buy one of these things yet. I also don't know how much these guys cost.

    I love this statement:

    Linux clusters do have a place. "For applications that require low performance, they are a cheaper solution," said Terry.

    Yeah, when we spend a million+ dollars on a supercomputer, we are thinking of low performance, because our applications require it. Thanks.

    I'm guessing this guy is a wannabe marketer who got stuck in a CTO position. There are plenty of HPC vendors out there, and trust me if this XD1 has a good price/performance and they work (this is key), then people will buy them with little questions asked. Otherwise, this whole article is just an advertisement that makes many statements without any evidence that the XD1 is any better than 4 Xboxes connected together over a serial connection. Next....

  52. Re:HPC by Disoculated · · Score: 2
    Well, of course, it's part of what they're selling. It doesn't do them any good to present their supercomputer as "pedestrian". They want to maintain an air of exclusivity to owning one, it lets them pad the bottom line. Lots of organizations like to say "We can do it on our Cray". It sounds more prestegious.

    I bet if you asked a lot of computer dilletantes out there what the fastest computer in the world was, they'd still say "Cray 2".

    Savages.

  53. doesn't this CTO of cray remind u of someone? by MoFoQ · · Score: 4, Funny

    doesn't this CTO of cray remind you of someone?
    "There IS no Linux in high-performance clusters."

    "There IS no Americans in Iraq."

    OMG! It's the former Iraqi mis-Informed-ation minister!

    Especially when 2004 has been dubbed the year of the penguin, it's wreckless to claim that Linux can't be used in HPC's.
    Hell, just look at the current top500 list. There's no Cray in the top 10 but there are two Linux based clusters there (and one based on OSX [FreeBSB based]).

    Here's a few:
    NCSA's IA32 Linux cluster
    NCSA's IA32 Linux cluster
    Space Simulator Clust at Los Alamos (SS51G based; makes me proud as I have a SS51G too)
    Beowulf - used in many Linux clustering projects
    Linux clusters at Los Alamos (they seem to have more than one)
    Virginia Tech's Supercomputer X

  54. How is the XD1, as advertised, any different? by borwells · · Score: 2, Interesting

    I have a rack right here housing a linux cluster. 36 1U dual-Xeon servers. On the Cray XD1 site it details the "Exceptional Performance" of the XD1 system. It details the performance of a system with 12 AMD Opteron processors, and the performance of a rack of systems with 12 AMD Opteron processors. I understand that the underlying architecture of those servers may be vastly different than the servers in the rack next to me, but fundementally aren't they both multi-processor PC servers operating in a cluster? If so why does their rack full of multi-processor systems qualify as an HPCbut mine does not?

    --
    "We can't solve problems by using the same kind of thinking we used when we created them."
  55. FUD and Thunder-Mongering by pragma_x · · Score: 3, Interesting

    Despite assertions made by Linux vendors, a Linux cluster is not a high performance computer, said Dr. Paul Terry, CTO of Cray Canada. "At best, clusters are a loose collection of unmanaged, individual, microprocessor-based computers."

    Although this statement reeks of FUD, he's right about one thing: a cluster is not an HPC... that's why its called a cluster. But to say that a cluster is 'unmanaged' is one hell of a stretch IMO. All in all, he's just arguing semantics: nothing to see here, put down your flamethrowers, move along folks.

    Since this is slashdot, I'll add that the rest of the article is full of choice quotes all of which point squarely at basic FUD + marketing spin for their new cluster-cost-like product.

    It seems to me that Cray is just plain bitter that Linux (through all the cluster solution providers) has managed to steal Cray's thunder at a mere fraction of the cost. Cray's probably even more bitter that folks are willing to sacrifice performance (at least from Cray's perspective) just to save a buck.

    Okay, this is Cray we're talking about here: people are saving millions of bucks all over the place by using clusters instead of big expensive machines.

    And guess who wants 'their' slice of the pie back.

  56. Vector based computing by Listen+Up · · Score: 2, Insightful

    There are certain types of computing which simply cannot be done with microprocessor based platforms including clustering. One of these calculation types is vector processing. A Cray supercomputer is a vector processing based unit. When comparing a cluster of PC systems being used to calculate what a single Cray is designed to calculate, the Cray CTO is perfectly correct in his statement.

  57. You guys are giving cray too much credit by shaitand · · Score: 2, Informative

    I'm seeing alot of single threaded versus multi-threaded arguments.

    That's great and all, but for a single threaded application a cray isn't even going to smash your modern top of the line home pc by too terribly much.

    crays are massive smp systems, they need a multi-threaded app to take advantage just as much as a cluster does. The difference is in the bus speed. A cray has a much faster bus, and with equivelent processing and memory it will excel with a number of small quickly terminated threads, whereas a cluster will as well or better with larger more processor consuming threads.

    Why would a cluster ever do better? Simple, although a cluster has a drastically slower bus, there is memory local to the processor in question so there is much less congestion on the bus, and since if your shelling out for a cluster you will be switching rather than hub style whatever you do there will be almost without collisions and bus contention. Each node has it's own ram so there isn't much of an issue with contention for the bus and much greater memory throughput.

    So like I said, it's all about how fast threads spawn and terminate, because if your rapid firing threads then you will doing alot of communicating between nodes over the slow bus (network), if your sending good sized chunks of data do something and keeping your nodes busy they will spend more time working and less time communicating results and your cluster will tromp all over that cray.

  58. Valid Question, then by Allen+Zadr · · Score: 2, Interesting

    At what price point does the Cray XD1 come in? While huge clusters are (supposedly) cheap individual computers -- I would argue that G5s are not inherantly cheap -- how many G5s that make up the Virginia Tech cluster would you have to get to before you've paid for a Cray XD1?

    I mention this because the article implies that Cray is planning on selling the XD1s at a price point cheaper than equivelant clusters. If they succeed at making the XD1 cheap enough, then it may be more cost effective to [[ effectively, cluster ]] a couple of these Crays, with less power consumption, heat dissipation and plain old real-estate.

    It seems to me that TCO would be cheaper for the Cray, especially considering that the best clusters expect 5% of the member computers to be broken at any given time.

    So, does anybody have Cray XD1 pricing? That, seems to me, to be the only way to rationally decide on the 'better' solution.

    --
    Kinetic stupidity has a new brand leader: Allen Zadr.
  59. Re:Help me here...(OT) by Halfbaked+Plan · · Score: 2

    And it's worth noting, for the uninformed, that the reason Slash puts in the spaces is that there used to be a problem with page-widening fucks screwing up the comments sections with deliberately wide strings.

    It's not a 'bug' in the slashcode.

    --
    resigned
  60. They're all clusters now anyway by Animats · · Score: 2, Insightful
    Supercomputer CPUs are dead. It's been a long time since Cray made CPUs. Current-generation supercomputers are composed of large numbers of commercial microprocessors. The only major exception is the current fastest supercomputer, the Earth Simulator in Japan. It uses custom vector processors. They seem vaguely similar in architecture to Playstation 2 vector units, but they probably are unrelated.

    Every other machine in the top 10 is built from standard processors. The old DEC Alpha, PowerPCs, and IA-32 predominate, with a few Itanium machines.

    Because supercomputers today have several thousand processors, they can't even be big shared-memory multiprocessors. Speed of light lag in the interconnects would slow everything down. It just takes too long for the signals to make it across the room.

    So all supercomputers today are clusters of one kind or another, fast machines with slower interconnects between them. The hardware architecture revolves around interconnect schemes. The software architecture revolves around working around the limitations of the interconnect schemes. Tightly coupled problems don't map well to such machines.

    Bear in mind that we're talking about clusters of uniform machines located near each other with gigabit or better interconnects. We're not talking about "clusters" consisting of spare-time programs out at the end of Internet connections. Those are useful only for problems with almost no coupling between parts. Such problems are usually low hit rate search problems, like cryptanalysis, SETI@HOME, and such.

    Yes, there's the Cray X1, the last of the liquid-cooled monsters, but it looks like the only customers who bought one were Government agencies with old Cray machines.

  61. Wrong Source by Ra5pu7in · · Score: 2, Insightful

    Most readers have the right idea - you don't listen to a competitor's opinion when judging whether something is viable or not. It is very easy to twist the words to be "true" while misleading.

    A cluster isn't a supercomputer, by definition, but for many jobs can be equal or better. In other words: Those 2 oxen cost more, consume more resources, are only useful for the one job (pulling a plow) and only benefit a single owner. Those 1024 chickens cost less, consume less resource, are useful for many jobs besides the one (including laying eggs) and benefit their many owners.

    --
    I was taking one day at a time, but then several days got together and ambushed me. (from a Rhymes with Orange comic)
  62. Provably non-Parallelizable? by mosel-saar-ruwer · · Score: 2, Interesting

    I asked this earlier in the thread: Provably non-Parallelizable?

    Allow me to ask it again: What's the state of the art of proofs of parallelizability [and non-parallelizability]?

    Is there a standard list of problems that have been proven to be non-parallelizable? Are there any problems that have been proved to be parallelizable, but for which no parallelizing algorithm has yet been discovered? Is there anything analogous to the NP-completeness conjecture in this field?