Slashdot Mirror


Time For A Cray Comeback?

Boone^ writes "The New York Times has an article (free reg. req.) talking about Cray Inc.'s recent resurgence in the realm of supercomputing. It discusses a bit of Cray's decline when the Cold War ended, "the occupation" under SGI, and the rebirth of the company after the Tera (now Cray Inc.) purchase. Recently Cray Inc. has been shipping their vector-based Cray X1 machine, designing ASCI Red Storm, and recently was one of 3 (also Sun, IBM) to win a large DARPA contract (PDF link) to design and develop a PetaFlops machine by 2010. Could Cray Inc. be poised for a comeback? Wall Street seems to think so."

75 of 266 comments (clear)

  1. Registration not required by Anonymous Coward · · Score: 5, Informative

    Partner Link

    Posting as Anonymous Coward, please award my Karma to starving children in the world.

  2. Definitely coming back by Hayzeus · · Score: 2, Funny

    Naturally. We have another Bush in the Whitehouse, and I even hear the Wang Chung is making a comeback -- so why not Cray?

    1. Re:Definitely coming back by Frymaster · · Score: 4, Funny
      waitaminnit. cray - the computer of the defense industry during the colde war - is releasing a machine called the "red storm"?

      is there a secret message here? should tom ridge be called?

  3. Ha! Wall Street has more confidence in SCO by Anonymous Coward · · Score: 4, Interesting
  4. Icon is back by aspelling · · Score: 2, Insightful

    Many scientists are very concern about state of supercomputing in US. Hopefully new generation of supercomputers improve this situation.

    1. Re:Icon is back by Smallpond · · Score: 2, Funny

      Oh yeah, real concerned. The top US supercomputer can only do 20 TFLOPS or so. That will never do.

      Imagine a beo...

    2. Re:Icon is back by mOdQuArK! · · Score: 2, Interesting

      Maybe, maybe not. I don't really think even the NSA is _that_ far ahead of commercial process technology. It's more likely that they do custom designs for whatever applications they need, which allows them to process their data much faster than any general-purpose setup.

    3. Re:Icon is back by CausticWindow · · Score: 5, Interesting

      I remember a story from a NSA contract worker.

      In the early days of Cray, he and many others were wondering how they could keep things running, considering that their official budgets only showed ten or so sales per year.

      Until he got the tour of the NSA computer plant, where they had a hall the size of two football fields, filled with Crays.

      --
      How small a thought it takes to fill a whole life
  5. Petaflops by 2010? by Pope+Raymond+Lama · · Score: 5, Funny

    Of course I expect that...in my Playstation IV,
    equipped with an opto-quantic Emotion Engine VI
    and a couple petabytes of holographic storage.

    --
    -><- no .sig is good sig.
  6. Definately by Anonymous Coward · · Score: 4, Informative

    There are still MANY applications for supercomputers. A lot of people think that linux/beo-clusters are going to be replacing supercomputers of the Cray/NEC/IBM variant. Not true. There are still many research, scientific, and military applications that require machines developed not for "slow" distributed number crunching, but require ultra high speed processor and memory architechtures.

    So definately, time for Cray to come back and retake the supercomputer industry crown.

  7. Re:explain by Alien+Being · · Score: 3, Informative

    memory bandwidth

  8. Re:explain by Tumbleweed · · Score: 3, Funny

    It's SUPER! Off-the-shelf components are just kind of "Meh."

  9. 2010? by stratjakt · · Score: 5, Funny

    There's a whole bunch of PETAFlops outside of McDonalds right now having a sit in and screaming about how fur is murder.

    I had to literally step on their faces to get a Big Mac.

    --
    I don't need no instructions to know how to rock!!!!
  10. Correct me if I'm wrong ... by SuperDuG · · Score: 5, Insightful
    ... but wouldn't the fact the market for supercomputers isn't exactly that large. I mean you've got governmental contracts (research, educational, who knows what) that have to take up 95% of all the purchases made, and then a small private market. I mean how many companies are striving for a petaflop machine to run their database server?

    If you look at the list of top 100 supercomputers, there are systems that are almost 15 years old or even older (not sure on a few). I know these take years to build and are multibillion dollar projects, but between time has got to be a killer.

    Then there's the question of ... what do you need a supercomputer for? The applications are pretty limited for a need for a petaflop computer, unless your doing mass storage, cryptography (cracking), or simulations.

    Don't get me wrong I'm all about nuclear testing being done in 1's and 0's instead of in the ocean or in the desert, but how big of a bomb do you really need when it's estimated theres enough nukes to blast the entire land surface of the earth 3 times over.

    --
    Ignore the "p2p is theft" trolls, they're just uninformed
    1. Re:Correct me if I'm wrong ... by MxTxL · · Score: 4, Insightful

      Then there's the question of ... what do you need a supercomputer for?

      To advance the state of the art. And not just in the field of computers, but also in any field that ends up benefitting from this. Which is potentially very many. Aerospace, geology, meterology... there are BUNCHES of fields that greatly benefit having more and more massively powerful computers. Sure, most projects can't afford to have the latest and greatest of the state of the art in supercomputing, but the fact that the state of the art progresses will push prices down on the older technologies that most labs CAN afford. This is a benefit for science as a whole.

    2. Re:Correct me if I'm wrong ... by anzha · · Score: 4, Interesting

      There are other uses too. Consider: the weather guys that are working on the global warming and other climate modeling want a 500 petaflop sustained speed, massive memory machine to get the granularity that they want.

      BTW, what's the 15 YO machine? I can't think of any...certainly not ones that are still in the Top 500. Hell, the ones I worked on 10 years ago, you can nearly buy the floppage on the desktop now...

      As an interesting aside, the DARPA contract is out in part because they think the traditional drivers in computing speed are going to peter out around 2010...the implications of that are definitely interesting, no?

      --
      Do you know why the road less traveled by is littered with the bones of the unwary?
    3. Re:Correct me if I'm wrong ... by agurkan · · Score: 4, Interesting

      Nuclear simulations are used to see if the warheads are still effective after not being used for long times, not to see if they'll wipe out a city right after they are produced.

      --
      ato
    4. Re:Correct me if I'm wrong ... by Doesn't_Comment_Code · · Score: 5, Funny

      Then there's the question of ... what do you need a supercomputer for? The applications are pretty limited for a need for a petaflop computer, unless your doing mass storage, cryptography (cracking), or simulations.


      You're missing the big picture...

      Massive multiplayer Quake on a 614,400 x 819,200 screen.

      Thank you Cray.

      --

      Slashdot Syndrome: the sudden, extreme urge to correct someone in order to validate one's self.
    5. Re:Correct me if I'm wrong ... by Pharmboy · · Score: 4, Interesting

      Don't get me wrong I'm all about nuclear testing being done in 1's and 0's instead of in the ocean or in the desert, but how big of a bomb do you really need when it's estimated theres enough nukes to blast the entire land surface of the earth 3 times over.

      Well, the earth is over 2/3rds covered with water, and now we have the technology to reach the moon, mars, venus and beyond. Remember the spectical when a comet hit Jupiter? Just imagine a Beowulf of those, but really big nukes instead :D

      On a more serious and less morbid note, I bet some other uses exist in physics, medicine and even cosmology. I even hear where they compare 'potential' cures for diseases using computer modeling to design drugs that we don't yet know how to make, good old biotech. You are correct that yes, this IS a very very limited market, but when you sell them for a billion bucks each, you don't need to match Dell's volume to make a profit. I wouldn't be suprised if the technology leads to some advancements in our pitiful micro world as well.

      --
      Tequila: It's not just for breakfast anymore!
    6. Re:Correct me if I'm wrong ... by morcheeba · · Score: 5, Informative
      Yep, you are a bit wrong... (you didn't think a challenge to the slashdot community would go unnoticed?!)

      From this site, you can see the breakdown by organization:
      Usage..... Count Share Rmax Rpeak Procs
      Industry... 202 40.4 % 82398 182964 62869
      Research... 131 26.2 % 187689 278030 120046
      Academic... 115 23 % 77143 133564 45216
      Classified.. 27 5.4 % 14167 20691 12892
      Vendor...... 22 4.4 % 11033 15545 5230
      Government... 3 0.6 % 1317 2256 528
      Total...... 500 100 % 373749 633052 246781
      There are a lot of companies that use supercomputers, although maybe not the type you're thinking of. Of course, there are the number-crunchers: oil companies are big users (to crunch data & find new oil), and car companies (BMW). But there are also the transaction-processors, like SprintPCS and Ebay (used to be in the top 500), that make the list just by the sheer number of connected processors.

      Here's the latest list
    7. Re:Correct me if I'm wrong ... by Pharmboy · · Score: 2, Interesting

      You're dramatically overestimating the size of the market. Cray's own website puts it at about $1.1bn worldwide, and it's not like cray will get 100% market share. The ongoing R&D costs are a staggering percentage of their revenue, to the point that if the NSA wasn't subsidizing them it's unlikely they'd be alive today. The same goes for other pure supercomputing ventures--without huge amounts of government largess they're sunk.

      Well, exaggerating to make a point perhaps. I checked out their website as well, looking for more technical info on their servers, with no luck. Personally, I can see more of a market in the future than even the past. Their systems do certain things faster than beowulfs, and frankly, for some govt. agencies/companies faster is more important than cheaper. It would not shock me to see the pendulum swing in the other direction, to at least a degree. Obviously Tera thinks so, too, since they purchased them in 2000, and profits/sales ARE up...

      --
      Tequila: It's not just for breakfast anymore!
    8. Re:Correct me if I'm wrong ... by adam872 · · Score: 2, Informative

      I work in the Oil&Gas business and we use Linux clusters (and in the past bloody large Sun, IBM and SGI systems) for seismic processing and reservoir simulation. These particular problems are DSP and FP intensive and also can require a fairly large amount of memory to run. They are exactly the kind of commercial workload either a supercomputer or cluster can chew on.

      Some of our customers (I work for a company that writes the software, amongst other things) have upwards of 100TB of 3D Seismic they want to process. These jobs can take weeks or months to run. The simulation jobs can take days as well. Obviously having a big computer or tight cluster of lots of small ones will help decisions get made faster and/or more accurately.

      There are other examples too: I met a gentleman who works for the lab that does crash simulation for Porsche, Audi and VW. Another example would be an ex-boss of mine who went to work for an engine manufacturer, who used a couple of SGIs to simulate the bore and stroke in a cylinder. The simulation took several weeks to run. They need large computers to do this too. So there is a market for these machines.

  11. Re:explain by Moeses · · Score: 2, Informative

    Bandwidth.

  12. Re:explain by Doesn't_Comment_Code · · Score: 5, Informative

    Well, a well engineered supercomputer has much less overhead than a cluster. One superfast processor doesn't have to deal with interprocessor communcations like a cluster does.

    And if your supercomputer has multiple processors, they are generally made to cooperate nicely to speed efficiency. Whereas a cluster has to go through ethernet and hardware layers to communicate between nodes. Granted that is fast, but on-board communication is faster.

    It seems strange, but a multiple processor computer can actually perform a task slower than just one processor working on the problem if the program and os aren't designed well. So a lot of the value of a supercomputer comes in its design, and the reputation of the manufacturer. And Cray is pretty reliable in my book.


    But the REAL key to the potential comeback of the Cray computer will be whether or not it still has cool bubbles! Wow!!! Cray computing... the inventor of case mods.


    --

    Slashdot Syndrome: the sudden, extreme urge to correct someone in order to validate one's self.
  13. Resurrection, not come back by Anonymous Coward · · Score: 2, Insightful

    Cray died. Anything else is just bartering on his name.

  14. Re:explain by Anonymous Coward · · Score: 5, Funny

    can someone explain to me what the benefit of a moving van is compared to buying a fleet of pintos?

  15. Re:explain by anzha · · Score: 5, Informative

    Memory to processor feeding: std ots processors are often idle because the memory subsystem cannot feed the processor fast enough. This is bad now. It will be getting a lot worse.

    Interconnections between processors: this goes beyond merely processors on a board, but between boxes. The bus architectures out there for the std ots hardware get saturated very quickly. This gets worse between boxes. In addition the latency on Myranet and Quadrics (compared to what Cray et al do) is horrible even if it is excellent compared to ethernet.

    Problem set vs architecture: Not all problems map out well to clusters, or even SMP boxen. Some map best to vector machines. Some map best to tightly integrated MPPs. Some map out to moderately tight clusters. Some are just plain 'embarassingly parallel'. Others are highly threaded and don't work well on vector or scalar machines. etc, etc. The architecture ought to match the problem set.

    MTBF: Mean time between failures. Commodity hardware goes kaputt much more often. A cluster capable of teraflop performance of custom hardware tends to need constant and evil levels of care and feeding: ie you better have a grad student on roller blades.

    Those are just off the top of my head. I am sure that others will Tell you others before I can post again. ;)

    Summarized: bandwidth, latency, problem set, and failure rate.

    HTH.

    --
    Do you know why the road less traveled by is littered with the bones of the unwary?
  16. Sun Enterprise 10000 by DNS-and-BIND · · Score: 2, Informative

    Didn't Sun basically buy out or hire away a bunch of Cray, Inc.? I always heard the E10000 was actually a Cray product. Oh, and just to brag, I have a blue jacket with a picture of a Y-MP-90 on the back with the words, "CRAY - WORLD'S FASTEST SUPERCOMPUTERS". Too cool for words. Ebay rules.

    --
    Shutting down free speech with violence isn't fighting fascism. It IS fascism!
    1. Re:Sun Enterprise 10000 by DNS-and-BIND · · Score: 2, Funny

      Sorry, I don't know where to get more of them. You're out of luck.

      --
      Shutting down free speech with violence isn't fighting fascism. It IS fascism!
    2. Re:Sun Enterprise 10000 by putaro · · Score: 4, Informative

      The E10000 is a Celerity product. Celerity was an independent Unix box maker back in the 80's with their own processor architecture. Celerity went bust trying to bring a "minisupercomputer" version of the architecture to market in about 1987 (33 MHz, whoo hoo!). The assets and technology of Celerity along with the design team in San Diego were acquired by Floating Point Systems (FPS). FPS brought the system to market and made the transition to a SPARC based architecture (66 MHz) before going bust. The assets and technology of FPS along with the design team in San Diego and now the manufacturing team in Beaverton were acquired by Cray. Cray did a couple of turns of the crank on the FPS product and sold it as a "business supercomputer". When Cray was acquired by SGI, SGI wanted no part of the SPARC business and sold (yes, again) the San Diego design team (and I think the Beaverton group) to Sun who finally brought a SUCCESSFUL product to market with the E10000.

      But it's still the same core team down in San Diego, so I like to think of the E10000 as being a Celerity product.

    3. Re:Sun Enterprise 10000 by laird · · Score: 2, Informative

      Quite a few of the people working on the E10K were from Thinking Machines Corporation. TMC was Danny Hillis' company that introduced massively parallel supercomputing. The first generation machine was a Symbolics workstation coordinating up to 65,536 single-bit CPU's connected by a hypercube network. Each CPU was fairly slow, but there were tons of CPU's and CPU performance was balanced nicely with network throughput (whereas most MPP machines have fast CPU's starved for data). Weird, but also astoundingly fast. Anyway, more relevant to Sun, the last generation machine from TMC was based on UltraSPARC's with custom FPU's (128 MFLOPS per compute node, which was cool at the time). I don't think that there was an upper limit on the number of CPU's, but the biggest I saw (I worked there for a few years) was 4,096 compute notes, and a few hundred storage nodes. Anyway, TMC ended up getting out of the hardware business (check out think.com), and Sun hired quite a few of the engineers (who knew how to build an MPP SPARC-based machine, with compilers, etc.) which rolled into the E10K nicely.

  17. Re:Ha! Wall Street has more confidence in SCO by Anonymous Coward · · Score: 4, Funny

    SCO vs Nike

    Look at me, I'm a stock analyst!

  18. Cascade Link: Karma Whoring by anzha · · Score: 2, Informative

    The home page at Cray for the Cascade project.

    There are some interesting PDFs there. Chew, mull, and consider.

    Also consider what Horst Simon, head of NERSC said here too.

    --
    Do you know why the road less traveled by is littered with the bones of the unwary?
  19. That's good by Anonymous Coward · · Score: 2, Funny

    Maybe now that there's once again a major player in the computer market with machine casing designs even SILLIER than Apple's, the rest of the Geek Community will give us a little slack..

  20. Re:explain by fgodfrey · · Score: 5, Informative
    As other replies have posted, bandwidth is the big issue. And by bandwidth, we are talking bandwidth of the processor to memory. Cache is great and all, but if you are stepping through gigabytes of data (or in some cases terabytes of data), your problem isn't going to fit in cache. The speed of your processor will then be dominated by the speed at which it can get to main memory. On a PC, that's slow. What's even slower is when you have to exchange data to a remote node in the cluster. Current massively parallel supercomputers (which is pretty much all of them) have phenomenal bandwidth between processors and memory and between nodes.


    Second, (yes, I work for Cray so now I'm going to put in a sales pitch :) our processors are vector processors. As such, you can hide a lot of the latency of getting to memory by queueing up 64 loads at once. Short length vectors are what is used by MMX and Altivec to accelerate graphics. With sufficient vector operation chains, you can keep the processor busy all the time. You can't do that on a PC. I've heard (no, I don't have actual links to articles) that 10% of peak performance on a cluster is considered really good. Our customers wouldn't consider that anywhere near "really good".


    Finally, there's memory. Lots of it. A single system image supercomputer can have terabytes of memory in one kernel image. You're simply not going to get that in a single PC cabinet.


    Finally, in case anyone doubts that vectors, big memory, and large bandwidth can make a good system, the fastest machine in the world right now is the Japanese "Earth Simulator" machine which is an NEC SX machine. That is somewhat similar in architecture to a Cray in that it has large bandwidth and vectors.

    --
    Go Badgers! -- #include "std/disclaimer.h"
  21. Re:explain by virtual_mps · · Score: 5, Interesting
    MTBF: Mean time between failures. Commodity hardware goes kaputt much more often. A cluster capable of teraflop performance of custom hardware tends to need constant and evil levels of care and feeding: ie you better have a grad student on roller blades.

    Hahahaha. Have you ever actually run a supercomputer? They tend to have much higher failure rates then normal servers. Couple of reasons: first, they push the envelope of a given technology. The sweet spot for stability is not the leading edge. Second, they're not nearly as well tested as mainstream hardware. On a platform with thousands of installations you're much less likely to run into a problem nobody has seen before than you are on a platform with only dozens of installations.
  22. I'm confused... by Tumbleweed · · Score: 2, Funny

    Why do people buy those really expensive supercomputers, when they could just buy an Apple one instead? They're much cheaper!

    1. Re:I'm confused... by Mikey-San · · Score: 3, Funny

      Because there are more games for the Cray than the Mac.

      (I'm a Mac user. I get to make this joke.)

      --
      Mikey-San
      Karma: +Eleventy billion (mostly affected by watching Celebrity Jeopardy)
    2. Re:I'm confused... by commodoresloat · · Score: 3, Funny

      I don't get it either. I've been sitting in front of this Cray supercomputer trying to copy a 17M file from one hard drive to another and it's taking forever. BBEdit has slowed to a halt....

  23. Re:explain by Arker · · Score: 3, Insightful

    Other posters have already pointed out the bandwidth issues over and over, so I'll skip that obvious difference.

    The fact is that not all problems are suitable to parallel processing. Sometimes you really need to know the outcome of one operation before you can go on to the next.

    Beowulf clusters really suck on problems where that applies. Cray style supercomputers shine on them.

    --
    =-=-=-=-=-=-=-=-=-=-=-=-=-=-
    Friends don't let friends enable ecmascript.
  24. It's also about better (not just faster) computing by binaryDigit · · Score: 3, Insightful

    Don't just think about solving a static problem faster, it's also about solving a problem better through the use of more variables. Take weather simulation. If having too many variables stretches todays forcast into next week, then it's useless. So you limit the amount of variables to come up with a "close enough" forcast in a more timely manner. With a faster computer, you can get a more accurate simulation in a more reasonable time period. This increase in accuracy/complexity is then useful in many fields.

  25. Before we all get sentimental... by taradfong · · Score: 4, Interesting

    ...isn't 'Cray' today about as 'Cray' as the company that now owns 'Atari'? What's left besides the name of the original company?

    --
    Does it hurt to hear them lying? Was this the only world you had?
  26. Re:explain by anzha · · Score: 2, Interesting

    Have you ever actually run a supercomputer?

    You know, that's kinda funny, since it's my current job. ;) I'm a NERSC employee. :P

    You're right, until the the system hits maturity. Our T3E before being retired had a lot less hardware problems than our linux cluster does. Or the SP3 we have for that matter.

    BTW, since it's rather hard to find a job these days for some people in the computing realm, we're hiring.

    --
    Do you know why the road less traveled by is littered with the bones of the unwary?
  27. Comparison of supercomputers to desktops by baryon351 · · Score: 2, Interesting

    OK this is about as much a kiddy thing as how many VWs fit inside a football stadium or something, but... ...anyone know of a site with info on how current and past supercomputers compare to current desktops? Where are we at now with 2GHz G5s and 3.3GHz P4s, relatively?

    One of the comparisons made when I was at university was of a 30-something MHz 386, with a supercomputer from 1973, showing how they do about the same amount of processing/data transfer but in completely different ways. I found that fascinating

  28. Re:explain by Pieroxy · · Score: 4, Interesting

    Well, almost. Let's say I have a plane that can accomodate 100 people and does NY->London in 6 hours.

    My problem is that I have to move 1000 people from NY to London

    Now I can either:

    1. I can buy a plane that is 20 time faster, 20 times more expensive. That's the supercomputer
    2. I can buy 9 other planes (same as mine) and accomodate the same results as in 1 for less than half the price (I'll let you do the math). That's the cluster.
    3. I can buy a plane that has a capacity of 1000 people. That's the parallel supercomputer. But if that one can do the deal for my specific problem, it proves to be not that flexible if my problem changes (ie: 500 people NY->London and 500 people from NY->LA).

    That's the power of the bewolf cluster!!!

  29. Comeback? by virtual_mps · · Score: 5, Insightful

    Probably not. Cray made some money back when a supercomputer was something that an ordinary company might need. The capabilities of "normal" computers was much more limited then today, so there was a much higher percentage of the buying public likely to want something more. These days the vast majority of users are happy with something mainstream

    But, you ask, isn't there a lunatic fringe who wants more power at any price? Well, the lunatic fringe ain't what it used to be. During the heyday of cray you got a damn fine box and nothing else. Cray didn't want to worry about your software--or even an OS. A person who needed the speed would plunk down the money for the box and then pay a couple of guys to code everything from scratch. Those days are gone--software is the driving factor these days, and people are far less willing to buy something that's going to force a total code rewrite. Especially if that thing is only going to buy them a couple of years of edge before they need to recode for the next best thing.

    Then there's the question of whether cray can afford to be bigger. The answer is "probably not". If you sell to a lot of customers you need a huge support infrastructure. Cray doesn't have much of one anymore, so they'd need to buy one. (Most of the old support guys left one way or another when SGI came in, or stayed with SGI.) If you have a lot of customers you can spread the costs around, but in the case of a company like cray a support infrastructure means having a people sitting around most of the time in every region you sell a machine. Maybe two to four guys per system (24x7, right?) plus some sorta warehouse facility if you enter a new geographical market. That's expensive. You can bill a lot of that cost back to the customers, but that just makes your systems less competetive.

    I think the long term answer is that cray will be a very small niche player, selling to a very select group of (U.S.) government agencies, with the occasional pro forma business customer thrown in so the company can issue press releases. Even most government facilities aren't in a position to buy a cray anymore. (Research money is fairly tight, recoding costs are prohibative, MTBF's are more of an issue then they used to be, etc.)

    1. Re:Comeback? by VoidEngineer · · Score: 3, Interesting

      Hmm... I'm not entirely convinced by your arguments. However, I do agree with you that "during the heyday of cray, you got a damn fine box and nothing else."

      My thinking, however, is that the same is true today and for all of the top 100 supercomputers in the world. That is to say, each one of those machines is a custom hardware installation, and my educated guess is that software still isn't the driving force in the supercomputing market. Rather, algorithms are the driving force. The supercomputer market is geared towards people who want to very specific tasks, very acurately, and very fast. Example applications might be calculating fourier transforms (spectroscopic analysis), mendelbrot sets (weather simulations), prime numbers (cryptography), and statistical derivatives (markets). Any of these types of applications could feasibly require only a few thousand lines of code... At the same time, however, any of these applications are fully capable of utilizing as much hardware resources as you have available...

      The problem is the magnitude at which these few lines of code need to be repeated. Furthermore, each of these types of algorithms can give qualitatively different and more robust results at each order of magnitude increase in speed... thereby creating a driving market force for upgrades.... We have a computer that can predict the weather 48 hours from now? Well, give us a computer that's 10 times as powerful, and we'll predict it 56 hours from now... Give us one 100 times more powerfull, and we'll predict the weather 62 hours from now, and so on, and so on... The point I'm trying to make is that the software isn't the driving force behind these supercomputers... the algorithms are... and the optimized hardware is what the organizations are paying hard cash for, in order to calculate those algorithms fastest.

      Remember, we're talking about supercomputers here... we're certainly not talking about super-electronic-typewriters, super-spreadsheet-applications, super-databases, super-webservers, super-videoeditors, etc. etc. Nor are we necessarily talking about super-von-neuman machines, super-turring-machines, or super-mainframes. We're talking about supercomputing and the Cray corporation... the company historically responsible for building the machines which simluated the weather and nuclear explosions for many years... I suspect that there are not many end users of such machines and that user interface software is kept at a minimum... ;-) Furthermore, I also suspect that if Cray Inc. built a zettaflop or yottaflop abacus and provided instructions on how to simulate the weather, people around the world would abandon their computers and begin taking abacus lessons... Remember, it's all about the hardware and algorithms in supercomputing...

      But, I'm not a physics or computer science major, so what do I know... That, and I'm beginning to ramble... just my $0.02 worth...

    2. Re:Comeback? by Rasta+Prefect · · Score: 4, Insightful
      Probably not. Cray made some money back when a supercomputer was something that an ordinary company might need. The capabilities of "normal" computers was much more limited then today, so there was a much higher percentage of the buying public likely to want something more. These days the vast majority of users are happy with something mainstream

      Cray has never sold computers that are anything like a normal company would need. Cray machines are made for heavy number crunching - Vector processors are made for simulation tasks. They're very good at them. However they perform abyssmally at most other tasks - buying one for use as say, a database or application server would be stupid.

      But, you ask, isn't there a lunatic fringe who wants more power at any price? Well, the lunatic fringe ain't what it used to be. During the heyday of cray you got a damn fine box and nothing else. Cray didn't want to worry about your software--or even an OS.

      Last time I checked Cray shipped UNICOS with their machines. It's a fairly BSDish UNIX variant. It's a bit of an oddball, but not all that much more of a PITA than say, IRIX or AIX. Want to port your beowulf apps? No problem! When I spent a summer working on a T3E all of our multi processor apps used MPI. Vectorization of C and FORTRAN apps is largely taken care of by the compiler. So wheres all this programmer investment you're talking about? Most of the kinds of apps that you're going to run on a Cray (Weather models, crash simulations, Gaussian for chemical sims, etc) already run on a Cray, and you're probably going to be modifying them anyway.

      I think the long term answer is that cray will be a very small niche player, selling to a very select group of (U.S.) government agencies, with the occasional pro forma business customer thrown in so the company can issue press releases. Even most government facilities aren't in a position to buy a cray anymore. (Research money is fairly tight, recoding costs are prohibative, MTBF's are more of an issue then they used to be, etc.)

      Cray isn't in the selling large business systems. Cray is, always has been, and likely always will be a competitor in the scientific computing market. Yeah, this means they're not going to be a Sun or IBM that sell to business customers for business needs, but that's not the sort of company they're trying to be so the comparison is pointless. They're selling machines to people who need to do heavy duty number crunching. This means Universities, government agencies and large companies doing lots of product research. Typically the cost of using these sorts of machines is spread around - frequently instead of buying the machine, you'll go to a company like Network Computing Services and buy time on a machine. It works out well. There will always be a certain number of organizations that need this sort of heavy duty computing power, and Cray will be there to serve them.

      --
      Why?
    3. Re:Comeback? by virtual_mps · · Score: 3, Insightful
      Cray has never sold computers that are anything like a normal company would need. Cray machines are made for heavy number crunching - Vector processors are made for simulation tasks. They're very good at them. However they perform abyssmally at most other tasks - buying one for use as say, a database or application server would be stupid.

      I don't recall saying that cray was trying to sell general business machines. But even for scientific applications, the number of customers who need a cray as opposed to being able to use a commodity cluster is much lower then the number who needed a cray instead of an IBM 360. There are businesses out there who use computers for more then spreadsheets and web servers. By "ordinary company" I meant to draw attention to that part of the market whose budget isn't classified.

      Last time I checked Cray shipped UNICOS with their machines. It's a fairly BSDish UNIX variant. It's a bit of an oddball, but not all that much more of a PITA than say, IRIX or AIX.


      I guess you didn't do much porting of mainstream applications to a cray. The lack of virtual memory, the funny type sizes in C, and other things that application writers make assumptions about (things that aren't technically guaranteed to work in ANSI C but do work on every other system in the world) could make porting a real problem. Things have gotten a lot better, but I can assure you that a unicos port of, say, perl or gcc was not in the same league as an irix port of the same app. One of the things cray is finally bowing to is the demand for virtual memory. Seymour never wanted it (didn't want the performance hit) but it's real hard to sell that in today's marketplace. The question is how much cray can back off of its old "speed is king" philosophy when their whole business is making fast computers.

      Want to port your beowulf apps? No problem! When I spent a summer working on a T3E all of our multi processor apps used MPI.

      You've kinda missed the boat. The point of the cutting-edge cray supercomputers isn't to run mpi apps--those do quite nicely on commodity clusters. The T3E is a MPP super--not a vector super. It's where cray was 10+ years ago, not where they want to be tomorrow. The point of cutting edge is to create new paradigms. That definately helps your performance, but it kills your compatibility.

      Vectorization of C and FORTRAN apps is largely taken care of by the compiler.

      Wow. Let's just say that when you're on the kind of project that can command the state of the art you don't depend on compiler autoparallelization.

      So wheres all this programmer investment you're talking about? Most of the kinds of apps that you're going to run on a Cray (Weather models, crash simulations, Gaussian for chemical sims, etc) already run on a Cray,

      Please, read up on the tera system, for example, and try to understand how it's different from a T3E.
    4. Re:Comeback? by virtual_mps · · Score: 4, Insightful
      My thinking, however, is that the same is true today and for all of the top 100 supercomputers in the world. That is to say, each one of those machines is a custom hardware installation,

      Yes and no. The problem is that a cray box has to cover the whole R&D cost for an entire system. When IBM sells you an SP2 most of the R&D is spread across their much higher volume business lines. Same with an intel based cluster--the technology specific to the HPC market is basically the interconnect, and the rest is subsidized by video game players. There's also the compiler cost (you don't sell many fortran compilers outside the scientific market) but the salaries for a few compiler writers is much lower than the cost of desiging a cutting-edge cpu from scratch.

      At the same time, however, any of these applications are fully capable of utilizing as much hardware resources as you have available.

      That's always true. The question is whether they can use the resources efficiently, and whether the cost/op is competetive. You're right about the algorithms being the driving force, but I'd argue that it is unusual for an algorithm that's optimized for one architecture to run optimally if you move it a radically different architecture. People can spend years trying to squeeze a couple more percent out of their code, and they don't want to start from scratch unless there's a very good reason. Then there's the problem that researchers tend to not work in a bubble. Even if you can afford to buy the most expensive machine on the block you might end up shooting yourself in the foot if nobody else in your field can collaborate with you.

      user interface software is kept at a minimum

      You've got that right--most of the examples I've seen are pretty...spartan.
  30. Re:explain by taradfong · · Score: 4, Funny

    You can't haul the A-Team around in a Pinto.

    --
    Does it hurt to hear them lying? Was this the only world you had?
  31. It's been a while since Robert Cray had a CD! by Newer+Guy · · Score: 2, Funny
    He's a great musician! It's been a long time since he had a CD released. Probably due to the RIAA cutting back on CD releases. Well, it's long over - wait....

    Oh!.... that Cray!

    Never mind!
  32. Re:explain by virtual_mps · · Score: 3, Interesting

    Our T3E was having problems well past the point where it was getting long in the tooth. Cray started adding functionality to make it more supportable a few years back, but when it was actually a cutting edge system it was pretty unstable. They probably couldn't widely sell a system today that had the problems of the earlier T3E's (one hardware problem and you need to reboot the whole thing) but that just increases the development costs and time to market in a market where delay means that the peasents will be nipping at your heels. Remember, by the time a super hits maturity, it's obsolete.

  33. Gimme by Cyno · · Score: 4, Funny

    My next couch should be a Cray..

  34. Classified? Re:Correct me if I'm wrong ... by SpikeSpiff · · Score: 4, Insightful
    To me, the 5.4% classified is improbable. The same defense establishment that kept the $100s of millions stealth fighter secret for five years can certainly keep multi-million dollar computers secret.

    Especially because it's so much easier to hide a computer than an airplane. No sightings in area 51....

    We have to assume that the state of the art is way past the public data. Cray has a "lousy" $150 MM in yearly revenue. They could be spending 10X that on heavy computing for national security. The government is spending $25BB on intelligence and another $400 BB on defense every year. Cray could be a drop in the bucket, even a red herring. I'd love to know what is going on in the basements at Fort Meade.

    --
    "All that is required for evil to triumph is for good men to do nothing." - Edmund Burke
  35. Economics of Scale by dprice · · Score: 4, Informative

    In the 1970's and 1980's, Cray and other supercomputer companies fit in the niche of "fastest computing at any cost". The design cycles were long for the specialized hardware that pushed the boundaries of the available technology. Companies and government agencies were willing to pay the high price since there was enough processing speed difference between the supercomputers and the "vanilla" computers.

    By the early 1990's, the "attack of the killer microprocessors" came. The PC class processors were still weak, but the higher dollar RISC processors used in workstations, like Sun, were reaching performance levels close to what the supercomputers were able to deliver. Since they were based on higher volume and more standardized processors, the price/performance of the RISC workstations started eating into the mainframe and supercomputer market. Many of the supercomputer companies died off, and some started to incorporate RISC processors into their designs. By the mid 1990's I believe that Tera and Cray were the last remaining old-school supercomputer companies left. The rest either died or were absorbed into other companies.

    Today, the investment required to produce the fastest processor chips is so high that it requires large unit volumes to pay for the cost of development and production. The PC class processors, with their high volumes, are putting pressure on the old style workstation market, where each company makes their own processor (SPARC/Sun, PA-RISC/HP, Alpha/DEC). We see Sun struggling as the PC's eat their market. Even some large scale supercomputers are based on the PC processors. The majority of the computer spectrum from low to high end is based on the same families of processors (Intel, AMD, PowerPC).

    So that brings us to Cray/Tera. Cray seems to go against the economics of scale that drive the rest of the computing industry. What keeps them running is a small niche that the government is willing to keep funded. It is similar to the funding of exotic bombers and fighter jets. We probably won't see Cray grow much larger than they currently are. They be kept running since they form a critical part of the national security, at least that is what the government believes.

    1. Re:Economics of Scale by tesmako · · Score: 2, Insightful
      What you are missing is that Cray really does have a niche that PC processors cannot at this time touch, vector processors. Having insane performance at vector tasks with a somewhat specialised vector processor is a lot easier than with a general purpose mips-descendant. It is not an all that highly competitive niche but is highly profitable (if you have a vector-heavy task a modern Cray vector-processor is not only extremely fast, it is even price efficient at that speed). Lets not forget either that Cray holds a lot of neat patents (most interesting are their compilation technique patents) for vector-processing problems.

      Other things interesting to note is that old Cray is not only keeping the company "Cray" afloat, to some part it is a division from Cray that is making Sun the most money these days too. The extreme SMP machines from Sun (think 106 processor Fire 15K) is created by a division of the company that Sun bought from SGI when SGI bought Cray, Cray toyed with Sparc SMP's back in that day and SGI felt a bit uncomfortable dealing with sparcs so they sold it off cheap. The best purchase Sun has made in the last decade.

      All in all I am sure that Cray has a lot to offer, they have shown off their technical skills many times in the past and the technology has aged quite well for this business.

  36. The trick is keeping ahead of the commodity guys by putaro · · Score: 5, Interesting

    Supercomputing per se died because Intel, DEC, IBM/Motorola had a lot more money to throw at speeding things up than the supercomputing community.

    In the 70's up until the early 90's it was possible to build a custom CPU out of discrete logic that ran significantly faster than the available microprocessors. Cray was able to push their clock cycle down into the nanosecond range through clever design. However, a 1ns clock rate == 1GHz. You can go buy that multi-million dollar CPU for a couple of hundred bucks in today's market.

    In order for superocmputing to be viable you have to be able to provide quantum leap performance above the commodity hardware AND keep your cost/performance ratio in line as well.

    The CRAY-1 came out with a clock speed of about 80 MHz and vector processing and high memory bandwidth at a time when mainstream systems like the PDP 11/70 were running at about 7MHz with a 1MB/s memory bus. Microprocessors weren't even't a joke compared with the Cray.

    The new Japanese NEC supercomputer came with a price tag of about $160 million if I remember correctly (some estimates say that it took $1G in research funding) and hits 35 TFlops (sustained). #3 on the Top 500 supercomputers list is a Beowulf cluster with 2304 processors coming in at 7.6 TFlops (sustained). Even figuring $2000/processor + interconnect, that puts the Beowulf cluster at around $5 million or 1/32 of the cost for 1/5th of the performance (roughly speaking).

    There are other factors, of course, but the key is that for the supercomputer to stay ahead of the microprocessor a boatload of funding is needed for the supercomputer and the payoff just isn't really there. If it was a lot more supercomputer companies would still be in business.

  37. Re:explain by terrab0t · · Score: 4, Funny

    "Well, a well engineered supercomputer has much less overhead than a cluster. One superfast processor doesn't have to deal with interprocessor communications like a cluster does."

    I like the way Cray put it:

    "If you were plowing a field, which would you rather use? Two strong oxen or 1024 chickens?"
    - Seymour Cray (1925-1996), father of supercomputing


    And how about a few more Cray quotes?

    "#3 pencils and quadrille pads."
    - Seymoure Cray (1925-1996) when asked what CAD tools he used to design the Cray I supercomputer; he also recommended using the back side of the pages so that the lines were not so dominant.

    "I just bought a Mac to help me design the next Cray."
    - Seymoure Cray (1925-1996) when was informed that Apple Inc. had recently bought a Cray supercomputer to help them design the next Mac.

    I wonder what he's using now? a Palmpilot?

  38. looks like Cray is going with the Opteron by Kargan · · Score: 5, Informative

    The Sandia National Labs supercomputer (code name: Red Storm), currently being built by Cray, is going to be powered by 10,000 Opteron processors. A 40 Teraflop theoretical peak will put it at the top of the supercomputer list, being approximately 4 Teraflops faster than the NEC Earth Simulator, the current champ.

    --
    Palaces, barricades, threats, meet promises
  39. More elegant than the macs, back in the day by BelugaParty · · Score: 2, Insightful

    I really want to see cray come out with more waterfall computers. I thought that was the greatest thing in the world when I saw it on Beyond2000! way back in the day. The contemporary "elegant mac" isn't even in the same aesthetic/functional dimension as that cray machine.

    Ah, glory days.

    1. Re:More elegant than the macs, back in the day by commodoresloat · · Score: 2, Funny
      The contemporary "elegant mac" isn't even in the same aesthetic/functional dimension as that cray machine.

      Are you saying the Cray has an extra mouse button?

  40. Re:explain by imnoteddy · · Score: 4, Insightful
    I've heard (no, I don't have actual links to articles) that 10% of peak performance on a cluster is considered really good.

    Sounds like Cray marketing articles. For example, Daniel Katz at JPL wrote in 1997:

    it is possible to construct a 16-node machine with a theoretical peak performance of 3.2 GFlop/s and a typical sustained performance of 1.2 GFlop/s
    which is > 35% of peak. Or consider this from the Universiry of Liverpool:

    The current Beowulf cluster can deliver a theoretical peak performance of about 100 Gigaflops (billions of floating point operations per second) and has been observed to deliver about 60 Gigaflops.

    The observed performance was based on LU decomposition.

    For sustained/peak of about 60%.

    I have no doubt that one could find problems where a Beowulf cluster has 10% efficiency, but there are real many problems that are good to go on a cluster. And even if you only got 10% it would be worth it if the cluster cost 5% of what a vector computer costs. Not to mention that performance/$ on commodity hardware increases by a factor of 2 every 12-24 months. It takes years to develop a supercomputer, and they are stuck at their level of technology for several years since they are so expensive to redesign.

    --
    No electrons were harmed creating this post, though some may have been subjected to electrical and/or magnetic fields.
  41. Re:Didn't Sun buy all the Cray technology? by javiercero · · Score: 2, Interesting

    They bought part of CRAY, the one that made the CS6400 server, which was a really neat SMP system based on supersparcs.

    The rest of the company went to SGI.

    So basically the server/sparc division went to SUN and then they got the technology for their Enterprise systems.

    The rest of the supercomputer (the Alpha based and the Vector based units) units went to SGI, which did.... nothing with them. Oh, yeah they named some interconnections as CRAYlink or something, but they had 0 CRAY technology on them, they just wanted the name.

    Same with TERA, they wanted the name and a way of ditching their crappy TM technology.

  42. Re:explain by funbobby · · Score: 3, Insightful

    Moving people in planes is not a good analogy because it is perfectly parallel. Each person getting to the destination is not in any way dependant on the other people's journey, so splitting up the work has no overhead.

    The Cray design philosophy is for solving problems that can't be split up easily. If all of the parts of the problem depend heavily on one another, you pay a large price for communication when you split it up. That's the situation where the cluster doesn't do as well as the Cray. So each design has its strengths, and it really depends on the problem.

  43. Re:The trick is keeping ahead of the commodity guy by Rasta+Prefect · · Score: 2, Insightful
    The new Japanese NEC supercomputer came with a price tag of about $160 million if I remember correctly (some estimates say that it took $1G in research funding) and hits 35 TFlops (sustained). #3 on the Top 500 supercomputers list is a Beowulf cluster with 2304 processors coming in at 7.6 TFlops (sustained). Even figuring $2000/processor + interconnect, that puts the Beowulf cluster at around $5 million or 1/32 of the cost for 1/5th of the performance (roughly speaking).

    Number of TFLOPS isn't everything. The move back to vector style processors in super computing has been largely inspired by the fact that beowulf clusters work really well for some problems - and very, very poorly for others. If you've got a problem that divides nicely into discrete chunks that don't require a lot of interprocessor communication, then yeah, sure go with beowulf. But complex simulation problems have a tendancy to leave most of the processors idling while the cluster talks to itself due to network speed issues.

    --
    Why?
  44. Re:explain by Pieroxy · · Score: 2, Interesting

    Every solution has to be chosen corresponding to any specific need. My point was just to show that in most cases the cluster makes sense. Of course some special cases might be better suited by option 1 or 3.

    you couldn't surgically separate them

    How do you stuff them in the plane then? ;-)

    A good constraint for option 1 would be that you need to have them ASAP and the overall transfer could be interrupted anytime (before the 6th hour) and at that at that time you still want as much people as possible. Let's say 3 hours. Option 1 will have brought half the people there while option 2 leave all the planes above iceland at hour 3 with noone in England.

  45. Re:Or maybe.... by virtual_mps · · Score: 2, Insightful

    Yeah, your point? You said nothing about the reliability of one system versus another. There's a lot more that goes into designing a reliable system then spouting off some made-up statistics about cpu failures.

  46. Seymour Cray's Legacy by mre5565 · · Score: 3, Interesting
    If you could ask Mr. Cray, he'd might say that
    SRC Computers is his legacy, not Cray Computer Corp.
    He co-founded this company (with several other
    ex-Cray employees) and died while still an employee/owner.

    Interestingly, SRC is still around without any evidence on their website
    of shipping a product. My guess is that their customers and/or investors
    prefer to stay out of the limelight.

    1. Re:Seymour Cray's Legacy by C.+E.+Sum · · Score: 3, Interesting

      Take a look at this link.

      The paper claims in its conclusion a speedup of ~800 (for DES encrpytion) and ~1600 times (for DES breaking) over C code for the P4.

      I wonder who would be interested in that?!

      --
      -- Have you ever imagined a world with no hypothetical situations?
  47. Cray Comeback? Desktop Cray! by Styx · · Score: 3, Informative

    I've been using Desktop Cray for a while now. It took me some time to weak the settings to perfection, but now it's just running along. Check it out!

    --
    /Styx
  48. Re:explain by Pieroxy · · Score: 2, Interesting

    Well, it doesn't have to be. We could say that a company wants to send 250 people to London and want to use the 6 hours flight to have a corporate meeting in the plane... You're kind of screwed with 10 planes containing 100 people...

    In this case option 3 makes sense.

    You could say that the 6 hours is a reasonnable limit but sometimes (not predictable) you need as many people as you can in England before (amound of time not predictable either). In this case, option 1 make sense because both options 2 and 3 doesn't deliver anything before the 6 hour delay.

  49. disregard story, its more markoff fodder by Indy1 · · Score: 3, Interesting

    John Markoff, the same jerkoff that wrote the less then factual articles and book about kevin mitnick, and happens to belong to one of the less reputable media outles (aka the plagarized and false stories coming from the ny times).

    --
    Lawyers, MBA's, RIAA? A jedi fears not these things!
  50. Gallium Arsenide's Day Will Come by Baldrson · · Score: 2, Interesting
    It can be argued that Cray died an early death as a result of attempting to revolutionize the semiconductor industry from the chemistry up -- but the question is "Was Seymour Cray right about Gallium Arsenide?" He made the fatal error of attempting to run an organization much larger than his historically successful organizations -- organizations that were no bigger than an extended family -- 50 or so.

    However it happens, it is unlikely Cray was wrong about Gallium Arsenide -- he was not stupid. The question is when will a bureaucratic organization be able to throw marching morons at the problem and make it happen -- since that appears to be the only way technology is funded anymore.

    It's unfortunate Seymour allowed Cray, Inc. to keep his name after he left to found CCC. Even though Cray himself was capitulating to massively parallel silicon in his final days -- he did die almost immediately thereafter.

    PS: It seems creepy he died in a "jeeping accident" -- because that's exactly the way I had portrayed him dying in an April fools joke faxed to all members of congress a few years before -- an "accident" following shortly on the heels of CCC being taken over by Craig Fields of DARPA. I was sending out the joke because of the horrifying way DARPA had spent money on silly favorites within the academic community while guys who were really pushing the envelope like Seymour were going begging for customers -- having acquired private investments.

  51. HEAT is the reason CRAY can come back by evilviper · · Score: 2, Interesting

    I think there is one single reason that the market is poised for a Cray comeback... HEAT!

    Commodity PCs managed to push the speed envelope by pushing the heat envelope... That's the main reason AMD took the speed advantage, because they were willing to operate their processors at higher temperatures than Intel would at the time.

    Now, I would say it's quite a different story. First off, processors are getting closer and closer to the end of the line for heat increases.. Pretty soon, no known metal will be able to conduct heat away fast enough to allow computers to operate at room-temperatures. Even now, dumb little personal computers need serious cooling solutions... Either that, or they need to be some place that has serious air conditioning.

    So, what are companies going to do, even with the current line of processors? Should they invest loads of money in dispersing waste heat, powerful air conditioners, system cooling fans, and software and/or hardware to closely monitor temperatures? OR Should they invest in a higher-end system that doesn't put off so much heat, doesn't use up so much electricity, etc?

    In fact, I think we are even nearing the point where home users are going to get seriously pissed off and start demanding lower-power systems... It's interesting that C3 processors have become so popular despite their lowsy perfomance... (Maybe AMD/Intel will learn something from that)

    So, I do think that either commodity processors will hit the heat ceiling, and stagnate like the rotational speeds of current IDE hard drives, OR the electrical and major cooling requirements of commodity processors will become too much to justify the small price savings. Either way, that will leave the market wide open for serious computing companies once again. The only question really is how much longer will it be until one of those two things happens? Well, in the Southern California Desert, electricty prices are still very high, and the temperatures are so very high that running a modern computer 24 hours a day requires your home cooling to also be running 24 hours a day, just to operate within the heat tolerances. I don't think it will be much longer before more of the country, and the world, will reach the same point.

    --
    Slashdot gets worse every day... Pipedot: News for nerds, without the corporate slant