Slashdot Mirror


U.S. Helps Finance New Cray Development

Durinia writes "SGI has announced a few details on their next Cray vector supercomputer. The press release is mostly about them getting government support for the R&D. It does, however, mention that it will be combining the powerful Cray vector processors with SGI's ccNUMA architecture for big-time scalability."

61 comments

  1. Re:Is this the same Cray SGI just Dumped???? by TurkishGeek · · Score: 1

    Now that Belluzzo is on the Dark Side of the Force, I believe SGI management may be planning to revive Cray.

    --
    Zigbee Central: A Zigbee weblog
  2. You know not of what you speak... by Troy+Baer · · Score: 3

    A supercomputer doesn't have what you'd consider an operating system. It's a front-end computer that does all I/O, provides the usual operating system services, and controls the supercomputer. Linux is perfectly practical for the front-end. It would be nice to see a Linux in there.

    Bruce, what are you talking about? I suspect you haven't seen a Cray machine for quite some time.

    Most of the current Cray vector machines (like our T90) have their own OS (UNICOS) and IO subsystem. The IO's in a physicially separate box, like in a classical mainframe, but it's still part of the machine. There are generally a couple workstations (usually Suns) attached directly to the machine, but those are system consoles and monitoring stations. A Linux box might be appropriate for one of these monitoring stations, but that's about it. And if you think a Linux machine could handle the I/O that a Cray's capable of, you're insane. We're talking multiple GB/s.

    The idea for the SV2 (as it was explained to us at a Cray User Group workshop last fall) is that it piggybacks on an MIPS-based SN1 (next generation SGI Origin ccNUMA machine). That implies IRIX (with features ported from UNICOS), not Linux. I doubt Linux on Intel or MIPS will be ready for the kind of prime-time SGI's going to be selling the SV2 for by the time they ship.

    (Before you flame me for putting down Linux in this particular context, consider the following: I've been using Linux as my primary OS at home since '93, and I'm one of the guys working on the Beowulf cluster of SGI 1400Ls at OSC. I'm rooting for Linux too, but it's not always the right answer.)

    --Troy
    --
    "My life's work has been to prompt others... and be forgotten." --Cyrano de Bergerac
    1. Re:You know not of what you speak... by Bruce+Perens · · Score: 2

      OK, I'm willing to accept that it's not done that way any longer. I did write a lot of microcode for a baby supercomputer, a 4-datum SIMD called the Pixar Image Computer that was driven by a Sun or SGI, but that was 10 years ago. My surmise is that the vector processor is really a separate device from a general-purpose CPU, even if it's in the same box, but I'd be willing to be enlightened on that detail as well. Bruce

    2. Re:You know not of what you speak... by Anonymous Coward · · Score: 0
      > My surmise is that the vector processor is really a separate device from a general-purpose CPU...

      Not at all...a vector processor is quite simply capable of handling more than one scalar quantity at a time, i.e., a vector. "Conventional" CPUs have scalar integer and floating point units, that take as their operands scalars; for example, add 1 and 2. A vector unit would simultaneously operate upon a a set of scalar values; for example, add (1, 3, 5, 7) and (2, 4, 6, 8).

      This doesn't necessarily make a vector processor a "separate device" from the CPU, though someone can certainly choose to put a vector unit on a separate chip, or on separate silicon in the same package. For example, in the new Motorola G4, the vector unit (AltiVec, aka Velocity Engine) is a peer to the integer & floating point units, sharing the clock, bus, cache, fetch & decode, and other parts of the chip. Just as the integer & FP units have their own register files, AltiVec has its own set of registers.

      Since vector units are extremely good at certain kinds of operations, which are becoming increasingly important as applications like streaming audio & video, sophisticated encoding, decoding, and compression, encryption, etc. become pervasive and mainstream, vector processors are likely to become a part of "normal" CPUs and not arouse any more attention, than, say, the existence of floating point units does in processors today. Remember the days when chips didn't have FPUs?

      Being able to do vector math is increasingly as "general purpose" as being able to do integer or floating point math.

  3. SGI Cray?? oh please, linux has it beat right now by Anonymous Coward · · Score: 0

    what a joke. like sgi has ever produced any innovation or worthwhile advancements in the digital age. a beowulf cluster will smoke this piece of garbage. when's the last time anything from sgi could even touch a pentium??

    maybe if it ran linux it would be something noteworthy, but i somehow doubt it. linux is too good for pathetic sgi hardware.

    besides, sgi and irix wont exist in a year, even sgi has said so. they need to just get on the linux train or get out.

    Linux Man

  4. Re:So what lenght keys do you have then? by loudici · · Score: 2

    you need to remember that all these assumptions on the hardness of cracking keys depend on the fact that NP-Hard problems are not tractable in polynomial time...

    this is only a conjecture as for now.

    laurent

    ---

    --
    Dev elpizw tipota, dev phoboumai tipota eimai lephteros http://euclidian.org
  5. Re:NSA... by Detritus · · Score: 1

    The NSA is probably the #1 supercomputer customer in the world. There used to be a Cray field office in Laurel, Maryland (near Fort Meade). I'm not sure if it is still there. The NSA has been a major supporter of high-end computing since the 1950s.

    --
    Mea navis aericumbens anguillis abundat
  6. Re:SGI Cray?? oh please, linux has it beat right n by flatrbbt · · Score: 1

    Well, you are half right. Linux beowulf is really good for high latency, high bandwidth processing, such as in a render farm. In that environment, a cluster will perform equally as well or better than a cray. However, in low latency processes like floating point calcs spread about 1024 processors, the cray will leave a cluster sitting in the dust.

    Steve Ruyle

    --
    Ex Libris Veritas
  7. Alaska Supercomputer is a Cray by snowbike · · Score: 1

    The Arctic Region Supercomputing Center does use Cray/SGI systems (currently on are a Cray T3E and J932). One previous machine was a YMP. I don't remember any non-Cray supercomputer ever being powered on over at ARSC.

    You can read more about the current hardware at http://www.arsc.edu/resources/Hardware. html

    1. Re:Alaska Supercomputer is a Cray by Anonymous Coward · · Score: 0

      The 400% tariff was for a procurement at NCAR, NEC won the procurement technically, Cray using it's political influence cried foul, and got Senators from Wisconson and Minni-hope-less to push the D.O.C. to declare an anti-dumping suit. NEC refused to play and was found guilty by omission. Meanwhile NCAR bought an IBM-SP2 because SGI/CRAY still sucks.......

  8. Re:You want it, you pay for it ... by cweber · · Score: 1

    Right on! Price-performance is of course an interesting factor, but at the very high-end it becomes very much secondary.

    I have just sat through a Beowulf-type cluster presentation yesterday and the upshot is that they simply cannot touch some problems. If there is significant communication going on between nodes, especially if it comes in bursts, present-day Beowulf does not cut it.

    We are seeing 95%+ efficiency on 64 or even 128 CPUs on a Cray T3E, while a Intel cluster connected through switched 100 BT shows something like 75% at only 16 nodes. Going to Gigabit ethernet, this figure does not improve much, mostly due to inefficient Gigabit drivers. One is then forced to look into much faster, but much costlier interconnects.
    Then there are also the CPU-to-memory bandwidth and the latency of disk I/O to consider.

    Simply stated, Beowulf-type clusters today cannot touch 'real' supercomputers for many types of applications. Those applications do not decompose into embarassingly parallel tasks like rendering frames or brute-force attacks on encryption do.

    Now, if only SGI/Cray would develop a T3F based on the 21264 Alpha CPU.
    We'll keep dreaming and hoping ...

  9. well, so much for a "free market" by cthonious · · Score: 1

    no money for education but plenty of hand outs for corporations in their death throes.

    --

    support gun control: take guns from cops
  10. Re:So what lenght keys do you have then? by Anonymous Coward · · Score: 0


    4096 bit keys won't be enough when quantum supercomputers are available (successful trials already out there).

  11. Re:So what the heck is a vector based computer any by Anonymous Coward · · Score: 3
    Think of a vector processor as being super, super pipelined with dedicated hardware for jamming stuff into the pipe as quickly as possible. Similar to, but not exactly like an array processor. Both can be classified as examples of the Single Instruction-Multiple Data stream (SIMD) architecture, although they're implemented very differently.

    Vector processing has one big advantage over massively parallel processing. Essentially, if you have a uniprocessor vector machine, and any appreciable amount of your code is vectorizable, you reduce total CPU time as well as wall-clock time. With massively parallel systems, you always pay for reduced wall-clock time with increased CPU time to synchronization overhead. A search on Amdahl's Law should turn up some interesting reading.

    Looking back at some of my old documentation (~1990), I have these stats for a Cray Y-MP with a memory cycle time of 6 ns, and 2 FPUs/CPU:

    1 CPU : peak throughput 333 MFLOPS
    8 CPUs: peak throughput 2667 MFLOPS

    These throughputs assume 100% vectorizable code. For the single CPU Y-MP running LINPACK with a vector size of 300 gets you 187 MFLOPS. However, with a LINPACK vector size of 1000, the throughput is 308 MFLOPS, which is approaching the peak throughput pretty closely.

    I wish I had something that stated just how deep the vector registers are on a Y-MP. I suspect it's somewhere close to 100 (i.e., 100 stage pipeline!).

    Anyway, this post is too long. Bye now.

    Bill Wilson
    The Keeper of Cheese

  12. You want it, you pay for it ... by LL · · Score: 5

    I'm a little amused by people asking for a comparison of the costs of a Cray SV2 compared with a Beowulf. For a certain class of applications, they remain dominant and some groups are willing to pay the extra premium for that niche, regardless of the absolute costs (and don't forget the cooling/storage/manpower multipliers). Much like a train (vector computers) is suited for different terrain than buses (shared-mem) or trucks (SMP), vector computers provide very cost effective REAL computing power, often obtaining 50-60% of peak performance whereas you'd be lucky to see beyond single digits for MMPs (and before I get roasted, I'd qualify that by noting decent compilers and reworking algorithms often overcome initial technical limitations).

    As for the US support of Cray, well, jaded veterns of comp.sys.supercomputer and HPCC practitioners are well aware of the historical situation with federal funding, technical advantages and bang-for-buck comparisons with Fujitsu and NEC vector computers. For people interested in what the Japanese are doing, I believe NEC are planning on introducing a 1 Teraflop machine with the goal of hitting 5 Teraflops for their Whole Earth Simulator project . Some scientists' idea of heaven is a dedicated vector box and for their purpose and types of code, it is a valid desire.

    The SV2 is a curious beast, effectively the first stage in the merging of the Origin cc-NUMA memory subsystem and vector chips. You can think of it as a hybrid box allowing various combination of graphics pipes, MIPS/Merced nodes and vector nodes. The gripe of some people is that they are looking for a successor to the top-end T90 and they are impatient. However, developing at the high end is always trickier than people expect (witness Merced) as you need to increase capabilities along a multitude of dimensions (memory latency, I/O subsystem, heat dissipation, networking) rather than relying from the automatic boost from Moore's Law. Unfortunately there are very few applications which demand absolute performance regardless of actual cost.

    To paraphrase crass consumerism, if you have to ask about the price, you can't afford it :-).

    LL

  13. Supercomputer >> Computer by pb · · Score: 2

    For *some* problems, Cray is not the answer. For others, it is the *only* answer. Any *big* problem using lots of "vector operations" (see Bruce's post, it's kind of like the extra instructions that are just now getting added to the PC-style computers for multimedia, but *much* better) and lots of RAM at once and stuff is made for a Supercomputer. Oh, and anything that scales well to multiple processors, they have techniques of doing this too.

    Supercomputer != Beowulf! Network latency sucks in comparison, for tasks like this. Only readily paralellizable tasks that *don't* need lots and lots of RAM at once, and *don't* benefit greatly from the vector operations are better for Beowulf. In fact, some cryptography problems have been mostly solved on regular computers and then finished on a Cray, *because* the Cray did that stage of the problem so much better.

    Of course, it won't be Cray without Seymour. :|

    It must have been nice to build a supercomputing couch, though, back in the day. :)

    --
    pb Reply or e-mail; don't vaguely moderate.
    1. Re:Supercomputer >> Computer by Performer+Guy · · Score: 1

      You're thinking of the OTHER Cray. Sure Seymour founded Cray but when he passed away it was after leaving and starting up ANOTHER Cray company, not the one SGI bought.

  14. Re:I seems like a good thing. by meadowsp · · Score: 1

    Cray were the first what?

  15. Re:NSA... by loudici · · Score: 1

    unless someone comes up with new algorithms, big vector computers won't crack all keys. The algorithms we know of are exponential. when you add 1 digit to the key it takes a computer that is K times faster to crack it.

    some other things would profit much more of new supercomputers. what they are really good at is solving Partial Differential Equations either on really big domains (weather forecast) or with a high precision ( nuclear weapons simulation)..and i can understand why the US government might be interested in that.

    Laurent

    ---

    --
    Dev elpizw tipota, dev phoboumai tipota eimai lephteros http://euclidian.org
  16. Re:So what lenght keys do you have then? by Anonymous Coward · · Score: 0

    Remember that, for encryption, the asymmetric key is being used to protect the bulk encryption key. If the bulk encryption key length/algorithm is not sufficiently strong then the public key length is irrelevant.

  17. Is this the same Cray SGI just Dumped???? by Anonymous Coward · · Score: 0


    Or is this some other Cray that SGI loves????

    True story: When Cubicle SGI purchased Office Cray, they went around and ripped off all the doors.

    I have really enjoyed watching SGI die a slow cruel death.

    True discussion: So we need a high end Linux server should we go with Dell or SGI? Well SGI has one server and doesn't offer any external storage. Lets try dell. Ring Ring. Dell we'd like to buy a linux server with a TB of Raid. What your high end RAID card is not supported w. linux? Ok, guess we'll have to go with NT.

    You think Cubicle's are great?

    Then you'll love the coming age of the Capsules!

    1. Re:Is this the same Cray SGI just Dumped???? by Anonymous Coward · · Score: 0

      Hmmmm.... As far as I can tell, everyone here in Eagan still has a fully functional door to their office.

    2. Re:Is this the same Cray SGI just Dumped???? by Maledictus · · Score: 1

      Although I might agree that this post was a *bit* of a troll, I do wonder...SGI made noises about Cray as a separate unit, selling Cray or some such. Have those "noises" changed?

      For once in my measley life I am not being sarcastic, I think its a question that bears asking. Here are some good things happening to a market that others (and, by the way SGI) have said was "dead." (Do not flame me for that, SGI states in their Q3 1999 report that "The Company believes that the decline in the UNIX workstation and vector supercomputer markets are
      long-term trends...")

      But that was under Belluzzo. To quote "Hopper;" "Oh I see, under new management."

      Things have been quiet at SGI since they changed CEOs, at least from an outsider's perspective. Maybe they'll get noisy again in a good way.

      --
      Consigned to flames of woe.
    3. Re:Is this the same Cray SGI just Dumped???? by Anonymous Coward · · Score: 0

      SGI has to pump-up CRAY to sell it at some it off at a resonable "break-even", otherwise we will see new upper management in SGI soon ;-)

  18. Of course =) by Anonymous+Shepherd · · Score: 2

    I'm one of the other posters who was wondering on the price/performance comparison.

    One could very well chalk up the price difference towards reducing communications/messaging latency between computing units, cooling solutions, memory architectures, etc.

    And hasn't it already been shown, for at least brute force algorithm cracking techniques, that massively parallel computing does work? So that a Beowulf is okay for such a system?

    Of course this is different than using a more elegant and efficient algorithm to handle encryption/decryption attacks, but that is out of my/our ken as well.


    -AS

    --

    -AS
    *Pikachu*
    1. Re:Of course =) by Detritus · · Score: 1
      I doubt that the NSA uses Crays for brute force attacks on keys. There are cheaper and more efficient machines for that sort of problem. The NSA has an in-house chip fabrication facility. They could crank out custom chips to do brute force key searchs.

      From the unclassified bits of information available on the NSA, they are very interested in doing sophisticated statistical analysis on encrypted data. I suspect that is the main task of the vector supercomputers.

      The NSA is the biggest employer of mathematicians in the USA. They have probably developed techniques of attacking ciphers that are considerably better than brute force attacks.

      --
      Mea navis aericumbens anguillis abundat
  19. The market for vector supers is very limited by Anonymous Coward · · Score: 0

    So the goverment have to push money into it, SGI would never spend millions of dollar on some project if they dont get the money back. Goverment wants supers, its not market-economical benifitting to develop new vector supers so the goverment have to pay for all the R&D and the production (by buying the machines from SGI then they are ready).

    1. Re:The market for vector supers is very limited by Anonymous Coward · · Score: 0

      Didn't the previous Cray Vector processors cost about 150-200 million to design/develope ? With SGI giving 10 million and NSA 10 million, who is going to supply the other +130 million ???

  20. Re:actually... by Anonymous Coward · · Score: 0

    I think he was thinking of 'CrayLink' the high bandwidth buss they got from Cray then they bought them and inplemented in the Origin 2000 and OnyxII line fullrack systems.

  21. Re:So what the heck is a vector based computer any by kinkie · · Score: 2

    It's also called "SIMD" (Single Instruction Multiple Data), and yes, it's been around forever.
    It was discovered by some chip makers a couple of years ago (*grin*).

    --
    /kinkie
  22. Beowulfs can't do everything. by Apuleius · · Score: 1

    Some programs cannot be structured well enough to be easily parallelized, and for them the time lost resynching the Beowulf's machines makes it a net loss.

    For example, some data structures are so irregularly organized (unlike the average matrix inversion) and use algorithms that alter their structure in mid run. All you can do is parallelize some of the for(;;) loops and then resynch constantly.

    For those you have no choice but better big iron.

  23. Re:Linux involvement is possible for the front-end by dtj@sgi.com · · Score: 1
    Not true. Cray's have had Unix-based systems (called UNICOS) for well over a decade. The Cray 2, released around 1986, had a full SysV system as its sole OS. It also had its own I/O facilities with disks, tapes, and all the typical stuff. UNICOS had everything you would expect from a Unix system, such as X, Motif, emacs, and more.


    The confusion may center around three points.

    • The T3D, which wasn't a self-hosted machine and required a Cray front-end. The T3E, however, was self-hosted.
    • More recent Crays use a coupla workstations as operator and maintainence workstations.
    • Older Crays, and their OS called COS, were more dependent on frontend systems.

    I hope this clears up some of the confusion.

    -Dean
  24. Re:only 2,500 nodes by Anonymous Coward · · Score: 0

    Super-hype uses peak, not sustained ops, which is 4GFlops for the G4.

  25. Re:actually... by Anonymous Coward · · Score: 0

    'CrayLink' was actually designed and implemented at SGI well before the acquisition of Cray. The mickey mouse marketeers just renamed it.

  26. By now we all know the G4 wins... by oliverk · · Score: 2
    ...based on what I've heard from Steve Jobs we don't even NEED a Cray anymore.



    "Thank you Seargent...you may put away your tank."

    --
    ---- Please be nice in case my Slashdot karma ~= my real life karma.
  27. Re:Vector architecture - it's all in the memory... by Anonymous Coward · · Score: 0

    Correction the NEC SX-5 is 8000 Mflops the Alpha is 600 Mflops (peak).

  28. I'd like to see the revival of... by Anonymous Coward · · Score: 0

    Thinking Machines.

    http://www.connmach.com
    Looks like they're dead now, but that Connection Machine 5 looks so badass.

  29. It has to be said... by Anonymous Coward · · Score: 1

    When are we going to see a beowolf cluster of these babies? I'm sorry... I couldn't help it.

  30. So what the heck is a vector based computer anyway by Anonymous Coward · · Score: 0

    I assume they don't mean vector graphics, i.e. classic battlezone and Star Wars in the arcade.. what does it mean?

  31. How much is that in Beowulfs? by Seth+Finkelstein · · Score: 1
    Hmm
    Plans call for the system to scale to peak performances of multiple tens of teraflops, many times faster than any supercomputer in existence today. One teraflop is equivalent to a trillion calculations per second.
    So how many Linux'ed Pentiums networked with Beowulf would be needed to give it a run for the money?
  32. Is there someone out there who knows something about encryption (I don't particularly) who can tell me if the thought of the NSA posessing say, oh, 50 of these machines means that I need to use a longer key?

    Or will it take 20 billion years for even these to crack something?

    Please enlighten me, I lack knowings.

  33. Linux involvement is possible for the front-end by Bruce+Perens · · Score: 3
    A supercomputer doesn't have what you'd consider an operating system. It's a front-end computer that does all I/O, provides the usual operating system services, and controls the supercomputer. Linux is perfectly practical for the front-end. It would be nice to see a Linux in there.

    Thanks

    Bruce

    1. Re:Linux involvement is possible for the front-end by RallyDriver · · Score: 2

      Linux advocates - please refrain from posting things which are utter nonsense. You just make the rest of us look bad.

      The original Cray 1 and 1/S were like this, being parasitic on an IBM or VAX host. Modern supercomputers (Cray, Fujitsu, NEC, ...) all run a Unix OS natively. Cray's Unicos is in version *10*

      Similarly, the first Cray parallel machine, the T3D, parasited - off the back of a Cray vector machine, J90 or C90. The T3E however devotes some of its 21264's to being I/O processors.

  34. G4 Beowulfs by Anonymous+Shepherd · · Score: 3

    Well how about a Beowulf cluster of G4s? Say using the publicized 1 gflop performance of the Velocity Engine, you'd need 1000 of them to get 1 teraflop and 10,000 to hit the 10s of teraflops, and multiples of 10,000 to hit a comparable multiples of 10s of teraflops.

    That's $1600 each, or $16,000,000 per tens of teraflops.

    It may be cheaper on PIIIs, but it would also take more PIIIs as well.

    I'm assuming it's a headless network at $1600 each, btw.


    -AS

    --

    -AS
    *Pikachu*
  35. NSA... by j+a+w+a+d · · Score: 1

    Oh, the government is helping? What for? Oh, "support ... from several U.S. government agencies, including the National Security Agency (NSA)." The NSA, eh?

    "In addition to critical government applications..."... this doesn't sound good.


    echelon: FBI CIA NSA IRS ATF BATF Malcolm X
    Militia Gun Handgun MILGOV Assault Rifle Terrorism Bomb

    --
    i dont display scores, and my threshhold is -1. post accordingly.
    Discuss /. policies
  36. So what lenght keys do you have then? by The+Creator · · Score: 1

    512bit keys can be broken easy(been done). 768 no way in hell(now). 1024 no way in hell(for a long time). >=2048 bit keys no way in hell(ever!). This is for RSA, DH seems to even safer, elliptic-curve isn't analysed enough yet(anyone?). You should really be more worried about the RNG used to create the key, having a good passphrase, making sure nobody sees you type your _, making sure not to type on a monitored terminal etc.
    Me? oh i use a 4096bit DH-key(the world can always be invaded by aliens that have a ray-gun that can turn entire galaxies into large computron axelerators that connect in near infinite number of dimensions |) ).

    LINUX stands for: Linux Inux Nux Ux X

    --

    FRA: STFU GTFO
  37. Click me, read me by Zico · · Score: 3
    See this NASA link for more information about vector processing, and how it relates to Crays. Here are a few paragraphs from it:

    Cray Supercomputers perform arithmetic and boolean operations in segmented functional units that divide the operation into a set of substeps or segments, with each segment being performed in one CPU cycle. In a nonvector (scalar) operation, only one segment is performed at any given time because only one set of values is available to the functional unit. The time needed to complete a scalar operation is equal to the number of segments in the functional unit times the CPU clock period of the computer.

    The CRAY C90's functional units in the CPUs are dual units. That is, they are capable of producing two results per CPU cycle during vector operations.

    Vector processing produces high computational speeds by applying pipeline techniques to arithmetic operations. In vector operations, the ability to access sets of data items allows the system to place new values into the functional unit as soon as the previous values have cleared the first segment. This allows every segment in the functional unit to operate simultaneously. The system thus produces two results every clock period (4.2 nanoseconds) once the first set of values in the vectors has moved through all segments in the functional units. The use of vector functional units also reduces the number of instructions that the system must interpret because multiple sets of values are processed by a single instruction.

    A vector can be defined as a set of floating-point data items that the computer accesses as a unit, with the same operations being performed for each value.

    Cheers,
    ZicoKnows@hotmail.com

  38. actually... by Anonymous Coward · · Score: 0

    In reality that should read "combining Cray's vector computers with CRAY's ccNUMA architecture"

    1. Re:actually... by substrate · · Score: 1

      No, vector processors are CRAY, ccNUMA is SGI's forte'.

  39. Re:So what the heck is a vector based computer any by Bruce+Perens · · Score: 2
    It's what is otherwise called an "array processor". You give it a big array (a vector) and it performs the same operation on every element of the entire array, very quickly. I've seen them used for image processing, but they are usable for many other sorts of problems. The concept has been around forever, of course they keep getting faster.

    Bruce

  40. Well, I fell off my chair, but it makes sense... by teraflop+user · · Score: 4

    The way the market has been going recently, it was beginning to look as though US vector supercomputers were dead, and only the Japanese were still advancing them. The T90 was the last major vector machine, but had memory synchronisation problems with more CPUs. The SV1 had some interesting specs, but I don't know of any site which actually installed one - certainly press releases were thin on the ground.

    Parallel machines, such as the Cray T3E, IBM SP2+, and to a lesser extent Beowulf clusters just give so many more Gflops/$. But as has been pointed out they are completely unsuited for some problems, for which you simply need all you power concentrated in a small number of CPU's.

    My guess is that there is not enough market for a new US vector supercomputer, and the US government are stepping in so not to become dependent on imported hardware. If most SV1 installations are government, it might explain why we've heard so little about them.

    BTW, many older supercomputers were single-user machines, which required a front end running a mutli-user operating system to schedule jobs. However all recent machines, including the C90, T90, T3E and I suspect the SV1 and SV2 run their own operating system (they are self hosting). In this case it is UNICOS, a Cray Unix which is gradually being merged with SGI IRIX.

  41. I seems like a good thing. by dant3 · · Score: 1

    I cannot pretend understanding all the technology so I wont. But I still have an opinion from an historical point of view. Just before Mr. Cray died a university in alaska procured some funding for a Cray. After the funding was given they decided that an alternative super computer would be more appropriate (Japanese I believe ( I apologize for the lack of information)). At the point of learning this I was indignant. For I believe that it is a matter of heritage and technical ground breaking on the information frontier. It may be just a bit of national pride but damnit they were the first. I personally felt a loss the day that he died even though I literally had no ties to Cray. I assumed that day the comapany would die. If they can find a way to be that pioneer that Mr. Cray once was I am very happy for for them and every one, as we will all benefit. That is, if they truly are pioneering and not just giving their last dyeing gasp.

    1. Re:I seems like a good thing. by substrate · · Score: 1

      Certain Japanese companies were attempting to flood the market with their vector processors by selling them in the US at below cost. This prompted a number of companies to feel that the Japanese presented a better solution. They were taken to court over it and lost, resulting on a 400% or so tarrif on their vector processing supercomputers. I'm wondering if this Alaska case was one such case and what the end result was?

  42. Vector architecture - it's all in the memory... by RallyDriver · · Score: 3

    Actually, I think the vector pipes on a Y/MP are something like 8 or 10 units deep.

    These computers are designed for one type of calculation - matrix algebra, which requires simple processing (multiply, add, invert) on enormous 2D arrays of numbers. The important point: the working set of these problems is often the size of the matrices, so caches are ineffective. Cray vector machines do not have ANY data cache between the vector units and main memory.

    The feature of most vector machines that no-one has really pointed out yet is the way the memory system keeps those vector units fed. Unlike a microporcessor which relies on 2 or more layers of cache, the vector machine is quite capable of streaming data from memory fast enough to keep the processor 100% busy.

    The vector instruction results in a number of vector fetches being issued to the memory controller, which is told to fetch a strided vector from memory (start at address x, every ith word, n words in total). The memory controller issues requests to individual banks in an overlapping fashion. Like the vector FPU, after it gets over its latency it starts banging the words out once per clock cycle.

    The way this is done is to have a huge interleave in the memory - Cray T90's use either 1024 or 2048 banks. So long as any one bank is not hit more than one per 100ns or so (the cycle time of the memory) the memory controller is capable of delivering multiple streams of 64 bit words at full clock speed (T90's can have up to 32 CPU's). Typically, to ensure this, arrays in programs are structured to stride along the first dimension (FORTRAN remember) or where this isn't possible, the array dimensions are chosen to be prime numbers.

    The thing that sets these machines apart is not processor speed - the peak speed in MFlops on an NEC SX-5 is only about 2-3 times that of an Alpha AXP. The thing that makes them special is the memory bandwidth to sustain that performance.

    To use my favourite automotive analogy, if a PC is a small hatchback, then a supercomputer is an 18-wheeler, not a Ferrari.

  43. Cost of a Cray SV2? by Anonymous+Shepherd · · Score: 2

    Just wondering, because another post wanted to compare one to, say, a Beowulf cluster.

    If you use a G4 PowerMac and their highly advertised 1 gflop rating as a base, at $1,600 each, to reach a 10 tflop rating you would need 10,000 networked machines, so for each 10 tflops would be spending $16,000,000. I don't know how the comparable PIII or Celeron would perform though, or at what price.

    16 mil is a lot to spend. But for government purposes it may just be a drop in the bucket.


    -AS

    --

    -AS
    *Pikachu*
  44. 386sx-16 Beowulfs by Anonymous Coward · · Score: 1

    According to my calculations, it would take 1,700,000 386sx-16's to be competitive with that supercomputer. The system would have 27 megawatts of waste heat, enough to vaporize lead.

  45. It's a national security issue... by RallyDriver · · Score: 2

    It was always said in the supercomputing community that the US government would never let Cray die. Buying in supercomputers from Japan, the only other source of big iron, just wouldn't wash with the likes of the NSA.

    First it was blocking the NCAR deal, and now this.

  46. Offtopic: Cray as a personality by RallyDriver · · Score: 1

    For those who take a passing interest in the history of computing, or who find people like Richard Stallman entertaining, the story of Seymour Cray the man is quite interesting. His methods of designing hardware (consulting the elves) were definitely unique.

    I don't think we'll see anyone quite like that again.

  47. Cray has been dead for a few years. by Anonymous Coward · · Score: 0

    He died in an automobile accident which is really unfortunate because his work kicked so much ass even if it is a little high priced ;)

  48. Argh. Ignore and don't keep responding! by Anonymous+Shepherd · · Score: 1

    It just encourages them.

    Sigh.

    *breath*
    *breath*
    *breath*


    -AS

    --

    -AS
    *Pikachu*
  49. Wouldn't that... by Anonymous Coward · · Score: 0

    Wouldn't about 75 of those make a cool beauwulf cluster!!!