Slashdot Mirror


SGI Installs First Itanium Cluster At OSC

Troy Baer writes: "SGI and the Ohio Supercomputer Center (OSC) have announced the installation of the first cluster using Itanium processors. The system consists of 73 SGI 750 nodes, each with two Itanium 733MHz procs and 4GB of memory, connected by Myrinet 2000 and Ethernet. Software includes Linux/ia64, SGI's ia64 compiler suite, MPICH/ch_gm, OpenPBS, and Maui Scheduler."

198 comments

  1. How long until... by FyRE666 · · Score: 0

    ... we get the article about some d00d gutting one of these things and filling the space with a 466mhz celeron based setup, so he can 0wN the next lan party with the cool looking case?

  2. Re:Slashdot by it's+a+culture+thing · · Score: 3, Interesting

    The story has been out for months and people obviously haven't been paying attention or at least have failed to see just how concerted this effort is. This technology isn't new, but it's just a small part of the whole if you know where to look.

    The new systems sound great, but they're tiny compared to what it's going to be like when the GRID is up and running.

    What the story fails to mention is that this system is likely to be connected to the other GRID environments in the States and the new ones in Europe at which stage you wont be talking about just 4 super computer centres but nearly a hundred, each with several Tflops of processing power and a few petabytes (10**15) of storage.

    I would suggest that to put this in the proper perspective you also look at IBM's contract to do the same to 4 sites in the Netherlands, the UK GRID which has 9 sites, the German one which I dont know much about but is fairly advanced and the CERN DataGrid. These are all interconnected with the same people working on several at a time.

    Or you could have a look at the top500, find all the supercomputers in Europe & the US which aren't classified or commercial and then figure out what their combined processing power is. You should then have a fair idea just how much processing power there will be in a couple of years time 8)

    Now back to the Particle Physics experiments ...

  3. I'll swap you by green+pizza · · Score: 2

    He gets to play with Linux/SGI clusters now, I'm stuck with Alphas & an O2000 in a back room somewhere.

    You can come admin these clusters and I'll go work with your Alphas and Origin 2000.

  4. Intel fighting the "megahurtz myth"? by sjonke · · Score: 1
    Yes, that is spelled as desired you fools. Apple has struggled with this issue - what chance is there that Intel/PC manufacturers will do any better?

    Of course Intel has the big advantage that if person X doesn't go for 733 Mhz Itanium or even y * 733 Mhz Itaniums, they'll still go for 1.8 GHz Intel P4. I.e. they don't have to sell the Itanium to consumers yet, whereas Apple has to sell the PowerPC. I.e. Intel, like Microsoft, can win by stagnation.

    --
    --- What?
  5. Re:SGI Sucks ( read on ) by garren_bagley · · Score: 1

    IRIX is no longer a first tier operating system. New versions of Oracle (just for example) come out for Solaris and (shudder) HP long before (if ever) they come out for IRIX. IRIX is even starting to lose focus on GNU projects. I don't think they'll ever recover that status. It is a shame. The only bright spot (for me) is that I can pick up an O2 on EBAY pretty cheap.

  6. Re:fascinating by sharkey · · Score: 2

    Hmm. Doesn't say WHICH company.

    --

    --
    "Outlook not so good." That magic 8-ball knows everything! I'll ask about Exchange Server next.
  7. Re:5.86 Gflops per processor by psoendergaard · · Score: 0

    Sure, they have a great product: The P4! It has a huge memory bandwidth, better than the Itanium, and with a good compiler it can easily do 1.5 flop per clockcycle in double precision arithmetic (3.0 for single precision). This will give you 2.7 gflops in double precision for a 1.8 ghz P4.

    Intel has a fantastic product, but compiler support is currently lacking bit.

    The P4 is actually faster than the Athlon for scientific computing, except for the compiler fact, and this IS very important. The P4 looses to the Athlon simply by the reason that the compilers can not use the vector instructions properly. The chip itself is faster.

    Why anyone would be an Itanium instead of a dual P4/Athlon beats me, it is supposed to only be a research chip and not really a production chip. It has less on-chip cache than a Celeron (128kb total)!! Sure it's packaged with a lot of sram, but still.

    Intel tells everyone to wait for the McKinley, but just look where the P4 and Athlon will be at the time that chip rolls out in full production. It will loose out just as the Itanium is doing now.

  8. More information by pdiaz · · Score: 1

    http://planetcluster.org/article.php?sid=41&mode=t hread&order=0 :Information about the cluster itself and the nodes

    --
    Make It Secret . Free JavaScript implementation of AES for your browser
    1. Re:More information by philipm · · Score: 0

      so basically these things are slower than 500 MHz G4s. Maybe they should have saved the money....

  9. Re:Wrong logo, Wrong idea by DiLLeMaN · · Score: 0
    Personally I think the best thing for them would be to be bought by someone with big pockets who can market a product properly.

    That would be M$... =]

    --
    /var/run/twitter.sock is a twitter socket puppet.
  10. OT: Music Box for IRIX? by BigBlockMopar · · Score: 2

    Hey everybody,

    I've had exactly two epiphanies in my life. One was the first time I saw an Amiga, back in 1985. I knew I would have one.

    The other time, it was 1993 or 1994. I was installing a video projection system into a hotel meeting room. The computer to which I had to connect the video projector was a funky little purplish pizza-box with the name "Indy".

    I fell in love.

    I still don't have an old Indy or Indigo - or even any UNIX workstations - so far, I've kept my cravings in check with Solaris 8 x86, FreeBSD and Linux.

    There was something running on that Indigo that day. I don't remember it very well, but I remember the name: Music Box. In the window, there was a picture of a woman, and cartoon musical notes would fall out of her mouth as eerie but beautiful music played in the background. For several hours while I aligned the video projectors in the room, it was on the screens, 10.5x17 foot image of an IRIX X session, with that eerie and hypnotic image and sound. It freaked out my boss, too. :)

    It looked like something you'd just leave on the screen of your computer to amuse passers-by. Anyone have any idea what it was, or whether it's available for anything but MIPS/IRIX? A Google search didn't prove too helpful.

    --
    Fire and Meat. Yummy.
    1. Re:OT: Music Box for IRIX? by Shinobi · · Score: 1

      There's one big error in this post: The Indy is not a purplish box, it's blue. The Indigo's and Indigo 2 Impact-series were purplish

    2. Re:OT: Music Box for IRIX? by irix · · Score: 3, Informative
      The Indigo's and Indigo 2 Impact-series were purplish.

      Well, not exactly. The Indy is a blue pizza box. The Indigo2 was a much larger turquoise/grren box, and the Indigo2 Impact (which had tons of problems, BTW) was the same box in purple.

      I should know - I have an Indy and an Indigo2 ;)

      --

      Do you even know anything about perl? -- AC Replying to Tom Christiansen post.
  11. Re:SGI Sucks ( read on ) by Anonymous Coward · · Score: 2, Interesting
    Impressive but overpriced big iron.

    Sorry, but there's nothing overpriced about the Origin 3000 family. I saw a quote for a 16-processor O3400 with 16 GB of RAM; the bottom line at list price was around $500,000.

    That seems totally competitive to me.

    And since I'm posting anonymously anyway, you might be interested to know that SGI is planning to release a new product any minute now. (It was announced to the developer and integrator channels this week). It's a four-processor (MIPS, natch) server in a 2 RU package. They're calling it the Origin 300.

    But the cool part is going to be an interconnect product codenamed Sprouter. It lets you take 2 to 8 Origin 300 systems with 4 procs each and connect them using NumaLink (formerly CrayLink) into a single system image of up to 32 processors.

    At, it's projected, half the price of a 32-processor Origin 3000 system. And for my kind of programming anyway, single-system-image beats the pants off that Myrinet stuff.

    The O300 has 2 66/64 PCI slots, so that's enough expansion to let you attach your basic I/O devices like fibre channel RAIDs and high-speed networking and stuff. Each server comes with USCSI3 built-in, if anybody still uses that stuff. ;-)

    Not everybody needs a medium-scale single-system-image IRIX machine, but I personally do a lot of ImageVision library programming. And ImageVision, being multithreaded at the core, loves big CPU counts. So for me, and people with needs like mine, it's going to be a very cool fall.

  12. Re:Wrong logo, Wrong idea by Anonymous Coward · · Score: 0

    oh yeah, that would be fucking great wouldn't it? why not just appoint Bill G as King of the whole fucking USA and rename MS "Computer Inc.". American's are supposed to be in favour of freedom aren't they? where's the freedom in a monopoly?

  13. Seti@home by goonies · · Score: 2, Funny

    I bet they only want to beat the sith out of the top10 seti@home users by sending in packages every 10 minutes.... ;-)

    --
    .sigh
  14. sif you'd use ia64(tm), or even post about it. by Anonymous Coward · · Score: 0

    AMD's-x86 64Bit.... ... Need I say more?

    1. Re:sif you'd use ia64(tm), or even post about it. by Anonymous Coward · · Score: 0

      Check the SPEC scores for floating point on the IA-64 and shut the fuck up junior. Some of us have real work to do with computers..

  15. two words: by green+pizza · · Score: 2

    imprecise exceptions

    IA64 has pretty clean and easy asm, but debugging is a complete nightmare. It's certainly better than x86, but give me MIPS64 or MIPS R1X000 any day.

    1. Re:two words: by stripes · · Score: 2
      imprecise exceptions

      That's kind of a pain (and at least the IA64 can be forced to do percise exceptions). Debugging something that the compiler has software pipelined will be a giant pain. Your source code will look like a simple loop, but the machine state will show you in the several times (i=7, 8, 9, 10 all at once...but not quite as far into the loop for some values).

  16. Interesting by SilentChris · · Score: 1, Funny
    "Software includes Linux/ia64, SGI's ia64 compiler suite, MPICH/ch_gm, OpenPBS, and Maui Scheduler."

    One only wonders if this would be posted on Slashdot if the OS was Microsoft Windows 2000 Datacenter. :) Just a thought...

    1. Re:Interesting by Anonymous Coward · · Score: 0

      well, it wouldn't, but that makes sense; as a rule of thumb we /.ers tend not to be interested in toy OSes.

      actually yeah the story would have been posted due to the cool hardware++ factor. but the 2k flames would have been flying.

    2. Re:Interesting by Anonymous Coward · · Score: 0

      This's not the same kind of clustering, moron.

    3. Re:Interesting by Guppy06 · · Score: 3, Informative
      "One only wonders if this would be posted on Slashdot if the OS was Microsoft Windows 2000 Datacenter."

      It wouldn't be worth mentioning, since you can't cluster more than four nodes with W2k Datacenter. When you compare that to this cluster of 70+...

    4. Re:Interesting by Anonymous Coward · · Score: 0

      Software "should" be free? Why don't we make hardware free as well. Hell, let's ditch the whole world's economy.

      I agree!

    5. Re:Interesting by OverDrive33 · · Score: 1

      Something tells me, given the chance M$ would take BIG advantage of a chance like this, they'd give most of the licenses away for free, and then make it into a huge publicity stunt. I know I would if I were Big Bad Bill. And I have seen Win2k running 32 procs clusters...

    6. Re:Interesting by Peter+Harris · · Score: 1

      Yeah it would -- on April the 1st :)

      --

      -- What do you need?
      -- Gnus. Lots of Gnus.
  17. Re:SGI Sucks ( read on ) by YeeHarr · · Score: 1

    SGI can not be judged on Oracle support or other database support (although from personal experience Red Brick (a data warehouse database) kicks ass on an Origin 3000)). SGI is specifically targeting the scientific, creative, analytical and technical users. These users use floating point, or need very large single system image (NOT clusters) or need the graphics that SGI provides. V6/8/10/12 graphics provides 12 bits per component with 10bit D-A's in an Octane. This is VERY unusual in the graphics industry. This has enabled human vision research not possible on any other machine. SGI workstations (Octane2), servers and Onyxes have HDIO (Hidefinition TV I/O cards). You need a LOT of bandwidth to handle uncompressed 1020p I/O. Onyxes have at the moment 256MB of texture memory per pipe. This enable volume visualisation impossible on any other graphics card. This is very important for some forms of medical imaging and for Oil/Gas exploration (BUT NOT ALASKA PLEASE!) SGI has a 1024 processor single system image (NOT a cluster) running at NASA AMES and has shipped multiple 256/128/512 proc SINGLE SYSTEM IMAGE Origin 3000s. Scientists like single system image machines. The quote from one of the nasa guys is it is just like using a desktop workstation, only a bit faster :-) These are not ordinary machines and are not targeted at people running Oracle.

  18. Shut the fuck up by HeWhoWalksALot · · Score: 1

    A guy asks a question, and all a f'n dork like yourself can do is nitpick about blue vs. purple? Get a f'n life you loser

    1. Re:Shut the fuck up by BigBlockMopar · · Score: 2

      IT'S INDIGO FOR FSCK'S SAKE

      Well, of course it is. I was building up to it, and revealing the name in the color of the box was rather self-defeating to the melodrama inherent in later naming it an Indy.

      --
      Fire and Meat. Yummy.
    2. Re:Shut the fuck up by Anonymous Coward · · Score: 0

      blue vs. purple? IT'S INDIGO FOR FUCK'S SAKE

  19. Re:fascinating by Anonymous Coward · · Score: 0

    hahah that's too funny... He didn't increase SGI's profits, but he sure as hell increased those of the place he now works at ;)

  20. Re:IA64s are kickass... by VAXman · · Score: 2

    Are you new to computers? The memory you get from malloc isn't actually allocated until you access it. There is no translation for the pages at first, so the first access gets a page fault at which point the OS maps a PTE to it. And, no, you could not achieve that with your first 5GB hard drive because most likely you were running IA-32 which supports only 32 bit linear addresses. On Linux a process has a maximum of 3GB of memory so that's the maximum you would be able to allocate (subtracting the tiny overhead from malloc, as well as the other code and data the process has).

  21. a plumber? like this guy...? by green+pizza · · Score: 2

    http://www.overclockershideout.com/customer/MTP/

    Overclocker kiddies will do anything for an extra 8 frames-per-second.

    Now here's a scary thought... how about the overclockers team up with cluster builders??

  22. Re:Wither XIO? by Anonymous Coward · · Score: 0
    Hmm, PCI and no XIO? Would be nice to at least have a couple of each.

    There's not much you need XIO for these days. GSN is only available in an XIO card right now, but the PCI-64 is supposedly coming soon. SGI's digial video options are only available in XIO, but you wouldn't use those with an O300 anyway. Hell, even OC-12 is available in a PCI card now.

    Wonder which version of IRIX it'll run? 6.5.13 + patches? 6.5.14?

    Given the September release timeframe, I'd guess 6.5.13. O300 has basically the same guts that O3000 has-- R14000 processors, Bedrock memory controller, and so on. I'd be willing to bet money that it'll run the same binary kernel that boots the O3000s.

    Patches are an absolute no-no these days. Unless you've got a support issue that has to be resolved before the next maintenance release, general-release patches to the OS are expressly forbidden.

  23. How Much Power and A/C for this Itanium Cluster? by Anonymous Coward · · Score: 1, Interesting

    Troy, How much power and A/C is required to run this 146 Itanium processor cluster? I have heard that 1 itanium node uses two 800 Watt power supplies and generates 6174 BTUs/Hr. I am guessing that you required 116.8 KiloWatts of power and 37.5 Tons of A/C. Please correct me if I am wrong. What did you have to do at your site to support this cluster?

  24. Re:Compiler? by Anonymous Coward · · Score: 0

    The difference is that pro64 is a modern compiler using very advanced compiler techniques, put together by some top notch compiler gurus. gcc on the other hand is an archaic piece of shit that couldn't optimize its way out of a paper bag.

  25. Re:in related news by Anonymous Coward · · Score: 0

    actually, he IS the guy in charge of those definitions.

  26. Re:IA64s are kickass... by statusbar · · Score: 1

    I haven't done work on the IA64 yet, but I have done work with a software pipelined DSP, the TI TMS320C6701. I found out the following things about compilers for such beasts:

    1) The compilers are MUCH more complex and therefore have complex bugs when compiling complex loops
    2) C/C++ are not good enough languages to express enough hints to the compiler to allow it to optimize properly.
    3) Therefore, C/C++ require non-portable language extensions for these hints.
    4) Advanced usage of C++ inline templates CAN help, but are still not as flexible as you want in terms of allowing the best re-ordering of code possible.

    For example, you HAVE to use the new 'restrict' keyword to give the compiler hints about un-aliased pointers. pragmas or similiar hacks are required to give the compiler hints about how many times a loop will be executed - for example, telling the compiler that although a loop is repeated based on a separate run-time variable or parameter, it is guaranteed to never be 0 and will always be a multiple of 8 loops. That kind of thing allows the compiler to really do a good job on the loops - but is not part of the C/C++ standards.

    In my opinion, C and C++ are too 'low level' and restrictive in terms of allowing the compiler to re-order the code to be really effective on non-typical architectures like these. Unfortunately there doesn't seem to be any other options. I would expect that a functional language like OCaml would allow the compiler much more freedom to do a better job - Once there is real support in OCaml for the ia64....

    Also, I highly doubt that GCC is going to be the best or even a very good compiler for ia64 for some time. I'll be happy if i'm wrong though!

    My opinion is that you did the right thing by diving into the assembly.

    --jeff

    --
    ipv6 is my vpn
  27. Re:Deus Ex!!! by EpsCylonB · · Score: 1

    the nsf are on our side, let unatco suffer some ultimate hacking from them.

  28. Re:IA64s are kickass... by Espen+Skoglund · · Score: 2, Interesting
    It's not only a matter of optimizing for using fewer registers. If you're coding loops for instance, you would most likely want to use the register rotation stuff to do software pipelining of your loop: register r32 becomes r33 from one iteration to the next, register r33 becomes r34, and so on (same thing for predicate and floating point registers). Using this technique you can do some really kick ass optimization of your inner loops. Only problem is that coding the thing and debugging it will do your head in. Now, you would want the compiler to do this stuff for you, but gcc does not support it yet (the SGI Pro64 compiler does, though).

    This was one example of low level optimizations, another one is giving hints to different branches (both target and outcome of branch conditions). This is also best done by the compiler (at least the branch target hints), and works even better if you can supply the compiler with profiling information. You can also give data prefetch hints and specify which cache level different prefetch data should go into.

    Another example of when you might need to do asm is when you do SMP. The reason being that different load and store instructions are given semantics of how the are to behave in a multiprocessor environment: you want acuire semantics on this load, release semantics on this store, fence semantics here, undefined semantics there, etc. I can't see how the compiler would be able to generate correct assembly in this case (unless it is modified so that you can attach some new attributes to your variables and types).

    Then there is this whole plethora of floating point stuff that I won't mention because I don't know shit about it.

    Hmm, reading your post again I see that I didn't really answer your question, and most of my ranting about doing asm coding ended up with the conclusion that having the compiler do all the nasty stuff is probably better anyway. I guess I'd better shut up now.

  29. bah by green+pizza · · Score: 2

    give me a 4D/35, IRIX 4.05 on tape, and leave me alone

  30. Re:wiggy wiggy what?? by Anonymous Coward · · Score: 0

    "Were they bought, and has their line continued??" Yes, they were bought by..... SGI (huh?) and their line continued in the form of the "SGI" ZX10. Insane, isn't it?

  31. OpenPBS? by Guppy06 · · Score: 0, Offtopic

    How the heck do you close-source public television?

  32. Re:IA64s are kickass... by AndrewHowe · · Score: 1

    Why is the parameter to malloc only 32 bit? This is a 64 bit architecture, you should be able to allocate more than 4GB.

  33. Re:Slashdot by Kewlhand`tek · · Score: 0, Offtopic

    i like the mindless trolling and the moronic comments

    --
    The Arkie Libertarian
  34. Wrong logo, Wrong idea by green+pizza · · Score: 3, Interesting

    Slashdot is *still* using the old cube logo, rather than the new "sgi" logo. Sure the new logo sucks and the old logo is quite cool, but it's time to move on. The old days are long gone. Like the rest of the 'new' SGI, there is nothing special about the SGI 750 Itanium box, it's the same box with the same Intel reference board that HP, Dell, and others are selling.

    SiliconGraphics has left the building. The "hip new" SGI is here. Quit using the old logo, it reflects a much cooler company that no longer exists.

    1. Re:Wrong logo, Wrong idea by BitwizeGHC · · Score: 3, Informative

      Hey, never give up hope! Remember how Borland changed their name to the laughable "Inprise" some time ago? They're back as Borland now. Companies want to please their customers, and if enough want some part of the old SGI back they'll get it.

      --
      N4st0r, trixx0r h0bb1tz0rz! Th3y st0l3 0ur pr3c10uzz!
    2. Re:Wrong logo, Wrong idea by e · · Score: 1

      Has anyone else noticed a little bit of a turn in SGI's marketing of late? For a while all I saw advertised was the new Intel cruft, but recently I've been seeing a lot more ads for Octane2s and O2s. Not only that, but they call them Silicon Graphics workstations.

      I think SGI has realized the folly of its full-scale leap into Intel and is finally starting to get back to their roots again.

      e;

    3. Re:Wrong logo, Wrong idea by fgodfrey · · Score: 2

      Err, actually, you won't need a P brick unless you need more than 5 PCI cards. One of the features we are kicking around (read "maybe someday we'll do this but don't go telling anyone that we're committed to it") for system partitioning is the ability to boot a partition I/O less from another partition. You would then be able to NFS mount a root file system over the cross partition communication layer (network interface "cl0").

      --
      Go Badgers! -- #include "std/disclaimer.h"
    4. Re:Wrong logo, Wrong idea by Doppelgaenger · · Score: 1
      I think you're making partitioning sound easier than it actually is. IIRC, for each partition, you need a seperate I-brick. They're _not_ cheap. It's much more cost effective to go with a large, single-image system.

      dopp

      --
      -- If a god of love and life ever did exist, he's long since dead. Someone, something, rules in his place
    5. Re:Wrong logo, Wrong idea by Gill+Bates · · Score: 1
      Personally I think the best thing for them would be to be bought by someone with big pockets who can market a product properly.

      Microsoft? ;-)

    6. Re:Wrong logo, Wrong idea by InsaneGeek · · Score: 2

      Whoops got my bricks mixed... was thinking of the pdu brick.

    7. Re:Wrong logo, Wrong idea by InsaneGeek · · Score: 4, Interesting

      Would you consider the following link new & cool???

      http://oss.sgi.com/projects/linux-scalability/do wn load/mips128.out

      It's a link to a running *single image* (i.e. not a beowulf cluster) 128 proc Linux system on a mips box. When was the last time you saw Dell, HP, Gateway, do that with a Linux system? This little cluster in the post is not where SGI is going with their systems (as you said anybody can do that) but are moving with Intel numa with high speed numalink interconnects that are much faster than standard Myranet (their cross bar is in the gigaBYTES).

      The real interesting part that I see (if they can live long enough for Intel to release Mercede) is the system partitioning and it's modularization. Need 2 more procs for your database but all your CPU slots are full, well plug in another "C brick", you won't have to worry about running out of CPU slots in your frame, everything is a component, you won't have to do another fork lift upgrade. Also with their partitioning I can purchase a 100 proc system, partition it into twenty 5 proc systems all within the same frame I don't have to pay for all of the overhead for 20 different frames and space for expansion in 20 of those frames, because all I need to do is plug in another "C brick" and give two of the boxes two more CPU's without ever having to have bought the headroom to begin with.

      Whoa getting a bit long, SGI really has some cool stuff going on right now. If they could only market themselves out of a paper bag they wouldn't be in the bled-dry situation they are in. Personally I think the best thing for them would be to be bought by someone with big pockets who can market a product properly.

    8. Re:Wrong logo, Wrong idea by InsaneGeek · · Score: 2

      In my example you have already purchased the I-brick, possibly the R-brick (depending upon how large you are), if you are using XIO the X-brick, etc. All you are doing is adding some more CPU's to existing systems in the 3000. Which before if I had 32 procs in a Sun 6500, you were done you had to do essentially throw our your 6400 and do a forklift upgrade to an E10k; with SGI's 3000 you are at CPU's you don't run out of headroom, you never have to get rid of your original investment. I can start with a 4 proc system and scale it up to a 1000 proc system and never have to throw out hardware to get there because you don't run out of headroom (there is a theoretical max I will admit but it's a limitation in interconnect speed).

      Anyway yeah, you have to buy the I-brick, P-brick & C-brick to get a system running and each system will need to have at least an I & P brick; but to add to an existing system no problems.

  35. Re:SGI Sucks ( read on ) by motzkie · · Score: 1

    Great comment!

    Now that SGI is about to be lost would *someone* please contact Dr. James H. Clark?!? I would hate to see this company going down just like Symbolics and Thinking Machines before.

  36. 5.86 Gflops per processor by Perdo · · Score: 5, Informative
    5.86 Gflops per processor (We can assume marketing fud value) at $3,300 Compared to the Athlon 1.4 with a peak of 1.37 Gflops (benched value) for $145. Keep in mind the nifty PCI card that has 4 G4 processors on it. That would make for 92 processors at 1Gflop each in just one box.

    Unfortunately, this seems to mark Intel's latest attempt to push an overpriced, substandard product at us. The P4 was crippled from the begining and is only just now begining to show any promise. The PIII at 1.13 and 1.2 Ghz is finally available 8 months after the recall of their failed 1.13 processor. Even their purchase of Alpha from compaq seems to be just stock propping because the original creators of the alpha are now working for AMD. The reason Compaq was willing to sell in the first place is the second generation alpha has been subjected to over three years of delays because they simply did not have the engineering talent to improve a ten year old design.

    The talented engineeers are working for AMD, built the athlon and are working on the sledgehammer.

    Before anyone jumps to Intel's defence, like they need defending as long as they are the 800 pound gorilla, keep this in mind:

    Craig Barret warned "This was a year of record annual revenue and earnings; yet, slowing economic conditions impacted fourth quarter growth and are causing near-termuncertainty,". He was faced with AMD going from 10% market share to 34% market share in a year. Wall street took barret's word as gospel that the entire market was in decline and not just Intel's market share. Intel is a market bellwether so we all got laid off. Just so Intel would not have to admit that AMD had a better product. Nasty business. Intel does not have a great product and they are reckless with their power.

    --

    If voting were effective, it would be illegal by now.

    1. Re:5.86 Gflops per processor by Anonymous Coward · · Score: 3, Interesting

      think again, Apple current top-end dual 800Mhz PowerMac G4 runs AltiVec code at over 5GFLOPS. My older dual 450Mhz can run up to 3.2GFLOPS... And don't forget, the G4 has a successor due pretty soon now. I think these guys will find the pleasure of being the first will be tempered by eventually having the worst price/performance 64bit cluster in academia.

    2. Re:5.86 Gflops per processor by BlowChunx · · Score: 1

      Single precision only and with code that "knows" how to use the short vector units. Although writing for altivec can't be harder than MPI ;)

    3. Re:5.86 Gflops per processor by Anonymous Coward · · Score: 0

      um... I'm not leaping to the defence of Intel, but this is their first step in 64 bitness, and having actual registers (like any other processor.) That's good stuff. Say you've got to write a simulation, and your model grid is 4 GB x 4GB (16 GB total) Do that in IA32, oh... and use double precision. On IA32, you're back to those nasty "memory models" of yore, overlays, segment offset, etc... yuck! Then compare the timings of such a monstrosity against an IA64 version, which would "just work." Sure, it's no different than any other 64 bit RISC machine, but it's bound to bring costs down.

    4. Re:5.86 Gflops per processor by David+Roundy · · Score: 1
      Apple current top-end dual 800Mhz PowerMac G4 runs AltiVec code at over 5GFLOPS. My older dual 450Mhz can run up to 3.2GFLOPS...

      I'm sorry, but altivec is useless for any sort of scientific computation. It can only deal with single precision floats, and then it is only able to do four adds simultaneously. There are good reasons to use powerpc processors (or better, POWER processors) for scientific work, but altivec is not one of them.

  37. Not the first! by Anonymous Coward · · Score: 0

    On an anonymous, cowardly, note :
    SGI was not the first to have an Itanium cluster.
    HP has (internally) a cluster of 16 4-cpu Itanum servers hooked up with gigabit ethernet.

  38. wiggy wiggy what?? by Anonymous Coward · · Score: 0

    When did Intergraph go under? My video editor friend just got one last year. Were they bought, and has their line continued??

  39. the difference between the elite and the l33t by Anonymous Coward · · Score: 1, Funny

    True HPC Coder:
    Myrinet interfaced via PCI? Argh! SN-IA won't be here soon enough!

    l33t d00d with an overclocked athlon:
    SCHWEEEET, a BEOWULF CLUSTER! With an IDE RAID on each node I could have years of DiVX movies on that!!!

  40. woohoo by popeyethesailor · · Score: 1

    Can anyone imagine a Beowulf cluster of "Where is the old SGI " laments ?

  41. Re:15 years for MicroSoft to 64 bits? by denshi · · Score: 1

    c'mon. It took MS that long to get some competent OS engineers working for them. Hopefully this time the process will be faster. 64-bit VMS!! Go, Cutler, go!!

  42. Re:SGI Sucks ( read on ) by daviddennis · · Score: 2

    About three weeks ago, I got one of those SGI Insider newsletters in my email, with an unbeatable offer for one of their 1600SW monitors. I really wanted an Apple Cinema Display, of course, but for $550 plus tax and shipping, it sure was tough to beat; they're still about $900 on eBay.

    I placed my order straight away, and it still has not arrived. The reason? During this crisis period for the company, where everything is falling apart, they are apparently changing their sales systems to use the latest version of Oracle. During this transfer, their ordering system is entirely manual. They haven't lost my order (yet), but they did take many more orders than they had available machines for, and they claim to still be straightening out the mess.

    I lost even more faith in SGI when I noticed their "configure your SGI system" web page was Windows-based. For shame, SGI!

    Part of this is, of course, the kind of weird degradation that occurs at any big company. But I can't say I have any faith in the company that makes that kind of blunder. Changing your ordering systems to the extent that operations are severely impacted is just plain stupid; I've never managed a major business, but if I were, the minute I had problems of that severity, I'd go straight back to the old system and send developers of the new one back to the drawing board. Elementary, surely?

    Oh, and even though I am surely a beneficiary of a great monitor deal because of this, they should have dumped all the monitors on eBay, ten or so at a time. They would have made, maybe $ 800 per monitor instead of $600. That's a pretty respectable chunk of change right there.

    D

  43. Re:SGI Sucks ( read on ) by Anonymous Coward · · Score: 0

    Look inside the o2000 Or even to a certain extent the o200 (haven't looked at teh o3000 series). THes have bandwidth in abundance for shunting data around. 5 GB/s bus rates. The problem is that todays $50000 SGI is outclassed by a desktop machine at best buy inside of three years. You need a hell of a lot of ROI to justify that price tag. They do make some nice stuff though. XFS has been world class, and the boxes teh stuff comes in are amazing (remember the SGI fridge?) ..d

  44. These numbers are meaningless by RelliK · · Score: 2
    P4 Xeons have been available for several weeks now; we've had a dual P4 machine on site for about a month.

    OK, I was not aware of that, but tell me where I can actually buy them. I looked at Dell's site before I posted. If Dell doesn't have them, nobody else does.

    The NAS Parallel Benchmarks would seem to indicate you are wrong.

    You pointed to the single (type of) benchmark that makes P4 look good, though not because of P4's virtues. P4 has two channels of RDRAM giving it 1600 * 2 = 3200 MB/s of memory bandwidth. The Athlon machine in the benchmark you pointed to had a single channel of DDR DRAM, giving it 2128MB/s of memory bandwidth. Fluid Dynamic simulation is one of the _few_ things that can really take advantage of greater memory bandwidth. Therefore P4 wins this round by riding on Rambus's coat-tails. Prove me wrong. Point me to the same benchmark where both P4 and Athlon have the same type of memory (both support SDRAM now, you know).

    --
    ___
    If you think big enough, you'll never have to do it.
  45. codeing for ia64 in asm? by nick-less · · Score: 1

    Hm, I don't think ia64 was ever designed to be programed in plain asm. Isn't it very painful to care about that epic stuff by hand?

    1. Re:codeing for ia64 in asm? by Innominandum · · Score: 1

      I have to say that I have never really seen a highly optimizing HLL compiler ever beat an intermediate Assembly programmer. It is true for an architecture like Alpha, Assembly becomes less important.

      But for IA-64... I remember reading through some of the IA-64 specs and it looks like low-level programming becomes even more important. This is coming from memory (and I'm running on 4 kb), but I recall Intel even saying on their pages that it is difficult to optimize for this platform. There are even more rules, special cases, and so forth.

      It's a pretty unusual architecture and I don't think any compilers would be able to output really tight code without a total back-end rewrite. A compiler, even optimized for IA-64, would have a harder time beating an Assembly programmer because of this. But once you get "into the groove" looks like it could be pretty nice.

      Like I said, this is all coming from memory so I can't cite specific documents. But if anyone else would like to prove me otherwise (and I *hope* somebody does), please post a reply. I really look forward to working with 128 general purpose registers one day!

      In all honestly, I think Intel should have scrapped IA-64 completely and adopted Alpha as its next platform. Backwards compatibility is killing them!

  46. I want one !!! by TrollMaster5000 · · Score: 0

    I want one !!!

  47. Re:SGI Sucks ( read on ) by LinuxParanoid · · Score: 5, Insightful
    What went wrong?

    I know this is a rhetorical question, but having once spent a lot of time thinking about how to advise them before and during their fall, I'll give you my analysis. Some of this I saw at the time, some aspects I only saw too late. Learn from their mistakes.

    Here's what went wrong:

    • SGI succumbed to the Innovator's Dilemna. Unwilling to canibalize their high-end graphics, they refused to enter the PC 3D graphics space and left it open for 40+ hungry competitors. Many of their engineers left for said competitors when it was clear SGI was going nowhere. (Jim Clark recognized the dilemna in 1992-1994 by the way and jumped ship himself, a harbinger of things to come.) In fairness, its hard to canabalize a 1-2 billion dollar workstation graphics market in hopes of winning a 50-200 million dollar 3D PC graphics one.
    • SGI refused to go to NT early on when they would have had leverage in making the move that could have forced/encouraged MS to adopt OpenGL exclusively for 3D. Instead, they said no to NT for too long, and when they said yes, it was a me-too decision that was later partially reversed in favor of Linux. Rather than recognizing and admitting they'd lost the war and pursuing the best possible terms, they chose, either conciously or through inaction, the "go down fighting, maybe we can still win" route.
    • SGI's bread and butter midrange workstation 3D graphics was prone to "good enough" copying by competitors, Sun and HP. SGI's engineers spent a lot of time focusing on developing unique high-performance texture mapping serving 2% of their market (the entertainment sector) rather than on improving geometry engine performance further for the benefit of the biggest 40+% market (CAD).
    • SGI's choice of a strategic response to PCs was poor: "We'll have highly differentiated systems" (the O2). Unfortunately, the differentiation (UMA, texture mapping, imaging, system bus architecture) was largely in areas that didn't add much value to their largest segments of customers. They built not what most of their customers wanted, but what the "cool" customers wanted. What most of their customers wanted was lower prices- that's what most of them ended up going to when dumping SGI.
    • SGI engineers were late. Whether through lack of focus/discipline, resistance to "impossible" marketing schedules (that turned out to be necessary), choice of agressive cutting-edge/bleeding-edge component technologies that proved hard to debug, whatever. Your pick. Key products in the timeframe you mentioned were late, late, late. The midrange IMPACT graphics were announced June 95 as shipping but in reality didn't really ship for another 6 months, more or less. (In the meantime, Wall Street lost faith in the company as a momentum stock and SGI stock price dropped from its alltime high of 45 down to mid-20s.) Subsequent products also had a tendency of being late (O2, a year later than needed, Visual PC was late, etc.) SGI engineering exhibited a lack of discipline when instead they needed increased focus to adjust from product design cycles of 4 years (traditional workstation graphics) to 6 months (90s PC graphics). In their defense, this wouldn't have been easy. But at least some there knew about this. Which brings us to the last problem I'll go into.
    • Arrogance. SGI was arrogant. No PC could beat us. We'll always stay ahead. Sun? HP? Ha! Yeah, we can do low-price graphics, look at Nintendo-64; see a PC beat that! etc, etc. Not everyone at SGI there was, but a heck of a lot were.
    --LP

    P.S. I didn't even get into their server strategy, Cray, and later events. Another time perhaps.

  48. Well half the hardware actually is free. by Anonymous Coward · · Score: 0

    Figure out how much their saving by using gpl-ware. Well gee... Win2K data centre is for large SMP's, and clusters thereof. I would think that each node here would run Adv. Server Instead. So... take your advanced server license cost, multiply by number of nodes, then add a license for a development environment for each user (probably a few thousand of those) and Platform's LSF. Wow! cool! now the software costs more than the hardware. OK we'll have to get a system only half as big. That'll make the economy run real GOOD....

  49. SGI has been faithful at least by HerrGlock · · Score: 4, Interesting

    It seems the ones who have been faithful to their commitment to Linux are SGI and IBM. The others have tried it and then decided it was not worth the effort to reach such a small segment of the population.

    I'm glad there are still big players in the Linux field, though, it helps forward the cause and the OS and lets people know there IS an alternative. By all means, SUN and other, keep your propriatary stuff available and have that as the default, but allow people the option to choose another OS if they so desire.

    DanH

    --
    Cav Pilot's Reference Page
    UNIX - Not just for Vestal Virgins anymore
  50. No, 11.8 really. by hotsauce · · Score: 1

    And this is different from SGI or Intel maketing data how?

    Actually, better G4 benchmarks are at Black Lab Linux.

    1. Re:No, 11.8 really. by Anonymous Coward · · Score: 0

      "And this is different from SGI or Intel maketing data how?" Not at all. Do you believe SGI and Intel marketing spin as though it were fact - if you do, you need to wise up

  51. Re:15 years for MicroSoft to 64 bits? by cr0sh · · Score: 2

    Correct me if I am wrong...

    But hasn't there been a 64 bit version of Linux for a VERY long time - ie, for the Alpha?

    What I find interesting (and it is just a personal conspiracy theory of mine, and probably holds zero water) is that it took "this much" time for Intel to release their 64 bit chip - how long have we been hearing about it now? At least 2 years...

    Is it just a coincidence that Microsoft finally has a running platform for it just at the time this chip comes to market?

    ???

    --
    Reason is the Path to God - Anon
  52. Re:How Much Power and A/C for this Itanium Cluster by Troy+Baer · · Score: 1

    OSC's Itanium cluster is physically located at the State of Ohio Computing Center (SOCC), which provides all the power and cooling. The power for the cluster is nineteen 30A/220V circuits, which go to a set of Sentry power controllers. The total heat load is about 195k BTU/hr, but the SOCC building is (over-)engineered such that this much of a heat load did not require additional AC capacity.

    --Troy
    --
    "My life's work has been to prompt others... and be forgotten." --Cyrano de Bergerac
  53. Slashdot by Anonymous Coward · · Score: 2, Insightful

    This story:

    http://dailynews.yahoo.com/h/nm/20010809/tc/tech _s upercomputer_dc_1.html

    Has been out for all most *five* hours. The story talks about how Linux is going to be used as the OS for the *biggest* *cluster* *of* *super* *computers* *in* *history*

    It is the greatest news I have heard in months and it "matters" if you ask me. The Super computer(s) will be funded by the National Science Foundation(NSF) and it is reported that the super computer(s) will be able to calculate in one second, what it would take a hand calculator ten millions years to calculate. In addition the total disk space will be enough to hold all most one and a half million full-length novels.

    In other words, the Linux OS is going to be used for the largest computing grid in the history of the world.

    This story has been availavle on Yahoo!(TM) LinuxToday, Newsforge etc for hours. I submitted the story 3 hours ago and nothing...

    I used to read slashdot for the news and told myself I could ignore the mindless trolling and moronic comments, now I realize the news service is garbage and I have no reason to read /. anymore.

    Looks like it is newsforge or LinuxToday for me
    :-)

    1. Re:Slashdot by kietscia · · Score: 2, Insightful
      A whole 5 hours before it showed up on Slashdot. Oh the humanity . If I'd only known about this a few hours ago when that extra $10M laying around on my boss' floor I could of had one. Impatient twirp went and spent the whole wad on 1 month of 24x7 support for our lone Win2K server.
      </sarcasm>

      Reading /. is not done for timeliness (although I usually see things here first) its for the breadth and variety of articles. If you want a place where things show up picoseconds after they hit the ether then you've shown up for the wrong party. And yes, there are trolls and moronic comments (scroll up a bit and you might see one) but the good stuff far outweighs the negatives. Since I gave up coffee (I'm still in mourning over that one), /. is the first thing I do in the morning.
      --
      -- If it isn't broken, you haven't let my users have a crack at it yet --
    2. Re:Slashdot by Anonymous Coward · · Score: 0

      Hmm... You seem to think that Slashdot is a news site. It isn't really, it's a web log. I gave up a long time ago expecting the level of journalism that I would get from other sites. It's not about that, I doubt they are even watching the wire. It's just a a few dorky kids posting stuff they like. Get over it!

  54. woah by EpsCylonB · · Score: 0, Troll

    thats gotta make one seriously fast game of quake 3...

  55. Re:Pirst Fost? by niko9 · · Score: 0, Offtopic

    Pirst Fosting is sooo like 30 seconds ago....

  56. IA64s are kickass... by Telek · · Score: 5, Interesting

    I had an opportunity to work on them about a year ago (the first one we received was a doorstop, literally... The sucker weighed 73 pounds in it's shipping package (I'm NOT KIDDING... They reeled the box in on a trolly, and I laughed at the guy cuz it looked small enough to carry, but then I tried to pick it up...) and didn't even boot, but intel shiped them with 2GB of ram and a kickass SCSI system, so let's just say that my desktop became a SWEEEEEET machine.), but once we got ones that did work, they were sweet machines. I was porting bigint libraries for encryption that I had hand-coded in assembly for the x86 platform, and going from 8 general purpose registers with 1 predicate register (i.e. only 1 carry flag) to having 128 general purpose registers, and 63 predicate registers was a GODSENT.. AMAZING... For anyone who's coded math routines in assembly, you know how much of a PITA it is having only one carry register. This was simply amazing. I could do 1024-bit RSA purely in registers, no memory access outside of the initial read of the data and the final write. Needless to say it flew. It was interesting because literally you wouldn't need a hardware crypto card if you have an Itanium system. So basically Intel really put in a lot of good effort into designing this new platform to avoid the pitfalls of the problems that they experienced with their x86 architechture.

    The machines also had 4GB of ram, so it was fun to do:

    char * myStr = (char *)malloc(-1);

    and have it succeed! (that's a 4GB memory allocation)

    --

    If God gave us curiosity
    1. Re:IA64s are kickass... by blair1q · · Score: 2

      Which obviates your point.

      A 32-bit CPU can address 2^32 bytes of virtual memory just fine.

      --Blair
      0xffffffffffffffff

    2. Re:IA64s are kickass... by statusbar · · Score: 1

      YES! As long as it can interface well with C. The DSP algorithms usually have to be rewritten from scratch anyways!

      For example in C and C++ I end up rewriting the standard biquad filter not just for each DSP architecture but for each way that the data would be organized. And the math is the same (and pretty simple) in each way. I feel that this is exactly the case where the compiler could do the re-organization for me. But the compiler needs to know a higher level concept of what I am trying to do - So it could not only run multiple iterations of a loop in parallel - it should also be able to interlace the instructions that comprise two or more function calls with each other (that are allowed to be done in parallel) in order to fill up the pipeline.

      --jeff

      --
      ipv6 is my vpn
    3. Re:IA64s are kickass... by schwap · · Score: 2, Interesting

      With that many registers, wouldn't it be better to let the compiler do the work since it should be able to optimize it quite well? Being an assembler programmer at heart, I know the necessity of getting down to the bare metal, but my experience was that I only needed to do it because the x86 arch had so few registers that I could, indeed, do it better than any compiler.

    4. Re:IA64s are kickass... by Anonymous Coward · · Score: 0

      You have to have 64bit cpu though.

      Which brings me to another point: if you're 64bit, wouldn't -1 translate to 2^64-1?

    5. Re:IA64s are kickass... by Telek · · Score: 1

      because I was using base standard c calls at the time, which use 32-bit params. I was just futzing around trying to see how fast I could access things, etc, and porting some code that already used malloc, so I said "hmm, I wonder" and tried it with malloc(-1). Fun!

      --

      If God gave us curiosity
    6. Re:IA64s are kickass... by Anonymous Coward · · Score: 0

      Please don't propagate bad programming style. It's cute but many of the wannabe and young programmers here are now going to try to use some variation on this this cute trick.

      char * myStr = (char *)malloc(0xffffffff); works just as well and you won't get screwed when you move to a machine that has 64 bit unsigned integers. Or one that treats all numbers as signed ints by default.

    7. Re:IA64s are kickass... by blair1q · · Score: 2

      The machines also had 4GB of ram, so it was fun to do: char * myStr = (char *)malloc(-1);

      You're going back to school, m'laddo.

      malloc(3) has overhead. You should have expected that to fail if you were thinking only of how much RAM you had. Fortunately, modren cowmpooterz has virtual memory. So you could have got away with that when you got your first 5GB hard drive.

      --Blair

    8. Re:IA64s are kickass... by blair1q · · Score: 2

      "The memory you get from malloc isn't actually allocated until you access it."

      malloc(3) has a job to do, and that's to keep you from using memory you can't use. If it doesn't "allocate" it until you access it, then whatever system you're using is incontrovertibly broken.

      malloc(3) is at least two layers above the actual memory. Plenty of room for the MMS and MMU to translate it.

      Don't go assuming what I was running.

      And if Linux limits memory to 3GB (which I did not know, as I don't often dig around the internals of toy computers) then what good is it?

      --Blair

    9. Re:IA64s are kickass... by Telek · · Score: 1

      ***EXACTLY*** my point....

      C is very very limited... For example, the single most important thing when coding math is using the carry flag, and there is NO WAY to use the carry flag in C... For example, to add 2 32-bit values together in C:

      boolean add32 ( UINT32 a, UINT32 b, UINT32 &res)
      {
      *res = a + b;
      return ( *res > a );
      }

      whereas in assembly, you can just use the flag... This results in **at least** twice the speed of your application, just for adds... C is way too inspecific to be used effectively for math routines. That's why I had to resort to assembler. Even on the kickass IA64 with wacky crazy assembler that's very very wierd to use, and we are told by intel that "you should never need to use assembler, we have spent millions of dollars optimizing our compilers for it!", they specifically instructed us to NOT USE ASSEMBLER in our ports of our crypto code, citing that it'd be faster to use C... Well I did that, and then later on ported it to assembler myself, by hand, and showed them that the asm code was 3x faster than the C code... Why? Just because of the things that I could access in ASM that I couldn't access in C. The people at intel were not too happy about this, and couldn't understand why it'd be faster. (sigh). Oh well... (end rant)

      --

      If God gave us curiosity
    10. Re:IA64s are kickass... by typedef · · Score: 1

      Speaking of which, its considered bad programming style to cast the return value of malloc().

    11. Re:IA64s are kickass... by darkphyber · · Score: 1

      Err.. hasn't all that been possible with the Alpha platform since it's creation?

    12. Re:IA64s are kickass... by Anonymous Coward · · Score: 0

      You don't even need a 5GB hard drive. You can do that on a diskless workstation. Most unixes allow you to overcommit memory. All 4GB worth of your pages are all pointing at the same zero'd page. Not until you write to them will the be COW'd into real memory/swap.

    13. Re:IA64s are kickass... by Anonymous Coward · · Score: 0

      For DSP type stuff do you think people would go for using a different language easily? Even if it is not similiar to C/C++? You are quite right that trying to get a compiler to optimize C/C++ is difficult, especially for complex architectures. (I'm a compiler researcher btw)

    14. Re:IA64s are kickass... by Anonymous Coward · · Score: 0

      You get warnings otherwise and you won't get this to compile with strict warnings. You are using warnings as errors aren't you?

    15. Re:IA64s are kickass... by Anonymous Coward · · Score: 0

      Do many high-level language compilers use AI-search techniques to find the best register-swapping algorithm to break the program down into ? Perhaps they do already -- I'd imagine it'd be necessary to ensure that the registers on this platform are used efficiently by a program. Compiling a program using this method would take considerably more time than what it takes to compile general programs today, and the advantages of downloading a pre-compiled binary version of a program over downloading the source-code and compiling it on your home computer would probably be a good idea. .. sfda;lfdsa;lk .. I think.

  57. Re:obvious by ThatComputerGuy · · Score: 2, Funny

    he said no imagining... from now on, all beowulf clusters must be fathomed.

    --
    XML is like violence. If it doesn't solve the problem, use more.
  58. Humm by Aqua+OS+X · · Score: 1, Funny

    Intel... you're painting yourself into a corner with this mhz battle against AMD, Moto, and IBM.

    --
    "Things are more moderner than before- bigger, and yet smaller- it's computers-- San Dimas High School football RULES!"
  59. in related news by Matthew+Luckie · · Score: 3, Informative

    there is a much larger cluster of linux machines going to be created care of the NSF. press release here. good day for linux!

    1. Re:in related news by Perdo · · Score: 2
      "The DTF will be the largest, most comprehensive infrastructure ever deployed for scientific research"

      "creating the 13.6-teraflops system--the most powerful distributed computing system ever"

      OK, what exactly does that make seti@home? 16.98 TeraFLOPs/sec as I write this, distributed computing system, scientific research... What am I missing?

      --

      If voting were effective, it would be illegal by now.

    2. Re:in related news by Perdo · · Score: 1

      Sort of like the guy just pretending he has courage, throwing himself on a grenade to save his friends, seti@home (currently at 22.13 TeraFLOPs/sec) is a great model for getting the job done. Distributed/centralized doesn't matter. That's more work being done than any other known computer ever. When the goal is a powerful computer, semantics are trivial. Perhaps linking all current research computers into a client/server relationship would be superior to a distributed system spanning the continent. Cheaper. Not as sexy. No big budget or bureaucratic opportunities. Never happen. Never mind

      --

      If voting were effective, it would be illegal by now.

    3. Re:in related news by novarese · · Score: 1

      Seti is not a true distributed system; the nodes dont communicate with each other, they only get data from a central server and submit results to that server.

    4. Re:in related news by AndrewHowe · · Score: 1

      Oh, are you the guy in charge of defining what phrases mean?
      Seriously, a distributed system doesn't have to be fully connected to be "true".

  60. computing power by revoid · · Score: 1

    this reminds me of the cluster here at my university.
    a team of about 4-5 guys trying to keep the network up and running because of stupid users.

    let's just hope this megacluster will be put to good use and not be the target of some bored hacker who installs a counter-strike server or whatever.

    *believes in the good of humanity*

  61. They DO exist! by reverius · · Score: 0

    Wow... apparently these things are real. I thought it was just marketing, but...

    they're real! :P

    Wow. After all this time, i've never heard of one being used. Even within Intel. :)

  62. daaaaamn.. by Anonymous Coward · · Score: 0

    Someone hook all this shit up to dnetc! Talk about a kickass keyrate..

  63. geez.. sounds like your describing Deep Thought by Pengo · · Score: 2, Funny


    We can ask it for the answer to the great question of the universe?

  64. a dialogue by the_tsi · · Score: 4, Funny

    Slashdotter: I wish to complain about
    this company what I read about not half an
    hour ago on this very website.
    Me: Oh yes, the, uh, the Workstation manufacturer...What's,uh...What's
    wrong with it?
    S: I'll tell you what's wrong with it, my lad. it's dead,
    that's what's wrong with it!
    M: No, no, it's uh,...it's resting.
    S: Look, matey, I know a dead company when I see one, and
    I'm looking at one right now.
    M: No no it's not dead, it's, it's restin'! Remarkable company,
    the SGI, idn'it, ay? Powerfull CPUs!
    S: The CPUs don't enter into it. It's stone dead.
    M: Nononono, no, no! It's resting!
    S: All right then, if he's restin', I'll wake him up!
    (shouting)
    'Ello, Mister Bob Bishop! I've got a lovely fresh government
    contract for you if you show...
    M: There, it moved!
    S: No, it didn't, that was you faking a press release!
    M: I never!!
    S: Yes, you did!
    M: I never, never did anything...
    S: (yelling and hitting the cage repeatedly) 'ELLO SGI!!!!!
    Testing! Testing! Testing! Testing! This is your nine
    o'clock alarm call!

    See, guys, I told you they still had life left! :)

    -Chris

  65. Just imagine.. by nr · · Score: 0

    ..a beowulf made of these pouring hot gritz down Natalie Portman pants.

  66. Re:Why should I _care_ about having the same memor by Troy+Baer · · Score: 1

    But you cannot use your benchmarks to claim that P4 is faster than Athlon. Your benchmarks show that P4 with RDRAM is faster than Athlon with DDR DRAM. This does not imply that P4 is faster than Athlon, which is what you were trying to claim.

    Actually, I was providing a counter-example to a specific comment of yours:

    If software was optimized for P4 and Athlon, P4 would still be pathetic.
    I can get Athlon-specific optimizations if I use the Portland Group compilers (-tp athlon -Mvect=prefetch), so claiming that "AMD has never had software optimized for their CPUs" is not entirely true. Certainly they're not well developed as the P4 optimizations in Intel's compilers, but they do exist.

    Also, you cannot make such a claim based on only one benchmark.

    I have a hard time viewing the seven different codes in the NAS Parallel Benchmarks as "one benchmark"; the algorithms in the CG (conjugate gradient) benchmark for instance are very different from those in FT (Fourier transform) or IS (integer sort), and they stress different parts of the architecture. They do all favor high memory bandwidth systems, but that's the nature of the beast in HPC -- almost all scientific applications are memory bandwidth hungry.

    --Troy
    --
    "My life's work has been to prompt others... and be forgotten." --Cyrano de Bergerac
  67. It is worth noting... by uncleFester · · Score: 3, Interesting

    The person who submitted the story (Troy Baer) is also the admin of the beast. Troy had an interesting article on the current (previous?) cluster setup at OSC in one of the recent Linux mags (Linux Journal, 2001 July). To call Troy a proud father of this setup might not be too far off. ;)

    I knew Troy from school, admin-ed with him in the Ohio State engineering labs. Ask him what he's doing with that Aero Eng. diploma nowadays.. ;) Overall, a pretty damn sharp guy. He gets to play with Linux/SGI clusters now, I'm stuck with Alphas & an O2000 in a back room somewhere.

    -'fester

    --
    -'fester
  68. Not a bad deal by wirefarm · · Score: 4, Funny

    "... The old system will be divided into smaller clusters and cascaded to faculty ...

    "Damn. I asked for an iMac, but got this stupid Linux cluster instead!"

    ;^)

    Cheers,
    Jim in Tokyo

    --
    -- My Weblog.
  69. How about... by DiLLeMaN · · Score: 0
    running Seti@home/dnet/whatever on this baby?

    --
    /var/run/twitter.sock is a twitter socket puppet.
  70. bahh ... this isn't supercomputing ... by Anonymous Coward · · Score: 0
    ... you young whipper snappers don't know what supercomputing is ...

    ... I'll tell you what supercomputering is ... back when I was a young lad, in days when we had to compute uphill, both ways, with punch-card boxes strapped to our feet for shoes ...

    .... we had to hire a plumber to install the cooling system in our Cray 2, it was so hot ...

    ... don't get me started ...

  71. Re:Itanium? by NDPTAL85 · · Score: 1
    "Presume there is something that cannot be labeled. Call it 'X'."

    Hrm.... Malcolm X or Mac OS X?? Kinda messes with your head doesn't it?

    --
    Mac OS X and Windows XP working side by side to fight back the night.
  72. You want numbers, here're some numbers... by Troy+Baer · · Score: 1

    I was hoping to avoid this pissing match, but these claims are too ridiculous to let pass.

    If software was optimized for P4 and Athlon, P4 would still be pathetic.

    The NAS Parallel Benchmarks would seem to indicate you are wrong.

    A Xeon is based on P3 core. Xeon is just an overpriced P3 with a larger L2 cache. My claim stands: there is no such thing as dual P4. If and when Xeons with P4 core are relased, then we'll talk.

    P4 Xeons have been available for several weeks now; we've had a dual P4 machine on site for about a month. Here's the /proc/cpuinfo from it:
    processor : 0
    vendor_id : GenuineIntel
    cpu family : 15
    model : 0
    model name : Intel(R) Xeon(TM) CPU 1700MHz
    stepping : 10
    cpu MHz : 1695.874
    cache size : 256 KB
    fdiv_bug : no
    hlt_bug : no
    f00f_bug : no
    coma_bug : no
    fpu : yes
    fpu_exception : yes
    cpuid level : 2
    wp : yes
    flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss tm
    bogomips : 3381.65


    processor : 1
    vendor_id : GenuineIntel
    cpu family : 15
    model : 0
    model name : Intel(R) Xeon(TM) CPU 1700MHz
    stepping : 10
    cpu MHz : 1695.874
    cache size : 256 KB
    fdiv_bug : no
    hlt_bug : no
    f00f_bug : no
    coma_bug : no
    fpu : yes
    fpu_exception : yes
    cpuid level : 2
    wp : yes
    flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss tm
    bogomips : 3381.65

    You really ought to check your facts...

    --Troy
    --
    "My life's work has been to prompt others... and be forgotten." --Cyrano de Bergerac
  73. There is *nothing* wrong with Atari ST's by Anonymous Coward · · Score: 0

    ...& just Imagine a Beowolf Cluster of THEM!!!

  74. Compiler? by joeytsai · · Score: 1

    What's the difference between SGI's ia64 compiler suite (Pro64) and the gcc compiler for ia64? I know the Pro64 is GPL, so why are there two different compilers?

    Thanks.

    --
    http://www.talknerdy.org
    1. Re:Compiler? by BarfooTheSecond · · Score: 1

      Try to compile a program with gcc and with SGI's compiler, both with full optimisation.
      Preferably use multi-treaded/multiprocessors sources.
      Run them both and see how the SGI compiled one is much more faster and efficient...

  75. not what Linux was intended for! by Anonymous Coward · · Score: 0

    Remember ppl, Linus only wanted an operating system for 386s ;-)

    1. Re:not what Linux was intended for! by OverDrive33 · · Score: 1

      I think its great when something can be just recomiled to work on a completly diffrent hardware setup! (beat that Bill!) I don't think Linus would be all that upset to know something he wrote is being used for projects that big.

    2. Re:not what Linux was intended for! by Guppy06 · · Score: 1

      In case you haven't noticed, the code was re-compiled for the Itanium...

  76. ItaniuMMM--good!!! by Anonymous Coward · · Score: 0

    Imagine a Beowolf Cluster of THESE!!!

    1. Re:ItaniuMMM--good!!! by pressman · · Score: 1

      Imagine if Beowulf had a cluster of these! Oh, wait he's a Medieval Welsh hero. Sorry.

      --
      Pooty tweet
  77. Re:Why should I _care_ about having the same memor by RelliK · · Score: 2
    Oh man, fro your last post I got the impression that you saw the light. Apparently I was mistaken.
    Actually, I was providing a counter-example to a specific comment of yours: If software was optimized for P4 and Athlon, P4 would still be pathetic.
    OK, let me repeat what I said in my last post. Your benchmarks show that P4 with RDRAM is faster than Athlon with DDR DRAM. This does *not* imply that P4 is faster than Athlon. To give you a mathematical analogy, if you know that A + B > C + D, you cannot claim that A > C : you do not have enough information for that. (This is especially true when you know that B > D). In the case of your benchmarks, you cannot claim that P4 is faster than Athlon, especially since you know that two channels of RDRAM have a greater bandwidth than once channel of DDR DRAM.

    They do all favor high memory bandwidth systems, but that's the nature of the beast in HPC -- almost all scientific applications are memory bandwidth hungry.

    Who cares if the benchmarks use different algorithms? They all test exactly the same thing: memory bandwidth. This statement of yours actually makes my point above stronger. You admit that all of these benchmarks are highly dependent on memory bandwidth. It appears that you agree with my supposition that these benchmarks depend more on the memory bandwidth than the CPU speed (integer sort for example would be 100% memory I/O). Yet you are still trying to claim that P4 is faster than Athlon. These benchmarks do not prove that. They prove that the bandwidth of two channels of RDRAM is greater than that of one channel of DDR DRAM, but we kind of knew that already, didn't we?

    So, your counterexample is invalid and my claim stands. I really think that we are starting to go around in circles.

    --
    ___
    If you think big enough, you'll never have to do it.
  78. Re:Why Itanium? by Anonymous Coward · · Score: 0

    sgi sold their MIPS division, I believe.

  79. Re:Bullshit alert by Anonymous Coward · · Score: 0

    "Uhhh, perhaps because there is no such thing as dual P4?" what the fuck do you think a Xeon IS you stupid stupid cunt

  80. Re:SGI Sucks ( read on ) by Anonymous Coward · · Score: 0

    and we have a similar situation in the UK with Quantel. The root cause of both of these miseries was actually Intergraph. Intergraph spent a ton of cash creating the first really powerful NY boxes, sold them at a loss to tempt people away from Octanes and Henrys, and then folded leaving their primary market (TV and film post production) in a total dilemma. Lack of sales having stalled R&D at SGI and Quantel both, and the myth of NT performance having been widely accepted, suddenly there weren't any powerful NT boxes that would run the apps that the clients needed, and SGI and Quantel were both busy doing crazy things to try and hold on to their stockholders. Java on Quantel? NT on SGI? both ridiculous and insane. So we now have IBM intellistations and other such feeble hardware substituting for Intergraph kit, the only ray of hope is Apple's adoption of a professional OS core that might allow scalability into the SGI domain. But Apple may decide not to do that, rather expend all their energy trying to fight with box shfters at the consumer end. :-[

  81. Sir, you're full of shit by Anonymous Coward · · Score: 0

    "A Xeon is based on P3 core. Xeon is just an overpriced P3 with a larger L2 cache."

    Are you a retard? Xeon is a P4 core!!!

  82. Deus Ex!!! by Anonymous Coward · · Score: 0

    What?!! The NSF builds a supercluster?

    They must be up to something!
    Somebody inform UNATCO now!

  83. Where it all went to hell by Anonymous Coward · · Score: 1, Interesting

    If I remember correctly, in about 1996/97 this guy went to work for SGI. He'd already had similar high-up jobs at two other companies that lasted nearly 6 months, just like his stay at SGI was going to. What he did was to go around all of these companies saying that they needed a "Windows NT strategy"
    The strategy was to go NT4.0 on commodity (intel) hardware. So, SGI announced that it was reducing Irix development, halting development of the MIPS processors (which were an order of magnitude faster than the Pentium of the day and 64-bit to boot).
    Very quickly, he was looking for a new job, SGI had penium machines that no-one wanted running NT (and Linux for the better informed customer), had restarted MIPS development and continued with Irix. However, by that time they had lost their lead.

    I hope he lost all his money in the dot com mania.

    1. Re:Where it all went to hell by BarfooTheSecond · · Score: 1

      >However, by that time they had lost their lead. And also 50 of their graphics hardware gurus who nearby all (after being fired) went to Nvidia, to work on Geforce...

  84. Wither XIO? by green+pizza · · Score: 2

    Hmm, PCI and no XIO? Would be nice to at least have a couple of each. Though 2 rack units isn't much room to work with. Wonder which version of IRIX it'll run? 6.5.13 + patches? 6.5.14?

  85. Bullshit alert by RelliK · · Score: 3, Insightful
    Oh man, technical errors abound. The P4 is actually faster than the Athlon for scientific computing, except for the compiler fact

    Pentium 4 has an absolutely pathetic floating point performance. Even Pentium 3 at 1000MHz outperforms Pentium 4 at 1500MHz on floating point. See here for example. Your claim that Pentium 4 can do 3 floating point operations per clock cycle is nothing more than pulling numbers out of your ass. (unless you can somehow substantiate your ridiculous claim).

    The P4 looses to the Athlon simply by the reason that the compilers can not use the vector instructions properly.

    AMD has never had code optimized for their CPUs. They have always fought an uphill battle. Yet they managed to beat the crap out of intel in absolute performance (price/performance they had for a long time). The whole compiler crap is a strawman's argument. AMD has 3Dnow instructions which nobody uses. If current software was optimized for AMD, P4 would look even more pathetic.

    Why anyone would be an Itanium instead of a dual P4/Athlon beats me.

    Uhhh, perhaps because there is no such thing as dual P4?

    It has less on-chip cache than a Celeron (128kb total)!! Sure it's packaged with a lot of sram, but still.

    I don't know how to break it to you, but 1) Celeron has exactly 128KB L2 cache, and 2) SRAM stands for Static RAM, which is used for cache (as opposed to Dynamic RAM, which is used for the main memory).

    --
    ___
    If you think big enough, you'll never have to do it.
    1. Re:Bullshit alert by Glonk · · Score: 1
      Pentium 4 has an absolutely pathetic floating point performance. Even Pentium 3 at 1000MHz outperforms Pentium 4 at 1500MHz on floating point.
      x87 FPU, sure. But that was only on for backwards compatability. Once SSE2 turned on (which, unlike the Athlon, is contained in a seperate unit on the processor and does not run in the FPU), the Pentium 4's FPU becomes a force to be reckoned with (surely you saw the SPEC scores?). As the original poster noted, the problem is most current compilers can't make proper SSE2 code if at all (it'd be like using a compiler optimized for 386 on your Pentium).

      If current software was optimized for AMD, P4 would look even more pathetic.
      Completely off-based. The Athlon 4 uses SSE, and rumor has it the upcoming chips will use SSE2. If people optimized for the new Athlons, the Pentium 3/4 also stand to gain a great deal.

      Uhhh, perhaps because there is no such thing as dual P4?
      They're called Intel Xeons. While it's true that it's not technically a "Pentium 4" by name, it is a P4.

    2. Re:Bullshit alert by Glonk · · Score: 1
      P4's FPU really stinks and now all of a sudden intel wants everybody to switch to SSE.
      So does AMD, apparently. Maybe that should tell you something? Like people are sick of the stack-based x87 FPU?

      ...high-precision floating point arithmetic is impossible with SSE, so x87 FPU is not going to go away. SSE2 can deal with two 64-bit numbers at a time, or 4 32-bit numbers. x87 deals with 1 80-bit number. The difference between 80-bit and 64-bit is not that great for the vast majority of applications, but the speed difference in having simultaneous data being worked on is MUCH more beneificial for way more apps than the little bit more precision.

      If software was optimized for P4 and Athlon, P4 would still be pathetic.
      Go to Tom's Hardware and read up on the FlasK fiasco, your claims are way off base.

      A Xeon is based on P3 core. Xeon is just an overpriced P3 with a larger L2 cache. My claim stands: there is no such thing as dual P4. If and when Xeons with P4 core are relased, then we'll talk.
      A Pentium III Xeon is based on the P3 core. A 'Xeon' (full name), is based on the Pentium 4. I think this little argument right here invalidates your whole damn argument. ;)

    3. Re:Bullshit alert by RelliK · · Score: 2

      x87 FPU, sure. But that was only on for backwards compatability. Once SSE2 turned on (which, unlike the Athlon, is contained in a seperate unit on the processor and does not run in the FPU), the Pentium 4's FPU becomes a force to be reckoned with (surely you saw the SPEC scores?)

      That's like saying Windows supports win32 binaries only for backwards compatibility. P4's FPU really stinks and now all of a sudden intel wants everybody to switch to SSE. Well, sure it will help (and, as you noted, it will also help Athlon which currently supports SSE, and future versions will support SSE2 as well), but high-precision floating point arithmetic is impossible with SSE, so x87 FPU is not going to go away. As for having x87 FPU and SSE in separate ALUs, well, that won't really matter if, as you claim, everybody will up and switch to SSE, will it? So my claim stands: P4's FPU is pathetic. If software was optimized for P4 and Athlon, P4 would still be pathetic. If future software uses SSE2, Athlon's performance would improve just as much as that of P4. And, furthermore: using the "compiler is not able to take advantage of some wiz-bang feature" is a strawman's argument. I repeat: AMD has always fought an uphill battle and has never had software optimized specifically for their CPUs, yet they still managed to beat the crap out of intel.

      They're called Intel Xeons. While it's true that it's not technically a "Pentium 4" by name, it is a P4

      A Xeon is based on P3 core. Xeon is just an overpriced P3 with a larger L2 cache. My claim stands: there is no such thing as dual P4. If and when Xeons with P4 core are relased, then we'll talk.

      --
      ___
      If you think big enough, you'll never have to do it.
  86. 15 years is pretty good by Proud+Geek · · Score: 2
    Just be glad that Apple isn't the company doing it. It took them 16 years to get a 32-bit OS after their computers started using 32-bit chips, and they already had 23 years of OS writing experience. Contrast that to Microsoft, which took 6 years with only 14 years of OS experience.

    Want innovation? Who do you think has it? Also, one company uses a UNIX based OS; the other at least tried to fix the mistakes of the past, although they did make more in the process of doing so.

    --

    Even Slashdot wants to hide some things

  87. MIPS is at a deadend by jc2436 · · Score: 2, Informative

    The MIPS architecture has run out of gas. All it has left in it is a few more speed bumps. SGI has no money to invest in something that would succeed the MIPS chip. Another note: Why did Compaq dump the Alpha chip ? Because it can't run lock-step, so it cannot be used for Compaq NonStop (formerly Tandem) processors. McKinley chips apparently can be run lock-step, so they will be used for the new NonStop systems.

    1. Re:MIPS is at a deadend by Midnight+Thunder · · Score: 1
      In an ideal world companies would release chip architecture plans to the world when they are killing off that division, or not selling it on. Of course we don't live in an ideal world, so no one can learn some of secrets of the chip that made a difference, except in the form of people.


      I would certainly appreciate open chip design plans.

      --
      Jumpstart the tartan drive.
    2. Re:MIPS is at a deadend by Anonymous Coward · · Score: 1, Informative

      Compaq was working on a version of Alpha which could be used in the Tandem lock-step configuration. I believe it was the EV7.

  88. Not exactly... (Re:It is worth noting...) by Troy+Baer · · Score: 2, Insightful

    The person who submitted the story (Troy Baer) is also the admin of the beast.

    To give credit where credit is due, the admin of that system is Doug Johnson, who has done an enormous amount of work to get this thing working. I'm just a user support guy who writes lots of documentation and happens to dabble in systems stuff like Maui and PVFS in my Copious Spare Time[tm].

    --Troy
    --
    "My life's work has been to prompt others... and be forgotten." --Cyrano de Bergerac
  89. Re:Why should I _care_ about having the same memor by RelliK · · Score: 2
    And I suppose the fact that width of the P4's memory bus interface (256 bits wide) is twice as wide as that of the Athlon (128 bits wide) has no influence on the memory bandwidth available?

    That's just plain not true. The P4 memory bus is 32 bit wide (16 bits for each RDRAM channel). Athlon memory bus is 64 bit wide (one DDR DRAM channel). You are probably thinking of the width of the L2 cache datapath, which is 256 bit for P3 and P4 (I don't recall what it is for Athlon).

    Or that I can't get an optimizing compiler for the Athlon that's comparable to the Intel Fortran compiler?

    I think I said it 3 times already, but I'll repeat it again for posterity. AMD has never had software optimized for their CPUs. They always fought an uphill battle. Yet they still managed to beat the shit out of intel.

    Look, I don't have the luxury of caring which processor is better in a "fair" test with the same memory, etc. -- my job is to figure out which system (processor/memory/IO/etc.) is fastest for our users' applications

    That's fine and that's a reasonable thing to do. But you cannot use your benchmarks to claim that P4 is faster than Athlon. You benchmarks show that P4 with RDRAM is faster than Athlon with DDR DRAM. This does not imply that P4 is faster than Athlon, which is what you were trying to claim. Also, you cannot make such a claim based on only one benchmark.
    Once again, I have no problem with people trying to find the best system for their needs. I have a problem with unsubstantiated claims.

    BTW, I suspect that for your particular application, memory bandwidth matters more than the CPU speed. So you may actually be benchmarking memory instead of CPU.

    --
    ___
    If you think big enough, you'll never have to do it.
  90. Re:Why should I _care_ about having the same memor by RelliK · · Score: 2

    oops I screed up html. pressed post too soon.

    --
    ___
    If you think big enough, you'll never have to do it.
  91. Available to anybody with an account there? by Troy+Baer · · Score: 1

    OSC had beta Itanium hardware last year as well. The difference is that this machine doesn't require signing a bunch of NDAs to use it.

    NCSA has a slightly larger system of (more or less) the same design on order from IBM, and I think the installation is ongoing right now.

    --Troy

    --
    "My life's work has been to prompt others... and be forgotten." --Cyrano de Bergerac
  92. Re:SGI Sucks ( read on ) by uweber · · Score: 2, Insightful

    Well sadly enough i can only agree with you.

    I actually do not have any hope left for SGI. Here is why:

    -Their workstations do not cover enough markets to sustain themseves let alone generate enough revenue to set them back ahead of the pack.

    -Their single image clusters are not cost effective enough for the scientific comunity and their use for Visualisation, where people would be willing to pay the insane price, isn't large enough.

    -Anybody can build UNIX servers.

    -Anybody can build MPI clusters.

    So I think they are reduced to niche markets which will not cover their cost in R&D which means they will have to license technology or buy components from others (CPUs, bussystems etc.). However they might run out of breath in the time they redesign their gear to take advantage of the new tech, and reduce the production cost.

    So my guess is they will be consumed slowly by their competitors - lets only hope some of their tech remains.

    --
    --Ulrich
    On no accounts allow a Vogon to read poetry at you
  93. Literally by SuperGrut · · Score: 1

    You may want to look up the word "Literally" in the dictionary.

    --
    The city is being overrun by a herd of Lucy Liu's.
  94. Mod Timothy redundant! It's been done folks.. by Myself · · Score: 2

    IBM has a story about this, the second cluster "to be installed this summer" already exists.
    Also read The story at NCSA if that's not enough for you.

  95. SGI Sucks ( read on ) by green+pizza · · Score: 5, Insightful

    Prepare to lose all karma...

    SGI sucks.

    Most of their hardware is great, as is most of their software. But their head is completely up their ass these days.

    Stagnant desktop machines. Impressive but overpriced big iron. OEM PCs. And a terrible logo. What went wrong? Where to begin??

    Once upon a time there was a company called Silicon Graphics. They got their start by making wickedly powerful terminals to provide 2D and 3D graphical front end to massive minicomputers and supercomputers. Mind you this was two years before Apple introduced the Macintosh and Xerox was still playing with the underpowered Star. Shortly there after they began selling a line of large rackmount, standalone graphical computers that used multiple large boards covered with cpus, fast ram, and other goodies to churn out decent primitive 3D in real time using the GL framework (later called IRISgl, which eventually became OpenGL). This was about the time your dad upgraded from a C64 to a IBM XT.

    Fast forward to 1995. You and I were probably playing with a Pentium 100 and looking forward to the rumored 3Dfx Voodoo card. In that same year, SGI upgraded their Onxy graphical supercomputers to InfinteReality graphics... providing performance on par with a Geforce 256. Except the IR could handle 64 MB of dedicated texture ram and 320 MB of frame buffer. Three IR "pipes" could be installed in a single system, and each pipe could even be broken down to multiple channels. IR allowed the world of graphical simulation to finally approach photorealistic quality with multiple projectors / monitors providing a wrap-around display (keep in mind that much of this was available on a limited scale 1991 with SGI's RealityEngine pipes). Both the Onyx and SGI's non graphical server, the Challenge, received a CPU upgrade. Up to 24 MIPS R10000 CPUs running at 195 MHz (each providing 390 MFLOPS + 390 MIPS) could be installed in the Onyx. The Challenge could take up to 36. SGI's flagship desktop machine, the Indigo2, received upgrades as well. The top of the line model had an R10K/195 CPU, up to 640 MB of interleaved ram, two channels of SCSI, and Maximum Impact graphics (4 MB of dedicated texture ram, 27 MB of framebuffer, and performance somewhere around that of the TNT2).

    SGI's machines continued to get better. Indigo2 was replaced with the Octane. Onyx and Challenge were replaced with the Onyx2 and Origin, and later with the Onyx 3000 and Origin 3000.

    Here we are in the middle of 2001. SiliconGraphics has become "sgi" with a NYSE stock price below $1. Their O2 desktop machine hasn't changed much since 1996, and aside from the new gfx card and faster CPUs, the Octane2 isn't a whole lot different than the original Octane in 1997. Onyx 3000 uses updated graphics based on the original IR from 1995. Perhaps the only noteworthy change has been the architecture of the new Onyx and Origin. Both can scale as a single machine to 512 CPUs with 1 Terabyte of RAM. Many of these massive machines can be clustered together for even more power... at an insane cost.

    The company that brought us 3D on the desktop has pretty much come to a halt. Their desktop machines haven't change much in almost 5 years. Their big iron is impressive, but expensive as all hell. And their PCs... where to begin on the PCs... They tried making what could have been the coolest pair of PCs of all time. But due to delays and driver issues, the machines ended up being overpriced, nonupgradable ho-hum boxes. Pretty soon they hit the other end of the spectrum with generic OEM PCs. And now this, the "SGI 750" Itanium. A box that is identical to that which is being sold by HP and Dell. The only thing SGI about it is the logo. We're not even dealing with the same SGI. This new "sgi" couldn't have possibly come from the same roots as the old, grand, SiliconGraphics.

    I can't help but wonder what the old SiliconGraphics would be doing today. Like another poster pointed out, the Octane would probably have an ever faster architecture, better graphics, and probably 4x the CPU power. This new linux cluster would probably be based on much better machines and using something better than Myrinet (which is limited by the 66MHz/64bit PCI bus the card sits in). The old SGI would have made a complete fire breather, not some OEM stack that anyone could build themselves. The old SGI would have the cube logo *and* rightfully wear it.

    When I look inside my old, used Indigo2 from 1995 what do I see? I see its 750 watt power supply. I see not a graphics card, but *three* massive cards working together and connected to the power supply via a thick jumper cable. I see engineering at its best. I see a product that pushed the limits of silicon and interconnects. I see something that was worth its $50,000 pricetag. I see something that was indeed an order of magnitude more powerful than anything else on the desktop.

    When I look at the current SGI desktop machines, I see something I can buy for less at Best Buy.

    I recently saw a demonstration of the Onyx 3000. One of the demos was a visualization app used by an automobile maker. The app showed a few different cars in full detail across three screens (each 1280x1024) in a panoramic configuration at a sustained, locked 75 Hz + 75 FPS. The cars had complete reflection features that interacted right down to the metallic flecks in the paint. The detail was right down to the 3D textures that made up the subtle surface of the dash plastics and the seat leather. It was truly photorealistic. I've seen the Geforce 3 demos, they were nowhere near as impressive as the car demo.

    Another demonstration showed the Onyx's power at loading textures. The machine they had was connected to several RAIDs containing over 500 GB of satellite and aerial photos. On the same three screens and in the same 75 Hz + 75 FPS were able to zoom down to a national park, pan across to another state, and zoom back out to planet Earth floating in space. All in real time. The RAIDs were clattering so loud I could hardly hear the man giving the demonstration. The Onyx never missed a beat.

    If the old SGI was here today, we'd have that kind of power on the desktop. And it would cost $50,000 and consume 750 watts. Not $500,000 and 9,000 watts.

    And we wouldn't have a Myrinet connected stack of Itanium PCs. We'd have something a whole hellofa lot better.

    [end rant]

  96. it's worse than that... by green+pizza · · Score: 4, Informative

    You're thinking of "Rocket" Rick Belluzzo, former CEO of SGI. He was responsible for putting MIPS/IRIX on hold, courting the Wintel crowd, and the "sgi" logo. He successfully put SGI in a steep nosedive they'll probably never recover from.

    Where is Mr. Belluzzo today?

    Hold on to your hat...

    http://www.microsoft.com/presspass/exec/belluzzo/d efault.asp

  97. 15 years for MicroSoft to 64 bits? by peter303 · · Score: 4, Insightful

    It took MS 15 years to have a full 32-bit OS
    after those chips came out. Hope they are faster
    this time. 32 bit NT on an Itanium would be a waste.

    SGI and SUN have had full 64 bit OS for 7 & 5
    years. Yes, there are bugs to shake out in the
    beginning. OF course Bill & Steve will announce
    they are "just about to ship" for years until
    they do.

    1. Re:15 years for MicroSoft to 64 bits? by Espen+Skoglund · · Score: 1

      Eh... I don't know about the SGIs, but the HP Itanium boxes ship with Linux, HP-UX and Windows. Win XP has, as far as I know, been available for the IA-64 for ages. I can't comment on how stable or efficiently it runs, though, but I guess that Linux for the Itanium still has a few quirks in it too.

  98. NSCA had the first Itanium cluster a year ago by Anonymous Coward · · Score: 0

    Sorry, folks, NCSA had an Itanium cluster last year. Saw a demo of it at supercomputing 2000 in November.

  99. Re:11.8 actually. by Anonymous Coward · · Score: 0

    best to test these things yourself. Apple's data is what's called "marketing", it's when some person or company tries persuade you to buy their product or service. A better place to find out about G4 GIGAFLOPS is at www.barefeats.com

  100. Re:sigh by Anonymous Coward · · Score: 0

    The 330 is an OEM board based around the VIA Apollo Pro133A. The 550 is an OEM i840 board. Both have been slightly customized with the SGI logo and whatnot. Aside from some slight diferences with the monitor port configurations and ram used, the graphics boards used in SGI's x86 PCs are pretty much stock nVidia reference designs. If you read the developer docs, the "tweaks" are nothing more than a few performance features enabled by default when used with an SGI PC (as SGI has been able to test the combination).

  101. Why should I _care_ about having the same memory? by Troy+Baer · · Score: 1

    OK, I was not aware of that, but tell me where I can actually buy them. I looked at Dell's site before I posted. If Dell doesn't have them, nobody else does.

    Had you looked a little further at the URL I posted, it had a link to the company we bought both systems from, Ace Computers. Dell is dragging their feet on dual P4s for some reason; we tried to buy a dual P4 system from them, but they wouldn't ship us one until October and couldn't explain the delay. Smaller vendors like Ace were much more helpful.

    You pointed to the single (type of) benchmark that makes P4 look good, though not because of P4's virtues. P4 has two channels of RDRAM giving it 1600 * 2 = 3200 MB/s of memory bandwidth. The Athlon machine in the benchmark you pointed to had a single channel of DDR DRAM, giving it 2128MB/s of memory bandwidth. Fluid Dynamic simulation is one of the _few_ things that can really take advantage of greater memory bandwidth. Therefore P4 wins this round by riding on Rambus's coat-tails.

    And I suppose the fact that width of the P4's memory bus interface (256 bits wide) is twice as wide as that of the Athlon (128 bits wide) has no influence on the memory bandwidth available? Or that I can't get an optimizing compiler for the Athlon that's comparable to the Intel Fortran compiler?

    Prove me wrong. Point me to the same benchmark where both P4 and Athlon have the same type of memory (both support SDRAM now, you know).

    Look, I don't have the luxury of caring which processor is better in a "fair" test with the same memory, etc. -- my job is to figure out which system (processor/memory/IO/etc.) is fastest for our users' applications, which happen to look a lot like NPB. We got one each of the "current best of breed" in dual P4s and dual Athlons, ran the same codes on each, and assembled the results. For single-processor stuff, the P4 wins, sometimes by a lot. For I/O bound stuff, the Athlon wins right now, mainly because Intel couldn't design a decent PCI bridge to save their lives. (This is a matter of the sorta-broken [Athlon] beating the completely-broken [P4]; I fully expect a dual P4 board with a ServerWorks chipset to hand the Athlon its butt.)

    I'll be the first to admit that my benchmarks comparison page needs work -- I haven't had time to run Gaussian or GAMESS (computational chemistry codes) yet, or BLAST (a bioinformatics code) for that matter. I know of a similar study that had completely different results, I think largely because they couldn't use the Intel compilers due to some language extensions used by their codes (eg. Cray-style pointers) that the Intel compilers don't support.

    --Troy
    --
    "My life's work has been to prompt others... and be forgotten." --Cyrano de Bergerac
  102. Imbecile! by Anonymous Coward · · Score: 0

    What's the point of running a P4 or and Athlon with SDRAM when you can use DDR or RDRAM? Are you stupid or what?

  103. Competitors Know the Itanium Sucks by Orbitalb · · Score: 2, Funny
    Apple knows that at present, the Itanium (which is based on a RISC-like design scheme called VLIW, or EPIC) is stuck at 800MHz even in its prelease form -- slower even than the shipping PPC 7450 (G4) which is running at 867MHz and expected to cross the GHz barrier by year's end. The Itanium, while offering many important features lacking from previous Intel chips, has also been criticized for its poor performance - particularly when running existing x86 applications and operating systems. Apple's next-generation processor is the PPC 7500 or G5, which is also 64-bit and employs the first complete architectural revamp since the first PowerMacs shipped in the mid-90's...but the G5, unlike Itanium, can process existing 32-bit PowerPC instructions at full performance and is expected to continue to scale upward in clock rates; the initial G5s slated to ship in early or mid-2002 are currently projected to run at 1.2GHz with per-clock performance targeted for roughly 65% beyond Itanium.

    Also of note, IBM has stated that current plans for the G5's SIMD/AltiVec engine specify a 256-bit system, rather than a 128-bit one in the G4. This will be one kickass CPU.

  104. Re:sigh by Anonymous Coward · · Score: 0
    Hell, none of their Pentium or Itanium machines are even custom designed. They're all OEM

    Actually that's not quite correct. Both the sgi 330 and 550 x86 workstations are custom designs with among other things nVidia Quatro Pro graphics cards that have been tweaked and improved by sgi. The 330 and 550 are actually some of the nicest x86 boxes I've seen.
    Although you're right that the sgi 750 is the same ref. design that HP, Dell and everybody else sells.

  105. whats so special? by gavlil · · Score: 0, Offtopic

    huh my pocket palm has a better spec and runs a custom kernel BSD with 128 bits!

    --

    Do Unto Others As You Would Have Others Do Unto You - ONLY HARDER!
  106. Re:The *first* Itanium cluster?!? by Troy+Baer · · Score: 1

    If CSAR had it before June of this year, it was prerelease, NDA-ridden, not-ready-for-prime-time hardware that was not supposed to be available for public use. (OSC has had a few Itanium boxes in house since September as well, but we didn't publicly advertise that fact since they were test systems.) OSC's machine is the first cluster of Itaniums that you don't have to sign a ton of NDAs with Intel to use.

    I did read the PDF in that URL, BTW, and I'd love to know how you got SGI and Intel to extend the NDAs to CSAR's entire userbase -- we asked SGI about this at one point, and they just about had kittens.

    --Troy
    --
    "My life's work has been to prompt others... and be forgotten." --Cyrano de Bergerac
  107. Why Itanium? by Midnight+Thunder · · Score: 1

    The only thing that gets me is why they are using the Itanium instead of the excellent MIPS chip? Given that Linux has been compiled for so many different archectures, I am sure the MIPS chip is on the list. Maybe this is the case of SGI doing a Motorola, ie using a competitors chip, instead of realising that they too have a CPU division.

    Oh well, I'll go back to smoking my joint, at least that way I can start seeing stuff from the marketing perspective ;-)

    --
    Jumpstart the tartan drive.
  108. Itanium? by Bobo+the+Space+Chimp · · Score: 3, Funny

    Dr. Adams: "Welcome to the Silicon Graphics Tantalus V Itanium research campus."

    Stan: "Why did you call it 'Itanium'?"

    Dr. Adams: "I have a rare marketing disease that prevents me from pronouncing the first 'T' in 'itanium.'"

    --
    I am for the complete Trantorization of Earth.
  109. SGI are *really* nasty employers by The_Jazzman · · Score: 0, Troll

    My friend works for SGI, and they are a company you definatley don't want to work for.

    He says that they're working hours are so long that their impossible - you have to work from 6 in the morning right through to 6 at night, sometimes longer, or else your pay will be docked.

    They also have really nasty contracts where the small print states you're notice period is five years !

    This is why so many stay and work for them. They have one nice office they show people when they start working there, but pretty soon (like, 1 week) later they ship you off to mexico where you have to work on Atari ST's (not even SGI's) and lightbulbs are not alowed because their to expensive.

    My friend has even been whipped to make him code faster and has the scars to proove it.

    So, Don't ever work at SGI - it'll be five years of hell, and my friend is still waiting to be payed.

    1. Re:SGI are *really* nasty employers by Anonymous Coward · · Score: 0

      Your friend there might want to talk to a lawyer. Just an idea.

  110. fascinating by Anonymous Coward · · Score: 2, Funny
    Here's a snippet from his little bio thing:

    Belluzzo was formerly chief executive officer of Silicon Graphics Inc. (SGI), where he was responsible for defining and executing a return to growth and profitability for the company

  111. The *first* Itanium cluster?!? by Anonymous Coward · · Score: 0
    Interesting ... given that we've been running a SGI-supplied Itanium-based cluster since last September.

    See page 5 of the CSAR Focus newsletter (PDF).

  112. 11.8 actually. by hotsauce · · Score: 1

    The dual processor G4e manages a sustained 11.8 gigaflops. The peak is much higher. More info here.

    The PowerPC G4 is a Very Nice Processor. It's too bad enough people don't recognise that. Maybe now with OS X and G4 processor boards for other architectures, development will start.

  113. Re:SGI Sucks ( read on ) by Anonymous Coward · · Score: 2, Interesting
    Well said - I agree with most of what you've written, and think that you've put it all very well.

    However, I think the reason that SGI is not producing a desktop version of Onyx 3000 is obvious - SGI tried to do battle with Sun, etc. and failed. SGI tried to do battle with Dell, etc. and failed. They're not about to do the same thing to NVidia...

    The original SGI targetted what was at the time a niche market - 3D graphics. It looks like the new SGI will also retreat into niche markets - very high-end graphics and compute servers.

    You're also right about the Itanium server - there's nothing very interesting about it. I believe this machine is only intended as an interim solution to allow developers onto the platform until SN-IA is available (whenever that might be).

    Then we'll see something a little more impressive than OEM Itanium boxes with low-bandwidth Myrinet interconnects!

  114. blah blah by jaxon6 · · Score: 1

    blah blah, where are the benchmarks. some definitive examples of ia64 speeds, etc..

    --
    Do you see the sig? Do you have it in your sights? Why yes, Miss Moneypenny...
  115. IA64 & Myrinet by ShavenGoat · · Score: 2, Informative

    So you guys know, Myrinet is a 2Gb peak interconnect with a 7us minimum latency. FULL CROSSBAR SWITCH :-) Fiber or serial.

    A interesting fact is that up until a few days ago, Myrinet only supported 1 GIG systems. I ran into this while setting up the University of Nevada beowulf named cortex.

    I must admit, IA64 with Myrinet 2000 is gonna kick some serious computational ass.

    The article says that Myrinet will run MPI, but it will also run PVM and TCP/IP stuff too.

    Check it out at their site

  116. sigh by Anonymous Coward · · Score: 0

    I hear you. The linux stuff is cool and all, but where is my "Octane3" with quad MIPS R14000 CPUs? SGI sure ain't the company it once was. Hell, none of their Pentium or Itanium machines are even custom designed. They're all OEM. Even the "Silicon Graphics 750" is the OEM Intel Itanium box with an ATI video card. Are they trying to kill themselves?