Slashdot Mirror


Sun Super Computer May Hit 2 Petaflops

Fletcher writes to tell us that Sun Microsystems has revealed their "Constellation System", a new supercomputing platform that the company hopes will put them back in the running for top dog in the supercomputer race. "The linchpin in the system is the switch, the piece of hardware that conducts traffic between the servers, memory and data storage. Code-named Magnum, the switch comes with 3,456 ports, a larger-than-normal number that frees up data pathways inside these powerful computers. 'We are looking at a factor-of-three improvement over the current best system at an equal number of nodes," said Andy Bechtolsheim, chief architect and senior vice president of the systems group at Sun. "We have been somewhat absent in the supercomputer market in the last few years.'"

36 of 134 comments (clear)

  1. Good to see! by Anonymous Coward · · Score: 2, Interesting

    Great job Sun!

    Once again you have shown us the power of talent, determination, and skill.

    Rock Rock On!

  2. Obligatory by CompMD · · Score: 2, Funny

    ...but will it run linux?

    1. Re:Obligatory by thommym · · Score: 3, Funny

      ...but will it run linux? Yes, of course. Although not as speedy as Solaris...
      --
      Don't feed the penguins
    2. Re:Obligatory by joe_bruin · · Score: 5, Informative

      ...but will it run linux?

      The first one of these being built, at the Texas Advanced Computing Center, is in fact running Linux, not Solaris (See this Register article). Sun will support both.

  3. Zoolander by Anonymous Coward · · Score: 4, Funny

    I dearly, dearly hope that the followups to Magnum are codenamed LaTigra and Blue Steel :)

  4. "We have been somewhat absent..." by PCM2 · · Score: 2, Interesting

    So Bechtolsheim says Sun has been "somewhat absent" from the supercomputer market in the last few years. OK, I'll bite. Exactly what markets has Sun been going gangbusters in since about 1999?

    Still, kudos to Sun, for real. Investors may get mad that Sun is full of terrific technology and solid R&D but can't seem to build the business model that will let Sun capitalize on it all. But from my perspective... God, that sounds almost refreshing, doesn't it?

    --
    Breakfast served all day!
    1. Re:"We have been somewhat absent..." by iggymanz · · Score: 2, Informative

      Exactly what markets has Sun been going gangbusters in since about 1999?

      why the Java market, it's everywhere. of course, Sun hasn't made a thin dime off of it, but the market sure went for it.

    2. Re:"We have been somewhat absent..." by PCM2 · · Score: 4, Interesting

      I think when Sun talks about supercomputing it's really talking about HPC/grid-type systems.

      FWIW, that's where Sun sees its future. Which makes sense. There's no point trying to compete with Linux for low-end applications (and by "low-end" I mean everything from desktops to simple Web-app servers). Sun has always been good at crafting products for that top 2% of customers who really, really need that high-availability or high-performance component that isn't going to make a difference for the other 98%. And Sun can charge for them.

      --
      Breakfast served all day!
    3. Re:"We have been somewhat absent..." by TheRaven64 · · Score: 3, Interesting

      Exactly what markets has Sun been going gangbusters in since about 1999? Web app servers. For a lot of web app type workloads, the T1 blows everything away in terms of power per watt and power per dollar.
      --
      I am TheRaven on Soylent News
    4. Re:"We have been somewhat absent..." by Anonymous+McCartneyf · · Score: 3, Funny

      Sun is responsible for OpenOffice. I expect OpenOffice to become the premiere office suite on PCs of all sorts any day now.

      --
      There is a fine line between recklessness and courage... -- Paul McCartney
    5. Re:"We have been somewhat absent..." by billcopc · · Score: 3, Interesting

      I second that. Sun's product line is very aggressive for SMB gear. They don't stray much into the supercomputing arena because it's a whole other ballgame, but for high-end "common" servers and workstations, they offer some pretty serious bang for the buck. In that light, they compete in the same segments as Dell's Poweredge line, or HP's Proliant. Medium iron as opposed to big iron. Server gear for the rest of us who aren't on the Fortune 50 :)

      I fell in love with Sun when I first laid my hands on a Sun Fire V40z, 8 cores of AMD goodness in a small box, but half the price of a competing Dell system.

      --
      -Billco, Fnarg.com
    6. Re:"We have been somewhat absent..." by PCM2 · · Score: 2, Interesting

      Except that, in the long run, the platform of choice for the HPC/high-availability/high-high-peformance market is turning out to be Linux, thanks to IBM and it's HPC business.

      Ha, well, yes, there is that. Only I wouldn't lie it completely at IBM's feet. After all, IBM is pretty much the only mainframe vendor still around. They have a vested interest in selling that kind of supercomputer, even though they've obviously seen the writing on the wall for their mainframe business.

      Outside the commercial sector, though, I think a lot of the HPC stuff is happening on Linux because a lot of the breakthroughs in this area are coming from universities and government-sponsored research, rather than private companies. A closed, proprietary OS isn't going to cut it here. (Interesting how Sun recently open sourced Solaris, isn't it?)

      --
      Breakfast served all day!
    7. Re:"We have been somewhat absent..." by afidel · · Score: 3, Informative

      Yep. We are implementing a datawarehouse app and after seeing the poor performance we are getting out of Oracle on Windows for our ERP product we are researching different solutions. Right now Sun is in the lead. They are about 20% less than our Windows solution for the overall solution and we expect it to be more scalable as we will only be partially populating the DB server out of the gate. They claim they can do with 4 middle tier boxes what our Windows solution provider has speced 13 Windows boxes for. I can't wait to see the results of the bakeoff. I'm primarily a Windows/Citrix guy but I've admin'd Solaris and Linux in past and I won't ming keeping my Solaris skills up to date if it's the best solution =)

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
  5. IBM Blue Gene/P by xzvf · · Score: 3, Informative

    IBM Blue Gene/P update slated to run at 3 petaflops.

    1. Re:IBM Blue Gene/P by flaming-opus · · Score: 2, Interesting

      yeah, there are a lot of systems out there with theoretical limits of several petaflops. Cray, IBM, NEC, even SGI have systems that could theoretically hit several PFlops if you had enough money.

      I'm waiting to see a customer actually purchase one, and for it to be installed, and actually running customer code, before I really care.

    2. Re:IBM Blue Gene/P by Life2Short · · Score: 4, Funny

      Meh... In 15 years the thing will wind up as baby furniture with kid puke on it anyway.

  6. And yet... by Anonymous Coward · · Score: 5, Funny

    Java apps still take 3 minutes to start up on it.

    1. Re:And yet... by glwtta · · Score: 4, Funny

      1996 has been looking for you - they really want their stupid jokes back.

      --
      sic transit gloria mundi
  7. Couldn't resist... by Bobfrankly1 · · Score: 5, Funny

    Well if they just move the petaflops out of the way before it gets there, they won't have to worry about it hitting them.
    -
    In Soviet Russia, TFA reads you!

  8. Constellation class system by CompMD · · Score: 4, Funny

    Please tell me the first production system will be named "Enterprise." There have to be enough people that will work on it that will be proponents of this.

  9. Great units by Life700MB · · Score: 5, Funny

    We are looking at a factor-of-three improvement over the current best system at an equal number of nodes

    Whoa, slowdown boy, just tell us how many laptop-miles of power this machine has!


    --
    Great hosting 200GB Storage, 2_TB_ bandwidth, php, mysql, ssh, $7.95
  10. Whooshhh... by Idbar · · Score: 5, Funny

    You missed the official rules.

  11. Slow improvements finally paying off by drspliff · · Score: 2, Interesting

    With Sun's slow improvements in the multi-core arena (the T1 & T2 systems) and their low power requirements their probably in for a good run at the top 10-20 spots on the Top 500 list.

    Consider 500 top-end T2 systems hooked up to some very fast switching hardware and you're performance per wattage ratio are going to be very persuasive to those running big data centers, although with the T1 systems the only thing which stopped us adopting them was the shared FPU (telephony codec transcoding sucks on them).

    Could we see suns equivalent of IBM's BlueGene system appearing next year? I definitely think so :)

    1. Re:Slow improvements finally paying off by Kristoph · · Score: 4, Informative

      If you RTFA you will note that, actually, this particular system is built around the Barcelona architecture (from AMD). It remains to be seen if T2 and later on Rock will really be competative against AMD and Intel.

      ]{

  12. Throughput: the race is on by athloi · · Score: 4, Insightful

    Back in the 1980s, what distinguished the Amiga (and later Steve Job's excellent NeXT) was the ability to split data among multiple co-processors and pipe it quickly around the motherboard, eliminating bottleneck and liberating the processor. Now in the PC world we're finally seeing this architecture recognized as new Intel chips tout their front-side bus and cache more than sheer increase in speed.

    This SUN machine is a bigger-scale example of the same. It uses AMD Barcelona chips, and derives its power from internally routing data more efficiently than (most of) its competitors. It seems that in the Moore's-law endgame, what makes the chip a star performer is the surrounding components and their engineering for efficiency.

    This will be better for geeks, as it makes the skill of efficient design come back into play after years of "bigger is better." Now if it just extends to software as well, we'll all benefit...

    1. Re:Throughput: the race is on by kpharmer · · Score: 3, Informative

      > in the PC world we're finally seeing this architecture recognized as new Intel chips tout their front-side bus and cache
      > more than sheer increase in speed.
      > This SUN machine is a bigger-scale example of the same.

      No, not really - parallelism has of course been around forever. But its application for high performance computing has been constantly demonstrated on large servers for the past 15 years. This is back when parallelism on intel hardware might at most have meant two cpus. And that was rare.

      MPP (Massively Parallel Processors) systems like Teradata and IBM's SP2 (aka DeepBlue - that defeated Kasparov at chess) successfully demonstrated great performance for the dollar back around 1994-1995 or so. These were originally designed around data mining and math computations - but found most of their sales in data warehousing. Meanwhile, CRAY was complaining that not all problems were good candidates for this kind of more cost-efficient hardware.

      By 1998 you could put db2 or informix on a hundred-node SP2, each node consisting of an eight-way SMP, each with its own dedicated storage. Queries on that old system were lightning fast compared to most other options. I worked directly on SP2s and worked with a team that has a 128-node one. Oracle & Sun eventually ecliped these solutions with massive SMPs. But much of that was more due to Informix's financial issues than technical merit - since Informix and DB2 (and of course Teradata) on MPPSs easily out-scaled oracle on SMPs. The SMPs were easier to adapt to application design changes, but the MPPs were easier to grow indefinitely large.

      These newer solutions are just more of the same thing - you've still got the same challenges in:
          - tons of OS and application images that must be consistent
          - node communication bandwidth (major selling feature of all these solutions are proprietary internal networks)
          - failover (how do nodes failover, especially if they have any dedicated resources)
          - scheduling (how do jobs get assigned to nodes)

      So, they're much bigger and faster than 10-15 years ago - and I'm sure there's got to be some cool innovation going on under the hood. But nothing looks fundamentally different from then. And nothing here has been inherited from the pc world.

    2. Re:Throughput: the race is on by fm6 · · Score: 2, Informative

      It uses AMD Barcelona chips...
      AMD chips are an option. Blades come in AMD, SPARC, and Intel flavors.

  13. 3,456 by Nom+du+Keyboard · · Score: 4, Funny

    3,456 ports. Now there's a non-computer number if I've ever seen one. It looks like someone asked, "And just how many ports do we need to be competitive," and someone else just started hitting the number keys in sequence across the top of the keyboard, starting at "3", until either Marketing was happy, or the engineer in charge fainted.

    --
    "It's the height of ridiculousness to say for those 9 lines you get hundreds of millions."
    1. Re:3,456 by jd · · Score: 2, Funny

      Or they needed 640 ports for internal connectivity (or 640 ports, channel-bonded, for upstream connectivity). Personally, I think it's the manager's password, though.

      --
      It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
    2. Re:3,456 by flaming-opus · · Score: 5, Informative

      Actually 3456 is 12 X 12 X 12 X 2. It's not actually a 3456 port router, it's a fat tree of 24-port router modules. Each rank 1 & 2 module has 12 ports down and 12 ports up. The rank 3 modules have 12 ports down, and 12 sidelink ports to one another. Thus you end up with a 3456 port, rank 3.5 fat tree all in one box.

    3. Re:3,456 by flaming-opus · · Score: 4, Informative

      I'll add:
      This is not an unusual arrangement for existing infinaband networks. The distinction is that they have all of these 864 switch modules in a cabinet, and the wiring is probably traces on a backplane, rather than flexible cables. This improves the reliability, reduces the cost, and makes it a whole lot easier to install. That may sound silly, but you're talking about 10,000 cables, each with endpoint connectors on each end. Even buying in bulk, that's a lot of money in cables.

  14. Do you feel lucky? by TheWoozle · · Score: 5, Funny

    The linchpin in the system is the switch, the piece of hardware that conducts traffic between the servers, memory and data storage. Code-named Magnum, the switch... I know what you're thinking. Did I forward 65,535 packets or 65,536 packets? Well, to tell you the truth, in all the excitement I kinda lost track myself. But being as this is a Sun Microsystems Magnum, the most powerful switch in the world, and would blow your IP clean off, you've got to ask yourself a question: Do I feel lucky? Well, do ya, punk? Go ahead. Make my day.
    --
    Insisting on "correct" English is like saying that there is only one, definitive recipe for chili.
  15. Sun Super Computer by phalse+phace · · Score: 4, Funny

    Imagine a beowulf cluster of these!

    There ya go

  16. Blue Gene Vs. Constellation by flaming-opus · · Score: 4, Interesting

    Bechtolsheim compares 131,000 cores of Blue Gene/L to 131,000 cores of constellation, with the sun system offering 3 times the performance.

    This is hardly a fair comparison. IBM installed a 131,000 core BG/L 2 years ago, and it's been running customer code for more than a year. The sun system won't be built until late this year, and probably won't be running real customer code until this time next year. Furthermore, the BG/L machine is designed with a low-power node, assuming that a larger number of cores would be used. In IBM's older BG/L design, there are 2048 cores in a rack. Sun is packing 768 opteron cores in a rack. So a per square-meter measure gives IBM's 3 year old design only a 20% disadvantage to Sun's not-yet-released machine.

    All of that is moot, of course, as theoretical peak performance is a crappy way to measure supercomputer performance anyway. The opteron is a great processor, and infinaband is a decent, though not remarkable interconnect. I'd be a little concerned, were I to buy the sun solution, that the infinaband bandwidth is being shared by 16 processor cores. That's quite a bit less interconnect performance per processor than IBM's Blue Gene, power5, Cray's XT, or SGI's altix. There's certainly plenty of memory on each of these constellation blades. That said, there are a list of applications that perform very well on Blue Gene, and Sun has a lot of ground to make up in terms of OS, software, and establishing a relationship with the HPC customers.

    It's nice to have more options, however.

  17. Overkill? by Red+Mage+13 · · Score: 2, Funny

    640k should be enough.

  18. Code named Magnum? by sonoronos · · Score: 3, Funny

    Have you ever wondered if there was more to life, other than being really, really, ridiculously high-performance?