Slashdot Mirror


Sun To Release 8-Core Niagara 2 Processor

An anonymous reader writes "Sun Microsystems is set to announce its eight-core Niagara 2 processor next week. Each core supports eight threads, so the chip handles 64 simultaneous threads, making it the centerpiece of Sun's "Throughput Computing" effort. Along with having more cores than the quads from Intel and AMD, the Niagara 2 have dual, on-chip 10G Ethernet ports with cryptographic capability. Sun doesn't get much processor press, because the chips are used only in its own CoolThreads servers, but Niagara 2 will probably be the fastest processor out there when it's released, other than perhaps the also little-known 4-GHz IBM Power 6."

214 comments

  1. Trust me... by jd · · Score: 4, Insightful

    ...If they put THESE under the GPL, along with the T1, they'd be getting more press than they could imagine. If they used these a bit more aggressively - such as using them as a graphics processor on a PC - they'd be getting some amazing press. If they keep them locked in a server closet, it's only then that nobody will care.

    --
    It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
    1. Re:Trust me... by LarsWestergren · · Score: 4, Informative

      ...If they put THESE under the GPL, along with the T1, they'd be getting more press than they could imagine.

      http://www.opensparc.net/

      They are openly discussing making the Niagara 2 available as open source as well, but note that there are some roadblocks such as the US government's restrictions on crypto technology.

      --

      Being bitter is drinking poison and hoping someone else will die

    2. Re:Trust me... by BuR4N · · Score: 1

      It would be interesting to know if there are any actual hardware out there generated from Opensparc.

      --
      http://www.intellipool.se/ - Intellipool Network Monitor
    3. Re:Trust me... by wild_berry · · Score: 2, Informative

      using them as a graphics processor on a PC

      Good enough for raster graphics, not so good for vector graphics or 3D due to there being only 8 FPUs on the die, with only twice the floating point throughput of the terrible-at-floating-point T1. Unless you do swap some of the throughput for soft-floating-point.

    4. Re:Trust me... by TheRaven64 · · Score: 5, Informative

      If they used these a bit more aggressively - such as using them as a graphics processor on a PC - they'd be getting some amazing press A modern GPU is fairly similar in design to the T2, but there are a few key differences:
      • The T2 is mainly focussed on integer ops with only one floating point pipeline per core. A GPU typically is close to 100% floating point pipelines, and doesn't bother with integer arithmetic.
      • The T2 uses multiple contexts to hide memory latency, mostly caused by incorrectly predicted branches. A GPU typically doesn't bother much with branch prediction, since it runs code that is very light on conditional branches (on average, branches happen every 7 ops in general purpose code. In GPU code, they happen every few hundred).
      • GPUs usually focus on 4-way vector instructions, since most of their data is of this form (RGBA colours, XYZW vertexes). The T2 only has scalar instructions.
      I posted in my journal recently suggesting that it would be easier to produce a modern GPU than an older card, since modern GPUs have much less application-specific logic and do more in software, relying on just having lots of cores / pipelines to give speed.
      --
      I am TheRaven on Soylent News
    5. Re:Trust me... by rbanffy · · Score: 1

      I would love to see Sun going back to the "distinctive look" "macintosh-beautiful" workstation business. Their current lineup looks like dull (if serious) PCs. I miss the Frog Design look.

      Of course, I know that generic x86 boxes (running Linux or, gulp, NT) killed the workstation market and that it would be hard to justify any development in this direction.

      It seems the Niagara 2 is more fit for desktop workloads than the first one. Maybe they can do it again. I would love to see.

    6. Re:Trust me... by MoxFulder · · Score: 2, Interesting

      ...If they put THESE under the GPL, along with the T1, they'd be getting more press than they could imagine. If they used these a bit more aggressively - such as using them as a graphics processor on a PC - they'd be getting some amazing press. If they keep them locked in a server closet, it's only then that nobody will care. I for one wish that they'd slap the UltraSPARC Niagara and its chipset on a standard ATX motherboard with PCI and PCI-Express support.

      There'd be a Linux port in practically no time, and I know a bunch of us Linux power users would adopt that setup in no time... cheap commodity hardware coupled with a high-throughput RISC processor would be great for desktop multitasking, software development, file serving, etc.
    7. Re:Trust me... by afidel · · Score: 2, Informative

      There are tons of research chips made from the OpenSparc designs and Simply RISC claims to have an embedded processor made from a single core T1 design.

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    8. Re:Trust me... by gig · · Score: 1

      If their boxes were distinctive people would link to their porno shots and blog about how drool worthy Sun's stuff is. Generate buzz. If the guts are unique then you can explain that with unique external features. Every bit of investment in this pays off as free advertising. Watch the buzz on the next iMac.

      The white box PC look is like a disguise.

    9. Re:Trust me... by morgan_greywolf · · Score: 2, Interesting

      I posted in my journal recently suggesting that it would be easier to produce a modern GPU than an older card, since modern GPUs have much less application-specific logic and do more in software, relying on just having lots of cores / pipelines to give speed.


      Which makes me wonder ... if most of the work of a video card these days is done in software, and one of the biggest complaints about Linux is the lack of good free/open source drivers for high-end NVIDIA/ATI graphics cards, then why, exactly aren't FOSS developers working on one? Get some chip fab to produce some cards based on an open GPU design, write our own drivers and -- bam -- the LinuXtreme3D Graphics Accelerator! Screw NVIDIA and ATI.

    10. Re:Trust me... by Bin+Naden · · Score: 2

      Which makes me wonder ... if most of the work of a video card these days is done in software, and one of the biggest complaints about Linux is the lack of good free/open source drivers for high-end NVIDIA/ATI graphics cards, then why, exactly aren't FOSS developers working on one? Get some chip fab to produce some cards based on an open GPU design, write our own drivers and -- bam -- the LinuXtreme3D Graphics Accelerator! Screw NVIDIA and ATI.

      Because the sunk cost to get involved in graphics cards is huge. You need a few billion dollars and like 10 years of research and development in order to simply be competitive with NVIDIA and ATI. Wouldn't it be much more convenient instead to outright buy NVIDIA or ATI and open it up? Maybe if us geeks each buy shares in the company.
      --
      There should be a "-1:Groupthink"
    11. Re:Trust me... by rbanffy · · Score: 1

      Yet, the guts of a current Mac are more or less exactly like the guts of my HP notebook, which is not ugly, but, certainly, is far less pretty than any MacBook.

    12. Re:Trust me... by CryoPenguin · · Score: 3, Informative

      You're looking for the Open Graphics Project. But hardware is hard to design and expensive to fab, you're not going to get an Xtreme3D Graphics Accelerator competitive with the latest from NVIDIA or ATI.

    13. Re:Trust me... by allenw · · Score: 2, Informative

      Linux is already running and certified for Niagara.

    14. Re:Trust me... by mhall119 · · Score: 2, Informative

      I believe Motorola makes Sparc-compatible processors, not sure if they're based on Opensparc of if they licensed it from Sun.

      --
      http://www.mhall119.com
    15. Re:Trust me... by MoxFulder · · Score: 1

      Indeed! I'm saying it'd be nice if I could take that chip and slap it in my commodity PC, using my existing drives, PSU, wireless card, etc. Because right now, you can only get a Niagara system by buying it complete from Sun, which means paying a lotta markup on all the other components.

      It'd be great if you could buy drop-in ATX boards with SPARC/MIPS/PowerPC/whatever processors. That might lead to the rapid demise of x86 if they performed well!

    16. Re:Trust me... by Anonymous Coward · · Score: 1, Interesting

      A modern GPU is fairly similar in design to the T2, but there are a few key differences

      Also the extensive pipelining (OGL2/DX9/DX10) which murders the result latencies (irrelevant for graphics rendering) and the huge amount of execution units (ATI/AMD's R600 consists of 320 FP32 ALUs -- the Niagara 2 has 8 FP64 ALUs total).

      I posted in my journal recently suggesting that it would be easier to produce a modern GPU than an older card, since modern GPUs have much less application-specific logic and do more in software, relying on just having lots of cores / pipelines to give speed.

      This is just plain silly. Many stages of the old fixed-function pipeline are still necessarily there in today's GPUs -- triangle setup, rasteriser, render output -- and about the rest, the mere management hardware needed to run DX9 shader programs (not counting the actual ALUs) is more complex than the simple vertex transform engines and pixel register combiners of DX7 vintage.

      (And the actual ALUs, taking up most of the real estate together with the huge register files, aren't a walk in the park to design either. Although you can replicate functionality a lot, it's tough to get the whole enchilada efficient -- for example R600 has a complex hierarchy of four banks of sixteen units of a vector + scalar engine pair, with multi-level caching, and multiple shared texture samplers.)

      They don't do more in software, they do vastly more complex things in hardware. Although you have a point that they have been somewhat stepping toward general purpose CPUs in their complexity if not in their workflow and role and strengths.

    17. Re:Trust me... by mikeabbott420 · · Score: 1

      I wish I could use water in my gas tank! Exxon would be fucked!

      --
      This program was made possible by a grant from the Ultra-Humanite, and viewers like you.
    18. Re:Trust me... by MoxFulder · · Score: 1

      I wish I could use water in my gas tank!
      Exxon would be fucked! Putting a Niagara on an ATX board is nothing like using water in your gas tank :-P

      It would just require making a Niagara board of the appropriate size with the right power connectors, and the PCI/PCIe slots in the positions that PC users expect. When the PowerPC first came out, IBM envisioned that people would build computers with PC-type peripherals but PowerPC processors, so they developed the PowerPC Reference Platform! It never took off commercially, unfortunately, but there's no technical reason why something similar couldn't be done with SPARC/MIPS/PPC/ARM-based boards today.
    19. Re:Trust me... by somersault · · Score: 1

      Where are you, Google... in our time of deepest need? Geeks around the world are on their knees, crying out to play Grand Theft Auto without any WINE or Virtualised Politically Corrects,;to have the freedom to accelerate in threedee that which can otherwise only be seen through the Windows.. please help us to feel lucky today!

      --
      which is totally what she said
    20. Re:Trust me... by Anonymous Coward · · Score: 0

      The T2 uses multiple contexts to hide memory latency, mostly caused by incorrectly predicted branches. A GPU typically doesn't bother much with branch prediction, since it runs code that is very light on conditional branches (on average, branches happen every 7 ops in general purpose code. In GPU code, they happen every few hundred). While it's true that GPU code doesn't have very many conditional branches, GPUs like nVidia's G80 series do very similar things to hide memory latency. Since there are so many small "ALU"s, or whatever you want to call them, the G80s work well by starting on a thread, and when they need something from memory, they issue the instruction to fetch the data. Then rather than waiting for the data to come back from memory, the ALU can get to work on another thread. I would say this is one area where the T2 and modern GPUs are similar.
    21. Re:Trust me... by thanatos_x · · Score: 1

      Good enough for raster graphics, not so good for vector graphics or 3D due to there being only 8 FPUs on the die, with only twice the floating point throughput of the terrible-at-floating-point T1. Unless you do swap some of the throughput for soft-floating-point.

      From TFA:
      "8 Sparc cores;
      4-MB L2 cache;
      Integrated floating-point units into each core pipeline
      Double the performance per watt of the predecessor Niagara 1;
      Order of magnitude (10 times, for you non-math types) improvement in floating-point performance;
      Two on-chip 10G Ethernet ports:
      Eight cryptographic units, to support running both Ethernet ports encrypted"

      I'm not debating your point (don't know enough to), just pointing that out.

      --
      I am not an expert. If I am misled in something, please correct me.
    22. Re:Trust me... by C-Shalom · · Score: 1

      Polaris Micro makes a chip based on OpenSparc.
      They're also considering using the OpenSparc architecture for building a version that plugs into the "Torrenza" chip socket used by Advanced Micro Devices.

    23. Re:Trust me... by Spy+Hunter · · Score: 2, Informative

      I'm sure it would be difficult to mold this chip into a GPU, but I'd like to point out that NVidia's GeForce 8 series actually does bother (a bit) with integer arithmetic, and is actually a scalar architecture, with no 4-way vector instructions (though it does pair multiple functional units with one instruction decoder, each functional unit executes a different thread).

      The biggest differences between Sun's chip and the 8 series are probably in the memory architecture. The 8 series has a ginormous memory bandwidth and many specialized ways to access it, all wired up in a highly optimized pattern: z-buffering hardware, framebuffer blending hardware, many read-only texture sampling units (which are powerful processors in their own right with dedicated caches), and local programmer-managed read-only and read-write memories for each core.

      --
      main(c,r){for(r=32;r;) printf(++c>31?c=!r--,"\n":c<r?" ":~c&r?" `":" #");}
    24. Re:Trust me... by Hucko · · Score: 1

      Still, one needs to question why IBM couldn't get it to take off commercially.

      --
      Semi-automatic amateur armchair Australian philosopher; conjecture ready at any moment...
    25. Re:Trust me... by level_headed_midwest · · Score: 1

      They couldn't get it to take off commercially as there were a lot of legacy x86 DOS and Windows applications out there. There would also be a lot of older devices that would lack drivers for the new-arch OS. Running x86 apps would require slow and computationally expensive emulation for a PPC-powered computer. Apple managed to switch arches when going from MC680x0 CPUs to the PowerPC units, but that was helped by the fact that the PPC CPUs were much faster than the m68k units they replaced. The PPC chips at the time that the CHRP was around were much closer in speed to the Pentiums, K5s, K6s, and Pentium IIs that they were to replace.

      Switching architectures is a pretty non-trivial thing to do even when the new one is backwards-compatible with the old one, such as x86_64 is with x86. The drivers still need to be updated and some old applications still won't be 100% compatible unless recompiled or otherwise built against the new system. Proprietary binary OSes like Windows running proprietary binary applications make this pretty tough to do, so that is why the open-source OSes running open-source applications are really the only setups that are able to change architectures in a reasonable amount of time and with a moderate cost.

      --
      Just "gittin-r-done," day after day.
    26. Re:Trust me... by Anonymous Coward · · Score: 0

      Though you refer to sunk costs when perhaps you mean barriers to entry, your analysis is spot on, and essentially why AMD bought ATI.

    27. Re:Trust me... by _damnit_ · · Score: 1

      Maybe it would be easier to request Sun to make the T2 compatible with the AM2 socket. I know Sun is very familiar with Hypertransport, they could design this lowend proc to work in AMD compatible mobos with a compatible EFI BIOS. Maybe allow us to stick a T2 in a terranza socket for giggles. Why not?

      I think that's what the OP was wishing for.

      --


      _damnit_

      It's my job to freeze you. -- Logan's Run
    28. Re:Trust me... by MoxFulder · · Score: 2, Interesting

      Maybe it would be easier to request Sun to make the T2 compatible with the AM2 socket. I know Sun is very familiar with Hypertransport, they could design this lowend proc to work in AMD compatible mobos with a compatible EFI BIOS. Maybe allow us to stick a T2 in a terranza socket for giggles. Why not? You know, I hadn't thought of that, but it's a great point! The Hypertransport bus (AMD's design) was intended to be a *standardized* front-side bus, allowing different types of processors and coprocessors to work together. And Sun already builds systems around Opterons. If they made HyperTransport-enabled SPARC processors to fit Socket 940/Socket AM2, that'd be awesome... I could drop one in PC and not even change the RAM or motherboard, though the retarded PC BIOS would have to be replaced with some SPARC-aware firmware.

      On the other hand, I have heard that the non-x86 processor families (SPARC, PPC, MIPS, etc.) use some innovative chipsets, so it might be advantageous to be able to use a SPARC processor on a motherboard with a completely different chipset as well.

      What I was getting at, basically, is that there's no reason why the PC platform has to be x86-only! The only thing that WAS holding it back is the closed-source nature of the Windows operating system and most applications... case in point: 5 years after x86-64 was introduced, nearly all PCs use 64-bit processors, and yet 64-bit Windows is hopelessly lacking in native applications and drivers. On the other hand, Linux and FreeBSD both had x86-64 ports available before the processors were even available for purchase. I run Ubuntu 64-bit and it works great.

      If I could buy a SPARC or MIPS or PPC board that could be physically and electrically integrated into my current PC, I'd definitely give it a try! There's actually not a lot to it... produce a motherboard that conforms to the ATX physical specifications with the right power connector, and include some of the on-board peripherals that PC users have come to expect: ethernet, audio, IDE and SATA controllers.

      Back in the late 90s/early 2000s, there were similar things, though unfortunately never popular: PowerPC-based PC platforms and Alpha-based PC platforms. I believe that with the increased prevalence of open-source, this is an idea whose time has come!

    29. Re:Trust me... by owlstead · · Score: 1

      ...If they put THESE under the GPL, along with the T1, they'd be getting more press than they could imagine.

      http://www.opensparc.net/

      They are openly discussing making the Niagara 2 available as open source as well, but note that there are some roadblocks such as the US government's restrictions on crypto technology. Yeah, those government restrictions are getting really idiotic. Everybody in the world can do crypto nowadays guys. Wake up, you are destroying US based companies. Note that one of the bigger problems of open sourcing Java are those same restrictions. Sun really seems to be on the receiving end of those restrictions.
    30. Re:Trust me... by Apocros · · Score: 1

      ... since modern GPUs have much less application-specific logic...
      this may be the trend on a relative scale, in that the percentage of logic gates that are used for programmable hardware is increasing. however, an an absolute scale, the fixed-function logic (things like triangle setup, MSAA resolve, etc) is certainly using more logic gates with every new generation of parts. i'm not really disagreeing with you, just noting that it all depends on how you look at it.
      --
      "onward!" cried the copper man, little knowing brass corrupts...
  2. Good floating point too by imroy · · Score: 4, Interesting

    This processor will also have a floating-point unit for each core, unlike the UltraSPARC T1 (Niagara) which only had one shared amongst all 8 cores. This should make it much more suitable than the T1 for a wide variety of applications. The T1 did great on multithreaded server-type tasks (e.g web, email, database) but would have been pretty hopeless for anything doing more than a bare minimum of FP work.

    1. Re:Good floating point too by dread · · Score: 3, Informative

      Correct. At my last employer we found this out the hard way. Most servers were getting great performance but the one that actually did some (and it wasn't much really) FP work was horrible. This should really remedy that problem.

      On the other hand, SUN still suffers from the fact that ETCA is getting more and more mindshare in the telco arena which has been one of their major cash cows. It will be real interesting to see how that pans out in the end.

      --
      I've had a wonderful time, but this wasn't it -- Groucho Marx
    2. Re:Good floating point too by sortius_nod · · Score: 1

      Here in Australia, all the exchange work in our largest telco seems to be done from Sun boxes. I think that's merely showing the company's age, rather than what they prefer.

      Still, would be nice to experience one of these fine boxes in action.

    3. Re:Good floating point too by Anonymous Coward · · Score: 2, Informative

      It has a cryptographic unit per core too. The PDF prezo linked by the page below says that bandwidth of the 8 crypto units is enough to run the on-chip 10 GbE ports encrypted. Sounds like an opportunity for some interesting applications -- VPN, SSL, SAN/NAS encryption, anyone?

      All that and the 64 threads run at 84 watts maximum (not TDP).

      http://sun.systemnews.com/articles/108/3/hw/17688

  3. Yes, but.. by aerthling · · Score: 2, Funny

    Yes, but will it run Vista?

    1. Re:Yes, but.. by Yetihehe · · Score: 0, Redundant

      Nope, but it will blend.

      --
      Extreme Programming - Redundant Array of Inexpensive Developers
    2. Re:Yes, but.. by dbIII · · Score: 4, Funny

      Yes, but will it run Vista?

      It has a Vista emulation mode - move the power switch to OFF and you get something just as useful but more stable.

    3. Re:Yes, but.. by Anonymous Coward · · Score: 4, Funny
      Yes, but will it run Vista?


      No, Vista requires 640 cores, which ought to be enough for anybody.

    4. Re:Yes, but.. by rbanffy · · Score: 2, Funny

      No. Not only a Solaris/SPARC box is 100% Microsoft-free, it's 100% Microsoft-proof.

    5. Re:Yes, but.. by michael+path · · Score: 1

      it's 100% Microsoft-proof.


      Not quite...., but it doesn't make it any less awesome.
  4. Smokin'... by Firefalcon · · Score: 1

    ...Quite literally I suspect if the cooling system ever breaks!

    1. Re:Smokin'... by Anonymous Coward · · Score: 1, Informative

      Actually, it doesn't dissipate that much heat. One of the purposes of the CoolThreads idea and the Niagara chip family is actually to reduce power demands and heat dissipation thus reducing the need for cooling and therefore saving even more power.

      The Niagara 1 (T1) for example only consumes around 70W with 8 cores running 4 threads each. This is comparable to single and dual core chips used in desktop computers today.

    2. Re:Smokin'... by Anonymous Coward · · Score: 1, Interesting

      Spot on about power consumption!

      But how can a core run 4 threads with only 2 integer units? (Hint: it doesn't. "Running" doesn't mean the same as "managing"... where Niagara shines is the lightning-quick context switches between the 4 threads -- hiding memory latency like no other design except certainly Niagara 2 -- but no it doesn't run them in parallel. Nevermind what "32 simultaneous threads" goodness Sun has implied and clueless journos repeated.)

  5. Interesting by ShakaUVM · · Score: 4, Interesting

    I like it. In my work with high performance computers, a significant limiting factor in a lot of our tasks was the interprocessor bandwidth. The Niagra2 has a crossbar, with a huge amount of bandwidth available between the different cores and their L2 caches.

    I'd like to see some benchmarks, and more technical specs, on these babies.

    1. Re:Interesting by Anonymous Coward · · Score: 1, Interesting

      IBM Power5+ and later also had inter-core connections.
      The Niagra line of processors is impressive, but you had to worry about workload AND what would happen in 18 months when the server was "reused" for some alternate need by folks that didn't know what floating point was?

      I've tried to find a fit for the T1 servers in my work as a technical arch for the last year ... it always came down to the customer not wanting to risk it so a V490 at 4x the price would be used instead. http://store.sun.com/ Oh well. The IBM P5+ line was also used a bunch with great success - for home use, check out the P505Q http://www-03.ibm.com/systems/p/hardware/express_e ntry.html - for around $5k, you get a server equiv to a loaded V490 and you can run 40 OS instances of either AIX or Linux. That isn't a typo. Ok, 1 of those has to be AIX 5.3, but that hardly sucks. It's almost as nice as Solaris or Linux. Heck, at least it isn't HP-sUX!

      I do about 25 projects a year - from start to finish, so I see a fair number of needs. The Niagra is an easy fit for a web farm. I just wish I had more of those projects that weren't wintel.

    2. Re:Interesting by TheRaven64 · · Score: 2, Informative
      The T2 has one huge advantage over anything in the POWER line, which is that SPARC is the only non-x86 instruction set supported by HiPE (the High Performance Erlang runtime). This is significantly faster than the runtime used on other platforms.

      Probably not applicable to any of the projects you're working on, but anyone writing Erlang code should check out the benchmarks from R11 running on the T1.

      --
      I am TheRaven on Soylent News
    3. Re:Interesting by rbanffy · · Score: 1

      Not to flamebait you, but how many people write Erlang code?

      It's a real question. I am curious.

    4. Re:Interesting by Anonymous Coward · · Score: 0
      The T1 used 72 Watts maximum for 8 cores x 4 threads/core = 32 threads.
      I wonder what the T2 will use for 8 x 8 = 64 threads per processor ....

      Actually, found my answer in the PDF on the page below --
              84 W @ 1.1 V, worst case (not bogus Intel TDP!).

      http://sun.systemnews.com/articles/108/3/hw/17688


      -Niagara 2 Key Features -

          Second generation chip multi-threading processor optimized
          for Space, Power and Performance (SWaP)

          8 Sparc Cores, 4MB shared L2 cache; Supports concurrent
          execution of 64 threads

          Twice UltraSparc T1's throughput performance and
          performance/Watt

          Ten times improvement in Floating Point throughput
          performance

          Integrates important SOC components on chip: Two 10G
          Ethernet (XAUI) ports on chip and Advanced Cryptographic
          support at wire speed

          On-chip PCI-Express, Ethernet, and FBDIMM memory interfaces
          are SerDes based

      -

      Niagara 2 is the first 64-bit 64-thread SPARC "System on a chip"
      from Sun based on the power-efficient CMT architecture optimized
      for Space, Power and Performance (SWaP). It is the successor to
      Niagara 1, which is known in the market as UltraSPARCR T1. It
      doubles Niagara 1's throughput performance, significantly
      improves Floating point throughput performance, has advanced
      cryptography support and two 10G ethernet ports on chip.


    5. Re:Interesting by Jasin+Natael · · Score: 2, Interesting

      If anybody is planning to benchmark this running common apps, I'd also be very interested to see how the approach to hiding memory latency works on more pedestrian applications like video encoding and pattern recognition (and maybe even thread-heavy GUI's).

      IIRC (I researched this proc years ago for a University paper), it tries to hide latency by switching thread contexts whenever there is a cache miss or branch misprediction. The crossbar should help a little with cache-related stalls, but the core would already have switched to another thread in any case. So, if there are complex paths of execution, you'd only run them a fraction of the time, on cores that are pretty bare-bones to start with. HPC is probably still better off with single-processor systems, even with the addition of per-core FPU's in Niagara 2, but the Niagara architecture could be really great as a coordinating hub and reporting center for a number of networked number-crunching machines.

      --
      True science means that when you re-evaluate the evidence, you re-evaluate your faith.
    6. Re:Interesting by TheRaven64 · · Score: 1

      I'm not sure, to be honest. Apparently it's popular in telecoms (which is why it was invented). I wrote some for my PhD, since it's the only sane way of writing code that targets a 64-processor machine but still runs okay on my laptop.

      --
      I am TheRaven on Soylent News
    7. Re:Interesting by markhahn · · Score: 1

      it's not HPC if you care about a measley 8 cores ;)

      seriously, the magic of HPC is in how you do the interconnect fabric, since that's the only way you can scale. 8 cores is nice, SMT is nice, but what if your program needs 4TB of ram to run at all and 1K cores in order to finish this year, and each thread communicates every 100 us with unpredictable, nonlocal other threads?

      it's simply false that more-slower-threads are a panacea, and it's not the case that end-users primarily care about flops/watt. but there's a substantial market which is not performance limited (web hosting, desktops) for which both multithreading and slow-but-cool are wins.

    8. Re:Interesting by jarich · · Score: 1
      I've heard that Meebo.com uses Erlang for some of their server bits.

      And the new Pragmatic Programmers book on Erlang will broaden it's exposure. http://www.pragmaticprogrammer.com/titles/jaerlang /index.html/

      It's not a commodity language yet, it seems to be on the upswing. It solves some very interesting multi-core/multi-machine problems with almost not effort on the part of the developer.

    9. Re:Interesting by dfn_deux · · Score: 1

      Okay, I'll bite... Lets see these bench marks.

      --
      -*The above statement is printed entirely on recycled electrons*-
    10. Re:Interesting by Anonymous Coward · · Score: 0

      http://spec.org/ Without knowing your workload type, which benchmark are you hoping for? As a generic benchmark I've used SPEC INT RATE 2000 and TPM - both are flawed since almost no real work looks like them.

    11. Re:Interesting by ShakaUVM · · Score: 1

      Because a lot of high performance apps are structured in a multi-tier fashion to take advantage of cores, or multiple CPUs on the same board, because the latency is better than going across an interconnect. For example, if you're iterating over an array, you put cores near each other, so they can communicate their work with no more effort than pulling data from a different cache.

  6. Re:low...... by dread · · Score: 1, Insightful

    Well, we tend to have jobs that are somewhat interesting and potentially even what is commonly known as "a life". This may be an unfamiliar concept but it includes things that are more important than the processing capability of the latest SUN processor (though not by much) but a lot of the added value comes from the fact that conversations with a two year old are generally more interesting than debates here.

    --
    I've had a wonderful time, but this wasn't it -- Groucho Marx
  7. Will it be water-cooled? (nt) by Rob+Simpson · · Score: 2, Funny

    (nt)

    1. Re:Will it be water-cooled? (nt) by lobStar · · Score: 1

      None of the current Niagara servers is water-cooled, so most likely no.

    2. Re:Will it be water-cooled? (nt) by morie · · Score: 2, Funny

      Then why call then Niagara?

      --
      Sig (appended to the end of comments I post, 54 chars)
    3. Re:Will it be water-cooled? (nt) by Anonymous Coward · · Score: 0

      Should be called Nigra 2 instead.

  8. Regurgitating "Quad" market speak by Eukariote · · Score: 5, Informative

    Along with having more cores than the quads from Intel and AMD...
    What quad from Intel/AMD? Intel is selling two dual cores on a cracker. The "quad" bit is just marketing, the actual silicon chips are pure dual core designs that have to talk across the front side bus just as in a two-socket server. And AMD has so far only been previewing their quads, you can't buy them yet.
    1. Re:Regurgitating "Quad" market speak by LWATCDR · · Score: 1

      I like AMD.
      But you are wrong from the consumer point of view.
      I can get an inexpensive Intel system with four cores in a socket. That is a selling point.
      I can get a two socket system with eight cores for not that much money as well.

      What you will not see are people pushing four socket sixteen core systems that way. At that point yes the Intel two duels on a cracker falls apart.
      BUT and this is a big BUT for a lot of people the Intel hack works and works well.

      I do hope that AMD does well with their four core cpu but to dismiss the current Intel part just isn't fair.

      --
      See my blog http://ilovecookes.blogspot.com/ for light hearted technical information.
  9. Sun doesn't get much processor press by Sycraft-fu · · Score: 0, Troll

    Because it isn't main stream. Their stuff is really, really expensive. That doesn't mean there's not a place for it (they'd not be in business if there wasn't) but most people just don't care because they can't pay that price. As an example we just took an old Sun server off of maintenance because we've moved all functions off of it. The cost of maintenance was $2,500 per year. No, that's not a typo. Ok well for that price, we can literally buy a new fairly high performance server from someone like Dell or Gateway (with a 3 year warranty).

    Well, when you are talking in price classes of that nature, most people just don't give a shit. A new Intel or AMD processor is exciting because it is something that is in the realm of what people can actually afford. Even if the item itself is high end, you know it'll be coming down soon enough. Intel's quads were over a grand when they launched, now you get get one for like $300. Sun stuff based on their own chips (and even not) is just damn expensive. If you aren't an enterprise type user it just isn't going to be on the list of things you'll get.

    Thus much less press.

    Also, their processor division has been kinda lagging. The SPARC offerings prior to this really haven't stacked up that well against what Intel and AMD have. We got a Sun Fire V440 and it works fine and all for the SPARC only apps we have, but for things that will run on x86, it gets blown away by Core 2 Duos.

    The Niagara looks cool but the base model is $10,000 which gets you the 4 core version of the chip and 8GB of RAM. If you want the 8 core setup, that's $21,500 minimum. At those prices, there's going to be little mainstream press as that is out of the range of even most companies. Thus most people just don't care, as Sun never will be bringing it to the masses (barring a massive strategy change).

    1. Re:Sun doesn't get much processor press by Anonymous Coward · · Score: 5, Interesting

      With all due respect mate, you don't have a clue. We, like most other financial companies in the world, buy Sun/IBM P5/HPUX/etc stuff because it is *cheap*... seriously, compared to the mainframes that handle the real back end, these babies are practically free.

      Also, if the last thing you have touched is a V440 then you are not exactly up to speed with the cutting edge of Sun products. I promise you that if you had actually ever seen a system running a T1 chip you would not say "their processor division has been kinda lagging". The cool threads stuff is amazing and they are the only people doing anything quite like it. I am not sure if you picked this up from the article but with one chip you get _64_ hardware based threads.

      In our internal benchmarks a £20k T2000 with 1 x 8 core T1 outperformed a £100k+ V880 with 8 x 2 core Sparc. Freakin' cool and excellent value for money. Plus all this fits in two rack units.

      Working in small companies is nice but I promise you that out there in the big wide world "most" companies don't think that $US20k is very much at all to spend on a system that will be part of a critical service.

    2. Re:Sun doesn't get much processor press by Capt+James+McCarthy · · Score: 5, Interesting

      "Ok well for that price, we can literally buy a new fairly high performance server from someone like Dell or Gateway (with a 3 year warranty)."

      It's all realative. Your 'high performance' Dell or Gateway wouldn't do much other then run bind at one of our locations. You are comparing apples to oranges. These systems are not for you to surf the net with, and as for price, well there is a lot to be gained from stability. I still have sparc systems with OEM (minus the disks) that are close to 20 years old running at some locations. Bet your Dell can't say that.

      --
      There are no loopholes. It's either legal or it's not.
    3. Re:Sun doesn't get much processor press by Sycraft-fu · · Score: 0, Troll

      With all due respect mate, it is YOU that don't have a clue. It's wonderful that as a financial company you like Sun. However you need to recognise that 99.99% of the world is not like you. Most companies have no need or ant for stuff like that and it shows in sales numbers. Sun certainly has a place, hence why they are in business (you sell something nobody wants, your company dies) however they aren't going to get the press that Intel and AMD do because most people just don't care. Home users don't care, small business doesn't care, many large businesses don't care, even ones that do, only a limited number of folks at them care (I am going to bet you don't run Sun on all your desktops).

      If you really think that limited segment of the market reflects the market at large, well, then go talk to one of your analysts and ask them why Sun is worth about 10% of Intel. You'll discover that as much as super critical infrastructure may like things like Sun, the world at large doesn't. Thus when a new Intel chip comes out, that's big news. When a new Sun chip does, it makes news only in trade mags or on some tech sites like Slashdot. This is teh same sort of reason that a new Hybrid Honda will get reviewed in major auto mags and tested by consumer reports but you won't even hear about new GE Hybrid locomitves. It's not that the locomotives are worthless, it is that they are worthless to most people, and thus limited news.

    4. Re:Sun doesn't get much processor press by Anonymous Coward · · Score: 1

      You have a funny way of viewing the world I must say. It seems to me that as far as you are concerned, unless I can pop down to PC World and buy one these it does not count as relevant to the world at large. Forget the fact that every financial transaction you make, every TV show you watch, every phone call you make and every device you use is made possible by systems that the muppets down the local computer shop would never even have heard of.

      If all you are trying to argue is that the latest chip from Sun will not make headlines in the news tonight then we are in agreement. However, that does not matter. Most of what is interesting and important in computing and let's be honest, pretty much everything is of no interest to the masses.

      But hey mate, it is a free world so you can go on thinking that the latest ATI graphics card is what matters in the computer industry, no skin off my nose. But here is something you might want to think about, the same companies that don't flinch at paying half a million for a server also don't mind paying serious money to someone to make sure they run properly.. which is nice.

    5. Re:Sun doesn't get much processor press by MysteriousPreacher · · Score: 4, Insightful

      A few points.

      1) Sun is not trying to win the hearts and minds of home users - that is not their market. Sun would see few benefits from pushing their products in the mainstream media. Trade press is where they reach the decision makers. How many Oracle adverts do you see in game magazines and tabloid newspapers? Not very many, they tend to advertise in business oriented outlets such as The Economist.
      2) Some small businesses don't care about computers at all. The companies that need Sun will buy Sun. The companies who can run their business out of a box of post-it notes will do the former.
      3) When you buy mission critical hardware, you don't look for a '3 year warranty'. You look for a service and support contract based on how critical the hardware is to your business. If you can run your business on a home-made 486dx system running Minix then that is probably the best option.
      4) Sun being worth 10% of Intel is irrelevant. The Economist sells far fewer copies than The Sun (a pretty terrible UK tabloid) but I know which one I'd chose for a serious overview of world news.
      5) This is a techie web site so news like this seems pretty relevant here, even if most of us can't afford to buy the kit.

      --
      -- Using the preview button since 2005
    6. Re:Sun doesn't get much processor press by DisKurzion · · Score: 2, Informative

      I'm going to have to agree with the coward on this one. You don't have a clue. You won't see Sun stuff on the desktops. Sun boxes have their place: The high-performance market. Where I work (hint: Feds), we have multiple Sun boxes set up, which run our virtual servers. If there's one thing that you can never get enough of in this kind of setup, it's multiple threading and RAM. The integrated networking is also a huge boost, since that's the last major bottleneck before hitting the clients.

      He wasn't trying to say that Sun deserves more press. Sure, small businesses and even many large businesses don't require that kind of power. But the coward was right: Sun provides good quality at (relatively) dirt cheap prices. Hence why they make this kind of thing.

      You try running 5+ heavily used virtual servers (Each running a component of Oracle) on one Intel or AMD box. Let me know how that goes for you.

      PS - Solaris kicks ass.

    7. Re:Sun doesn't get much processor press by MysteriousPreacher · · Score: 1

      Argh, I'll preview next time.

      Point 2 should have read:

      "2) Some small businesses don't care about computers at all. The companies that need Sun will buy Sun. The companies who can run their business out of a box of post-it notes will do the *latter*."

      --
      -- Using the preview button since 2005
    8. Re:Sun doesn't get much processor press by TheRaven64 · · Score: 2, Interesting

      I promise you that if you had actually ever seen a system running a T1 chip you would not say "their processor division has been kinda lagging". I disagree. The T1 is absolutely incredible engineering for certain workloads, but before that I can't even remember the last competitive chip they put out, and now Alpha is officially dead I consider SPARC to be the most interesting surviving architecture, so I've been paying attention.

      Rock and T2 look very promising, but before that their processor division was lagging so badly they were putting re-badged Fujitsu chips in their high-end machines to try to stay competitive. Between the end of the dot-com era to the release of the T1, Sun's microprocessor division seemed like dead weight. They made a huge gamble to start designing web app optimised chips as the bubble was bursting, and it looks like it will end up paying off, but it comes at the end of a period where 'lagging' is a very polite way of describing their performance.

      By the way, there seem to be a lot of low-power SPARC variants, but I've never seen a palmtop form-factor device containing one. Do they exist? I'd be very interested in one, since SPARC, even SPARC32, is a lot better supported than ARM, in spite of the latter's ubiquity.

      --
      I am TheRaven on Soylent News
    9. Re:Sun doesn't get much processor press by conares · · Score: 1

      "The Niagara looks cool but the base model is $10,000 which gets you the 4 core version of the chip and 8GB of RAM. If you want the 8 core setup, that's $21,500 minimum. At those prices, there's going to be little mainstream press as that is out of the range of even most companies. Thus most people just don't care, as Sun never will be bringing it to the masses (barring a massive strategy change)." OK, great I got the money...does it run games?

      --
      That, that really grinds my gears!
    10. Re:Sun doesn't get much processor press by afidel · · Score: 1

      Sun's stuff isn't expensive, at least not if your workload fits into their profile. We are doing a bake-off between a number of vendors for a Peoplesoft/JD Edwards business intelligence project and the current cheapest hardware is Sun, beating the Windows/HP offering by 10% while being significantly more scalable. Then there is the added bonus that the 3 year operating cost is also 10% less due to needing so many fewer boxes thanks to the Niagra chip on the web/app servers.

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    11. Re:Sun doesn't get much processor press by Anonymous Coward · · Score: 1, Informative
      The workloads they excel at aren't as general as you'd like to think. I had a fair amount of hands on experience with some T1s, they would only ever beat a dual opteron if you could get over 8 threads working on a problem and it wasn't I/O bound. Otherwise it was at best a tie and usually a run of the mill dual opteron could pretty easily hang.


      Now that's not to say that there aren't server loads that are like that. Specifically with java app servers, it's not uncommon to have hundreds of threads doing light weight tasks.


      I like the direction that they are moving in though. In a few generations, I'm sure they will either have competitive chips across the board or not make chips anymore and the energy efficiency is a long overdue area to focus on.

    12. Re:Sun doesn't get much processor press by Anonymous Coward · · Score: 0

      It's hardly accurate to say that most companies have no need for high-end server hardware, but obviously the number of companies of any size, let alone with greater than entry level (Intel server) requirements, is much less than the number of end-user PC owners, hence Intels sales figures are larger than Sun.

      There are whole industries that run on Sun hardware. Finance, Communications for starters.

    13. Re:Sun doesn't get much processor press by Anonymous Coward · · Score: 0

      And you get protection for most canned exploits for free, since it's not an x86 CPU. This alone can keep many cracker-wannabes away.

    14. Re:Sun doesn't get much processor press by southpolesammy · · Score: 1

      And it's also really a matter of target markets. Sun's biggest customer segments are the US gov't and large financial institutions, and it's likely that those two customer bases comprise at least 1/3 of Sun's revenues. For them, $20k/server is par for the course, and also inline with what competitors are charging for similar gear.

      The other point made above is that Sun realized a long time ago that they can't compete margin-wise with Intel on the desktop CPU market. Way back when, Sun's revenue used to be workstation driven (hence, the stock ticker....SUNW, W=workstation). But then the Microsoft marketing engine of the 90's allowed Intel to kill the viability of that market for them, and they've since concentrated on the server market.

      Truth be told, they struggled mightily throughout the late 90's and early 00's due to this, but this new architecture really holds promise, and is way ahead of anything that Intel or AMD have out now. Like the original synopsis says, their only real competition is IBM's Power 6, which is a beast of a chip in its own right.

      --
      Rule #1 -- Politics always trumps technology.
    15. Re:Sun doesn't get much processor press by TheLink · · Score: 2, Informative

      While I'm sure your 20k T1 outperformed your 100k v880. It does not show that the T1 is a better choice than an Intel/AMD system.

      Sun's stuff are slower than IBM's POWER line, and they are nowhere in the same league as IBM mainframes, and IBM mainframes are not in the same league as real nonstop computing clusters.

      Mainframes = very good uptimes, but you have _scheduled_ downtimes.
      Stuff like OpenVMS or Tandem = uptimes of _decades_ possible, don't even need scheduled downtimes where you turn everything off, you can run while replacing the hardware. With the Tandem stuff you even have CPUs running the same thing at the same time for real redundancy. Only thing is HP seems to be burying VMS and Tandem.

      Sun? They didn't even have hardware instruction retry till Fujitsu SPARC. For many years it was pretty embarassing that the really high end SPARCs were Fujitsu rather than Sun - the fastest SPARC systems till just a few years ago were all Fujitsu PRIMEPOWER (I haven't bothered checking recently, the last I recall Sun started using Fujitsu stuff for their high end systems).

      Sun got where they were by making relatively cheap Unix RISC workstations and they provided servers for areas where reliability and availability didn't really matter as much as the real "high end" stuff. They caught the internet wave for "cheap" webservers etc and made a lot of money then.

      The problem now with Sun is, they get blasted at the low end by x86, and at the high end they pale in comparison to IBM's stuff.

      For "normal" webserver/db/internet/corporate stuff it's x86.
      In the HPC arena it's x86 (for scale out), and IBM (for scale up).
      So where does Sun fit?

      Just go google for benchmarks of T1 vs Intel vs AMD. The T1 doesn't even do that well for performance/power consumption when compared to the Intel woodcrest CPU: http://www.anandtech.com/printarticle.aspx?i=2772

      For Sun's sake, their Niagara 2 better be magnitudes much better than their T1, if not it'll be out of date even before it's released.

      Don't get me wrong, I'll be happy if Sun succeeds, but they've fallen way behind.

      --
    16. Re:Sun doesn't get much processor press by TheLink · · Score: 1

      If you don't like the Anandtech benchmark I linked to, go check the SAP one:

      http://www50.sap.com/benchmarkdata/sd2tier.asp

      And you'll see what I meant about Sun being squished between x86 and IBM. Look at the T1 showings (add the 4 linked entries together to get the total performance, or go for the single one at: http://www.sap.com/solutions/benchmark/pdf/cert470 5.pdf).

      Compare with x86 or IBM POWER or even Fujitsu's 2 year old stuff (http://www.sap.com/solutions/benchmark/pdf/cert13 05.pdf).

      Go compare the SPEC benchmarks too if there are any T1 ones.

      --
    17. Re:Sun doesn't get much processor press by ancientt · · Score: 2, Informative

      They sent me a Sunfire last year telling me it had capabilities it didn't. I sent it back. If $21,500 is the price tag, then this would outperform what I was considering at approximately the same price. I doubt these have the capabilities I was looking for, (I want Windows in Xen) but the price isn't going to be it's limiting factor.

      By the way, if they're still running it, the Try and Buy program is every bit as good as it sounds. They shipped me a server for the asking, I tested it, and sent it back, all on their dime (except my time.) If I'd talked to salespeople honest enough to say "I don't know" rather than "of course" I wouldn't have had it sent in the first place and might be running one of their servers right now. Maybe next year.

      --
      B) Eliminate all the stupid users. This is frowned upon by society.
    18. Re:Sun doesn't get much processor press by DamonHD · · Score: 1

      I bought a (24-thread) T1 because it was *cheap* for the specs, and because I can use all that parallelism for my Web server ( http://mirror-uk-rb1.gallery.hd.org/ ) and stuff it is doing on the backend, and it is *lovely*.

      I also use various flavours of Intel chips including Core Duos in the MacBook that I'm typing on now and in my new low-power Linux laptop that's replaced 5 old Solaris servers ( http://www.earth.org.uk/low-power-laptop.html ), but I have to say again that the T1/Niagara is lovely to write threaded code for (eg in Java).

      Rgds

      Damon

      --
      http://m.earth.org.uk/
    19. Re:Sun doesn't get much processor press by Fweeky · · Score: 1

      The problem now with Sun is, they get blasted at the low end by x86 Uhm, so it's a good thing Sun do a very nice range of x86 servers, then?
    20. Re:Sun doesn't get much processor press by Anonymous Coward · · Score: 0

      Sun got where they were by making relatively cheap Unix RISC workstations and they provided servers for areas where reliability and availability didn't really matter as much as the real "high end" stuff. They caught the internet wave for "cheap" webservers etc and made a lot of money then.

      Would a 72-way Sun Fire be a cheap Unix RISC workstation or a "cheap" webserver, then? I admit their clientele was riff-raff such as Finance or Insurance.

    21. Re:Sun doesn't get much processor press by TheLink · · Score: 1

      Uh, how does that stop them from getting blasted by x86 stuff from Dell, HP, IBM etc?

      Selling x86 stuff is a bitter pill that Sun has to swallow since they've had duds with SPARC so far (unlike IBM with POWER).

      There's probably a market for better x86 servers and Sun might be the one. BUT currently why should anyone buy a Sun x86 server instead of from the others? So that you can run Solaris 10[1]? Is there any evidence that they make the best x86 hardware for the price?

      [1] IBM/HP consultants will happily charge companies with more money than available skilled staff to help them put/build all sorts of stuff on Solaris on Sun/HP/IBM/Dell/whitebox, what does Sun offer in that area? Other companies will just use stuff like Linux or even windows (IBM/HP will still be happy to "help" you for a fee in any case).

      --
    22. Re:Sun doesn't get much processor press by TheLink · · Score: 1

      Sun Fire? That comes under 'servers for areas where reliability and availability didn't really matter as much as the real "high end" stuff'.

      Sun's lucky that many people were/are willing to pay a lot for something that's not really much more reliable than Linux/BSD on decent x86 hardware.

      Maybe those people were accustomed to Microsoft levels of reliability and assumed you had to pay much more.

      Then again with the sort of people about nowadays it's often no point having servers and software that can run for decades. Just only today someone rebooted one of our servers that was running fine, well as fine as it could be - the actual problem was network related.

      But maybe that could mean something with Dell's level of "quality" making more sense. :(

      --
    23. Re:Sun doesn't get much processor press by pavera · · Score: 1

      I just went to their site, and a Niagra 1 T1 with 6 cores x 4 threads is 3900, not 10k. That is not expensive, a 2x4core intel server with decent RAM, HD, and network interfaces is at least 3k, and I'm pretty sure that niagra is going to blow away the intel solution.

    24. Re:Sun doesn't get much processor press by cartman · · Score: 1

      I am not sure if you picked this up from the article but with one chip you get _64_ hardware based threads.

      Those 64 threads do not execute simultaneously. Sun has lengthened the pipeline for Niagara2 and a added a "pick" stage where 2 threads per processor are chosen to execute. As a result, Niagara2 can execute 16 threads simultaneously. Compare that to an 8-way opteron system which executes only 8 threads simultaneously, but which is superscalar and executes at least 2 instructions/thread/cycle (with twice the cycles, bear in mind--3GHz). In other words, Niagara2 could theoretically execute (16*1.5) 24 billion instr/sec whereas the cheaper 8-way 3GHz opteron could (theoretically) execute more than twice that many. So, speaking of (theoretical maximum) throughput, the opteron clearly has an advantage.

      Of course the opteron will stall for a long time on cache misses and therein lies the true Niagara advantage.

      In our internal benchmarks a £20k T2000 with 1 x 8 core T1 outperformed a £100k+ V880 with 8 x 2 core Sparc. Freakin' cool and excellent value for money.

      That is comparing proprietary hardware from Sun, to other proprietary hardware from Sun. The comparison is beside the point, since the V880 uses US-IVs which are in-order (!!), low clockspeed processors. US-IVs are undoubtedly outrun by x86.

      The question is whether a £20,000 T2000 would outrun a £15,000 8-way Opteron system from Sun. The answer seems to be: yes, modestly, but only for very specific workloads which are severely affected by memory latency (like OLTP) or which are benefited by application-specific hardware in the Niagara cores (like SSL webserving).

      The cool threads stuff is amazing and they are the only people doing anything quite like it.

      Indeed, nobody else is taking the fine-grained multithreaded approach.

      The "throughput computing" strategy was a brilliant move for Sun. It was the best thing they could have done. It was the only way they could make Sparc-compatible chips for commodity servers that are at least somewhat price competitive with x86 offerings. It was really the only way of saving the Sparc franchise.

      With that said, it relegates Sun to a niche.

    25. Re:Sun doesn't get much processor press by Anonymous Coward · · Score: 0

      Dijkstra said "the bottle neck is in the sinchronizing".

      Many threads => Many bottle necks.

    26. Re:Sun doesn't get much processor press by Jynx77 · · Score: 1

      Yes, but what else did you compare it to? We had "experts" at a company I used to work for configure a T2000 as an Oracle database server and it ran at 1/6th the speed of a five year old HP intel box we were using for development while we waited on the "high-end" prod equipment. After a month of fiddling, it still ran at 1/6th the speed. I am app dev guy, haven't been invovled in HW more than I had to for the last 10 years but I must say, Sun was the emporor's new clothes the way it played out on that project.

      --
      It's turtles all the way down!
  10. quad is a quad and I want a cheap 8-way desktop by OrangeTide · · Score: 3, Interesting

    customers just want to fit 4 cores in one socket. That's all that matters. That you can get a 1U with two sockets and put 8 intel cores in it under under $2k is a big deal right now.

    That said I've always wanted to get my hands on some of these new multicore UltraSparcs. I think they have a lot of potential, and the new ones seem extremely powerful.

    Now if only Sun would but the low end one in a mac mini form factor and sell it as a java developers kit then maybe I could play with one. The low end sun fires are something I could almost afford, but I don't really want to keep a 1u on my desk just to try out the technology.

    I think the big 64-bit address space and the ability to run lots of threads seems to fit well with Sun's Java. Not that I am a Java developer, I just think it's a good match, and it seems to be that's why people were using the older CoolThreads systems, enterprise Java.

    --
    “Common sense is not so common.” — Voltaire
    1. Re:quad is a quad and I want a cheap 8-way desktop by Eukariote · · Score: 2, Interesting

      Customers just want to fit 4 cores in one socket. That's all that matters.

      Note that the post was about the number of cores/threads in the Niagra chip design. In terms of chip design, the circuitry on the silicon is what matters, not how you package, integrate, or market it. Moreover, it does matter to a customer if marketing speak fobs him with two dualcore chips on a cracker instead of an integrated four core design.

      Performance does not scale purely with the number of cores, it also matters how efficiently the cores can share resources and intercommunicate. Things like accessing shared memory and inter-process communication are an important part of real world applications. Just try to run a heavy duty threaded server benchmark with a lot of IPC on a faux "Quad" and compare how it scales relative to a true Dual core design, and you'll be lucky to get 1.5-1.6 times the performance instead of twice.

    2. Re:quad is a quad and I want a cheap 8-way desktop by dbIII · · Score: 1

      That you can get a 1U with two sockets and put 8 intel cores in it

      It's even better that you can have two boards like that side by side in 1U, had a few since early this year. For some things this Sun is really going to walk over a pile of processing nodes like that - but it won't be as cheap.

    3. Re:quad is a quad and I want a cheap 8-way desktop by brucmack · · Score: 4, Insightful

      In terms of chip design, the circuitry on the silicon is what matters, not how you package, integrate, or market it.

      I agree with you on this point.

      Moreover, it does matter to a customer if marketing speak fobs him with two dualcore chips on a cracker instead of an integrated four core design.

      I don't agree with you here. What matters to the customer are costs and performance. They shouldn't have to care about how the package works, as long as it works correctly.

      From Intel's perspective, they had two options:

      1. Start with a new design that integrates all four cores on a single chip.
      2. Put two existing chips onto one package. Chips that they've been manufacturing for quite some time, so yields are good and there's headroom for higher clock speeds or lower power consumption.

      From the customer's perspective, those two options correspond to:

      1. A chip that performs a bit better, but probably costs more and definitely comes on the market later.
      2. A package that's got some performance drawbacks in certain situations, but is available now at a reasonable price.

      What do you think Intel and their customers prefer?

    4. Re:quad is a quad and I want a cheap 8-way desktop by Sycraft-fu · · Score: 3, Insightful

      Also Intel's seems to have shown that having two units that need to communicate across the FSB doesn't really cause any problems. Worked fine for their Pentium Ds (2 single cores) works fine for the quads. While bus contention assuredly becomes a problem at some point, with just two units it doesn't seem to be for normal tasks.

      Thus it makes it a worthwhile design to go with. I could see it continuing too. Maybe their next gen chips are 4 cores on a single unit which goes mainstream, and then an 8 core 2 unit job for higher end stuff. At some point there may be too many cores per unit to do with without bus contention, but them maybe not since the speed of the bus keeps getting increased. Also I could see OSes being made aware of this, if it continues, and knowing that each X number of processors is a unit and you can shuffle all you like withing that, but shuffling across units incurs more penalties and thus isn't done unless it has to be. So if a process had 4 threads, and a unit was 4 cores, it'd make sure all the threads were running on the same unit.

      Regardless, you are correct that at this point it is an excellent idea. Doesn't matter if it is the most technically correct solution or not, what matters is that it works well and is cheap.

      We make concessions like that all the time in the computer world. Memory would be a good example. For a good while on desktops, memory, the FSB, and the processor ran at the same speed. You had a 30MHz 386, you were running 30MHz memory. Multipliers weren't a things you worried about. Then, we started to run in to limits of what memory could do. We could scale processors faster than RAM, or at least faster than RAM could be done cheaply. Thus the start of clock multiplied chips. This works, but at some point the memory is just too slow. So then we start getting in to tricks like DDR RAM, which transfers twice per clock cycle, and interleaving RAM, so that the processor has two channels to get faster access and so on. Currently you can have a CPU at one speed, an FSB at another, and memory at a third. Right now I've got a 2.66GHz CPU, a "1333MHz" FSB (it's not really 1333MHz, FSBs are quad pumped so it really runs at 333MHz) and "667MHz" RAM (again not really, it's DDR so the actual memory clock is 166MHz, bus clock is 333MHz, it just does 667 million data transfers per second hence the rate) and this is not an uncommon setup.

      None of this is an ideal setup. Ideally, the FSB would run at the same speed as the processor and so would the RAM. This would lead to the processor having almost no wait time for memory data and very little need for trickery to try and prefetch data and such. However alas, if it were possible at all it would be too expensive to do. Thus we have this somewhat hacked solution. However in reality it matters little, though a hack it may be, it works real well. It has given us memory that can get the data to the CPU in a timely fashion and doesn't break the bank.

    5. Re:quad is a quad and I want a cheap 8-way desktop by TeknoHog · · Score: 2, Insightful

      I'd be glad to have any kind of 4-way SMP system. Whether they're all on different chips or all on the same, I'd still get 4 CPUs of processing power. Of course, inter-CPU communication makes a difference in certain applications, but people have worked with traditional SMP systems for decades, and we know how to make good use of them. Putting them on the same die won't solve the basic problems of parallelization.

      --
      Escher was the first MC and Giger invented the HR department.
    6. Re:quad is a quad and I want a cheap 8-way desktop by Anonymous Coward · · Score: 0

      You can get 1U Quad Opteron systems. So once AMD releases it's quad core ship that would be a 16 core system

    7. Re:quad is a quad and I want a cheap 8-way desktop by OrangeTide · · Score: 1

      Customers I've talked to don't seem to be interested in the two dualies talking over FSB or one quad sharing cache and quickly migrating threads between them. I am well aware of the technical advantages of the designs.

      The benchmarks on quads hold up well for the right kind of work loads, you get some real metrics and show customers are pretty bar graph telling them for $500 more they can have twice as many processors going in a 1U and get a 50-75% boost in performance, they will jump on it. Intel's dual-dual-dual (dual core, dual silicon, dual processor) setup is very cost effective and is extremely easy to sell to customers. The other thing is there are so many ways to measure performance that you can play all sorts of games. In many scenarios I measure transaction latency under various amounts of load, because it tells me a lot about how the design will scale under load.

      Each type of customer(for each part of the industry) has their own idea what performance means, so it's hard to just say 1.5-1.6 times the performance without me just laughing a bit.

      --
      “Common sense is not so common.” — Voltaire
    8. Re:quad is a quad and I want a cheap 8-way desktop by slashname3 · · Score: 1

      "The low end sun fires are something I could almost afford, but I don't really want to keep a 1u on my desk just to try out the technology."

      No you would not be able to keep a Sunfire on your desk. It is way to noisy for that during normal operations. When they boot up there is a tremendous surge in noise which then settles down to a dull roar that would turn you brain to tapioca after a few hours.

      If you have an equipment room to rack them in and you can get away from them they are not to bad.

    9. Re:quad is a quad and I want a cheap 8-way desktop by OrangeTide · · Score: 1

      a 1U ought to be noisy and have powerful fans, which is why I don't like them on my desk. And no. my apartment does not have an equipment room.

      --
      “Common sense is not so common.” — Voltaire
    10. Re:quad is a quad and I want a cheap 8-way desktop by networkBoy · · Score: 1

      "However alas, if it were possible at all it would be too expensive to do."

      It is possible, and it is expensive.
      L1 Cache is at core cpu speed. It is expensive.

      Another way to do it, which you hinted at is interleaved memory. Not the same as dual channel DDR, but genuine interleaved memory.
      Tektronix does this for scope and logic analyzer acquisition memory.
      100MHz sdrSRAM has the potential of 100Mbps/lane if you instead pipe the rising edge of the clock to one chip and the falling edge to another you get 200Mbps/lane, add quadrature clock and phase offsets and you can get 8 chips sharing a "single" clock giving you 800Mbps/lane. This is how the scopes ultimately achieve 10GHz sampling and waveform acquisition.

      The catch? you go through gobs of memory, timing has to be perfect, and it costs a bundle. But, it is possible.
      -nB

      --
      whois gawk date unzip strip find touch finger mount join nice man top fsck grep eject more yes exit umount sleep dump
  11. Re:low...... by Anonymous Coward · · Score: 0

    you make it sound like talking to a 2 year old is something bad to compare something worse to... Have you ever sat down and talked to a 2 year old? the conversations you have are much more interesting than that of your peers. You should try talking to a few younger kids, or even have some of your own. Have a good one.

  12. Re:yes but ... by utnapistim · · Score: 3, Funny

    ... will a beowulf cluster of these run linux, or blend?

    --
    Tie two birds together: although they have four wings, they cannot fly. (The blind man)
  13. Niagara by Anonymous Coward · · Score: 0

    "Sun to Release 8-Core Niagara 2 Processor"
    So does that mean the Canadian version is going to be ten times better than the American?

    1. Re:Niagara by Libertarian001 · · Score: 1

      No, but I hear the English are releasing one what goes to 11...

    2. Re:Niagara by dido · · Score: 1

      Maybe something like this?

      --
      Qu'on me donne six lignes écrites de la main du plus honnête homme, j'y trouverai de quoi le faire pendre.
    3. Re:Niagara by Anonymous Coward · · Score: 0

      Yes. Yes it Does. And all strings returned from if statements will end in -eh? And since I cant figure out a good metaphor for it I will just say it: This Canadian Version of which you speak will also be a tourist hole full of jerks with cameras, people on honeymoons feeling duped into going to said tourist hole.

    4. Re:Niagara by Anonymous Coward · · Score: 0

      Gives completely new meaning to the waterfall development model, doesn't it?

  14. on-chip 10G Ethernet ports by bazorg · · Score: 1

    huh? ethernet ports where? anyone care to explain?

    1. Re:on-chip 10G Ethernet ports by Cheesey · · Score: 4, Informative

      High-speed CPUs are all limited by a bottleneck - getting data on and off chip. Putting the Ethernet controllers on chip helps to offset this.

      In the future, it is likely that all the wired buses in your motherboard will be replaced by an internal Ethernet-like network. We are already seeing a trend towards simpler and faster interconnects such as SATA. The next step is to use Ethernet-style connections for every chip-to-chip link, and within the chips themselves too. If this seems unlikely, consider that your PCs memory bus already is basically a network connection. The device at one end (CPU) is in a different clock domain to the device at the other (memory). Data is sent in packets (called bursts) to offset the latency of setting up a transfer.

      --
      >north
      You're an immobile computer, remember?
    2. Re:on-chip 10G Ethernet ports by BiggerIsBetter · · Score: 1

      Wasn't that the promise of technologies like HyperTransport, or are you thinking of something even more serial?

      --
      Forget thrust, drag, lift and weight. Airplanes fly because of money.
    3. Re:on-chip 10G Ethernet ports by pklinken · · Score: 0

      I think they meant Intarnet ports.

    4. Re:on-chip 10G Ethernet ports by Anonymous Coward · · Score: 1, Informative

      I think he's trying to refer to SerDes.

      The T2 has a SerDes switch connecting processor cache, RAM, PCIe buses,
      and the 10 Gb Ethernet interfaces, all on-chip.

      Impressive bandwidth, over 1 Tb/second total if I read correctly.
      See the chart on page 16 of the PDF on the page below:

      http://sun.systemnews.com/articles/108/3/hw/17688

      Lots of tech details in this Sun presentation too, including power (84W max
      for 8 cores/64 threads, including Ethernet and RAM controllers).

    5. Re:on-chip 10G Ethernet ports by imgod2u · · Score: 1

      It's already moved there. It's called PCI-Express (or Hypertransport). Buy a modern motherboard. The various controller chips in there are connected via a multi-lane PCI-Express link.

      I don't agree on the memory though. As far as I recall, memory interfaces are the one thing that have not gone link-based. We're still using Synchronous RAM and all of the major bus interfaces use synchronous interfaces. This makes sense as crossing clock domains takes time which adds memory latency. Packetizing it takes even more time (along with overhead) and, IIRC, no memory interface to day uses such a model.

    6. Re:on-chip 10G Ethernet ports by Anarke_Incarnate · · Score: 1

      Not "ports" but controllers. The controller for (2) 10GbE ethernet is built into the chip itself, so you don't have to go to an off chip device to control them.

    7. Re:on-chip 10G Ethernet ports by fm6 · · Score: 1

      So really, it's on-chip Ethernet controllers, not ports.

    8. Re:on-chip 10G Ethernet ports by zrq · · Score: 1

      In the future, it is likely that all the wired buses in your motherboard will be replaced by an internal Ethernet-like network
      That sounds familar http://en.wikipedia.org/wiki/Transputer#Links. Looks like INMOS had the right idea after all.
    9. Re:on-chip 10G Ethernet ports by owlstead · · Score: 1

      Yeah, they tried to glue on the connectors, but it didn't work.

    10. Re:on-chip 10G Ethernet ports by owlstead · · Score: 1

      Besides having the information of the ethernet ports go directly into the processor, those processors now all have a fast crypto-unit in there. Think SSL acceleration. This can safe a lot of cost for hardware SSL accelerators - because you won't need them in front of your application server anymore. Of course, if you want to have the private server key really safe, you shouldn't put it in a general purpose CPU like a Niagara processor.

      In my opinion, each and every computer should have at least a secure keystore with RSA and ECC cryptography (TPA modules have one), secure hash and 3DES/AES acceleration and a random number generator. The gains are just too high to ignore cryptography in current chips (~100:1 performance difference). For servers like the low clocked Niagara processors, they are a must to compete. It won't be long before AMD and Intel follow suit. VIA C7 processors are already equipped with hardware crypto acceleration.

  15. Freudian Processor? by Anonymous Coward · · Score: 3, Funny

    Am I the only person who read the headline as "Sun to Release 8-Core Viagra 2 Processor"?

    1. Re:Freudian Processor? by Anonymous Coward · · Score: 0

      Yes.

    2. Re:Freudian Processor? by ratbag · · Score: 1

      Yes

    3. Re:Freudian Processor? by ettlz · · Score: 4, Funny

      Sounds like you've got serious uptime on the mind.

    4. Re:Freudian Processor? by SpinyNorman · · Score: 1

      On that note, my SPARC/Solaris development box at work has a current uptime of about 9 months.

      AFAIK Viagra's only good for an hour or so.

    5. Re:Freudian Processor? by fm6 · · Score: 1

      Probably not. Many lamers see lame puns everywhere. Wish they'd keep them to themselves.

    6. Re:Freudian Processor? by Madsy · · Score: 1

      Or Nigeria Processor. I hope this doen't increase the number of fraud mails I get in my inbox.

  16. Re:low...... by blackicye · · Score: 1

    They're nocturnal, like Vampires..or raccoons :P

  17. Re:low...... by pimpimpim · · Score: 1
    I guess that should've been 14-year old then:

    Parent: "Hey, would you like to .."

    14-yo: "You hate me, don't you! I wish I wasn't born!"

    --
    molmod.com - computing tips from a molecular modeling
  18. Re:low...... by dread · · Score: 1

    The two year old here wants me to tell you that "you are mister poo head".

    --
    I've had a wonderful time, but this wasn't it -- Groucho Marx
  19. Not going to be the fastest, but... by zeromemory · · Score: 4, Informative
    Sun donated one of the original T2000 (based on the original 8-core 4-thread/core Niagara processor) systems to a campus organization where I'm a volunteer system administrator, so think I have quite a bit of experience with this processor. Here's my take on the Niagara2, based upon my experiences with the Niagara1:
    • No, this processor is not going to be the 'fastest' processor out there; this processor is designed primarily for workloads that don't require floating-point calculations (web servers, mail, etc), so it's not going to be the go-to processor for places like rendering farms. In fact, float-point performance on the Niagara1 was so terrible that Sun included a special cryptographic accelerator to help with SSL performance (the primary consumer of floating-point calculations on most web servers).
    • This processor architecture absolutely rocks for the purpose it was intended, though. It consumes very little power, but handles service loads amazingly well. We also have a Sun v40z (8-core Opteron server) that would barely be able to keep up with the our T2000 (that's saying a lot), and our T2000 consumes only a little more than half as much power going into our v40z (2.6A @ 120VAC compared to 4.6A @ 120VAC).
    • The inclusion of 10GbE support is going to be absolutely essential and will help make servers based upon the Niagara2 stand-out compared to servers from competing vendors. Why is 10GbE so important? I mean, we already have GbE, and most places barely have an infrastructure for that in place, right? The answer is SAN. 10GbE is going to be necessary if you're going to be using iSCSI to consolidate storage and deliver reasonable performance, and most places are heading in that direction, especially the target market for these systems.
    • Solaris Logical Domains (not to be confused with Sun Containers or Zones) is a hardware-based virtualization technology that was packaged with the Niagara1 and will probably be included with the Niagara2. Using Logical Domains, you can create independent virtual servers running different operating systems and divide hardware resources up between them, down to the individual CPU thread and PCI Express bus leaf level. Unlike software virtualization solutions, all your virtual servers are never dependent on any single virtual server (global zone, dom0, etc). This technology is making hardware virtualization a possibility for many places.

    I think the Niagara is a pretty solid design, but it's not the processor to end all processors. For service workloads, I don't think you can get a better processor, but you probably don't want one of these processors in your workstation. Sun Microsystems is also headed in the right direction, establishing an open-community around these processors and Solaris.
    1. Re:Not going to be the fastest, but... by Alioth · · Score: 4, Informative

      The floating point performance of the new processor should be like night and day compared to the old one you had: the old one apparently only has 1 FPU for the entire device - the new one has an FPU per core.

    2. Re:Not going to be the fastest, but... by BDeblier · · Score: 2, Interesting

      In normal circumstances public key cryptography doesn't touch floating point. It's multi-precision integer calculations that are required for this. But UltraSparc cpus have such a bad integer multiplier that you need to resort to floating point trickery to get a slightly better performance. It's no miracle they had to add a dedicated crypto processor to the Niagara line.

    3. Re:Not going to be the fastest, but... by phoebe · · Score: 1

      This processor architecture absolutely rocks for the purpose it was intended, though. It consumes very little power, but handles service loads amazingly well. We also have a Sun v40z (8-core Opteron server) that would barely be able to keep up with the our T2000 (that's saying a lot)

      A $16,000 machine barely keeps up with a $21,000 machine is saying a lot?

      Sun Fire V40z Server
      $ 16,995.00
      4 Dual-Core AMD Opteron - Model 885

      Sun Fire T2000 Server
      $ 21,495.00
      1 x 1.2 GHz UltraSPARC T1 - 8 Core

    4. Re:Not going to be the fastest, but... by Anonymous Coward · · Score: 0

      You forgot the part where the $16000 machine uses almost twice as much power as the $21000 machine (and thus will need almost twice as much energy for cooling, as well).

    5. Re:Not going to be the fastest, but... by TheRaven64 · · Score: 1, Informative

      Note that this is per core, and not per context. With eight contexts per core, it's still going to be a bottleneck if your code is more than 1/8th floating point calculations. On the other hand, a big part of the performance problem came from register copying from the individual cores to the FPU and back on the T1, and this should be fixed with the T2. It's still not going to be a great floating point chip, but it should be a bit better.

      --
      I am TheRaven on Soylent News
    6. Re:Not going to be the fastest, but... by the+eric+conspiracy · · Score: 2, Informative

      The threads don't execute simultaneously anyway. They are there so that processes can continue when there are I/O waits, i.e. for memory. An FPU per core should be enough.

      http://www.sun.com/2003-1014/feature/

    7. Re:Not going to be the fastest, but... by slew · · Score: 1

      Sun included a special cryptographic accelerator to help with SSL performance (the primary consumer of floating-point calculations on most web servers).

      Interesting, comment about the crypto-processor, except SSL servers usually don't use much floating point (RSA key stuff is large integer mult/modulus stuff, not floating point) so I don't think this factoid is related to the FP performance of T1 (or T2). I always thought that sun had their crypt-accelerators as add-in system boards, hmm, I'll have to take a look at this...
    8. Re:Not going to be the fastest, but... by owlstead · · Score: 1

      A crypto accelerator will do wonders for any server that needs to handle SSL. You can really get high performance if you put a hardware accelerator on any chip above 1 GHz. My C7 can do full SSL on 100Mbit with a truly slow processor, only because of the acceleration. No processor is going to match a hardware processor. That said, if you have a processor that is not fast on integer calculations, the benefits are twice as big.

      Now read this (source: http://www.sun.com/processors/niagara/M45_MPFNiaga ra2_reprint.pdf)

      "The third important change is a significant upgrade of the in-core, asynchronous cryptographic coprocessor. In Niagara 1, the crypto unit handled basic RSA/DSA public-key ciphers. The upgraded crypto unit in Niagara 2 handles just about any cipher that might be of interest, including RC4, AES, DES,and 3DES.It also handles MD-5 and SHA-1/256 hashes. Moreover, running at full core speed, it is designed to keep up with the two new,integrated 10Gb Ethernet ports, allowing encryption/decryption of packets to keep pace with the wire-speed flow of traffic off the network into memory, and out of memory into the network. The DMA engine in the crypto unit shares the core's crossbar port."

      Now note that there will be 8 of these crypto-processors. This will be one of the most crypto-enabled CPU's we've seen in a while.

  20. Re:low...... by steveoc · · Score: 1

    Then again, many of us have moved to Australia (or Portland, OR - which is almost the same thing), so don't read too much into the "time of post"

  21. Niagara by Cctoide · · Score: 2, Funny

    Niagara? I don't want to know what happens when one of these has to compute an integer overflow, do I?

    --
    "Let's face it, it's a good story. Accuracy would kill it."
  22. Re:yes but ... by cyphercell · · Score: 2, Interesting

    can it blend? - yes I'm sure it can, the iphone blended.

    speaking of which how much does this processor cost, and why doesn't Sun Microsystems make laptops, I was looking for Unix machines recently and I decided to go with the Mac book pro, rather than the Linux machines (laptops) at Dell, because of the hardware and general lack of processing power, which doesn't seem to lend itself to virtualizing other Operating systems.

    --
    Under the influence of Post-Cyberpunk Gonzo Journalism
  23. So, will it get rid of Vista/boot delays? by Anonymous Coward · · Score: 0

    I'm still waiting for an assembly that can actually allow me to work w-i-t-h-o-u-t the da.mn b..ox slowing down because something ins.i.d.e has decided to update/upgrade or otherwise do something that has nothing to do with my work. And no, it's virus clean and yes, it has plenty of resources. It's one of the reasons I like Linux, but even that has not yet solved the other problem especially laptop users have:

    Why, after 20+ years of PC chips, does it still take so bloody long to boot up? Only "resume" is reasonably usable. I'm waiting for a BIOS that doesn't show me heaps of wonderful stuff other than when I, for instance, hold a key down during powerup, I'm waiting for an OS that is smart enough to realise that I'm unlikely to rip the video card out of my box, nor is the setup of HDD + CDROM drive going to change anytime soon so it would be nice if it didn't spend a friggin' year probing for change. Probe once, register, get on with it, do not probe again on any reboot. Give me a boot option to request a re-probe instead.

    There's a opne BIOS out there that boots so fast they had to slow it down to give the HDDs a chance to spin up - no sign of improvement for the rest of the world. It's a wide open marketing opportunity..

    . I am obviously in serious need for my medicine, or at least coffee :-)

    1. Re:So, will it get rid of Vista/boot delays? by BiggerIsBetter · · Score: 1

      Sounds like you want to run Solaris 10. It can do exactly that, and it comes up VERY fast on modern hardware.

      --
      Forget thrust, drag, lift and weight. Airplanes fly because of money.
    2. Re:So, will it get rid of Vista/boot delays? by ricegf · · Score: 2, Informative

      Linspire (back in the day - I've been on Ubuntu for quite a while now) worked this way. IIRC you had to hold down a key to rescan for hardware, otherwise it assumed nothing changed and booted very briskly. I'm surprised it didn't catch on with more popular distros.

      Also, I thought http://www.linuxbios.org/Welcome_to_LinuxBIOS would get through POST and to the payload in just a couple of seconds.

    3. Re:So, will it get rid of Vista/boot delays? by Just+Some+Guy · · Score: 1

      So, will it get rid of Vista/boot delays?

      One: it will get rid of Vista in the sense that Vista isn't ported to it.

      Two: you'd reboot these things once every many months. Who cares if it takes half an hour each time?

      --
      Dewey, what part of this looks like authorities should be involved?
    4. Re:So, will it get rid of Vista/boot delays? by PenGun · · Score: 1

      Just leave it on. A PC will last longer if you don't power cycle it. I build em' fire em'up dump on slak and the machine stays on for the 3 years or so I use it.

        I'm on my 5th one. It's worth turning monitors on and off but not the computer.

  24. 5. Profit! by nowhere.elysium · · Score: 0, Flamebait

    Did anyone else notice TFA's mention of Sun's "bang up profits" of $329? Niagara for teh win! Seriously, though: the processor looks cool and all that, but how many of us are likely to seriously be able to play with one anytime soon? (And if anyone answers with 'I am!', I will happily trade jobs with you for a week)

    --
    http://xkcd.com/313/
    1. Re:5. Profit! by Anonymous Coward · · Score: 0

      Sun will send you one to try out for free: http://www.sun.com/servers/coolthreads/tnb/

  25. Re:yes but ... by Anonymous Coward · · Score: 2, Informative

    why doesn't Sun Microsystems make laptops

    They do. Ultra 3 Mobile.

    There are also the units from Tadpole, and I'm sure others

  26. Not fucking funny anymore! by Anonymous Coward · · Score: 0

    Time to flip the record.

    1. Re:Not fucking funny anymore! by Anonymous Coward · · Score: 0

      On the contrary - it has always been funny, and will never stop being funny.

    2. Re:Not fucking funny anymore! by Anonymous Coward · · Score: 0

      Wait till your company adopts it, and you get to be on the receiving end of the funny fucking! Also called "the Vista experience". -- You'll flip in record time!

    3. Re:Not fucking funny anymore! by somersault · · Score: 1

      I've told them no Vista allowed. One employee went ahead and got a Vista laptop (in our Houston office, rather than our UK office), I said no freakin way, and after a chat with me her manager had her take it back to the store..

      If he'd said anything else then that's probably a good time/reason for me to quit and move on! On to living in the middle of nowhere, possibly farming potatoes instead of managing IT for a bunch of crazy oil industry workers.. I've noticed a couple of my hairs are going grey/white, and I'm only 23!

      --
      which is totally what she said
  27. Because they process information like water. by Colin+Smith · · Score: 1

    Then why call then Niagara? Because they'll go through information the way that Niagara Falls goes through water.

    http://www.personaltours.ca/niagara-info.html

    --
    Deleted
  28. CURSES! by Anonymous Coward · · Score: 1, Funny

    How dare you correct your own mistakes!

    I, and my fellow grammar Nazi overlords, were just about to rip your lousy post to shreds.

    1. Re:CURSES! by MysteriousPreacher · · Score: 2, Funny

      If you were a real grammar nazi, you'd be able to point out all the other errors I made. I demand to see your badge!

      --
      -- Using the preview button since 2005
  29. Re:yes but ... by yoder · · Score: 1

    Looks like the Ultra 3 mobile might have beeen a nice laptop, but it's no longer orderable. I didn't look to see if they have anything to replace it.

    --
    "In a time of universal deceit, telling the truth is a revolutionary act!" -- George Orwell (Eric Arthur Blair)
  30. That's not fair! by SpeedyGonz · · Score: 2, Funny

    Only one silly meme per customer please.

    1. Re:That's not fair! by TheBOfN · · Score: 2, Funny

      ...In Soviet Russia Linux running Beowulf clusters blends You!

    2. Re:That's not fair! by somersault · · Score: 1

      "...In Soviet Russia Linux running Beowulf clusters blends You!"

      I for one welcome our new Soylent Green producing, waterfall powered overlords..

      --
      which is totally what she said
  31. Re:yes but ... by Rynth · · Score: 1

    Will it run Doom?

  32. 2x10Gb Ethernet by Doc+Ruby · · Score: 1

    To me the most exciting part is that they're putting 2x10Gb ethernet ports directly on the CPU. The crypto is cool too: I hope it's not encapsulated entirely in the ethernet, so apps can call it directly.

    If they made these CPUs cheap enough, we could put them on PCI-e cards in a Xeon, and run a Linux cluster over the PCI-e, coordinated by apps running on the Xeon. Or maybe stuff a Niagara/PCI-e box with extras, like we used to do with Mac Quadra 950/NuBus cards. But this time with 20Gbps ethernet per node, for a networked grid of nodes.

    --

    --
    make install -not war

    1. Re:2x10Gb Ethernet by asaul · · Score: 1

      The crypto is exposed by kernel interfaces and OS library interfaces. Basically if you link to the /usr/sfw/lib OpenSSL built into Solaris 10, you get the benefit of CPU core crypto acceleration.

      We have a few of these boxes - while the straight line speed is noticeably lacking on some tasks, the box runs everything thrown at it quite well. The odd part for me was realizing that a process using 3% cpu in prstat was actually consuming an entire CPU thread.

      --
      "If everybody is thinking alike, somebody isn't thinking" - Gen. George S. Patton
  33. Oh no, not again.... by ricky-road-flats · · Score: 5, Insightful
    It's already been said, but that's a big glossy load of poop.

    The quads from Intel provide four physical cores per socket. That is the definition of a quad in this context. The exact workings of how many bits of silicon there are, how they talk to each other and to the rest of the system is, to 99.999% of users and computer buyers, background fluff.

    This was the same as when Intel put two single-core chips into a package to release a 'dual core'. Lots of people like you jumped up and down and pointed out it wan't *real* dual core, and how the FSB issue would cripple performance. Amazingly, it wasn't the case - they sold in droves, and real-world performance was good enough to carry Intel through to the 'true' dual core, the Core 2 Duo.

    If the competition had anything out that was the same cost and performed significantly better than the 'fake' quad cores, you would have an argument. But they haven't and you don't. Bear in mind I'm talking about the huge x86/x64 market, not the relatively low volume non-x86 server market.

    What Intel did back then and again now is perfectly sensible. They have millions of high yield, robust dual core chips being churned out, and they have built into the infrastructure the ability to put two into a package, lower the speed a bit to drop the per-core heat output, and sell reasonably priced (now) quad core chips. When the drop to 45nm happens, they will release their 'real' quad cores, and pretty quickly put two of those into a package to start selling oct-core (whatever we're going to call them). And so it goes.

    What's the alternative? Not sell quads until 45nm comes out? Not working out too well for AMD is it? I've asked the question before here and on realworldtech.com - at what point will the FSB problem actually become a painful problem for the Intel chips? Well, not yet (4 core) is the answer, despite dire predictions from the AMD camp for years. My gues is that, shock of shocks, Intel have actually thought it through - and that's why CSI is coming. When the number of cores gets to the point where FSB will actually hurt performance relative to the AMD architecture, that's when CSI will kick in. Maybe at 8 cores, maybe at 16.

    What, you don't need quad core yet? Fine, stop your bitching and choose what's right for you. Vive la difference, and 3 cheers for a market that gives us the choice.

    1. Re:Oh no, not again.... by Eukariote · · Score: 2, Insightful

      The quads from Intel provide four physical cores per socket. That is the definition of a quad in this context.

      Well, yeah, and if I say the context is the motherboard, then I can define a four-socket board holding single-core CPUs to be a quad core chip. It would be equally ludicrous in the context of chip design.

      ... the FSB issue would cripple performance. Amazingly, it wasn't the case - they sold in droves

      I take it you haven't looked at proper SMP benchmarks. Sitting on the same FSB sucked rather badly for SMP scalability. What sells is a function of marketing and perception. When it comes to managing that, Intel rules. That's why you have to search hard to find any real-world SMP benchmarks being done by the various bought-and-paid-for review websites.

    2. Re:Oh no, not again.... by Wesley+Felter · · Score: 1

      Sitting on the same FSB sucked rather badly for SMP scalability.

      If the quads are cheap enough, it doesn't matter. Sure, you only get 50% performance improvement instead of 100%, but if the price is only 20% higher then it's a net win.

    3. Re:Oh no, not again.... by Apocros · · Score: 1

      Well, yeah, and if I say the context is the motherboard, then I can define a four-socket board holding single-core CPUs to be a quad core chip. It would be equally ludicrous in the context of chip design. ignoring the cores-per-die discussion, i find it interesting that you view asic (eg chip) design to stop at the die level. package design and development is a *huge* part of designing and manufacturing an asic. the two parts are designed together, and i find it difficult to view them as completely separate entities.
      --
      "onward!" cried the copper man, little knowing brass corrupts...
    4. Re:Oh no, not again.... by Eukariote · · Score: 1

      I find it interesting that you view asic (eg chip) design to stop at the die level.

      I don't, in general, but in the case of Intel's current quads that is basically where the design stopped: there is no special design provision for an interconnect in the package. The dual-core dies are pinned out as two loads on the same front-side bus, electrically equivalent to having each die packaged separately and sitting in a separate socket connected to the same FSB.

      When you have two dies in a package that are designed to allow a close intra-package coupling by design, like the two-die core/L2 package used for the Pentium Pro when it was first released, then I agree it's valid to view the package as an integrated whole.

  34. Price.. by blinx_ · · Score: 1

    One quite important point with the T2000 cost is that Oracle requires 0.25 license per core as opposed to 0.50 license/core with Intel/AMD systems.

    I've got 2 T2000/32GB ram boxes here and if you remember their limitations and run what they are designed for, they are awesome.

    --
    Resistance is not futile - www.gnu.org
  35. The new Sun Moto: by teknopurge · · Score: 3, Insightful

    "Do No Evil"

    It's like it's 1999 all-over again, except this time Sun actually has revenue in-line with expectations. I continue to maintain Sun is this century's Bell Labs and Xerox PARC all rolled into one.

    1. Re:The new Sun Moto: by fm6 · · Score: 1

      You just cited two outfits that created lots of cool technologies, but never benefited from them commercially, due to the ineptitude of their parent organizations. Hopefully Sun can do better than that.

  36. Re:Man, what is with the defensive Sun nuts? by masdog · · Score: 1

    Your troll-fu is weak, Young Sycraft. You're arguing something we already know, and something that most people will acknowledge - Sun isn't for the hobbyist or the home/small business market.

    But saying they don't get press is a misnomer. They do get press - in the publications that matter to their market. You'll find ads and articles for Sun in places like CIO magazine, Infoworld, and Information Week. Tom's Hardware, in the big scheme of things, is great for commodity hardware review, but when you're building a 24/7 datacenter, they are probably the last place to look for information.

    And finally, with there being a greater focus on virtualization and data center consolidation, a server that can handle 64 simultaneous threads will go a long way to conserving datacenter floor space, lowering cooling costs, and using less electricity. So it may have a greater upfront cost, but if you factor it out over 10 years, it will pay for itself in lower energy bills for the datacenter.

  37. Actally, SUNW stands for.... by Name+Anonymous · · Score: 1

    (hence, the stock ticker....SUNW, W=workstation) Actually, SUNW stands for Stanford University NetWork.
    1. Re:Actally, SUNW stands for.... by southpolesammy · · Score: 1

      Whoops.....

      Guess I have to hand in my Sun CSA now......

      --
      Rule #1 -- Politics always trumps technology.
    2. Re:Actally, SUNW stands for.... by /tmp · · Score: 1

      no, actually the previous poster was correct (sort of)
      the original meaning of SUNW was Stanford University Network Workstation.
      (Sun's original product line was mainly engineering workstations)

      But the meaning has been updated for the new business direction that the
      company is focused on and it now stands for "Sun Worldwide"

      http://www.sun.com/aboutsun/investor/faq/index.htm l#04

  38. Re:Man, what is with the defensive Sun nuts? by ajs · · Score: 2, Informative

    most people just don't care about them. Tom's Hardware is not going to be reviewing them for the enthusiast market, for example, they are waaaaay out of that range. Same shit with the Power 6. Great chip, coming never to a desktop near you. These specialised high end products are just not of mass interest if for no other reason than price. Which Sun has never cared about. They sell to the high performance market, which (outside of trivially distributed applications like serving Web pages or rendering CG) NEEDS the kind of horsepower than Suns can crank out.

    Even if that cheap server breaks right at the end of its warranty, it is still a money saver, a big one. This is almost always a red herring. Price of a system is rarely a company's most significant cost (within an order of magnitude) when you're dealing with high performance computing. It's the people and the data vendor relationships that usually cost you the bulk of your outlay. Hardware of just about any sort is fairly cheap by comparison.

    I understand the market for enterprise systems, I also understand that it is small. Hrm... small? Well, not really. It's got a low number of players, but those players have a voracious appetite for processing power.

    Don't get me wrong. Most of a large financial house doesn't need a Sun server. However, those who do (e.g. quants) really do. Same goes for government, biotech, military contractors, etc.
  39. Niagra 2 my precious by jshriverWVU · · Score: 0

    Me wants it...my precious.

  40. These would make great backup servers by Anarke_Incarnate · · Score: 1

    I have been actively interested in the T1 and T2 series for a while. Currently, my backup server at work is a v880 (Sparc III) with 8 GigE interfaces.
    I could replace it, and get more throughput from a T2000, but the issue was doing restores would lose that edge from poor single thread performance

    The Niagara 2 series is set to have 1.4X the single thread performance, plus the higher simultaneous threads (Though a slightly longer pipeline).

    Since I am moving away from tape and going to Virtual Tape Library tech, I won't be constrained by how many backups I can do and avoid over multiplexing. I plan on doing 24-32 (or even more) simultaneous backups to virtual tape drives without skipping a beat. The only thing then will be keeping the network from being over-saturated.
    Don't have any 10Gbe switches in house yet, but that can't be too far off. I'd likely put in 2 4 port 1Gbe cards and pump them like no tomorrow. I'm getting about 20-30MB/sec from each machine, so assuming 140MB/sec on a GigE port, and 8 of them, I can handle over 1100MB/sec, but doing 32 backups would be about 950MB/sec. It is close, but should work.

    1. Re:These would make great backup servers by asaul · · Score: 1

      When we looked at this a V445 was much more suitable - multiple PCI controllers, more slots and better bandwidth - and enough CPU to keep up with the lot.

      We looked at a T2000 for a similar role, IIRC the problem with a setup like you are proposing is that they only have one PCI controller, so that will be your weakpoint as all your SAN and Network traffic will be hammering that poor puppy. Keep in mind everything coming in on the network, will likely go out the SAN, so your real "work" throughput will be roughly half of the capacity of the PCI controller. The CPU wont enter into it too much - there is more than enough to handle it.

      --
      "If everybody is thinking alike, somebody isn't thinking" - Gen. George S. Patton
    2. Re:These would make great backup servers by Anarke_Incarnate · · Score: 1

      How is it possible they have 1 PCI controller when they have PCI Express and PCI X? The T2000 server has multiple slots and at least 2 controllers. I could throw the extra 4 port GigE on the PCI X slot and fibre channel on the PCIe.

      However, I still don't plan on going T2000. The Niagara 2 should be a different platform and it will be interesting to see how that pans out.

  41. Re:Man, what is with the defensive Sun nuts? by maney · · Score: 1

    So it may have a greater upfront cost, but if you factor it out over 10 years, it will pay for itself in lower energy bills for the datacenter. That's the whole rub of this argument. Sycraft, like most of the PeeCee hardware crowd, is only looking at the original hardware price and warranty cost for 2-3 years. People who buy big iron tend to look at the TCO over a much longer period of time and include more things into their equations (downtime, power, cooling, consolidation, training, etc.).

    It's an apples vs. oranges argument. Pick the tool based on what you need, not on what you see on the shelves at Best Buy or Fry's.
  42. Re:yes but ... by swordgeek · · Score: 1

    Last I heard, they don't. However, the UltraSPARC line isn't really what you want in a laptop anyways. Much better to get an X64 laptop and run Solaris10/x86 on it. You can use the list of tested and proven hardware for Solaris x86 to make sure it'll run without fiddling.

    There's been a lot of (justified) doubt in the past about Sun's commitment to Solaris x86, bit it clearly is the future of consumer-directed Solaris. And it rocks.

    --

    "People who do stupid things with hazardous materials often die." -- Jim Davidson on alt.folklore.urban
  43. Re:yes but ... by yoder · · Score: 1

    I agree about the x64, and I am/was one of those doubters.

    --
    "In a time of universal deceit, telling the truth is a revolutionary act!" -- George Orwell (Eric Arthur Blair)
  44. manages != runs, a.k.a. "the 64 threads bullshit" by Anonymous Coward · · Score: 1, Insightful

    All that and the 64 threads run at 84 watts maximum (not TDP)

    Please note that it doesn't *run* 64 threads simultaneously. It *manages* 8 threads per core -- but each core has only two integer units, one load-store unit, and one floating point unit. At best a core can have ops from four different threads in simultaneous execution, but this will be a very rare case (when int, int, float, load/store happen at same cycle). Most often each core will be able to simultaneously execute instructions from just one or two threads -- which all is still excellent for 84W!

    It just irks me when people read "manages 64 threads" as "is a 64-core uber-chip", when what they have is just a wider version of Intel's idling-eliminating HyperThreading (in each of the eight cores). Surprisingly Sun PR hasn't made much of an effort to remedy the misconception :-P

  45. Re:low...... by texwtf · · Score: 1

    Yeah. I have to use an umbrella every time I put a shrimp on the barbie.

    (Of course that's nonsense, none of us ever use umbrellas)

  46. Re:yes but ... by zerocool^ · · Score: 1


    [...] and why doesn't Sun Microsystems make laptops, I was looking for Unix machines recently and I decided to go with the Mac book pro, rather than the Linux machines (laptops) at Dell, because of the hardware and general lack of processing power, which doesn't seem to lend itself to virtualizing other Operating systems.

    That's odd.

    I'm looking at right now a Dell Latitude D630 that has 3.5 GB of addressable ram, a Core 2 Duo T7300 dual core chip @ 2 Ghz, and an NVidia Quadro graphics card. And it's running Centos 5 just fine.

    ~Wx

    --
    sig?
  47. Reminds me of a classic Onion story... by CopaceticOpus · · Score: 1

    Fuck Everything, We're Doing Five Blades

    (The funniest thing about this article is that a year after they published it, Gillette actually did release a five-bladed shaver!)

  48. Threads Are the Work of the Devil by MOBE2001 · · Score: 2, Funny

    Each core supports eight threads, so the chip handles 64 simultaneous threads, making it the centerpiece of Sun's "Throughput Computing" effort.

    Wow! Only 64 threads, eh? That's the problem with threads, you can't have too many of them because switching from one thread to another is very expensive, cycle-wise. In other words, as long as threads remain the only multitasking mechanism used by the computer industry, super fast, fine-grained multiprocessing will remain a dream. It gets worse. There is another problem with threads that is even worse than this. Threads are inherently asynchronous. Until and unless the computer industry comes to its senses and realizes that asynchronous processing makes it impossible to implement programs with deterministic timing, we will continue to pay the heavy price of software unreliability. Switch to a non-algorithmic, signal-based, synchronous software model (with the supporting CPU architecture), and the problem will disappear. Threads suck! Period. One man's opinion.

    1. Re:Threads Are the Work of the Devil by grigori · · Score: 2, Informative
      Above guy is confused partly because Sun reused the word "thread" to mean "hardware thread" when most people think of "software thread" like in Java or pthread_create().

      Hardware threads are virtual CPUs sharing resources on a core so work can proceed when a thread stalls on cache miss - the hardware switches to a new thread in a SINGLE CLOCK. Not expensive. Time-slicing between many software threads on a few CPUs can be expensive but having many hardware threads to run those threads makes the problem tiny. In fact, this design is great way to make multithreaded applications run real fast.

    2. Re:Threads Are the Work of the Devil by MOBE2001 · · Score: 1

      Above guy is confused partly because Sun reused the word "thread" to mean "hardware thread" when most people think of "software thread" like in Java or pthread_create().

      Thanks for pointing out the false assumption but it does not matter. Like it or not, the computing world is going parallel. What I mean is that the trend is toward a future where fine grain parallelism occurs at the operation level, not the algorithm level as it is now done with threads. The Transputer was a precursor of this trend. It's not going to happen because of threads but in spite of them, I can guarantee you that. There is a simple way to accomplish fine-grain parallelization that does not incur the overhead of threads. However, it would have to be supported at the CPU level.

    3. Re:Threads Are the Work of the Devil by grigori · · Score: 1

      I'm glad we're on same page for definitions! I agree world is going parallel - since you can only push single CPU performance so far and we're already getting diminishing returns. I disagree that there will be fine grain parallelism at the operation level. That's been tried over and over since 360/91 and the pipelined, OOO machines since then, and that doesn't scale. Operand dependancies and branch prediction mean you really can't go that way as far as we need, and makes processors really complex. Look how IBM had to make POWER6 not have the OOO they highlighted in POWER5. No, parallelism at program unit level is the way to go for most workloads. But, it's worth you opening a blog on if you got argument to that says otherwise - go for it.

    4. Re:Threads Are the Work of the Devil by Anonymous Coward · · Score: 0

      switching threads is not hard. the niagra can mux threads in 1 cycle and work on multiple threads at the same time. there is specifically a pipeline stage for thread select as well as the required extra WARFs.

      i beleive the niagra was targeted to apps using independent threads with minimal interaction.

      While some of your conclusions are valid in general, I don't think they address the market Sun is targeting.

    5. Re:Threads Are the Work of the Devil by Anonymous Coward · · Score: 0

      Actually, the T2000 switches threads in 0 clocks, not 1. Even better.

  49. Re:yes but ... by cyphercell · · Score: 1

    Thanks for the info. but I'm sure you can forgive me if I just want something off the shelf. I also, want the virtualization software to just run, and I need to virtualize whatever OS I want. Your system would be nice if Dell was selling them with the option of adding Xen, and incidentally running a *nix system from the start. Honestly, I don't understand why Dell doesn't offer this sort of thing it kinda seems like a waste M$ on a 64bit system, selling Linux without a 64bit option is kinda goofy too.

    --
    Under the influence of Post-Cyberpunk Gonzo Journalism
  50. OT: IBM Power6 by sysadmintech · · Score: 1

    Anyone got links to user discussions or reviews on 4.7GHz Power6 since they've been out for 2 months? I've read all the IBM, Oracle, Data Center promo stuff.

  51. The proper question is by Anonymous Coward · · Score: 0

    "Will a beowulf cluster of these run Vista?"

    Well, no. It wouldn't. I guess the joke wasn't that funny then :(

  52. Re:manages != runs, a.k.a. "the 64 threads bullshi by imroy · · Score: 2, Informative

    Please note that it doesn't *run* 64 threads simultaneously. It *manages* 8 threads per core -- but each core has only two integer units, one load-store unit, and one floating point unit. At best a core can have ops from four different threads in simultaneous execution, but this will be a very rare case (when int, int, float, load/store happen at same cycle). Most often each core will be able to simultaneously execute instructions from just one or two threads -- which all is still excellent for 84W!

    Quite true. For more info on the way its cores work, see the UltraSPARC T1 article on Wikipedia (which I have edited quite a bit). Each core is a barrel processor, meaning each stage in the pipeline is handling an instruction from a different thread. This adds complexity, but in exchange it means that branch mis-prediction is no longer a problem - any branch instruction has already been through the execute stage and the Program Counter modified before the next instruction of the thread gets fetched.

    The other big advantage with the multi-threaded UltraSPARC T1/T2 design is that it has high throughput. While a single-threaded CPU has to wait on cache misses, the T1/T2 just continues chugging along with its remaining threads. It's switching threads on every clock cycle, so each thread gets only 1/8th of the 'power' of each core. But because it's doing something on every single clock cycle, it can do a lot of work - as long as the work is multi-threaded. That's its weakness.

  53. Sacrifices everything for multithread throughput by wirelessbuzzers · · Score: 1

    I was lucky enough to get to play around with a Niagara 1 demo unit a year or so ago, and it was mediocre as a general-purpose server. The system was amazingly fast if you could keep its 32 threads saturated (and I notice that the new one is 64 threads), but if you were only running, say, 8 threads, you would do as well on a more mundane server. I don't have the exact numbers here, but from what I recall:

    The first test was a "make" test. On my desktop machine (generic dual-core Athlon), configure for some large software package (BerkeleyDB, I think, to run more benchmarks on) took a minute, and make -j 3 took 5. On the Niagara, configure took 5 minutes, and make -j 40 took only one.

    For high-concurrency database benchmarks, the cost of synchronization made the Niagara slower than a standard AMD-based server. For a less concurrent load, the Niagara was of course much faster. Interestingly, a dual-core server performed much better here than a dual-processor single-core server, because the synchronization cost was lower.

    For web applications, the Niagara did well for simple applications, but introduced unacceptable latencies for more CPU-intensive ones.

    For anything floating-point, the original Niagara choked due to its single FPU, but that's what the T2 is supposed to fix.

    --
    I hereby place the above post in the public domain.
  54. Re:Man, what is with the defensive Sun nuts? by Miguelito · · Score: 1

    They sell to the high performance market, which (outside of trivially distributed applications like serving Web pages or rendering CG) NEEDS the kind of horsepower than Suns can crank out. Hah... the engineers I support, that design ICs, have been steadily moving off the SPARC platform onto linux/x86_64[*] for several years now. Even the big IC design ISVs are completely standardizing on Linux now. These are not (other then for a very few apps) parallelized applications. These guys simply need to run massively cpu bound compute jobs, some of them (at last stages in the design process) for days.. and Sun just hasn't come even close to being competitive in this area in years.

    There's some (very little) talk of Sol 10 x86, mostly pushed by the "If it ain't Sun/Solaris, it's crap!!" crowd.. but its' not getting much more then a little mention from the ISVs because to them it's another whole platform to support, and why bother when linux is working ok and they did all that work to move to linux (at customer request) a few years back.

    Price of a system is rarely a company's most significant cost (within an order of magnitude) when you're dealing with high performance computing. It's the people and the data vendor relationships that usually cost you the bulk of your outlay. Hardware of just about any sort is fairly cheap by comparison. True, but for us the highest cost is the license costs of the software. So they want to get as many jobs as they can out the door as fast as they can, and SPARC can't do it.

    Now the big push from us (IT and engineering) is for the ISVs to parallelize their tools as much as possible. They were ok for awhile there when CPU speeds kept going up, but now that the CPU makers have basically given up on the speed jumps and are going multi-core, the tools need to parallelize too. Thus far, the multi-core and multi-hosted tools still run orders of magnitude faster on linux hosts vs even the latest from Sun.

    [*]There was a very short stint with IA64 as king for huge RAM jobs (128Gig+ in the box) for the last testing stages, but x86_64 overtook the cpus in speed awhile ago, and now you can get 256Gig+ hosts (we've recently ordered a few of those). We've got a small amount of 32bit linux boxes left too.. mostly for support of older tools.
    --
    - My favorite error message: xscreensaver, running on an old Sparc 5 w/ 8bit color: bsod: Couldn't allocate color Blue
  55. Re:manages != runs, a.k.a. "the 64 threads bullshi by Anonymous Coward · · Score: 0

    Thanks for the clarification.

    The 84 W is impressive considering that in addition to the CPU itself, not only is the 921 Gb/sec RAM controller on the chip (AMD does that too) but also the two 10 Gb Ethernet links (at 50 Gb/sec) and the PCIe interface (at 40 Gb/sec). Normal Intel north bridges don't have 10 GbE, period, and still consume plenty of power alongside the CPU.

  56. Sun's Home in the Datacenter by BookRead · · Score: 1
    The name of the game that Sun is playing is the datacenter. A really significant cost of running a serious enterprise with dozens or hundreds of servers is in the supporting infrastructure, things like electric capacity and cooling. The important metric is computing power per watt. And computing power isn't just CPU power but also things like I/O bandwidth. Sun has always excelled at the I/O portion of the program. The Niagara processors improve the performance of multi-threaded applications but shortening the calls to memory and cache.

    Where I work our datacenter is a bit constrained on space, power, and cooling. Adding these bad boys allows us to support many more applications, websites, and whatever else the business wants with less power and cooling and capital cost than what we used last year. And, yes, you can get a three year warranty on brand name Intel servers but the reliability and serviceability of Sun gear lasts way beyond three years.

    I think their desktops suck. And I wasn't too much of a fan of Solaris until Sol 10. It was boring. Run Solaris x86 if you want to try it cheap. Linux has made it much better by forcing new features liek Dtrace and ZFS. The cost of entry is a bit steep (and over powered) for SME but if you want serious computing power you can do much worse than Sun. They've been written off more times than I care to count (kind of like Apple) but they're still standing.

  57. Re:Man, what is with the defensive Sun nuts? by ajs · · Score: 1

    They sell to the high performance market, which (outside of trivially distributed applications like serving Web pages or rendering CG) NEEDS the kind of horsepower than Suns can crank out. [...] These are not (other then for a very few apps) parallelized applications. You're confusing distributed computing (something that works very well on piles of cheaper boxes) and parallel execution (something that requires very specialized hardware that's not just designed well for parallelization at the chip level, but throughout the architecture. Sun spends an awful lot of time building that kind of hardware.

    Now, I'm no Sun fan boy. I work with Linux exclusively these days because that's the world I live in, and it's what my application wants, but if I were in the business of high-powered computation and I needed a box to run a parallel, non-distributed application, I would run it on a Sun.

    There was a very short stint with IA64 as king for huge RAM jobs (128Gig+ in the box) for the last testing stages, but x86_64 overtook the cpus in speed awhile ago, and now you can get 256Gig+ hosts (we've recently ordered a few of those). We've got a small amount of 32bit linux boxes left too.. mostly for support of older tools. I think you live in the world where very fast, single-threaded, memory-hungry computation is king. In that world, Intel running Linux might be the right call. Just don't make the mistake of assuming that all the world is your back yard.
  58. Re:yes but ... by socz · · Score: 1

    can it blend? - yes I'm sure it can, the iphone blended.


    "Sun Smoke. Don't breath this."
    --
    My abilities are only limited by my imagination
  59. Re:yes but ... by Anonymous Coward · · Score: 0

    SPARC64 is a waste of money. Don't buy it.

    Buy x86 or x86-64 instead.

  60. Netbackup media server by mr_sparc · · Score: 1

    This processor is ideal for a Veritas Netbackup media server. Netbackup is heavily multi-threaded, and getting the backup data across the network to the server will be tremendously aided by the on-chip 10G bandwidth. Symantec has been doing this with great success.

  61. Consolidation with LDoms by johnjmclaughlin · · Score: 1
    A major feature of the T1 processor and the Niagara 2 is called LDoms - logical domains. With this technology, you can divide the processor into a bunch of logical domains, each with is own O/S and dedicated resources. If you don't have a workload that can use 64-way scaling but instead you have 8 workloads that can each use up to 8 threads, then you can use LDoms technology to assign the resources and create 8 virtual servers.

    Imagine 8 web servers each on it's own "server" in an LDom - with the ability to handle multiple requests. While you can do something similar with Solaris 10 containers, LDoms give you control over the memory and independent O/S images.

    LDoms can be used with containers. In my example, those 8 "servers" could be owned by different departments who could use containers to have a test and development containers within the same LDom.

    People are having good results from similar efforts with VMware for a non-trivial free. By the way, LDoms are free.

    Did anyone mention that the T1-based servers such as the T1000 and T2000 don't require a lot of power? 300-350 watts Replacing 8 older Sun SPARC servers with one (as in my example) can save a lot of space and electricity.

    -johnj

    --
    John J. McLaughlin, Editor-in-Chief/CTO, System News Inc. Publishers of "System News for Sun Users"
  62. Re:Sacrifices everything for multithread throughpu by lopgok · · Score: 1

    I was also 'lucky' enough to play around with a Niagara processor.
    I happened to have my own highly threaded app.
    It ran really, really slowly. The sun engineers eventually figured out that two of my
    threads would saturate one of their processor cores.
    (there was zero fp in my code).

    With 32 threads, their fastest T2000 took 10,600 seconds.
    My old, dual xeon 2.4ghz machine took 11,494 seconds.
    An 8 way opteron 2.8ghz machine was 11.6 times faster than the niagara.
    A a Sun Sparc V890 8cpu, 16 core was8.3 times faster than the niagara.

    Since their t2000 cost around $20k, and a dual 2.4ghz xeon cost around $500-$1000,
    that made the price/performance roughly 20-40 times better for a xeon than the niagara.

    Perhaps their new chip is higher performing than their earlier generation.

    You can see all the gory details at www.weasel.com/comp-perf.html
    source code available upon request.