Slashdot Mirror


Sun Releases Starcat

SilentChris writes: "Sun has released the Starcat server, a beast with up to 106 processors running Unix. Anyone have an extra couple [million] bucks lying around?" They're not cheap.

27 of 305 comments (clear)

  1. Yeah by SpanishInquisition · · Score: 3, Funny

    At last a platform to get descent java performance...

    --
    Je t'aime Stéphanie
  2. just wait by smnolde · · Score: 5, Funny

    until it shows up on e-bay from a disgruntled former dot bomb employee who five-fingered it from a linux shop which stole BSD code.

  3. For those beowolf comments by segfaultcoredump · · Score: 5, Informative

    Lets remember, that this system is not intended to replace a beowolf cluster of cheap pc's. It is intended to do something that most beowolf clusters can never do: present a single OS image with half a terabyte of memory that any cpu can access at very high speed.

    This is a system that is very good at things like fluid dynamics and massive database operations. It is not a good idea if all you want to do is get to the top of the list for the SETI@Home project

    1. Re:For those beowolf comments by nion · · Score: 5, Funny
      It is not a good idea if all you want to do is get to the top of the list for the SETI@Home project.

      Been there, done that. Tech here working with the StarFire used to run Seti@Home on idle systems. 64 400MHz UltraSparcs. Team Sun@Home rose rather quickly in the ranks those days, I hear. ;)

      --
      der dee der.
    2. Re:For those beowolf comments by Webmonger · · Score: 4, Informative

      In terms of memory bandwidth and latency, they are very different.

      The fastest networking technologies do not approach the speed and responsiveness of a memory bus. Yet a cluster design uses networking in place of a memory bus some of the time.

      If there's not a lot of data, it doesn't matter much. If there's tons and tons of data, a cluster design is inefficient.

    3. Re:For those beowolf comments by tolldog · · Score: 3, Informative

      Agreed.
      This would make a killer render system, assuming the renderer can handle that many threads.

      This is why beowulf rendering is bad. Network performance for shared memory sucks.

      With renders hitting the 2GB + mark for memory useage, do you really want a network passing that data arround.

      What could happen with systems like this is that the render time vs. load time would get extremely lopsided. 30 minute loads and under a minute a frame. It would force a rethink of how the render jobs get distributed and ran.Best case would be a few of these, for each different departments render needs. But then we are talking 20+ million for rendering. That buys a lot of intel boxes.
      If I was given one, I would try to use it. But I don't think I could ever seriously suggest buying one. But that is me and my particular application.

      --
      -I just work here... how am I supposed to know?
  4. They need one! by MSBob · · Score: 3, Funny

    Sun themselves are in a need for boxes like that as their website seems severely slashdotted right now.

    --
    Your pizza just the way you ought to have it.
    1. Re:They need one! by friscolr · · Score: 3, Funny

      sun put the dot in dot com,
      but slashdot took it out.

  5. Re:106? by segfaultcoredump · · Score: 5, Informative

    The system grows to 106 in the following way:

    There are 18 "cpu/memory" boards that hold 4 cpu's each. This brings the system up to a total of 72 cpu's and 576GB of ram.

    Now, if you want an server that just does number crunching and dont care about I/O, you can then add 'MaxCPU" modules. Each module holds two additional cpu's (no memory) and occupies the hPCI module slot (a hot swap PCI case that can hold what looks like two to four pci cards). You can use up to 17 of the hPCI module slots to hold MaxCPU modules. (there are 18 pci channels on the system, and at least one must be used for accessing the boot disk).

    So there ya have it, 106 cpu's and half a terabyte of ram. I think that in most cases, folks will opt to not use the MaxCPU modules and just stick to the 72 cpu limit.

  6. All I want for Christmas... by digital_freedom · · Score: 4, Funny

    Dear Santa,
    I've been a real good gEek this year. I wrote several white-hat worms to fix IIS holes. I defended IP rights in the Linux kernel. I also mirrored the LOTR trailer.
    Could I please get just one little old Starcat Server from Sun? Please make sure it is the 106 processor version with 576 GB of RAM.
    I will be real good and use my idle time for SETI.

    Your pal,
    digital_freedom

    P.S. Chocolate chip cookies are your favorite right?

  7. Re:Clarification please by Cheetahfeathers · · Score: 3, Informative

    These can be either. It depends on how you configure it. And the fun this is, you can reconfigure it on the fly. You want a cluster in a box? You got it. You want 2 seperate instances of Solaris running, each using 1/3rd the resources of this box, while you pull out the hardware on the rest of the box for maintenance? You got it. This thing is _configurable_. You can hot swap everything except the backplane, pretty much. It's _sweet_.

  8. Cool by Buzzwang · · Score: 3, Insightful

    I'm glad to see that some companies are at least trying to accomplish new things and come out with new products given the state of our economy and markets and such. Even if people think it is overpriced or under-powered and what not that still doesn't degrade the fact that it is a relatively new product in a squishy market. Personally, I own a few hundred shares of SGI stock, but I'm still happy to see any tech comapny suck in their gut, tighten their belt and release a new product in this market. Makes me want to believe the tech markets will turn around sooner than people believe. Kudos to Sun for still working on new products and trying to generally improve things. Now, if Cray would follow suit, I'd be a happy man...

    --
    Things you can say to your dog that you can't say to a girl: "How about a nice bone?"
  9. The marketrons are going to _freak out_. by Anonymous Coward · · Score: 5, Funny

    "Wow, look at all the hits we're getting on the Starcat shopping cart! We're going to make a mint on these suckers!"

  10. partitions by cornflux · · Score: 4, Interesting
    CNET article: The system can simultaneously use 900MHz processors with faster models yet to come. However, each partition requires all processors to run at the same speed, so faster chips will have to run in a partition of their own.

    As someone who does nothing with these types of systems, nor follows them, I think it's great that you can have different processor speeds using "partitions."

    I wonder if memory is treated the same way... i.e., separated by "partitions," or if you also have a choice to use it as one, large unified memory resource... or, I wonder if memory can be dynamically partitioned... hmm.

    Actually, now that I'm thinking about it... are all of the processor partitions considered peers? I mean, are the partitions all treated as if they were a single processor... then treated equally?

    1. Re:partitions by Doctor_D · · Score: 5, Informative

      According to the specs each processor board holds 4 processors and 32 gigs of memory.

      Now, if the starcat treats domains (partitions) the same as the E10k (I haven't been to training yet on it), then each domain at minimum will consist of 4 processors and 32 gigs of ram, ie 1 processor board. Basicaly these doamins are treated as seperate boxes as far as Solaris is concerned. You configure a domain to say contain 2 system boards, and then when you load Solaris, it then sees 8 processors and 64 gigs of memory. This way you can allocate resources as the need fits. But this means it doesn't look like the virtual processor that mainframes present.

      The starcat may deal with processors above 72 in a different way, but I honestly don't know at this time how it deals with them.

      Hope this helps answer your question.

      --
      "If you insist on using Windoze you're on your own."
  11. I'll bet their hoping by ch-chuck · · Score: 3, Interesting

    to get on the National US ID Card database bandwagon with Oracle... It'll only need to store about 300 million records with DNA, fingerprint, picture for facial recognition software, key escrow, etc...

    --
    try { do() || do_not(); } catch (JediException err) { yoda(err); }
  12. Re:Scheduling airplanes? by Derkec · · Score: 5, Interesting
    "those conducting computationally intense tasks such as scheduling airplanes"

    "Huh? I understand that the nation's air traffic controllers may need updated equipment in light of the existing crisis, but how hard can scheduling be? I could see a use for a massively parallel monster like this in, say, flow-through or structural analysis or something, but scheduling? "


    What your missing is that this isn't a matter of airtraffic control. This is a matter of determining which planes and crews to fly to which locations at what times to maximize revenue. This is a classic, big, nasty travelling salesman problem. The bigger of a beast of a machine you get, the closer you get to an optimized solution. I.E. Most passengers willing to pay this most money with the least use of resources. It's a huge problem that needs massive computational power.

  13. Re:I can't build one... by Derkec · · Score: 3, Insightful


    We may want to forgive Sun for being a bit slow in getting their DB server back up and pretty. A huge chuck of their support staff is helping bale out clients whose data centers got blown up out east. On the other hand, it's your brand new product, you gotta make sure it's available to be bought up. But then again, who in their right mind would just go online and buy one of these? I'd bet ALL of these are sold through meetings between sales reps and IT purchasers.

  14. Re:That's it... by Magumbo · · Score: 3, Funny

    Um. I believe Starcat Cluster is a trademark owned by the Little Debbie Food Group Inc., Lubbock Texas. It consists of two Star Crunch patties glued together with Mallow Kreeme filling. Then dipped in a Chocolastic sealant.

    Just watch out. I hear Debbie has quite a few lawyers.

  15. Re:Brought TCO WAAAY down by elmegil · · Score: 3, Funny
    Far cheaper than the time spent waiting for the Sun tech who comes, and then bends the fucking pins on a CPU trying to install it.

    Let's see...

    Sparc 1 box, CPU pretty much part of the MB, unless you get one of those fancy Weitek things. CPU fails, tech replaces MB, no pins to bend.

    Sparc 2, Sparc IPX, etc, same story.

    Sparc 20, suddenly we have CPUs with pins on them. Coulda happened. Of course, that hasn't been current tech for several years.

    UltraSparc line comes out...pressure fittings for the Enterprise servers, no pins to bend. Deskside UltraSparc (like E250, E450), no pins to bend, the CPU is on a card just like Intel does these days.

    Ultrasparc III line comes out, big servers don't even use pressure fittings--if you lose a CPU, you get a new system board. Deskside US III (SunBlade 1000) uses a card similar to older deskside units, and has rails to line it up and a torque tool to seat it. Don't see too many bent pins there.

    So, apparently you got burned once a very long time ago with a Sparc 20. Don't you think it's time to get over it?

    --
    7 November 2006: The day Americans realized corruption and incompetence weren't addressing 11 September 2001
  16. Insecure by Syberghost · · Score: 3, Funny

    I thought we weren't supposed to use strcat anymore, because it's subject to buffer overflows?

  17. Re:106 procs, so what by segfaultcoredump · · Score: 4, Informative

    The SGI origin has a ccNUMA architecture, which makes it great for some tasks, ok for others, and awful for yet others. (the trick is to make sure that your particular app falls under the 'great' category)

    The sun system is an smp based system, everything connects to a common backplane and each board has equal access to all of the other boards. With the sgi, the speed of accessing memory on the local board or boards in the same cabinet is much faster than hits to memory in remote cabinets.

    From what I can tell, Sun is planing on producing a special system board that goes into one of those 18 slots. Thus, with 19 StarCats you can create one big system with 1836 cpu's and 9.7TB of ram. (think of a system in the middle that acts as the center of a star) it will most likely be based on a COMA architecture rather than a ccNUMA. Like the SGI, memory access will depend on the distance between the requesting cpu and the storage location. The difference is that under COMA, if a cpu requests a particular bit of memory a lot, that page is either migrated or copied to a memory bank on that cpu's memory board (so if 5 cpu's all need read only access to the same bit of memory, then they can each have their own copy in a local memory bank. write updates are what make the system a pain in the ass to manage ).

  18. strcat? by J'raxis · · Score: 5, Funny

    We all know strncat() is better.

  19. Re:106 procs, so what by swordgeek · · Score: 3, Interesting

    Just wait for six months. This is the first beast in a series of pseudo-clustered Sunfires. This is roughly a stack of 6800's, and there's going to be a MUCH larger machine released very soon.

    --

    "People who do stupid things with hazardous materials often die." -- Jim Davidson on alt.folklore.urban
  20. Re:SGI Origin 3000 by fgodfrey · · Score: 3, Informative

    What are *you* smoking? The Origin 3800 is certainly *not* using the "same back plane a the sun was 2 gens back". In fact, if you were anything other than a hopeless troll, you'd realize that the Origin 3800 doesn't *use* a backplane at all. You get, on a 512p system, 128 I/O channels, each of them supporting up to 12 PCI slots or 4 XIO slots. I can't remember off the top of my head what the bandwidth per channel is but it's on the order of a gigabyte/second (I wanna say it's 1.6 GB/sec, but I might be wrong).

    --
    Go Badgers! -- #include "std/disclaimer.h"
  21. Where 106 probably comes from by MadDog+Bob-2 · · Score: 3, Informative
    I mean to say what's the difference between 106 (what an odd number) ...

    Given the way other Sun boxen like the E3500 work, I expect that's the 15K has 18 boards, each of which takes 3 modules, either 2xCPU of 8GB RAM.

    That means that 72 CPU / 288 GB memory is 18 boards, each with 2 2xCPU modules and one 18 GB memory module, and the box is full.

    Since you always need some memory, the most CPUs you can get is 17 boards w/ 6 each and one with 4. Of course, that leaves you with 8GB of memory for your 106 CPUs.

    The other end is (17 x 3 + 1 x 2) 8 GB memory modules for 424 GB on a pair of CPUs

    But that's just a guess...

  22. the use of this type of system by Ciannait · · Score: 3, Interesting

    Yes, there's a buy online button. But that's used to get info so one of their sales droids can contact you. It's not like you can slap it on your Visa card. :)

    (Disclaimer, I work a lot with E10Ks, so this post is written mostly from my experience with those.)
    The 15K is basically just an improvement on the E10K architecture, from what I've seen and heard from Sun's SSEs. The E10K started out life as the Cray SuperServer, and was sold to Sun for a song. It's not architecturally perfect. The E10K is set up to allow individual system boards to be part of domains (aka partitions), which can make for some great scalability in the domains. I've seen tiny little one-system-board domains, and domains with 13 fully populated system boards in them.

    One of the major advantages to this platform is the fact that you can hot-swap everything except the centerplane. (Of course, I've never seen a centerplane fail.) The E10K also has Dynamic Reconfiguration, where you can remove system boards from a running domain, but unless your platform is set up in a certain, specific way, this doesn't work as well as advertised. I've personally never used it. The best thing about the E10K is the use of the System Service Processor, which handles all the administrative tasks for the entire cabinet. I've heard that the SSP is now integrated into the 15K, thus eliminating the need for a separate system to perform these tasks and monitoring.

    The only thing I've ever seen this class of system used for is data warehousing. No modeling, no graphics rendering, just Oracle databases. Just because it has a large number of processors, doesn't mean they're going to be suitable for every task imaginable. (I used to have a 180MHz Indy R5000, that got 68kkeys/sec in d.net. My 166MMX got something like 350kkeys/sec.) These are workhorse processors, not sports-car style processors.

    Though I wonder if Sun's gotten around to fixing that nasty ecache parity error problem with their processors... Having a domain randomly crash because the parity bit on a processor got flipped is no fun when you're dealing with a large production database. I have a feeling that problem will continue to plague them in the 15K.

    --
    A good traveller has no fixed plans and is not intent on arriving.