Slashdot Mirror


On the Supercomputer Technology Crisis

scoobrs writes "Experts claim America has been eating our 'supercomputer feed corn' by developing clusters rather than new supercomputer processors and interconnects. Forbes says America is playing catch-up and that the new federal budget items are too little too late. Cray is laying people off due to decreased federal spending and claims lower margin products have forced them to create products based on commodity parts. Red Storm, one of their new Linux-based products, is being delayed to next year."

22 of 347 comments (clear)

  1. it makes sense by dncsky1530 · · Score: 4, Insightful

    when you can build a top 5 supercomputer for under 6 million dollars, using off the shelf parts. Why spend the hundreds of millions of dollars?

    1. Re:it makes sense by Otter · · Score: 4, Insightful
      If you RTFA, an administration panel on high-end computing claims that clusters are inappropriate for certain tasks. I don't necessarily trust the claims of what I assume is an industry-heavy panel, but then I don't necessarily trust the supercomputing expertise of a bunch of Lunix fanboys "administering a network" in their parents' basement either.

      My inclination is to let the market sort itself out, although if supercomputer makers go under, they won't necessarily reappear the moment they're needed.

    2. Re:it makes sense by badboy_tw2002 · · Score: 5, Insightful

      Then trust the fact that not all problems are easily attacked from a parallel perspective. This means problems where working on one section of the dataset affects large amounts of data in other sections. There's a lot of locking and waiting for tasks in other parts of the system to be completed; and a lot of data transfer/need for shared memory, which if you're bussing between cluster components, its going to be slow.

      This doesn't mean that clusters don't have some use in these regards, it just means that for these types of problems no one has figured out an efficient parallel algorithm to use on them.

    3. Re:it makes sense by ch-chuck · · Score: 4, Insightful

      because, sometimes you need two strong oxen instead of 10240 chicken.

      --
      try { do() || do_not(); } catch (JediException err) { yoda(err); }
    4. Re:it makes sense by Rei · · Score: 3, Insightful

      I'll agree that it sounds like an industry heavy panel; furthermore, it sounds like a supercomputer-industry-heavy panel. Furthermore, it sounds like a *nostalgic* supercomputer industry heavy panel. What else could explain lines such as " In contrast, classic supercomputers that rely on very fast, specially designed vector processors "could be programmed in Fortran," Scarafino said. "They could be programmed in a language that mere mortals . . . could program in." ?

      Yes, there are tasks where supercomputers are needed. Most tasks are not among these. If there is a single parallelizable task in a CPU-intensive process, odds are that a cluster is your best bet. For example, even if your core algorithm requires intensive memory locking and must be done in a completely serial manner, if you are going to be running that core algorithm over a range of possible inputs, a cluster will probably be your best choice.

      --
      "You abandoned me! You abandoned my hatred!" "I... I have cuttlefish..."
    5. Re:it makes sense by grawk · · Score: 5, Insightful

      As someone who works for a supercomping center, I can say that some things work VERY well on cheap unix based clusters. I am the primary admin on a 5 TFLOP cluster. We've also got a Cray X1, and while it's only 2.6 TFLOPs, it will eat my IBM's lunch when it comes to some specificly tuned tasks. Much in the same way that we can outperform mac clusters that have significantly higher floating point performance because of the speeds of the interconnects. Supercomputing is about a LOT more than just raw CPU power.

  2. Expected fallout from the Beowulf takeover by beee · · Score: 4, Insightful

    This is an expected and predicted fallout from the recent rise in popularity of beowulf clusters. Slowly but surely managers are realizing, yes, it is possible to have a supercomputer on mass-market hardware, running a free OS.

    Don't see this as bad news... it's a sign that we're winning.

    --


    + Donald Gunth
    + Email: dgunth@quicktek.net
    "Caffeine is the greatest lubricant ever created." -ESR
    1. Re:Expected fallout from the Beowulf takeover by susano_otter · · Score: 4, Insightful

      Since clusters are so much cheaper than mainframes, it's often the case that clusters still offer better performance for the money spent than a mainframe would, even if the cluster isn't really optimized the way the mainframe is, for the task at hand.

      That being the case, wouldn't it make more sense to invest heavily in R&D to solve the cluster's problems and remove its limitations, than to invest heavily in R&D into next-gen mainframes?

      --

      Any sufficiently well-organized community is indistinguishable from Government.

    2. Re:Expected fallout from the Beowulf takeover by Aadain2001 · · Score: 3, Insightful

      Then let the people who have those problems pay for the hardware to solve those problems. For the people who are doing parallised work, clusters make perfect sense. I think the big guys (Cray) are just unhappy to see that most of their business is going away because their hardware isn't needed as much since people are figuring out how to use clusters to get the jobs done.

      --
      Space for rent, inquire within
    3. Re:Expected fallout from the Beowulf takeover by forkazoo · · Score: 3, Insightful

      A cluster has poor latency and bandwidth compared to moving data register to register on a CPU. Big fast CPU's have lots of local bandwidth. Clusters have less. How in frack's sake do you expect to "Fix" that? It is the inherent distinction of a cluster. Separate boxes, with IO connecting them can never be faster at comms than the CPU itself. a 486 has more on chip bandwidth and better latency than Gig-ethernet. Sure, it only has 8 registers... Not a huge range of problems that it can solve entirely on-chip... :)

  3. Inevitable by Marxist+Hacker+42 · · Score: 3, Insightful

    What most people don't seem to understand is that you don't need a supercomputer when a mesh of nodes on a network will do just as well. Just like most people don't understand that a 386 running Linux and Word Perfect 5.1 is just as good of a word processor as a 2.5Ghz Itanium running Windows and Word. Computer power has *usefull* limits as well as technological limits.

    --
    SJW: a person who perceives an injustice, and while correcting it, commits a greater injustice.
    1. Re:Inevitable by Performer+Guy · · Score: 3, Insightful

      No the government didn't abandon these. Infact the government is one of the few remaining purchasers of this type of hardware. It just so happens that a lot of problems including the governments are solved by clusters.

      It could be argued that at least *some* of the ASCI (Advanced SuperComputing Initiative) computers had specialized architectures with loads of bandwidth & low latency interconnect (in their day).

      It's a bit of a joke complaining about a lack of vector computing when every Intel and AMD CPU sold today has floating point vector instruction set extensions with very interesting operators.

      I'd argue that if you take those lamented early 90s supercomputers there's not a problem they can solve faster that a relatively small contemporary cluster or even a single desktop system. A standard 4 CPU single PC desktop system with the right architecture could also spank those legacy systems in memory bandwidth, shocking but true. It just didn't keep pace with the scale and cost reduction of small systems & clusters.

      The real problem here is *relative* performance of supercomputers and commodity components, but as it takes hundreds of millions if not billions to develop a new competitive CPU & architecture and manufacture it, scientists pockets aren't deep enough to pay for those costs (and thank goodness because it's our tax dollars). It is rather pathetic to lament that supercomputers have been outpaced by clusters. The economics make it impossible for supercomputers sold in low numbers to keep pace. Or more reasonably stated, the economics of consumer PC systems makes powerful computing ubiquitous and affordable to the point where it no longer makes economic sense to pursue specialized processors and architectures to try to outperform them.

      If anything is to be done it would be to increase the bandwidth and reduce the latency of cluster interconnect, and guess what, that's EXACTLY what smart people are working on right now.

      As for eating America's seed corn, it is Intel and AMD that sell most CPUs used in clusters today. It is that competition and the pressure of increased development costs that makes custom hardware untennable.

      It is just false to imply that supercomputing technologies fed lower end development. It is a romantic vision of trickledown technology but it is not actually how technological development works. Look at computer graphics, since the commodity PC graphics cards beat big iron from SGI there has been more innovation and development in graphics hardware, not less. There is competition and a willingness to experiment with new features. The same is true with CPUs from Intel and AMD and the architectures and innovations in memory bandwidth they constantly drive forward.

  4. Law of Diminishing Returns by Billobob · · Score: 4, Insightful
    It appears to me as if we have reached the point where supercomputers aren't really as practical as they were before. Fewer and fewer industries need and prefer supercomputers to a cluster of cheap PCs, and the market is simply heading in that direction - nothing really unique happening here other than capitalism.

    Of course people are going to cry that companies like Cray are falling by the wayside, but the truth is that their services simply aren't as needed as they were in years past.

    --
    If you have to ask, you'll never know.
  5. "Feed' Corn? by jmckinney · · Score: 4, Insightful

    I think that should have been "Seed Corn."

  6. Expert complains: by Anonymous Coward · · Score: 4, Insightful

    Free market sucess might lead to us actually having to pay for our own supercomputer research that we use in profit making ventures.

  7. You are all missing the point by Anonymous Coward · · Score: 5, Insightful

    Its the fact that clusters require higher skill to program efficiently for than do single processor systems. Plus you have all of the wasted processing power used for communication between the nodes. Granted, many problems lend themselves well to distributed computing (essentially what a cluster is, but the nodes are closer and communicate faster), but there are also problems that are handled better by a smaller amount of specialized hardware. The other point is that by using off the shelf parts, we are not really innovating in this space like we should be. We are allowing the commodity computer market determine the direction of the supercomputer market.

  8. Without a market you can't survive long term by wintermute42 · · Score: 4, Insightful

    There seems to be some historical revisionism going on regarding the demise of the "supercomputer industry". People are coming out of the woodwork now saying that lack of government support caused the great supercomputer die off.

    As Eugene Brooks predicted in his paper Attack of the Killer Micros, the supercomputer dieoff was caused by the increasing performance of microprocessor based systems. Many of us now own what used to be called supercomputers (e.g., 3GHz Pentinum processors, capable of hundreds of megaFLOPs).

    The problem with supercomputers is that high performance codes must be specially designed for the supercomputer. This is very expensive. As people were able to fill their needs with high performance microprocessors they quit buying supercomputers.

    Many people who need supercomputer levels of performance for specialized applications (e.g., rendering Finding Nemo or The Lord of the Rings) are able to use walls of processors or clusters.

    There are, of course, groups where putting together off-the-shelf supercomputers will not suffice. But these groups are few and far between. As far as I can tell they consist of the government and a few corporations doing complex simulations. The problem is that this is not much of a market. Even if the government funds computer and interconnect architectural research, there does not seem to be a market to sustain the fruits of this research.

    In the heyday of supercomputers there were those who argued that when cheap supercomptuers were available the market would develop. The problem is, again, programming. High performance supercomputer codes tend to be specialized for the architecture. Also, no supercomputer architecture is equally efficient for all applications. It is difficult to build a supercompter that is good at doing fluid flow calculations for Boeing and VLSI netlist simulation for Intel (the first applications tends to be SIMD, the second, MIMD). The end result of these problems tends to suppress any emerging supercomptuer market.

    The reality right now seems to be that those who are doing massive computation must build specialized systems and throw a lot of talent into developing specialized codes.

  9. And if they really want it... by sterno · · Score: 3, Insightful

    If there truly is a demand for those kind of processors, then somebody will likely meet that demand. Right now, it seems that actual demand is so low that they have to drum up this legislation a as a sort of wellfare for vector processor manufacturers.

    It's a simple cost tradeoff. If you can save millions in purchasing computers, it means more money to pay for people to run those computers and do the real work.

    --
    This sig has been temporarily disconnected or is no longer in service
  10. What tasks require high-speed interconnects? by Bruce+Perens · · Score: 5, Insightful
    One of the nice things about clusters is that they encourage people to consider how to decompose a problem so that it can work without a large high-speed shared data memory. Some of the older supercomputers were important because scientists hadn't done this work because there wasn't the economic incentive back then. Now there is one.

    So, what tasks still require a high-speed shared data memory? Answer that, and you'll understand where you can still sell a supercomputer.

    Bruce

  11. No, YOU are actually missing the point. by CatOne · · Score: 3, Insightful

    Granted, it is more difficult to program something (from the ground up) that runs distributed, than it is to program something that runs on a giant 2048-way box.

    Just like it's more difficult to write multithreaded code than it is to write single-threaded code.

    That's where software, and platforms come in. There is a TON of research being done, which uses technologies like Infiniband and Myrinet as interconnects, and can make a cluster "look" like a big monolithic machine. If you as an end user write code that goes down into the TCP stack itself, you're working too hard, and you're going about it the wrong way.

    Put it this way: In 5 years the odds are overwhelming that there will be a good software platform that can let you pick 5000 servers and run your app 10,000 threaded, with everything appearing just like a single process, and running "as it would on a Cray." It's easier to solve this stuff with software -- take your problem (distributed computing) and solve the problem with a different set of technologies (high performance/low latency interconnects, shared address space/DMA across machines, etc).

    Apple's Xgrid is a step in this direction. It's missing a ton of "Supercomputer" functionality right now, but it's a nice cross-machine GUI scheduler. Right now this type of app can address maybe 20% of what supercomputer apps need... in the future maybe more like 98%.

  12. So what by DarkOx · · Score: 3, Insightful

    What does it matter if we don't develop single unit supercomputers. Clearly in a free market if these thing had value they would be persued. There is not predetory tax laws on supercomputer, or any other regulations on domestic use. The only reason development has slowed is there is not much market for the beasts.

    There are many reasons for that too, for one other then in stealer, neculear, mathematic, and bio research feilds few industries need more computing power then can be had off the shelf any day of the week. That was not true yesterday it took all sorts of custom hardware to make CGI happen in films that can be done now in my basement in resonable time frames. So no more super computer market there the ROI is gone I am sure this plays out in all sorts of other engineering feilds as well.

    Many places where you do need super computing power can be done with clusterd systems that are cheap to build and cheap to maintain.

    At least people in the pure science and research fields have learned to be better thinkers and programers, they found ways to do things in parallel that were traditionally serial. Things that still are serial can be made to work on a cluster, sure it might take longer then a single computer considered to be equal FLOPSwise but considering I could either spend all the money I saved makeing my cluster bigger and more powerful so I can get back to equal time or on other profitable efforts while I wait there is again no ROI.

    It so happens that may of the most interestin questions in math, physics and computer science such as quatum theory need massive amounts of parallel work, rather then serial so that works better on a cluster anyway.

    If there is a real reason to do it people will build supercomputer, because there is nothing stopping them other then economics. No need to fear Supercomputers are not going away. Everyone else that needs that kinda proc-ing power will settle for clusters, as well they should. This is just another largly obsolete industry wanting someone to bail them out because they have failed to adapt to a changing market. If they are going to die we should let them, just like we should let the Universitys adapt or die, and the RIAA needs to adapt or die, we need to stop proping up obsolete undustries so new ones can replace them!

    --
    Repeal the 17th Amendment TODAY! Also Please Read http://www.gnu.org/philosophy/right-to-read.html
  13. It is about cost by deadline · · Score: 3, Insightful
    Sigh...

    There never really was a supercomputer market. There was a cold war, that subsidized the supercomputer market.

    Then there is the cost. Companies stopped making SC because they were too expensive. If the guy from Ford wants to pay 1 billion for a supercomputer I am sure someone will build him one. The cost build a FAB is over 4 billion. Why do you think HP teamed with Intel. Why do you think there are so few processor families? You have to make a living in the commodity market where you can sell things in the millions because supercomputers even in their heyday were sold in the hundreds.

    Then there is the problem that many problems are solvable on clusters. So those specialized problems can not depend on other parts of the HPC market to help subsidized their corner of the market. i.e. clusters make the really hard problems more expensive.

    It is question of how much you want to pay to solve your problem? Simple economics actually. If the numbers don't work, the problem doesn't get solved. If the Gov. wants to solve some problems (and during the cold war they did) then they can step in and subsidize the market.

    And don't cry about Japan and the Top500. When the top500 has price column then it will start to be meaningful.

    --
    HPC for Primates. Read Cluster Monkey