Slashdot Mirror


Efficient Supercomputing with Green Destiny

gManZboy writes: "Is it an oxymoron to have an efficient supercomputer? Wu-Chun Feng (Los Alamos National Laboratory) doesn't believe so - Green Destiny and its children are Transmeta-based supercomputers that Wu thinks are fast enough, at a fraction of the heat/energy/cost, according to ACM Queue." 240 processors running under 5.2kW (or less!) is nothing to sneeze at. The article offers up this question: might there be other metrics that might be important to supercomputing, rather than relying solely on processing speed?

9 of 193 comments (clear)

  1. Re:Indeed... by ergo98 · · Score: 2, Informative

    While obviously there is a bit of hyperbole in your statement (I highly doubt there are many systems defined as "supercomputers" that consume less than 3 digits or so of kW...certainly not 400-500W that a worst case enthusiast consumes), I really wonder if home computing has really gotten that much worse. Around 11 years ago I remember getting a 350W power supply for my 386-33 (with Diamond Speedstar 24x!), and this was pretty much par for the course - of course the CPU itself consumed much less power (I think around 5W) than some of the high end CPUs (that can consume up to 80W), however the board and surrounding hardware generally consumed a lot more power then.

  2. Re:Green Destiny by Anonymous Coward · · Score: 1, Informative

    Mod parent DOWN -- was copied from another post!!

  3. from the so-obvious-it-hurts dept. by Tom7 · · Score: 2, Informative

    The article offers up this question: might there be other metrics that might be important to supercomputing, rather than relying solely on processing speed?

    Um, yes?

  4. Do the comparison with VT X instead of ASCII Q by 2nd+Post! · · Score: 3, Informative

    If you do the math with X (10,280 instead of 13,880 performance, 1000sq instead of 21,000sw, and 800kw instead of 3,000kw) you get a 337 fold increase in performance per square foot, rather than 65, and an 832 fold increase in performance per Watt, rather than 300 fold, vs the Cray.

    Of course I dunno the numbers for the Transmeta solution yet!

  5. Imagine reading the beowulf mailing list! :-) by Anonymous Coward · · Score: 1, Informative

    Anyone interested in this sort of thing should check out the Beowulf mailing list - go to www.beowulf.org, and read through the (recent) archives. There's been some talk lately on different metrics.

  6. Re:Does it really save that much power? by Svartalf · · Score: 5, Informative

    You're comparing apples to oranges, not to mention that your info's a little off...

    1) A Nehemiah core C3 runs really close to the same performance of a comparably clocked Celeron, with the same general power consumption of a Samuel2 core (For those that don't know, part of how VIA's chip originally got it's low power is that the FPU was underclocked by a factor of 1/2). It's a nice chip overall, but it's not really intended (nor are they USING it that way) for scientific or gaming applications even though you can use it for that. The C3's winning usages is in something like a media PC, workgroup servers, and embedded systems where you need low power consumption, relatively low cost, and relatively high performance compared to other x86 embedded solutions.

    2) The Crusoe and similar chips are very fast executing VLIW CPUs (very much like the Itanium...) that have code morphing that translates x86-32 instructions into comparable sets of instructions for the VLIW CPU- in fact it's very good at doing this sort of thing. The reason it's less desirable with a desktop or gaming application is that you're exceeding the VLIW code cache regularly, meaning you have to keep recompiling the x86 instructions into the native VLIW ones. For a scientific application, the same task gets executed time and time again and usually ends up with most, if not all, of the code in the pre-morphed code cache. At that point, you're now in the high-performance domain with very little power consumption. The Crusoe in this application would consume less power than the G5 and run just as fast. (Check the article that you're commenting on...)

    Do some thinking outside of the box here, what's good or great on a desktop machine isn't always the optimal choice for supercomputing clusters or HA clusters. Depends on a bunch of factors, including what you're going to be running on the systems in question and what kind of environmental conditions you're going to be facing.

    --
    I am not merely a "consumer" or a "taxpayer". I am a Citizen of the State of Texas
  7. Re:WTF? by Anonymous Coward · · Score: 1, Informative

    the premise of the article was that compute power is NOT the only consideration of operating a supercomputer.

  8. Full Formatted Text by TubeSteak · · Score: 4, Informative
    Sorry, no Tables and no Pictures

    Making a case for Efficient Supercomputing
    From Power
    Vol. 1, No. 7 - October 2003
    by Wu-Chun Feng, Los Alamos National Laboratory It's time for the computing community to use alternative metrics for evaluating performance.Motivation

    A supercomputer evokes images of big iron and speed; it is the Formula 1 racecar of computing. As we venture forth into the new millennium, however, I argue that efficiency, reliability, and availability will become the dominant issues by the end of this decade, not only for supercomputing, but also for computing in general.

    Over the past few decades, the supercomputing industry has focused on and continues to focus on performance in terms of speed and horsepower, as evidenced by the annual Gordon Bell Awards for performance at Supercomputing (SC). Such a view is akin to deciding to purchase an automobile based primarily on its top speed and horsepower. Although this narrow view is useful in the context of achieving performance at any cost, it is not necessarily the view that one should use to purchase a vehicle. The frugal consumer might consider fuel efficiency, reliability, and acquisition cost. Translation: Buy a Honda Civic, not a Formula 1 racecar. The outdoor adventurer would likely consider off-road prowess (or off-road efficiency). Translation: Buy a Ford Explorer sport-utility vehicle, not a Formula 1 racecar. Correspondingly, I believe that the supercomputing (or more generally, computing) community ought to have alternative metrics to evaluate supercomputersspecifically metrics that relate to efficiency, reliability, and availability, such as the total cost of ownership (TCO), performance/power ratio, performance/space ratio, failure rate, and uptime.

    Motivation

    In 1991, a Cray C90 vector supercomputer occupied about 600 square feet (sf) and required 500 kilowatts (kW) of power. The ASCI Q supercomputer at Los Alamos National Laboratory will ultimately occupy more than 21,000 sf and require 3,000 kW. Although the performance between these two systems has increased by nearly a factor of 2,000, the performance per watt has increased only 300-fold, and the performance per square foot has increased by a paltry factor of 65. This latter number implies that supercomputers are making less efficient use of the space that they occupy, which often results in the design and construction of new machine rooms, as shown in figure 1, and in some cases, requires the construction of entirely new buildings. The primary reason for this less efficient use of space is the exponentially increasing power requirements of compute nodes, a phenomenon I refer to as Moore's law for power consumption (see figure 2)that is, the power consumption of compute nodes doubles every 18 months. This is a corollary to Moore's law, which states that the number of transistors per square inch on a processor doubles every 18 months [1]. When nodes consume and dissipate more power, they must be spaced out and aggressively cooled.

    Figure 1

    Without the exotic housing facilities in figure 1, traditional (inefficient) supercomputers would be so unreliable (due to overheating) that they would never be available for use by the application scientist. In fact, unpublished empirical data from two leading vendors corroborates that the failure rate of a compute node doubles with every 10-degree C (18-degree F) increase in temperature, as per Arrenhius' equation when applied to microelectronics; and temperature is proportional to power consumption.

    We can then extend this argument to the more general computing community. For example, for e-businesses such as Amazon.com that use multiple compute systems to process online orders, the cost of downtime resulting from the unreliability and unavailability of computer systems can be astronomical, as shown in table 1millions of dollars per hour for brokerages an

    --
    [Fuck Beta]
    o0t!
  9. New trend in computing. Vector processing by zymano · · Score: 2, Informative