Slashdot Mirror


More on Virginia Tech G5 Cluster: 17.6 Tflops

daveschroeder writes "BBC World's Click Online has a video report (with text transcript) on Virginia Tech's new 1100-node dual 2.0 GHz G5 Terascale Cluster. The report quotes the performance as 17.6 Tflops. As a point of reference, the cluster would be number 2 on the most recent June Top 500 list, behind only Japan's Earth Simulator, and considerably more than doubling the performance of the current number 3 1152-node dual 2.4 GHz Xeon MCR Linux cluster. Assuming the performance figure accurately reflects the LINPACK score (which it should; since the deadline for submissions for the upcoming list of Oct 1 has already passed, one would imagine VT would quote that figure), and depending on new entries for November's upcoming list, the cluster should almost certainly rank in the top 5 - all for only US$5.2 million. The video report is available in Windows Media 9 and Real formats; the relevant portion starts at 13:00."

12 of 390 comments (clear)

  1. Better links for Windows Media by Mwongozi · · Score: 4, Informative
    You can watch just the report itself, no skipping required, by following the links on this page:

    http://www.bbcworld.com/content/template_clickonli ne.asp?pageid=666&co_pageid=3

  2. Re:Can the results be trusted? by sakusha · · Score: 5, Informative

    They have previously discussed this, they use error correction algorithms, no ECC RAM necessary.

  3. New top-500 list will be announced around Nov 18 by slyfox · · Score: 3, Informative

    The new "top 500" list will be announced right before SC2003 and discussed in detail at a session of SC2003 on November 18.

    Look for another (less speculative) story on Slashdot around then.

  4. Re:It runs MacOS X !!! by paulthomas · · Score: 3, Informative

    Ugh.

    It has been said thousands of times by now I'm sure.


    Running Mac OS X does not mean running FreeBSD Mac OS X is a system of frameworks running on top of a Mach Kernel. The only thing that relates Mac OS X to FreeBSD is the userland. In addition to the userland you have: Cocoa, Carbon, Aqua, Java, etc. The FreeBSD portion is minimal.

    And yes, if you want you can run this lower level unix without the rest of Mac OS X. It is called Darwin. It runs on Intel and PPC if you're wondering. No, this doesn't mean that Mac OS X runs on both or ever will.

    Here is a short description of the BSD families.

  5. Re:Can the results be trusted? by Usquebaugh · · Score: 4, Insightful

    Yep, and they're going to be top 5. Between you and them I wonder who has the best knowledge of how to build a cost efficient cluster?

  6. Project leader speaking at conference Oct 28 by daveschroeder · · Score: 4, Informative

    The project leader, Dr. Srinidhi Varadarajan, will be speaking at a session entitled Building Virginia Tech's G5 Supercluster on Oct 28 at the upcoming O'Reilly Mac OS X conference.

    He'll probably reveal some of the technical details, such as the version of Mac OS X used, at that session.

    Also, according to a blog at O'Reilly:

    Next year, all the little known details [about the cluster] will be revealed in a new book. By that time we'll know what the project means for supercomputing and for Apple.

  7. Re:Can the results be trusted? by Waffle+Iron · · Score: 4, Interesting
    So they bog down the software doing something that could be done in hardware?

    Just because it's in hardware doesn't mean it's free. The ECC logic is going to add a small delay to each of trillions of memory accesses. Plain memory can most likely be tuned to run faster than ECC memory.

    If you're running a constrained problem and can verify the results at the end, a single error check in software could consume far less overall time than the continuous ECC hardware checks. The software check would probably catch other types of errors as well (including many errors caused by software bugs).

  8. How does it not add up? by daveschroeder · · Score: 3, Interesting

    Easy: you yourself point out that 1100 * 15.7 = 17.27 ... not 17.6.

    Since the call for papers for the new Top 500 list was Oct 1, and the BBC show aired on Oct 9 with a companion BBC News story dated Oct 12, you'd hope that VT was simply regurgitating the figure that has already been sent to the Top 500 organization.

    And why are you trolling around with one of those super-old benchmarking stories? We've already established that every manufacturer does what they can to show their products in the best possible light. At least Apple documented their test results and methods in full.

    So acually, your logic doesn't make any sense: you jump to the conclusion that it's not real results - even though real results already exist and have been submitted, and the entire story is pretty much about that process, making performance figures a critical piece to get accurate - and that they must have just multiplied some benchmark number by 1100. Then, even though the subject of your own post indicates your recognition that "it doesn't add up", you still apparently assume that the results are somehow doctored, this time for the worse, and you manage to weave in one of the stories that tries to make it look like Apple lied with its benchmarks - which it didn't - which is unrelated to the current issue! How does it "assume" the original scores were accurate?? YOU are assuming that they're just multiplying. You might have been onto something if the multiplication actually came out, but it doesn't, meaning that is NOT what they did.

    Bravo, +1 Troll.

  9. BJs for Geeks by bstadil · · Score: 4, Interesting
    --
    Help fight continental drift.
  10. numbers ok....reading is wrong... by djupedal · · Score: 4, Informative

    The 'project' uses the same amount of electricity as 3,000 average sized homes. There are many more devices deployed than just the 1100 G5s. The cooling system alone is a major power eater. Read the articles :)

  11. 17.6 TFLOPS is Rpeak, not Rmax! by DeeKay · · Score: 3, Insightful

    Simple equation: 4 FPops/cycle (IBM-PPCs) * 2GHz * 1100 G5s * 2 CPUs/G5 = 17.6 TFLOPS!

    No *real* Rmax linpack scores are known yet, and from what i figured the submissions on Oct 1st are just for *inclusion* in the list, real Linpack scores can be submitted till shortly before (or even on!) the conference mid-November..

    This article is BS and should be removed...

    P.S.: 4 FPops/cycle per clock with 2 FPUs i hear you scream - Impossible! - That's due the Multiply/Add FMAC thing that counts as 2 FPops!

  12. Re:Can the results be trusted? by afidel · · Score: 4, Informative

    No, ECC ram typically is just made with faster internals. As an example most ECC comodity ram is CAS2 latency whereas most generic ram is CAS3, so the ECC ram will perform exactly the same as the non-ECC ram. You can buy CAS2 non-ECC ram but it's nearly as expensive as the ECC ram. If you have a simple idiot check at the end of a complex calculation then saving the cost of going with ECC may be worth it but most clusters this large will be used on too many different projects to assume that all of them will have such checks. For an idea of how important ECC is read (a href="http://www.ibm.com/servers/eserver/pseries/c ampaigns/chipkill.pdf">This IBM whitepaper on their chipkill ECC scheme. Even normal SEC ECC ram (what most ECC ram is today) will have aproximately 900 failures per 10TB per three years. I think that IBM is right and that eventually all ram will be RAID-M, that is a RAID5 style array of redundant memory banks that are composed of ECC banks. At future densities this will be necessary because a single high energy particle will have the ability to scramble an entire memory word including it's ECC checking bits.

    --
    There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.