Slashdot Mirror


Big Mac Officially Ranks 3rd

An anonymous reader noted that according to Wired, it will be announced officially on Monday the Big Mac supercomputer is the third-fastest super-computer. The article also talks about some of the amazing supercomputers in the planning stages. The sort of stuff that will make Big Mac look like that old TI-85 collecting dust in your drawer.

21 of 357 comments (clear)

  1. already official by Anonymous Coward · · Score: 5, Informative
  2. THe bandwidth demands are high. by mindstrm · · Score: 3, Informative

    Interconnect is very important.

    This is nothing like distributed.net.

    For a problem that can be broken into millions of discrete, independent chunks, sure, distributed.net's model is fantastic, and works really well... (seti, folding, distributed.net, etc)

    For something where you need lots of feedback from nodes, (like these benchmarks, and lots of simulation work), bandwidth is everything.

  3. Re:Trademark infringement by lost_it · · Score: 5, Informative

    Virginia Tech has not (and will not) call the computer "Big Mac". BBC used the name when it first started appearing in the news, and everyone else picked it up, IIRC.

    The people in charge of the cluster don't want to call it "Big Mac" because (1) they don't want a lawsuit from McDonalds, and (2) who wants to be associated with nasty, greasy fast food?

    They've worked out a solid candidate for a name (it's not official yet) that isn't quite as catchy as "Big Mac", but it also doesn't have any of the downsides.

  4. Re:Kudos to the Mac (don't forget the others) by tychay · · Score: 5, Informative

    Hmm, guess this means my submission a couple hours ago won't go through (dangit, Wired!)...

    Here is the official press release and the list.

    There is a lot of good points to note all around. The first is the G5 Terascale cluster at Virginia Tech at #3 (10.28 Tflops/s, 2200 CPU, Infiniband) is the first academic computer to break 10 teraflops/s. This extra performance was promised at Mac OS X Developer's conference last month. Not to sure if the price is a testament to Infiniband ($1.5 million cabling, cards, and routers) or the Macs ($4.2 million list).

    Good thing too because in a surprise move the NCSA cluster made the list at #4 (9.82Tflops/s, 2500 CPU, Myrinet). This cluster is built using Dell's running Pentium 4 XEONs and Red Hat Linux! One subtle point to note is that they didn't get all the systems online in time (there should be 2900 CPUs, not 2500). I bet some programmer at PSC and an ex-Chief Scientist of SDSC is appreciating having a hand in edging out NCSA for #3--not to mention Apple beating Dell for #3.

    The fastest Itanium cluster is at #5 (8.63 TFlops/s, 1936 CPU, Quadrics) which is looking like the odd man out boxed in by a PC based systems using Myrinet, the P4 Xeon above, and the most powerful Opteron system at #6 (8.05 Tflops/s, 2816 CPU, Myrinet). Another point of similarity:did I mention it's also using Linux?

    And finally, It's easy to overlook #73, a single compute node of BlueGene/L (1.44 Tflops/s, 1024 CPU). Imagine 128 of these connected together and you have something that will easily take #1 when it's completed even if we handicap it 20-40%. As noted on SlashDot earlier, this will be running Linux.

  5. Re:Check the #5 and #6 by damiam · · Score: 4, Informative

    If you had read anything about this project, you would know that they have implemented their own low-overhead software error correction to compensate for the non-ECC memory. Presumably the benchmarks were done with this enabled.

    --
    It's hard to be religious when certain people are never incinerated by bolts of lightning.
  6. Re:Kudos to the Mac (don't forget the others) by tychay · · Score: 3, Informative

    I miffed some of my links in the parent post:

    Here is the reference to the ex-SDSC scientist.

    Here is the link showing that the Opteron cluster is using Linux Networx.

    Finally in the interest of full disclosure and to pre-empt the anti-Mac zealots, I should mention that the $4.2 million for the G5 machines is probably the education list price, because when you go to Apple Store, putting 2GB of RAM into 1100 2x2Ghz G5's will cost you $4.4 million (+ a little more for having some spare machines).

  7. Re:Its also the CHEAPEST by tychay · · Score: 1, Informative
    goombah writes:
    Now this system is the cheapest of the top 10. its cheaper than many it beat by a factor fo ten

    Let's not get ahead of ourselves, it's more like a factor of two, not ten (it's a factor of 20-30 cheaper than #1 and #2 and some of the older members of the list, sure). The Mac system price cost $4.2 million educational list. However, #4-6 were built using the Pentium 4 Xeon, Itanium, and Opteron respectively. The systems cost for these would be around $9-10 million if they were built today.

    Not that there isn't a lot to say Mac isn't "back". After all, of the four, it offers the highest flops/cpu, the second most flops/cycle (Rpeak is the same/cycle as Itanium but it's not as efficient so it's Rmax gets edged out), and the best price/performance (by a factor of 2 as noted above).

    I hope when IBM introduces 970-based blades next year, a lot of Linux users take a serious look at it. With 90 nm next year and Power5-based GPUL's coming down the pike, Big Blue-based Linux compute nodes should be looking mighty impressive as the most bang for your Linux buck. I can't wait for Opteron and Itanium blades. Combined with the P4 Xeon blades they already have, you could have 4 different CPU families in the same BladeCenter!

  8. Infiniband, not G5 by binkleym · · Score: 5, Informative

    The G5 is a cool processor, but it isn't the reason the VT cluster is so fast, the Infiniband interconnect is. The LINPACK benchmark that is used to determine position on the Top 500 list depends very strongly on the latency of the network connection.

    Infiniband has ~ 8-12 us latency (probably even less by now), while ethernet is an order of magnitude slower. In real-life applications it's actually worse than this suggests.

    We have tested a real-life application (socorro) using both gigabit ethernet and Myrinet (slightly slower than Infiniband), and gigE took 600 seconds to finish a run, while Myrinet took 4.

    VT's cluster is using the largest Infiniband network yet built (or at least announced). The previous largest Infiniband network was O(100) machines. VT could have built the cluster using Xeons, Itaniums, or Opterons and arrived at roughly the same level of performance.

    1. Re:Infiniband, not G5 by Hoser+McMoose · · Score: 2, Informative

      Actually I would tend to disagree, in a manner of speaking. Yes, high-bandwidth/low latency I/O is termendously important in real-world supercomputer applications, but Linpack doesn't always show this, Linpack can be run in parallel pretty easily. The reason why the cluster does so well in this test is very much related to the processor itself, though it's performance in other applications may end up being much more closely connected to it's I/O performance.

      The real key here is that the PPC 970 has a multiply-accumulate instruction and can decode and retire two such instructions each clock cycle. Since Linpack does mainly multiplies and adds on data, this instruction is ideal and makes things go REAL fast. Notice that the Big Mac has an Rpeak value of 8 GFlops per 2GHz PPC970 processor as compared to the 4 GFlops per 2GHz Opteron processor for LLNL Lightning.

      FWIW there are Infiniband clusters using Xeons and Opterons in the list. They don't really do any better, clock for clock and processor for processor, than the Myrinet clusters do in Linpack, though they probably would in many real-world supercomputer applications.

  9. Re:Check the #5 and #6 by The+Infamous+Grimace · · Score: 5, Informative

    "...Then, as you have read it, you can tell the world how you do this "low-overhead software error correction to compensate for the non-ECC memory"..."

    Taken directly from interview with Srinidhi Varadarajan:

    Q: How do you deal with Error Correction in Memory?

    A: There's a lot of traffic on Ars Technica and other places. We do failure recovery, memory doesn't report. One of the things we've noticed is that failures aren't an issue yet. The reason they can be competent is the LINPACK test, which is showing 16 digits of accuracy. We are planning on moving to ECC systems in the future. They may have to run things twice for a bit.

    (tig)

    --
    Ignorance and prejudice and fear
    Walk hand in hand
  10. Re:Trademark infringement by Anonymous Coward · · Score: 1, Informative

    " Excuse me, but Big Mac is a registered trademark of McDonald's" ...lest consumers get confused between a burger and a supercomputer?

    Trademarks are not unique: The patents and trademarks office returns 13 results for big mac, most of which are owned by McDonalds. As usual, you get a more accurate picture by looking at the official register, rather than running to the mcdonalds.com website for legal advise, but you can have many trademarks all using the same wording for different products.

    On a related note, anything with a TM after it means that the company is trying to claim something which is not a trademark, as opposed to the R which indicates that they do indeed 'own' the trademark.

  11. Re:Can someone tell me... by 2nd+Post! · · Score: 1, Informative

    1:) Because the editors of slashdot have decided it should be that way.

    2:) Because the importance and equality of AMD64 is not your decision, nor is it's icon.

    3:) Because all 1,100 nodes of this computer are powered by Apple's PowerMac G5; it just happens that Apple's PowerMac G5 is also the IBM PPC970, and to call it the PPC970 would detract from Apple's achievement. To be fair, it is also IBM's achievement, but they have their own supercomputers on the list.

    4:) No, of course not. To sell out is such a close minded and antagonistic viewpoint. The slashdot editors just happen to like Macs more than you do. You just haven't kept up with how cool Apple is nowadays.

  12. Re:Apples and Oranges by prichardson · · Score: 4, Informative

    Actually, you are horribly horribly wrong about two things.

    You definitely could not do that with Opteron or Xeon systems. VT was in negotiations about price and delivery time with Dell and Apple. Apple beat out Dell's prices (shocking!!!).

    Also, the G5 makes a great cluster computer. It comes standard with gigabit ethernet and has very easy access to parts (no screws required to install anything).

    Finally, the Apples make a good cluster because in 5 years or so when they disassemble it they have 1,100 really nice desktop machines. PC's need to be upgraded more often to serve as a desktop computer (that's why Macs have awesome resale value compared with PCs).

    --
    Help I'm a rock.
  13. Re:Its also the CHEAPEST by Anonymous Coward · · Score: 2, Informative

    The fans operate at low RPMs, and are pretty high quality (compared to some of the shit that's out there).

    None of the Beige G3's that our company is still running (24/7) have blown a single fan. Compared to our Dells (blown motherboards, power supplies, and fans), and even our Suns (two of which are making some very bad fan noise because the bearings are about shot)--which our macs outnumber handily-- they're virtually indestructible.

  14. Bad news for Itanium 2 by afantee · · Score: 4, Informative

    The 1.5 GHz Itanium 2 costs over $3000 per chip, and even the 32-bit Xeon 3.06 GHz is about $1000, while the 2 GHz PPC 970 is about $300 or $400.  In addition, VT wants 64-bit chips, so Xeon is a nonstarter.

    Excluding the Earth Simulator, the 2 GHz G5 has the highest Flops per CPU, even 5% higher than the 1.5 GHz Itanium 2 and 10 times cheaper:

    #2 Alpha 13880 / 8192 = 1.69

    #3 G5 10280 / 2200 = 4.67

    #4 Xeon 9819 / 2500 = 3.92

    #5 Itanium 8633 / 1936 = 4.45

    #6 Opetron 8051 / 2816 = 2.85

  15. Re:Its also the CHEAPEST by Cecil · · Score: 4, Informative

    I would guess that your guess is wrong.

    Have you seen a G5? And do you know why your typical fans fail? Could it be perhaps because they are undersized, underpowered, cheap fans being pushed to their limits to try and brute force enough air churning around to keep your system cool? The G5s have a properly engineered case with a specific path for airflow, and abundant, high-quality fans mounted on rubber shock absorbers to dampen vibration. I suspect that these fans will have few problems. And if they do, won't it be a painful process to replace them: unlatch side panel, remove plastic airflow enclosure panel, put hand on fan assembly, slide out, slide in new fan assembly. Click, done. I wish my servers were that easy to replace things on.

    Cheap hard drives? Yeah, sure, whatever. They use Seagate Serial-ATA drives. I don't know about you, but Seagate stopped fitting into my definition of 'cheap and crappy' about 8 years ago.

    I have never dealt with ECC memory, so I grant you that point, however.

  16. Re:Internet distributed computing by JohnsonWax · · Score: 3, Informative

    My understanding was that, if anything, the distributed.net algorithms unfairly favor the PowerPCs - esp. those with Altivec. I believe the Apple has used that fact in their advertising much to the consternation of many Slashdotters.

    And, of course, problems in computational fluid dynamics and the like tend to also favor Altivec. Not always, but it's pretty common. As such, the value of Altivec shouldn't be tossed off as some kind of parlor trick. For these applications, it's quite relevant.

  17. Re:Its also the CHEAPEST by ZackSchil · · Score: 3, Informative

    Have you ever even looked at a G5, seen its internal design, read the technical documents outlining airflow zones and the large, high quality, low RPM fans, the controller that measures temperature and keeps them moving only when they need to? No? Ever looked inside a generic PC box (as in technical slang and literally a box) with little to no airflow and 2 or 3 noisy fans drilling away as fast as they can the whole time trying to circulate air in and out of the little holes in the back? The G5 has been designed to keep cool with as little noise and wear as possible.

  18. Re:Internet distributed computing by martinX · · Score: 3, Informative

    Don't know about that:

    Apple and IBM jointly developed the PowerPC G5, world's first 64-bit desktop processor. With two double-precision floating-point units , advanced branch prediction logic and support for symmetric multiprocessing, the PowerPC G5 processor builds on previous PowerPC designs, combining an optimized Velocity Engine with a superscalar, super-pipelined execution core that can execute more than 200 simultaneous in-flight instructions. This high-bandwidth core has over 12 discrete functional units that process massive amounts of instructions in parallel.

    And it has got to be the most totally buzzword compliant CPU ever built.

    All from here and here.

    --
    When they came for the communists, I said "He's next door. Take him away. Goddam commies."
  19. Re:Its also the CHEAPEST by ZackSchil · · Score: 2, Informative

    I know why he wrote that, but Mac zealots, from years and years of such comments, are extremely knowledgeable of Apple's product line. Even more so when Apple touts the G5's design as one of the nicer things about it both aesthetically and technically. They tout the cooling zones, etc right on that page.