Virginia Tech Supercomputer Up To 12.25 Teraflops
gonknet writes "According to CNET news and various other news outlets, the 1150-node Hokie supercomputer rebuilt with new 2.3 GHz Xserves now runs at 12.25 Teraflops. The computer, the fastest computer owned by an academic institution, should still be in the top 5 when the new rankings come out in November."
6.40tflops should be enough for anybody
Reflecting on the comment: "hould still be in the top 5 when the new rankings come out in November." There seems to be a serious push for multiprosessor systems, currently the ranking seem to consist of a couple of stars, few big ones(this computer among them) and a huge group of third category, and then the "used to be great" computers. But from my reading of the trends seems that there will be more and more crowding at near the top, so I expect the second category to be much larger, with much smaller differences.
If that were feasible, you could be looking at toppling Earth Simulator at a fraction of the cost.
Would be interesting to know exactly what stuff do these machines do? Maybe they would even be able to share some code so that people can fiddle around with it optimizing (should be fun).
Compare it to this new Cray system. Bang for the buck would make the Apple system better.
It's either on the beat or off the beat, it's that easy.
I moderate therefore I rule!
--
From the article:What I really want to know is what they do with the old machines. The articles speaks of the cluster being 'upgraded' - are the older G5s replaced, or do they just become part of the new cluster?
Still, I suppose there's one or two unwanted G5s - anyone want to send me a couple?
Tedious Bloggy Stuff - hooray?
are not designed for the same type of work as clusters. If a probably is not effeciently parallizable and requires shared memory then a Cray is the only feasible option A Cray is not a cluster. It's like comparing mph for a sports car and truck: the car is faster but they are meant for different types of loads.
Your CPU is not doing anything else, at least do something.
this is the official homepage of the listing:
http://www.top500.org/
they were sold off by MacMall at a slight discount around 6 months ago, along with a certificate of authenticity and a "property of virginia tech" sticker
-mkb
They were sold through retail channels with the addition of a metal nameplate stating its node number.
Lorem ipsum dolor sit amet
but will it run Longhorn?
--- Asking inconvenient questions for over 30 years...
Before you guys ask I RTFA. I was wondering what do they do with the old processors?
The PC Weenies: 11 Years of Online Tech 'Too
This kind of funding I wish my school had. We are going to build a cluster system out of about 20 G4's running yellow dog Linux. But this is my chance to actually do cluster programing.
The funny thing is with the class we are actually trying to figure out things to compile, besides bootstrapping our Linux laptops. =P Man we are geeks.
Linux is like a teepee. It has no windows, no gates, and there's an Apache inside.
Correct me if I'm wrong, but it actually looks an awful like a Beowulf cluster by nature.
Oh, and btw: here are some pictures.
The problem with Slashdot memes is that YOU INSENSITIVE CLOD!
I have it on good insider knowledge, that this entire cluster is going to be put to the best possible usage.
/kidding
Not disease solving, not genetic mapping, not calculating weather patterns.
No, what they're going to do is remaster the Original Star Wars series, right from the laser disc versions!!!!
Imagine, a digitallly remastered bar scene where Han shoots first!!@$!@#!one!@
If power was only equated to speed then you would be correct. However, as other posters have pointed out, there are several reasons why a Cray is a more powerful system besides sheer speed.
http://www.busyweather.com/
I have a freind who is finishing up his masters, and starting his PhD in computer engineering at VT. I asked him about it and he simply said: "they haven't found anything do actually _do_ with it"
2:14am EDT August 29, 1997...
Researcher: "Go to your machine room! And no Command and Conquer until you do your homework!"
Joshua:"Oh yeah? Would you LIKE TO PLAY A GAME?"
The reason is this.. more and more of these 'supercomputer' entries appear to be many machines hooked up together, possibly doing a distributed calculation.
However, would projects such as SETI, GRID, and UD qualify with their many thousands of computers all hooked up and performing a distributed calculation ?
If not, then what about the WETA/Pixar/ILM/Digital Domain/Blur/You-name-it renderfarms ? Any one machine on those renderfarms could be put to use for only a single purpose: to render a movie sequence. Any one machine could be working on a single frame of that sequence. Does that count ?
I seem to think more and more that the answer is 'no', from my perspective. They mostly appear to me as rather simple computers (very often not even the top-of-the-line in their own class), with the only thing going for them that there are many of them.
The definition of supercomputer (thanks Google, and by linkage dictionary.reference.com ) is
And for mainframe
Doesn't the above imply that a supercomputer should really be just a single computer, and not a network or cluster of many computers ?
( The mention of 'terminals' does not mean they're nodes. Terminals are, after all, chiefly CPU-less devices intended for data entry and display only. They are not part of the mainframe's computing capabilities. )
If the above holds true, then what is *really* the world's top 3 of supercomputers ? I.e. which aren't 'simply' a cluster of nodes.
Any mistakes in the above write-up/though process ? Please do point them out
For a while there were CPUs specifically designed to run LISP, aka AI . Symbolics was one of the better knowns one.
It failed in bankrupcy. My vague understanding was that the designing dedicated LISP processors was hard and slow and with little resources they could not keep up. Essentially the Symbolics computers ran LIPS pretty quickly given the MHZ but SUN and Intel kept moving up the MHZ faster than Symbolics could keep up. In the end there were not speed advantage to a dedicated LISP machine, just an increase in price. Economics might change eventually. Who knows.
the 1150-node Hokie supercomputer rebuilt with new 2.3 GHz Xserves now runs at 12.25 Teraflops. The computer, the fastest computer owned by an academic institution, should still be in the top 5 when the new rankings come out in November."
Just in time for the release of Half Life 2. Hmmm...coincidence? I THINK NOT!!!
(those that go to despair.com will recognize this) that "You can do anything you set your mind to when you have vision, determination, and an endless supply of expendable labor." Point being, I'm sure having essentially free labor (sans pizza, of course... ;-) might have cut the price down just a little bit too...
Not to poo poo their efforts, but the whole system was essentially a 'loss-leader' for future supercomputers projects using the G5's and Xserve....
Peace
Prof. Jack Dongarra of UTK is the keeper of the official list in the interim between the twice yearly Top 500 lists:
http://www.netlib.org/benchmark/performance.pdf (see page 54)
There have been some new entries, including IBM's BlueGene/L, at 36Tflops, finally displacing Japan's Earth Simulator, and a couple other new entries in the top 5.
Here's just the top 16 as of 10/25/04:
http://das.doit.wisc.edu/misc/top500.jpg
No matter what anyone says, Virginia Tech pulled an absolute coup when they appeared on the list at the end of 2003: no one will likely EVER be able to be #3 on the Top 500 list for a mere US$5.2M...even if the original cluster didn't perform much, or any, "real" work, the publicity and recognition that came of it was absolutely more than worth it.
Also interesting is that there is also a non-Apple PowerPC 970 entry in the top 10, using IBM's JS20 blades...
Highly concentrated in bio-systems/informatics. Tech just built a HUGE building for bioinformatics. they plan to be doing a lot of processing for that.
But they also plan to sell out processor time "cheap".
and i must say GO HOKIES
They built the origninal and as you say they didn't perform any real work. So whats the point? Its like rich guys that buy ferraris and never drive them.
Well.. maybe. Or Maybe not. But Definitely not sort of.
What, compared to the people who post the Beowulf/Soviet Russia/SCO jokes a million times over? Hard to get a worse sense of humour than them, as even this "new" 7xxxxx user is sick of their lame jokes.
It was late and I should have wrote top 10.... Who cares about the #8 computer, when you can have a top 5 computer!
If you add in VirtualPC... presumably the clustered version.. you should start to get to the level of compute power that was recommended by Microsoft for Longhorn... though it still wouldn't be the high end. Expect some sluggishness..
My question would have to be: Teraflops - is it purely an aggregation of processor power, or does it take into account things like interconnects? That is, interconnects inside a single node, and among nodes.
I ask the question having freshly read this article which explains why Opterons are such a huge leap forward compared to Intel's designs. In a nutshell, the bottleneck on a server isn't generally the CPU but interconnects between components. An opteron has dedicated interconnects to other CPUs, its own memory bank, and other components while Xeons, for example, do all this through a slow FSB.
I'm sorry if I haven't offended anyone
The vast majority of clusters are for simulating very complex systems that require lots and lots of calculations.
You can get a few hints by looking just at their names.
The number one "Earth Simulator Centre" is fairly self-explanatory, going to their website show they create a variety of models for things such as weather, tectonic plate movement, etc.
The number 3 LANL supercomputer "is a key part of DOE's plan to simulate nuclear weapons tests in the absence of actual explosions. The more powerful computers are designed to model explosions in three dimensions, a far more complex task than the two-dimensional models used in weapons design years ago." I imagine that most US government simulations would be doing something simmilar.
When they do real work, all of my questions and concerns will be taken care of. Until then, its a bit frustrating. A tool is only useful when it is used for its intended purpose. Maybe they built the first super computer with the idea that it would never be used, but thats just a little sad to me. Its as if Michelangelo had created a smaller version of David, but destroyed before anyone could see it because it was just a model for the larger work.
Well.. maybe. Or Maybe not. But Definitely not sort of.
My question would have to be: Teraflops - is it purely an aggregation of processor power, or does it take into account things like interconnects?
Take a look at top500.org. For every supercomputer you'll see two numbers, Rpeak and Rmax. Rpeak is a purely theoretical estimate, basically number of floating point operations per clock cycle times the clock frequency times the number of cpu:s in the system. Rmax is the result from running the linpack benchmark.
The linpack benchmark is run on the entire system, so it takes into account all the interconnects, but unfortunately linpack is almost "embarassingly parallel", e.g. for a cluster the interconnect makes almost no difference, ethernet is as good as some supah-dupah $$$ interconnect.
Well the G4s where not originally intended to be used for cluster computing. They where originally desktops for the past year or so. We got replacement systems for them so it was either we could surplus them or we could use them. We do not have the funding to buy systems just for this project so we just use the left overs.
Linux is like a teepee. It has no windows, no gates, and there's an Apache inside.
We'll have the puppet master, but it'll be contained to only a couple computers in the whole world (until you can get 100tflops on your desktop)...
Yes Francis, the world has gone crazy.
should still be in the top 5 when the new rankings come out in November.
Wow, ranked higher than the Virginia Tech football team this year.
If you look at the latest list, VT is already out of the top 5. They are in 7th. The new list is here:
http://www.netlib.org/benchmark/performance.pdf (Page 54)
IBM is first with BlueGene (PowerPC 440), but is also 3rd with their 3564 CPU PowerPC 970 2.2 GHz JS20 system.
1) 36010 - BlueGene/L DD2 - 16384 0.7 GHz PowerPC 440
2) 35860 - Earth Simulator - 5120 NEC processors
3) 20530 - IBM eServer BladeCenter JS20 - 3564 2.2 GHz PowerPC 970 G5
4) 19940 - QsNetII Intel Tiger4 - 4096 Itanium 2 1.4 GHz
5) 19564 - NASA Project Columbia SGI Altix 3000 - 4032 Itanium 2 1.5 GHz
6) 13880 - ASCI Q AlphaServe EV-68 - 8160 Alpha 1.25 GHz
7) 12250 - Virginia Tech Apple Xserve - 2200 2.3 GHz PowerPC 970 G5
8) 11680 - BlueGene/L DD1 - 8192 0.5 GHz PowerPC 440
9) 10310 - IBM eServer pSeries 655 - 2880 1.7 GHz POWER4+
10) 9819 - Dell PowerEdge 1750 - 2500 3.06 GHz Xeon
BTW, I wouldn't be surprised to see the 2.3 GHz Xserves announced by Apple in January for general consumption.
I don't think Apple deserves any of the credit. They made the case for the xServe? and the OS .. nothing hardware. Once again its an IBM feat. The Apple zealots on here are too much. They make Linux zealots look intelligent.
Thank you. Thank you. All of my friends laughed at me when I said I didn't need a motherboard-- I could save a bundle instead and just use a breadboard.
pricing a top of the line dual 250 opteron with a mobo that has similar features to a powermac (gigabit, pci-x, 8 ram slots, firewire 400 and 800.. which no opterons offer, etc) gives you a system at rough price around $2,473.00. that doesn't include a case, powersupply, hard disk, cdrom, keyboard, or mouse like the powermac does.
what planet are you pricing yoru "similar" x86 hardware on? look, i know mac doesn't have a low end $200 pc. but their high end offerings are not only competitive, but cheap.
- tristan
"Everything works in theory, but not pratice."
In theory, anyway.
If I were you guys, I wouldn't be calling their supercomputer a "Hokie supercomputer." Some of them thar Virginians might get a wee rankled thinkin' you said "Hokey supercomputer," and 12+ teraflops ain't too hokey. Who says? The end of my buckshot Blue Ridge rifle, that's who!
;-)
IronChefMorimoto
P.S. - Take my word on this as an ex-North Carolinian-- I called an Appalachian State University server farm rather "dairy" and nearly got my ass shot off.
From the CNET story headline:
u percomputer/2100-1016_3-5426091.html?tag=nefd.top)
e nnington.html) onsite to help with the installation upgrades?
"The fastest Mac supercomputer has gotten faster, thanks to an Xserve makeover." (http://news.com.com/Virginia+Tech+beefs+up+Mac+s
Was that neurotic TLC-to-ABC crossover Ty Pennington (http://abc.go.com/primetime/xtremehome/bios/ty_p
Sorry -- this post was in honor of my wife, who tortures me with that damned show every Sunday night.
IronChefMorimoto
Humans are not born intelligent, but most are born with the ability to become intelligent through a learning process. So the question becomes: How much of a human brain's "software" is coded genetically (like firmware) and how much is learned? Nobody knows the answer. Not yet, anyhow.
It's not obvious that the genetic portion is so complex that we can never figure it out. It might be relatively small and simple, and there may be evolutionary pressures to keep it that way. If this were the case, it might bode well for AI research. Then all you have to do is synthesize this "firmware" enough to enable the AI to start learning.
It was recently announced that the human genome only contains about 25,000 genes. There's a limit to how much complexity you can encode into that. Intuitively, it seems this should be possible to figure out someday.
His estimate was probably based on the common, and incorrect, belief that neurons are purely digital.
So.... are they partly digital? Entirely analog? Quantum in nature?
Don't tease.
My God, it's Full of Source!
OUTSIDE_IP=$(dig +short my.ip @outsideip.net)
Can somebody not at VT rent time on this or is it purely in-house?
Are there any supercomputer rental outfits out there?
I've heard IBM will truck in a box for you, but that's not really 'net savvy. There was a story about Weta leasing during downtime, but that's a side-line.
My God, it's Full of Source!
OUTSIDE_IP=$(dig +short my.ip @outsideip.net)
What, compared to the people who post the Beowulf/Soviet Russia/SCO jokes a million times over? Hard to get a worse sense of humour than them, as even this "new" 7xxxxx user is sick of their lame jokes.
In Soviet Russia, Beowulf cluster jokes are sick of you.
It's not offtopic, dumbass. It's orthogonal.
Yes, it was up for a while, but mostly for testing and tuning.
The one critical problem with the initial cluster was that the Power Mac G5 didn't have ECC memory, meaning that any long calculation would really have to be run twice - or at least until the result was the same - to essentially insure a soft error did not go unnoticed (and no, VT's special "error detecting" software didn't account for this).
The Xserve G5, however, does have ECC memory, making the current cluster just as capable as anything else in the top 10.
I'm not denigrating the original cluster, however: VT played by the rules, and made it to #3 in the world, #2 in the US, and #1 academic for a mere US$5.2M. They also broke the burgeoning Dell/Linux hegemony for cheap clusters, proving that you could use Apple, PowerPC, Mac OS X, and Infiniband to make clusters just as cheap, if not cheaper (note how much better the Apple clusters perform per processor than the closest Dell P4 Xeon 3.06GHz Linux cluster several spots below...additionally, check out this fantastic cost comparison of many of the top machines). Not to mention bringing a new 64-bit player to the HPC table. And one would hope that competition, even in supercomputing, is a good thing.
Gromacs is used in parallel by folding@home with the stanford group. If you run the folding@home program you can see the gromacs data packets being sent to your computer. The program simulates how peptides might fold into proteins by solvating them in a cellular environment.
So the virginia tech supercomputer could also run folding@home (basically modified gromacs).
I think they are headless, tailess and mouseless servers.
There is a spark in every single flame bait point.
So, Apple didn't design the motherboard, or the chipset derivative, or the Apple northbridge? Sure, the bus is based off of HyperTransport.. But you can't take Apple out of the equation.
- oZ
// i am here.
I'm a robotic software researcher, so this notion really affects me.
This post deserves its own slashdot article all to itself. Not only has an AI-driven robot posted on slashdot, but apparently someone has designed the robot to research software. So it would make sense that the robot would be reading slashdot. I think the editors should set up an interview with this AI drone known as SnowZero.
$5 / month hosted VPS on linux = awesome!
But it only has 1150 mouse buttons...
One problem with this list...
It doesn't include the new Xserve cluster that the COLSA corporation had Apple build, which can exceed 25 teraflops.
http://www.apple.com/science/profiles/colsa/
"There are 10 types of people in this world--Those that understand binary, and those that do not..."
dude, have you never seen those "property of bahdeblah sports team athletic club" shirts?
-mkb
yeah only apple would implement a transit system that does'nt go all the way downtown
I don't know were you learnt your maths, but 42.7 doesn't get anywhere near 10 times 12.25, or 122.5 for the slow-at-moving-decimal-places. It doesn't even get close to 5 times the speed, or 61.25. If you do the maths, the factor is actually approx 3.49.
Or, Finite Element Analysis!
Think about processing thousands of million
cubed matrix calculations needed to simulate
the air flow and thermal dynamic calculations
of a jet engine (or of weather patterns).
The matrix operations are readily split and
transferred piecemeal to a computer cluster.
I don't know how much Dell's Tungsten cluster cost but those guys went online last year and got ranked #4 (just behind this Mac cluster) and they're #5 or something now. These bozos have spent a year fscking around with upgrades and from the theoretical #3 (as they were taken out since the cluster couldn't enter production) will have dropped to #7 or more in the next ranking....
Tungsten cost $12 million. Just for the hardware.
System X cost a total of $6 million, and it's still faster.
Not to mention that Virginia Tech was able to pull of a publicity coup and become #3 in the world, #2 in the US, and #1 academic for a paltry $5.2M. And they were "taken out" of the list voluntarily, because they dismantled the entire thing to replace it with Xserve G5s. With the renewed US focus on supercomputing, no one will likely EVER be able to hit #3 on this list for something anywhere close to $5.2M again.
Here's the current list:
http://www.netlib.org/benchmark/performance.pdf
Here's just the current top 20, as of 10/26/04:
http://das.doit.wisc.edu/misc/top500.jpg
Confusingly, you seem to have forgotten that since VT dropped on the list, since VT is still much faster than Tungsten, that means Tungsten also dropped. Tungsten is currently #16. For $12 million. VT's 2.5 Tflops faster - a respectable standalone clusters' worth faster - for half the price. Plus VT got all the huge publicity and news articles, and attracted millions of dollars in funding and grants for their new supercomputer center. Not to mention bringing a whole new OS, platform, interconnect, and processor onto the scene, which will benefit everyone (competition and choice is good, right?).
Also, here's a really great cost/performance comparison of all the top clusters.
Nice try at trolling, but next time don't be so obvious and pathetic about it, especially when Tungsten looks like it clearly is the raw end of the deal, when you have to spend over twice as much money to get a cluster that performs significantly worse, and has worse power requirements.