Virginia Tech to Build Top 5 Supercomputer?
hype7 writes "ThinkSecret is running a story which might explain exactly why the Dual 2GHz G5 machines have been delayed to the customers that ordered them minutes after the keynote was delivered. Apparently, Virginia Tech has plans to build a G5 cluster of 1100 units. If it manages to complete the cluster before the cut-off date, it will score a Top 5 rank in the Linpack Top 500 Supercomputer List. Both Apple and the University are playing mum on the issue, but there's talk of it all over the campus."
...a Beowulf cluster of....
Oh wait, it is a cluster. DAMN!!!!!
there's talk of it all over the campus ....and we all know how reliable campus rumors are! C'mon guys, don't forget to say hi to the Olson twins when you see them on campus next year!
"Quoting famous computer scientists out of context is the root of all evil (or at least most of it) in programming." - K
That's the one thing that favors huge amounts of processors in the same box. All this "the internet is one giant distributed computer" doesn't acknowledge this. A box designed to be separate just will not have the latency advantage of a supercomputer designed from the ground up.
-Libertarian secular transhumanist
"Both Apple and the University are playing mum on the issue, but there's talk of it all over the campus."
Must be a pretty boring campus...
~ "When I'm of that age I'm just going to live up a tree."
Are they gonna run Photoshop on that supercomputer ?
play Doom III.
but there's talk of it all over campus.
Funny, I haven't heard anything about it prior to today. Guess I'm just out of the loop then...
-Julius X
remove "-whatkindofspamdoyoutakemefor-" from email to send
And it'll get skunked by 40 teraflops by Duke's supercomputer every year!
I don't want to start a holy war here, but what is it with you with you G5 zealots? Ive been sitting at my 1100 CPU G5 supercomputer for 20 minutes as it computers a fast fourier transform of an 8Ghz guassian. 20 minutes! At home, on my 60 cpu linux beowulf cluster, the same operation would take 2 minutes if that. Also, while this operation is takiing place, Doom III won't start, and everything else grinds to a halt, even my multithreaded emacs is struggling to keep up as i type this.
My Sun Enterprise 5000 is faster than this machine at times. Super computer addicts, flame me if you want, but I'd rahter hear some inteligent reasons why I should use the G5 supercomputer over cheaper, faster clusters.
I take it you don't look at Think Secret on a regular basis. It is, easily, the most accurate Mac rumors site out there. In fact, they have posted info on numerous occasions that has caught the attention of Apple's lawyers, and have been forced to pull down and issue their standard disclaimer. Say what you will about other rumors sites (most of them simply feed off each other) but there are some startlingly reliable sources informing Think Secret. Frankly, I don't recall the last time they were wrong about anything they've posted.
--Rick "If it isn't broken, take it apart and find out why."
Virginia Tech placed the dual-2GHz G5 order shortly after the G5 was announced. Multiple sources said Virginia Tech has ordered 1100 units
:-)
...)
Wow, that'll make Apple's quarter for sure
Seriously though, why PowerMacs ? I've always been under the impression that intelloid machines are the cheapest commodity hardware around for equivalent processing power, if not the most exciting. Would anybody know why Powermac G5s are a better choice here?
(Note to computer zealots: it's not a flamebait, it's a genuine question, from someone who is rigorously ignorant of the Mac world. And just in case, the first sentence is a joke, too
"A door is what a dog is perpetually on the wrong side of" - Ogden Nash
I got the following email the other day:
Virginia Tech is in the process of building a Terascale Computing Cluster which will be housed in the Andrews Information Systems Building (AISB). For those who are interested in learning more about this project, we will host an information session on Thursday, September 4th from 11 a.m. to noon in the Donaldson Brown Hotel and Conference Center auditorium.
We look forward to seeing you there
Terry Herdman Director of Research Computing.
I'll try to remember to take notes on this and let you all know if there's anything interesting...
Friends help you move. Real friends help you move bodies.
By my count, they'll have an R_peak of 8800 GFLOPS; unless they've got more efficient linpack code than anyone else, that will put them around 7th or 8th place.
Tarsnap: Online backups for the truly paranoid
Of all the Macintosh rumor websites, Think Secret is, by far, one of the most reliable sites I've seen. If it wasn't, I wouldn't be investing $1,200 a year in them for their message board. Of course, if you think you can do better... ;-)
8==8 Bones 8==8
Yeah, chicks dig massive...computers.
No wait, no they don't!
One thing against clusters rather than machines designed for the ground up is memory access. If on a n Single System Image (SSI) system is that any node can access memory of another over fast internconnects. With a cluster the memory has to be transfered over ethernet which even if using 10GB Ethernet is still a number of magnitudes lower than memory
Rus
Cheap UK and US VPS
Right after the Sony Playstation 2 launch, there was a big shortage. Several media stories blamed it on some "unnamed" Middle East country buying them all up to power their missles and supercomputers (because, the rumor claimed, the PS2 was just so powerful).
Wonder if Apple is trying to "pull a Sony" here...
The article makes no mention of the operating system that will be running on this supercomputer. I for one would like to see them get this done w/ OS X rather than use GNU/Linux.
[Connection closed by foreign host]
Orientation today was postponed, however, so I won't have more details until Wednesday =/ I'm looking forward to helping out, though.
"If it manages to complete the cluster before the cut-off date, it will score a Top 5 rank in the Linpack Top 500 Supercomputer List."
Err... I think somebody's getting a bit ahead of themselves here. =) Building parallel computing systems is complicated, and it may end up being quite a bit harder to realize the predicted performance than thought (not an uncommon occurrence). I'll believe it when they have the actual Linpack numbers.
Currently the top 5 consist of 4 machines that have a Therotical maximum speed (Rmax) the are larger then the 10TFLOPS this machine will have. Then you have to translate that into peak speed which is what matter and what this list uses to rank the machines. Peak will be a good deal less, but this mostly has to do with the way the systems are interconnected and not the machines themselves. Say what you may about the G5 but the interconnect is more important.
There is only one machine in the top 5 that this cluster could beat. The rest of the world has had 6 months to build machines too.
This should be a top 10 machine for sure. Good to see more fast machines being built every day.
Schools have many different accounts set up to fund many different things. This is due to how donors donate money and specify that they want it to go toward a certian project or department. One department, say the CS department might have recieved donations from CS alumni. Also, having large projects like this can generate lots of revenue through grants.
Several years ago I did some work on some Virginia Tech "supercomputers" (actually, baby versions of ones on campus that were the same as huge ones they leased time on elsewhere), and I think the people talking about Altivec are on track. I never knew exactly what they did, but at that time the Math, CS, and Engineering groups were working together to simulate wing designs for the YF-22 jet figher prototype. Since I was more of a "sysadmin" (althoug h with a math and CS background) I ignored most of what was going on, but one thing I can tell you was vectors, vectors, and more vectors. The vector is king. It's an assumption, but I'll bet they are still working on similar type studies, and if built, this will be just the beast for it.
Exactly, Virginia tech has a goal to become a top 30 research university. Having known about the plan for some time, this makes perfect sense. The departments who are building the cluster, have gotten very large grants and donations from our great alumni to build this, and become a better university for it. This construction can be compared to the stadium expansions. The stadium expansion is paid out of a different set of funds, as is research. Academic fund is hurting because alumni rarely give money for academic reasons, but more for football or research.
Damn, I've been at this school for a week and I haven't found a single redeeming value. Finally, a cause to hang in there for the next couple of years.
-Waldo Jaquith
While the AltiVec unit is very impressive, The SSE2 unit on the P4 or the Opteron would have nearly the same performance and cost a whole heck of a lot less (I am betting if this rumor is true at all, then Apple has given the units to the school).
What I am wondering is, what OS is this cluster going to run? I mean, have the BSD folks figured out how to scale? No chance it will be OS X...maybe AIX?
[RIAA] says its concern is artists. That's true, in just the sense that a cattle rancher is concerned about its cattle.
The cluster has nothing to do with Apple. They were contemplating using PPC970 CPUs (basically becasue they're cheaper than the Power4's), but AFAIK there were never plans to use complete G5 computers, or anything else made by Apple. And I doubt IBM would take CPUs from Apple to give them to V.Tech. The delay is mot likely due to performance issues; they're probably fine-tuning the OS so it's competitive with x86 / Windows systems.
The grant money that flows into a public research and occasionally teaching institution can be stagering, and absolutely dwarf the money students pay in tuition (sometimes by a factor of 10!). A better question might be, why don't the gradstudents donating their labor, possibly to patents that will be controlled by the university, recieve more consideration, and fair labor law protections.
But I would bet this will be not too dissimilar in use from the HP Itanium2 referenced earlier on slashdot. I would bet one of the paramount concerns this cluster would look at is the effect of farm runoff, and probably climatology too among other things.
--Jimmy has fancy plans; and pants to match.
This won't help sell their clustering hardware:
http://www.apple.com/server/clustering.html
eleven plus two / twelve plus one
With 1100 machines in the cluster, there must be _at least_ 2200 DIMMs. Since these must be 400MHz (PC3200) DDR, they can't be on a large 0.15 micron DRAM process, but most likely between 0.11 and 0.13u.
d /d imm_results.htm
Who cares?
APPLE G5'S DO NOT SUPPORT ECC.
The random bit error rate for 2200 DIMMs with 0.13u cells is roughly one '1' bit dropped to '0' every 9 hours. In other words: good luck getting any reliable, large-scale computation done with this cluster. (And I do mean "good luck" - they might get a run of two or three days without any problems once in a while.)
Now if only Apple would support PC3200 ECC DIMMS, which certainly do exist:
http://www.intel.com/technology/memory/ddr/vali
this cluster might be a bit more useful for real work.
-I'm about to buy another in a day or two...
... there is a pretty big gap between asking for one and actually getting one. Tell you what, let me finish this gig I got happening in Virginia and then we can start dealing with the customers that want to order G5 machines in the onsies - twosies quantities.
... now THAT is hardcore.
You mean you are about to order another in a day or two
Sincerely,
Steve.
sjobs@apple.com
PS - Seven Mac's in ten years isn't hardcore. 1100 units ordered on one purchase order before they even ship (August, 2003)
Glonoinha the MebiByte Slayer
ooops, yuo = own3d!!!!y pe=99&cpumhz=1000&recordid=1&contest=rc572&multi=0
http://n0cgi.distributed.net/speed/query.php?cput
Power PC 7450/7455 G4 1000 MacOS X 10.2 2.9005 RC5-72 10,594,666.00
DecafJedi
DecafJedi
my weblog: apropos of something
That first column says "Report as Error" It's a link to report the number as an error if it is unusually high compared to its neighbors, not to state that the actual figure is an error. Click the link, it will explain the system to you. I must be off because being a homosexual, a Macintosh user, and a priest takes a lot of my time! At least the slashdot trolls seem to think so! Toodles!!
i would take this story to imply that a G5 powered Xserve is not going to be shipping anytime soon..... the Xserve is made to cluster and run in situations like this. i guess the rumor sites can speculate if it's G5 parts available or some other holdup on a G5 Xserve.
/. a year or so ago about a group that went from building a rack and unboxing their G4s to a running cluster in part of a day. i really don't remember the specifics but i think it was something like 30 G4s? i would guess the G5 is not that much harder... and they seem to have Apple helping. maybe they hooked up the optical cards from the Xserve...... we'll see i guess.
unless there is some reason the desktops are better for this project that i did not pick up on?
as for the above question about Macs.... depending on what they want to really do with this, Altivec is really efficient for some computations. all flame wars aside there have always been people clustering Macs for certain uses. i do not know how much of it was user preference or the software they wanted to run or the simplicity of getting the cluster running.
it is supposedly VERY simple to cluster Macs. there was a story on
I've been tempted to order a dual G5. I've resisted the temptation by realizing that my only real reason for wanting it would be to awe friends and co-workers. Pretty shallow. I was ashamed.
What a surprise to find that the folks who buy multi-million dollar supercomputers seek some of the same shallow satisfaction that moves me--bragging rights.
Still, if a single order for 1100 units causes significant delays filling orders for other customers, Apple must not have been expecting to sell many of these things. Maybe I should place an order just to help out.
When all you have is an axe, everything looks like a grindstone.
For those unbelievers, here's a little proof and maybe a bit more
Here's an official word (search for Teraflop).
Also, here's the original e-mail that went out (a month ago) They never mentioned Apple though:
> Date: Mon, 28 Jul 2003 17:36:46 -0400
> From: Jason Lockhart <multimedia@vt.edu>
> Subject: Terascale Assembly Assistance
>
> Hello all,
>
> As you may know the College of Engineering in conjunction with the
> university Information Systems and Computing organization are
> building
> a 10 TeraOp, 1100 node supercomputing cluster. We are in need of
> volunteers to assist in three areas of assembly; cabling, RAM and PCI
> card install, and machine racking. We would like to get as many
> volunteers as possible.
>
> Some logistical things, there is not adequate parking for every
> volunteer to drive and park at the Computing Center. We
> would ask that
> volunteers either carpool (4 or more to a vehicle) or take the BT.
> Initial cabling will be done with the power cables beginning this
> coming Friday, August 1st. We have yet to set a start time,
> but I want
> to get an idea of who is available to assist on that day as well as
> availability for the weekend and early next week. I will forward the
> time to arrive as soon as I nail it down...should be tomorrow. The
> power cabling represents one third of the overall cabling that will
> need to be done.
>
> I need to get a list of those who will be assisting to the facilities
> people at the ISB as soon as possible. This list will be
> used to allow
> volunteers access to the building and the machine room. If
> you're not
> on the list you will not be allowed in! We want to have at
> least 30 to
> 40 people working at any one time on the project. If you know others
> that will be interested in assisting, please have them email me so I
> can get them on the roster.
>
> Thank you for your willingness to participate, and please respond and
> have others respond with "Terascale" in the subject so I can
> filter the
> volunteers properly.
>
> Thanks.
(E-mail signature removed)
As Zack pointed out, iWalk was not a Think Secret report; in fact, we debunked it. For WWDC, we reported that Apple would announce 64-bit Power Macs as well as a videoconferencing camera that we said would be called "iSight," -- I think we're in the clear there. iWorks? I maintain that it is still a future Apple release. As for 12-inch and 17-inch PowerBooks, while we raised the possibility of a release that week, we specifically said we couldn't confirm the delivery date: "It's unclear when Apple plans to announce the upgrades..."
Bottom line? Like any other news organization, Think Secret has occasional misses. But those misses don't appear to include any of the items mentioned here. I think our record speaks for itself.
Nick dePlume
Publisher and Editor in Chief
Think Secret
For the ones who are questioning this existence, the order is shipping, the racks (a ton of them) are there in the main Computing Center server room. First they required all servers to be moved innto racks. Then they started moving servers around, including removing the Petaplex. The power has been upgraded in the server room (the UPS backup generator actually). This caused a morning of basically all the important servers on campus having to go down for one day in the summer - I hated waking up to go switch off machines for that one. The AC has been upgraded to accomidate the huge amount of heat to be put out. It was't until I heard about the cluster that all the chages in the Machine Room made sense. Now they're recruiting help to do the grunt work of putting all the machines in the racks.
/. link to the campus geek list (If someone hasn't beaten me to it).
/. since a cluster this size is noteworthy of the frontpage. (Rumor - and this is rumor sice I haven't goe to direct sources on this - is that it will not be running OS X, and probably BlackLab or YellowDog or SuSE.)
The stated objective was to be on the next 500 list. Dell and HP were considered, but they couldn't fill the order in time (possibly as they have made announcements of other large clusters recently) and Apple promised delivery after someone leaked the story of the cluster meetign with Dell and HP to Apple and Apple jumped at the chance.
Basically, the story is not a rumor from the point of view of the geeks on campus who have been effected by the preperations. I'll probably post the
I'm disapointed about this being only on the Apple section of
I am, and always will be, an idiot. Karma: Coma (mostly effected by
You got a link for dual Opterons for 1500 bucks with all the goodies in the G5? If yes, Id love to see it.
:) )
(seriously - I can take out a loan
Obviously you haven't looked at VT recently. Tuition and fees is only $7,500, out of state. I can only wish that my tuition were that low. Hell, for in-state students, the room and board is the same price as tuition (around $2,000). But of course, you're modded up insightful, because you pulled a random idea out of your ass and presented it as fact.
I'm a lawyer, but not yours. I wouldn't represent someone who thinks taking legal advice from Slashdot is a good idea.
And 8 hours@12.4GFlops...damn you Virginia Tech, you owe me a third of a quadrilion floating point multiplies!
According to Apple, there were "over 100,000" pre-orders for the G5. Now this includes single processor models, but the university's alleged order of 1100 machines is not going to make a big impact on everyone else.
Besides, the real reason that Apple's machines are late is case defects and AGP problems, amongst other issues that Apple has not been forthright about. At the keynote an honest Apple employee told me the machines wouldn't ship until October as there were many little problems and I should wait for the January refresh so I don't get a flaky machine.
And one has to wonder why anyone building a cluster would build it using desktop machines and not use the forthcoming G5 rackmount machines from Apple and IBM... which is supposed to include a quad-processor from IBM.
1. The PPC970 draws from the Power4 lineage, which I have used for a long time. The PPC970 has 2 double precision FPUs, each capable of fused multiply add instructions leading to 4 flops/cycle/processor (2 units*2flops/cycle). This is identical to the Itanium2 FPU microarchitecture. The Opteron on the other hand can only do 2 double precision flops/cycle, which makes it only half as powerful on matrix heavy scientific computations, when compared to the PPC970 or the Itanium 2. The PPC970 should really be compared in FP terms to the Itanium2 at 1/10th of the cost, and at 2GHz it is clocked higher than the top-end 1.5GHz Itanium2 Madison. Moral of the story, read thy arstechnica. 2. The standard benchmarking process (LINPACK) only uses double precision FP. If this rumor is true, then this machine is capable of an Rpeak (LINPACK) of 17.6 Teraflop, which those of you who follow top500 will realize is quite substantial. 3. If they are really using Infiniband, this should be a nice machine. Infiniband provides 10 Gbps (20 Gbps full duplex) of bandwidth, which is much faster than either Myrinet or Quadrics. Also Infiniband latency is 10us and the benchmarking process is bandwidth not latency sensitive. On the other, this stuff is really expensive. If all of this is true, this would be a major engineering endeavor. Also, it is probably cheap. However, all in all, this could well just be a rumor (come on it is thinksecret - remember iWorks). If not, this should be a fairly substantial machine.
It's just too darn hard to make a shared memory computer with 1000's of processors. So the common architecture is to make a cluster of smaller shared memory machines.
It's hard, but not too hard or impossible. The Silicon Graphics Origin 3000 supports 512 processors in a single image system with the stock IRIX kernel and 1024 processors with the "XXL" kernel.
Rumor has it Origin 4000 will support 2048 processors, as will Altix once SGI has done some major work with their kernel patches. (Altix is currently limited to 64 processors per system image).
They went with the G5's because they were the cheapest 64 bit solution and because they would use less power and generate less heat than alternative systems. That is the whole of it.
They are having someone write infinaban drivers for OS X just for this cluster.
I look forward to helping install 4GB of ram + the infinaban cards in each of these bad boys.
It is great having connections!
And that opteron system will use 3x the power and heat. Universities don't spend millions on a 'non efficient choice'. Smart people did the math, and they had facts - which you don't.
Damn, ethernet controllers must really piss you off then, huh?
As long as they put it on Legion, U.Va. will get to maintain state bragging rights. I don't keep up with the football rivalry, but this is much cooler anyway.
WARNING: there is a trojan on your
If the money granted was earmarked for a specific project their is nothing the university can do to transfer it to another need. It is either they use it or lose it. Fortunately these grants usually include monies for maintenance of the systems and the staff required, which means that personnel can be put under this new grant area and alleviate costs from another area. So in the end, grants like this help other areas of the school and will attract and create other sources of revenue.
So how does this anti-intellectual tripe qualify as insightful? Any yahoo can point at and complain about just about any non-trivial project at a research university whether it is public or private. If they were building it just to attain a certain ranking without any research proposals or plans it wouldn't be hard to find fault. Does anyone that could possibly be the case here? I think this sort of empty headed bushwhacking is a cheap shot and contemptible.
Is there something particularly about building any clusters today that is ill advised? Anything specifically about a cluster built with these parts? Why do any science that involves a large expense when the money could be applied to "lowering tuition"? Maybe because an important part of the mission of some universities is to advance the state of knowledge by performing research that would not be done by other segments of society.
Sure, having a top-5 supercomputer is cool, and the bragging rights will garner the school some additional funding and scholarships, but at the end of the day after the benchmarks are run and the empty Jolt(tm) cans are recycled, what will they *do* with it?
Chip H.
Well the only AMD alternative is the 2GHz Opteron whose processor alone costs about 900/processor. You won't be able to get a dual 2GHz Opteron for less than Apple's retail dual 2GHz G5. Too, you have to make sure PCI-X slots are there for the interconnect cards they are using. But as was stated elsewhere, the G5's can process twice the scalar FP instructions of the Opteron or the Itanium per cycle. The University as also relies on vector math and the G5's have a significantly better SIMD unit.