TACC "Stampede" Supercomputer To Go Live In January
Nerval's Lobster writes "The Texas Advanced Computing Center plans to go live on January 7 with "Stampede," a ten-petaflop supercomputer predicted to be the most powerful Intel supercomputer in the world once it launches. Stampede should also be among the top five supercomputers in the TOP500 list when it goes live, Jay Boisseau, TACC's director, said at the Intel Developer Forum Sept. 11. Stampede was announced a bit more than two years ago. Specs include 272 terabytes of total memory and 14 petabytes of disk storage. TACC said the compute nodes would include "several thousand" Dell Stallion servers, with each server boasting dual 8-core Intel E5-2680 processors and 32 gigabytes of memory. In addition, TACC will include a special pre-release version of the Intel MIC, or "Knights Bridge" architecture, which has been formally branded as Xeon Phi. Interestingly, the thousands of Xeon compute nodes should generate just 2 teraflops worth of performance, with the remaining 8 generated by the Xeon Phi chips, which provide highly parallelized computational power for specialized workloads."
I can't find any source at all.
I wonder why it's got such little memory? You can easily run 64GB per socket at full speed with the E5-2600 (16GB x 4 channels) without spending that much money. Heck for maybe 10% more you can run 128GB per socket (You need RDIMM's to run two 16GB modules per bank). They're apparently only running one 16GB DIMM per socket (any other configuration would be slower on the E5) which IMHO is crazy as you're going to have a hard time keeping 8 cores busy with such a small amount.
There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
The summary mentions that 2 teraflops are generated by the CPUs while 8 are generated by the Knights Bridge chips. It should say petaflops.
Computer simulation made easy -- LibGeoDecomp
"Petaflops" is not representative of the power of modern supercomputers, many of which use massively parallel integer processing to perform their duties. Sure, you can say that simulating floating point operations with the integer units amounts to the same thing, but it actually doesn't. We have discovered that there are a great many real-world problems for which parallel integer math works just fine, or even better (more efficient) than floating point. And for those, flops is a completely meaningless metric.
We need a standard that actually makes sense.
The Knights Corner chips use GDDR5 memory, bandwidth is a big problem when you have 50+ cores to feed.
This reminds me of an old science fiction story. The designers, builders and programmers assemble. The Switch is flipped. The computer boots. The first question they ask is, "Is there a God?" The machine hums away for a few seconds, then arc welds the power switch open and responds, "There is now!"
I've calculated my velocity with such exquisite precision that I have no idea where I am.
Ooops, scratch that miss-read the summary. There probably is n't a need for that much memory because the kind of problems they are most likely to be dealing with will have massive datasets that don't fit in memory anyway. The limiting factory will be CPU and node interconnect bandwidth so adding extra memory wont make much if any difference to performance.
I'm pretty sure you are mistaken on this point.
Most modern supercomputers get their "flop" count from SSE3/4 and/or GPUs which are not integer, but Floating point processing machines(at least 32-bit single precision fp, but also double precision albeit at a slower rate). These machines most certainly do NOT simulate floating point with their integer units (nor cheat by calling an integer op as an approximate fp op), and they have massive amounts of dedicated hardware SIMD FP processing units to do their heavy lifting.
Of course there are many real world problems that could use parallel integer math and CPUs and GPUs are also capable of lots of SIMD integer ops as well, but that's not how supercomputers are rated these days, they are rated by the number of IEEE FP operations (mostly FMA or fused multipy-add counting as 2-ops) with at least 32-bits of precision.
The integer OPs currently don't count in the current ratings and I don't see that changing any time soon. Important scientific operations like matrix inversion, finite-element analysis, FFTs, and linear programming don't work the same with integer ops, so it is unfair to compare supercomputers by their integer ops.
No Lego!
how does it rank in the TOP100? Or the TOP1000?
yea will it run Doom!
I'm seriously bothered by the fact they couldn't figure out how to put an O at the end of the acronym.
The world is made by those who show up for the job.
The summary is poorly written. It doesn't answer the question most Slashdot users are dying to know about supercomputers. Does it run Linux?
The chief science official of Texas has divined that the computational projects will be:
1. Derive a proof that the universe is only 5000-6000 years old.
2. Derive a proof that God is a silver haired, white man from Texas.
Just how many supercomputers are required for a stampede, Earl? I mean, is it like three or more? Is there a minimum speed?
In the free world the media isn't government run; the government is media run.
I heard about a new raspberry pi supercomputer here yesterday, which is more powerful? Which one makes more bitcoins per MPAA lawsuit? Can I get a Beowulf cluster of these in soviet Russia? or in soviet russia do bitcoints cluster my beowulf?
I got excited for a couple seconds, I thought it was talking about a "Taco Stampede".
#DeleteChrome