Intel Announces Xeon E5 and Knights Corner HPC Chip
MojoKid writes "At the supercomputing conference SC2011 yesterday, Intel announced its new Xeon E5 processors and demoed their new Knights Corner many integrated core (MIC) solution. The new Xeons won't be broadly available until the first half of 2012, but Intel has been shipping the new chips to a small number of cloud and HPC customers since September. The new E5 family is based on the same core as the Core i7-3960X Intel launched Monday. The E5, while important to Intel's overall server lineup, isn't as interesting as the public debut of Knights Corner. Recall that Intel's canceled GPU (codenamed Larrabee) found new life as the prototype device for future HPC accelerators and complementary products. According to Intel, Knights Corner packs 50 x86 processor cores into a single die built on 22nm technology. The chip is capable of delivering up to 1TFlop of sustained performance in double-precision floating point code and operates at 1 — 1.2GHz. NVIDIA's current high-end M2090 Tesla GPU, in contrast, is capable of just 665 DP GFlops."
4chan is still down? Maybe we should lend them a hand.
Faster! Faster! Faster would be better!
time to do some bitcoin mining?
I mostly understand the figures this post states, but it sounds like engineering dialog from 'Star Trek: Voyager'. But, all this means to me is that the chips from last year are now cheaper that they've been out-classed.
Xeon is dead remember
-L. Ellison
When they said nobody needed multicore processors I heard the echos of "640K should be enough for anyone" and "There is no reason for any individual to have a computer in his home" Now they're trying to see how many they can jam on one die. 50 is a pretty odd number, though. Usuall see things in powers of 2 (2, 4, 8, 16) Perhpas they neede space on the die for Mickey or an etched portrait of Jobs.
A feeling of having made the same mistake before: Deja Foobar
Pfffft! My prosthetic horse cock penis can deliver 500 OPH (orgasms per hour)
"DP" is double precision in this case, not the other one;)
A 50 core chip at 1GHz is going to need to perform 20 double precision floating point ops per cycle per core to achieve 1Tflop performance. OK, so 1.2GHz cuts that down to 16flops/clock. Since when can anything Intel Architecture achieve that many flops per cycle? Two 4-element dot products is only 14 flops. I suppose if they did two vector-scaler multiply-adds that would get 16 flops per cycle. So I just answered my own question. But can they really keep the FP unit running continuously at that rate? On all 50 cores?
Today I can go to the store and buy the Nvidia board that they mention. When can I buy a system with a Knight's Corner chip? What about a PCI-E board? The answer is never. It will only be sold to Intel's partners in labs and research environments for special projects. It means very little to most of us.
But they don't.
Wonder if they'll produce a consumer version.
I use an ATI card as my main video card, wouldn't mind sticking a physics card in the other PCI-E slot. The thing is that if I put in an Nvidia card it won't work as a physics card since Nvidia has written the drivers in such a way that if you have a non-Nvida video card as your primary video card Nvidia will not allow you to use their cards just for physics.
So my hope is that if Intel puts out a consumer version then either I'll be able to buy an Intel board just for physics or Nvidia will drop their stupid restriction.
Either way if Intel puts out a consumer physics card I win.
Don't know something? Look it up. Still don't know? Then ask.
We may yet see high-end Intel discrete graphics cards in the future.
Knights Corner sounds like it is basically a high-end GPU without the actual graphics output. This lets Intel position it as a professional product for HPC and supercomputing, and squeeze out as much profit as possible from the early models. Then, once the R&D cost has been amortized and the fab technology is advanced further, they can add a HDMI output, dedicated RAM, and glue logic, and write appropriate drivers to make it a full-fledged graphics card. Of course this may lack some features of the professional Knights Corner (ECC support?) so it won't cannibalize the high-end market. But it has the potential to be much more power-efficient than AMD and nVidia enthusiast products.
I'm at SC11 right now and just attended NIC's MIC presentation. The scaling looks fantastic according to various codes that they compiled to run on it, but what was notably absent was performance relative to traditional x86 chips. The final presenter even said that now that the technology has been demonstrated to work (with minimal porting effort required) the next step will be to optimize and improve performance. The take away is that relative to Intel's other chips, MIC performance wasn't impressive enough to include in the presentation. That's fine in my book because it's an ambitious project, but it sounds like there is still some work to do.
Just shows you the progress in CPU power: ASCI Red was the first supercomputer to go over 1TFlop, and was massive, now we have this with just one chip!
-- The Grand Teddy Bear has Spoken: "Windows 8 Source Code Available NOW! more disgusting than your pr..."
A beowolf cluster of these! but seriously even one wouldn't be efficient enough to be worth it yet even in top-of-the-line OS's. We need a whole new paradigm of algorithms and maybe even a new language to do this right.
If video games influenced behavior the Pac Man generation would be eating pills and running away from their problems.
http://techresearch.intel.com/spaw2/uploads/files/FASTsort_CPUsGPUs_IntelMICarchitectures.pdf
OC it should compile run a current x264 with avx simd fine, but the question is with its wider SIMD and other improvements can it produce a/several high profile 3K/4K super HD video streams far better than realtime on insane settings :) and using CRF=16
Everyone seems to be defining High Performance Computing as CISC/RISC chips with multi-core processors utilizing Instruction Level Parallelism and Thread-Level Parallelism with extremely fast multi-level Caching. HPC High Availability computing is a synergy of CISC/RISC chips combined with Application, Integrated Instruction, Facility, Graphic and Cryptography assisted processor technologies supplemented with Integrated Coupling facilities. These processors must share access to large amounts of fast Dynamic Random-Access Memory and be integrated into fast I/O bus architectures for a variety of High Performance connectivity with networking equipment, data storage, and other peripheral devices. All this hardware must work in concert with a variety of firmware, hyper-visors, and operating systems supporting telecommunications, storage management, databases, application run-time environments, application servers, web servers, online transaction servers, redundancy, security, applications, fail-over, backup, recovery, archival, and administration management. IBM has been doing HPC/HA for quite a while now. Hewlett-Packard, Oracle Sun Microsystems, Microsoft, Intel, EMC, and Cisco seem to be still chasing the dream.