Larrabee Based On a Bundle of Old Pentium Chips
arcticstoat writes "Intel's Pat Gelsinger recently revealed that Larrabee's 32 IA cores will in fact be based on Intel's ancient P54C architecture, which was last seen in the original Pentium chips, such as the Pentium 75, in the early 1990s. The chip will feature 32 of these cores, which will each feature a 512-bit wide SIMD (single input, multiple data) vector processing unit."
A little context might help. This isn't the Inquirer for god's sake.
good. sounds like a sensible engineering decision.
on the basis that..
the design is well known, understood and has had rigorous testing in the field
they will no doubt fix any understood errors firstlimits the RnD to the multicore section
as long as the chip performs well for the silicon overhead then they should feel free to cram as many in as they want.
seems perfectly sensible to me.
I know I'm wasting my time responding to such a blatant troll, but they're nothing hypocritical about saying that the original Pentium 1 was a pretty bad chip, and the Core 2 Duo is a pretty great one.
Failing to reliably perform basic floating point ops is pretty embarrassing. But Intel's come a long way since then.
Comment removed based on user account deletion
From TFA "Heise also claims that the cores will feature a 512-bit wide SIMD (single input, multiple data) vector processing unit. The site calculates that 32 such cores at 2GHz could make for a massive total of 2TFLOPS of processing power."
I don't see how they get to 2 TFLops.
512-bit = 64 bit * 8 way SIMD or 32 bit * 16 way SIMD. Let's go with the bigger of these two and say we are performing 16 single Floating point operations per clock-cycle per core. 16 operations per clock-core * 32 cores * 2 Billion clocks per second = 1024 Single Precision GFlops. It looks more like 512 Double Precision GFlops for 300 Watts which means a DP Teraflop on Larabee will cost you 513 Dollars a Year at 10 cents/kWH. If we're considering single precision, we can cut this in half to 257 dollars per years per single precision teraflop.
Compare to Clearspeed which offers 66 DP GFLops at 25 Watts costing 332 dollars for a sustained DP teraflop for a year.
even the NVidia Tesla has better performance at single precision: you can buy 4 SP TFlops consuming only 700W or 5.7 GFLops/Watt, for an annual power budget of 153 dollars.
Obviously they're not just going to slap a bunch of Pentium cores on there and call it good. But the high-level design can probably start off with the P54, and just rip out stuff that doesn't need to be supported, possibly including:
Scalar floating-point, 16-bit protected mode, real mode, operand size overrides, segment registers, the whole v86 mode, the i/o address space, BCD arithmetic, virtual memory, interrupts, #LOCK, etc, etc.
Once you've done that, you'll have a much simpler model to synthesize down to an implementation. And with a slightly-modified compiler spec, you can crank out code for it with existing compilers, like ICC and GCC.
Oh, so 2 years from now (two lifetimes in the GPU business) Intel will be releasing a chip comparable to this month's ATI HD 4870 X2.
"It's the height of ridiculousness to say for those 9 lines you get hundreds of millions."
they're nothing hypocritical about saying that the original Pentium 1 was a pretty bad chip, and the Core 2 Duo is a pretty great one.
Have you compared the total length of Pentium errata with the length of the Core 2 Duo errata?
Support SETI@home
Intel's basically doing here what Sun did with the Niagara series, but without concurrent threading. I suppose it wouldn't be too tough to add it in, though. The cores in the Niagara are really simple 6 or 7 stage pipelines. They don't do any forwarding, and stall at pretty much every hazard they hit. Instead of adding all the complicated circuitry needed for do advanced pipeline stuff (like forwarding and OoO etc), they just defer execution to a new thread. All the threading is in the cores themselves, so there's no need for OS intervention.
I should add that the Niagara's are pretty awful when it comes to single threaded performance, though. I use a T2 daily and starting up firefox can take about 30 or 40 seconds. But they're great if you can manage to parallelize your programs (which is easier said than done).
the division bug was far more serious than any of the errata for core2 as it caused calculations to go wrong with no indication of why and to go wrong very often (every time you divided).
Making math errors at blazing speeds...
To err is human.
To really screw up, you need the aid of a computer.
There is no "I disagree" mod for a reason. Flamebait, Troll, and Overrated are not substitutes.
/Every/ time you divided? Bull-SHEET. It only affected a certain few pairs of operands.
http://en.wikipedia.org/wiki/Pentium_FDIV_bug
Executive summary: You're a moron who repeats things he doesn't understand.
Hail Eris, full of mischief...
E pluribus sanguinem
The good news is that video card manufacturers have heard the plea and are trying to reduce the power consumption on their newer cards. nVidia's newest GTX series cards draw less power when idle than pretty much anything they've made outside of their Mobile line in years, although they are voracious when running full tilt. As long as you spend most of your time not gaming (which is true of most people) they won't inflate your power bill nearly as much as their maximum power draw might suggest.
I read the internet for the articles.
Why the unhappy face? ARM is probably the only example of the best technology winning that I can think of. You do know their market share dwarfs x86, right? Even just counting the ARM chips in mobile phones, they are outselling x86 chips 3:1, and mobile phones are far from the only places you find ARM cores.
I am TheRaven on Soylent News
I guess back then it would have been difficult to notice. If you were using that 75Mhz beast for nothing but floating point processes it would produce about one error every 117 seconds. I'm sure there are/were a lot of applications (not standard users) that would have really been affected by the flaw. Add in multiple core servers and you could have some pretty hefty issues.
Pretty crazy that these days an error like that would rear its ugly head ever 3-4 seconds (Folding@Home or SETI?).
This sig isn't original enough, it's time to come up with something witty...