Intel Details Upcoming Gulftown Six-Core Processor
MojoKid writes "With the International Solid-State Circuits Conference less than a week away, Intel has released additional details on its upcoming hexa-core desktop CPU, next gen mobile, and dual-core Westmere processors. Much of the dual-core data was revealed last month when Intel unveiled their Clarkdale architecture. However, when Intel set its internal goals for what its calling Westmere 6C, the company aimed to boost both core and cache count by 50 percent without increasing the processor's thermal envelope. Westmere 6C (codename Gulftown) is a native six-core chip. Intel has crammed 1.17 billion transistors into a die that's approximately 240mm sq. The new chip carries 12MB up L3 (up from Nehalem's 8MB) and a TDP of 130W at 3.33GHz. In addition, Intel has built in AES encryption instruction decode support as well as a number of improvements to Gulftown's power consumption, especially in idle sleep states."
Can most programmes really be written to take advantage of so many cores? I am not sure I want to have a 6-core processor, of which 5 spend most of the time idling as I am only running a single-core-aware programme. OK, one more core can be used by the OS to make everything snappy, but the question stands.
Perhaps a jump in number of cores will convince people outside the Apple and FreeBSD camps to port Grand Central Dispatch.
Letting the kernel team handle the hairier parts of multi-threaded design should make it easy for barely-optimized software to use powerful hardware.
Could its Apache license work with the #1 OS family?
I'll be your candy shop of infinite deliciousity if you'll be my discotheque of endless rump-shaking.
1.17 billion transistors into a die that's approximately 240mm sq
That's a big chip.
we see things not as as they are, but as we are.
-- anais nin
Talked to our HP rep a few weeks ago about them. As soon as HP ships proliant servers with the new CPU's, we're going to buy 4 of them. Just haven't decided if we're going with 36GB RAM or 72GB RAM. 72GB RAM is only $2000 more than 36GB RAM these days.
Just so you know, I made this joke almost two years ago:
http://hardware.slashdot.org/comments.pl?sid=465898&cid=22548916
They could have gone to 3 cores, like the competition. That seems like the logical thing to do, but they said "Fuck it, we're going to six". What part of this don't you understand? If two cores is good, and four cores is better, obviously six cores would make them the best fucking CPU that ever existed.
http://www.theonion.com/content/node/33930 [theonion.com]
/I'm just waiting for the day Intel says "this one goes to 11"
It's the CPU joke that will never die.
[Fuck Beta]
o0t!
first
looks like you need more cores
So I skimmed TFA (gasp!) and it appears that Intel is finally following AMDs lead by keeping thermal envelopes constant.
I note that this is still a effectively 2 CPUs with 3 cores each, but that's better than legacy Intel approaches, which would have been 3 sets of dual cores.
It will be interesting to see how independent performance benchmarks play out between the new processors that are coming out.
The cesspool just got a check and balance.
blah blah Beowulf blah blah
1.17 Billion transistors. Anyone remember the 6502, the 6800, and then the 68000? 68K transistors was a LOT in 1980 and made for a fantastic 32bit architecture. Now we're at 17000 times that count. Sometime you just have to stop for a moment and think just about the numbers.
This is a server processor. They did it for advanced encryption. The only way this would make more powerful DRM is if there were some sort of key embedded in the CPU (and this is not that.)
This is for encryption, which unlike DRM is actually more than security theatre (when used properly.)
What?
AES acceleration will be useful for VPNs, serving SSL websites, VoIP, full disk encryption ... and so on.
Instead of churning out cores they schould tweak the x86 isa to use multiple cores efficently. 1/2-word Atomic compare and swap is not enough, you cannot make atomic lockless doubly linked lists with that. No wonder something as interesting as http://valerieaurora.org/synthesis/SynthesisOS/ is not possible on x86 without major hacks.
Why put AES on-board? I thought AES was relatively fast as encryption algorithms go. Plus, it is inevitable that AES will be replaced at some point, so why include something so specific in a chip now? It will suck to have to implement that in the processor in 20 years when nobody uses AES any longer. This is the whole point of a processor - include generic instructions that are useful for implementing any algorithm.
>Westmere 6C (codename Gulftown)
Really? I fricking hate codenamed codenames...
In the server space we've gone through the same thing. Sun introduced the T1 with 8 cores and 32 threads (now 64 on T2). Lots of software wasn't suitable for this type of horizontal scaling. But over a period of five years, that changed dramatically.
On the desktop, you can expect the same. For now, not many desktop apps will take advantage of the additional cores. But if Intel would have stuck with 1-2 cores, no software will be written to take advantage of multiple cores.
Chicken or egg ....
Browsers shouldn't have a back button!! It's all about going forward...
Which is obviously the reason AMD's and VIA's CPUs had the same thing over half a decade ago, right? DRM sure was prolific back then.
I note that this is still a effectively 2 CPUs with 3 cores each
You need to skim a little slower. It is a single die, native six core chip. From the article:
Westmere 6C (codename Gulftown) is a native six-core chip
Yes, some functional blocks are "split in two" and assigned to three cores. Those blocks are split up because it makes engineering sense to do so, not because the designers were lazy or "slapped together" two non-six core designs.
If you look at a GPU with 10's or 100's of cores you will see that some functions are "split up" with each piece being assigned to (and located close close by!) some subset of cores. That does not make them "effectively X number of CPU's."
"Liechtenstein is the world's largest producer of sausage casings, potassium storage units, and false teeth."
VPNs, serving SSL websites, VoIP, full disk encryption ... and so on
Well, yeah, but I think the GP's point is that while some of those things have been in mainstream use for years, the use case for DRM is what made it important enough to be implemented.
I'm pretty sure it is one die, with communication possible between any cores. It just looks like 2x3 due to the way it is laid out.
... is the number!
This is a server processor.
Well, the summary says "desktop CPU", while the URL of TFA says "mobile"...
It's official. Most of you are morons.
This comes the same month as the release of 16-core processors by IBM and Oracle, and a 12-core from AMD. This isn't that impressive.
You can bet the sole reason for including this was to support DRM-protected content.
It may very well be useful for DRM, but I would venture that there are several reasons to include further AES instructions in hardware; one of the chief being full disk encryption performance.
Don't think of it as a flame, more like an argument that does 3d6 fire damage.
with an AMD X3 Core 2 Duo or AMD X3X2...
Though I wonder why we are going to 6 rather than 8. Core 2 Quad Duo's? Head 'esplodes....
I just can't wait till the Quad Quads... or something spiffy, like Quad Squared. 16 is probably a ways off from the consumer market anyway.
Jesus Christ APK, you're useless.
If you ignore ACs because they are anonymous - you're an idiot.
Having 6+ cores is a complex issue for the desktop. In the 'olden' days it way easy to do the math that per Mhz, the less cores the faster ( a pair of 1Ghz chips is slower than a 2Ghz CPU if all all other things are equal)
But now, each CPU can have a dedicated memory bus. That means that 2 cores can be FASTER than a single core because 2 cores can have twice the memory bandwidth. Its not necessarily about clock cycles but bandwidth.
More cores also can improve the desktop experience. because flash is currently stuck on a single CPU, your system cant get totally rocked by flash. If you have a really intense process, you could set the CPU affinity to 5 of the 6 cores to make sure you had one left over. The process scheduler should try to do this on its own but its not always successfull.
I run some VM hosts on dual quad cores and processor affinity for virtual machines is very important. On one box I have CPUs that are 2 dual cores on a single CPU and then 2 sockets. A VM on CPU1&2 (numerically 1-8, not 0-7) is ideal. A VM on 2&3 is slow and a VM on 4&5 is rediculously slow because you go from a signle chip, to two chips, then to a core on seperate sockets. This is an extreme case but it is true on the desktop as much as it is anywhere else.
Any high transaction count server that runs multiple processes or threads will make very good use of 6 cores as these process tend to not jump between cores ( or shouldnt if the programmers are concerned about performance)
24 x86 cores just doesn't compare to 1 Fermi with 512 striped down vector processors
CPU hogging programs can be a real pain, especially on a multi-user system. I know of businesses that run keyboard polling applications on a Terminal Server, for example. Each instance easily pin a core at 100%.
If you ever run into a problem like this (on Windows), check out ThreadMaster
Or a game boy or something like that. Of course you would have to wear asbestos gloves to play with the thing... :)
It's a shame they didn't make a mid-level version with no graphics core, a la Core i7 860. As a crypto/security guy, I'd like to try out PCLMULQDQ, the AES instructions and maybe the IOMMU. But if I'm going to get a fancy new computer, I might as well put a decent graphics card in it, at which point their on-die graphics card is simply a waste of space, power, money and latency. And no, I'm not dropping $1k for a 6-core Gulftown.
I hereby place the above post in the public domain.
Well... take a peek @ your Taskmgr.exe (with the processes tab open & the threads column selected + viewable)...
You'll probably see pretty much as I do here, and, as I have for YEARS now no less: That most of the processes running on your system now have 2-N threads running PER PROCESS already
No they have 2-N threads per process, afaict there is no indication in task manager how many of them are running/runable.
Most threads spend most of their time blocked waiting for some kind of event, while occasionally doing some quick action in response and then blocking to wait for an event again. The number of tasks doing serious processing work is usually in the range 0-1 unless you either start multiple intensive tasks at once or you have software that is specifically designed to take advantage of multiple processors/cores.
note: i'm known as plugwash most places but i screwd up registering that here somehow in the past and now can't register
I'm running 2.6.32 with the fabulous new scheduler and all those "low latency" settings painstakingly compiled in. Nevertheless, I'm currently making some DVD isos and while mkisofs is running, firefox has to think for five or six seconds before loading every page. Somehow I doubt that having two extra cores is going to help me here...
"which would have been 3 sets of dual cores."
The have been previous intel hex-core processors which were three dual-core dies in one package.
This is one die though it does seem to have two seperate L3 caches. feeding off a central memory controller and queue (with the two QPI interfaces on the other end of the L3 caches).
note: i'm known as plugwash most places but i screwd up registering that here somehow in the past and now can't register
It was announced in Oct. It has a single shared L3 cache. Anon Intel fanboy? Whatever for? Embarrassed?
The cesspool just got a check and balance.
I think you misunderstand me, most threads aren't blocked because they need access to data. They are blocking because they are waiting for something to happen (the user doing something, a network packet coming in, a timer tick, a soundcard asking for more audio data etc). Just look in task manager and see all those 0% entries in the CPU column.
Desktops are rarely doing more than one intensive job at a time and even more rarely doing more than two. This means that unless your apps can break an individual job into multiple threads (and as I said above the number of threads shown in task manager is no indication of whether an app will do this) or you run a lot of background batch tasks adding cores (particularly beyond the first two) has little benefit.
note: i'm known as plugwash most places but i screwd up registering that here somehow in the past and now can't register
watch castle
I'm not the world's leading expert on processor design, but from what I can tell from the schematics and die pictures I've seen of the Westmere 6-Core CPU's, it only *looks* like they stuck 2 3-core processors on the same die. Everything I have read about Nehalem (and now this current die-shrink) has said how modular and scalable it has been designed to be; Intel saw the limitations of their early dual-core designs in that they *weren't* particularly modular or scalable, and I can't imagine that they would repeat that mistake.
That said, from what I can tell, they laid it out like they did (so that it looks like three cores on one side and three on the other) in order to minimize the maximum distance(s) between certain key components (they stuck the queue and most of the uncore in the middle). It is my assumption that although there appears to be two L3 caches on the die, that any core can directly access any of the cache. I base this mostly on my understanding of how the QPI system works on a DP platform -- one link is for communicating with the Northbridge (Current DP NB is tylersburg 5520 while SP boards usually use X58) while the other is for direct communication with the other processor --- I would assume that if processor 0 can get data directly from the cache on processor 1, that any core on processor 1 can access any data on it's internal shared cache.
It will be interesting to see if this is borne out in the benchmarks. I'm very curious regarding these details. It'd be very nice if the #1 CPU maker was on the same playing field as the current tech of the day. We would all benefit from that situation.
The cesspool just got a check and balance.