Intel Discloses Detailed Skylake Architecture Enhancements
MojoKid writes: Intel is still keeping a number of details regarding its complete Skylake microarchitecture and product line-up under wraps for a few more weeks, but at a public session at IDF, some of the design updates introduced with Skylake were detailed. Virtually every aspect of Skylake has been improved versus the previous-gen Haswell microarchitecture. I/O, Ring Bus, and LLC throughput has been increased, the graphics architecture has been updated to support DX12 and new eDRAM configurations, it has an integrated camera ISP, support for faster DDR4 memory, and more flexible overclocking features. All of these things culminate in a processor that offers higher IPC performance and improved power efficiency. There are also new security technologies dubbed Intel Software Guard Extensions (Intel SGX) onboard Skylake, which support new instructions to create and isolate enclaves from malware and privileged software attack, along with Memory Protection Extensions (Intel MPX) to help protect stack and heap buffer boundaries as well. A new technology, dubbed Intel Speed Shift, also allows Skylake to switch power states faster than previous-gen products, controlling P states fully in hardware, whereas previous-gen products required OS control. The end result is that Skylake can switch P states in 1ms, whereas it takes roughly 30ms with older processors.
I recently bought a socket 2011-v3 motherboard and put a "low-end" 6-core CPU on it. I wonder if this Skylake can be considered an upgrade path..
They've been talking about this since 1978 and here we are no closer.
"Win treats sysadmins better than users. Mac treats users better than sysadmins. Linux treats everyone like sysadmins."
Sounds like the last few generations - lots of incremental improvements and excellent technology but wont amount to much of a difference in general performance.
Sure looks like it
From the errata:
Executing CPUID with EAX = 7 and ECX = 0 may return EBX with bits [3] and [8] set, incorrectly indicating the presence of BMI1 and BMI2 instruction set extensions.
Attempting to use instructions from the BMI1 or BMI2 instruction set extensions will result in a #UD exception.
and in the errata summary, its currently labeled NO FIX so they dont even have a fix that will trap the exception and emulate the instructions (which would perform terribly anyways... but hey, working is better than not working.)
"His name was James Damore."
Intel's pricing (and refusal to offer 6-core mainstream parts) is a consequence of Intel's effective MONOPOLY in the x86 space. AMD's current CPU offerings are a BAD JOKE, offering around 50% per core of Intel's core performance. No serious PC gamer would opt for anything less than a true 4-core i5. AMD isn't even in the picture.
So why did I pay less for my i7 a couple of years ago than I did when I bought my Pentium-4 back in the days when AMD was actually competitive?
Intel's current competition is ARM, not AMD.
HOWEVER, what gamers want is a decent priced (sub 200 dollar) mainstream i5 with SIX true cores.
6 isn't enough of a jump over 4...
Give me 8 true cores and 16 threads, remove the IGP which I don't need for such a CPU...
Yes, yes, I know, Xeon and Haswell-E, but the reality is that the "need" for 8 core chips won't really happen until more of them hit the desktop market, and what AMD sells as 8 core doesn't count.
My AMD system works just fine for games. Not sure what you're doing wrong.
Unless you're just trolling with the "no true Scotsman" argument.
It's AMD not ARM. ARM is not competing in the desktop world. Hardly in the laptop either. Try cell phones and tablets.
If there was ever a true multi-threaded application AMD would take the prize. As such Intel dominates because of single threaded applications.
It's AMD not ARM. ARM is not competing in the desktop world.
Nor is AMD.
The difference is that ARM could compete there, and replace the 90% of desktop systems that don't need much CPU power.
If there was ever a true multi-threaded application AMD would take the prize. As such Intel dominates because of single threaded applications.
There are embarrassingly well-threaded applications where AMD does well. The x264 encoder does a fantastic job and hammers all 8 of the cores in my FX-8320 at >90% utilization, and it was cheerfully faster at that than the i5 3570K I used to keep around. But IPC does ultimately win out, and Haswell's AVX2 support is sufficient to let an i5 4690K generally pull out ahead of my FX. That's especially true on interlaced media, where the deinterlacer's essentially single-threaded and the rest of the chip's basically waiting for that single core to finish before tackling the rest of the workload. For most other uses it's somewhere around a Nehalem quad core: certainly fast enough for what I do, but the overall performance outside of niche applications isn't impressive in absolute terms. At least it took to undervolting well, and it's a friggin' behemoth for virtualization.
There are embarrassingly well-threaded applications where AMD does well. The x264 encoder does a fantastic job and hammers all 8 of the cores in my FX-8320 at >90% utilization, and it was cheerfully faster at that than the i5 3570K I used to keep around. But IPC does ultimately win out, and Haswell's AVX2 support is sufficient to let an i5 4690K generally pull out ahead of my FX.
Try that same task on a i7 4790k and see even more speed... that is exactly the task that hyperthreading is for.
If there was ever a true multi-threaded application AMD would take the prize. As such Intel dominates because of single threaded applications.
Actually, it still generally doesn't...
An Intel Core i7 generally is faster than a 8 core FX chip, even in really, really well threaded applications.
The Intel chip is SO MUCH better per core and with the help of hyperthreading, it still wins.
Of course, this is all a moot point, if you're serious about such work, you're at least on Haswell-E with 8 true cores and 16 threads, or you're on a Xeon and this isn't even a conversation worth having.
For pure speed, it is all Intel.
In fairness Piledriver did a tolerable job against Sandy Bridge... The problem is that Intel hasn't exactly stood still since 2012, and between three generations of minor but cumulative performance and power improvements and the platform updates that came with them, there's a huge difference even in the consumer market. AMD doesn't have an answer to Haswell-E, and Opterons have languished in the same three year old doldrums as their FX cousins. Zen will be a make or break proposition; they can't continue the way they have been.
Yep - nothing like extra logical threads to take maximum advantage of a high performance architecture.
Historically and hysterically stupid post.
Yes, yours is. But what about mine?
Why would Intel not be charging more for a current i7 than they used to charge for a Pentium-4, if they had no competition? AMD has nothing in that market, and, even in lower-end markets, they barely compete on performance and are vastly more power-hungry.
If Intel started charging $1000 for an i3, the market would be flooded with ARM desktops by Christmas.
I agree with all of what you've said...
AMD has a problem in that I think they are serving an ever smaller part of the market, the middle...
People either now want low power draw with "enough" performance, or they want "give me all you've got" performance, while not being totally stupid about the power.
Keep in mind that Haswell-E draws 30% less max power than the top AMD FX chip does, while crushing it in performance. Granted, it costs 5 times as much, but frankly if you're buying a machine for 3 years of use and you are using it to make money, the $1,000 cost of the chip is trivial.
Even the 6 core Haswell-E for $380 or so completely runs off from the best AMD has to offer, for not much more money.
It is a shame, I remember the days of Thunderbird from AMD, heck I remember the days of the 386DX-40, when you could get 486DX-25 performance for half the price. Those were days when a processor upgrade often doubled your performance, today it seems the gains are much smaller.
If AMD was pushing Intel, we might see $300 8 core Skylake chips for sale.
HOWEVER, what gamers want is a decent priced (sub 200 dollar) mainstream i5 with SIX true cores.
6 isn't enough of a jump over 4...
For most home / personal computing (including high end video games) diminishing returns kick in hard past 4 cores. The problem is that in the few cases where tasks can be easily subdivided so as to utilize more than 4 cores, the cores will normally be stuck waiting for memory updates which continues to lag (speed / throughput wise) behind processor compute ability at an increasingly large gap which spans orders of magnitude. Of course the only known way to speed DRAM is to utilize more power, which goes against the general IT development trends (greener computing, more capable mobile).
The processor to memory speed gap is one of the reasons why Intel is investing in novel memory technology (phase change memory, etc.). The recent XPoint memory announcement hinted at potential future usage as "page swap" memory, replacing virtual memory management swapping pages out to disk (mechnical or solid state).
I haven't read all of Intel's releases this week, but one area I'm interested in is seeing how eDRAM (embedded DRAM) aka Crystal Well technology is going to end up being available and utilized across the Skylake line. In memory intensive benchmarks eDRAM has already shown considerably improvement in memory constrained benchmarks in Broadwell mobile processors, wheere it acts as an additional level of cache.
Give me 8 true cores and 16 threads, remove the IGP which I don't need for such a CPU...
Most people don't utilize more than 2 cores for more than 25-33% of the time, so the market for consumer-oriented many core processors just isn't there. People who really need the performance already just buy a Xeon.
Intel's "hyper-threading technology" is one of the biggest disappointments in many years, I wish they would let the branding and feature set die in obscurity like it deserves (IMHO).
Yes, yes, I know, Xeon and Haswell-E, but the reality is that the "need" for 8 core chips won't really happen until more of them hit the desktop market, and what AMD sells as 8 core doesn't count.
Well 8-16 core processors have been around for what, a bit less than a decade now? They won't really happen in the consumer / desktop market, because the market isn't demanding it (with purchasing dollars, not wishful thinking). Look at the very modest take-up of the Haswell-E X99 (LGA2011) 6 to 8 core processors released last year (August-Sept 2014 IIRC).
I love fast computers, personally I have a 6 core i7-5930K, and the performance difference for most home/consumer applications is so trivial that I don't notice a difference over using a 4 core i7-4790K except for in parallel benchmarks.
"Limited Liability Corporation" and "Internet Service Provider" don't make much sense, but then again I'm pretty far behind the times on CPU architecture. Who knows what coprocessors they're spending their insane transistor budgets on these days.
OK, "ISP" appears to mean "Image Signal Processor". "LLC" could mean "Last Level Cache" or "Logical Link Control". "Last Level Cache" makes more sense in context, though this is the first time I've seen that phrase. Usually cache levels are explicitly numbered (first, second, third, etc).
It looks like they spelled out everything else except "IPC" which is obviously(?) "Instructions Per Cycle".
Good job there, author, submitter, and editor!
I love fast computers, personally I have a 6 core i7-5930K, and the performance difference for most home/consumer applications is so trivial that I don't notice a difference over using a 4 core i7-4790K except for in parallel benchmarks.
Of course you don't, because you're at the bleeding edge...
The same was true when the Q6600 (Quad Conroe) launched, it simply didn't make a difference over the E6600 (Dual Conroe) due to the programs we were running at the time.
Windows 10 has much more potential than Windows XP did. Between OneDrive keeping it all synced, Cortana always listening, and the 20 other things running in the background, a core or 2 is busy quite often.
I have many machines used for production and testing, and I can now tell the difference between a dual core and quad core for general application use. In fact, the Q6600 that I have for older machine testing is actually faster in many cases than more modern dual cores, due to having 4 true cores. This doesn't mean it gets work done faster, rather it is SNAPPIER, which is what a lot of people relate a computer to being faster.
There is little done on a desktop that needs super high end single core performance, other than some games. Rather what is needed is to always have a spare core so the responsivness of the machine is instant. When I say "hey Cortana", I expect an instant response. A half second lag is noticeable.
I haven't read all of Intel's releases this week, but one area I'm interested in is seeing how eDRAM (embedded DRAM) aka Crystal Well technology is going to end up being available and utilized across the Skylake line. In memory intensive benchmarks eDRAM has already shown considerably improvement in memory constrained benchmarks in Broadwell mobile processors, wheere it acts as an additional level of cache.
The lag in DDR memory hasn't improved in a long time. The speed is faster, once you get a transfer going, modern DDR3/4 memory is rocket fast. The challenge is that the speed to the first byte hasn't improved for a decade, largely due to the latency and distance of the memory from the CPU, among other things.
eDRAM is a welcome change and frankly I expect to see more such improvements in the future as they run into various walls of physics, ranging from the size of atoms to the speed of light.
Something that hasn't changed in a long time is the overall design of the machine. Conroe was a throwback in many ways to the Pentium III, it was Intel admiting that Netburst was a mistake, without them saying it of course.
Nehalem was a change for the better and you can see it in the improved memory throughput and better design, but it is time for the next move. Skylake, for all its improvements, is just a beefed up Nehalem CPU in many ways.
Windows has changed a lot in the past 10 years. I have looked at what Intel talked about at the IDF and they see a future that I tend to agree with. The Internet of Things will actually cause a need for more CPU power, not less, due to having a hundred things connected. When your main computer is remotely talking to 10 things at a time, you're going to want more cores.
Something that I think Intel should look at is a Big/Little configuration. This will take support from Microsoft, it has to be baked into Windows so Windows knows what KIND of cores you have, but frankly having 4 super power cores and maybe 4 low power cores makes more sense for the use of silicon and battery life.
Not all these CPUs go into a desktop, I'd like a notebook with a 10 hour battery life (because it really isn't 10 hours when you use it and it is 2 years old).
Of course, while we're at it, can we get the weight down? My main laptop is a nice Acer 17" machine that is too heavy to really be called "portable". But it is nice with a true quad i7, 1080p IPS display, dual drives (SSD and HDD), and backlit keyboard (please, can we have all devices have that now?).
I just wish it was half the weight it is now. :(
for /.ers to lose their Douglas Adams references?
"Win treats sysadmins better than users. Mac treats users better than sysadmins. Linux treats everyone like sysadmins."