AMD Considered GDDR5 For Kaveri, Might Release Eight-Core Variant
MojoKid writes "Of all the rumors that swirled around Kaveri before the APU debuted last week, one of the more interesting bits was that AMD might debut GDDR5 as a desktop option. GDDR5 isn't bonded in sticks for easy motherboard socketing, and motherboard OEMs were unlikely to be interested in paying to solder 4-8GB of RAM directly. Such a move would shift the RMA responsibilities for RAM failures back to the board manufacturer. It seemed unlikely that Sunnyvale would consider such an option but a deep dive into Kaveri's technical documentation shows that AMD did indeed consider a quad-channel GDDR5 interface. Future versions of the Kaveri APU could potentially also implement 2x 64-bit DDR3 channels alongside 2x 32-bit GDDR5 channels, with the latter serving as a framebuffer for graphics operations. The other document making the rounds is AMD's software optimization guide for Family 15h processors. This guide specifically shows an eight-core Kaveri-based variant attached to a multi-socket system. In fact, the guide goes so far as to say that these chips in particular contain five links for connection to I/O and other processors, whereas the older Family 15h chips (Bulldozer and Piledriver) only offer four Hypertransport links."
DDR3 is low latency, low bandwidth. GDDR5 is high latency, high bandwidth. Low latency is critical for CPU performance while bandwidth doesn't matter as much. On video cards, GPUs need high bandwidth but the latency doesn't matter as much. This is why gaming PCs use DDR3 for system RAM and GDDR5 on their video cards. Video cards that cut costs by using DDR3 instead of GDDR5 take a massive hit in performance. The XBox One and PS4 use GDDR5 shared between the CPU and GPU, and as a result have the rough equivalent of a very low-end CPU paired with a mid-range GPU.
Great! Can you explain to me why "GDDR5 isn't bonded in sticks for easy motherboard socketing" ?
Everything I write is lies, read between the lines.
The whole point of AMD APUs is low cost gaming. That is, lower cost than buying a dedicated GPU plus a processor. Many already argue that you don't save much by buying an APU. A cheap Pentium G3220 with a AMD Radeon 7730 costs the same as the A10 Kaveri APU, and will give better frame rate. Even if the Kaveri APU prices come down, the savings will be small. If you have to buy the GDDR5 memory, there won't be any savings. It's understandable that AMD didn't take that route.
No, they don't do it because it considerably raises the cost of the chip and it doesn't help improve the "average" user's workload. Many core processors have a bunch of inherent complexity dealing with sharing information between the cores or sharing access to a common bus where the information can be transferred between processes. There are tradeoffs to improving either scenario.
Making the interconnection between cores stronger means that you have a high transistor count and many layers in silicon. Even if you do this, the scenarios in which the cores need to intercommunicate means that either the processes are being dispatched to the processor and the processor has a scheduling algorithm built in (which has its own issues), or you have a new set of instructions allowing software to dispatch code from cpu to cpu. Then race conditions and all sorts of other nonsense are inherently in the CPU itself and you have a whole bunch of problems trying to write code for the mess.
Even if you go the above route, you still have to get the data you process out of the CPU and into RAM. Then you have a bus contention problem: how can multiple cores access different sections of RAM simultaneously? Is the CPU responsible for settling a conflict when two different cores want to write to the same section of RAM at the same time? These issues are largely easily settled with only 2, 3, or 4 cores now (and currently the responsibility for preventing these scenarios is on the OS not the CPU), but they would explode with a 24 core (or more).
It's currently possible to build a CPU with several hundred cores inside (and both Intel and AMD have done it). No one has settled the issues above to make it practical, or invented the new software paradigm to make it easy to fix them.
I like to use Blender to compose my videos.
Help stamp out iliturcy.
Anandtech's writeup (which Hothardware seems to be ripping off) has a much better explanation of what's going on and why it matters.
It's also worth noting that the Anandtech article implies that AMD is still on the fence on Kaveri APUs with more memory bandwidth, and that it may be something they do if there's enough interest/feedback about it.
Motherboard manufacturers see the profit margin's Apple has with RAM that can't be increased in a number of their high-end models and they want in on that action.
Sleep your way to a whiter smile...date a dentist!
They don't care because a desktop with a 24 core AMD CPU is likely to be slower than a 4 core Intel CPU for most popular _desktop_ tasks which are mostly single threaded. They're already having problems competing with 8 core CPUs, adding more cores would either make their chips too expensive (too big) or too slow (dumber small cores).
Sad truth is for those who don't need the speed a cheap AMD is enough - they don't need the expensive ones. Those who want the speed pay more for Intel's faster stuff. The 8350 is about AMD's fastest desktop CPU for people who'd rather not use 220W TDP CPUs, and it already struggles to be ahead of Intel's mid range for desktop tasks: http://www.anandtech.com/bench/product/697?vs=702
A few of us might regularly compress/encode video or use 7zip to compress lots of stuff. But video compression can often be accelerated by GPUs (and Intel has QuickSync but quality might be an issue depending on implementation). The rest of the desktop stuff that people care about spending $$$ to make faster would be faster on an Intel CPU.
A server with 24 cores will be a better investment than a desktop with 24 cores.
Because it's electrically so delicate that you can't keep bit sync when shoving such high frequencies through a slot connector. The price of higher bandwidth, in both the analog and digital senses.
Fine. You do it your way. I want this technological achievement so I can do it mine.
Help stamp out iliturcy.
Your problem - and Intel and AMD have this problem also - is thinking the specifics of the device have something to do with its utility. It doesn't. What matters is how the thing enables people to do what they want to do, and we passed that level of utility in devices a decade ago. I can be telepresent with my children without resorting to either Intel or AMD technologies - all day. Compared to exclusively special uses of spreadsheets and slideshow programs, compatibility differentials of Office suites, that is a win.
It's about the people, not the thing. Wrap your head around that. Make it your religion.
Help stamp out iliturcy.
Sure can. You see currently all GDDR5 is sold to GPU OEMs and they haven't used sockets on graphics card since the mid 90s.
As for TFA they sold some HD43xx boards with a small amount of dedicated RAM and since AMD makes their own board chipsets I see no reason why they couldn't sell a board or two with dedicated GDDR5 of 512Mb to say 2Gb. Lets face it guys you can only go so far with an APU before it would make more sense to have a dedicated GPU instead but with AMD having the same chips that are in the new consoles this might be a good option for a budget gaming rig.
The key will be if they can get the GDDR5 at a price that will still let the board make sense versus just buying say an HD7750 card, but if they could get the boards made so a kit would cost around $250-$270 (current their quad APU kits sell for around $200 sans HDD) this could be a really compelling option for those on a budget that want to game.
ACs don't waste your time replying, your posts are never seen by me.
But the "dirty little secret" that doesn't get brought up enough frankly is a lot of those "single threaded loads" is as rigged as quack.exe was back in the day thanks to every Intel compiler made since 2002 being made to put out crippled code for any chip that Intel doesn't want to push. Oh and for those that use the "Intel just knows their own chips and optimizes for them" excuse that lie has been disproved and the proof was the last gen Pentium 3. You see the last gen P3 was curbstomping the Netburst P4s in early benchmarks, yet when the cripple compiler comes out? Suddenly the very same Netburst chips are winning by 30%!
And the bitch is that any of these so called review sites could test for rigging trivially but they won't for fear of losing Intel advertising revenue. To see if a program is rigged all one has to do is run the code on a Via CPU, Via CPUs allow one to softmod the CPUID so if you change the CPUID from "Centaur Hauls" to "Genuine Intel" and suddenly the chip scores 20%-30%+ higher on the test? Then the program has been rigged by ICC, simple as that.
All of these sites like Anandtech and Tom's have more than enough money to pick up a Via chip and keep it around for testing but they won't bite the hand that feeds so everyone should consider their tests to be tainted and as worthless as Quake tests were with the rigged drivers. It would be nice if someone would run some real tests so we could see real numbers, wouldn't be hard to do as GCC is free and there are plenty of FOSS programs like Firefox one could compile with GCC to give accurate tests but so far no review site will do this for fear of pissing off Intel. How they didn't get busted for antitrust is beyond me, this is every bit as bad as "Windows isn't done until lotus won't run" but they were allowed to bribe AMD to the tune of 2 billion to drop the lawsuit and with it the investigation.
ACs don't waste your time replying, your posts are never seen by me.
The rest of the desktop stuff that people care about spending $$$ to make faster would be faster on an Intel CPU.
You mean something like this? (Or similar solutions for other languages?)
Ezekiel 23:20
Great! Can you explain to me why "GDDR5 isn't bonded in sticks for easy motherboard socketing" ?
Reason is that they are used exclusively for the video cards/GPUs, and are not meant to be accessed directly by the CPU. In cases of integrated video on motherboards, they're not used in the first place. In case of video cards, they are soldered right on the video cards - video cards don't have slots b'cos then, you'll have video cards going into PCIe slots, and then the cards would have slots of their own, and then the height of the GDDR5 modules would potentially eliminate other motherboard slots that may be needed. In other words, configuration complications.
Short answer: GDDR5 is not meant to be socketed on to motherboards
CPUs - the last that I saw - were PGAs - except for embedded systems, I'm not sure of many BGA based CPUs. DDR3 onwards changed its packaging from TSOP to BGA due to excess pin count, where on TSOP, only a larger package would work, but then, the length of the wire bonds would become a factor in the speed of the sub-system (CPU to DDR3). Also, while TSOP is cheaper than BGA for lower pin counts, when the pin counts become comparable - ~50, the equation flips - BGAs become cheaper than 56 pin TSOPs. As a result, memory started getting packaged in BGA, which were already in use by flash memory, especially in portable applications such as cell phones, PDAs and so on.
But even those DDR3 modules are available on sticks, even in their BGA packages.