AMD's Next-Gen Steamroller CPU Could Deliver Where Bulldozer Fell Short
MojoKid writes "Today at the Hot Chips Symposium, AMD CTO Mark Papermaster is taking the wraps off the company's upcoming CPU core, codenamed Steamroller. Steamroller is the third iteration of AMD's Bulldozer architecture and an extremely important part for AMD. Bulldozer, which launched just over a year ago, was a disappointment. The company's second-generation Bulldozer implementation, codenamed Piledriver, offered a number of key changes and was incorporated into the Trinity APU family that debuted last spring. Steamroller is the first refresh of Bulldozer's underlying architecture and may finally deliver the sort of performance and efficiency AMD was aiming for when it built Bulldozer in the first place. Enhancements to Fetch and Decode architecture have been made, as well as increased scheduler efficiency and cache load latency, which combined could bring a claimed 15 percent performance-per-watt performance gain. AMD expects to ship Steamroller sometime in 2013 but wouldn't offer timing detail beyond that."
They all sound like sexual positions.
Things like hitting the 1GHz mark first, and making a workable 64bit chip that also speaks x86 only get you so far. AMD needs to come up with something cool, else they're doomed to play catch-up.
I want to delete my account but Slashdot doesn't allow it.
I think 15% would put them around even with Intel. That means it's a toss up except, oh wait, their boards are ungodly expensive. I really don't know why, probably chipset manufacturing cost or something. A really nice MSI B75 board with all metal caps is $65 and my usual H77MA-G43 board is a mere $90. All the AMD ones i looked at that had the features I wanted (basically same ones as those last two I mentioned) are all $100 and usually well over a hundred. Just to get SATA III at all was terribly expensive on any AMD board with any chipset. So unless it has an onboard 1833MHz memory controller, that's the end of that. But who knows, boards might come out super cheap for the new chips.
It is supposed to come out when Haswell does next year. Who will buy it? No mention of AVX2 either.
AMD boards have better PCI-E lanes then intel chips.
With Intel you need to go high end to get more then 16 lanes + DMI
Intel's integrated GPUs are now "good enough" for most people. Those who game won't want integrated AMD if integrated intel isn't good enough...
I run: Windows, OS X, Linux, FreeBSD. Just because you have a hammer, doesn't mean everything is a nail.
AMD may be getting its shit together when in regards to chip design. but I'm still going Intel on my next PC because of their superior Linux drivers. At the moment I'm an unhappy owner of a laptop with a AMD graphics card that can't do anything because the drivers are useless. I'm looking forward to a new laptop with an Intel Ivy Bridge processor (I don't think I can wait Haskell).
The thing is, the low end don't care for masses of PCI lanes. They run integrated video. The high end want a fast CPU as well.
I run: Windows, OS X, Linux, FreeBSD. Just because you have a hammer, doesn't mean everything is a nail.
It's an interesting story but I think in general Americans are tired of what happens in the Middle East, and some idiot protester getting herself killed garners as much sympathy as those idiot pot-smoking hikers who got caught trespassing. If somebody dies doing something dumb, they tend to get a yawn in response.
The best policy for this country is to stay out of other affairs and work on our own problems. That's why support for the war, occupying Afghanistan, and the Israel/Palestine thing are so low on the radar for most Americans.
I wish we had world peace, but to be honest I'm more concerned about fixing the problems in my backyard and not half the world away. Plus those Israeli/Palestine conflict will never end and both sides do some heinous stuff, so it's a waste to care too much about it. Both sides are monsters, there can't be a winner. It's like having Hitler wrestle Stalin, hard to root for either one.
I think AMD's work here will provide some great evolutionary speedups that will be significant to many people. Unfortunately for them, at the same time AMD is bringing out these small "free lunch" general improvements, Intel will be bringing out Haswell -- which in addition to such evolutionary improvements has some really fantastic, significant new features that'll provide remarkable performance boosts.
These are all pretty specialized features, yes, but they service some very high-profile benchmark areas: video processing and concurrency are always on the list, and AMD will get absolutely crushed when apps start taking advantage of it.
I'm a developer, a major optimization geek both micro- and macro-. I thrive playing with instruction latencies, execution units, and cache usage until my code eeks out as much performance as possible. Of course we'll never know until the CPUs are released for everyone to play with, but right now my money is on Intel.
AMD is in serious trouble here. I hope I'm wrong.
Is that new resonant clock mesh technology still planned to launch with this new series? I remember reading they were planning to break the 4GHz barrier?
Having dedicated decoders for the IPU is definitely on the right rrack, but a shared fetch is still means there is a bottleneck in getting those cores fed.Also, apparently, the changes hit the L1 performance, so they had to add some cache to make up for it. So, there is some room for improvement, and this does help, However, I just don't see it as the big step that AMD needs against Intel. Intel's dies are smaller, they are making better use of space, and this is a huge advantage. Intel has 10 core dies, and has room to go to 12. In the server space, this is a big gain in density.
Also, they have a lead in hyperthreading, which does seem to boost multithreaded performance enough in highly parallel server scenarios to hold up against the chip multithreading concept that Bulldozer supports.
What is strange is that AMD need to really get a vision of a unified platform. AMD could really have a great ultrabook and media PC platform, but against, they let Intel take a lead in this area.
Unicorns *could* pop into existence.
Cheap flying cars *could* be invented.
Gravity *could* cease to exist.
The dead *could* come back to life.
Etc., etc., etc.
Lots of things *could* happen. It doesn't mean that they will happen or even that they are remotely possible. "Could" in a headline is a cheap way to make you think something is likely to happen or is just around the corner. It isn't.
Steamroller
That is like asking "What colour do you want this database in?" OpenCL doesn't do anything for transcoding video, it is an API for talking to graphics cards. Now, GPUs can be used for video transcoding, of course, using OpenCL or other APIs (CUDA, DirectCompute). However how well they do depends on what the graphics card is. An AMD 7970, that'll throw down some serious performance. An ATi 3400 doesn't even have driver support for OpenCL and if it did would be very slow.
So there isn't going to be any way to directly compare speeds because the speed of something using OpenCL will depend on the GPU that runs underneath it. On some systems that may be very fast on others it may not be available.
Also along those lines a difference from the user standpoint is that it is integrated in the CPU. You don't have to go buy a GPU, the CPU itself just does it. This is something that is rather nice, and hopefully some day we reach the point where CPUs are powerful enough that we don't need GPUs. The more a CPU can handle well by itself, the less people that have a need for an addon GPU.
Sorry but lots of PCIe lanes are just not the kind of thing that matters to non-high end users or people who focus on stats rather than real world performance. To even have a situation on a desktop board where it could theoretically matter you have to have multiple graphics cards. The 1x slots hang off the southbridge and have their own bandwidth separate from the lanes on the CPU for the video card. So if you stick on two GPUs then yes, you don't have enough to give them both 16 lanes.
However it turns out to not matter. We have more bandwidth than we need with PCIe, particularly now with 3.0. You test a card in 16x vs one in 8x and you find no difference in performance. So it just doesn't matter even if you have multiple GPUs.
Of course multiple GPUs are rather a high end proposition. Many people don't even bother with a GPU at all, they just use the onboard graphics which these days are surprisingly good (I've played with the integrated graphics on my new Ivy Bridge laptop since it has switchable graphics and for many games, you don't need anything more). Even those that do choose to have addon GPUs, most choose just one. I've got a GTX 680 in my desktop and there is just no need for anything more, it handles all games superbly. I'd have to move up to multiple surround monitors or something before I'd start needing more than one GPU.
So it is a situation where you only end up needing more lanes in a high end environment, thus I don't see the big deal in not having them. It is the kind of thing you won't notice.
What kind of workload needs more than 16 PCIe lanes, but doesn't similarly need a higher-end processor?
DATABASE WOW WOW
Will it be faster than the Phenom II 985?
I would like to stay with AMD because they don't usually limit the features to the high-end or server CPUs. For example, I can use ECC on my workstation. Performance matters, though, and it doesn't seem great unless they do something like 20 cores for a reasonable price.
Interesting as this is the Finnish word for friend. Finnish cooperation here ?
Personally I thought the whole idea was retarded except for the mobile chips like Brazos, on the desktop the idea was completely stupid and on the server even more so. For those that don't know the original plan was to go "Full APU" and have the GPU take the place of the FP on chip, which would be a much simpler and weaker design than in years past thus freeing up more TDP for more cores. Why is this dumb? Well what if you want to use the GPU AND do some floating point heavy task? Or what if you don't want the integrated GPU because you can't OC worth a crap with the GPU built in?
All correct, but I could live with those aspects. I usually don't OC, and if I know I want the GPU AND do some floating point heavy task, I could get an additional discrete GPU. There is, however, a worse one:
Memory bandwidth congestion. A typical lower midrange graphics card with 128 bit data bus and GDDR3 is significantly slower than the same model with GDDR5. In an APU, the GPU part has to share the even lower bandwidth of the DDR3 main memory with the CPU part.
When the LLano was new, Anandtech published a preview:
http://www.anandtech.com/show/4448/amd-llano-desktop-performance-preview
It shows some comparisons to discrete graphics cards, including the HD 5570 which represents the lower midrange graphics card w/128bit mentioned above.
In gaming frame rates, the HD 5570 beats the LLano even when it runs on DDR3-1866 RAM, which was not a JEDEC standard at the time. With standard DDR3, the difference gets bigger. Which shows clearly the LLano is limited by memory bandwidth and really could use four-channel memory as in Intel's socket 2011.
With bigger and faster GPUs, the bandwidth demands will only grow, and for a Bulldozer APU with matching GPU part even four-channel memory may be insufficient..
C - the footgun of programming languages
I had a steamroller in high school. That thing kicked your ass.
Lets get this over with... Fuck Off
I am guessing those who want to do more GPGPU type stuff, so if you get a supported video card or multiple video cards, then you can potentially get some great performance. Of course, if running dual-GPU stuff is what you want, you should NOT be bothering with a cheap motherboard.
Wow...ouch. Thanks for the heads up, I knew they were having bandwidth problems (the net is filled with advice and tutorials on getting the absolute fastest memory your Brazos or Liano will take because it seriously boosts performance, which I can attest to when I swapped the stock 1066 for 1333 in my Brazos netbook) but I hadn't thought ahead to what it would take to feed an 8 core AND a GPU on the same die...ouch.
But that just proves my point though, that in any place other than mobile situations it is a DUMB idea. It works in mobile because battery life is king and by being able to shut down a good portion of the GPU and FP when you are just doing basic web surfing you cut down on both power and heat, while still having the ability to activate the GPU when you wanna watch that video on the bus or use your netbook over HDMI as a media tank.
The problem is what works in mobile does NOT work in desktop and server, its different use cases. And in desktop and server roles as the link you provided shows (thanks again, good read) the bandwidth problem takes an otherwise powerful unit and gimps the living hell out of it. Since we know they will have to have even more powerful parts to compete with Intel that means by using the GPU as the FP the bandwidth is just gonna get worse, as now you have both the more powerful GPU AND a multicore CPU both fighting over a seriously limited pipe...ouch again.
Yeah...no bones about it the APU design for anything other than laptops and basic office boxes is a BAD design.
ACs don't waste your time replying, your posts are never seen by me.
Please reply to something related to what I've written instead of just attaching some stuff about the low end servers I was not writing about. The words "massively parallel" should have been a clue. The cost difference IS that large in the sector I was writing about. There is nothing in my words above that even hints at anything outside of that niche.
The above costs are from quotes for complete servers, so the cost difference is as described. Do I have to put things in bold with illustrations in this place?
ASSDOZER! (from the movie "Idiocracy")