Slashdot Mirror


AMD Details High Bandwidth Memory (HBM) DRAM, Pushes Over 100GB/s Per Stack

MojoKid writes: Recently, a few details of AMD's next-generation Radeon 300-series graphics cards have trickled out. Today, AMD has publicly disclosed new info regarding their High Bandwidth Memory (HBM) technology that will be used on some Radeon 300-series and APU products. Currently, a relatively large number of GDDR5 chips are necessary to offer sufficient capacity and bandwidth for modern GPUs, which means significant PCB real estate is consumed. On-chip integration is not ideal for DRAM because it is not size or cost effective with a logic-optimized GPU or CPU manufacturing process. HBM, however, brings the DRAM as close to possible to the logic die (GPU) as possible. AMD partnered with Hynix and a number of companies to help define the HBM specification and design a new type of memory chip with low power consumption and an ultra-wide bus width, which was eventually adopted by JEDEC 2013. They also develop a DRAM interconnect called an "interposer," along with ASE, Amkor, and UMC. The interposer allows DRAM to be brought into close proximity with the GPU and simplifies communication and clocking. HBM DRAM chips are stacked vertically, and "through-silicon vias" (TSVs) and "bumps" are used to connect one DRAM chip to the next, and then to a logic interface die, and ultimately the interposer. The end result is a single package on which the GPU/SoC and High Bandwidth Memory both reside. 1GB of GDDR5 memory (four 256MB chips), requires roughly 672mm2. Because HBM is vertically stacked, that same 1GB requires only about 35mm2. The bus width on an HBM chip is 1024-bits wide, versus 32-bits on a GDDR5 chip. As a result, the High Bandwidth Memory interface can be clocked much lower but still offer more than 100GB/s for HBM versus 25GB/s with GDDR5. HBM also requires significantly less voltage, which equates to lower power consumption.

98 comments

  1. Security implications by sinij · · Score: 0

    This will make password cracking that much easier. My understanding is that space-time tradeoff drastically changes with increased bandwidth for RAM. I am not smart enough to derive a formal proof, but I think a number of key-stretching functions just bit the dust.

    1. Re:Security implications by Anonymous Coward · · Score: 0

      Then they never should have been used. Any type of encryption should be viewed through the lens of at least several orders of magnitude of processing speed when being considered as safe or not.

    2. Re:Security implications by Anonymous Coward · · Score: 1

      They just went from 330GB/s on the 290X to ~500GB/s. This will do nothing at all to security. Also we still have no idea how memory latency is impacted (shorter paths, but also lower clocks). If they can scale down to APUs and lower cost GPUs then there is some really great potential.

    3. Re:Security implications by jellomizer · · Score: 1

      Moore's law cover this security concern. Expect computations to keep doubling this your key that takes a year to crack will take a few hours in the course of a few days in the next decade.

      --
      If something is so important that you feel the need to post it on the internet... It probably isn't that important.
    4. Re:Security implications by Anonymous Coward · · Score: 0

      Have you ever implemented an actual encryption system? You do not have infinite CPU speed, ya know. At one point or another you have to make a tradeoff.

    5. Re:Security implications by Anonymous Coward · · Score: 0

      This will make password cracking that much easier. My understanding is that space-time tradeoff drastically changes with increased bandwidth for RAM. I am not smart enough to derive a formal proof, but I think a number of key-stretching functions just bit the dust.

      Exponents provide actual protection.

      Multiplication makes people who should know better feel better.

    6. Re:Security implications by suutar · · Score: 1

      I'm not sure how increased bandwidth could make a difference beyond sheer number of tries per second. On that scale, it's (very) roughly a factor of 2 difference, so add a bit and move on, no?

    7. Re:Security implications by Anonymous Coward · · Score: 0

      It's not like Bitcoin mining hardware hasn't already made passwords effectively useless.

    8. Re:Security implications by r.freeman · · Score: 1

      It's not like Bitcoin mining hardware hasn't already made passwords effectively useless.

      What a retarded statement. Even all bitcoin hardware in the world can not break one (real) password,

      nor any password that is hashed also with something other/more then sha256.

    9. Re:Security implications by Anonymous Coward · · Score: 0

      Nope.
      Unless your password is a weak piece of crap.

      It will still take life-ending numbers to crack any reasonably good password or encryption system.
      Even with moores law unrealistically continuing for this whole century, it will still not dent anything good.

      However, with increasing machine intelligence, it could theoretically lead to better dictionary attacks, or even machines capable of, or attempting to, emulate your thinking patterns based on things it knows about you in order to figure out the sort of password system you have.
      Good thing my algorithmic password stack was decided on several coin tosses.

    10. Re:Security implications by Anonymous Coward · · Score: 0

      If you're referring to rainbow tables, those only work if you don't salt your passwords.

      As long as use you techniques which require exponential increases in work factor, you're golden for the foreseeable future. Notably password hashes like PBKDF2, bcrypt, scrypt, etc, are linear work factor increases, and they increase your cost just as much as an attacker. That means you constantly need to increases the work factor parameter as hardware evolves. I don't bother with them. My time is better spent replacing password-based systems than trying to put lipstick on a pig.

      scrypt, BTW, is supposed to be unique is that it's more memory intensive than other methods. So, yes, this kind of thing dings scrypt. Big shocker. Again, it's why I don't bother with such crap. Salting your passwords is the best you can do. Using PBHDF2 or scrypt isn't going to make it realistically any more difficult to crack "pa55w0rd".

    11. Re:Security implications by CycleMan · · Score: 1

      Expect computations to keep doubling this your key that takes a year to crack will take a few hours in the course of a few days in the next decade.

      Not to worry; I'll change my key several times before then.

    12. Re:Security implications by Anonymous Coward · · Score: 0

      That depends on what the password was hashed with (the assumption should be that the attacker has your password hash, plus the salt and algorithm). For things salted and hashed with MD5 or SHA-1, any password of length 8 or less is crackable in hours/days using hardware costing a few thousands of dollars. Assuming ~6 bits per character, that's 48-55 bits of strength for an 8-character randomized password. And it can be a lot lower if part of the password is an English word, maybe as low as 28-32 bits of strength.

      Picture three or four $300 graphics cards in a computer case with SLI/Crossfire mode, capable of cracking a MD5 hashed password in under a week. That's darned cheap ($2000/node) and easy to scale up as password cracking is an embarrassingly parallel problem. Assume that the attacker needs 1 week per password and runs the machine for 5 years. That puts the cost of cracking your MD5 hashed password down around $10 (including electric costs).

      Longer passwords help a bit. Twelve characters might get you into the 55-60 bit territory and fifteen characters puts you up around 60-70 bits. (Longer passwords typically have lower complexity per character, unless you are using password management software.) The other approach is to use a hash like SHA-256 which is run for 5,000 to 15,000 iterations or some other key-stretching technique / algorithm.

    13. Re: Security implications by Anonymous Coward · · Score: 0

      Latency, in nanoseconds, is likely unchanged in any significant way.

    14. Re:Security implications by Agripa · · Score: 1

      Also we still have no idea how memory latency is impacted (shorter paths, but also lower clocks).

      Latency is dominated by reading the DRAM itself and not the interface frequency which is why latency which is specified in clocks is roughly proportional to interface frequency.

  2. Repeated links? by valinor89 · · Score: 1, Insightful

    What is the point of providing more than one link if all of them point to the same page?

    1. Re:Repeated links? by Hognoxious · · Score: 2, Informative

      Because he gets a referral fee in both cases? All MojoKid's submissions are clickbait.

      --
      Confucius say, "Find worm in apple - bad. Find half a worm - worse."
    2. Re:Repeated links? by ArcadeMan · · Score: 3, Funny

      I hate people who do that...

      And please don't forget to register via the links below to get free Bitcoins and Dogecoins so I can get more referrals!

  3. All of you should buy AMD whenever possible by Andrio · · Score: 2, Funny

    They're a substantially less evil force than either Intel or NVIDIA.

    --
    The Internet King? I wonder if he could provide faster nudity.
    1. Re:All of you should buy AMD whenever possible by Anonymous Coward · · Score: 0

      But are they less evil than Google, Comcast, or the US Government?

    2. Re:All of you should buy AMD whenever possible by Anonymous Coward · · Score: 1

      AMD Vs NVIDIA! Asian CEO kung fu showdown!

    3. Re:All of you should buy AMD whenever possible by lister+king+of+smeg · · Score: 0, Flamebait

      They're a substantially less evil force than either Intel or NVIDIA.

      But Intel and Nvida give me graphics better drivers for linux systems than the crap amd gives.

      --
      ---Saying gnome 3 is better than windows 8 not so much a compliment as it is damning with light praise.
    4. Re:All of you should buy AMD whenever possible by maugle · · Score: 2

      Is that still true in Nvidia's case? I originally bought an Nvidia card because of the supposed Linux-friendliness, but it's been giving me trouble. Meanwhile, I'm hearing about how AMD has opened up a lot more and is making good strides with its Linux drivers.

    5. Re:All of you should buy AMD whenever possible by Adriax · · Score: 2, Insightful

      I buy AMD because it's cheaper. I have yet to notice a real performance difference.
      The fact my money doesn't go towards bribing benchmark/review sites and prebuilt manufactures to lock competition out of the market is just a bonus.

      --
      I don't suffer from insanity, I enjoy every minute of it!
    6. Re:All of you should buy AMD whenever possible by bhcompy · · Score: 1

      ? People don't use Matrox for linux? What year is it?

    7. Re:All of you should buy AMD whenever possible by arbiter1 · · Score: 2, Interesting

      AMD's linux driver is still pretty far behind even though they claim to make some strides. when a r9 290x gets beat by a gtx750ti you know its pretty bad.

    8. Re:All of you should buy AMD whenever possible by lister+king+of+smeg · · Score: 1

      ? People don't use Matrox for linux? What year is it?

      no we all moved on to fdfx Voodoo cards.

      --
      ---Saying gnome 3 is better than windows 8 not so much a compliment as it is damning with light praise.
    9. Re:All of you should buy AMD whenever possible by Anonymous Coward · · Score: 0

      After the HDD crashed that had my Windows 8.1 installation on (mostly for gaming), I installed Ubuntu 14.04. My graphics card is Radeon R9 270x 2GB and my CPU is a Phenom II X4 955 3.2 GHz. First thing I did was install Steam (note, I am using the open source drivers, not the closed-source proprietary ones from AMD) and all the games I could for Linux. I can play Counter-Strike Source at 200-300 FPS just like Windows. Some games actually run faster, most are about the same. CSGO is just a bug laden POS so it's no surprise that one has issues, but overall I'm very happy by ditching Windows 8.1 for Ubuntu 14.04 and using the open source drivers on my machine. Things have come a long way.

    10. Re:All of you should buy AMD whenever possible by im_thatoneguy · · Score: 0

      I kept buying AMD to save a penny and my experience was iffy drivers (even on Windows) and two dead fans. Now I go NVidia and have had far fewer problems.

    11. Re:All of you should buy AMD whenever possible by Adriax · · Score: 1

      Neither amd nor nvidia makes the actual card, just the chips and design spec. Busted fans would be the fault of the card manufacture. And it's a good bet the same manufacture makes crappy nvidia cards too.

      Drivers are a valid point. Though not one I've experienced myself.
      Worst i can say about drivers is nvidia won't let you easily uninstall the support software if you replace a burnt out card with an amd. The uninstaller wouldn't run without a valid nvidia card installed.

      --
      I don't suffer from insanity, I enjoy every minute of it!
    12. Re:All of you should buy AMD whenever possible by Anonymous Coward · · Score: 0

      Honestly, after AMD tried to pay forum trolls free GPUs to have them infiltrate tech communities and blanket everything with misinformation with their northamericaninfluencer bullshit, I can't take posts like this seriously anymore.

      I'll do my own research from people who don't get paid to benchmark things and buy the best GPU for my use regardless of brand, thank you very much.

    13. Re:All of you should buy AMD whenever possible by X0563511 · · Score: 1

      The problem with AMD drivers (at least when I last tried) is they work great for some cards, and others hardly do anything beyond instigate kernel panics.

      --
      For large sets, this will be our guide even unto death, for the LORD will work for each type of data it is applied to...
    14. Re:All of you should buy AMD whenever possible by Adriax · · Score: 1, Interesting

      Basic math.
      Top of the line AMD FX-9590 processor, $260. Top of the line Intel i7-5960X processor, $1050. (blatantly lazy, I just sorted newegg's processor list by price)
      Intel's performance lead over AMD, nowhere near 4x.
      I can't think of a normal situation that justifies that big of a price difference for the performance. I can build a whole second AMD computer for the price difference.

      Intel spent billions over the last 15 years kneecapping the competition. Any advancements they have right now is less than what we could be using today if the fight had been fair. Intel could very easily have been the top dog in a fair market, but we will never know.
      So you know what, go ahead. Call me a zealot all you want. I'll just stay happy over here knowing I don't supporting a company that is actively hampering progress.

      --
      I don't suffer from insanity, I enjoy every minute of it!
    15. Re:All of you should buy AMD whenever possible by drinkypoo · · Score: 4, Insightful

      Typical zealot. I use whatever tool is best for the job, be it AMD or Intel.

      History has shown that, like Microsoft, if you give Intel money they will use it for evil. Specifically, it will fund illegal anticompetitive behavior that retards progress in computing.

      If you're happy with that, keep giving Intel money. But keep in mind that yes, it really does make you an asshole when you give known assholes money on purpose.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    16. Re:All of you should buy AMD whenever possible by drinkypoo · · Score: 1

      Is that still true in Nvidia's case? I originally bought an Nvidia card because of the supposed Linux-friendliness, but it's been giving me trouble.

      Sorry to hear that, try another driver, an older one if necessary. I have a multitude of nVidia cards, and they all work right if you pin down the right driver. I have 6150 LE onboard in nForce-chipset boards, I have a Quadro 295 NVS in an HP C2D I just bought, might upgrade it to a C2Q for $50, really slick low-power setup there with support for CUDA 6.5 anyway, not too bad. 240GT, 450GTS OC, 750 Ti. Seriously all working great under Linux, but seriously none of them using the suggested driver.

      I keep hearing stories about nVidia driver being hard to install manually but I'm doing Linux From Scratch right now (I'm currently starting over after a successful LFS 7.7 build now that I'm educated, and combining CLFS 3 and LFS 7.7 along with building with GCC 5.1 and multilib glibc 2.21, libressl, compiz...) and I've found that the nVidia driver still installs flawlessly on LFS just like it did on say Slack back when the nvidia driver was new. But like I said, you have to go driver hunting. For example the driver which comes with CUDA 6.5 won't build with a modern kernel and/or toolchain, so I had to hunt up a different driver (340.76) to support my NVS 295. And when I got my spanking new 240GT, it wasn't even officially supported by the driver yet but being 3/4 of a 250GTS it did actually work... as long as I installed the next-to-newest driver, and not the very newest one. Then another version or two later they added explicit support for my card.

      On the other hand, I tend to try an ATI card every two or three GPUs, and I am always pissed off. I bought a gateway netbook with R690M chipset and that still isn't properly supported several years later. fglrx never supported it (said it was "too old") and radeon still trashes the display when used.

      If you want proper Linux support, you're better off with nVidia. If you want open-source driver support for a moderately new card, you may be better off with ATI, but it depends on the specific GPU. If you want open-source driver support for a very new card, that's pretty much just Intel anyway.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    17. Re:All of you should buy AMD whenever possible by Anonymous Coward · · Score: 1

      That's just the nature of high end products. They command a price premium. Particularly when the functionality is not available from other suppliers.

      To use Slashdot's beloved car/computer analogies, a high end supercar will cost you 10x the price of a regular car, but it can't go 10x faster, can't accelerate or decelerate 10x faster, can't go around a track 10x faster.

    18. Re:All of you should buy AMD whenever possible by Trongy · · Score: 1

      That's a bad comparision.
      The AMD FX-9590 ($260) is beaten by the i7 4790 CPU ($310).
      The $1050 i7-5960X CPU doesn't have any competition, which is probably why Intel charges such a large premium. I couldn't justify the purchase of one, but some people have to have the best, regardless of the cost.

      AMD cpus are only slightly cheaper than roughly equivalent Intel CPUs.
      I think they are good value for multithreaded performance, but Intel still has the lead for single thread performance which is important to most desktop users.

      I still don't know why AMD doesn't have a NUC/BRIX style platform. These are growing in popularity for media boxes and non-gamer desktops. They don't need the fastest or lowest power CPUs, just a middle of the road performance. I think AMD could compete quite will in this sector.

    19. Re:All of you should buy AMD whenever possible by Anonymous Coward · · Score: 1

      You're comparing apples to oranges by quoting Intel's top-end CPU prices.

      The competitor for the AMD CPU that costs around $260 is an Intel CPU that runs about $330. And that Intel CPU is about 30-45% faster for single-threaded stuff, and has enough under the hood to keep up on the multi-threaded tasks. It's sad but true, Intel is still the king of single-threaded performance and they have the die area to put enough cores on the chip to keep up with AMD's "many cores" approach.

      Still more expensive, but not 4x as expensive like you claim. If the AMD chip really did compete with Intel's high-end chip, then AMD would be charging more for it. Instead, AMD prices have to be set against the Intel chip against which they can go toe-to-toe.

      (I bought only AMD for over a decade. But their Steamroller/Piledriver/Bulldozer architecture just doesn't deliver for the things that I want to do. So I bought an i7 this year.)

    20. Re: All of you should buy AMD whenever possible by Anonymous Coward · · Score: 0

      Evil in what sense? The "making money is evil" sense? In that case, sure AMD is the good guy... Otherwise, don't be naive.

    21. Re:All of you should buy AMD whenever possible by inasity_rules · · Score: 1

      It is a bit of Russian Roulette...

      The propriety drivers aren't bad on my aging 6630M and run most things acceptably (except KSP, but that may be related to KSP on linux in general...?). Still no support for switchable graphics (in a dynamic and meaningful way), and the thermal management is slightly broken. On occasion I get a thermal shutdown.

      --
      I have determined that my sig is indeterminate.
    22. Re:All of you should buy AMD whenever possible by Lonewolf666 · · Score: 1

      The Open Source AMD drivers tend to lag a few years behind, especially in performance, so the latest generation of cards (HD 7xxx and its rebrandings) is hardly better under Linux than the HD 6xxx series, despite being more capable hardware in theory. Really new models may not work at all or only in 2D.

      Closed source AMD drivers still have an iffy reputation. Personally, I'd avoid them unless I really need the performance or a specific feature.

      --
      C - the footgun of programming languages
    23. Re:All of you should buy AMD whenever possible by Anonymous Coward · · Score: 0

      You pay US income tax, right?

    24. Re:All of you should buy AMD whenever possible by AgentElrond · · Score: 1

      I use the open source driver on an R9 270 optimized for quiet operation on Ubuntu 14.10. Mainly I use the machine for playing Skyrim at 1920x1080, which works surprisingly well. I'm sure I could get better performance with Catalyst or running on Windows, but performance is good enough for me: 25 fps outdoors, 60 indoors, using pretty much maxed graphics settings and the usual ton of mods.

      I use the open source driver because I've found it to be absolutely bullet proof stable and offer the best support for things like multi monitor, screen rotations, multiple instances of X and switching between them and the console, frequent suspend/resume cycles, etc. The other advantage is the complete absence of hassle - installation involved plugging it in and turning it on and I haven't had to bother with it since.

      If you use an AMD card and haven't tried the open source driver in a while it's worth checking out, the improvement recently has been impressive. I believe newer versions than I'm running have better performance still, but... systemd. And I don't feel like running xorg-edgers.

    25. Re:All of you should buy AMD whenever possible by drinkypoo · · Score: 1

      You pay US income tax, right?

      As little as possible, and only because men with guns will come to incarcerate you if you don't give them more money to hire more men with guns. Go back and re-read my comment until enlightenment reaches you.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
  4. HotHardware Spam by Luthair · · Score: 0, Troll

    Do we really need to see an article from MojoKid every day to drive traffic to his site?

    1. Re:HotHardware Spam by drinkypoo · · Score: 2, Funny

      Do we really need to see an article from MojoKid every day to drive traffic to his site?

      The answer depends on how that question is interpreted. No, we don't need to see such an article. Yes, such an article is necessary to drive traffic to said site, if that's your goal anyway.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    2. Re:HotHardware Spam by Anonymous Coward · · Score: 0

      Do we really need to see an article from MojoKid every day to drive traffic to his site?

      We see the same few names getting stories posted nearly every single day. That means one (or a combination) of a few things:

      • The quality of submissions of these authors is consistently head and shoulders above everyone else, and/or they flood the queue with so many stories that something filters through almost every day
      • The quality of submissions in the queue by the rest of the authors is consistently putrid (although, since the Firehose was implemented, it's pretty obvious that that's always been the case)
      • Dicedot has an interest (monetary or otherwise) in driving traffic to certain sites (all of Nerval's Lobster's stories point to news.dice.com, itwbennett to IT World, Mojokid to Hot Hardware, HughPickens.com's user name is hughpickens.com, etc)
      • The authors are personal friends/acquaintances of someone on staff and get special treatment
      • Or maybe something else entirely, I'm not clairvoyant

      Not that any of this matters, of course. Posting AC because I fully expect this post to be modded into oblivion.

    3. Re:HotHardware Spam by Luthair · · Score: 1

      I take it the HotHardware people have mod points today

  5. Power savings by asliarun · · Score: 5, Interesting

    One has to give it to AMD. Despite their stock and sales taking a battering, they have consistently refused to let go of cutting edge innovation. If anything, their CPU team should learn something from their GPU team.

    On the topic of HBM, the most exciting thing is the power saving. This would potentially shave off 10-15W from the DRAM chip and possibly more from the overall implementation itself - simply because this is a far simpler and more efficient way for the GPU to address memory.

    To quote:
    "Macri did say that GDDR5 consumes roughly one watt per 10 GB/s of bandwidth. That would work out to about 32W on a Radeon R9 290X. If HBM delivers on AMD's claims of more than 35 GB/s per watt, then Fiji's 512 GB/s subsystem ought to consume under 15W at peak. A rough savings of 15-17W in memory power is a fine thing, I suppose, but it's still only about five percent of a high-end graphics cards's total power budget. Then again, the power-efficiency numbers Macri provided only include the power used by the DRAMs themselves. The power savings on the GPU from the simpler PHYs and such may be considerable."

    http://techreport.com/review/2...

    For high end desktop GPUs, this may not be much, but this provides exciting possibilities for gaming laptop GPUs, small formfactor / console formfactor gaming machines (Steam Machine.. sigh), etc. This kind of power savings combined with increased bandwidth cna be a potential game changer. You can finally have a lightweight thin gaming laptop that can still do 1080p resolution at high detail levels for modern games.

    I know Razer etc already have some options, but a power efficient laptop GPU from the AMD stable will be a very compelling option for laptop designers. And really, AMD needed something like Fiji - they really have to dig themselves out of their hole.

    1. Re:Power savings by Kjella · · Score: 2

      One has to give it to AMD. Despite their stock and sales taking a battering, they have consistently refused to let go of cutting edge innovation. If anything, their CPU team should learn something from their GPU team.

      Well unlike their CPU division the GPU division hasn't been the one bleeding massive amounts of cash, at least not until the GTX 970/980 generation from nVidia. Though with the R300 OEM series being a R200 rebrand they seem to be running out of steam, one limited quantity HBM card won't fix their lineup.

      This kind of power savings combined with increased bandwidth cna be a potential game changer. You can finally have a lightweight thin gaming laptop that can still do 1080p resolution at high detail levels for modern games.

      You still need power for shaders that is about 80-90% of a GPU's power consumption. In fact, AMDs problem is that even if they could swap out the GDDR5 for HBM today they still lose on performance/watt to nVidia's Maxwell 2. And the interposer is basically a giant semi-processed silicon die, it might be good for performance but it's probably not good for cost.

      Anyway, the slides are kinda impressive just like with their Zen processor and whatnot. But AMD has a rather poor track record of delivering products on schedule that live up to their hype. It's now been eight months since nVidia launched the GTX970/980 and we're still waiting for something new from AMD. You can't win if you don't get your tech into products and ship them.

      --
      Live today, because you never know what tomorrow brings
    2. Re:Power savings by Anonymous Coward · · Score: 0

      Disclaimer:I own shares in AMD and this news gets me really excited.

      As far as AMD vs Nvidia nowadays, Nvidia has been crushing AMD on power consumption and performance. If AMD can deliver with this innovation maybe they can get enough momentum and capital to compete with Intel and Nvidia again. Recent reviews have consistently shown AMD GPUs being too power hungry and always gets docked for points. Let's hope this can be the turning point for AMD. All of us as customers should hope as much to prevent an Intel-CPU and Nvidia-GPU monopoly.

    3. Re:Power savings by Anonymous Coward · · Score: 1

      "On the topic of HBM, the most exciting thing is the power saving. This would potentially shave off 10-15W from the DRAM"

      Umm, going to have to disagree with you there. We create very high end real time visual simulations that are inherently bandwidth bound, i.e. the limit to model complexity is generally how fast vertices and textures can be moved around.

      Most of my customers are going to be way more excited about potentially doubling our tripling the fidelity of their visualization than saving a few $ on power consumption.

      Sure, there are dense computing applications where performance per watt matters, but that doesn't represent the entire use case out there.

    4. Re:Power savings by Anonymous Coward · · Score: 0

      It's a little weird to consider their "GPU team" as such. They acquired a wildly successful GPU company that was already on the cutting edge to get it. AMD has a long history of swinging for the fence, hanging in there, being reborn with help from the value niche, rinsing, and repeating. They might get there someday, but you shouldn't discount their status just because they haven't been able to maintain parity with a shady behemoth like intel.

    5. Re:Power savings by Adriax · · Score: 1

      I'm not exactly a routine patron of review sites, but from my last bit of research the top sites all dock points because amd made the cards.
      Same with cpus. The first site google returned claimed no performance difference between my 6 year old 4 core 2.3ghz amd, my new 8 core 4.2ghz amd, and the single core intel my father's 8 year old netbook runs.

      There is a lot of blatant intel/nvidia bias out there and it's really annoying to wade through to get an idea of real performance.

      --
      I don't suffer from insanity, I enjoy every minute of it!
    6. Re:Power savings by Lonewolf666 · · Score: 1

      There is more in the works (Zen CPU cores), and I hope that will work out too. I have some AMD shares as well, but I'd also love to see serious competition again.

      On the CPU side, AMD is even more behind Intel than it is on the GPU side behind Nvidia. If Zen can fix that, it will be even more important than catching up to Nvidia.

      And considering APUs, one major drawback in the last years used to be memory bandwidth. Where discrete cards do reasonably well with GDDR5, APUs really get crippled by having to share DDR3 between CPU and GPU. A stack of HBM memory for the graphics RAM should do wonders here ;-)

      --
      C - the footgun of programming languages
    7. Re:Power savings by mr_mischief · · Score: 1

      Now, imagine if the whole APU could use HBM as the combined system RAM.

    8. Re:Power savings by non0score · · Score: 1

      I don't know why you're attributing this. One of the leading firms for 3D stacking is Tezzaron, and they've been around for more than a decade. You can give credit to AMD for taking on the risk of 3D memory, but not for innovating it -- other companies did most of that work.

    9. Re:Power savings by Lonewolf666 · · Score: 1

      Sounds interesting, but might be too expensive in the short run and not as urgent. "Normal" system RAM tends to be larger than typical VRAM sizes but not as bandwidth-critical. Also, classical DIMM modules allow upgrading if necessary.

      Where HBM as the combined system RAM looks interesting are consoles. The PS4 in particular already has
      - an APU based design from AMD
      - fast but expensive GDDR5 RAM
      - a fixed, non - upgradeable memory size

      That looks like a scenario which is just waiting for HBM. Though probably not in the current generation, as one of the main advantages (even higher bandwidth) might be irrelevant to a console with fixed performance requirements.

      --
      C - the footgun of programming languages
    10. Re:Power savings by Agripa · · Score: 1

      Well unlike their CPU division the GPU division hasn't been the one bleeding massive amounts of cash, at least not until the GTX 970/980 generation from nVidia.

      Their CPU division has also been bleeding people since the K8 was released. Apple has more ex-K8 employees than AMD. I have been told this this started and was primarily caused by Hector Ruiz.

    11. Re:Power savings by Anonymous Coward · · Score: 0

      You don't have to have a r9 290 to feel the heat. I am using 2 R9 285s and when GPU 1 kicks on the box temps jump up 10-15 C. That is with both GPUs @ 85% +. The mobo temps go from 31C to 45C and this is just to run one 26K monitor. I think that is what the Tonga was designed for 1 one 26K monitor with 2Gb memory. The CPU is externally water cooled AMD 8 core that runs at 50-60% capacity when gaming. Reducing from 8 dram chips to 1 one would make a difference on Tonga chip and increase the memory also. Loosing 20 watts of heat means 40 watts total when running 2 GPUs. Heat is OK in the winter, but it is not winter here.

        I don't think AMD will ever break the wintel lock on commercial notebook/tablet/desktop production, ever, even with the next gen of APUs. Winders did not release a 64 bit version desktop OS until intel was ready to produce them. Windews could done a desktop a XP version that would address past the 32 bit self imposed address barrier too, but chose not do it. 64 bit desktop Windoze was 1-2 years after I was running 64 bit Linux on a AMD 64bit desktop CPU.

    12. Re:Power savings by mr_mischief · · Score: 1

      The difficulty in replacing it is of course a factor. People could buy a CPU+memory package as an upgrade for a motherboard, though, and still use card-slot peripherals like extra network cards and such. Many motherboards even in the enthusiast space now have networking, sound, lots of USB, and lots of SATA onboard.

      I'd pay a few hundred dollars for a mid-line CPU with mid-high GPU and 16 or 32 GB of ultrafast memory on a card that mounts onto the motherboard in a great big socket. Heck, that motherboard could then have DIMM slots for a few GB of slower DDR3 just as a RAM cache for the drives. Better yet for a server would be 32GB of HBM with 128GB of DDR3 or DDR4 swap space.

      This all assumes HP or IBM doesn't make it to market with memristors / big racetrack memory that's much faster still. Any of that, too, though, would benefit from being more tightly coupled to the CPU.

  6. I think my next card is going to be AMD by davydagger · · Score: 3, Interesting

    nVidia fanboy since switching to Linux. A combination of them releasing their new unified driver, the latest nvidia chips being notoriously hard for nouveau, and now this, I think my next card is going to come from AMD

  7. Uneven height and cooling by valinor89 · · Score: 2

    All the graphics I have seen represent the HVM memory being much higher or thick that the GPU core part. Is that how it will be? And if so how are they going to make the lid make good contact with the hot parts when they are "recessed"?

    1. Re:Uneven height and cooling by mrchaotica · · Score: 1

      Go back to using separate heatsinks instead of one gigantic one?

      --

      "[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz

    2. Re:Uneven height and cooling by Anonymous Coward · · Score: 0

      From what I've seen on the few GPU HSFs I removed this is already the case (at least the uneven height between the 2 dies.) The HSF manufacturers used pads (loose fits) for the RAM and grease (tight fit) for the GPU. The tolerances may have tighten in the last few years but I'd bet they still go for precession milling to make the GPU contact and looser precession to make the RAM contact.

    3. Re:Uneven height and cooling by Rockoon · · Score: 1

      As far as cooling, note that the ram is also clocked "much lower" than traditional gddr and uses less power. In practical terms this means "this ram solves heat issues."

      --
      "His name was James Damore."
    4. Re:Uneven height and cooling by mr_mischief · · Score: 1

      They could use separate heatsinks or mill a recess into a bigger one for the memory.

  8. I take two things from this. by edxwelch · · Score: 2

    The good: next year AMD start making APUs with HBM. The only thing that was holding back the iGPU was memory bandwidth. So, now they can put a 1024 shader GPU on the die and not have it starved by bandwidth. That will have interesting applications: powerful gaming laptops much cheaper than those with a discreet GPU and HPC (especially considering HSA applications)
    The bad: this year AMD is only releasing one new GPU, Fiji. The rest are rebadges. And there is no new architecture. Even Fiji is making do with GCN 1.2

    1. Re:I take two things from this. by Blaskowicz · · Score: 1

      Not earth-shattering, but Tonga which is GCN 1.2 is the GPU that needs most a re-release and I think they release a new GPU this year, Iceland. That one is low end 128bit, and is about what you need to run the new AMD linux driver - which sadly requires GCN 1.2 (?) from what I think I've read.

    2. Re:I take two things from this. by 0123456 · · Score: 1

      The good: next year AMD start making APUs with HBM. The only thing that was holding back the iGPU was memory bandwidth. So, now they can put a 1024 shader GPU on the die and not have it starved by bandwidth.

      Yeah, now you just need to get 500W out of that chip that combines a power-hungry AMD CPU with a power-hungry AMD GPU.

    3. Re:I take two things from this. by Anonymous Coward · · Score: 0

      Making do? What's broken in GCN 1.2 that needs a new architecture to fix?

  9. No trolls please by magarity · · Score: 1

    which was eventually adopted by JEDEC 2013

    I hope the organizers were careful not to invite any patent trolls to that round.

  10. easier solution by slashmydots · · Score: 0

    They're moving the RAM modules closer to the controlling chip? Why can't they simply make the copper path that connects them slightly thicker and out of purer copper if they want mad bandwidth? Isn't that how all electronic modulated pulses work?

    1. Re:easier solution by Anonymous Coward · · Score: 1

      I'm glad you brought that up. Clearly, the AMD engineers spent all this time working on the wrong solution.

    2. Re:easier solution by matfud · · Score: 3, Informative

      No. To drive an external bus requires a lot of silicon space to handle the capacitance resistance and distance. This also requires a lot of power.

      Stacked chips required far smaller drivers. The distance is in the mm rather than decimeters. The insulators are far better (as the current and voltage can be far smaller). Capacitance is also far lower. And you do not need to have 1024 bit data paths + address + signaling on the motherboard which makes motherboards far simpler and cheaper to make. Not counting the problems with signal propagation along different length paths on a motherboard (designed into the chip in this case) or having interactions from the multilayer PCB traces.

      So yes there are very good reasons to do this.

    3. Re:easier solution by Anonymous Coward · · Score: 0

      Why can't they simply make the copper path that connects them slightly thicker and out of purer copper if they want mad bandwidth?

      Crazy all the constraints people working modern process have to deal with. Things we take for granted.. area is money, propagation delay, inductance between nearby components and layers and even capacitance of little bitty wires turn into big deals in addition to normal problems of working design and margins. Then you pile on local and global design restrictions from the foundries and pray you didn't screw up and leave some stray inductance from a nearby metal layer to add too much charge to a component and prematurely burn out your chips costing your company millions.

    4. Re:easier solution by matfud · · Score: 1

      Oh yes when you get to the hundreds of Mhz-Ghtz range the signaling properties and distance become very important. Electrical signals in copper do not travel at the speed of light. Even if they did you still would have timing issues due to different lengths of the copper traces. on the MB. It takes a lot of skill to negate this single effect. The wider the bus the harder it is to achieve

      Unless you get Monster cables of course!.

    5. Re:easier solution by Anonymous Coward · · Score: 0

      At a certain point, short of putting memory on chip, the only way to move forward is going to be a fibre bus that connects memory and peripherals.

    6. Re:easier solution by slashmydots · · Score: 1

      Have you seen how their company is doing lately?

    7. Re:easier solution by matfud · · Score: 1

      That is debatable. Bumping the signals from the silicon to light seems to be quite slow (sort of time expensive as these things go). Almost enough to negate the transmission time overhead. Many people are working on this issue (lasing elements on the silicon substrate and various other good ideas)

      But this seems to work. it avoids the large bus drivers it avoids the distance issues. It avoids the routing issues.
      Note that I said "avoid" a lot there:)

    8. Re:easier solution by Anonymous Coward · · Score: 0

      The LC factors get messy above 2Ghz and the components need to be closer to reduce the effective Z factors. Tuning Ghz 100 watt transmitter modules is fun until you get a RF burn.

  11. DRAM Latency Still Sucks. by Anonymous Coward · · Score: 0

    High bandwidth is kinda pointless when you still have to wait dozens of cycles between a request going on the bus and the data appearing,

    1. Re:DRAM Latency Still Sucks. by 0123456 · · Score: 2

      GPUs tend to be more bandwidth-constrained than latency-constrained, as they typically have long pipelines that can hide much of the latency. Though I'm sure you could make them really latency-constrained if you wrote shaders badly.

    2. Re:DRAM Latency Still Sucks. by Anonymous Coward · · Score: 1

      Not when your data accesses are very very pipelined like in the case of 3D graphics. Efficient rendering code is generally setup to render a small number of large batches precisely because of this pipelining.

  12. Memory addresses round robin by hackwrench · · Score: 1

    And if you stagger the memory addresses round robin and maybe offset the clock of each stack appropriately, might there be a performance gain as well?

    1. Re:Memory addresses round robin by Anonymous Coward · · Score: 0

      All modern memory systems will stripe nearby addresses across different banks of memory. That trick's been known for decades; everyone uses it. There's no relative performance gain from continuing to use it.

  13. This isn't new, NVIDIA is working on it too by D.McG. · · Score: 1

    Googling the JEDEC document number JESD235 from the article found several references with NVIDIA talking about this for 2 years now for their Pascal series of chips after Maxwell.

    future-nvidia-pascal-gpus-pack-3d-memory-homegrown-interconnect
    http://en.wikipedia.org/wiki/High_Bandwidth_Memory
    http://www.cs.utah.edu/thememoryforum

    1. Re:This isn't new, NVIDIA is working on it too by Xenx · · Score: 1

      From the summary: AMD partnered with Hynix and a number of companies to help define the HBM specification and design a new type of memory chip with low power consumption and an ultra-wide bus width, which was eventually adopted by JEDEC 2013

      AMD started this with Hynix 4 years ago, so obviously the tech itself isn't brand new. The new news, is the pending release of hardware using said technology.

  14. AMD should Patent this memory and collect the $$$$ by Anonymous Coward · · Score: 0

    Here is an example of significant technical advancement.
    I hope AMD is patenting this new RAM and collects big $$$$ from any technology company that uses it.
    Unlike Apple and Microsoft's stupid patents, this RAM does deserve patent protection.
    Also AMD badly needs the $$$$.

  15. Control-F by ckatko · · Score: 1

    Control-F "heat"

    [No Results]

    My thoughts exactly.

    That's not to say this isn't great stuff, but thermal issues have always been the first or second most important problem with stacked memory. The other problem being fabrication and routing. You can't put a heatsink on a die when the die is wedged between 5 other dies. So you're hoping a heatsink on the top and bottom is enough for the middle wafers, or you're running some sort of tiny heat exchanger system.

    1. Re:Control-F by Anonymous Coward · · Score: 0

      That is the problem for searching for a single word in a document when there are multiple ways to say the same thing. Try searching for "power" instead of "heat" and find multiple references to reduced power consumptions.

  16. ati/amd video card trap by mOzone · · Score: 1

    How about they work on the drivers? i love amd cards and new HBM is super ..but every 1-2 years i fall for AMD.ATI video card hype and fall into the driver hell trap. 260x and 285x horrid linux/windows support . i can live with heat or noise issues but driver crash's on a black desktop with icons ..come on before they break out new gpu how about fix the drivers ..nvidia gets hate for there drivers but they work and dont go bonkers

    1. Re:ati/amd video card trap by Anonymous Coward · · Score: 0

      For you, perhaps. People love to pretend that their experience is the definitive one, but in my experience all three driver sets (especially on Linux) are godawful. All of the companies should stop with this ridiculous head-up-ass mentality and do what AMD is doing, and just make as much of the driver open as possible, because it's not like it's going to give away any trade secrets anymore. Just hide those in your firmware like everyone else.

  17. Re:Huh? by 0123456 · · Score: 1

    So confused. GPU memory bandwidth really hasn't been an issue.

    Uh, yes, it has. That's why GPU manufacturers often made mid-range cards by putting the same chip on a board with a half-wide memory bus; saves money on RAM, and is significantly slower.

  18. Comment removed by account_deleted · · Score: 1

    Comment removed based on user account deletion

  19. Re:Huh? by Anonymous Coward · · Score: 0

    So confused. GPU memory bandwidth really hasn't been an issue. The 100 GB's barrier was broken many years ago. Is MAD just NOW realizing this? The issue with memory bandwidth is not in the GPU, but on the CPU side where memory bandwidth currently can be around 10-50 GB's. But that doesnt matter considering the real issue is Hard Drive bandwidth. At only a few hundred MB's a second, that issue needs to be addressed first.

    AMD seems to be stuck in mid 2006 for some reason.

    1: Please note the difference between Gb and GB. Its a factor of 8, and pretty relevant here (I haven't seen cards with 100+ GB/s. I have seen ~200Gb/s).

    2: I'm a graphics programmer, and personally I've seen more cases of video memory bandwidth being the limit than compute. Its a major issue and I've done a lot of work trying to do better bandwidth use. I have written applications that basically run at a speed proportional to the video memory bandwidth. Its very clearly not a solved problem at 1080p, and is 4x worse at 4k. All the high FPS VR stuff isn't helping either. We need more bandwidth more than just about anything else on the high end graphics side these days.