Slashdot Mirror


Samsung Begins Mass Production of World's Fastest DRAM (hothardware.com)

MojoKid writes: Late last year marked the introduction of High Bandwidth Memory (HBM) DRAM courtesy of AMD's Fury family of graphics cards, each of which sports 4GB of HBM. HBM allows these new AMD GPUs to tout an impressive 512GB/sec of memory bandwidth, but it's also just the first iteration of the new memory technology. Samsung has just announced that it has begun mass production of HBM2. Samsung's 4GB HBM2 package is built on a 20 nanometer process. Each package contains four 8-gigabit core dies built on top of a buffer die. Each 4GB HMB2 package is capable of delivering 256GB/sec of bandwidth, which is twice that of first generation HBM DRAM. In the example of NVIDIA's next gen GPU technology, code named Pascal, the new GPU will utilize HBM2 for its frame buffer memory. High-end consumer-grade Pascal boards will ship with 16GB of HBM2 memory (in four, 4GB packages), offering effective memory bandwidth of 1TB/sec (256GB/sec from each HMB2 package). Samsung is also reportedly readying 8GB HBM2 memory packages this year.

65 comments

  1. Go AMD! by Foxhoundz · · Score: 5, Insightful

    You are the underdog and the warrior that has stood the wrath of Nvidia! I will keep buying your products until the bitter end!

    1. Re:Go AMD! by Anonymous Coward · · Score: 0

      exactly how i feel as well xD

    2. Re: Go AMD! by Anonymous Coward · · Score: 0

      AMD CPU and nVidia GPU is the way to go.

    3. Re:Go AMD! by saloomy · · Score: 4, Informative

      The HBM2 memory is manufactured by Samsung and likely to be used in both GPU boards. Even the summary states "In the example of NVIDIA's next gen GPU technology, code named Pascal, the new GPU will utilize HBM2 for its frame buffer memory. High-end consumer-grade Pascal boards will ship with 16GB of HBM2 memory (in four, 4GB packages), offering effective memory bandwidth of 1TB/sec (256GB/sec from each HMB2 package).".

      As far as AMD vs. NVIDIA... competition breeds innovation, I'm happy they complete and make better packages to fight for the GPU crown.

    4. Re: Go AMD! by Anonymous Coward · · Score: 0

      Not if you want upgrade paths and similar or better TDP profiles on their upgradable range. ESPECIALLY on laptops but it equally applies to desktop sockets. This is where Intel has a better selection, of TDP profiles on their compatible sockets. AMD really sucks here.

    5. Re:Go AMD! by Anonymous Coward · · Score: 0, Troll

      Pity their drivers suck.

      I've bought two AMD GPUs. Both times I've been burned by poor drivers and poor after sales support, even though the hardware itself was quite good. I haven't had the same problem with any of the four nVidia GPUs I've had. Yes I've had bad nVidia drivers at times, and even a defective nVidia card (replaced by vendor) but they were usually fixed pretty quickly, even for older cards. The last driver update for my AMD APU broke all hardware accelerated video decoding, tough luck, that was the final release, no longer supported. Yet every few weeks it still nags me to update to these broken drivers. Try to find out if my APU has windows 10 drivers, I have a 6xxx series GPU. They say 6xxx series cards will have drivers, but my APU is not supported. They say the APU is really a 4xxx series card with a 6xxx series name. How is that my problem? You labelled it a 6 series part, you said you would support 6 series parts in windows 10, release the drivers for it. I know it exists, you released a beta for it. Won't do it for business reasons (read: time for you to buy newer hardware from us).

      Never again AMD.

    6. Re:Go AMD! by Gadget_Guy · · Score: 1

      It's a shame you had problems, but I haven't had any driver problems from my AMD cards. This includes the 6250-powered APU running Windows 10 that I am using to write this message.

    7. Re:Go AMD! by Anonymous Coward · · Score: 0

      I never knew there was such a strong market for constant disappointment.

    8. Re:Go AMD! by oic0 · · Score: 1

      I had the reverse issue. Owned AMD cards since the 2000 series. Very little driver problems, nothing remarkable. Then I got an Nvidia 970. Glitchy drivers all around and a corrupt mouse cursor all the time. Windows 10 came out. Still having the same issues but add flickers and the screen going funky colors. They finally fixed all of that after several months of windows 10 but overall my experience with my 970 has been full of driver bugs (most related to being at 4k 60hz, but if the card supports it, it should support it!).

    9. Re:Go AMD! by Anonymous Coward · · Score: 0

      The 6xxx series was a mix of various generations.

      The 6250 was on the E1 series of APU and is supported on windows10. The 6320 for the E450 APU, for instance, is not supported on windows 10.

    10. Re:Go AMD! by smallfries · · Score: 0

      Thank you for your sacrifice. I will shed a tear for you as I play on my GTX980ti.

      --
      Slashdot: where don knuth is an idiot because he cant grasp the awesome power of php
    11. Re:Go AMD! by Anonymous Coward · · Score: 0

      Sounds like you had a bunch of issues related to being at the bleeding edge with a single card.
      This is less surprising and is often the "bleeding" part that early adopters put up with.

      Users using mid range cards on established operating systems with much less onerous requirements shouldn't have to put up with similar issues, nor previously working functionality stopping with no response from AMD after months, nor support being dropped from serviceable hardware not due to technical requirements, but business ones.

    12. Re:Go AMD! by Anonymous Coward · · Score: 0

      Absolutely the wrong approach. Sometimes the underdog is in the position of being the underdog for a reason - it's not automatically a virtue. I'm aware of Intel's shady past (and present) but it's not as if AMD haven't made their own bone-headed mistakes and decisions that have lead to their current position. We need results as well as words, and heck, the Linux situation with AMD ain't great when compared to NVIDIA's (proprietary) offerings.

    13. Re: Go AMD! by mSparks43 · · Score: 1

      Do amd have decent Linux support yet.

      Not bought anything from them for about 9 years now after I made the switch to linux for my desktop and amd had near zero linux support.

    14. Re: Go AMD! by qbast · · Score: 2

      What? AMD can barely reach the lower-middle end now. The time when AMD was actually a viable alternative to Intel (around first Athlon) are long gone.

    15. Re: Go AMD! by Anonymous Coward · · Score: 0

      No they don't.
      Not trolling, I am long time Nvidia user, but will ditch it as soon as open source drivers on AMD will reach parity on speed/features. Features parity is reached and in some way is outdone, but performance is just trash yet.

    16. Re:Go AMD! by Anonymous Coward · · Score: 0

      Ummmm.... your graphics card is faulty. Sounds like bad vram. You should warranty it.

    17. Re: Go AMD! by ACE209 · · Score: 1

      Exactly my way.

      The AMD CPUs are quite a bit cheaper than the Intel ones. And usually are enough for the games I want to play.

      The temperatures this CPUs reach can be a bit frightening. Mine runs usually between 60C and 70C. But so far none left the magic smoke out.

      My usage of nVidia Cards comes more from habit than from strong opinions about either of the companies.

      --
      "we are all atheists about most of the gods that societies have ever believed in. Some of us just go one god further."
    18. Re:Go AMD! by vyvepe · · Score: 1

      I used both AMD and nVidia cards and I did not have driver problems with any of the cards. But nVidia hardware did die after about a year :-/

    19. Re: Go AMD! by Anonymous Coward · · Score: 0

      I got my 6850 amd card a few years ago specifically for the linux support. At least in gesture I wanted to support them for it... It's support was remarkably worse than nvidia in just about every way when it came to actually using it on my TV.

    20. Re: Go AMD! by Anonymous Coward · · Score: 0

      We're talking graphics now, but wait for the Zen processors. If any of the leaked material is true the Zen architecture is going to be an asskicker. It's enough to have stopped me from a new i7-5390K build to wait and see what Zen brings.

    21. Re: Go AMD! by Redbehrend · · Score: 1

      Nvidia talks crap about amd and doesn't share anything it develops. Yet as soon as amd comes out with something new, they use it....ooh the irony.

    22. Re:Go AMD! by Anonymous Coward · · Score: 0

      A "Troll" mod does not mean "I disagree".
      This was no troll, merely a frustrated end user.

    23. Re:Go AMD! by dave420 · · Score: 1

      It is supported under Windows 10 - I know because my little media PC at home has the E450 APU with the 6320, and it runs Windows 10 just fine.

    24. Re: Go AMD! by Anonymous Coward · · Score: 0

      Zen does sound good, but all of the claimed improvements should only bring it up to Ivy bridge performance and who knows about power. But yes, it's sounds good enough to wait and see. Bring on the benchmarks!

  2. AMwho? by Anonymous Coward · · Score: 0, Troll

    AMD is now just another ARM maker, they lost the x86x64 war, now they're going to lose the ARM war too (too many competitors in the ARM market), AMD also lost the GPU war.

    Where do AMD belong? The only place I see AMD, is on support forums with people crying things are not working, or they cannot upgrade, my drivers are rubbish or some other broken reason.

    1. Re:AMwho? by Anonymous Coward · · Score: 0, Troll

      Where do AMD belong?

      Crowed about by delusional fanboys.

      I love those guys. I want them to keep wasting money on inferior products, because like it or not, competition is a very necessary thing. Without AMD, Intel and nVidia would both quickly become fat and lazy.

    2. Re:AMwho? by Anonymous Coward · · Score: 0

      There IS competition, ARM, MIPS in the embedded space and also the light books (chromebooks and tablets and IoT space which will be HUGE).

      Graphics competitors also include Matrox (dominant in big displays and medical / scientific / vision systems).

      If AMD flop, their assets will be bought up, probably split between Intel and ARM.

      Perhaps even a group of investors to keep it running. New management is needed, new direction is needed. AMD need to do a Microsoft and turn around.

    3. Re:AMwho? by Anonymous Coward · · Score: 0

      Intel's newest gen of low power CPUs are on par with ARM's power usage and Intel is about 30% more efficient when it comes to work done per unit of energy. That's a sizable gap that hasn't gone away over the past decade since ARM tried to break into servers.

  3. Draym! by turkeydance · · Score: 1

    that's NVIDIA-ah to you.

  4. More Slashvertisements by Anonymous Coward · · Score: 0

    Great, just what we need, more ads on Slashdot. Weren't these Slashvertisements supposed to have different colors to distinguish them from real stories? Is DICE so desperate for money that they have to run ad after ad as stories like the previous one and this one? Geez! I remember when Slashdot used to talk about open source, Linux, and stuff that mattered, not ads.

    1. Re:More Slashvertisements by Anonymous Coward · · Score: 0

      OK, Rob. You said you'd go. Now go.

    2. Re: More Slashvertisements by Anonymous Coward · · Score: 0

      Well it is mojokid... You can never set your expectations low enough. Wonder how much kickback he gets.

  5. This stuff is meant to be pretty fancy. by AbRASiON · · Score: 3, Informative

    The initial AMD Fury card was a bit of a disappointment, I mean it is quite fast for it's size and it's also quite fast for only 4GB memory onboard, but it didn't thrash the nvidia 980Ti it competes with, despite being a newer technology with more memory bandwidth.
    I haven't investigated (nor do I care to) as to /precisely/ why, but it may be the AMD GPU itself is simply not powerful enough to use that bandwidth effectively or the 4GB holding it back due to texture size.

    *THAT* being said, that's phase 1 of HBM, phase 2 is about to kick in this year for both AMD and nvidia and premium video cards will be utilising this technology in the high end for certain.

    The other thing that's frequently mentioned when these are brought up is that this on chip (or is it on package?) memory is going to be utilised in some of AMD's mid tier APU chips (the CPU / GPU combined ones) which should make some onboard video surprisingly damn good in the coming future. Perhaps not dedicated GPU good but may compete well with low to mid tier dedicated GPU's now.

    Also for compute functions for scientific stuff or whatever people use all that number crunching stuff with dedicated GPU's for, this will be far better. (Apparently it's similar to Intel Xeon Phi or some such? (Knights landing) https://en.wikipedia.org/wiki/...

    I guess ultimately what has enabled this technology to exist is stacking ram (?) since they can fit 4GB of memory inside a single, very small chip.
    (Here you can see the existing stuff, 1GB in a single chip, the 4 smaller chips around the GPU) https://www.google.com.au/sear... soon to be 4GB in presumably the same physical space and 8GB shortly

    It looks to me like stacked ram is the future in many things (SSD capacity booming due to this)
    It's all pretty exciting for the future of bandwidth, 1TB/s is pretty nice and I imagine it'll only go up from there.
    (I read some theories recently about 'stacking' CPU's too, although the heat may become an issue? but if they can lay out 48 layers of memory inside a chunk of silicon, why not lay out multiple processors) however that's for the smart people to figure out.
    Please read the replies to this post as I don't follow as closely as I used to and several pieces of information here might be slightly off.

    1. Re:This stuff is meant to be pretty fancy. by complete+loony · · Score: 2

      ... may be the AMD GPU itself is simply not powerful enough to use that bandwidth effectively ...

      Building the right balance of compute units, memory capacity & bandwidth is a hard problem. Games developers will target high frame-rates on their target hardware, optimising or cutting back features until everything works well enough.

      We will have to wait and see if developers will find creative ways to use this bandwidth increase.

      --
      09F91102 no, 455FE104 nope, F190A1E8 uh-uh, 7A5F8A09 that's not it, C87294CE no. Ah! 452F6E403CDF10714E41DFAA257D313F.
    2. Re:This stuff is meant to be pretty fancy. by Anonymous Coward · · Score: 0

      I think a great deal of that has to do with AMD have shitty drivers for their cards.

      You can see it when they compare DirectX 11 to DirectX 12 implementations of stuff.

      When it comes to running the stuff in Direct X 11. Compared to Nvidia, AMD sucks nuts and swallows when it comes to performance.
      But as soon as they put it in Direct X 12, AMD starts to kick ass.

      So honestly think it is a case of AMD have crap drivers that kneecap it when it comes to DirectX 11 and older stuff while in the newer Direct X 12 implementations which give the developers "closer to the metal" access to the cards and do the stuff themselves, it runs like a bat out of hell.

    3. Re:This stuff is meant to be pretty fancy. by Anonymous Coward · · Score: 1

      All that but you can't tell it's from its?

    4. Re:This stuff is meant to be pretty fancy. by rahvin112 · · Score: 2

      Game designers design for the lowest common denominator. This memory advance will be irrelevant until it's in 50% of the GPU's out there. This has been true for years and will remain true for years to come. No sane game developer would target the performance of a card that 90% of GPU's can't support.

    5. Re:This stuff is meant to be pretty fancy. by Anonymous Coward · · Score: 0

      Studios seem to go for the high end of the current crop, because the games have development times measured in years, by which time it'll be low to mid-tier. This has, however, ceased being the case for PC since some studios started primarily targeting consoles and porting.

      Indies target whatever they can scrounge or whatever they're comfortable with.

    6. Re:This stuff is meant to be pretty fancy. by Anonymous Coward · · Score: 0

      "This has been true for years and will remain true for years to come. No sane game developer would target the performance of a card that 90% of GPU's can't support."

      *cough* Crysis.

    7. Re:This stuff is meant to be pretty fancy. by TheDarkMaster · · Score: 1

      I also find it strange. In theory one Fury should be faster than a GTX980 but what happened was the opposite. I think the reason is the fact that the HBM chip behaves like a NAND flash chip: Slow on individual access but fast when you access multiple chips in parallel in a RAID0 like scheme

      --
      Religion: The greatest weapon of mass destruction of all time
    8. Re:This stuff is meant to be pretty fancy. by iampiti · · Score: 1

      I don't actually think it'll be relevant until it's available in the consoles since very few houses develop exclusively for PCs.
      That would delay the mainstream usage of HBM until the next console generation

  6. HBM memory... by Grog6 · · Score: 1

    What I want is a Motherboard that will use one of these stacks to feed my 4-channel Intel socket 2011-3 processor.

    The current max for memory is ~25Gb/s/channel, so 4 channels from one device still leaves a lot on the table for improvement.

    Two processors could keep one busy... :)

    --
    Truth isn't Truth - Guliani
    1. Re:HBM memory... by Anonymous Coward · · Score: 0

      A CPU wouldn't even scratch those things - a cache line is still 16 bytes and CPUs just can't work with it. You need a fundamentally different approach to computation to get anywhere close to saturating that bandwidth. Very, very, VERY big SIMD. Big vector units. The bus is hilariously wide compared to ye' standard PC bus. Even a GPU core is going to struggle to move that data around under the most ideal conditions.

      Think about it. With a 4Ghz clock even if you could load 64bits/8 bytes in one clock you're only moving 32 billion bytes around 32GB/sec peak if you're doing no processing whatsoever and not stalling. You'd need 32 processors just to be able to load 1TB/sec bandwidth without doing anything else and many more processors to actually do anything. This memory is just not designed for a stock PC CPU.

      To get more performance we need to bring latency down - I can only imagine that would be much worse for this memory.

    2. Re:HBM memory... by Anonymous Coward · · Score: 0

      You can't do that. HBM is slower (transaction rate per wire) and wider: you will need a lot more pins (AKA new CPU and crazy socket) to have a chance at that, though the pins can be lower quality and will consume less power to drive (this is where the potential cost savings come from).

      For typical single threaded CPU workloads, HBM might actually be a loss, despite its high bandwidth. A GPU is a bunch of ALUs organized to handle latency as much as possible by having a large pool of pending work (its like hyper-threading * 100), in addition to being very parallel. CPUs are the opposite: they are designed to pay massive costs (power, area, complexity) to minimize latency. For your CPU memory by far the biggest limiting factor is latency, not bandwidth. HBM very likely has worse latency than your existing DDR3 SDRAM. I would like further stats on HBM latency numbers though.

      Its hard to imagine a reasonable task that would benefit from HBM that is better run on a CPU than GPU.

    3. Re:HBM memory... by willy_me · · Score: 1

      About sockets - HBM is integrated into the IC package thereby negating the need for additional IC pins. The claim of poor latency is also a non-issue. The various different caches sitting between the ALU and memory are there to hide memory latency and bandwidth limitations. Even if HBM has higher latency, which I doubt, the CPU cache would largely hide this fact.

  7. Better FDTD simulations by Anonymous Coward · · Score: 1

    I found out the hard way that memory bandwidth was the bottleneck for this activity.

  8. In random units... by Anonymous Coward · · Score: 0

    1TB/sec = approx. 34.7 hours of 8KB 1080p porn videos per second...

  9. For servers also, with ECC included! by Anonymous Coward · · Score: 0

    for use in high performance computing (HPC), advanced graphics and network systems, as well as enterprise servers

    Samsung’s 4GB HBM2 also enables enhanced power efficiency by doubling the bandwidth per watt over a 4Gb-GDDR5-based solution, and embeds ECC (error-correcting code) functionality to offer high reliability.

  10. Amazing bandwidth, no better latency. by ameline · · Score: 4, Interesting

    The latency for HBM and other technologies of its ilk are no better (even slightly worse) than DDR3.

    It's no good for large last level caches -- but 8 of those 8 TB stacks would make for a nice 64GB of RAM with 2TB/sec bandwidth. I'd like to see that connected to a good CPU.

    --
    Ian Ameline
    1. Re:Amazing bandwidth, no better latency. by Anonymous Coward · · Score: 0

      the underlying physics of accessing a DRAM cell have not changed... one can increase the bandwidth of the chip-to-chip interface all you like, but it takes time to precharge/sense/etc, and so latency is what it is.

    2. Re:Amazing bandwidth, no better latency. by Anonymous Coward · · Score: 0

      IO bandwidth is nothing without the FLOPS to use it, and FLOPS are (mostly) useless without the IO to feed them.

      I forsee no present or near-future CPU able to utilize 20x the memory bandwidth.

      GPUs, on the other hand, have processors that are obscenely overpowered relative to main memory to the point that you consider computing things as free. A K80 can perform nearly 6 trillion calculations per second, but its main memory bus can supply only 60 billion new floats per second: Less than 100 flops per gld/gst instruction and it starves. More remote fetches only get more abhorrently worse. Waiting for PCIe to do an RDMA from another card on the node, or for transfer to CPU memory? It might take an appreciable fraction of a second to transfer a single gigabyte! 10gbps ethernet is like the howling eternity between words.

      2TBps to global memory is what it would take to bring the FLOPS:IO ratio in GPUs back to something *sane*, i.e. something that works *well* for things other than embarassingly parallel problems.

    3. Re:Amazing bandwidth, no better latency. by Anonymous Coward · · Score: 0

      Even with shitty latency, I'd like to see a kernel module that can use the HBM memory on some of the APU's or those on the GPU units to be used for filesystem shenanigans like swap.

      I know that there is already a module to use GPU VMEM as a sort of tmpfs. Could be updated to utilize these HBM chips.

    4. Re:Amazing bandwidth, no better latency. by Anonymous Coward · · Score: 0

      It would be nice to see SRAM cache size grow to the point where cache can be abandoned and a simpler/faster CPUSRAM model adopted

    5. Re:Amazing bandwidth, no better latency. by Anonymous Coward · · Score: 0

      It's no good for large last level caches

      They are better scratch pad memories as parts of software controlled memory hierarchy, which give a better multicore scalability compared to implicit caches.

    6. Re:Amazing bandwidth, no better latency. by Anonymous Coward · · Score: 0

      For graphics, bandwidth is still pretty damn important

    7. Re:Amazing bandwidth, no better latency. by Anonymous Coward · · Score: 0

      The latency is worse per cycle, but much better when measured in absolute time. HBM cuts GDDR5 latency in half. The reason for this is if you look at the actual clock rate at which the memory runs, the high bandwidth memory is not much faster, but the width of the data is. This means the high bandwidth memory has a worse ratio of bandwidth to clock rate, making the latency look bad.

  11. 64 bytes by Anonymous Coward · · Score: 0

    cache lines are 64 bytes on any intel cpu made in the last 100 years.

    1. Re:64 bytes by behrooz0az · · Score: 2

      dog years?

      --
      Moderating "-1, Disagree" is simple censorship. Have the guts to post your opinion. -- Spazmania (174582)
  12. Chips made with the tech in mind. by Z80a · · Score: 1

    I wonder how many kludges and "offloads" amd/nvidia will be able to pull off with this sort of external bandwith, increasing the performance/lowering costs in the process.

  13. AMD will likely be the first to ship HBM2 GPU too by Kartu · · Score: 1

    It has been said that Pascal GP104 will use GDDR5X. If Nvidia repeats the cycle GP104 will be their flagship and big Pascal GP110 wont be GeForce ready until next cycle some time in 2017.

    Which would make sense, considering that nVidia has no experience with HBM.

    AMD - Hynix collaboration on HBM started a while ago, by the end of 2013 they've only "finalized HBM 3D memory", it took 2 more years to ship Fury series GPUs with HBM::
    http://linustechtips.com/main/...

  14. Fury Nano rocks after price drop (499$) by Kartu · · Score: 1

    After price drop on Fury Nano, it costs like 980 (non TI) (499$) while handily beating it in most games, tiny form factor.

  15. :faceplam: by Anonymous Coward · · Score: 0

    Reading anything about HBM always makes me want to do a facepalm! The reason is that HBM is NOT fast... per say. It is only really fast BECAUSE IT HAS A 1024BIT BUS! Any DRAM riding on such a freaking wide bus will technically be "fast" If we actually cut down the bus width, we find that HBM is actually slower bit-per-bit than say a modern DDR5 equipped video card. Example, Nvidia GT 730 (DDR5 version) video card uses DDR5 on a 64bit bus and achieves 40GB's a second at 1250Mhz. HBM on the other hand at 64 bits wide achieves 32GB's a second.

    In short, any video card OEM can widen the bus width of their product and achieve the same or higher results. So I ask, what's the point to HBM? Placing it on, or near the die package so there's less room for heat to dissipate so it either fails prematurely, or runs into a TDP ceiling? I just don't get it.

    1. Re::faceplam: by Anonymous Coward · · Score: 0

      Hint: P = f C U^2