Next-Gen GPU Progress Slowing As It Aims for 20 nm and Beyond
JoshMST writes "Why are we in the middle of GPU-renaming hell? AMD may be releasing a new 28-nm Hawaii chip in the next few days, but it is still based on the same 28-nm process that the original HD 7970 debuted on nearly two years ago. Quick and easy (relative terms) process node transitions are probably a thing of the past. 20-nm lines applicable to large ASICs are not being opened until mid-2014. 'AMD and NVIDIA will have to do a lot of work to implement next generation features without breaking transistor budgets. They will have to do more with less, essentially. Either that or we will just have to deal with a much slower introduction of next generation parts.' It's amazing how far the graphics industry has come in the past 18 years, but the challenges ahead are greater than ever."
As a Radeon 7970 early adopter, I am completely fine with this. It still more than kicks butt at any game I throw at it, and hopefully this slow pace will mean that I'll get another couple of good years out of my expensive purchase.
Meanwhile, Intel is about to give us 15 core Ivy-bridge Xeons. A year from now we'll have at least that many Haswell core Xeons, given that they have the same 22nm feature size.
How many cores will 14nm Broadwell parts give us (once they sort the yield problems?) You may expect to see 4-5 billion transistor CPUs in the next few years.
Yay for Moore's law.
Played across 24 monitors. Who really needs this crap?
I started out gaming on the computer on computers with no GPU, and when I got one with a Rage Pro 4MB it was awesome. Then I got a Voodoo card from 3DFX with a whopping 8MB and it was more awesomer. Now you can get whatever that will do whatever for however many dollars.
I really don't see the game programming keeping up with the GPU power. I'm at least 2 GeForce's behind the latest series (560ti) and I can play any game at 1200p resolution with a very decent framerate. Yes I beta-tested Battlefield 4. How much more is enough? I don't want them to stop trying, but somebody needs to ask where it reaches the point of diminishing returns. They could focus on streamlining and cheapening the "good enough" lines...
Of course it's come far in the last 18 years. Last 2 years? Not so much. In fact GPU advancement have been _pathetically_ slow.
The Xbox One and PS4, for example, will be good at 1080p but ultimately only a few times faster than the previous generation consoles. Same thing with PC graphics cards. Good luck gaming on a high resolution monitor spending less than $500. Even Titan and SLI are barely sufficient for good 4K gaming.
I'm ready for the next (or next-next) gen display, holographic display hovering in midair. Preferably with sensors that can detect my interactions with them. Wonder how far out THAT is now ?
More GPU power translates into more detailed geometry and shaders as well as more GPGPU power to calculate more detailed physics.
No, seriously. I have yet to find a graphics card that will accelerate 2D line or bitmapped drawings, such as are found in PDF containers. It isn't memory-bound, as you can easily throw enough RAM to hold the base file, and it shouldn't be buffer-bound. And yet it still takes seconds per page to render an architectural print on screen. That may seem trivial, but to render real-time thumbnails of a 200 page 30x42" set of drawings becomes non-trivial.
If you can render an entire screen in 30ms, why does it take 6000ms to render a simple bitmap at the same resolution?
(the answer is, of course, because almost nobody buys a card for 2D rendering speed - but that makes it no less frustrating)
Is it just my observation, or are there way too many stupid people in the world?
The 'point' of this very crappy article is that each process node shrink is taking longer and longer. Why bother connecting this to GPUs, when self-evidently ANY type of chip relying on regular process shrinks will be affected?
The real story is EITHER the future of GPUs in a time of decreasing PC sales, rapidly improving FIXED-function consoles, and the need to keep high-end GPUs within a sane power budget ***OR*** what is the likely future of general chip fabrication?
Take the later. Each new process node costs vastly larger amounts of money to implement than the last. Nvidia put out a paper last year (about GPUs, but their point was general) that the cost of shrinking a chip may become so high, that it will ALWAYS be more profitable to keep making chips on the older process instead. This is the nightmare scenario, NOT bumping into the limits of physics.
Today, we have a good view of the problem. TSMC took a long time to get to 28nm, and is taking much longer to get off. 20nm and smaller aren't even real process node shrinks. What Intel dishonestly calls 22nm and 14nm is actually 28nm with some elements only on a finer geometry. Because of this, AMD is due to catch up with Intel in the first half of 2014, with its FinFET transistors also at 20nm and 14nm.
Some nerdy sheeple won't believe what I've just said about Intel's lies. Well Intel gets 10 million transistors per mm2 on its 22nm process, and AMD, via TSMC, gets 14+ million on the larger 28nm process. Defies all concept of maths when Intel CLAIMS a smaller process, but gets far less transistors per area against a larger process.
It gets more complicated. The rush to SHRINK has caused the industry to overlook the possibilities of optimising a given process with new materials, geometry, and transistor designs. FD-SOI is proving to be insanely better than finFET on any current process, but is being IGNORED as much as possible by most of the bigger players, because they've already invested tens of billions of dollars in prepping for FinFET. Intel has had two disastrous rounds of new CPU (Ivybridge and Haswell), because FinFET failed to deliver any of the theoretical on the process they 'call' 22nm.
Intel has one very significant TRUE lead, though- that of power consumption in its mains-powered CPU family. Although no-one gives a damn about mains-powered CPU power usage, Intel is more than twice as efficient than AMD here. Sadly, their power advantage largely vanishes with mobile, battery powered parts.
Anyway, to flip back to GPUs, AMD is about to announce the 290x, the fastest GPU, but with a VERY high power usage. Both AMD and Nvidia need to seriously work to get power consumption down as low as possible, and this means 'sweet spot' GPU parts which will NOT be the fastest possible, but will have sane compromise characteristics. Because 20nm from TSMC is almost here (in 12 months max), AMD and Nvidia are focused firstly on the new shrink, and finFETs, BUT moving below 20nm (in a TRUE shrink, not simply measuring the 2D profile of finFET transistors) is going to take a very, very, very long time, so all companies have an incentive to explore ways of improving chip design on a GIVEN process, not simply lazily waiting for a shrink.
Who knows? FD-SOI offers (for some chips) more improvements than a single shrink suing conventional methods. It is more than possible that by exploring material science, and the physics of semiconductor design, we could get the equivalent of the advantages of multiple generations of shrink, without changing process.
The Xbox One and PS4, for example, will be good at 1080p but ultimately only a few times faster than the previous generation consoles.
I believe the same sort of slowdown happened at the end of the fourth generation. The big improvements of the Genesis over the Master System were the second background layer, somewhat larger sprites, and the 68000 CPU.
Good luck gaming on a high resolution monitor spending less than $500.
Good luck buying a good 4K monitor for substantially less than $4K.
How strange that it just happens that WiFi encryption standards fall with the power of last generation cards... Just a coincidence, I'm sure.
Sure is strange though how both NVidia and AMD announced their cards shortly after the adoption of TLS 1.2 by major American e-commerce websites and web browsers.
During TSMC earnings call the CEO mentioned that there are tape-outs for GPUS for 16nm Finfet, but not 20nm - hinting that Nvidia and AMD will skip that node altogether.
http://seekingalpha.com/article/1750792-taiwan-semiconductor-manufacturing-limited-management-discusses-q3-2013-results-earnings-call-transcript?page=3
"Specifically on 20-nanometers, we have received 5 product tape-outs, and scheduled more than 30 tape-outs in this year and next year from mobile computing CPU and PLD segments"
"On 16-FinFET. Technological development is progressing well, risk production is on schedule by the end of this year. More than 25 customer product tape-outs are planned in 2014, including mobile computing, CPU, GPU, PLD and networking applications. "
All of the 3D rendering APIs are capable of proper, full-featured 2D rendering. The same hardware accelerates both just as well. The problem is that most apps are just not using it and/or that they are CPU bound for other reasons. PDFs, for instance, are rather complex to decode.
Not totally true. Stroke/path/fill rasterization work is not supported by current 3D rendering APIs (and thus not accelerated by 3d hardware). Right now the stroke/path/fill rasterization is done on the CPU and merely 2D blit-ed to the frame buffer by the GPU. The CPU could of course attempt convert the stroke/path into triangles and then use the GPU to rasterize those triangles (with some level of efficiency), but that's a far cry from "proper, full-featured 2D".
Fonts are special cased in that glyphs are cached, but small font rasterization isn't generally possible to do with triangle rasterization (because of the glyph hints).
Since SW doesn't even attempt to use HW for modern 2D operations, it will likely be a long time before HW will support this kind of stuff...
True. And once graphical realism in a human-created game universe reaches its practical limit, game developers will have to once again experiment with stylized graphics. This parallels painting, which progressed to impressionism, cubism, and abstract expressionism.
...can't come soon enough. We desperately need a change in paradigm. Hopefully we might have something around 2020. *crossing fingers*
If you're out of silicon to work with, you can't just keep on going to throw transistors at a performance problem. You will have to get smarter with what you do with the transistors. If the GFX card makers add innovative features to the on-board chips, they could solve many bottlenecks we still face with utilizing the massive parallel performance we have on these cards. Both for science and for GFX I'm sure there is a list of "most wanted features" or "biggest hotspots" they could work on. For example, the speed at which you can calculate hashes with OCLhashcat differs extremely for NVidia and AMD graphics. NVidia clearly is missing something they don't need a smaller silicon process for. There must be plenty of this sort of improvements both AMD and NVidia can make.
I was promised a flying car. Where is my flying car?
The most powerful chips out there are still far below the capacity of a human brain.
I don't want just to play games, I want to retire and leave my computer to do my work for me.
At this point, we already have better software models for the brain than hardware to run it.