Intel, NVIDIA Take Shots At CPU vs. GPU Performance
MojoKid writes "In the past, NVIDIA has made many claims of how porting various types of applications to run on GPUs instead of CPUs can tremendously improve performance — by anywhere from 10x to 500x. Intel has remained relatively quiet on the issue until recently. The two companies fired shots this week in a pre-Independence Day fireworks show. The recent announcement that Intel's Larrabee core has been re-purposed as an HPC/scientific computing solution may be partially responsible for Intel ramping up an offensive against NVIDIA's claims regarding GPU computing."
It has come to my attention that the entire Linux community is a hotbed of so called 'alternative sexuality', which includes anything from hedonistic orgies to homosexuality to paedophilia.
What better way of demonstrating this than by looking at the hidden messages contained within the names of some of Linux's most outspoken advocates:
I'm sure that Eric S. Raymond, composer of the satanic homosexual propaganda diatribe The Cathedral and the Bizarre, is probably an anagram of something queer, but we don't need to look that far as we know he's always shoving a gun up some poor little boy's rectum. Update: Eric S. Raymond is actually an anagram for secondary rim and cord in my arse. It just goes to show you that he is indeed queer.
Update the Second: It is also documented that Evil Sicko Gaymond is responsible for a nauseating piece of code called Fetchmail, which is obviously sinister sodomite slang for 'Felch Male' -- a disgusting practise. For those not in the know, 'felching' is the act performed by two perverts wherein one sucks their own post-coital ejaculate out of the other's rectum. In fact, it appears that the dirty Linux faggots set out to undermine the good Republican institution of e-mail, turning it into 'e-male.'
As far as Richard 'Master' Stallman goes, that filthy fudge-packer was actually quoted on leftist commie propaganda site Salon.com as saying the following: 'I've been resistant to the pressure to conform in any circumstance,' he says. 'It's about being able to question conventional wisdom,' he asserts. 'I believe in love, but not monogamy,' he says plainly.
And this isn't a made up troll bullshit either! He actually stated this tripe, which makes it obvious that he is trying to politely say that he's a flaming homo slut!
Speaking about 'flaming,' who better to point out as a filthy chutney ferret than Slashdot's very own self-confessed pederast Jon Katz. Although an obvious deviant anagram cannot be found from his name, he has already confessed, nay boasted of the homosexual perversion of corrupting the innocence of young children. To quote from the article linked:
'I've got a rare kidney disease,' I told her. 'I have to go to the bathroom a lot. You can come with me if you want, but it takes a while. Is that okay with you? Do you want a note from my doctor?'
Is this why you were touching your penis in the cinema, Jon? And letting the other boys touch it too?
We should also point out that Jon Katz refers to himself as 'Slashdot's resident Gasbag.' Is there any more doubt? For those fortunate few who aren't aware of the list of homosexual terminology found inside the Linux 'Sauce Code,' a 'Gasbag' is a pervert who gains sexual gratification from having a thin straw inserted into his urethra (or to use the common parlance, 'piss-pipe'), then his homosexual lover blows firmly down the straw to inflate his scrotum. This is, of course, when he's not busy violating the dignity and co
Seriously.
I am now posting using my GPU. It's at least 50x faster!
Isn't it like saying "Ferrari makes the fastest tractors!" (yeah, I know!), which may be true, as long as they can actually carry out the things you want to do.
I don't know about the limits of OpenCL/GPU-code (or architecture compared to regular CPUs/AMD64 functions, registers, cache, pipelines, what not), but I'm sure there's plenty and that someone will tell us.
I don't expect slashdot "editors" to actually edit, but could you at least link to the most applicable past story on the subject? It's almost like you people don't care if slashdot appears at all competent. Snicker.
"You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
AMD must feel very conflicted...
At least as far as parallel computing goes. CPUs have been designed for decades to handle sequential problems, where each new computation is likely to have dependencies on the results of recent computations. GPUs, on the other hand, are designed for situations where most of the operations happen on huge vectors of data; the reason they work well isn't really that they have many cores, but that the operations for splitting up the data and distributing it to the cores is (supposedly) done in hardware. In a CPU, the programmer has to deal with splitting up the data, and allowing the programmer to control that process makes many hardware optimizations impossible.
The surprising thing in TFA is that Intel is claiming to have done almost as well on a problem that NVIDIA used to tout their GPUs. It really makes me wonder what problem it was. The claim that "performance on both CPUs and GPUs is limited by memory bandwidth" seems particularly suspect, since on a good GPU the memory access should be parallelized.
It's clear that Intel wants a piece of the growing CUDA userbase, but I think it will be a while before any x86 processor can compete with a GPU on the problems that a GPU's architecture was specifically designed to address.
The author doesn't understand what the straw man argument is. He thinks it is bringing up anything that isn't specifically mentioned in the original argument. Nvidia stating that optimizing multi-core CPUs is difficult and that the Nvidia architecture has hundreds of applications seeing a huge gain in performance now is a valid point even if the Intel side never mentioned the difficulty of implementation.
This sentence no verb.
What the hell kind of sales pitch is "We're only a little more than twice as slow!"
It's gonna work, too.
Humanity sucks at math.
This sentence no verb.
From the article, you can narrow the gap:
"with careful multithreading, reorganization of memory access patterns, and SIMD optimizations"
Sometimes though, I don't want to spend all week making optimizations. I just want my code to run and run fast. Sure, if you optimize the heck out of a section of code, you can always eek out a bit more performance, but if the unoptimized code can run just as fast (on a GPU), why would I bother?
On top of being highly capable at massively parallel floating point math (the bread and butter of top500 and most all real world HPC applications), GPU chips benefit from economies of scale by having a much larger market to sell chips to. If Intel has an HPC-only processor, I don't see it really surviving. There have been numerous HPC only accelerators that provided huge boosts over cpus that flopped. GPUs growing into that capability is the first large scale phenomenon in hpc with legs.
XML is like violence. If it doesn't solve the problem, use more.
Does anyone under the age of 25 really care anymore about processor speed and video card "features"?
I only ask because 15 years ago I cared greatly about this stuff. However, I'm not sure if that is a product of my immaturity at that time, or the blossoming industry in general.
Nowadays it's all pretty much the same to me. Convenience (as in, there it is sitting on the shelf for a decent price) is more important these days.
Evergreen had a *huge* lead over pre-Fermi nVidia chips, and still leads in 32-bit precision (and by extension most of what the mass market cares about), but 64-bit precision lags Fermi. Of course, Evergreen beat Fermi to market by a large large margin.
XML is like violence. If it doesn't solve the problem, use more.
Magny-Cours is currently showing significant performance advantage over Intel's offerings while at the same time AMD's Evergreen *mostly* shows performance advantages over nVidia's Fermi despite making it to market ahead of Fermi.
AMD is currently providing the best tech on the market This will likely change, but at the moment, things look good for them.
XML is like violence. If it doesn't solve the problem, use more.
The day I build a computer with an Nvidia graphics processor as a CPU is when it's time to call 911, cause I will have completely lost my mind.
Just kiss and make up already. Intel and nVidia have but one choice: to join forces and try collectively to compete against AMD/ATI. Anything less, and they're cutting their nose off to spite their respective faces.
Yeah, speciality silicon for a small subset of problems will stomp all over a general purpose CPU. No big news there.
Why is Intel even bothering to whine about this stuff? They sound like a bunch of babies trying to argue that the sky isn't blue.
This makes Intel look truely sad. It's completely unecessary.
A Pirate and a Puritan look the same on a balance sheet.
I wonder if matrix inversion could be done with an asic with massive performance improvement over typical cpus? Im thinking of hardware that is designed to natively describe very large (spares?) matrices efficiently, and perform elementary matrix ops on these matrices.
is this possible? can you think of a way of implementing this, in terms of actual transistor logic?
The reason that Intel is whining is in the context of large number crunching systems or high end workstations. Rather than sell Ks of chips for the former, Nvidia (and to a lesser extent AMD) gets to sell hundreds of GPU chips. And for the workstations, Intel sells only one chip instead of a 2 to 4.
No, I don't trust in god. He'll have to pay up front, like everybody else.
I remember reading here on ./ that it got abandoned by Intel.
So what you're saying is nVidia will become a patent holding company and probably make just as much money as they're making now.
Intel decided to bail on marketing an in-house high performance GPU. But, they'd still like a return on their Larrabee investment. I don't doubt they would have been pushing the HPC mode anyway, but now, that's all they've got. Unfortunately for Intel, they've got to sell Larrabee performance based on in-house testing, while there are now a number of CUDA-based applications, and HPC-related conferences and papers are now replete with performance data.
To Intel's and AMD/ATI's advantage, NVIDIA has signed on with the OpenCL effort, so as the first two start getting drivers out, they can give the later a run for their HPC-GPU money. At the moment, though, it's all talk.
Luke, help me take this mask off
Don't get me wrong, I like what Intel is doing, but c'mon, you are understating this:
and the SIMD instructions that have been added to Intel/AMD CPUs in recent years really are the same thing you get with GPU programming, just on a bit smaller scale.
It's an order of magnitude different (and I know from experience coding CPU and GPU)
i7 960 - 4 cores 4 way SIMD
GT285 (not 280) - 30 cores 32 way SIMD
SP GFLOPS
i7 960 - 102
GT285 - 1080
No matter what, AMD really wins in this one.
AMD has the potential to win, but currently are in last place. Intel is aggressively solving all of the problems that previously gave AMD an advantage, and NVIDIA has aggressively put in place the things HPC wants (e.g. easy to code in C for the platform - I've done it and it is easy, also adding ECC and caching, etc.)
Using Badaboom a CUDA app, you can rip down DVD copies to your Ipod's in minutes, not hours.
Unfortunately Badaboom are idiots and are taking their sweet time porting to the 465/470/480 cards.
I'd love to see a processor fast enough to beat a GPU at tasks such as these, and cd to mp3 conversions on CUDA, it's like moving from a hard drive to a fast SSD.
I mean when you get down to it, the seem really overpriced. No video output, their processor isn't anything faster, what's the big deal? Big deal is that 4x the RAM can really speed shit up.
Unfortunately there are very hard limits to how much RAM they can put on a card. This is both because of the memory controllers, and because of electrical considerations. So you aren't going to see a 128GB GPU or the like any time soon.
Most of our researchers that do that kind of thing use only Teslas because of the need for more RAM. As you said, the transfer is the limiting factor. More RAM means less often you have to snuffle data back and forth.
But I think the timescale will be a very long one.
I mean ideally, we want only the CPU in a computer. The whole idea of a computer is that it does everything, rather than having dedicated devices. Ideally that means that it does everything purely in software, that the CPU is all it needs. For everything else, we seem to have reached that point but graphics are still too intense. Have to have a dedicated DSP for them.
However, we'll keep wanting that until the CPU can do photorealistic graphics in realtime. That is a long way off yet. Even GPUs can't do that. Once GPUs can, the trick is then being able to scale that down to become a realistic subset of the CPU, rather than a dedicated unit. You can't very well scale CPUs up to massive sizes and power consumptions.
So I've no doubt it'll happen, but I think not for 20+ years.
I have an Intel T8100 and a 8600M GT in my laptop. I tried badaboom (GPU) and Any Video Converter (CPU, multithreaded). The CPU based software encodes x264 faster, and has more features than badaboom. So what you say is not true for mid level GPUs. Also, I am more comfortable running my CPUs at 95C than my GPU, due to better reliability.
From Wikipedia, "OpenCL (Open Computing Language) is a framework for writing programs that execute across heterogeneous platforms consisting of CPUs, GPUs, and other processors." In other words, write your massively parallel programs using OpenCL and then run them on the device (or combination of devices) that executes your program the fastest.
Hopefully, OpenCL will have the same cataylzing effect on HPC that OpenGL had on computer graphics, but time will tell.
Word of warning to Intel: Almost nobody wants to hand-code assembly to run your SIMD instructions. People doing HPC (at least the ones using CUDA) are scientists and engineers who typically have better things to worry about than reading through detailed tomes on the i7 architecture. Make it more convenient (i.e. via OpenCL) or continue to lose market share in this area.