Nvidia Calls Out Intel For Cheating In Xeon Phi vs GPU Benchmarks (arstechnica.com)

← Back to Stories (view on slashdot.org)

Nvidia Calls Out Intel For Cheating In Xeon Phi vs GPU Benchmarks (arstechnica.com)

Posted by ryuzaki0 on Wednesday August 17, 2016 @03:20AM from the calling-out dept.

An anonymous reader writes: Nvidia has called out Intel for juicing its chip performance in specific benchmarks -- accusing Intel of publishing some incorrect "facts" about the performance of its long-overdue Knights Landing Xeon Phi cards. Nvidia's primary beef is with the following Intel slide, which was presented at a high performance computing conference (ISC 2016). Nvidia disputes Intel's claims that Xeon Phi provides "2.3x faster training" for neural networks and that it has "38 percent better scaling" across nodes. It looks like Intel opted for the classic using-an-old-version-of-some-benchmarking-software manoeuvre. Intel claimed that a Xeon Phi system is 2.3 times faster at training a neural network than a comparable Maxwell GPU system; Nvidia says that if Intel used an up-to-date version of the benchmark (Caffe AlexNet), the Maxwell system is actually 30 percent faster. And of course, Maxwell is Nvidia's last-gen part; the company says a comparable Pascal-based system would be 90 percent faster. On the 38-percent-better-scaling point, Nvidia says that Intel compared 32 of its new Xeon Phi servers against four-year-old Nvidia Kepler K20 servers being used in ORNL's Titan supercomputer. Nvidia states that modern GPUs, paired with a newer interconnect, scale "almost linearly up to 128 GPUs."

58 comments

Min score:

Reason:

Sort:

Hmmm. by Anonymous Coward · 2016-08-17 03:22 · Score: 4, Funny

So Intel is the Volkswagon of CPUs?
1. Re:Hmmm. by K.+S.+Kyosuke · 2016-08-17 03:26 · Score: 1
  
  It kind of is, in the sense that most people have it.
  
  --
  Ezekiel 23:20
2. Re:Hmmm. by oic0 · 2016-08-17 03:29 · Score: 2
  
  Not quite. This is deceptive advertising. You can deceive customers all you want so long as you have enough fine print don't outright lie.
3. Re:Hmmm. by Mashiki · 2016-08-17 03:33 · Score: 4, Informative
  
  Only if you think this is new. Intel has been doing shit like this for year and keeps getting caught, there were two lawsuits against them from AMD a few years ago where they ended up paying AMD around $7B USD for doing things like this and other forms of anti-competitive behavior which resulted in multi-billion dollar fines. Then again, nvidia has been caught doing the same. Probably the best example most recently is with their "Hairworks" API, which is likely going to land them in hot water again. Nvidia got nailed a few years ago for anti-competitive behavior over shaders.
  
  --
  Om, nomnomnom...
4. Re:Hmmm. by Luthair · 2016-08-17 03:38 · Score: 1
  
  Didn't Apple get sued for doing this to some PowerPC benchmark to claim it was the fastest PC?
5. Re:Hmmm. by CajunArson · 2016-08-17 03:38 · Score: 1
  
  Poor poor AMD. At least that's what their marketing and PR departments like to say.
  A real AMD employee who -- like most of the actual engineers -- no longer works there has a different story though:
  http://vrworld.com/2011/06/24/...
  
  --
  AntiFA: An abbreviation for Anti First Amendment.
6. Re:Hmmm. by Luthair · 2016-08-17 03:40 · Score: 2, Informative
  
  It wasn't benchmarks, it was changing their commercial (as in users pay for it) compiler to ignore CPU flags for non Intel parts and not documenting it, then various illegal behaviour with effectively paying OEMs to not use AMD parts.
7. Re:Hmmm. by Adriax · 2016-08-17 03:53 · Score: 0
  
  So an engineer who doesn't understand marketing spoke out about a company's change in marketing direction?
  Smoking gun right there. So much worse than intel sabotaging compilers and bribing manufactures to force customer selection.
  
  --
  I don't suffer from insanity, I enjoy every minute of it!
8. Re:Hmmm. by bluefoxlucid · 2016-08-17 03:54 · Score: 3, Informative
  
  It's not so much "older" as "different" in the artificial benchmarking world. Real-world loads don't tend to follow benchmarks religiously, and the newer benchmark might favor a configuration that's not as good in real-world loads.
  The classical marketing maneuver is to select from multiple sets of up-to-date benchmarks and pick the ones that favor your particular product. CPUBoss usually shows that one CPU outperforms another consistently (except for single-core vs threaded with dissimilar cores or SMT--fast clock wins single-core, many-cores wins threaded); and frequently shows the same benchmark tool using different strategies and rating each CPU faster than the other based on how it was configured, or shows that one benchmark favors one CPU and another favors the other.
  This goes all the way up to real-world functional tests, where you select games which perform better because of some feature or strategy of your GPU and CPU. You have better shaders? Pick a shader-heavy game. Heavy parallelism? Pick a game that meshes with that. You've got fewer parallel operations, but a higher clock? Avoid games that work best with 387-core GPUs and pick ones that like that 1185MHz clock. Show off 6 or 7 games running at freakishly-high 292fps.
  
  --
  Support my political activism on Patreon.
9. Re:Hmmm. by CajunArson · 2016-08-17 03:57 · Score: 0, Flamebait
  
  How exactly did Intel sabotage GCC or sneak in to AMD's offices and break AMD's own compiler?
  Oh you mean Intel didn't go out of its way to optimize its compiler for AMD chips? Tough cookies. If AMD is so amazingly great and wants to have "fair competition" here's an idea: Tell those geniuses at AMD to go out and write their own damn compiler for their supposedly "perfect" chips.
  Bitching and moaning that Intel's compiler produces perfectly functional code for AMD hardware (it did, BTW, never broke a single program running on AMD hardware) but doesn't optimize for AMD's hardware is sour grapes from a second rate company.
  If AMD is so magically "innovative" like all the delusional fanboys still believe, then they can optimize their own hardware and there's no responsibility for Intel to go out and do the work for them.
  
  --
  AntiFA: An abbreviation for Anti First Amendment.
10. Re:Hmmm. by sexconker · 2016-08-17 04:04 · Score: 0, Troll
  
  You're a retard.
  Intel went out of its way to cripple software compiled using the ICC if it detected a non-Intel CPU, ignoring the standard flags the CPU exposed for extension and feature support.
  The ICC is probably the most-used compiler for major software products. This is textbook anti-competitive behavior. There's a reason they got sued and lost.
11. Re:Hmmm. by Penguinisto · 2016-08-17 04:29 · Score: 1
  
  For awhile, it got even worse than that...
  
  --
  Quo usque tandem abutere, Nimbus, patientia nostra?
12. Re:Hmmm. by ausekilis · 2016-08-17 04:56 · Score: 1
  
  * VW emissions and mileage tests as reported during in-place testing and may not accurately represent real-world driving results.
  
  Thank you, VW. I'll expect my $10 million check in the mail.
13. Re:Hmmm. by PhrostyMcByte · 2016-08-17 04:56 · Score: 2
  
  Intel optimized per-architecture, not per feature. This had the end result of AMD chips taking the generic path and being slower, but I wouldn't call this tactic dirty. Why would Intel go out of their way to optimize for a competitor?
  CPUs have a wide variety of timing and pipeline limitations, and optimizing purely for feature set will never get you peak performance -- this is why GCC has the exact same per-architecture optimization support.
14. Re:Hmmm. by Anonymous Coward · 2016-08-17 05:14 · Score: 0
  
  Not quite. This is advertising.
  
  FTFY
15. Re:Hmmm. by hairyfeet · 2016-08-17 05:34 · Score: 2, Insightful
  
  I guess "Windows isn't done unless Lotus won't run" by your logic is completely reasonable behavior, it WAS their OS...right?
  Or maybe if you'd stop waving your little Intel flag as hard as your squeeing fangirl ass can you'd know they didn't "not optimize" for other chips, they purposely designed their compiler to put out broken code on other chips so badly in fact that you could take a Via CPU (the only CPU that allows you to change the CPUID in software) and by simply changing the CPUID from "Centaur Hauls" to "Genuine Intel" you magically got a 30% performance boost...wow, the power of of CPUID huh?
  Of course what it really was was a classic case of "Windows isn't done unless Lotus won't run" and this kind of behavior is typical of Intel, hence why they had to shell out 1.4 billion for market rigging and anti competitive behavior in the EU just 2 years ago. Would you like a quote from the judgement?
  "The Commission demonstrated to the requisite legal standard that Intel attempted to conceal the anti-competitive nature of its practices and implemented a long term comprehensive strategy to foreclose AMD from the strategically most important sales channels. ... The General Court considers that none of the arguments raised by Intel supports the conclusion that the fine imposed is disproportionate. On the contrary, it must be considered that that fine is appropriate in the light of the facts of the case.
  In other words the exact same shit MSFT got busted for and frankly they should get no less than what MSFT did, 10 years of being monitored by the courts to keep them from pulling shit like this again.
  
  --
  ACs don't waste your time replying, your posts are never seen by me.
16. Re:Hmmm. by K.+S.+Kyosuke · 2016-08-17 05:35 · Score: 1
  
  So they deliberately chose to use an inferior code path for AMD because they were convinced it would be better for the customer?
  
  --
  Ezekiel 23:20
17. Re:Hmmm. by K.+S.+Kyosuke · 2016-08-17 05:39 · Score: 1
  
  That is kind of ironic. Do think that nVidia and AMD are doing more or fewer application-specific tweaks in their drivers today? Because I don't believe that the latter is the case. (After all, Microsoft does it even with regular Windows software.)
  
  --
  Ezekiel 23:20
18. Re:Hmmm. by rahvin112 · 2016-08-17 06:08 · Score: 1
  
  They all do it. Nvidia is notorious for it, probably more so than Intel, going so far as to bin chips and creating special review boards and firmwares that make the review cards 30% faster than the retail versions.
  This is nothing more than a Pot meet kettle moment. Intel must be making waves in HPC with Phi to draw this strong of an Nvidia comment.
19. Re: Hmmm. by Anonymous Coward · 2016-08-17 07:01 · Score: 0
  
  Whoa whoa. Who said anything about "customers"? We are talking profits here.
20. Re:Hmmm. by PhrostyMcByte · 2016-08-17 07:12 · Score: 1
  
  There's no check for AMD CPUs. It runs cpuid and finds a code path in a lookup table for that specific CPU -- if it's not there, it gets the generic path.
21. Re:Hmmm. by sjames · 2016-08-17 08:07 · Score: 1
  
  No. ICC certainly did look at feature flags and use them to the utmost UNLESS the cpu was AMD, then it used the worst performing code paths available. The telling part is that you could preload a library that replaced the IsThisIntel function (not the actual symbol) with a function that always returns true and greatly improve performance on an AMD processor (sometimes beating the performance on an Intel processor). The existence of that function is very much Intel going out of it's way to de-optimize AMD performance. By any reasonable measure, ICC would have been a superior product had they not expended effort to snatch defeat from the jaws of victory in support of another department.
22. Re:Hmmm. by sjames · 2016-08-17 08:23 · Score: 0, Troll
  
  There is a check for Intel CPU. Since AMD is !Intel, it gets the crappiest possible code path. Override the detection with a preload and suddenly performance improves many-fold.
  The Intel code path then follows the relevant standard of choosing code path based on feature flags.
23. Re:Hmmm. by cheesybagel · 2016-08-17 09:09 · Score: 1
  
  The 'per-architecture' check included a strcmp with "GenuineIntel". There are processor flags to check if a processor supports an API already. I mean Intel designed the X86 ASM spec the least they could do is follow it in their own software.
24. Re:Hmmm. by Carewolf · 2016-08-17 10:32 · Score: 1
  
  That is kind of ironic. Do think that nVidia and AMD are doing more or fewer application-specific tweaks in their drivers today? Because I don't believe that the latter is the case. (After all, Microsoft does it even with regular Windows software.)
  Considering NVidia has special driver releases right after new games that are "optimized" for the new game. Plus a special bundled tool to "optimize" games to your graphics card... Yeah, it is not even a secret any more.
25. Re:Hmmm. by Mashiki · 2016-08-17 21:37 · Score: 0
  
  Funny you're marked troll. Guess the intel fanboys don't like the truth. Here's the reality, if you have a AMD chip there's a program called IPC that removes that flagging in executables built with that. It's fairly well known in gaming circles, and people usually see a 10-40% increase in performance from their games. Sadly it only works with non-encrypted exe's and so on. So some steam games you're pretty out of luck with.
  
  --
  Om, nomnomnom...
26. Re:Hmmm. by sjames · 2016-08-18 09:25 · Score: 1
  
  It is interesting that I got marked troll over an easily verified statement of fact. I sometimes wonder is it's just extreme fanbois or paid shills.
Here's the real reason for Nvidia's complaints by CajunArson · 2016-08-17 03:30 · Score: 4, Interesting

The real reason that Nvidia is bitching up a storm is that KNL has received a very positive reception in the HPC world.
Oh, and KNL is actually an absolute bargain in comparison to the requirements to get a high-end Pascal system setup, not only because you can buy an entire KNL system (not just a GPU card) starting at only $5000, but because it's self-hosting and doesn't need a high-end Xeon CPU just to feed the GPU. To put it in perspective, you could build a cluster of 26 KNLs for the price of one of those 8-way systems Nvidia is selling.
http://www.colfax-intl.com/nd/...

--
AntiFA: An abbreviation for Anti First Amendment.
1. Re:Here's the real reason for Nvidia's complaints by Creepy · 2016-08-17 03:57 · Score: 4, Insightful
  
  Yeah, but their beef isn't about the cost, it is about the speed comparisons. Intel never has tried to compete in the GPU performance space - they are happy with being in the low cost space. If you just compare what you get for a certain cost I have no idea, but I'm guessing having so many more Intel chips in your cluster will add significant power and space requirements at the very least. You may actually be better off with the nVidia solution in the long run.
2. Re:Here's the real reason for Nvidia's complaints by Anonymous Coward · 2016-08-17 04:20 · Score: 0
  
  wrong.
3. Re:Here's the real reason for Nvidia's complaints by Anonymous Coward · 2016-08-17 04:40 · Score: 0
  
  Irrelevant. They're still shitting on it for marketing purposes. Seriously, nVidia is hardly in a position to bitch about someone else gaming the benchmarks.
4. Re:Here's the real reason for Nvidia's complaints by Guybrush_T · 2016-08-17 04:49 · Score: 2
  
  Yes, the HPC world is waiting for KNL because they don't want to port their old codes to CUDA. But that's just the expectation : people are starting to realize that running a Xeon code on KNL is by no mean immediate and you won't get much performance boost without a serious application rewrite ... just like porting to GPUs, maybe slightly easier though.
  But on the performance side, it is very clear that KNL performance is terrible. The fact that Intel only shows scaling figures is quite funny : it is very easy to make a slow code scale, because computation times are high compared to communication times. To have good scaling, you can either have a faster interconnect or a slower CPU. Since they're never showing performance comparison but only "scaling", I'd bet it is the latter.
  To illustrate, say the speed of your code is 1 on 1 CPU, and 32 on 32 CPUs, scaling is perfect. If the speed is 100 on one GPU, and 2400 on 32 GPUs, the scaling is not perfect and you can show the scaling curve from Intel saying "hey, we scale better !". That's ridiculous.
5. Re:Here's the real reason for Nvidia's complaints by rahvin112 · 2016-08-17 06:20 · Score: 0
  
  HPC is not GPU. It's a whole other area of computing that has little if anything to do with graphics. The Knights landing chips are kinda like a GPU in the sense that there are lots of tiny cores good at one or two operations but it differs significantly from a standard GPU type chip that Nvidia produces in that its x86, and it's cores are a little more general purpose than a standard nvidia CUDA core.
  Think of it this way, Knights Landing is marketed as 70 Atom-like cores, rather than 1200 CUDA cores. Because it's x86 you don't need a Xeon in there running command and control for the CUDA cores. The Knights Landing chip can run all on it's own and it has the interconnects and shared memory to run all those atom cores at full pace such that it can do the matrix math just like CUDA but without any of the overhead of a Xeon cpu and PCIE Cuda cores coordinating across the PCIe bus. The advantage is supposed to be that because KL is x86 you don't need to recompile the code to CUDA to get it to run while at the same time getting the same power as an Nvidia build at half the cost.
  Clearly KL is making waves or nvidia wouldn't be complaining. The cost of recompiling is massive, being able to run older software on these is a huge advantage, even if you can't get the same speed, especially because it's cheaper than the equivalent nvidia build.
6. Re:Here's the real reason for Nvidia's complaints by Anonymous Coward · 2016-08-17 06:32 · Score: 0
  
  What? the highest-end 72-core Phi is over $6000. The Tesla cards are $5000.
7. Re:Here's the real reason for Nvidia's complaints by PhrostyMcByte · 2016-08-17 08:06 · Score: 1
  
  Yes, the HPC world is waiting for KNL because they don't want to port their old codes to CUDA. But that's just the expectation : people are starting to realize that running a Xeon code on KNL is by no mean immediate and you won't get much performance boost without a serious application rewrite ... just like porting to GPUs, maybe slightly easier though.
  
  Exactly this. AVX-512 is now much more GPGPU-like than traditional SIMD, so even transitioning AVX-256 code to it isn't going to be trivial. I would not expect random code to perform better on it without serious work.
8. Re:Here's the real reason for Nvidia's complaints by sjames · 2016-08-17 08:27 · Score: 1
  
  If you have some actual reason to believe that, please share so others can make a good decision. If not, why chaff the discussion?
9. Re:Here's the real reason for Nvidia's complaints by rahvin112 · 2016-08-17 09:08 · Score: 1
  
  According to this Phi can be had for $200 at the low end. You can't buy a Nvidia Tesla product for that.
  https://www.phoronix.com/scan....
10. Re:Here's the real reason for Nvidia's complaints by Anonymous Coward · 2016-08-17 22:01 · Score: 0
  
  Wasn't the AVX-512 about be integrated into the next generation of Xeons? Intel probably would like to get the ecosystem humming sooner rather than later.
11. Re:Here's the real reason for Nvidia's complaints by Anonymous Coward · 2016-08-17 22:35 · Score: 0
  
  That Xeon Phi 31S1P card is a Knight's Corner part, it's probably being flogged off cheap as it's obsolete.
  KNC is a very different animal to KNL, you no longer need a host processor and to attach the accelerator via PCIe with KNL.
12. Re:Here's the real reason for Nvidia's complaints by Anonymous Coward · 2016-08-18 08:34 · Score: 0
  
  Because it's x86 you don't need a Xeon in there running command and control for the CUDA cores. The Knights Landing chip can run all on it's own and it has the interconnects and shared memory to run all those atom cores at full pace such that it can do the matrix math just like CUDA but without any of the overhead of a Xeon cpu and PCIE Cuda cores coordinating across the PCIe bus.
  Xeon's IPC gives an advantage in terms of the Amdahl's law over the slower cores. I'd like to see Intel integrating the high IPC cores with reasonable amounts of Phi throughput cores to give a more balanced system. Nvidia, on the other hand has the benefit of using the available memory bandwidth much more effectively and the chip locality optimized algorithms, nVlink and RDMA over fabrics remove those some of those bottlenecks.
  
  The advantage is supposed to be that because KL is x86 you don't need to recompile the code to CUDA to get it to run while at the same time getting the same power as an Nvidia build at half the cost.
  The fact is that the old codes not yet optimized for modern architectures need a rewrite anyway. OpenACC and OpenMP can help with the job of enabling performance in the new, glorious world of heterogeneous systems for the old software as it is being rewritten and optimized.
Pot meet kettle by HalAtWork · 2016-08-17 03:37 · Score: 1

Both parties are quite guilty here

--
Twinstiq, game news
Layoffs by 110010001000 · 2016-08-17 03:58 · Score: 1

Intel is the next tech giant to have mass layoffs. It obviously is hitting a dead end if it is arguing over such small increments of performance. Moores Law was fun while it lasted, but depending on transistor count for performance gains isn't going to work.
1. Re:Layoffs by sexconker · 2016-08-17 04:08 · Score: 1
  
  They've had multiple rounds of layoffs recently (or one round with the reported number increasing frequently).
  Intel will be a husk in less than 10 years if they keep this shit up.
What Else is New by Anonymous Coward · 2016-08-17 04:11 · Score: 0

Intel - Mellanox, Intel - NVidia, Intel - IBM tugs of war: both sides present the opposite results, so much opposite that the suspicion rises naturally. Intel's performance and marketing departments, what are you guys and gals mucking about?
Worlds smallest violin for Nvidia by Moheeheeko · 2016-08-17 04:13 · Score: 4, Insightful

After they spent the last decade going out of their way to force "improvements" into new games (that either never work or cause severe issues for pc games) just to make their overpriced cards look better than AMD, they can go fuck themselves.
1. Re:Worlds smallest violin for Nvidia by thegarbz · 2016-08-17 06:19 · Score: 2
  
  If by improvements you mean optimsations, and by overpriced cards you mean get what you pay for, and by better than AMD you mean better than AMD then yeah you're 100% right.
  Now you can repeat the same statement for AMD.
  And for Intel
  And for ARM
  And for every other chip manufacturer who targets a specific market with specific products.
Manufacturer Exaggerates Product's Virtues by mschwanke97402 · 2016-08-17 05:12 · Score: 1

Yawn, wake me when there's some actual news.
Also, anyone who puts much faith in Intel's claims is either naive or a company shill. This simply business as usual for Intel.
Cheater A calls out Cheater B for cheating! by Gravis+Zero · 2016-08-17 05:20 · Score: 1

People in glass houses shouldn't throw stones... and buy some damn shades because seriously, nobody wants to see that!

--
Anons need not reply. Questions end with a question mark.
1. Re:Cheater A calls out Cheater B for cheating! by Carewolf · 2016-08-17 10:37 · Score: 1
  
  People in glass houses shouldn't throw stones... and buy some damn shades because seriously, nobody wants to see that!
  Cheaters think everybody cheats. In this case they might be right.
Get used to it Nvidia by JosephDoeden · 2016-08-17 06:06 · Score: 1

Intel has your markets in their crosshairs now and they have a track record of awesome when it comes to commercial chip success.
1. Re:Get used to it Nvidia by Anonymous Coward · 2016-08-17 06:50 · Score: 0
  
  Intel has your markets in their crosshairs now and they have a track record of awesome when it comes to commercial chip success.
  And they have a track record of festering garbage that's worse than useless for GPUs.
Maroon by Anonymous Coward · 2016-08-17 06:53 · Score: 0

The PowerPC MAC was faster than an equivalent X86 system even running MacOS at the time they converted to Intel. The CPU however is more expensive to purchase and ran hotter than the X86. RISC architecture has always been faster than CISC, mainly due to the lack of a need for IRQ polling.
Apple's decision to move to X86 was strictly a numbers and profit game, not a better performance game.
SLASHDOT FBI JUST DON'T UNDERSTAND CONSUMER TECH by Anonymous Coward · 2016-08-17 06:59 · Score: 0

No concept of what people on Slashdot know already. FBI you are retarded as fuck.
Are you trying to be a mirror of arstechnica, softpedia, zdnet, intel, and cnet now?
Fucking tools. Go back to your bat caves.
Programming GPUs! Arrgh! by Anonymous Coward · 2016-08-17 07:44 · Score: 0

The biggest difference is that Knight Landing is a very high core count CPU. You can run an OS on it and program it in the same way you program any other conventional computer.
In contrast, programming a GPU is a pain in the neck unless the task at hand is very well mapped to the hardware.
Hypocrisy? by Anonymous Coward · 2016-08-17 08:21 · Score: 0

"Pot, meet Kettle". Wasn't Nvidia doing some shady things a while back to make their benchmarking look better that it was?
arguments that are 30 years out of date by Anonymous Coward · 2016-08-17 08:58 · Score: 0

IRQ polling? I'm a kernel developer, contributor to linuxppc, and system software engineer at a cpu company. Sorry but the claim is total bullshit. The terminology is barely coherent.
A better superscalar design and cache architecture tends to be a bigger deciding factor than CISC versus RISC. Certainly with CISC you're wasting some engineer's time designing translation and pipeline issue that would have otherwise been unnecessary. CPUs are frequently found with over a billion transistors these days, with a B, as in (1e9). A few extra transistors to do the CISC junk is no longer a significant portion of your wafer, unlike 30 years ago.