Slashdot Mirror


FutureMark Confirms nVidia's Benchmark Cheating

jlouderb writes "As first reported by ExtremeTech, Futuremark has confirmed that nVidia is cheating on its 3DMark2003 benchmark through eight driver optimizations. The 3D graphics performance war just keeps getting more and more interesting!" See our previous story.

38 of 404 comments (clear)

  1. This is why artificial benchmarks don't matter by Anonymous Coward · · Score: 2, Insightful

    Test with the applications/games people really use, and they can't optimize for them without, well, optimizing for them! If they want to make Quake III faster, great.

    1. Re:This is why artificial benchmarks don't matter by mskfisher · · Score: 5, Insightful

      Wrong - as they point out in the article, these "optimizations" are usually reductions in quality. They don't just improve performance.

      --
      0x0D 0x0A
    2. Re:This is why artificial benchmarks don't matter by pbranes · · Score: 2, Insightful
      Who modded this up? They say specifically in the article that this is still *not* optimization, it is cheating!

      In fact, they say in the article that with "applications/games people really use", it is even harder to detect driver cheats.

    3. Re:This is why artificial benchmarks don't matter by paranode · · Score: 4, Insightful

      This is a problem with Nvidia. The only reason they are competing well with ATI is because they cut so many corners to get their benchmark scores up. It certainly would be nice if Nvidia concentrated on real-world apps and games but it seems like they do not. If you look at the benchmarks historically between ATI and Nvidia's closely competing cards, you'll find that they are closely matched in default runs. However, try turning on 4x anti-aliasing or anisotropic filtering and watch the older, slower, ATI cards beat out the shiniest new Nvidia cards. ATI's image quality has always been superior to Nvidia's. They are all about quantity and need to be focusing more on quality.

    4. Re:This is why artificial benchmarks don't matter by mskfisher · · Score: 4, Insightful

      In terms of the clipping planes, you're right.

      But the nVidia driver also substituted a shader for one of the water effects, which degraded/modified the image quality.

      And past history has shown that companies are willing to sacrifice quality for performance (see ATI's Radeon 8500 drivers and Quake 3 for an example)...
      It's almost like this is a cold war, of sorts, between the testers/benchmarkers and the card manufacturers.

      --
      0x0D 0x0A
  2. This is why.. by craigtay · · Score: 5, Insightful

    You don't base your findings on one benchmark. Whenever I go to a site like tomshardware.com they have several different ways to benchmark. Each card has its own strengths, and if a card has cheated it will show up like that.

  3. Isn't this standard practice? by Nathan+Ramella · · Score: 4, Insightful
    There's no law about fudging benchmarks on a third party application.

    While this isn't a huge suprise, I am happy that there are smart folks out there who spend time to uncover this kind of information. Kudos to you for your efforts!

    Videocard Benchmarks are about as believable as the the 'World's Best Grampa' award.

    -n

    --
    http://www.remix.net/
    1. Re:Isn't this standard practice? by James+Lewis · · Score: 4, Insightful

      It doesn't need to be against the law. Their motive for doing this in the first place was the expectation that their card would gain a better reputation by doing well in that benchmark by cheating. Instead, it has backfired and seriously hurt their reputation. Having a community that can uncover these unsavory practices is deterrent enough.

    2. Re:Isn't this standard practice? by GlassHeart · · Score: 2, Insightful
      Nvidia (and ATI before) are guilty of using deceit to attempt to sell more video cards. Thus, they are guilty of fraud.

      No, they are not guilty of fraud. They did not misrepresent their benchmark score; merely to optimize for the benchmark score. Whether or not benchmark scores are representative of general real world performance is not their responsibility.

      This is similar to Intel realizing that MHz meant everything to silly consumers, and optimizing their CPUs to achieve the highest MHz rating possible. As Apple has proven, it's possible to match Intel's performance in niche applications with alternative CPU architectures running at much lower CPU clock speeds.

      These are shady business practices, and is good reason to avoid a vendor for, but it's probably not illegal. You just had the wrong assumption that benchmark numbers meant real performance. That's not NVidia or Intel's fault.

    3. Re:Isn't this standard practice? by dead+sun · · Score: 2, Insightful
      Let me play devil's advocate. IANAL and all, but here's my devil's advocate view.

      Has nVidia (or ATI for that matter) ever claimed that any benchmark was indicitive of real world performance? Sure, they may boast individual benchmark numbers and say that their card is fast, despite having optimized routines for individual benchmarks, but they're really only claiming that their cards are fast, which is a subjective measure, and achieve those numbers on those tests, which they do. The benchmark writers may try to claim that their tests are somehow indicitive of the real world, but that doesn't really concern the hardware companies, but rather the benchmark writers themselves.

      Yes, they would love for you to believe this on your own, and I doubt that they're going to come forward and say that these third party benchmarks were optimized for (until they're caught, but then only maybe). They may even go so far as to say that card X outperforms card Y by Z amount in a certain benchmark when it does, despite not being indicitive of that relative performance in other arenas. Any tie between benchmark scores and real world performance is implicitly created by those who look at the benchmark expecting real world scores. There's not some grand rule anywhere saying you cannot write optimizations to execute some piece of code, benchmark or otherwise. Besides, those who actually follow computing hardware know not to rely on benchmarks for the true numbers. Sometimes they can be useful tools, but they certainly should not be used as a ground truth of performance. If everything that's been said is true and they aren't making any sorts of claims about relating where they have cheated to real performance, where is the fraud?

      Personally I'm saddened that this happened, but not really that suprised. I think if hardware manufacturers would stop doing stupid things like this then maybe benchmarks would be a little more useful. I'm sure that there's a more useful use of time than writing optimizations for benchmarks. It brings the companies down a bit in my eyes and now I'll be watching a bit more for any sort of doublespeak or implicit statement that isn't really there from these two.

      --
      If not now, when?
  4. cheats, not "optimizations" by Anonymous Coward · · Score: 2, Insightful

    Calling them optimizations gives what nVidia is trying to do a level of legitimacy which is undeserved. If you read the Futuremark paper, you will see that they are clearly cheating.

    It would be as if a CPU manufacturer substituted its own algorithms stealthily in a CPU performance benchmark and only when running that benchmark.

    Sure, you get a higher number, but you aren't measuring what the benchmark designer intended to measure.

  5. History repeats itself a thousand times over... by voxel · · Score: 5, Insightful

    This has been done for many years, even the last decade. A good friend of mine works and has worked for almost every major video card company in the buisness for the last decade. What is his job? Make sure THEIR video card gets the best scores on the latest and greatest video cards.

    I am sorry to tell you all, but just because Nvidia was CAUGHT this time, doesn't mean they haven't been "cheating" (by optimizing for a specific benchmark) for the last 6 years.

    I would bet every driver release contains code to help out benchmarks and even specific games. Why do you think Nvidia just said with there latest driver release " *Up to 30% faster frame rates ( *With Unreal Tournament 2002)".

    Its just once in a great while someone notices a performance jump TOO big, or just wants some news worthy-ness and decides to put out a nice PDF file.

    - Jeff

    --
    Modesty is one of life's greatest attributes
    1. Re:History repeats itself a thousand times over... by Cyno · · Score: 4, Insightful

      Maybe this is a good arguement for open source drivers? Afterall I'm paying for the hardware.

    2. Re:History repeats itself a thousand times over... by Xibby · · Score: 1, Insightful

      Is it a cheat, or is it an optimization?

      Well, given that it's qute likely that other products will be based on Doom3, QuakeIII, and the various Unreal engines, is it really bad to optimize your drivers to run those applications as best as possible?

      They are intended to be fast moving games, so even if such optimizations do degrade the visual quality a bit, it's not likely that you as the person playing the game will notice that the soke trail on the rocket that just exploded in your chest is only rendered across 25% of the distance between you and your attacker instead of the 50% the developer of the game intended.

      And in the case of Unreal Tournament 2002, they are at least telling you that the drivers have been optimized specifically for that title (if you read the small print.)

      Now go do some Linux vs Windows benchmarks without doing some sort of tweaking so that Linux has the advantage in whatever it is you're benchmarking. No? Not going to do it? I didn't think so. :)

      --
      I'm going to go back in my box and will think within the limits of my box: MS Sucks Linux Good I read too much Slashdot.
    3. Re:History repeats itself a thousand times over... by SnowDog_2112 · · Score: 3, Insightful

      Optimizing for a given product is fine. Heck, I appreciate it! If I know that the company has spent time looking at specifically one game and has polished the driver for that game, that's one more data point I have on whether to buy it.

      That's completely different from what happened here. They looked at a particular test where the camera travels on a set path and hard-coded it so that things were beautiful on that path. As soon as you hop the camera off the rails, the driver goes to crap.

      Gah. Read the article. Or, if you're not up to that task, read the dozen posts before mine which say the same thing.

      --
      Not representing or approved by my company or anybody else.
  6. ATI cheaping too by IpsissimusMarr · · Score: 3, Insightful

    Our investigations reveal that some drivers from ATI also produce a slightly lower total score on this new build of 3DMark03. The drop in performance on the same test system with a Radeon 9800 Pro using the Catalyst 3.4 drivers is 1.9%. This performance drop is almost entirely due to 8.2% difference in the game test 4 result, which means that the test was also detected and somehow altered by the ATI drivers. We are currently investigating this further.

    It not about cheating... but about how much you cheat.

    --
    "Engineers do the work of man, Physicists do the work of God"
  7. So what? by bobm17ch · · Score: 2, Insightful

    Different graphics cards have different strengths and weaknesses - much moreso than in previous years.

    eg. Fillrate, Vertex manipulation, Texture rasterizer, Shader technology, Texture sampling techniques, Shadow buffering etc.. etc...

    Some cards will be better than other at these tasks, and some games will take advantage of differing ratios of these technologies.

    The unreal engine has a reliance on poly-count and texture resolution, and it looks like the doom engine will tend to tax shader, and multitexture units more than the polygon throughput side of things.

    In other words, gfx cards are now so flexible that their abilities in these individual areas must be assessed in isolation depending on your choice of game/engine/technology.

    As little as 2 years ago all that mattered was fillrate, and this was essentially what the direct3d/opengl api's could stress in hardware.

    IMO, price seems to be the most useful benchmark for the newest cards.

    --
    \\ Mitch
  8. Re:Don't don the tinfoil hats prematurely... by Tweakmeister · · Score: 2, Insightful

    The problem with comments like these is why should Nvidia HAVE to buy into the FutureMark program, which is a "monopoly" in the benchmark market.

    I believe Nvidia's stance is "we don't really care for 3dmark all that much" ...

    --

    Colossians 2:8

  9. Re:Don't don the tinfoil hats prematurely... by Anonymous Coward · · Score: 1, Insightful

    Never attribute to malice what can be explained by stupidity.

  10. Re:well.. by uityup · · Score: 3, Insightful

    I would be amused to see ATI try and sue over this considering that they also appeared to cheat the benchmark on game test 4. I wonder if this is because they weren't able to catch and manipulate any other tests. New benchmark for driver writers: how effectively can the coder cheat the performance benchmarks?

  11. Re:Leave Coke out of it by Anonymous Coward · · Score: 1, Insightful

    No harm except for the sugar and caffeine that they pump into your body. If you don't think it does any harm then just try and quit for a week. When you start feeling the effects of withdrawal let me know.

    I've been caffeine free for 6 months now and the week long migraine when I quit was well worth it.

  12. Would it really kill Taco if... by WIAKywbfatw · · Score: 2, Insightful

    ...Slashdot was to host a BitTorrent of this and similar files for faster, cooperative downloading?

    I've said it before, and I'll say it again: doing this would be a win-win situation. It's a pity that the editorial team are too busy playing with MAME/whatever to actually do something of real benefit to the wider community.

    --

    "Accept that some days you are the pigeon, and some days you are the statue." - David Brent, Wernham Hogg
  13. Re:Don't don the tinfoil hats prematurely... by rhavyn · · Score: 4, Insightful

    When they did nothing more then change the splash screen, the nvidia card gave out different results. That seems to be detecting the benchmark and cheating.

    Try reading the article.

  14. Wasted Code by JeffRC · · Score: 5, Insightful

    Just think about this the next time you do a 5MB driver download. How much of that code is specifically for detecting and defeating benchmarks? How much of the cheats are part of the instability problems in your system?

  15. A reviewer's job is what? by kwerle · · Score: 3, Insightful

    Funny, I seem to remember Toms Hardware being rabidly AMD fanboyish about 1.5 years ago when AMD still had the fastest processor. I'm not saying they aren't biased fanboys, what I'm saying is they're fairweather fans.

    Isn't that the definition of a good reviewer? Fans of the current top of the line stuff - damn their history?

    To keep it on-topic, I also seem to remember ATI doing the exact same thing nVidia is now doing with quake "optimization" for the 8500 cards... Do a google search for "quake quack"

    Case in point...

  16. ATI cheating too? by Kegetys · · Score: 2, Insightful

    So, ATI results drop from this new patch too? Doesn't this mean that ATI is also cheating? If so, then how do we know that there isn't more cheats ATI is using, as this new patch is only made to exploit the nVidia ones. ATI has access to the developer version of 3D mark, so they could hide their cheats much more efficiently.

  17. Re:lies and statistics. by Surak · · Score: 5, Insightful

    The problem with 'real world testing' when it comes to video cards aimed at the gamers market is basically the difference between a few lousy FPS between the two top-of-the-line cards (and each have similar features, performance-wise) will be virtually indistinguishable in most cases.

    I think people shouldn't get all macho when it comes to this stuff. Honestly, it's like the difference between a 350 hp engine and a 351 hp engine. It doesn't amount to a hill of beans worth of difference except on paper.

    Get over it people.

  18. Old hat by Anonymous Coward · · Score: 1, Insightful

    Benchmark cheating on video drivers has been going on ever since video driver benchmarking. I worked for a graphics card manufacturer for a few years and a driver development company for a few years and remember a few interesting "optimizations". (This was in the 1991-1997 timeframe.)

    We had cheats -- excuse me, optimizations -- that were specific to a particular version of Winbench. Other "cheats" were true optimizations... they were just optimizations you'd probably only see in the benchmark program. A few were known to reduce performance in the "real world" but produced higher Winmarks... so we turned them on only when we detected Winbench was running.

    One of the major motivations towards benchmarks that use real applications/games was because of these "optimizations" that were only useful in the benchmarking program. Optimizations for those are still cheats, in the sense that they are intended to improve benchmark results, but at least they are optimizations that have a real-world positive effect.

    This is basically a "nothing new under the sun" story. But it's good for these kinds of articles to come out every once in a while, because it helps keep things from getting too out of hand.

    -(realname) posting anonymously.

  19. Good for futuremark by egarland · · Score: 3, Insightful

    I think it's awesome that Futuremark has come out swinging on this one. NVidia has obviously cheated horribly on these benchmarks. ATI aparently has also taken the low road on these but not as low as NVidia.

    NVidia is losing. Their chips and cards are worse than ATI's. What's worse than that, though, is that they are still trying to pretend that it's not the case. They need to seriously sit down and work on their designs but instead they are pissing money away working on cheating on benchmarks. That is a really bad sign for a company. It means managament is diverting money away from becoming successful twords appearing to be successful. A mentality like that is disasterous to the real value of a company.

    SELL! SELL NOW! Buy again when they have fixed their mangement and design issues.

    Contravertial != Overrated. Reply if you disagree, I'll read it.

    --
    set softtabstop=4 shiftwidth=4 expandtab nocp worlddomination
  20. Re:Stop your FUCKING whining, Slashdot! by PhoenixK7 · · Score: 3, Insightful

    "Who came out with a standard API that ALL manufactures could use without resorting to the arcane obfuscation of OpenGL? That's right, cuntfaces...

    It was Microsoft."

    Right. All manufacturers... whose hardware works with windows. I'll take cross platform compatability thank you very much.

    Before you might argue that nobody uses OpenGL, what about all those licensees of the Quake 3 engine? And what about all those who will license the Doom 3 engine?

  21. This is silly. by tempshill · · Score: 2, Insightful

    You could not prove to any court that NVidia is using deceit. NVidia improved their driver so that a certain set of operations runs faster. There is nothing deceitful about this.

    Even if they were to state on the box that they have the card that performs best on the 3DMark2003 benchmark, it would still be a truthful statement. Logically, it's a flaw of the benchmark that it is able to be exploited.

    If there is any deceit involved, it would be if someone were to claim that the result of this one benchmark conclusively proves that the NVidia card is superior.

  22. Re:lies and statistics. by DigiShaman · · Score: 3, Insightful

    Now that's just fucking wrong! I can understand returning a card because it didn't do as well as you thought or not worked at all. But to get a free product out of it. Damn, have some moral backbone.

    --
    Life is not for the lazy.
  23. thanks, but... by Shadestalker · · Score: 4, Insightful

    Dear nvidia / ATI / etc.,

    Please optimize your drivers and hardware for the actual applications and games I run, not the synthetic benchmarks designed to simulate workloads. Benchmarks don't use your products, end-users do.

  24. Re:Of course by Anarchos · · Score: 2, Insightful

    Hell I'd read the articles if they weren't slashdotted 90% of the time, such as the current story.

    --

    "A good conspiracy is an unprovable one." -Conspiracy Theory
  25. That's not a cheat... by Dr.+Zowie · · Score: 2, Insightful
    or instance, at some point there was a trick for a test with lots of occlusion to clip (discard) polygons that would eventually be occluded. However, these discarded polygons were actually calculated at run-time and not precomputed, so if you changed the test, it would still work right.


    You have just described an optimization, not a cheat. The point of cheats is that they take advantage of knowledge that's not available to normal processes. If your "cheat" takes no such advantage (e.g. calculating its shortcuts at runtime based only on the actual rendering data) then it's actually an optimization.

  26. Benchmarks not using 3DMark by Anonymous Coward · · Score: 1, Insightful

    Have a look at these benchmarks and tell me if nVidia made a cheat for all of them as well. http://www6.tomshardware.com/graphic/20030512/inde x.html

    Tom's Hardware does a really good job of actually testing the hardware with multiple softwares to get an accurate test of the card.

    BTW -- Those "enhanced" drivers did nothing for my score.

  27. Re:Worse than that! by Sj0 · · Score: 2, Insightful

    The point is that ATI had a pretty tenuous justification (that they were optimizing for Quake 3 as it's the engine behind a large number of games), but if this is the case then nvidia has none.

    Only if they didn't understand what they were doing, which I doubt. Since there aren't many quake-based games that are named quake.exe, and at the time, Quake 3 was an aging game used mainly for benchmarks, and the stunning similarity between the two, you're just searching for a way to justify it.

    This case certainly isn't black and white. If you recall, 3dmark and nvidia are kind of in this PR war right now because 3dmark uses general shader code, which the ATI specifically handles better, whereas the nvidia handles specially optimized shader code quicker. Considering that either card is made for these fictional "next generation games" which the gaming media has been warning us of since the days of the Geforce 3, I've decided that it doesn't really matter either way. My old Geforce 4 MX is STILL chugging the newest games away just fine, despite all the doomsayers prophecies, and the gaming media is still warning us about these mythical games that are supposedly just around the corner, so I think I'll hold off my next video card purchace somewhat.

    --
    It's been a long time.
  28. The other problem by roystgnr · · Score: 2, Insightful

    "Real World Testing" in general means that they're testing the card on games that are out on the shelf, finished products, right now; i.e. games which were targeted at video cards years old. In other words, one card does 150fps at the highest quality settings, another does 155fps, and when both of them are run on my 80hz refreshing monitor, the results are exactly the same.

    Instead, I want testing that approximates the sorts of games that I'll want to buy years from now. Unfortunately those games don't exist yet. In lieu of those games existing, I can look at these eye candy benchmarks to get some idea of what the performance of video cards will be once they're pushed to their limits. How many polygons or how many dynamic lights can programmers squeeze into a scene before the frame rate drops to something unacceptable? How fast can the card whip through those pixel shader programs that everybody is going to be rendering fur and metal and such with in soon? That's what these sorts of benchmarks are supposed to do: tell me how my prospective new purchase will perform on games in the future.