Slashdot Mirror


How Many Android OEMs Cheat Benchmark Scores? Pretty Much All of Them

An anonymous reader writes "After Samsung got caught out cheating on benchmarks (Note 3, Galaxy S4) AnandTech has done a detailed analysis of the state of benchmark cheating amongst Android OEMs. With the exception of Motorola, literally every single OEM they've looked at ships (or has shipped) at least one device that does benchmark-specific CPU optimizations. AnandTech also thinks it will get worse before it gets better. 'The hilarious part of all of this is we’re still talking about small gains in performance. The impact on our CPU tests is 0 - 5%, and somewhere south of 10% on our GPU benchmarks as far as we can tell. I can't stress enough that it would be far less painful for the OEMs to just stop this nonsense and instead demand better performance/power efficiency from their silicon vendors.' The article notes that Apple doesn't do any of the frequency gaming stuff."

40 of 189 comments (clear)

  1. Easy solution by TheRealMindChild · · Score: 4, Interesting

    The benchmark software should randomize the process name on launch

    --

    "When life gives you lemons, don't make lemonade. Make life take the lemons back!" -- Cave Johnson
    1. Re:Easy solution by h4rr4r · · Score: 2

      So then you just look for behavior. Oh look a bunch of unnatural activity, change the governor to performance instead of on_demand.

    2. Re:Easy solution by hairyfeet · · Score: 4, Insightful

      Even easier...don't use benchmarks which are easily rigged, choose real world applications and cook up some tests using them!

      The problem is ALL the benchmarks have been rigged by somebody for ages...X86? Any benchmark compiled using ICC is rigged against AMD and boosts Intel by 30% (look up "Intel Cripple Compiler" if you want to learn more, end result is benchmarks are completely useless on X86 unless compiled with GCC) and as we see here ARM? Rigged by everybody. This kind of crap has been going on since "Quack.exe" and is why you shouldn't trust ANY benchmarks. I'll always remember one of the reviews before the guy was told about ICC when he was testing netbooks "These new netbooks with AMD seem to be MUCH snappier...yet it loses to the low end Atom by nearly 30%, and I just don't know why"...the reason why is the benchmarks are all rigged!

      So if you want to bench an ARM chip? Root the phone, install Bash, and run a script that runs real applications in real world scenarios. there are more than enough apps out there that cooking up their own informal benches should NOT be hard and until they do? Just remember to replace the word benchmark with bullshit every time you see one of these articles.

      --
      ACs don't waste your time replying, your posts are never seen by me.
    3. Re:Easy solution by realityimpaired · · Score: 2

      Oh look a bunch of unnatural activity, change the governor to performance instead of on_demand.

      That shouldn't make that much of a difference... if the governor is in the low_power setting, limiting the CPU to minimum clock speed, then there'll be a performance hit on benchmarks, but with on_demand, by the time the system has realized it needs to change the governor to performance, the clock speed will have already been increased to maximum due to CPU load. Meanwhile, you're wasting clock cycles monitoring what stuff is doing....

    4. Re:Easy solution by hairyfeet · · Score: 3, Informative

      If you are trying to say AMD's compiler rigs? Sorry you are full of shit, the reviewer that first found the cripple code in ICC did rigorous testing of AMD's compiler and found it was merely GCC with some extra X64 flags added to better support 64 bit, that is it. Oh and you can download the source for the ACC and see for yourself, its based on GCC and is fully GPL V2 compliant.

      --
      ACs don't waste your time replying, your posts are never seen by me.
    5. Re:Easy solution by realityimpaired · · Score: 2

      Fair enough, And not an unreasonable measure.

      Truthfully, and I know I'm in the minority in a discussion about benchmarks, cell phones have been "good enough" for quite a while, and I bought mine based on feature list and price. I bought the cheapest phone that had all of the features I wanted: FM radio, GPS, bluetooth, wifi tether, and that's it. Didn't even care whether it had 4G connectivity (though it did). Even a bottom-end $100 smart phone with an 800MHz single core processor is in the performance range that will be good enough for most of us, and for the serious gamers, they can worry about getting the most expensive quad core on the market, or, like I did, get a good quality tablet to accompany the phone. With entrants like the Asus MeMo Pad HD 7" (a sub-$200 quad core tablet with plenty of memory, there are others but that's the one I bought), it's getting ridiculously easy to get good performance in your devices for just about anything you throw at it in the real world.

  2. Or, alternatively by Truth_Quark · · Score: 5, Insightful

    The phone manufacturers should not be dicks.

    1. Re:Or, alternatively by Anonymous Coward · · Score: 3, Interesting

      On the contrary, the testers should just be bigger dicks. "We detected benchmark-specific optimizations in products #1, #2 and #3, so they all got zero points."

    2. Re:Or, alternatively by L4t3r4lu5 · · Score: 4, Informative

      Pfff. Car manufacturers tape up the air intakes and door seams on their cars to do fuel economy runs, just to eek out the every last 0.1mpg. Running your car like that for any reasonable period of time would wreck the engine pretty quick.

      Benchmarks are about as useful as manufacturer spec sheets. Take both with a a few metric tonnes of salt.

      --
      Finally had enough. Come see us over at https://soylentnews.org/
    3. Re:Or, alternatively by idontgno · · Score: 2

      Pfff. Car manufacturers tape up the air intakes and door seams on their cars to do fuel economy runs, just to eek out the every last 0.1mpg. Running your car like that for any reasonable period of time would wreck the engine pretty quick.

      Does any manufacturer report their own measured mileage? Within the US, the only number that matters is measured by the EPA, on a dynamometer.

      --
      Welcome to the Panopticon. Used to be a prison, now it's your home.
    4. Re:Or, alternatively by gnasher719 · · Score: 4, Insightful

      On the contrary, the testers should just be bigger dicks. "We detected benchmark-specific optimizations in products #1, #2 and #3, so they all got zero points."

      That seems quite arbitrary. What about "To test the battery, we tested how many minutes the battery lasts while running benchmark X". The cheaters will get shorter battery life.

    5. Re:Or, alternatively by ftobin · · Score: 4, Informative

      Actually, manufacturers do report their own efficiency numbers, and the EPA spot-checks them.

      http://business.time.com/2012/12/10/more-reason-to-be-skeptical-about-new-car-mpg-claims/

    6. Re:Or, alternatively by Pieroxy · · Score: 2

      Why would they get shorter battery life?

      They can't artificially boost performance without a drain on the battery.

    7. Re:Or, alternatively by Pieroxy · · Score: 3, Insightful

      Just buy an iPhone. Apple doesn't cheat apparently.

    8. Re:Or, alternatively by dj245 · · Score: 2

      Actually, manufacturers do report their own efficiency numbers, and the EPA spot-checks them.

      http://business.time.com/2012/12/10/more-reason-to-be-skeptical-about-new-car-mpg-claims/

      Not only that, but they are allowed to use the same numbers if the drivetrain and weight of the vehicle is the same as to a previously tested vehicle.

      --
      Even those who arrange and design shrubberies are under considerable economic stress at this period in history.
  3. And Apple by Anonymous Coward · · Score: 5, Informative

    With the exception of Motorola...

    And Apple. Apple and Motorola/Google are the only two companies that don't boost their devices for benchmark tests. If you're going to give credit to one, please do be fair and give credit to the other.

    I respect both of them for that level of integrity and I hope they stick to their guns and remain honest.

    I may be an Apple fanboy (and I am) but I'm really looking forward to seeing what Motorola starts releasing in about a year once Google's able to, as they said, flush things out of the system and start releasing truly Google-designed products.

    1. Re:And Apple by Anonymous Coward · · Score: 3, Interesting

      Posting anonymously since I'm a motoroogle employee... you'll be disappointed. I certainly am. At this point, I expect google to shut us down or spin us off.

    2. Re:And Apple by h4rr4r · · Score: 3, Insightful

      I fear Motorola is dead.

      The Moto X was supposed to be a device with high amounts of post Google input. It is a high priced mediocre mid range device. They still cater to carriers unlike Apple. They need to compete with the 5S, One X and S4. Instead they are competing with last years product.

      Moto needs to copy Apple in some things. Make 1 main device, sell the old one or a cheaper version as well. Max 2-3 devices. Refresh them once a year. This lets third parties make all kinds of doodads for the device. Put them on all carriers and update them at the same time. If need be get the FCC to lean on Verizon's C block requirements. Other than that, unlock the bootloaders and make donations of devices to the popular mod groups. Most users will never care, but the PR among the tech crowd would be huge and they influence other buyers.

    3. Re:And Apple by Dixie_Flatline · · Score: 5, Interesting

      It's not a mid-range device. It's only mid-range if you look at the spec sheet and nothing else. Its (non-gamed) benchmarks are actually pretty good for all this talk of 'mid-range'. They did the same thing Apple did and tried to balance out performance with battery life. They didn't put the biggest screen in it, and they have optimised silicon to listen for commands without keeping the CPU on all the time.

      Specs aren't the war that anyone should be trying to win in the mobile space. That kind of thinking is why there are phones that only last half the day.

    4. Re:And Apple by JanneM · · Score: 3, Interesting

      I've heard a number of people I trust comment on how the Moto X just feels really good in the hand and how the screen and size is just right. So the phone is certainly not bad at all, and most of us don't actually buy the high-end phones any more than we buy the high-end cars or bikes. A really good mid-range phone is exactly what I want; I'm already making calf-eyes toward the coming Sony Z1 Mini.

      What's hurting Motorola for me, personally, is that it's simply not on sale in most of the world, and seems unlikely to ever be. It's not just about being able to get it where I live, but having a phone designed from the start to be usable in all major regions of the world as I travel.

      --
      Trust the Computer. The Computer is your friend.
    5. Re:And Apple by king+neckbeard · · Score: 3, Informative

      Perhaps you should take a refresher course on reading comprehension. The first sentence mentions that Android OEMs were analyzed. Apple is not an Android OEM, so they were not included at that point.

      --
      This is my signature. There are many like it, but this one is mine.
    6. Re:And Apple by Karlt1 · · Score: 3, Insightful

      Apple is not the exception. They just fudge the numbers in a different way. Really, they're all bastards though and I don't trust them any further than I can throw them.

      And a little deeper analysis will show....

      a) Apple accounted for numbers the same way they always did.
      b) Apple does not count phones as sold if they are still in Apple Stores inventories.
      c) If Apple were channel stuffing, phones would readily be available at carriers.

      The fact is that the "analyst" were wrong and are trying to cover their tails.

      Do you really believe that there is not enough of a demand in the launch countries to sale 9 million phones in one weekend?

      Do you think that Apple will be forced to take a write down (like MS and BlackBerry) for unsold inventory in the channel?

    7. Re:And Apple by Dixie_Flatline · · Score: 2

      It can't be worse than the Samsung's I've handled. And the HTC One is premium and beautiful, but that doesn't seem to be working out.

      I will accept that you personally are calling it a mid-range device on the build quality alone, but so much of the commentary has focussed on how it has 'mid-range specs' and is overpriced for the screen/CPU/GPU that's in there.

      There are trade-offs to be had. iPhones are top-tier devices, but they've got smaller screens than the top-tier Android phones. The S4 is a plasticky top-tier device with a big screen and a whole bunch of useless software features. Given these things, I'm not convinced the Moto X (and I admit, I haven't sat down and played with one) deserves its mid-tier accusations.

    8. Re:And Apple by Trillian_1138 · · Score: 3, Interesting

      Just picked up a Moto X to replace my aging Galaxy Nexus. I'm on Verizon, which makes me skeptical I'll see another Nexus phone anytime soon. I like the near-stock Moto X setup, and the little tweaks (active notifications, touchless control, etc) are pretty fun. It's a pretty great phone, and I hope it helps boost Motorola in the Android world. I do think they priced it too high (although I still bought it, so take that as you will..) but I highly recommend the phone to anyone who plays the occasional, not to graphic-intensive games, wants their phone to be quick and responsive, likes stock Android, etc. It's an awesome phone.

  4. Re:Probably wont get better by h4rr4r · · Score: 3, Insightful

    They started that game long before you could get 1 GB hard drives.

  5. CPU Benchmark Shenanigans ROI?? by Anonymous Coward · · Score: 2, Informative

    'The hilarious part of all of this is we’re still talking about small gains in performance.'

    The even more hilarious part is that OEMs are going to the trouble to do this when CPU benchmark scores are a very small factor in the decision of most consumers to buy these phones. I doubt that the ROI is higher than say, oh, improving the user experience of the GUI or call QoS.

  6. Hilarious... by Savage-Rabbit · · Score: 3, Insightful

    The hilarious part of all of this is we’re still talking about small gains in performance.

    The hilarious part of all this is that most people really don't give a rat's as about performance when selecting a phone or even a tablet. The criteria are things like: how does it handle? How intuitive is the UI? Can I watch my favorite online video feeds on this thing? Are any buttons in annoying palaces? What's the price? Does this thing have software to view and edit MS Office files I get sent by mail? The only performance tests these smartphone and tablet things usually get is playing around with a display example in the shop and seeing if the UI is nice and snappy. Nobody excepts tech nerds gives a rats ass that a Samsun Galaxy 4 get a few more FPS in Modern Combat than an iPhone 5.

    --
    Only to idiots, are orders laws.
    -- Henning von Tresckow
    1. Re:Hilarious... by coolsnowmen · · Score: 2

      I might not care as much about performance as I care about battery life, and a rating of that is performance/Watt.

    2. Re:Hilarious... by the_humeister · · Score: 2

      Here are some tests I've done on my devices compiling Povray 3.6 (single threaded) and rendering the benchmark scene. I was rather surprised that my new Nexus 4 (Qualcomm S4) is slower per core than my SGS II (Samsung Exynos 4). Yet the Nexus 4 doesn't feel any slower and is my main phone now due to the larger and higher resolution screen.


      Athlon II x4 (2.8 GHz): 179.82 pps ; 64.22 pps/GHz
      Exynos 5 (1.7 GHz): 77.36 pps ; 45.51 pps/GHz (-mfloat-abi=hard -mcpu=cortex-a9 -mthumb -mthumb-interwork)
      PowerPC 750 (700 MHz): 20.47 pps ; 29.25 pps/GHz

      Exynos 4210 (1.2 GHz): 29.90 pps ; 24.91 pps/GHz (-mfloat-abi=hard)
      Pentium 4m (1.5 GHz): 36.24 pps ; 24.16 pps/GHz
      S4 Pro APQ8064 (1.5 GHz): 31.81 pps ; 21.20 pps/GHz (-mfoat-abi=hard -mcpu=cortex-a15)
      Atom N270 (1.6 GHz): 28.96 pps ; 18.10 pps/GHz

      Unfortunately can't post all the data I have because this site thinks there're too many 'junk' characters already...

    3. Re:Hilarious... by Savage-Rabbit · · Score: 3, Insightful

      Apple decided the only way they can differentiate their iPhone 5s amidst all the comments that they are no longer innovative is to create a competitive market based on useless CPU performance numbers, just like what Apple did with Retina displays. Before Retina, nobody cared about pixel density. Before A7, nobody cared about CPU performance or its bittyness. Before the iPhone 5s camera, nobody cared about the size of the CCD pixel on their phone camera.

      Dude, It's not that long ago that articles like this slashvertisment were plastered all over the web accompanied by comments filled with enthusiastic boasts by hoards of Android fans detailing how iPhone performance sucks ass. Even if blisteringly fast benchmark performance is pretty low down on the list of most people out to buy a smartphone I still can't fault Apple for trying to put a sock in the collective mouth of the Android community. There is a certain personal satisfaction to be had from making the choir of hard core Android fans shut up about benchmarks until Samsung comes up with a still faster device (hopefully free of benchmark cheating this time) even if the customers will probably hardly notice this stupid pissing contest.

      --
      Only to idiots, are orders laws.
      -- Henning von Tresckow
  7. The best part by aitikin · · Score: 3, Funny

    Is that the table is titled "I Can't Believe I Have to Make This Table". Made me smile.

    --
    "Don't meddle in the affairs of a patent dragon, for thou art tasty and good with ketchup." ~ohcrapitssteve
  8. If you measure it, it gets better by tompaulco · · Score: 4, Insightful

    If you measure it, it gets better. Nobody ever stops to think why. They mostly just think they are good managers. But in fact, if you create a measuring tool to measure qualities of a device, the manufacturers will work to make that measurement better. If you make a measurement to determine how your employees are performing, they will perform better according to that measurement. That's just the way it works.
    You can't measure everything, so you're best bet is to try to keep the measurement methods secret and change them frequently. Unless, of course, your measurements are intended to improve a particular area, then by all means, measure on.

    --
    If you are not allowed to question your government then the government has answered your question.
  9. Wrong, they are boosting clock speed above normal by SuperKendall · · Score: 2, Informative

    I was under the impression that what they were doing was ensuring that the clock speed was running at full, not slowed down for power saving, etc.

    No. They are running at a clock speed that no real application will see under any circumstance, either the GPU or CPU cock increased.

    It is within what the parts are rated for but not what the device was built to run at normally.

    I assume some games and other applications also force the processor to full

    There is no way to build a game on Android that can run at the speed the benchmarks are getting run at on each of the devices "cheating".

    --
    "There is more worth loving than we have strength to love." - Brian Jay Stanley
  10. Re:Cheating or just gaming the benchmark? by DJRumpy · · Score: 2

    You do realize that this is a battery operated device, and that such 'tweaking' dramatically impacts that battery life. Couple that with them not reporting the actual battery life while running the GPU/CPU coverclocked, and you are essentially lying out of you eye teeth.

  11. Re:Wrong, they are boosting clock speed above norm by Anonymous Coward · · Score: 2, Informative

    No, he was right. The phones CAN and DO reach that clock speed. Read the AnandTech article. The graph of CPU speed shows it quite clearly.

  12. Re:"Pretty Much All of Them" by flimflammer · · Score: 4, Insightful

    No, not really. Comparing iOS and Android directly on performance is silly. They're two totally different ecosystems and hard performance numbers don't change much. That's like a typical user picking a Mac or Windows PC because one performs 5% better at random tasks, ignoring the fact that the offerings between each machine is radically different and pure performance numbers are only a tiny part of the whole picture.

    Apple has no reason to cheat because they have no competition that merits the risk of cheating on. It might have been a different story had iOS hardware been available from multiple vendors.

  13. Re:Wrong, they are boosting clock speed above norm by ifiwereasculptor · · Score: 4, Funny

    either the GPU or CPU cock increased.

    Whoa, can cell phones do that now? I hold these things to my ear, for Christ's sake!

  14. Mechanical Turk Benchmarks by swb · · Score: 2

    Instead of automated benchmarks of hardware, why not real world human benchmarking where a group of people is given a set of tasks to do on a given cell phone platform and see who can do them faster?

    Automated technical benchmarks make sense when the workloads more or less approximate the benchmark -- video gaming, 3D modeling, disk throughput, etc.

    But unless I'm living totally in the dark, most people aren't buying cell phones oriented towards single-task performance (eg, gaming). They get used for many tasks and the fact that I can run some obscure CPU task faster than some other model doesn't tell me if it makes it faster to open an app, etc.

  15. Re:"Pretty Much All of Them" by triffid_98 · · Score: 3, Funny

    No, Apple already has that part down to a science.

    Step #1
    Hype new iPhone
    Step #2
    Release new iPhone
    Step #3
    Immediately release new iOS update.
    Step #4
    Watch existing iPhone users complain after the iOS update cripples older models.
    Step #5
    Laugh maniacally after existing iPhone users stand in lines waiting for new uncrippled iPhones.
    Step #6
    PROFIT

  16. Re:Cheating or just gaming the benchmark? by dj245 · · Score: 2

    You do realize that this is a battery operated device, and that such 'tweaking' dramatically impacts that battery life. Couple that with them not reporting the actual battery life while running the GPU/CPU coverclocked, and you are essentially lying out of you eye teeth.

    This may not be as straightforward as you think- when batteries are cold, they have a different "extractable" capacity compared to when they are hot. Running the CPU/GPU at full tilt is going to warm up the battery. It is kind of a guess as to how much is actually left in there, which could easily explain this.

    --
    Even those who arrange and design shrubberies are under considerable economic stress at this period in history.