In Tests Opteron Shows Efficiency Edge Over Intel, Again
Ted Samson writes "In their latest round of energy-efficiency tests between AMD Opteron and Intel Xeon, independent testing firm Neal Nelson and Associates find AMD still holds an edge, but it's certainly not cut-and-dried. Nelson put similarly equipped servers through another gauntlet of tests, swapping in different amounts of memory and varying transaction loads. In the end, he found that the more memory he installed on the servers, the better the Opteron performed compared to the Xeon. Additionally, at maximum throughput, the Intel system fared better, power-efficiency-wise, by 5.0 to 5.5 percent for calculation intensive workloads. For disk I/O intensive workloads, AMD delivered better power efficiency by 18.4 to 18.6 percent. And in idle states — that is, when servers were waiting for their next work load — AMD consistently creamed Intel."
Here is the whitepaper, instead of the summaries.
Xeon 51xx (Woodcrest) is essentially the same as Core 2 Duo (Conroe); it is very power-efficient.
The submitted article is terrible. The full paper is much more informative. For example, the full paper gives the system specs (both systems at 3.0GHz) and shows that the Opteron system is much cheaper ($2800 vs. $4170 for 2GB configuration).
No, the point is people are starting to care about the total power usage of their 500-zillion server colo facility, where even a 5% reduction in power usage can mean hundreds or thousands on the power bill.
http://www.worlds-fastest.com/d.pdf/wfw991.pdf
(Granted, it was buried several links deep.)
The article does not mention it, but SLES 10 enables cpufreq and the ondemand governor by default.
AMD power utilisation with reduced frequency in idle is higher than that of a Xeon system which consumes nearly nothing when you slam it down to 250MHz.
Uh, the lowest frequency of the Xeon 5160 is 2GHz.
If you fully load them down, my X2s use nearly as much as the Core2 systems - but when lightly loaded, my experience mirrors that of the article, that the X2 systems use significantly less power.
In our call center, we built a large batch of X2-based systems - nothing too fancy, just an X2/3800, two gigs of memory, a 250-gig drive, a DVD burner, a 6200tc video card, and 19" LCD monitors. The cases and power supplies were pretty cheap - I think $35 for the case and a "400-watt" power supply. (Yes, the quotes are there for a reason.)
In order to size out the UPS units, we broke out the old, trusty Kill-A-Watt. In logging into a PDC server, browsing the web, checking email, etc., then logging out, the peak draw for one machine and monitor together was 140 watts, with the load *most* of the time at 80-100 watts. Those are some spankily low numbers, especially when you consider that the monitor's contribution was probably 25-40 watts.
And, as we speak, I have a dual-socket, dual-core opteron with a 15K SCSI raid array and 8 gigs running just a few feet away from me, with 4 instances of Prime95 running. Kill-A-Watt says 296 watts with all of that going on. This is going to replace an old 4x700 MHz Xeon server which draws 500-700 watts. The power factor, however, is just 0.7 - I really need a better power supply in there.
Oh, you're not stuck, you're just unable to let go of the onion rings.
Actually, a close look at the tests show is that they got the AMD to cycle down under no-load conditions, but couldn't get the Intel chip to do the same.
As you said, this probably has more to do with the OS, Motherboard, and BIOS than the chip being used.
Over and over again people try and compare the efficiencies between two "seemingly" identical servers / machines. But truly, how can you declare a winner (and base it on something like a 5% efficiency margin) when the two machines are using different power supplies? A 600 Watt for the Intel, 500 Watt for the AMD. I can't find those models listed on Delta's website at quick glance, but it'd be a stretch to imagine that two different power supplies have the exact same efficiency curves. I mean, I'd believe if they were accurate to within maybe 3%, so now we're arguing over whether or not Intel and AMD are more than 2% different in efficiency? Come on people. The whitepaper does say they assume there might be a 1% difference between the two power supplies, but that's based on "eyeballing" the efficiency curves.
We know that Intel takes a hit with FB-DIMM memory especially as you add more memory modules.
Another inconsistency appears to be related to the case design, where the cases for the Intel machines appeared to be providing inadequate cooling for the memory modules, causing the system management controller to bump up fan speed considerably. So now we're comparing two systems with different power supplies and with different requirements for cooling which may or may not be related to the actual architecture but may be impacted by a design consideration made by the case manufacturer. How would these results change with different power supplies or a different case. Are the differences the same in a 2U case? A tower? Does it get worse? Better? I know that our Mac Pro's NEVER speed up the fans above the 500/600 RPM's that they bottom out at.
As noted by others, the paper is completely devoid of any discussion regarding CPU frequency / voltage scaling that may or may not be handled by the BIOS or Linux resident programs (cpuspeed daemon). It's possible they haven't even checked for it. As our company has both Intel and AMD linux boxes, I can testify that linux is very sensitive to motherboard/cpu combinations when it comes to cpu scaling and it's "possible" that this could be playing a MAJOR role in the idle performance values. It'd be nice to see it addressed.
Lastly, there's no discussion as to the optimizations made to the software being run on each of the boxes. Is the code compiled for each architecture individually taking into account support for 3DNow / SSE instructions, cache sizes, etc? Obviously more efficient or less efficient code execution would have a MAJOR impact on these studies, enough so that companies usually spend a large amount of time playing with compiler options to get the best performance on a given architecture. And when you're arguing over performance comparisons in the sub 20% difference arena, code efficiency should be addressed, especially if it's not a big commercial package that "everyone" in the industry would be using. Anyhoo, just my thoughts.
Don't expect, though. Core 2 Duo is ahead of Athlon 64 X2 in gaming performance... but when you go multi-socket, Opteron servers still spank similar Xeon systems thanks to the integrated memory controllers and the superior HyperTransport interconnect. In enterprise or scientific use data traffic can be the crucial thing to overall performance -- more so than actual ALU horsepower -- and that's what AMD got right many moons ago with the very first K8 'Hammers.
TO START
PRESS ANY KEY
Where's the 'ANY' key? I see Esk, Kitarl, and Pig-Up...