Xeons, Opterons Compared in Power Efficiency

← Back to Stories (view on slashdot.org)

Xeons, Opterons Compared in Power Efficiency

Posted by Zonk on Friday December 15, 2006 @02:56AM from the nothing-like-some-friday-morning-power-analysis dept.

Bender writes "The Tech Report has put Intel's 'Woodcrest' and quad-core 'Clovertown' Xeons up against AMD's Socket F Opterons in a range of applications, including widely multithreaded tests from academic fields like computational fluid dynamics and proteomics. They've also attempted to quantify power efficiency in terms of energy use over over time and energy use per task, with some surprising results." From the article: "On the power efficiency front, we found both Xeons and Opterons to be very good in specific ways. The Opteron 2218 is excellent overall in power efficiency, and I can see why AMD issued its challenge. Yes, we were testing the top speed grade of the Xeon 5100 and 5300 series against the Opteron 2218, but the Opteron ended up drawing much less power at idle than the Xeons ... We've learned that multithreaded execution is another recipe for power-efficient performance, and on that front, the Xeons excel. The eight-core Xeon 5355 system managed to render our multithreaded POV-Ray test scene using the least total energy, even though its peak power consumption was rather high, because it finished the job in about half the time that the four-way systems did. Similarly, the Xeon 5160 used the least energy in completing our multithreaded MyriMatch search, in part because it completed the task so quickly. "

8 of 98 comments (clear)

Min score:

Reason:

Sort:

Hmm, so which better reflects real-world usage? by pla · 2006-12-15 03:05 · Score: 4, Interesting

the Opteron ended up drawing much less power at idle than the Xeons
...
the Xeon 5160 used the least energy in completing our multithreaded MyriMatch search, in part because it completed the task so quickly.

So what does this mean for people shopping for servers?

If your servers constantly tick along at nearly 100% CPU use, you might do better going with the Xeon system. If your machines basically sit idle most of the time with an occasional spike for a few seconds when it actually does something, the AMD would save you more on electricity.

Of course, this raises a third possibility - Would running a number of virtual servers on one large Xeon machine waste more energy than it saves, or give a net gain?
1. Re:Hmm, so which better reflects real-world usage? by ptbarnett · 2006-12-15 04:36 · Score: 2, Interesting
  
  Although some people will pipe in with their number crunching sever stories, are there any normal usage servers that really come in at 100% CPU usage?
  For capacity planning purposes, most of my clients target 40-50% CPU utilization on servers. If it starts creeping above 60% on a consistent basis (or is forecasted to do so soon), they begin the acquisition process to either upgrade or add servers.
  Queuing theory (M/M/1) shows that while the average response time doesn't increase that much, the standard deviation increases rapidly as utilization grows above 60%. Restated in simpler terms: a larger proportion of response times become significantly larger -- to the point that users start to notice and either complain or go elsewhere.
  Other system components often keep you from reaching that target,
  Yes, system overhead starts to increase rapidly on most systems as you approach 100% CPU utilization. In many cases, total throughput actually decreases above system utilization of about 85-90%.
  most 24-7 servers I've seen do most of their work during a certain period then spend the rest of their time twiddling their thumbs.
  I've looked at usage patterns for a number of systems. Whether they are public (online banking) or internal-use-only, they all seem to have the same pattern: usage peaks about 10:00 AM, with a smaller secondary peak about 1:30 PM. The second peak usually disappears on Friday afternoon.
2. Re:Hmm, so which better reflects real-world usage? by twiddlingbits · 2006-12-15 04:42 · Score: 2, Interesting
  
  If I'm do General Purpose computing I would trade the 10W difference in power consumption for the redundancy and flexibility of the 4-way Opteron. With two 4 way boxes you can use one as the failover for the other, or load balance between them keeping low CPU use on each. General purpose computing really doesn't need the power of an 8-way SMP solution even with 1000's of users. You can virtualize either the 4 way or the 8 way with VMWare or Zen or Solaris Containers so that (IMHO) is a wash.
  
  It's really back to the old Horizontal vs Vertical scaling argument which involves a lot of factors along with power consumption. If floor space in your data center is a premium you probably want the 8ways as you can double your server density per rack (assumes you have the power and cooling). If your servers idle most of the time, space is not an issue and you are at close margins on data center power and cooling the Opteron 4way might be a better choice. There are also cost differences to consider. Opterons are usally priced below Xeons so if the botton line hardware costs are important that pushes to Opterons. You also have to look at the number of HBAs and network connections a 4way and an 8way will support. There are SO many combinations to consider including how much IT growth will occur it is mind boggling! It all depends on the strategic and tactical decisions made by the Data Center Team and the IT Organization, some places are all about performance and some are all about cost and others try to get a knife edge balance. Also keep in mind what you buy today is probably obsolete in 18 months and likely will be replaced in 36-48 months.
  
  There is also a 3rd Option. If you don't mind running on Sun SPARC equipment then the SPARC T1 based severs blow both options out of the water in terms of power consumption (just don't do a lot of floating point..they suck at that). If you are running Java and other products that have SPARC and Solaris 10 (Linux soon) versions then changing to a SPARC architecture might get some really big gains. However if you are a .NET shop or a Windows server shop you are stuck with the X86 Architecture with Xeon or Opteron.
Conclusions converted to $$$ by ben+there... · 2006-12-15 03:14 · Score: 2, Interesting

"The eight-core Xeon 5355 system managed to render our multithreaded POV-Ray test scene using the least total energy, even though its peak power consumption was rather high, because it finished the job in about half the time that the four-way systems did. Similarly, the Xeon 5160 used the least energy in completing our multithreaded MyriMatch search, in part because it completed the task so quickly."

Presumably, the article tests power consumption because businesses are concerned with how much running each of these systems will cost them. If the Xeons managed to win in power consumption because they completed the task in half the time, that has other cost-saving benefits even beyond power consumption. They can use fewer systems to complete tasks within the deadline, complete tasks ahead of schedule (making their business slightly more agile), and/or spend less money on animators waiting for their animations to render.
Re:God, I'm sick of this architecture by ben+there... · 2006-12-15 03:23 · Score: 2, Interesting

Aren't newer x86 processors essentially CISC that convert the instructions down to RISC? And RISC processors, like G4/G5, that use instruction sets such as Altivec are actually using some aspects of CISC?

That was my understanding, after reading articles like this one on Ars Technica. If true, it would make fighting over CISC vs. RISC not make a lot of sense.
Well too bad get used to it by Sycraft-fu · 2006-12-15 04:28 · Score: 2, Interesting

It's not going anywhere. Intel actually wanted to replace it though it's arguable if their replacement was better or worse but AMD won out the 64-bit round with x86-64. That's what Linux uses, that's what Windows uses, it's a done deal.

Now personally to me you sound like someone who's spent a little too much time in a computer science architecture class soaking up theories about ISAs and too little time actually looking at how chips are made these days and what works. When you get right down to it, x86 works just fine. The chips built on it are very fast, the compilers are able to generate efficient code for it, it plain works in the real world. You may not like it, but it does work well in the real world.

Will something like the Cell kill it? Maybe, but forgive me if I'm more than a little skeptical. There's been things that are going to kill x86 for a long time and none of it has panned out. You can try and make your ISA as brilliant as you like, what it really seems to get down to is good chip design for the money, and Intel and AMD are hard to beat at that.
Re:AMD needs to get back in the game, quick by Anonymous Coward · 2006-12-15 05:57 · Score: 1, Interesting

What I don't get is why are people constantly comparing Intel's quad-core with AMD's revision F hardware? Is this a marketing ploy from Intel to try and make it seem like AMD isn't in the game anymore? Revision F hardware from AMD means that the hardware is *capapble* of taking the future quad core chips, but AMD has not released them yet. In all fairness, what has Intel done for the public without AMD (or Sun for that matter) coming out with their own multi-core competitive product that saved us so much power? Let's be real here - if AMD and Sun didn't come out with multi-core, would we really be where we are today, or would we be cooking our pizzas over the already-overclocked and overheated intel chips that they were trying to squeeze so much out of? Why did it take competitive vendors to make Intel really re-think their methodology? Because they wanted to make more $$ off an old technology. I am in the least bit impressed with Intel. Of course they released a quad-core to market faster than AMD - they've been doing it for years. But again, all they did was take an idea which was not their own and go to market faster. What they don't get is AMD is working up another idea that will blow away the quad-cores. But of course, once Intel sees what they're doing and figures it out, they'll just be faster to market again... Let's give it some time and give AMD and Sun the credit they deserve for changing the future of the processor market today.
Re:God, I'm sick of this architecture by Salamander · 2006-12-15 14:52 · Score: 3, Interesting

You're forgetting the basic formula from Hennessy and Patterson:

WorkPerSec = WorkPerInstruction * InstructionsPerCycle * CyclesPerSecond

Yes, CISC has better work per instruction, except for one glaring issue I'll get to in a moment, but - for various reasons explained throughout H&P - it loses on the other two and thus overall. That's why nobody's making new processors that are CISC internally any more; they just couldn't hit the issue widths and clock speeds are achievable with a RISC core (even if that core has a CISC ISA bolted on the front). What's missing here is that not all work is useful work. As anyone who has accidentally coded an infinite loop knows, executing lots of instructions is not necessarily a good thing. The glaring issue I mentioned earlier is that a lot of the instructions executed on a register-poor architecture like x86 are not doing useful work. Register thrashing means i-cache bandwidth is wasted fetching instructions which are then used to waste d-cache bandwidth, which more than outweighs any advantage from variable-length instructions.

So, you say, wouldn't variable-length instructions on a register-rich processor be the best of both worlds? Not so fast. A regular instruction set makes superscalar execution easier because it means that multiple instructions can be fetched literally at the same time without having to examine the first one to figure out where the second one begins and so on. It also makes deeper pipelines easier because it allows many internal activities (e.g. register allocation, hazard detection) to start after a simple pre-decode stage, in parallel with the remainder of decode. Either way, regular instruction sets allow for more parallelism - and parallelism in some form is the generally the key to CPU performance. If you're willing to give up performance by eschewing most modern processor-design techniques, which might be the case for a deeply embedded system with extreme size and/or power requirements, then variable-width instructions might still be a reasonable choice. In that case you might as well use an older architecture; there are plenty to choose from. For new processor designs, though, variable-width instructions are almost invariably a way to lose.

--
Slashdot - News for Herds. Stuff that Splatters.