12-Core ARM Cluster Beats Intel Atom, AMD Fusion
An anonymous reader writes "Phoronix constructed a low-cost, low-power 12-core ARM cluster running Ubuntu 12.04 LTS and made out of six PandaBoard ES OMAP4460 dual-core ARMv7 Cortex A9 chips. Their results show the ARM hardware is able to outperform Intel Atom and AMD Fusion processors in performance-per-Watt, except it sharply loses out to the latest-generation Intel Ivy Bridge processors." This cluster offers a commendable re-use of kitchenware. Also, this is a good opportunity to recommend your favorite de-bursting tools for articles spread over too many pages.
Or, could they just not do the MIPS/Watt calculations without actually building the thing?
I dont know the exact model used but the first one I could find online was 182 bucks * 6, thats a grand just to prove a point (+ other hardware), hope it was worth it to beat a 60$ atom
"Besides winning on performance and efficiency, the Core i7 3770K system would cost less than the cost of a six PandaBoard ES cluster setup."
So a single Ivy Bridge system, which takes up much less rack space, no cluster network ports, outperforms and costs less than the ARM cluster. Is that the definition of a no-brainer?
I must have been under a rock for the past few years, but are Ivy Bridge processors really more power-efficient than Atoms, Fusions and even ARMs? I thought they were designed more for speed than efficiency, while the others were made for low consumption. Was I wrong? On the internet?
PlusFive Slashdot reader for Android. Can post comments.
I'm getting Dramamine for everyone on Slashdot to counteract the ARM FUD.
1. Look at both the AMD and Intel boards for the low-end processors... notice anything? They have all of these... features like PCIe, real memory interfaces, SATA controllers, etc. etc. All of these features consume power. Huge amounts? Not really, but compared to both the E-350 and the Atom CPUs, the amount of power being measured for each board is including a very large amount of power that has zero to do with the CPU. Guess what would happen if I took an E-350 or Atom and put it in an equivalent to the Panda board?
2. Apparently ARM's marketing department ran out of money to pay the poster to describe the Ivy Bridge system used in this test. Here's the short results:
a. In the parallel benchmarks used in this test that are a (probably unrealistically) best-case scenario for the ARM cluster, a single Ivy Bridge CPU was 5 times faster.
b. Oh but ARM says: So what if Ivy is faster! It's a power hog... look it used over 100 WATTS OMG!!!! Well guess what? On a performace per-watt scale, the Ivy Bridge system is THREE TIMES BETTER THAN ARM.
c. Oh but the ARM fanboys will say that Intel cheated by using a better lithographic process!! Well guess what: ARM loudly brags that it is better because it is an IP only company, so you have to take the good with the bad.
4. Oh one more thing... the Ivy Bridge system had REAL PERIPHERALS like real memory, reali PCIe, a real SSD, etc. etc. that by themselves probably used more power than at least one of the ARM boards, probably 2 of them. Oh and by the way.. the power used for the network fabric needed to network those ARM boards... *NOT* included in the power consumption figures so ARM had that as an extra advantage! So in many ways the Ivy Bridge system was intentionally disadvantaged.. and was still THREE TIMES MORE EFFICIENT ON A PER-WATT BASIS THAN ARM IN A SERIES OF BENCHMARKS THAT ARE BEST-CASE-POSSIBLE SCENARIOS FOR ARM.
5. For all of those ARM fanbois who are about to say that PCIe, real RAM interfaces, real SATA support, etc. etc. are inelegant artifacts of the stupid x86 instruction set well.. bite me. The last 5 years of ARM trolls who have literally gone down the feature list of every feature that x86 has that ARM doesn't and found a way to call the features that ARM lacks stupid and moronic (until ARM implements them years later and then claims to have come up with them first) is pissing me off.
AntiFA: An abbreviation for Anti First Amendment.
It took how many arm computers to match one old intel chip? i didn't realize arm was that underpowered.
any time soon?
http://www.readability.com/articles/sagdka0j
I was impressed that it gets the first 11 pages, and then it includes a 'Next Page' link to in-line the remaining pages. The problem is it didn't get the performance images, which are in separate iframes.
tomorrow who's gonna fuss
The point is how much mips do you do per watt or fraction of watt.
Because if you scale up, probably any "new" and more powerfull CPU will beat a "cluster" of low power ones.
They are low power for a reason. To process data without draining batteries.
If you want to go all guns blazing, then do a test with video card GPUs, because they also do MIPS, albeit, specialized ones...
So, interisting project... a bit useless conclusions.
So intel finally beats arm in performance/watt, but a 2 board cluster beats intels lowest power offering. So, basically intel has finally eroded the advantage arm has in servers, but arm still maintains an edge in small, low power devices. I love that arm has been so competitive in certain areas. Its good to see something other than x86 everywhere. Imagine if there was no iphone. Imagine if there was no competition and arm was still just a slow, but modern and power efficient core? ARM has come a long, long way in the past few years. Competition drives innovation. We need more.
zosxavius photography
So I'm asking myself how 12 ARMs equal the power consumption of one Atom. So I have sit through all the page loads. The "Atom" is a complete off-the-hself "Net Top" box designed to maximize performance (spinning hard drive and high-end graphics card) with the sole constraint of being noiseless -- i.e. the Atom was chosen by the NetTop manufacturer for low heat, not for low energy consumption.
OK, then for the comparison with Ivy Bridge, I wasn't surprised. I've been salivating about the low-power versions of Ivy Bridge for several months now. But this comparison wasn't even againt that. They used the highest clock cycle highest power 3770K variant, which is rated at 77W. There is a 45W version for a bit lower clock speed. (BTW, Intel "produces" low-power variants the same way they "produce" high-clock variants -- they test the chips after manufacturing to see which ones draw less power.)
So, basically, the comparison is completely pointless and a waste of time.
If you're looking at highly parallelizable workloads shouldn't the GPU in the AMD part be part of the equation?
Safari's reader seems to make good work of that. One long page, all the photo's and no adds.
IMO they did two basic things wrong:
1. They tested the raw CPU power, when it's common knowledge ARMs aren't meant for heavy computations.
2. Using 6 pandaboards is probably the least cost effective way to get a 12-core ARM cluster (but the only way available off-the-shelf right now)
To address 1, they should change the benchmark to something IO-bound instead of CPU-bound, say a database or a static-file webserver. For 2, they'd have to wait for some ARM server boards, which combine all the CPUs on a single board.
Wow! Imagine a Beowulf cluster of these!
.. Imagine a Beowulf cluster of those!!
To Terminate, or not to Terminate, that's the question - SCSIROB
What they did not mention is
The panda board cost $182 at Digi-key - for that same $182 per board, one can buy the Intel Atom
mini ITX board for $62 (search for Intel Desktop Board D425KT Innovation Series - motherboard - mini ITX - Intel Atom D425
or http://www.google.com/products/catalog?q=intel+atom+mini+itx&oe=utf-8&rls=org.mozilla:en-US:official&client=firefox-a&um=1&ie=UTF-8&tbm=shop&cid=182897328813518723&sa=X&ei=idTcT9fgF6GO2AXl6-WTDA&ved=0CNEBEPMCMAE#scoring=tp)
so for $182 and $4 you can buy 3 Intel Atom mini ITX board and still out run your ARM in cost and performance.
and when the whole performance test finish - you can hook up those atom and boot up window XP and use some useful OS/application like Photoshop
The quad core Cortex-A15s have even better perf/watt. Better cache architecture in them. Support for 40-bit physical addressing. ARM is quickly catching up to Atom and Fusion in terms of performance.
“Common sense is not so common.” — Voltaire
Real servers need ECC RAM. I'd be reluctant to even run a home file server without it, if that server contains critical data.
Does ARM support ECC? If not, then it can be ruled out on that basis alone. Atom and Bobcat can also be ruled out at this time since neither support ECC RAM.
A while back Intel announced a 2-core, 1.2 GHz Sandy Bridge "Pentium 350" that has a max TDP of 15W and has the standard server chip package, including ECC support. This would be nice for small, low-power servers. But for some reason, I haven't been able to find them on sale anywhere (online or off), even though Intel's site says it was launched Q4 2011.