Slashdot Mirror


ARM In Supercomputers — 'Get Ready For the Change'

An anonymous reader writes "Commodity ARM CPUs are poised to to replace x86 CPUs in modern supercomputers just as commodity x86 CPUs replaced vector CPUs in early supercomputers. An analysis by the EU Mountblanc Project (PDF) (using Nvidia Tegra 2/3, Samsung Exynos 5 & Intel Core i7 CPUs) highlights the suitability and energy efficiency of ARM-based solutions. They finish off by saying, 'Current limitations [are] due to target market condition — not real technological challenges. ... A whole set of ARM server chips is coming — solving most of the limitations identified.'"

55 of 238 comments (clear)

  1. IMHO - No thanks. by Anonymous Coward · · Score: 2, Insightful

    PC user, hardcore gamer and programmer here; for me, energy efficiency is a lesser priority than speed in a CPU. Make an ARM CPU compete with an Intel Core i7 2600K, and show me it's overclockable with few issues, and you got my attention.

    1. Re:IMHO - No thanks. by Stoutlimb · · Score: 5, Insightful

      No doubt your CPU would win. But when looking at power/price as well, you'd have to pit your CPU against 50 or so ARM chips in parallel. For some solutions, it may be a far better choice. One size doesn't fit all.

    2. Re:IMHO - No thanks. by Anonymous Coward · · Score: 2, Interesting

      architecture is complicated. but in terms of ops per mm^2, or ops per watt, ops per $,
      cycles per useful op, the x86 architecture is a henious pox on the face of the
      earth.

      worse yet, your beloved x86 doesn't even have any source implications, its just
      a useless thing.

    3. Re:IMHO - No thanks. by c0lo · · Score: 4, Funny

      The article is aimed at supercomputers, not commodity PC. You are not the target.

      While not the target, you'll be collateral damage anyway.

      --
      Questions raise, answers kill. Raise questions to stay alive.
    4. Re:IMHO - No thanks. by king+neckbeard · · Score: 5, Informative

      You aren't operating in the supercomputing market. There, what matters is the how much processing you can get for how much money. You can always buy more chips, and power usage and cooling are both signficant factors. That's why x86 became dominant in that space. It was cheaper to buy a bunch of x86 chips than to buy fewer POWER chips. In terms of computing power, a POWER7 will eat your i7 for breakfast, but they are ungodly expensive.

      --
      This is my signature. There are many like it, but this one is mine.
    5. Re:IMHO - No thanks. by Colonel+Korn · · Score: 5, Informative

      architecture is complicated. but in terms of ops per mm^2, or ops per watt, ops per $,
      cycles per useful op, the x86 architecture is a henious pox on the face of the
      earth.

      worse yet, your beloved x86 doesn't even have any source implications, its just
      a useless thing.

      In TFA's slides 10 and 11, Intel i7 chips are shown to be more efficient in terms of performance per watt than ARM chips. However, they're close to each other and Intel's prices are significantly higher.

      --
      "I zero-index my hamsters" - Willtor (147206)
    6. Re:IMHO - No thanks. by dbIII · · Score: 3, Interesting

      Then you use something else as well. High performance computing server rooms already have a mix of stuff, especially since the AMD chips can give you a 64 core machine with half a terabyte of memory for $14K but it's not as fast per core as the two way Xeons. The parallel stuff is done on the plentiful and slower cores while the single treaded stuff is done on the faster cores - then GPUs do whatever parallel stuff you can feed them (memory and bandwidth limiting issues keep them from doing some tasks)

    7. Re:IMHO - No thanks. by KiloByte · · Score: 4, Interesting

      Damage or a winner? I feel so bad about having a cheap, efficient, and above all, quiet box.

      I bought this 4*2GHz baby, and the only reason it's not my main desktop yet is a weird and asinine requirement for monitor resolution to be exactly 720 or 1080 (WTF?!?). I think I'll replace my old but perfectly working pair of 1280x1024 monitors (I hate 16x9!), and put the big loud clunker to the cellar. I just hate the noise so much. x86 machines with no moving parts are extremely hard to get, and have terrible performance/price. Anything that requires lots of processing power: compilation, running Windows VMs, etc, can be done remotely from the cellar just as well, while a 2GHz arm is fast enough to do client stuff, running a browser being the most demanding part.

      And what else do you need to reside directly on the machine you plop your butt at?

      --
      The creatures outside looked from Alt-Right to Antifa; but already it was impossible to say which was which.
    8. Re:IMHO - No thanks. by symbolset · · Score: 4, Interesting

      The problem you have is the software tools you use sap the power of the hardware. Windows is engineered to consume cycles to drive their need for recurrent license fees. Try a different OS that doesn't have this handicap and you'll find the full power of the equipment is available.

      --
      Help stamp out iliturcy.
    9. Re:IMHO - No thanks. by 0123456 · · Score: 2

      I feel so bad about having a cheap, efficient, and above all, quiet box.

      So do I. I can't even hear my i7 machine when playing games on it, whereas the old Pentium-4 sounded like a vacuum cleaner.

    10. Re:IMHO - No thanks. by Dcnjoe60 · · Score: 2

      50 arm cpu's eh, problem comes to fact of something that can scale to that many cpu's.

      Well the article is about arms being used in supercomputers, so scalability is probably not going to be a problem.

    11. Re:IMHO - No thanks. by LordLimecat · · Score: 2

      THe core i7 might very well still win. Remember that intel is more efficient in computing work per watt, and an Ivy Bridge core i7 3770k uses 77w. If your average arm chip uses 2 watts, that means that ~30 arm chips will still get beaten by the core i7....

    12. Re:IMHO - No thanks. by aztracker1 · · Score: 4, Insightful

      Exactly, then again, there are plenty of non-cpu intensive loads.. part of the popularity and growth of NodeJS is that a lot of jobs are IO bound, and even a lot of web services/sites are spending most of their time waiting on files, or network resources/services... 10 arm CPU's handling 10K simultaneous requests, is as good as 1 uber-cpu handling 10K simultaneous requests... for that matter, there's been a lot of work done in MessageQueue routing, and distributed databases... ARM is a pretty good fit for an environment designed to scale horizontally. Some of the first things I wanted to try on my Raspberry Pi were MongoDB and NodeJS, with the thought that a couple dozen of them might work better with more resilience than a few larger systems...

      For the record, I think addressing a bit more memory, and larger/faster storage channels are what's holding back some of these systems.. which aren't a problem at super-computer scale.. but for someone wanting to put together a small cluster, it gets irritating.

      --
      Michael J. Ryan - tracker1.info
    13. Re:IMHO - No thanks. by Redmancometh · · Score: 2

      Useless for what you do. The second performance...not performance per watt...PERFORMANCE becomes an issue..ARM is a steaming pile of shit and you know it. If you're doing anything more than what the above AC said (keep playing soduku, and portal) it can't handle it. How about everyday consumers who need a tablet that can actually do work? A gimp version of windows is not going to get the job done. Some of the Samsung Slate tablets however come with an x86...and are actually fully functional! Can you point to an ARM tablet that can do everything it can? Or any other x86 tablet for that matter?

      I know it's not about the software. However, unfortunately, sometimes raw productivity is all that matters. Sometimes the latest windows RT garbage dump or iOS xyz isn't going to hold water. The fact of the matter is the software that will run on a system defines how productive that device is going to be. Me and you might be able to put a proper operating system on one of these...but your whole company? Hell no.

    14. Re:IMHO - No thanks. by aztracker1 · · Score: 3, Informative

      The last two times I ran Linux on my desktop I ran into issues that weren't impossible to overcome, just a pain in the ass to deal with... I had a desktop with two graphics cards in sli, and two monitors.. getting them both working in 2006 was a pain, I know that was seven years ago, but still... far harder than it should have been.. in 2007, my laptop was running fine, upgraded to the latest ubuntu, nothing but problems.. In the first case, XP/Vista were less trouble, in the second, Win7 RC1 ran better... I also ran PC-BSD for a month, which was probably the nicest experience I've had with something outside win/osx on my main desktop, but still had issues with virtual machines that was a no-go.

      Given, my experiences are pretty dated, and things have gotten better... for me, linux is on the server(s) or in a virtual machine... every time I've tried to make it my primary OS has been met with heartache and pain. I replaced my main desktop a couple months ago, and tried a few Linux variants.. The first time, I installed on my SSD, then when I plugged in my other hard drives, it still booted, but an update to Grub screwed things up and it wouldn't boot any longer. This was after 3 hours of time to get my displays working properly.... I wasn't willing to spend another day on the issue, so back to Windows I went. I really like Linux.. and I want to make it my primary desktop, but I don't have extra hours and days to tinker with problems an over-the-wire update causes... let alone the initial setup time which I really felt was unreasonable.

      I've considered putting it as my primary on my macbook, but similar to windows, the environment pretty much works out of the box, and brew takes things a long way towards how I want it to work. Linux is close to 20 years old.. and still seems to be more crusty for desktop users than windows was a decade and a half ago in a lot of ways. In the end, I think Android may be a better desktop interface than what's currently on offer from most of the desktop bases in the Linux community, which is just plain sad... I really hope something good comes out of it all, I don't like being tethered to Windows or OSX... I don't like the constraints... but they work, with far fewer issues... the biggest ones being security related... I think that Windows is getting secure faster than Linux is getting friendlier, or at least easier to get up and running with.

      --
      Michael J. Ryan - tracker1.info
    15. Re:IMHO - No thanks. by 0123456 · · Score: 2

      I had a desktop with two graphics cards in sli, and two monitors

      Given SLI barely works in Windows, expecting it to work in Linux was optimistic. I recently booted up a Linux Mint DVD on my laptop to try it out and... everything just works. Even using the 'recovery partition' to reinstall Windows on there takes over three hours, reboots about thirty times and breaks with barely decipherable and completely misleading error messages if you installed a hard drive larger than the one that came with it.

       

      Linux is close to 20 years old..

      And the BSD core in MacOS is close to 40 years old.

      Android would make a lousy desktop interface, just like Window 8. It was designed for phones and is barely a usable tablet interface. Of course, it probably is more usable than Gnome 3.

    16. Re:IMHO - No thanks. by Khyber · · Score: 2

      "For supercomputers? Battery life isn't a term."

      You say that until the power grid fails and your generator fails to kick on, leaving you with only battery backup in place.

      --
      Still waiting on Serviscope_minor to wake up to fucking reality and realize that Jessica Price isn't going to fuck him.
    17. Re:IMHO - No thanks. by Anonymous Coward · · Score: 3, Insightful

      A single ARM 4 core A-15 running 1.5 GHz per core blows away any competing chip at the same specs, on power AND price. It's not limited to the calculations x86 are and can process graphics and physics better as a result.

      Translation: It gets raped sideways on single-threaded performance and you have to double up on sockets right out of the gate.
      It's a bit of a misconception about ARM and x86. ARM wins of watts/socket and mhz/watts, but Intel's i7s cream ARM on performance/watt, once you account for those two factors, ARM isn't as competitive as you might think. Now, I'm not saying it isn't competitive, just that it's nowhere near as one-sided as you might be led to believe by cherry-picking.

    18. Re:IMHO - No thanks. by gl4ss · · Score: 2

      No doubt your CPU would win. But when looking at power/price as well, you'd have to pit your CPU against 50 or so ARM chips in parallel. For some solutions, it may be a far better choice. One size doesn't fit all.

      50 costs more in silicon than a single x86.

      basically you need a "new generation" of arm chips. but they'll have to compete against a new generation of x86 chips - and remember, x86 chips are priced as they are only because they're fastest you can buy!.

      the thing is, we have been listening to this for years, that in few years arm will take over everything. yet it hasn't.

      instead of supercomputing, I would foresee the lowest tier of rent-a-webservers to move to arm.. what's a better business than renting a machine that costs 40 bucks total for 5 bucks a month?

      --
      world was created 5 seconds before this post as it is.
    19. Re:IMHO - No thanks. by Teun · · Score: 2

      Always :)

      --
      "The likes of Facebook and WhatsApp are free to those whose privacy is of zero value."
    20. Re:IMHO - No thanks. by BasilBrush · · Score: 3, Interesting

      Why would an ARM chip use 2 Watts?

      â-- ARM Cortex-A9
      â-- 1 ops / cycle @ 800 MHz - 2 GHz
      â-- 0.25 - 1 Watt

      â-- ARM Cortex-A15
      â-- 4 ops / cycle @ 1 - 2.5 GHz*
      â-- 0.35 Watt

    21. Re:IMHO - No thanks. by Teun · · Score: 2

      Maybe you should ask your mom what those Preview and Continue Editing buttons below your fresh commend mean?

      --
      "The likes of Facebook and WhatsApp are free to those whose privacy is of zero value."
    22. Re:IMHO - No thanks. by BasilBrush · · Score: 3, Informative

      what do you think goes on at the other end of the copper/fibre cable?

      No supercomputing whatsoever. I'm not a physicist, a mathematician, a code breaker nor anyone else with supercomputing needs. My HTTP request for web page is quite likely served by a single core. Maybe 2.

    23. Re:IMHO - No thanks. by unixisc · · Score: 2

      Alpha's high price was due to DEC trying too hard to achieve prized speeds, and thereby having plenty of fallout, resulting in their need to jack up prices on those that did pass their tests. Had DEC gone for different speed bins, instead of just one, they could have priced it lower and sold it to markets which would have happily considered an Alpha, but where price was less critical.

    24. Re:IMHO - No thanks. by tolkienfan · · Score: 2

      A single Sandy Bridge system will outperform many dozen Raspberry PIs.

  2. Does it really matter? by gman003 · · Score: 4, Interesting

    Most of the actual processing power in current supercomputers comes from GPUs, not CPUs. There are exceptions (that all-SPARC Japanese one, or a few Cell-based ones), but they're just that, exceptions.

    So sure, replace the Xeons and Opterons with Cortex-A15s. Doesn't really change much.

    What might be interesting is a GPU-heavy SoC - some light CPU cores on the die of a supercomputer-class GPU. I have heard Nvidia is working on such (using Tegra CPUs and Tesla GPUs), and I would not be surprised if AMD is as well, although they'd be using one of their x86 cores for it (probably Bulldozer - damn thing was practically built for heavily-virtualized servers, not much different from supercomputers).

    1. Re:Does it really matter? by Victor+Liu · · Score: 5, Informative

      As someone who does heavy duty scientific computing, I wouldn't say that "most" of the actual process power is in GPUs. They are certainly more powerful at certain tasks, but most applications run are legacy code, and most algorithms require substantial reworking to get them to run with reasonable performance on a GPU. Simply put, GPU for supercomputing is not quite a mature technology yet. I am personally not too interested in coding for GPUs simply because the code is not portable enough yet, and by the time the technology might be mature, there might be a new wave of technology (like ARM) that could be easier to work with.

    2. Re:Does it really matter? by Anonymous Coward · · Score: 2, Insightful

      False. According to the Top 500 computer survey from November, 2012 (Category: Accelerator/Co-Processor), 87% of systems are not using any type of GPU co-processor, and 77% of the processing power is coming from the CPU.

      This is, however, a decrease from the June 2012 survey, so GPU is certainly making inroads, but it is not yet the main source of computation.

      http://www.top500.org/statistics/list/

      I still remember when the IBM Blue architecture came out, using embedded PowerPC processors and it was a huge power savings. It was a big deal, but far from a complete solution (limitations in RAM with no disk/swap).

      There is certainly a growing demand for a better power/performance solution in order to reduce total cost of operation. The individual performance of each processor doesn't matter as much when you have applications which are written to take advantage of 100,000s of processors in parallel.

    3. Re:Does it really matter? by Junta · · Score: 5, Informative

      Of the last published top500 list, 7 out of the top 10 had no GPUs. This is a clear indication that while GPU is defintely there, claiming 'Most of the actual processing power' is overstating it a touch. It's particularly telling that there are so few as overwhelming the specific hpl benchmark is one of the key benefits of GPUs. Other benchmarks in more well rounded test suites don't treat GPUs so kindly.

      --
      XML is like violence. If it doesn't solve the problem, use more.
    4. Re:Does it really matter? by symbolset · · Score: 5, Interesting

      These ARM cores are halfway between the extremely limited GPU cores and the extremely flexible X86 cores. They may be the "happy medium".

      --
      Help stamp out iliturcy.
    5. Re:Does it really matter? by KiloByte · · Score: 5, Informative

      Also, a lot of algorithms, perhaps even most, rely on branching, which is something GPUs suck at. And only some can be reasonably rewritten in a branchless way.

      --
      The creatures outside looked from Alt-Right to Antifa; but already it was impossible to say which was which.
    6. Re:Does it really matter? by Victor+Liu · · Score: 2

      On the point or portability, there's then a distinction of your focus. If you do research on numerical methods, then yes, you would write highly optimized code for a particular machine, as an end in and of itself. I myself am merely a user, and our research group does not have the expertise to write such optimized code. We pay for time on supercomputing clusters, which constantly bring online new machines and retire old ones. Every year our subscription can change, and we are allowed to use resources on different computers. Therefore, from my standpoint, portability is very important. Otherwise, if we were to write our own code in-house, we basically have a 1 year (ok, fine, maybe 2 or 3 year) window in which to develop, test, and run it. It just doesn't seem worthwhile to spend so much effort developing a one-time use piece of code. I'd rather write something which will outlive my stay in the research program.

    7. Re:Does it really matter? by JanneM · · Score: 2

      System and numerical libraries and compilers are of course written specifically for the machine. But user-level apps (and a lot of scientific computing uses finished apps) are ported across multiple systems.

      Portability is not as big an issue as it was a generation ago, as most supercomputers basically are Linux machines today, and made to more or less look like a typical Linux installation from a user-application level, with a POSIX API; pthreads, OpenMP and OpenMPI; a standard set of numerical libraries; and often even gcc-compatibility in order to minimize the effort of porting. A notable exception is GPU-based machines (that are in the minority today, despite the OP assertion); they don't have a common API to write for, so using them is substantially harder at a user-level.

      And at a user level (but unike system libs) porting or coding time very much matters. Let's say your project is going to need a month of wall-clock computing time during the course of a year or two. If switching to a GPU-based system would shrink that by 50% - two weeks - then the effort to move your model code, app, and libraries had better take less than two weeks of work or you're going to waste project time, not save it.

      --
      Trust the Computer. The Computer is your friend.
    8. Re:Does it really matter? by ThePeices · · Score: 5, Funny

      Also, a lot of algorithms, perhaps even most, rely on branching, which is something GPUs suck at. And only some can be reasonably rewritten in a branchless way.

      nonsence, I play Farcry3 on my GPU, and it renders branches just fine thank you very much.

  3. Questions... by storkus · · Score: 5, Interesting

    As I understand it, Intel still has the advantage in the performance per watt category for general processing and GPUs have better performance per watt IF you can optimize for that specific environment--both things which have been commented to death endlessly by people far more knowledgeable than I.

    However, to me there are at least 3 questions unanswered:

    1. ASICs (and possibly FPGAs): Bitcoin miners and DES breakers are the best known examples. Where is the dividing line between where your operations are specific enough to emply an ASIC vs not specific enough and needing a GPU (or even CPU)? Could further optimization move this line more toward the ASIC?

    2. Huge dies: This has been talked about before, but it seems that, for applications that are embarrassingly parallel, this is clearly where the next revolution will be, with hundreds of cores (at least, and of whatever kind of "core" you want). So when will this stop being vaporware?

    3. But what do we do about all the NON-parallel jobs? If you can't apply an ASIC and you can't break it down, you're still stuck at the basic wall we've been at for around a decade now: where's Moore's (performance) law here? It would seem the only hope is new algorithms: TRUE computer science!

    1. Re:Questions... by XaXXon · · Score: 2

      The reason for the question is that nothing in Moore's law says anything about single-threaded performance doubling every 1.5 years as many thing.

      Moore's law is the observation that, over the history of computing hardware, the number of transistors on integrated circuits doubles approximately every two years.

  4. Re:Not buying it. by MikeBabcock · · Score: 3, Informative

    I don't buy your response: http://top500.org/statistics/list/ ... click accelerator and hit submit.

    87.6% of the top 500 super computers have no NVIDIA etc. coprocessing

    --
    - Michael T. Babcock (Yes, I blog)
  5. So, when can I buy an ARM ATX board? by LaughingRadish · · Score: 2

    Hopefully this means we should start seeing ARM-using motherboards in an ATX form-factor. The Pi and Beaglebone are nice, but I want something that's eassentially just like a commodity x86 motherboard except it uses ARM.

    1. Re:So, when can I buy an ARM ATX board? by LaughingRadish · · Score: 2

      Mini-ATX or Mini-ITX will do fine. I just haven't seen any that have the kinds of things you take for granted on x86 boards. I want an ARM board with SATA ports, PCIe slots, and DIMM (or SODIMM) slots. Is that too hard to produce? I don't see anything like this anywhere.

  6. No, they won't. by Dputiger · · Score: 5, Informative

    Current ARM processors may indeed have a role to play in supercomputing, but the advantages this article implies don't exist.

    Go look at performance figures for the Cortex-A15. It's *much* faster than the Cortex-A9. It also draws far more power. There's a reason why ARM's own product literature identifies the Cortex-A15 as a smartphone chip at the high end, but suggests strategies like big.LITTLE for lowering total power consumption. Next year, ARM's Cortex-A57 will start to appear. That'll be a 64-bit chip, it'll be faster than the Cortex-A15, it'll incorporate some further power efficiency improvements, and it'll use more power at peak load.

    That doesn't mean ARM chips are bad -- it means that when it comes to semiconductors and the laws of physics, there are no magic bullets and no such thing as a free lunch.

    http://www.extremetech.com/computing/155941-supercomputing-director-bets-2000-that-we-wont-have-exascale-computing-by-2020

    I'm the author of that story, but I'm discussing a presentation given by one of the US's top supercomputing people. Pay particular attention to this graph:

    http://www.extremetech.com/wp-content/uploads/2013/05/CostPerFlop.png

    What it shows is the cost, in energy, of moving data. Keeping data local is essential to keeping power consumption down in a supercomputing environment. That means that smaller, less-efficient cores are a bad fit for environments in which data has to be synchronized across tens of thousands of cores and hundreds of nodes. Now, can you build ARM cores that have higher single-threaded efficiency? Absolutely, yes. But they use more power.

    ARM is going to go into datacenters and supercomputers, but it has no magic powers that guarantee it better outcomes.

  7. I want by EmperorOfCanada · · Score: 2

    I have long pined for a server with maybe 10 4 core ARM CPUS. Basically my server spends its time serving up web stuff from memory. Each web request needs to do a bit of thinking and then fire the data out the port. Disk IO is not an issue nor is server bandwidth. Quite simply I don't need much CPU but I need many CPUs. A big powerful intel is of less interest.

    Also by breaking up the system into physically separate CPUs I suspect that an interesting memory accessing architecture could be conjured up preventing another potential choke point.

    1. Re:I want by zbobet2012 · · Score: 2

      Supermicro 1u 64 cores. Bunch of other Mobos (some more than 1u) on this page. Cheap is relative to the buyer I suppose, but to my (admittedly very large) company these things are rather cheap unless you start stacking them with lots of dense memory.

  8. Xilinx Zync anybody? by Z00L00K · · Score: 4, Informative

    Has anybody else seen/considered the Xilinx Zync? It's a mix of ARM kernels and FPGA, which could be interesting in supercomputing solutions.

    For anyone willing to tweak around with it there are development boards around like the ZedBoard that is priced at US$395. Not the cheapest device around, but for anyone willing to learn more about this interesting chip it is at least not an impossible sum. Xilinx also have the Zynq®-7000 AP SoC ZC702 Evaluation Kit which is priced at US$895, which is quite a bit more expensive and not as interesting for hobbyists.

    Done right you may be able to do a lot of interesting stuff with a FPGA a lot faster than an ordinary processor can and then let the processor take care of stuff where performance isn't a critical part.

    Those chips are right now starting to find their way into vehicle ECUs, but it's still in an early phase so there aren't many mass produced cars yet with it.

    As I see it - supercomputers will have to look at every avenue to get maximum performance for the lowest possible power consumption - and avoid solutions with high power consumption in standby situations.

    --
    If builders built buildings the way programmers wrote programs, then the first woodpecker would destroy civilization.
    1. Re:Xilinx Zync anybody? by janisozaur · · Score: 2

      there was a successful kickstarter campaign some time ago that introduced precisely those chips [Zynq-7020] at quite affordable prices: http://www.kickstarter.com/projects/adapteva/parallella-a-supercomputer-for-everyone/ they are going to have them available for sale soon at http://www.parallella.org/ after the kickstarter pledges are fulfilled. at $100 + shipping they are far more affordable than what you mention, they are also committed to having most (if not all?) things open, so be sure to check out their website.

  9. One Size Doesn't Fit All -- Same in Supercomputing by gentryx · · Score: 4, Informative

    There is already one line of supercomputers built from embedded hardware: the IBM Blue Gene. Their CPUs are embedded PowerPC cores. That's the reason why those systems typically have an order of magnitude more cores than their x86-based competition.

    Now, the problem with BG is, that not all codes scale well with the number of cores. Especially when you're doing strong scaling (i.e. you fix the problem size, but throw more and more cores on the problem), then the law of Amdahl tells you that it's beneficial to have fewer/faster cores.

    Finally I consider the study to be fundamentally flawed as it compares the OEM prices of consumer-grade embedded chips with retail prices of high-end server chips. This is wrong for so many reasons... you might then throw in the 947 GFLOPS, $500 AMD Radeon 7970, which beats even the ARM SoCs by a margin of 2x (ARM: ~1 GFLOPS/$, AMD Radeon: ~2 GFLOPS/$).

    --
    Computer simulation made easy -- LibGeoDecomp
  10. Power Efficiency - MIPS vs ARM by Taco+Cowboy · · Score: 2

    I may be wrong here, but I get the impression that the MIPS architecture is much more power efficient than that of the ARM architecture

    If they are going to talk about building up a big iron using CPUs which are of high power efficiency, I reckon the MIPS cpu might be more suitable for this task than one from the ARM camp

    --
    Muchas Gracias, Señor Edward Snowden !
    1. Re:Power Efficiency - MIPS vs ARM by julesh · · Score: 4, Insightful

      I may be wrong here, but I get the impression that the MIPS architecture is much more power efficient than that of the ARM architecture

      If they are going to talk about building up a big iron using CPUs which are of high power efficiency, I reckon the MIPS cpu might be more suitable for this task than one from the ARM camp

      I don't think it is. Best figures (albeit somewhat out-of-date) I can find for a MIPS-based system is 2GFLOPS/W for a complete 6-core node including memory. ARM Cortex A15 power consumption is a little hard to track down, although it's suggested that a 4-core 1.8GHz configuration (eg Samsung Exynos 5) could run at full speed on 8W (if the power manager let it; the Exynos 5 throttles down when it consumes more than 4W). Performance per GHz/core is about 4GFLOPS, so this system should be able to pull in about 28.8GFLOPS (or twice that if using ARM's "NEON" SIMD system to full advantage). Add in ~2W for 1GB DDR3 SDRAM, and that's 2.9GFLOPS/W. Assuming that the MIPS system I found is not the best available (as the data was from 2009 it certainly seems likely better is available now), the two appear to be roughly comparable.

    2. Re:Power Efficiency - MIPS vs ARM by niftymitch · · Score: 2

      I may be wrong here, but I get the impression that the MIPS architecture is much more power efficient than that of the ARM architecture

      If they are going to talk about building up a big iron using CPUs which are of high power efficiency, I reckon the MIPS cpu might be more suitable for this task than one from the ARM camp

      MIPS is an under invested older but great technology.
      Another historic winner was the DEC Alpha.

      As the folk at Transmeta (and others) demonstrated logic to decode any random ISA and drive a RISC core faster than the old VAX microcode days is very possible. This seems to be the way of modern processors. So ARM/x86/x86_64 ISA almost does not matter except to the compiler and API/ABI folk. If you want to go fast feed your compiler folk well.

      --
      Truth is stranger than fiction, but it is because Fiction is obliged to stick to possibilities; Truth isn't. Mark Twain.
    3. Re:Power Efficiency - MIPS vs ARM by Bert64 · · Score: 2

      Another advantage of MIPS is that 64bit MIPS is already mature, having been around since the early 90s... 64bit ARM on the other hand is new and not widely supported yet.

      --
      http://spamdecoy.net - free throwaway anonymous email - avoid spam!
    4. Re:Power Efficiency - MIPS vs ARM by KonoWatakushi · · Score: 2

      As the folk at Transmeta (and others) demonstrated logic to decode any random ISA and drive a RISC core faster than the old VAX microcode days is very possible. This seems to be the way of modern processors. So ARM/x86/x86_64 ISA almost does not matter except to the compiler and API/ABI folk. If you want to go fast feed your compiler folk well.

      One of the best ways you can help the compiler folk is with an orthogonal and sensible architecture. Furthermore, consider that generating good code is a problem that must be solved for every language, so starting with a good ISA makes for a lot less work.

  11. That's what is so funny to me by Sycraft-fu · · Score: 4, Insightful

    Slashdot seems to have lots of ARM fanboys that look at ARM's low power processors and assume that ARM could make processors on par with Intel chips but much more efficient. They seem to think Intel does things poorly, as though they don't spend billions on R&D.

    Of course that would beg the question as to why ARM doesn't and the answer is they can't. The more features you blot on to a chip, the higher the clock speed, and so on, the more power it needs. So you want 64-bit? More power. Bigger memory controller? More power. Heavy hitting vector unit? More power. And so on.

    There's no magic ju ju in ARM designs. They are low power designs, in both sense of the word. Now that's wonderful, we need that for cellphones. You can't be slogging around with a 100 watt chip in a phone or the like. However don't mistake that for meaning that they can keep that low consumption and offer performance equal to the 100 watt chip.

    1. Re:That's what is so funny to me by TopSpin · · Score: 3, Interesting

      There's no magic ju ju in ARM designs.

      The magic ju ju is the ARM business model. There is one trump card ARM holds that precludes Intel from many portable devices; chip makers can build custom SOCs in-house with whatever special circuits they want on the same die. Intel doesn't do that and they don't want to do it; it would mean licencing masks to other manufactures like ARM does. For example, the Apple A5, manufactured by Samsung, includes third party circuits like the Audience EarSmart noise-cancellation processor, among others. It is presently not feasible to imagine Intel handing over masks such that Apple could then contract with some foundry to manufacture custom x86 SOCs. This obviates Intel from many portable use cases.

      That feature of the ARM business model might be very useful to large scale computing. One can imagine integrating a custom high-performance crossbar with an ARM core. Cores on separate dies could then communicate with the lowest possible latency. Using a general purpose ARM core to marshal data to and from high-performance SIMD circuits on the same die is another obvious possibility. A custom cryptography circuit might be hosted the same way.

      Contemporary supercomputers are great aggregations of near-commodity components. However, supercomputing has a long history of custom circuit design and if the need arises for a highly specialized circuit then a designer may decide that integrating with ARM to do the less exotic leg work computing that is always necessary is a good choice.

      --
      Lurking at the bottom of the gravity well, getting old
  12. Not only Performance per $ by gentryx · · Score: 2

    ...but also reliability (because supercomputers are really large and one failed node will generally crash the whole job, thereby wasting gazillions of core hours; that's one reason why SC centers buy expensive Nvidia Tesla hardware instead of the cheaper GeForce series) and IO and memory bandwidth and finally integration density. That one Intel chip can be more tightly integrated as it won't generate as much excess heat per GFLOPS (according to TFA...).

    --
    Computer simulation made easy -- LibGeoDecomp
  13. Not this week... by niftymitch · · Score: 2
    Not this week....
    I am a fan boy for the small ARM boards... I have built an MPI cluster out of Raspberry-Pi boards and it is not even close except as a teaching exercise where it excels.

    However many site services can be dedicated to these little boards where corp IT seems to dedicate virtual machines.

    Department Web Servers... with mostly static content... via NFS or a revision control system like hg.
    Department and internal caching name servers... NTP servers and managed central storage for each building or closet.

    The impact of the little ARM boards has kicked Intel in their lethargy-loaded-behind. Their next generation sub 25 Watt systems will take names and kick but as long as IT does not overload them with WindowZ.

    IT departments will find that the management advantage of chromebox devices connected to quality screens compelling.

    Users will find that flipping open the company ChromeOS laptop will put them on the same page as the big screen in the office...

    It is true that this is not 100% ready for prime time for all of us but the handwriting is on the wall.

    --
    Truth is stranger than fiction, but it is because Fiction is obliged to stick to possibilities; Truth isn't. Mark Twain.
  14. GNU/Linux on ARM by tepples · · Score: 2

    A gimp version of windows is not going to get the job done.

    On the other hand, a Windows version of GIMP does get a lot of jobs done that don't quite need Adobe Photoshop.

    But seriously, the reason Windows RT is "gimped" is because Microsoft has refused to endorse recompiling desktop applications. That's not a failing of ARM, as ARM ran RISC OS on Acorn computers, as much as a power grab by Microsoft.

    Some of the Samsung Slate tablets however come with an x86...and are actually fully functional! Can you point to an ARM tablet that can do everything it can?

    Some ARM tablets run Ubuntu. Other Android tablets run Debian in a chroot, with video out through an X11 server app for Android. These can't run Windows applications in Wine the way x86 applications do, but they work for any GNU/Linux application that has been recompiled for ARM.