Japan's Fujitsu and RIKEN Have Dropped the SPARC Processor in Favor of an ARM Design Chip Scaled Up For Supercomputer Performance (ieee.org)

← Back to Stories (view on slashdot.org)

Japan's Fujitsu and RIKEN Have Dropped the SPARC Processor in Favor of an ARM Design Chip Scaled Up For Supercomputer Performance (ieee.org)

Posted by msmash on Friday July 6, 2018 @04:52AM from the shape-of-things-to-come dept.

Japan's computer giant Fujitsu and RIKEN, the country's largest research institute, have begun field-testing a prototype CPU for a next-generation supercomputer they believe will take the country back to the leading position in global rankings of supercomputer might. From a report: The next-generation machine, dubbed the Post-K supercomputer, follows the two collaborators' development of the 8 petaflops K supercomputer that commenced operations for RIKEN in 2012, and which has since been upgraded to 11 petaflops in application processing speed. Now the aim is to "create the world's highest performing supercomputer," with "up to one hundred times the application execution performance of the K computer," Fujitsu declared in a press release on 21 June. The plan is to install the souped-up machine at the government-affiliated RIKEN around 2021. If the partners achieve those execution speeds, that would place the Post-K machine in exascale territory (one exaflops being a billion billion floating point operations a second). To do this, they have replaced the SPARC64 VIIIfx CPU powering the K computer with the Arm8A-SVE (Scalable Vector Extension) 512-bit architecture that's been enhanced for supercomputer use, and which both Fujitsu and RIKEN had a hand in developing. The new design runs on CPUs with 48 cores plus 2 assistant cores for the computational nodes, and with 48 cores plus 4 assistant cores for the I/O and computational nodes. The system structure uses 1 CPU per node, and 384 nodes make up one rack.

21 of 40 comments (clear)

Min score:

Reason:

Sort:

Fujitsu SPARC M3000 & Solaris 10 by buzz_mccool · 2018-07-06 05:26 · Score: 2

For many years my daily driver was a M3000 Fujitsu 4 core 2.86Ghz SPARCVII with 64GB of RAM and a internal RAID disk running Solaris 10. It still would be too had my employer not moved to a combination of Windows and Linux. Very reliable, scalable, hardware and OS.
Re: Yes but by fubarrr · 2018-07-06 05:27 · Score: 1

No
Hot stuff! by Sponge+Bath · 2018-07-06 05:31 · Score: 1

FTA: "30 to 40 megawatts..."
"first, a feeble spark, next a flickering flame, then a mighty blaze, ever increasing in speed and power." - Nikola Tesla
Re:512 bits eh? by Anonymous Coward · 2018-07-06 05:43 · Score: 1

SVE is a flexible in terms of vector length, with maximum width of 2048 bits. The 512 was likely selected due to power, bandwidth and chip real-estate reasons. We should know more after the Hot Chip representation, available free for everybody at this December the latest.
Re:512 bits eh? by CajunArson · 2018-07-06 05:53 · Score: 1

"SVE is a flexible in terms of vector length, with maximum width of 2048 bits."
AVX can get wider too if it makes sense. The fact that an exotic purpose-built supercomputer chip that won't be in real operation until 2021 is clearly taking design cues from a 2017-era consumer-grade part that you can use to play video games kind of shows how this nonsense about "ARM" vs. "x86" doesn't mean anything in the real world.
Incidentally, for all the crap about "RISC" that gets tossed around here like a magic get-out-of-thinking-free card, anybody who actually knows what RISC actually means would tell you that any and all SIMD units violate RISC's principles.

--
AntiFA: An abbreviation for Anti First Amendment.
K is faster than what LINPACK indicates by u19925 · 2018-07-06 05:58 · Score: 1

K was listed at number 10 in Nov-2017 benchmark and about 1/9th the speed of the fastest machine. But on HPCG benchmark (http://www.hpcg-benchmark.org ), it was listed number 1. Even now, it is number 2 behind the Summit in HPCG. Not bad for a computer debuted in 2011. Hope they can maintain performance on HPCG as well.
The proposed computer's LINPACK power efficiency is good but not that impressive. Summit is 8.8 MW and proposed computer is 8 times faster at about 4-5 times power consumption. So efficiency increase by a factor of 2 in 3 years (it is expected to be ready by 2021).
Re:512 bits eh? by dfghjk · 2018-07-06 06:37 · Score: 4, Insightful

"anybody who actually knows what RISC actually means would tell you that any and all SIMD units violate RISC's principles."
No, SIMD units do not "violate RISC's principles" and your saying so only demonstrates your own shallow understanding of what these principles are. RISC may mean "reduced instruction set computing" but that doesn't mean the end game is the smallest instruction set.
RISC is about making an architecture that is easy to implement by eliminating instructions that aren't needed for good performance. In doing this, designers can spend their gates on performance wins rather than on logic that can just as easily be reproduced in software. RISC is about putting gates where they count and not wasting them where they don't.
When you view RISC by it's actual principles, and not just by what the letters stand for, it's quite easy to see how SIMD units are entirely compatible. Of course, RISC as a differentiator has been obsolete for decades now and only exists as a point of argument for people who don't understand. RISC was an interesting topic...in 1988. You're 30 years behind.
ARM supercomputers by DontBeAMoran · 2018-07-06 06:55 · Score: 1

Remember over a decade ago when people were claiming that Apple replacing PowerPC CPUs for Intel CPUs would never ever happen?
We still have people here on Slashdot and on Mac-related forums claiming that Apple will never ever replace Intel CPUs with their own ARM CPUs.
We now have supercomputers built with ARM CPUs. Given Apple's love of computing-power-to-watts ratios, and their need to control as many parts of their hardware as possible, it's only a matter of time. ARM-powered Macs are coming.

--
#DeleteFacebook
1. Re:ARM supercomputers by iggymanz · 2018-07-06 09:32 · Score: 1
  
  You mean we have ARM based supercomputers planned or being built.... are there any running yet?
2. Re:ARM supercomputers by DontBeAMoran · 2018-07-06 09:59 · Score: 1
  
  Funny, but I remember the exact same predictions 5 years ago. Also 10 years ago. It's always "ARM is coming, real soon now!!"
  Except that 5 or 10 years ago, Apple didn't have ARM CPUs powerful enough to hold their own against Intel's i3/i5/i7 CPUs. And now they're designing their own, custom-made for their needs. And seeing the low-power, low-end CPU they're using in their MacBooks, Apple could swap to ARM tomorrow and the MacBook computing power would actually increase while lowering the cost.
  And ARM computers are important because watt-for-watt, ARM will give you a laptop that lasts a lot longer on the same battery, giving the chance for Apple to either make a laptop that still runs 10 hours on a single charge while lowering the cost and weight, or making a laptop with the same battery and weight that will run for 20 to 30 hours.
  
  --
  #DeleteFacebook
3. Re:ARM supercomputers by DontBeAMoran · 2018-07-06 10:00 · Score: 1
  
  Sure there are! Haven't you seen the new amazing cluster of 16 Raspberry Pi Zero?
  
  --
  #DeleteFacebook
4. Re:ARM supercomputers by iggymanz · 2018-07-06 13:53 · Score: 1
  
  my beolwulf cluster of old cell phones can kick its ass, if I can find all the chargers
5. Re:ARM supercomputers by serviscope_minor · 2018-07-06 19:53 · Score: 1
  
  Maybe you're right, but not for the reasons you state:
  We now have supercomputers built with ARM CPUs.
  Yes, but we also have SPARC powered supercomputers and we haven't had SPARC laptops since Tadpole in the 1990s. What makes for a good supercomputer and a good laptop are quite different things. Part of what makes for a good supercomputer is a fast, very wide floating point unit. But even more, supercomputers live and die on the interconnect. One thing you'll notice is that The Riken, stil very competetive and #1 on some benchmarks has an on die interconnect. Those things add up to making a good supercomputer.
  What makes ARM useful in this regard is that it's a flexible enough design that they can add on their fast, wide vector FP unit and the interconnect, not to mention a fast, wide memory bus, but no longer have to maintain a now ageing CPU core.
  In the other direction, Intel are already targeting the power balance for laptops. So, it doesn't mean we won't see ARM laptops, but the fact they're good for supercomputers has little bearing.
  
  --
  SJW n. One who posts facts.
Can we please stop with this outdated joke? by DontBeAMoran · 2018-07-06 06:57 · Score: 1

Nothing can run Crysis.

--
#DeleteFacebook
Re:512 bits eh? by technosaurus · 2018-07-06 07:16 · Score: 1

They likely chose 512 so that they didn't need to make all the necessary compiler improvements from scratch.
Re:well duh by Anne+Thwacks · 2018-07-06 08:04 · Score: 1

WHY not. Sparc is GPL'd and people still run OpenBSD on Sparc systems.

--
Sent from my ASR33 using ASCII
Looks like a Borg component by mnemotronic · 2018-07-06 08:04 · Score: 1

You will be armsimilated. All your Linux are belong to us.

--
The Russians have won. They have made the world a cesspool of distrust, greed, fear and hate.
Re:512 bits eh? by tempmpi · 2018-07-06 09:20 · Score: 1

You don't understand what "this guy" (David Patterson and Andrew Waterman) is writing. Their main complaint with regular SIMD is that it makes the instruction set grow quickly over time, as CPU performance is scaled up and SIMD units are made wider but old instructions still need to be supported for backward compatibility. They support vector instructions similar to ARM SVE, as those allow scaling up the performance by adding wide execution units without requiring the introduction of new instructions at the same time. As an additional benefit, those instructions can reduce the code size compared to regular SIMD. So by the standard of that article ARM SVE is not SIMD. The SIMD instructions they call harmful are only those older ones such as ARM Neon, SSE, AVX, AVX2, ... that use a fixed SIMD size.
And at the same time, their main complaint with those instruction sets is how they evolve over time when backward compatibility is required. If you are building a CPU for a specific purpose, let's say HPC or an embedded application, where you don't have to support old code, regular SIMD with only vector size support can be a very good choice and very RISC-like choice.

--
Jan
Re:512 bits eh? by tempmpi · 2018-07-06 09:34 · Score: 1

AVX can get wider too if it makes sense.
No, if you want to make AVX wider, then you would need to introduce a new instruction set. Intel did this several times. They started floating point SIMD with SSE which is 128-bit, then introduced the new AVX instruction set to support 256-bit wide SIMD and then AVX-512 to support 512-bit wide SIMD. If they want to scale up to 1024 or 2048-bit wide SIMD in the future than again, they will have to add new instructions and still need to support the old instructions for legacy software.
With traditional SIMD, your instruction set and the maximum width of your execution units are coupled. SVE removes this coupling. You can write SVE code now for a CPU with a 512-bit wide execution unit and would get a speedup later, without recompilation, if someone builds a CPU with a 2048-bit wide execution unit.

--
Jan
That's a terrible chipset, not the good one by davecb · 2018-07-06 13:07 · Score: 1

That was the glow-in-the-darek, go as fast as possible, not-parallel-enough Intel-like kludge. if you want a good SPARC, look at the T5. The T5 was small, cool, heavily parallel and utterly not what Oracle wanted (;-))

--
davecb@spamcop.net
Re:512 bits eh? by unixisc · 2018-07-08 14:20 · Score: 1

RISC and SIMD address totally different issues. RISC is about the architecture of one CPU. SIMD is about how multiple CPUs work concurrently on various data sets