NEC SX-9 to be World's Fastest Vector Computer
An anonymous reader writes "NEC has announced the NEC SX-9 claiming it to be the fastest vector computer, with single core speeds of up to 102.4 GFLOPS and up to 1.6TFLOPS on a single node incorporating multiple CPUs. The machines can be used in complex large-scale computation, such as climates, aeronautics and space, environmental simulations, fluid dynamics, through the processing of array-handling with a single vector instruction. Yes, it runs a UNIX System V-compatible OS."
so awesome when girls post...
(This would waste some of the compute power, but if the total time saved from not changing the application exceeds the time that could be saved using more of the cycles available, you win. It is this problem of creating illusions of whatever architecture happens to be application-friendly at a given time that has made much of my work in parallel architectures - such as the one produced by Lightfleet - so interesting... and so subject to office politics.)
It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
"Easter Island's Weather Forecasting Service believes operation of the NEC SX-9 would realize a 53% savings under Windows Server 2008 compared to under UNIX"
What's with these new fangled measurements.
I'd like to know what it is in Libraries of Congress per Jiffy
A game has objectives and is competitive, anything else is just play
I wonder how well it will do with the really cool vector games like Asteroids or BattleZone or Tempest or...
"what's your vector, Victor?"
The Kai's Semi-Updated Website Thingy
I dunno: maybe this thing could run faster at higher temperatures in lower gravity?
(/pretending to know what I'm talking about)
I used to carry a bottle of whiskey for snake bite. And two snakes. -Nefarious Wheel
Ahh... so you're planning to turn Aero on.
Well, distributed is often seen as poor-mans parallel, but in this case they don't compare. Vector units have large arrays of data and perform the same operation on all of them at once. Think array or matrix operations being done in one step rather than needing loops. This is where a SIMD architecture takes off.
The only unit I ever got to play with had a 64x32 grid of processors, you could add a row of numbers in log2(n) steps instead on n. It was cool because you could tell each processor to grab a value from the guy next to him (or n steps in a given direction from him) and so on. You could calculate dot products of matrices very quickly.
The distributed stuff you mentioned is mostly farming. Take a big loop of independent steps, break them up and pass them out to a (possibly) heterogeneous collection of processing nodes. Collect the answers when they finish. Render farms work the same way. It's a good way to break up some problems, but it's not what a vector unit does.
Now, I haven't touched this stuff for eleven years so my facts are possibly wrong. I'm sure someone will be along to correct me.
The only text that can ever follow the words "up to" in computing is "0.1 *". As in "speeds of up to 0.1 * 102.4 GFLOPS". Every time a marketing droid published a press release, a kitten dies.
- Adam L. Beberg - The Cosm Project - http://www.mithral.com/
If you build it, nerds will come. Soylentnews.org
user@host:~ $ ls
You are about to list the files in this directory.
Are you sure you want to do this? [y/n] y
Enter Administrator password:
We're sorry, using MS Bash 4.00 Basic you do not have
the proper privilege level to view system files.
Please purchase MS Bash 4.00 Mega, Ultra, or Extreme.
Would you like to purchase one of these products now? [y/n] y
We're sorry, this product is not upgradeable. Please
reinstall your operating system, choosing "clean install"
during the upgrade process. Thank you for choosing the
rich user experience provided by MS Bash 4.00.
MS Bash must now restart your computer.
My grandmother used anecdotal evidence all the time, and she lived to be 120 years old.
http://www.nec.de/hpc/hardware/sx-series/index.html
There are four PDFs there; the brochure is a four-colour glossy, but there is some real information. Sadly, the interesting-looking white papers are for the SX6, two generations earlier.
SX9 summary: 65nm technology, 3.2GHz clock speed, eight vector elements handled per cycle with two multiply and two add units, which is where the 102.4Gflop/CPU figure comes from. 16 CPUs in a box about the size of a standard 42U rack.
Totally absurdly fast (ten 64-bit words per cycle per CPU) access to a large (options are 512GB or 1TB) shared main memory; absurdly fast (128GB/second) inter-node bandwidth.
May contain traces of nut.
Made from the freshest electrons.