Japan's Newest Linux Supercluster: 13TB RAM

← Back to Stories (view on slashdot.org)

Japan's Newest Linux Supercluster: 13TB RAM

Posted by timothy on Wednesday November 3, 2004 @04:17AM from the hey-is-that-DDR-you've-got-there dept.

green pizza writes "Following its sale of a 10240 processor cluster to NASA, Silicon Graphics Inc has announced that it's supplying a 2048 processor Altix 3700 Bx2 to the Japan Atomic Energy Research Institute. Aside from running Linux on Itanium2 processors, the beast also features 13 TB of RAM!"

4 of 163 comments (clear)

Min score:

Reason:

Sort:

Nuclear research by Big+Nothing · 2004-11-03 04:26 · Score: 5, Informative

The puter will be used for nuclear research (bushspeak: nucjular reesatch) by the Japan Atomic Energy Research Institute. More info about the organisation, their projects, etc. can be found at: http://www.jaeri.go.jp/english/index.cgi.

--
SIG: TAKE OFF EVERY 'CAPTAIN'!!
Re:bottleneck by amorsen · 2004-11-03 04:39 · Score: 5, Informative

The whole point of Altix is that it's a single system image, not a cluster. Every processor can access all 13TB. That doesn't mean communication is free, of course, but it's vastly faster than your favourite Beowulf cluster.

--
Finally! A year of moderation! Ready for 2019?
Luckily by bmajik · 2004-11-03 04:45 · Score: 5, Informative

SGI has been working through this in hardware for over 10 years.

The distributed shared memory concept of the Altix (first seen on Origin 200 / Origin 2000 in the commercial space, and previously based on the Standford DASH/FLASH projects) uses a hardware based memory router.

Each PE has local ram and local CPUs and a "MAGIC" chip that routes cache invalidations, memory block "ownership", etc messages to other PE's as necessary. Unlike SMP designs, cache coherencvy doesn't destroy the whole shebang because its not a shared bus, it's a heirarchial directory system. I.e. PE0 knows it only needs to contact PE3, PE6, and PE13 to invalidate a cache block. Turns out that thats much more efficient than broadcasting a message to PE0-PE63 saying "invalidate this block!"

Now, as far as _all_ processor sharing the full 13TB - i am not sure.

The memory density / system image equation is sort of a tradeoff, as more PE's require more router hops in the topology. More router hops increase latency. SGI has sold 256 and 512p single-image systems, and may have gone up to 1024 or 2048p / system.

To be perfectly honest, the system-system latency is different than the intra-system latency, but nothing like it would be on an x86-with-ethernet shared nothing cluster.

SGI's big installations are cool as they have advantages of both SMP and MPP designs.. each autonomous machine gives you signle-image benefits but with really high proc counts.. . and then you link a bunch of those together to get this outrageously sized machine.

--
My opinions are my own, and do not necessarily represent those of my employer.
Not the largest memory capacity by Anonymous Coward · 2004-11-03 04:51 · Score: 5, Informative

Sorry to spoil the excitement for everybody but actually, Columbia far exceeds the Japanses system's memory capacity at 20 TByte. See this description for details of Columbia's config.