100 Million-Core Supercomputers Coming By 2018
CWmike writes "As amazing as today's supercomputing systems are, they remain primitive and current designs soak up too much power, space and money. And as big as they are today, supercomputers aren't big enough — a key topic for some of the estimated 11,000 people now gathering in Portland, Ore. for the 22nd annual supercomputing conference, SC09, will be the next performance goal: an exascale system. Today, supercomputers are well short of an exascale. The world's fastest system at Oak Ridge National Laboratory, according to the just released Top500 list, is a Cray XT5 system, which has 224,256 processing cores from six-core Opteron chips made by Advanced Micro Devices Inc. (AMD). The Jaguar is capable of a peak performance of 2.3 petaflops. But Jaguar's record is just a blip, a fleeting benchmark. The US Department of Energy has already begun holding workshops on building a system that's 1,000 times more powerful — an exascale system, said Buddy Bland, project director at the Oak Ridge Leadership Computing Facility that includes Jaguar. The exascale systems will be needed for high-resolution climate models, bio energy products and smart grid development as well as fusion energy design. The latter project is now under way in France: the International Thermonuclear Experimental Reactor, which the US is co-developing. They're expected to arrive in 2018 — in line with Moore's Law — which helps to explain the roughly 10-year development period. But the problems involved in reaching exaflop scale go well beyond Moore's Law."
Can't we just start calling this a 'supercore' or something? When the numbers get that high it kind of goes beyond what most people can visualize. Like describing how hot the Sun is....let's just says it's "exactly 1 Sun hot".
I judt got a nre Kinesis keybiartf so please excusr ant egregiou typos.
Wait, what? You lost me. Are you from the future? How can you describe the state of the art as "primitive"?
-Peter
my old trusty VIC20 ftw
Can You Imagine a Beowulf Cluster of These?
The CFL condition that limits the maximum time step one can take shows no sign of relenting. Score has been Courant (the C in CFL) 1, Moore 0 for the last three decades.
sed -e 's/Chuck Norris/Rajnikant/g' joke > fact
All this effort at creating parallel computing ends up solving very few problems. HPC has been struggling with parallelism for decades, and no easy solutions found yet. Note that these computers are aimed at solving a particular problem (e.g. modeling weather) and not at being a vehicle to quickly solve any problem. When the comparable multi-processing capacity is in your cell phone, what are you going to do with it?
2B|^2B
Technically, shouldn't 640K processors be enough for every one?
sed -e 's/Chuck Norris/Rajnikant/g' joke > fact
What's "Portland, Ore"?
Oh, you mean "Portland, Oregon"? This is a website. It isn't fucking Twitter or SMS, how hard is it to write three more letters?
We know that Slashdot is U.S.A.-centric so we'll forgive the missing "Portland, Oregon, U.S.A." part, but for crying out loud, at least write the whole state name.
The exascale systems will be needed for high-resolution climate models, bio energy products and smart grid development as well as fusion energy design.
Sounds like a pork program. What are "bio energy products", anyway. Ethanol? Supercomputer proposals seem to come with whatever buzzword is hot this year.
It's striking how few supercomputers are sold to commercial companies. Even the military doesn't use them much any more.
It's interesting that 4 of top 5 supercomputers are running AMD, while 402 of the Top500 are running Intel.
What's the cause of this? Value? Energy-saving? Performance?
We know what answer it is going to give. 42. Save the money.
sed -e 's/Chuck Norris/Rajnikant/g' joke > fact
The first Jaguar was a single megaflop.
We're still at the point where unthreaded languages (like PHP) are still viable. For example, we use PHP in a complex, multi-server, multi-core cluster, and it's "share nothing" approach scales quite nicely, in that having more and more users hitting the systemm on separate servers doesn't really cause a problem, since there's virtually no cross-communication going on.
But there's a scalability limit in what you can do "PER PROCESS". There are some very processor intensive functions that simply take a while to do (such as rendering a 100 page report, then converting to PDF) and there's currently no way to spread the load in PHP beyond a single core.
At the other extreme, we have almost the same problem - having such a large number of cores that resources commonly shared among threads and processes is really no longer feasible.
Languages like Erlang have a "shared nothing" approach, but not at the process/thread level, but at the function level. Individual functions within a process are themselves "share nothing" and thus can easily scale across multiple cores, processors, and servers in a networked cluster. (at least, this is the theory)
So how 'bout it, folks? Where are the benchmarks showing how languages DESIGNED to take advantage of parallel processors and clusters actually scale up in the real world? Is Erlang the cat's meow when discussing systems of this scale?
I'm not expecting to see my example process (100 page PDF reports) scale up smoothly to 250,000 cores, but I sure would like to see it scale up smoothly to a dozen or two!
I have no problem with your religion until you decide it's reason to deprive others of the truth.
that will settle who 'owns' what forever.
it's newclear powered, user friendly, & completely bug free, as well free to use, forever.
Maybe this thing will have enough power to run Windows by 2018??
"Before God we are all equally wise - and equally foolish"
Albert Einstein
When the comparable multi-processing capacity is in your cell phone, what are you going to do with it?
Stream high definition porn... duh.
How many cores do we need to simulate a human brain?
The question is: will it be enough to run Aero?
The fastest system only has 224k Cores? Oh, dear. We definitely need bigger systems, then. :D
And I suppose Deep Thought has nothing to worry about yet, either. Yet.
(The fictional version, that is. The "real" one has already been outdone by Rybka.)
Yup. 100-million core driven Second-Life server, like the Matrix.
New Economic Perspectives
Wow, not just one million-core supercomputer, but 100 of them?
I am currently accepting investors to help build a one billion core supercomputer to create high resolution climate models that take into account the waste heat from a 100 million core supercomputer making a high resolution climate model.
(Seriously, how much heat is that thing going to put out?)
Note that these computers are aimed at solving a particular problem (e.g. modeling weather) and not at being a vehicle to quickly solve any problem.
That's not entirely accurate. HPC systems are designed to solve a class of problems. That's not the same thing as a "particular" problem. Jaguar has, in fact, solved many different problems, including fluid flow, weather, nuclear fusion and supernova modeling. It's not going to run Word any faster than your PC but that's not what you buy a supercomputer to do.
They should obviously start working with the Mandlebulb people..
Is this going to be the new processor requirement for running Flash in a web browser?
Take Nobody's Word For It.
These computers will be good for solving problems involving lots of independent operations. A processor can process one operation at a time, but since these operations do not depend on each other, the operations can be sent to several processors at once. Imagine a big foreach loop, for example.
Or maybe someone will want to have 100 million Google Chrome tabs open.
I'm personally imagining putting dnetc on one of these things.
I'm still waiting for that 10GHz Pentium Intel promised for 2004.
Parallel computing is great for solving NP-Complete problems. If you have enough cores for every possible solution you can have all possible paths process at the same time and compare the results.
That's not entirely accurate. HPC systems are designed to solve a class of problems. That's not the same thing as a "particular" problem. Jaguar has, in fact, solved many different problems, including fluid flow, weather, nuclear fusion and supernova modeling. It's not going to run Word any faster than your PC but that's not what you buy a supercomputer to do.
So you're saying that OpenOffice would still take forever to start.
Republican reign for the next 1000 years after Obama fails.
Yours In Moscow,
Kilgore T.
if only they had built 1,000,001!
...Such as, say, FPGAs (www.maxeler.com) or GPUs (www.ati.com) ...
One such accelerator card can replace ~ 100 cores for common applications such as finite differences or MonteCarlo
Hence you would need "only" a million blades.
Made by sharks with frikkin lasers on their heads.
Is it just me or does this news message just shout Dr. Evil?
What's it like to be a primitive caveape in the modern world?
IBM's design goal for an exascale system is to limit it to 20 megawatts of power ,,,
Just keeping that sucker cooled will contribute to global warming. I hope they're going to use all that waste heat for something.
To put this into perspective, it would take over 4 and a half million bowls of Super Colon Blow to equal the computation power of just 1 of these things!
"But the problems involved in reaching exaflop scale go well beyond Moore's Law."
The above quote shows quite well that the writer doesn't understand what Moore's Law is about.
Check Wiki about "thinking machines", "transputer" and if you have more than 1 CPU/Core, launch a game and see if all cores used effectively without needing massive additional work from game publisher.
Technology is primitive, even a billion processor machine doesn't save it from being primitive. It is the software at least.
"Portland, Oregon, U.S.A., Earth, Milky Way, Cluster TXH-170718, Universe 01 (we think)"
I'm always wary of making an infamous "50 MB of memory is all you'll ever need" type of claim, so I like to believe that we'll figure out how to use greater processing power by the time it gets here. We haven't had too much trouble with that so far. As far as actual use, if we ever get products like Morph (http://www.youtube.com/watch?v=IX-gTobCJHs), there might be a need for massively parallel processing. At the very least, such computing power would likely be needed to make such products.
My webcomic
if speed is the goal, then it needs to be a power of two number of cores so that you don't have to implement logic checking for a valid core address. That logic would eat performance from every action performed by the machine. So, until you develop affordable decimal logic hardware implementations that can scale in size the way the binary logic does, we're gonna keep making computers that work fast the way we do now and it's gonna involve powers of 2. And get off my lawn.
"If still these truths be held to be
Self evident."
-Edna St. Vincent Millay
just had an ugly thought...."Windows17 for PC (Personal Cloud)"
HA! I just wasted some of your bandwidth with a frivolous sig!
Come on, that's just silly. I can understand why we might a few million-core supercomputers, but who would need 100 of them?
Bow-ties are cool.
Low latency, high bandwidth interconnect that can mesh 100 Million cores: The Next Big Problem in computer architecture.
Can I just say... FUCK YES. Thank you!
As someone who grew up in the Portland (Maine) area it annoys me to no end when people talk about things in "Portland" and neglect to disambiguate - especially when they're talking about the other Portland. :)
Bow-ties are cool.
Yes. So by extending it to a million core machine, OpenOffice would take million x forever* to load if one instance is opened per core.
*Forever = two seconds after a mouse click.
Face your daemons!
To which Oak Ridge National Laboratory replied "Fuck everything, we're doing 500 million cores."
I take it THIS is a machine that might run Vista well. Too late SP3 aka Windows 7 is out.
These posts express my own personal views, not those of my employer
Stream it? With that much processing power it should be able to create it on the spot: "Computer, let's start today's scenario with Angelina Jolie surrounded by...."
Any sect, cult, or religion will legislate its creed into law if it acquires the political power to do so.
If you think massively parallel supercomputers are like throwing logs on a fire, you should also think rocket motors are like throwing more fuel on a fire. Both are abstractly about the same goal of condensing more 'reaction' into a single motor, and both are devilishly complex in all the details which distinguish them from scaling a camp fire to a bonfire. Scaling up manufacturing and delivery of the fuel units to all pack consistently into one system which can be controlled in practice, while avoiding issues like uncontrolled resonance, heat dissipation...
Yeah, I can see it..
You can close your eyes to reality but not to memories.
Probably will need a fusion plant to power and cool the thing. But still sounds awesome. They briefly mention data/memory flow issues, but don't really address it. It is getting to the point where data flow will be as important as processing power, especially as you have escalating processors. You can run as many operations as you want, but if it can't be delivered somewhere useful, then they are wasted. I am also very interested on how the overhead will be managed when this many processors are involved. Multi-processors are not quite 2x (or 4x, 8x, etc) than just one processor due overhead, and even a really specialized scaled OS & I/O system won't be able to overcome this many processors.
Now for some fun:
It could probably power real time rendering of a Beowulf cluster of Natalie Portmans in grits while making us submit "All Our Base" to our new "Insert-Here Overlords". (Did I miss any?)
Vote monkeys into Congress. They are cheaper and more trustworthy.
Q: Is there a God?
A: There is now!
Let’s put this into perspective: Intel’s next generation of chips are estimated to have ~2 billion transistors (release in 2010 or 2011). Moore's Law would predict that chips would have ~32 billion transistors in 2018. That’s an estimated 3.2E+18 transistors spread over the 100-million cores
That’s 32,000,000 times the number of neurons in the human brain.
Department of energy?
Mapping weather systems?
Cracking high bit encryption schemes? Listening to every phone call happening on the planet and mapping social patterns?
BORING!
No, I want to see a 100 million core supercomputer render one of those 3D "Mandelbulbs" and let me do some real-time exploring with a VR helmet.
Now THAT would be a worthy use for such resources!
That and being able to grow virtual beings from DNA samples.
-FL
All your cores are belong to us!
when these exascale systems start asking questions and/or making demands?
Parallel computing is great for solving NP-Complete problems. If you have enough cores for every possible solution you can have all possible paths process at the same time and compare the results.
That's tough to manage when the number possible paths grows exponentially with respect to the input size.
Oak Ridge National Laboratory, home to the world's fastest supercomputer, does a lot of work for national security. At the labs housing the top ten supercomputers, at least five do weapons and defense research. And that's just what the public knows about. I would be shocked if there weren't similar supercomputers working on intelligence and classified projects.
Even if the computers aren't stamped with "U.S. Army", the military does indeed use many of them. The wonderful side effect of their push to simulate things like aging nuclear weapons is that it helps develop the technology for peacetime purposes like renewable energy and pharmaceuticals.
How many problems can these systems really solve?
Well, only the ones where you need to conserve mass, momentum and energy; pretty niche market really...
I am sure Stalker SOC will still crash.
-Oz
But even when choosing a data type for money, an app designer still needs to know base 2 to find the appropriate type in MySQL capable of holding amounts up to $327.67 (smallint), $83,886.07 (mediumint), over $21 million (integer), or more money than the gross world product (bigint).
A 100 million cores supercomputer will solve a 26 city traveling salesman problem. Frankly, if your salesman needs to visit more cities than that, you could probably gain more by re-targeting your sales to larger markets.
"Who is the Journal of Quantum Physics going to believe?" --Stephen Hawking
If each core could model one small patch of skin on one of those busty 3D models, then, well, ... wow!
Sorry but throwing more processing power at crap software doesn't help... Microsoft realised that recently. :P
Exascale computing may seem mind bogglingly implausible at first glance, but one forgets that logic switch density goes up with the square of the process size reduction. A 1000-fold increase in computing is merely a 10x reduction in process size. Intel seems confident silicon can approach this, although it may be the realm of graphene and nanotubes.
1997/8 The first teraflops class supercomputers. We now have 32-45nm silicon.
2008/9: First petaflops class supercomputers. Today, teraflops computing is available in your desktop. A single $100 800 core GPU is theoretically a match for the 1997 #1 supercomputer.
2018/19: A single $100 ASIC should be capable of a petaflop. 3-4nm would be required to keep pace. Enter the era of exascale computing.
Oddly Moore's law detractors have been so consitently wrong, the burden of proof is now on the critic.
After logging in slashdot still does not take you back to the page you were on. It's been that way for 20 years.
can it run Crysis???
...what might happen if we could run a copy of The Sims on a truly massive supercomputer. It would need to be somewhat customised for that particular machine/environment, of course, but I think it could be interesting.
There were times when I did see something close to genuinely emergent behaviour in the Sims 2, or more specifically, emergent combinations of pre-existing routines. You need to set things up for them in a way which is somewhat out of the box, and definitely not in line with real world human architectural or aesthetic norms, but it can happen.
Makes me think; if we could run the Sims, or the bots from some currently existing FPS, parallel on a sufficiently large scale, we might eventually start seeing some very interesting results come from it, at least within the contexts of said games.
Average human lifespan is 80 years. Assuming the author is roughly 30, then the author need only fear being ridiculed for underestimating the future for the next 50 years. Assume Moore's Law continues to hold and assume that scheduling problems constrain individual motherboards to 16x16 cores (16-way SMP is your limit, it's hard to imagine hardware inside the CPU is going to be any easier than hardware outside the CPU).
This means a desktop system in 50 years time can realistically expect to be limited to the equivalent of 256 cores that are each running at 100 terahertz. It won't be quite that architecture, but it should have that level of power.
It seems very wise to be scoffed at by a few people now and then hailed as a Visionary in his old age. It's not like anyone would give him anything now for being accurate, but Visionaries get their own TV shows, awards, endorsements - serious cash!
Far, far better to be a Visionary.
It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
Easy solutions =have= been found, but then Inmos was sold off. The mice were furious.
It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
Nice reference.
What are these computers used for, though? Weather simulations? Building bombs?
Designing nuke reactors, designing aircraft (transsonic flows are nasty, hypersonic is a nightmare), designing bombs, weather simulations (or so they say), designing racing cars (almost all Formula 1 teams own or rent supercomputer time and Bloodhound - the 1000 MPH car - can't be designed any other way), processing data from particle accelerators (just collecting terabits of data per second isn't easy), designing new generations of microprocessors and pattern analysis in genetics.
Of these, I'm most familiar with processing data from accelerators. The work I did at Daresbury (20 years ago, back when there was still an SERC) involved collecting data from a very tiny accelerator. A mere 20 MeV. The design was over-specced by a bit (they wanted to be able to handle 2^65536 32-bit words of data) but the system really did max out both the processing and networking capabilities that existed at the time. And Eurogam was a small, small project by nuclear physics standards. The LHC, if they can ever keep it running, will be logging terabits of data per second. The European high-performance grid exists because the alternative would be convoys of 40' trucks hauling SANs. And you know how the French drive.
It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
Make a Beowulf cluster of these and violla, simulated brain!
Tsukasa: All I really want, is to be left alone...
Chances are just like 10 years ago, the architecture won't even be the same. This is useless, 100 million cores are only needed by google for its porn searches.
"They confiscated everything, even the stuff we didn't steal!"
If you are fighting about 1000 vs. 1024 cores, you haven't got enough of them yet.
10^8 cores isn't that much.
Human body: 10^14 cells, 10^11 neurons, 10^14 synapses.
It would be enough to simulate a brain maybe if each core simulated 1000 neurons and it is interconnected as well as a brain. Basically if it's a brain.
You could simulate a brain at 1000 neurons per core but it has to be cheap enough, small enough, low enough power consumption and dissipation, well enough interconnected and - okay basically you have to have a brain.
It would be very useful in biology, though even at the recent petacomputer discussions there was question about whether data should really be stored, it is so expensive to do so. Ideally you would put a drop of blood in and the data would be driven in real time through the system, which would
The problem is data storage. I was in a seminar about the petacomputer being built in Japan. The people were saying that there is a real question about whether data should be stored and how.
If each core consumes 10 watts, which is small for an Intel system, you are really talking about a gigawatt then. Fortunately, this is a the same problem as mobile computing, and innovation is converging to solve both.
They want their Matrix fan back...
Tired of all the isms, don't exploit people as an employer, or a government, mmmmK?
That settles it. I'm NOT purchasing any new hardware until 2018.