10GHz Processors And Moore's Law
AntiFreeze writes "There is an interesting story on MSNBC about Intel's attempts at producing chips capable of running at faster than 10 gigahertz. There was a previous /. article in early December about this here. This article from MSNBC is much more detailed (both technically and non) than the original article referenced from December, and provides a very intriguing look at what Intel's planning to do over the next four years, and what they'll have to show the general public as soon as April 1st. And as always, there's the heated /. argument about Moore's law buried in there, too."
While everything is obviously shielded, it is still amusing to speculate on the cooking potentials of the insides of your PC.
What is more worrisome is the problem of heat. I recall reading someplace that right now a typical processor runs the energy of a 60 watt light bulb through that piece of ceramic.
When we multiply this with the frequency shifts and the number of transistors, it becomes worrisome.
I occasionally have visions of computers glowing like a flying saucer [smile]
"It is a greater offense to steal men's labor, than their clothes"
Better off with multiple slower CPUs, like 1.5 GHz and Beowulf them. More machines to take care of, but better than rushed/poor fabbing of CPUs. Plus you get redundancy and almost unlimited scalability. And ungodly bandwidth if you use gigabit cards instead of just 100bt. It's the way to go for pretty much everything unless you have something custom for one cpu (which is rare these days)
Oh, but Murphy's law has been experimentally tested by many people many times. It is most definitely a law of physics :-)
Better off with multiple slower CPUs, like 1.5 GHz and Beowulf them. More machines to take care of, but better than rushed/poor fabbing of CPUs. Plus you get redundancy and almost unlimited scalability. And ungodly bandwidth if you use gigabit cards instead of just 100bt. It's the way to go for pretty much everything unless you have something custom for one cpu (which is rare these days)
Actually if you are going to have a system of highly interconnected cpu's like in a beowulf cluster then you are limited fairly severly in scalability. This is mostly due to the size of the memory bus. Even if you move up to gigabit ethernet cards the bus is a big limiting factor.
Secondly the class of tasks that a cluster is useful for is not that big. It does nothing towards making a really bloated program run any faster. They are not very good for real time tasks because once you have chopped up a problem and distributed it to all of the processors you have very little time to work on it and get the results back in time.
While very useful the cluster is not likely to be the solution to potential end of Moore's law like growth.
When I want your opinion I will beat it out of you.
Ok, so what happens when we hit a practical mile-stone? Will faster general purpose CPU's achieve such a limit that it costs 10 times as much to achieve 10% performance gain?
Here are the alternatives. Get away from pipelining (which is a hack that facilitates ever-increasing clock-speeds).. Return to optimized and specialized adders / multiplers, etc. Now that we make things in parallel with 2 - 4 adders, simply produce CPU's with 24 adders, each with no inter-vening pipeline buffers.. The number of transistors significantly goes down for each adder, and through the use high conductive materials (such as diamond) you can achieve large surface area chips. (This assumes that you take on the reverse of existing P4's.. You have the control log and memory interfaces running at 10GHZ while your adder runs at say 100MHZ, which each gate switching with nearly 1/20GHZ probagation delay)
Step two is even more obvious. Specialized hardware.. In the video world, we have only to compare software OpenGL to hardware OpenGL. specialized hardware is monumental because it's the ultimate parallel algorithm. Those algorithms such as MFC, or possibly even OS calls could be hardware controlled.. Granted it makes upgrades a lot harder, but don't we find ourselves spending the money on new video cards every year and a half now? How often does someone upgrade winNT? It already costs $150 for the OS upgrade, what's an additional $50 for the PCI / adaptive AGP card?
To facilitate smoother transitions, I think that FPGA or ASICS might have a popularity explosion. As far as I know, they're still manufactured with huge gate-widths.. Bring an ASICs into the "10GHZ" range, and you have the potential for incredible performance.
In fact, the CPU as we know it might fade away into the anals of history over time. A return to cartraiges perhaps?
-Michael
-Michael
They'll have something to show on April 1st? Am I the only one who raised an eyebrow at this bit?
uh huh. They said the Scanning Tunnelling Microscope was impossible for exactly the same reasons, now people are building them at home.
How we know is more important than what we know.
"We expect to have the first full field-scanned images by April 1," said Chuck Gwyn, program director for EUV. ;)
Wouldn't happen to look like this would it?
-----
"Almost isn't good enough - but it's almost good enough."
-----
"Almost isn't good enough - but it's almost good enough."
-Me
Then why does almost every single linux company I know of (regardless of their field) have *at least* a 6-node beowulf cluster. It's not for SETI, my friend. Some folks need that power without having to get a crazy expensive Sun/HP/SGI/DEC/Aviion or with some performance-crippled 8-way xeon. If you BREAK UP the task, it works better. Gigabit is more than enough for databases, etc.
Like I said, there are certain tasks that a cluster is great for, but there are quite a few that it doesn't do you any good to have a cluster. This is mostly due to the fact that you have traded what amounts to very high memory latency for more available cycles.
Secondly there are some tasks that are very hard to break up into sub-problems and so it is very hard to apply a cluster to those types of problems.
A good way to know if a cluster will help solve a problem is to look at how much the processors must share data while working on that problem. If you can limit a processor to writing to a data space that other processors do not depend on then you have a problem that may be well suited for a cluster. If however the result space and the data space overlap then your memory bus can get easily swamped trying to keep everybody's memory up to date.
When I want your opinion I will beat it out of you.
Never say something is impossible. I learned this many years ago, when I said a certain putpixel routines was impossible to get any faster.
I was proven wrong.
Please read "Impossible for Dummies", this must brighten you up.
This is a replacement signature.
A more careful journalist would hopefully have written:
--
Blaming GW Bush for the Iraq war is like blaming Ronald McDonald for the poor quality of food.
Lowering the voltage has some good effects - the main one is that the power consumption drops as the square of the voltage (assuming Ohms law). However lowering the voltage causes everything to run slower. The old fashioned 4000 series CMOS chips were much faster at 15 volts than they were at 5 volts.
Chips get faster when they shrink because the capacitances decrease as the surface area of a conductor shrinks; cut the feature size by a factor of two in both directions and the capacitance is down by a factor of four. However there is another effect which occurs as everything shrinks; the insulation between features shrinks, and that shrinking feature increases the parasitic capacitance between the two features.
In the past the increase in capacitance caused by the thinning of insulators has not been a significant effect in limiting clock speeds but there comes a point where the effect does become important. In neurons the cell walls are so thin that the capacitance effects of the thin dielectric limit signal propagation speeds in the neuron to about 180 miles per hour or so. Long axons have thick sheaths to cut the capacitance and increase the signal propagation speeds.
This increasing capacitance with the decreasing dielectric thickness combined with the decreasing speed from the lowered voltages will eventually put an effective cap on the clock speed of silicon devices. The only big trick left in the book is too switch to Diamond based semi conductors - which are as much better than silicon than silicon was than germanium - and that will give us some more speed. Above a certain frequency Nature itself changes the way it does things. At RF frequencies bulk devices like crystals function - at the frequencies of light waves only atomic devices can switch from one state to another quickly enough.
In other words at some point in the near future we are going to reach a point where simple die shrinking won't be enough to crank up clock speeds any more. Enjoy things while they last - but another factor of a thousand increase in clock speed (Apple II one megahertz to present day one gigahertz) is going to be very difficult to achieve.
Are we going to have a party when we reach this milestone?