Intel Squeezes 1.8 TFlops Out of One Processor
Jagdeep Poonian writes "It appears as though Intel has been able to squeeze 1.8 TFlops out of one processor and with a power consumption of 62 watts." The AP version of the story is mostly the same; a more technical examination of TeraScale is also available.
It's quite fun to consider that when the original joke was made, the processing power of that Beowulf cluster would probably been quite close to the processing power of the processor discussed in the article.
Nothing is impossible. We just haven't quite worked out how to do it yet.
Does this permit the practical use of any truly breakthrough apps?
Does it suddenly make previously crappy technologies worthwhile? I.e., does image recognition or untrained speech recognition become a mainstream technology with this new processing power?
The first thing that jumped out at me was the presence of MACs. They are the heart of any DSP. So, this chip is good for computation although not necessarily processing. As other posters have pointed out, this chip could become a very cool GPU. It should also be awesome for encryption and compression. Given that the processor is already an array, it should be a natural for spreadsheets and math programs such as Matlab and Scilab. Having a chip like this in my computer just might obviate the need for a Beowolf cluster. :-)
It is entirely not true that you could replace today's fastest computer with this kind of technology and get the same performance. These new Intel CPU's are really difficult to program efficiently. You would only get good performance on certain problems sets.
Ray tracing is embarassingly parallelizable, and while I'm no expert, two terraflops might just be enough calculating power to do a pretty good job at scene rendering, maybe even in real time. To think this performance would be available from a standard 65nm die that uses 65 watts... that really could make a difference to gamers!
This clearly isn't for CPU's. It's for building GPU's and more importantly for intel get a part of the huge growing market demand for general purpose programming on GPU's. We'll have to call them something other than GPU's in 5-10 years as they'll do all sorts of other jobs too.
s crete_gpu_return).
IBM saw this coming and went with the Cell, AMD saw this coming and bought ATi, NVidia already has a card that has all these shader units. Intel would be stupid not to respond. They've already admitted a discrete GPU part is on the way (http://www.reghardware.co.uk/2007/01/23/intel_di
Only the other day there was a story (either the register or inquirer that's AFAIK has been now deleted...) about their GPU part being a whole chunk of in order x86 parts on a chip. Pieces of the jigsaw are slotting togheter. Makes programming GPGPU stuff easy for many. Intel want to move x86 architecture onto GPU's.
Ah well, I wonder when we'll get that story confirmed. Intel are clearly up to something... I think we'll know what shortly. All in all it spells trouble for NVidia as being left out of the CPU part of the equation with Intel, AMD and in some respects IBM all with combo's.
Anon because I've signed way too many NDA's...
Because it doesn't take special problem sets and programming on the current supercomputers?
My blog. Good stuff (when I remember to update it). Read it.
...is a version of the Sims 2 rewritten so that the Sims have a much greater degree of genuine autonomy, and for said version to be run without human intervention (and recorded) for a period of months or years on a multiple TFlop system. If the environment was made a lot more detailed than it is in the retail version of the game, and if the Sims were given somewhat more capacity for learning than what they've currently got, something tells me the results of such an experiment might be extremely interesting, given enough time.
Or more like the T9s... So the 32way crossbar switch, with 32 processors that I have working in the garage is coming back into fashion... Now if all the work that we did on interconnect topologies and their performance in networks up to size 1024 nodes might be useful. Hey we might even make something from the book!.... Welcome back to the late '80s Intel - do yourselves a favour - read the literatature - we've done the painful stuff already - you don't need to waste money on the fundemental research - its been done!