Google's Tensor Processing Unit Could Advance Moore's Law 7 Years Into The Future (pcworld.com)

← Back to Stories (view on slashdot.org)

Google's Tensor Processing Unit Could Advance Moore's Law 7 Years Into The Future (pcworld.com)

Posted by BeauHD on Wednesday May 18, 2016 @12:10PM from the Moore's-Law dept.

An anonymous reader writes from a report via PCWorld: Google says its Tensor Processing Unit (TPU) advances machine learning capability by a factor of three generations. "TPUs deliver an order of magnitude higher performance per watt than all commercially available GPUs and FPGA," said Google CEO Sundar Pichai during the company's I/O developer conference on Wednesday. The chips powered the AlphaGo computer that beat Lee Sedol, world champion of the game called Go. "We've been running TPUs inside our data centers for more than a year, and have found them to deliver an order of magnitude better-optimized performance per watt for machine learning. This is roughly equivalent to fast-forwarding technology about seven years into the future (three generations of Moore's Law)," said Google's blog post. "TPU is tailored to machine learning applications, allowing the chip to be more tolerant of reduced computational precision, which means it requires fewer transistors per operation. Because of this, we can squeeze more operations per second into the silicon, use more sophisticated and powerful machine learning models, and apply these models more quickly, so users get more intelligent results more rapidly." The chip is called the Tensor Processing Unit because it underpins TensorFlow, the software engine that powers its deep learning services under an open-source license.

19 of 86 comments (clear)

Min score:

Reason:

Sort:

Probably not advancing Moore's law by starless · 2016-05-18 12:32 · Score: 4, Insightful

Moore's law relates to the number of components in an integrated circuit.
I really doubt these things put more transistors onto a piece of silicon.
1. Re:Probably not advancing Moore's law by AchilleTalon · 2016-05-18 12:44 · Score: 3, Insightful
  
  You are right. This is bullshit. He didn't mention anything about the density of transistors. So, since this is a specialized chip, the performance claim cannot be compared to a general purpose chip.
  
  --
  Achille Talon
  Hop!
2. Re:Probably not advancing Moore's law by dbIII · 2016-05-18 13:50 · Score: 3, Insightful
  
  Also from the summary:
  
  "which means it requires fewer transistors per operation"
  So as you suggest it's definitely a misuse of the term used to roughly describe part of Intel's roadmap when Moore was there.
3. Re: Probably not advancing Moore's law by WarJolt · 2016-05-18 18:17 · Score: 2
  
  Many people confuse Moore's law and Dennard scaling. Dennard scaling is dead. We can still etch smaller transistors. We have trouble dissapating the heat, so even if we have more transistors most of it has to be dark.
  Once you boil it down to the math they are doing mostly giant matrix multiplies. Optimizing a particular type of computation is not really related to transistor density or power dissapating. What has evolved is our understanding of algorithms.
Re:Slashdot: Google's no cost PR agency by SnowPickles · 2016-05-18 12:47 · Score: 2

It's because today was Google I/O, Google's developer conference, so a lot of projects were announced. Still, Slashdot could have combined all these into story with links to details rather than spam us with a ton of Google stories. However, this TPU project might be the most interesting thing to come out of the conference. Not because the chips are novel (it's just the same principles as GPUs but taken to a further extreme), but because it sounds like Google's getting into low level chip manufacture. We'll have to wait and see if Google can deliver more FLOPS per dollar/watt than the leading co-processor manufacturer, NVIDIA.
Yeah, like DSPs... by RyanFenton · 2016-05-18 12:57 · Score: 5, Interesting

Specialized processing chips have been several 'generations' ahead in terms of processing per dollar for many decades. In the 90's at least, DSPs were doing audio/video processing much cheaper by performing many machine-level steps simultaneously in one 'cycle' with less power than a general processor, by leaving out the features and cost of a general processor. And all you had to do to use them was test them on a hardware emulator, flash them, then pop them into production test run until you were good enough to deploy. Depending on the chip, they could run on a trickle of power, without active cooling, and match a much most costly general chip for pennies.
I mean, it's how we got cell phones, and LOTS of other things, including most things in a computer that aren't the CPU.
But isn't Moore's law more about transistors per unit cost, rather than performance per cost? Seems like a fundamental misunderstanding in the headline... which seems about as common as specialized chips in modern technology.
Ryan Fenton
1. Re:Yeah, like DSPs... by peragrin · 2016-05-18 13:43 · Score: 2
  
  True, our knowledge of physics has limited moose's law. once we get into quantum scale computing though it should have another good run.
  
  --
  i thought once I was found, but it was only a dream.
2. Re:Yeah, like DSPs... by RyanFenton · 2016-05-18 15:02 · Score: 3, Informative
  
  Linky
  "The complexity for minimum component costs has increased at a rate of roughly a factor of two per year. Certainly over the short term this rate can be expected to continue, if not to increase. Over the longer term, the rate of increase is a bit more uncertain, although there is no reason to believe it will not remain nearly constant for at least 10 years."
  - Gordon E. Moore, April 19, 1965
  It's both cost and density, and continued to be so as it crystallized into the transistor doubling every 18 months figure. To double the density, only at exorbitant cost would not really be an increase in accessible technology. It's not just the technology being invented and monopolized, but being rolled out and usable by the entire field. Increasing computational complexity is still the most important part - but cost has always been a part of the idea too.
3. Re:Yeah, like DSPs... by konohitowa · 2016-05-18 16:20 · Score: 3, Funny
  
  Perhaps is time for squirrels law, no?
Re:Too superior for humanity by Anonymous Coward · 2016-05-18 12:57 · Score: 3, Insightful

It's easy to disparage the efforts of somebody trying big things when they don't go as planned isn't it. But they are trying and the next step they take could be that 'big thing'. What have you done lately?
Tensor Processing Units not new by Shinobi · 2016-05-18 13:54 · Score: 4, Informative

Tensor Processing Units are not new. SGI used to offer that for their Octane, aimed pretty heavily at the satellite image analysis crowd.
1. Re:Tensor Processing Units not new by jasnw · 2016-05-18 17:30 · Score: 2
  
  I strongly suspect that Google's "Tensor" is not the same as a mathematical tensor, which is what the SGI chips were working with. This use of the work smells more of marketing than mathematics. As in "OMG, Google is using TENSORS!!! to do their AI language processing. TENSORS dammit!!!"
2. Re:Tensor Processing Units not new by Namarrgon · 2016-05-18 18:25 · Score: 3, Informative
  
  Right, this isn't a general-purpose DSP but a custom ASIC designed to run their TensorFlow graphs efficiently.
  
  --
  Why would anyone engrave "Elbereth"?
3. Re:Tensor Processing Units not new by ceoyoyo · 2016-05-19 00:20 · Score: 2
  
  A tensor is an array, possibly with dimension > 2 if you want to be picky. TensorFlow absolutely does use tensors.
Re:Amazing... NOT!! by msauve · 2016-05-18 14:10 · Score: 2

What's amazing (or maybe not, based on their history) is that Google doesn't know what Moore's law says.

"order of magnitude better...performance per watt for machine learning. This is roughly equivalent to fast-forwarding technology about seven years into the future (three generations of Moore's Law)"

Uh, no. Moore's law says nothing at all about performance. It speaks to the number of transistors. It was Dave House who predicted a doubling in performance every 18 months (Moore predicted doubling transistor counts every 2 years). Both were based on the size and/or speed of transistors, neither took changes in processor architecture into consideration.

--
"National Security is the chief cause of national insecurity." - Celine's First Law
Re:Amazing... NOT!! by harperska · 2016-05-18 14:34 · Score: 2

Most people who aren't chip architects don't really care one way or another about transistor density, other than that it was a convenient proxy for performance, the frequent doubling of which ordinary people do care about. Now that transistor density has largely hit a physics wall, perhaps we need a new term for the projected trajectory of performance that would have continued had physics allowed transistors to be infinitely small, which engineers are attempting to satisfy by coming up with novel architectures instead.
Re:Google! YEY! by Anonymous Coward · 2016-05-18 15:16 · Score: 3, Informative

Google I/O is happening at the moment.
TPU sounds familiar by FreshnFurter · 2016-05-18 21:12 · Score: 3, Interesting

Was that not something introduces about 20 years ago by Silicon Graphics? Or am I getting old
http://manx.classiccmp.org/mir...
Re:Amazing... NOT!! by flargleblarg · 2016-05-19 04:30 · Score: 2

Nope. People do care about density, which [...]
haha, you're cute.
Go ask 50 random people on the street if they care about transistor density and report back to us.