Slashdot Mirror


Google's Tensor Processing Unit Could Advance Moore's Law 7 Years Into The Future (pcworld.com)

An anonymous reader writes from a report via PCWorld: Google says its Tensor Processing Unit (TPU) advances machine learning capability by a factor of three generations. "TPUs deliver an order of magnitude higher performance per watt than all commercially available GPUs and FPGA," said Google CEO Sundar Pichai during the company's I/O developer conference on Wednesday. The chips powered the AlphaGo computer that beat Lee Sedol, world champion of the game called Go. "We've been running TPUs inside our data centers for more than a year, and have found them to deliver an order of magnitude better-optimized performance per watt for machine learning. This is roughly equivalent to fast-forwarding technology about seven years into the future (three generations of Moore's Law)," said Google's blog post. "TPU is tailored to machine learning applications, allowing the chip to be more tolerant of reduced computational precision, which means it requires fewer transistors per operation. Because of this, we can squeeze more operations per second into the silicon, use more sophisticated and powerful machine learning models, and apply these models more quickly, so users get more intelligent results more rapidly." The chip is called the Tensor Processing Unit because it underpins TensorFlow, the software engine that powers its deep learning services under an open-source license.

86 comments

  1. Too superior for humanity by Anonymous Coward · · Score: 0

    so that they'll end up on the pile of broken dreams just like all those other amazing Google projects that we're going to save the world.

    1. Re:Too superior for humanity by Anonymous Coward · · Score: 3, Insightful

      It's easy to disparage the efforts of somebody trying big things when they don't go as planned isn't it. But they are trying and the next step they take could be that 'big thing'. What have you done lately?

    2. Re:Too superior for humanity by Anonymous Coward · · Score: 0

      I developed a TPU, standing for Thought Processing Unit, and planned 4 generations, back in the late 70s. Now it seems I can interface with one from Google. Therefore like printer drivers were among the first devices developed, what I want is a Google AI API made to measure for my own TPU driver. My own TPU continued to be developed on a daily basis and therefore provides a robust PCA interface.

  2. Amazing... NOT!! by Desler · · Score: 0

    What's supposed to be amazing about this? Isn't decades-old knowledge that you can make computations faster by reducing the precision and using faster approximation methods?

    1. Re:Amazing... NOT!! by Anonymous Coward · · Score: 1

      Except it's normally done on the algorithm level, not the chip level.

    2. Re:Amazing... NOT!! by JBMcB · · Score: 1

      How many working, deployed hardware implementations have there been?

      --
      My Other Computer Is A Data General Nova III.
    3. Re:Amazing... NOT!! by Anonymous Coward · · Score: 0

      Sure, but Google is leveraging their extensive experience at truncating things. Given that, I wouldn't bet against them.

    4. Re:Amazing... NOT!! by Anonymous Coward · · Score: 0

      Every GPU that uses half-precision floating-point vector units to increase FLOPS?

    5. Re:Amazing... NOT!! by msauve · · Score: 2

      What's amazing (or maybe not, based on their history) is that Google doesn't know what Moore's law says.

      "order of magnitude better...performance per watt for machine learning. This is roughly equivalent to fast-forwarding technology about seven years into the future (three generations of Moore's Law)"

      Uh, no. Moore's law says nothing at all about performance. It speaks to the number of transistors. It was Dave House who predicted a doubling in performance every 18 months (Moore predicted doubling transistor counts every 2 years). Both were based on the size and/or speed of transistors, neither took changes in processor architecture into consideration.

      --
      "National Security is the chief cause of national insecurity." - Celine's First Law
    6. Re:Amazing... NOT!! by harperska · · Score: 2

      Most people who aren't chip architects don't really care one way or another about transistor density, other than that it was a convenient proxy for performance, the frequent doubling of which ordinary people do care about. Now that transistor density has largely hit a physics wall, perhaps we need a new term for the projected trajectory of performance that would have continued had physics allowed transistors to be infinitely small, which engineers are attempting to satisfy by coming up with novel architectures instead.

    7. Re:Amazing... NOT!! by fustakrakich · · Score: 1

      So, you're saying this is the first time they are making computations on a computer?

      --
      “He’s not deformed, he’s just drunk!”
    8. Re: Amazing... NOT!! by Anonymous Coward · · Score: 0

      No, that's not what he is saying, at all.

      What he is saying is chips generally (except for some special things like certain A/D D/A converters) are generally designed to implement precision to a certain specification. Probabilistic methods are not otherwise hard coded into the logic. From what little I can see from the announcement, the algorithm embodied in the hardware here is probabilistic (which gets translated through the PR machine into 'allowing for errors').

    9. Re:Amazing... NOT!! by Anonymous Coward · · Score: 0

      There has been quite a bit of research in recent years into computing solutions that aren't fully accurate. Most of the time an off the shelf component is run at a lower voltage than required to produce fully accurate results. The power savings from doing this allows multiple devices (or more cores on the same device) to run in parallel whilst using same power that a single fully accurate device/core would consume granting greater parallelism within the same power envelope.

    10. Re:Amazing... NOT!! by colinrichardday · · Score: 1

      Wouldn't seven years be between 4 and 5 generations?

    11. Re:Amazing... NOT!! by instinct71 · · Score: 1

      This is BS. Yes, there have been research papers on approximate computing in HW but none of them can give error bounds for applications. Your errors reported will be only on the inputs you run, anything else and all bets are off.

    12. Re:Amazing... NOT!! by msauve · · Score: 0

      "Most people who aren't chip architects don't really care one way or another about transistor density, other than that it was a convenient proxy for performance"

      Nope. People do care about density, which provides the ability to condense useful functions which used to require a desktop sized computer or a large video camera onto a pocketable device like a smartphone. People don't care about performance. We've got computers a thousand times faster than they were 20 years ago, but they're only slightly faster to the user. Why? Because people simply don't care about actual performance. They're happy to trade it away to have gimmicky UI doo-dads on their screens.

      --
      "National Security is the chief cause of national insecurity." - Celine's First Law
    13. Re:Amazing... NOT!! by flargleblarg · · Score: 2

      Nope. People do care about density, which [...]

      haha, you're cute.

      Go ask 50 random people on the street if they care about transistor density and report back to us.

    14. Re: Amazing... NOT!! by Anonymous Coward · · Score: 0

      Your definition of "slightly faster" is different than everyone else.

      On that note, will stop reading further as we're nowhere near on the same page.

  3. IS thIS Google Advertising day? by Anonymous Coward · · Score: 0

    I only see four or five stories from them, so maybe not...

    1. Re:IS thIS Google Advertising day? by Anonymous Coward · · Score: 0

      Jew mad broceph?

    2. Re:IS thIS Google Advertising day? by Anonymous Coward · · Score: 0

      Google has a big conference going on, you can probably expect a couple more

  4. Probably not advancing Moore's law by starless · · Score: 4, Insightful

    Moore's law relates to the number of components in an integrated circuit.
    I really doubt these things put more transistors onto a piece of silicon.

    1. Re:Probably not advancing Moore's law by AchilleTalon · · Score: 3, Insightful

      You are right. This is bullshit. He didn't mention anything about the density of transistors. So, since this is a specialized chip, the performance claim cannot be compared to a general purpose chip.

      --
      Achille Talon
      Hop!
    2. Re:Probably not advancing Moore's law by Anonymous Coward · · Score: 0

      Yes, welcome to the new improved slashdot... just like reddit, but uh, more advertisements

    3. Re:Probably not advancing Moore's law by Anonymous Coward · · Score: 1

      Came here to say the same thing.

      Moore's law isn't about performance, it's about economy. But smaller components do allow higher clock speeds and more functionality, so performance tends to increase too.

    4. Re:Probably not advancing Moore's law by dbIII · · Score: 3, Insightful
      Also from the summary:

      "which means it requires fewer transistors per operation"

      So as you suggest it's definitely a misuse of the term used to roughly describe part of Intel's roadmap when Moore was there.

    5. Re: Probably not advancing Moore's law by Anonymous Coward · · Score: 0

      No, you misunderstand. What they are talking about is extending performance gains, which is often (that is, not their fault) called 'extending Moore's law'

    6. Re: Probably not advancing Moore's law by WarJolt · · Score: 2

      Many people confuse Moore's law and Dennard scaling. Dennard scaling is dead. We can still etch smaller transistors. We have trouble dissapating the heat, so even if we have more transistors most of it has to be dark.

      Once you boil it down to the math they are doing mostly giant matrix multiplies. Optimizing a particular type of computation is not really related to transistor density or power dissapating. What has evolved is our understanding of algorithms.

    7. Re: Probably not advancing Moore's law by Anonymous Coward · · Score: 0

      My hunch is that it is like a math coprocessor geared towards simpler floating point. Does a neural net need 8 bytes of precision? 4? Maybe they are using only 20 bits?

    8. Re: Probably not advancing Moore's law by Anonymous Coward · · Score: 0

      Which is actually better IMO.

    9. Re: Probably not advancing Moore's law by Anonymous Coward · · Score: 0

      multiplying giant matricies. Seem to remember that being called a Tensor. wonder why this shares the same name.

  5. Sing the praises by Anonymous Coward · · Score: 0

    of Teh G and ye shall live for eternity -- in HELL, in the mouth of The Beast.

    1. Re:Sing the praises by Anonymous Coward · · Score: 0

      Ooh, as long as I can pull my cock out, that sounds like fun!

  6. Re:Slashdot: Google's no cost PR agency by SnowPickles · · Score: 2

    It's because today was Google I/O, Google's developer conference, so a lot of projects were announced. Still, Slashdot could have combined all these into story with links to details rather than spam us with a ton of Google stories. However, this TPU project might be the most interesting thing to come out of the conference. Not because the chips are novel (it's just the same principles as GPUs but taken to a further extreme), but because it sounds like Google's getting into low level chip manufacture. We'll have to wait and see if Google can deliver more FLOPS per dollar/watt than the leading co-processor manufacturer, NVIDIA.

  7. Yeah, like DSPs... by RyanFenton · · Score: 5, Interesting

    Specialized processing chips have been several 'generations' ahead in terms of processing per dollar for many decades. In the 90's at least, DSPs were doing audio/video processing much cheaper by performing many machine-level steps simultaneously in one 'cycle' with less power than a general processor, by leaving out the features and cost of a general processor. And all you had to do to use them was test them on a hardware emulator, flash them, then pop them into production test run until you were good enough to deploy. Depending on the chip, they could run on a trickle of power, without active cooling, and match a much most costly general chip for pennies.

    I mean, it's how we got cell phones, and LOTS of other things, including most things in a computer that aren't the CPU.

    But isn't Moore's law more about transistors per unit cost, rather than performance per cost? Seems like a fundamental misunderstanding in the headline... which seems about as common as specialized chips in modern technology.

    Ryan Fenton

    1. Re:Yeah, like DSPs... by wjcofkc · · Score: 1

      Moore's law as it stands has always been a wall we would hit sooner than predicted by any self respecting nerd. It's time that it gets redefined.

      --
      Brought to you by Carl's Junior.
    2. Re:Yeah, like DSPs... by Anonymous Coward · · Score: 0

      > It's time that it gets redefined.
      What?
      No it's not.
      Moore's law is defined. When we hit the wall we can have some new law.
      It will be the end of Moore's law. It doesn't need to be and shouldn't be redefined.

    3. Re:Yeah, like DSPs... by peragrin · · Score: 2

      True, our knowledge of physics has limited moose's law. once we get into quantum scale computing though it should have another good run.

      --
      i thought once I was found, but it was only a dream.
    4. Re:Yeah, like DSPs... by Kjella · · Score: 1

      But isn't Moore's law more about transistors per unit cost, rather than performance per cost? Seems like a fundamental misunderstanding in the headline... which seems about as common as specialized chips in modern technology.

      Actually just transistor density, not cost at all. But in the popular press there's no invalid use of computers and Moore's law, it can be about performance, cost, size, IPC, battery life or whatever. Anything that's twice as good in two years or whatever best fits your story follows Moore's law. In this case it's not even an actual use, it's comparing totally unrelated improvements to imaginary iterations of it.

      --
      Live today, because you never know what tomorrow brings
    5. Re:Yeah, like DSPs... by RyanFenton · · Score: 3, Informative

      Linky

      "The complexity for minimum component costs has increased at a rate of roughly a factor of two per year. Certainly over the short term this rate can be expected to continue, if not to increase. Over the longer term, the rate of increase is a bit more uncertain, although there is no reason to believe it will not remain nearly constant for at least 10 years."

      - Gordon E. Moore, April 19, 1965

      It's both cost and density, and continued to be so as it crystallized into the transistor doubling every 18 months figure. To double the density, only at exorbitant cost would not really be an increase in accessible technology. It's not just the technology being invented and monopolized, but being rolled out and usable by the entire field. Increasing computational complexity is still the most important part - but cost has always been a part of the idea too.

    6. Re:Yeah, like DSPs... by konohitowa · · Score: 3, Funny

      Perhaps is time for squirrels law, no?

    7. Re:Yeah, like DSPs... by Anonymous Coward · · Score: 0

      Moore's law is essentially did now.

    8. Re:Yeah, like DSPs... by dtmos · · Score: 1

      *sigh* There's never a mod point around when you need one.

    9. Re:Yeah, like DSPs... by Anonymous Coward · · Score: 0

      Your mom is did.

    10. Re:Yeah, like DSPs... by Memnos · · Score: 1

      They've repeatedly tried to popularize Squirrel's Law, but it always gets off to a Rocky start.

      --
      I don't trust atoms -- they make up stuff.
    11. Re:Yeah, like DSPs... by Anonymous Coward · · Score: 0

      that's a load of bullwinkle if i've ever heard one.

  8. Hyperbole by Tough+Love · · Score: 1

    I will trust Intel, AMD and NVidia to sustain Moore's law as it pertains to general purpose computing, not Google. Google gets plaudits for advancing neural net hardware, if indeed they didn't just buy the tech and slap the Google brand on it, which is likely. Thw hyperbole just erodes credibility, in other words, makes me wonder how many other exaggerations will turn up in this department. Yes, it's a fact it plays Go well, and no doubt does a lot of other things well. Let's stick to the facts please. It didn't advance rendering by 7 years, it didn't advance compiling, in fact it didn't advance just about anything I do with my own computer. Putting my own jaded spin on it, it mainly advances Google's ability to invade my privacy and monetize me.

    --
    When all you have is a hammer, every problem starts to look like a thumb.
    1. Re:Hyperbole by Anonymous Coward · · Score: 0

      narcissist.

  9. I'm done with slashdot by Anonymous Coward · · Score: 1

    End of message.

  10. Google! YEY! by Frosty+Piss · · Score: 1

    This is the FIFTH Google story today.

    --
    If you want news from today, you have to come back tomorrow.
    1. Re:Google! YEY! by inode_buddha · · Score: 1

      Yeah, but OTOH they also have an article about AIDS, so it's all good.....

      --
      C|N>K
    2. Re:Google! YEY! by Anonymous Coward · · Score: 3, Informative

      Google I/O is happening at the moment.

    3. Re:Google! YEY! by Anonymous Coward · · Score: 0

      Slashdot Gobbles Google Cocks. News at 11

  11. Tensor Processing Units not new by Shinobi · · Score: 4, Informative

    Tensor Processing Units are not new. SGI used to offer that for their Octane, aimed pretty heavily at the satellite image analysis crowd.

    1. Re:Tensor Processing Units not new by Shinobi · · Score: 1

      Now that I check, there was also an XIO board with a TPU for the Origin/Onyx 2/Origin 200 machines

    2. Re:Tensor Processing Units not new by Anonymous Coward · · Score: 0

      They claim it is not related to SGI's stuff which was a DSP.

    3. Re:Tensor Processing Units not new by jasnw · · Score: 2

      I strongly suspect that Google's "Tensor" is not the same as a mathematical tensor, which is what the SGI chips were working with. This use of the work smells more of marketing than mathematics. As in "OMG, Google is using TENSORS!!! to do their AI language processing. TENSORS dammit!!!"

    4. Re:Tensor Processing Units not new by Anonymous Coward · · Score: 0

      loool which is exactly what googles is. the stupidity here amazes me now, its so funny its tragic.

    5. Re:Tensor Processing Units not new by Namarrgon · · Score: 3, Informative

      Right, this isn't a general-purpose DSP but a custom ASIC designed to run their TensorFlow graphs efficiently.

      --
      Why would anyone engrave "Elbereth"?
    6. Re:Tensor Processing Units not new by ceoyoyo · · Score: 2

      A tensor is an array, possibly with dimension > 2 if you want to be picky. TensorFlow absolutely does use tensors.

    7. Re:Tensor Processing Units not new by Shinobi · · Score: 1

      The SGI TPU was not merely a rebadged DSP. You had multiple application specific pipelines on it for example.

    8. Re:Tensor Processing Units not new by GonzoPhysicist · · Score: 1

      Yup, and if you want to be really pedantic even scalars are tensors of rank zero.

      --
      horror vacui
    9. Re:Tensor Processing Units not new by jasnw · · Score: 1

      And anything that does anything using arrays of any kind is "technically" using tensors. However, I suspect that Google is just using it for the psuedo-geek-sexy sound of it. Or maybe they're using an array of old Tensor high-intensity lights?? If so, my humble apologies.

  12. Re:Slashdot: Google's no cost PR agency by Anonymous Coward · · Score: 0

    Not sure anyone has said who is "manufacturing" these units for google. I doubt google is producing them themselves.
    Would love to know who it is though.

  13. Stop talking about performance per watt. by Anonymous Coward · · Score: 0

    Because usually it's combined with "green" "eco" and "slower than the last generation of hardware". I don't care about performance per watt; I care about performance. Granted in a data center setting wattage(and heat) is important, however, that hardly translates to talking about Moore's law.

    1. Re:Stop talking about performance per watt. by Blaskowicz · · Score: 1

      Not if your stuff is massively parallel. For example on a 100 watt GPU, doubled performance per watt translates to doubled performance - if there's no significant bandwith bottleneck etc.
      If you're the bad ass dude who wants a 250 watt GPU and nothing else it's the exact same deal. There are a few cards that eat even more but nobody will make a kilowatt GPU (single) just for you.

  14. Nice, but no thanks for a threat to humanity. by sethstorm · · Score: 1

    Perhaps when it isn't something that is likely to take multiple lines of work out in one shot, I'll be for it.

    --
    Twitter supports and protects racists - by smearing their critics with the "Hate Speech" label.
  15. Welcome to the Machine! by Anonymous Coward · · Score: 0

    says "Samaritan" (ref to the TV series 'Person of Interest')

    sorry but no. Google is a no-go zone for me these days.
    They are bent of boing evil to humanity in order to get a few $$$$ more Ad revenue.
    When that dries up, they will attempt to rule all our lives 24/7/52

    1. Re: Welcome to the Machine! by Anonymous Coward · · Score: 0

      'boing evil' sounds very evil.

  16. It could by Anonymous Coward · · Score: 0

    I could flap my arms and fly to the moon. Well, I *could*. Why does everyone like to gag on the cock of google?

  17. Machine learning trying too hard to be brainy! by Anonymous Coward · · Score: 0

    Less precision computing, machine learning and neural networks. Mm.. trying too hard to emulate the brain here. Tell me when you reach 20W power draw and 40 petaflops.

  18. TPU sounds familiar by FreshnFurter · · Score: 3, Interesting

    Was that not something introduces about 20 years ago by Silicon Graphics? Or am I getting old
    http://manx.classiccmp.org/mir...

    1. Re:TPU sounds familiar by slew · · Score: 1

      Was that not something introduces about 20 years ago by Silicon Graphics? Or am I getting old
      http://manx.classiccmp.org/mir...

      If you are old enough, you might remember, some of the googleplex used to be SGI buildings...
      Maybe google did some remodeling and found some antiques ;^)

  19. Ai advancement scalng by Anonymous Coward · · Score: 0

    Well, Moores Law is about hardware. Never heard of Denning.
    In the early days, hardware speed and transistor density determined the capabilities of a computer prety well.
    Plain old logic and linear programmng. This was advanced somewhat by specialized hardware when array processng
    and GPus were developed. Capability was still linked mostly to hardware. Software contributed to the capabilities
    by being written to use the hardware and parallel programming techniques. All of these can be measured in terms
    of FLOPS.
          The advance of AI is different. Measurement here should be based on the standard math/logic ( FLOPS ) and some set of measure
    of the information manipulation and processing capabilities. Memory and data handling come into play.
    Pattern recognition and analysis of data are integral to this also. And - to be sure that all is accounted for - advances will
    require more database capability, more disk storage, and more computing hardware....

    Ai capabilities are still using hardware, so anyone who uses moores law for Ai description is only mildly retarded.
    ( little i in Ai since intelligence is still specific - idiot savant...)
    I propose several alternatves, based on animal and human development.
    Peak law ( what is the peak capabilty of the Ai?). All intelligences peak at some level... how high?
    Stupid Law ( how stupid can it potentially be?). Self-explanatory.
    CS Law ( common-sense measurement - humans have this law also...). Self-explanatory.
    Humor law ( recogniton and use of humor IS an intelligence measure ).

  20. No it couldn't by Anonymous Coward · · Score: 0

    Moore's law was about feature size and advances continue to slow.

    This product is about parallelism. Nothing new.

  21. 90% hype by Anonymous Coward · · Score: 0

    At least.

  22. Drat by jsepeta · · Score: 1

    Not related to the wizard Tensor of Tensor's Floating Disc.

    --
    Remember kids, if you're not paying for the service, YOU ARE THE PRODUCT THAT IS BEING SOLD.
  23. Stop the presses by Krakadoom · · Score: 1

    Am I reading this right, the basic gist of the article is that custom, purposebuilt chips are faster at very specific tasks than general all-purpose chips?

    Such amaze, much fast, wow.

  24. click bait by Anonymous Coward · · Score: 0

    >...Could Advance Moore's Law 7 Years Into The Future

    No, they built an ASIC that accelerates a SPECIFIC APPLICATION - doesn't do shit to Moore's law.

  25. They should just call it "Google's Law" by Pyramid · · Score: 1

    "Google's Law" states that whenever you need to make something sound cool and innovative, just misuse and existing term like "Moore's Law", because reporters are stupid.

    --
    ~Any apparent grammatical or typographic errors are caused by defects in your display device.
  26. Polling issues by Keybounce · · Score: 1

    See, this is what's wrong with polls.

    "Do you care about transistor density?", will get a lot of "Huh?", which will be interpreted as "no, does not care".

    "Transistor density is one of the biggest factors in computer speed. Do you care about making transistors more dense in the future to improve computer speed?" will get a lot of "Yes", because people want the computer speed, even if they don't understand transistor density.

    Polling has gotten a bad name because the loaded questions spew the results, and make headlines.

    Oh, I didn't give this the right headline -- "Slashdot Readers Care More About The Headline", because of the poll question, "Will a bad, uninformative headline that doesn't interest you result in your not clicking on the story?"

  27. Can it mine bitcoin? by Keybounce · · Score: 1

    The real question on advances over GPU's: Can it mine bitcoin faster and for less electricity?

    1. Re:Can it mine bitcoin? by TeknoHog · · Score: 1

      Shirley, you're joking. FPGAs took over GPUs for Bitcoin mining in 2011, and later ASICs.

      --
      Escher was the first MC and Giger invented the HR department.
    2. Re:Can it mine bitcoin? by Keybounce · · Score: 1

      Actually, I did not know that.

      Meanwhile, given that this new TPU has lower watt per computation, maybe a better question is, can it mine bitcoins cheaper?