Slashdot Mirror


MIT Develops New Chip That Reduces Neural Networks' Power Consumption by Up to 95 Percent (mit.edu)

MIT researchers have developed a special-purpose chip that increases the speed of neural-network computations by three to seven times over its predecessors, while reducing power consumption 94 to 95 percent. From a report: That could make it practical to run neural networks locally on smartphones or even to embed them in household appliances. "The general processor model is that there is a memory in some part of the chip, and there is a processor in another part of the chip, and you move the data back and forth between them when you do these computations," says Avishek Biswas, an MIT graduate student in electrical engineering and computer science, who led the new chip's development. "Since these machine-learning algorithms need so many computations, this transferring back and forth of data is the dominant portion of the energy consumption. But the computation these algorithms do can be simplified to one specific operation, called the dot product. Our approach was, can we implement this dot-product functionality inside the memory so that you don't need to transfer this data back and forth?"

55 comments

  1. How does this compare with Google's? by Mostly+a+lurker · · Score: 3, Interesting

    The tensor processing units Google developed seem also very capable compared to regular processors. Does anyone know how MIT's new chips stack up against what Google already has in operation?

    1. Re:How does this compare with Google's? by DrTJ · · Score: 4, Insightful

      The MIT press release says next to nothing, unfortunately. AFAICT, they don't reference any published article, or any kind of link to more information, so it is hard to assess. I really wanted to know more so I'm a little disappointed with MIT.

      There are a few things that indicates that this is not even comparable to Google TPU:
      1. The lack of more information.
      2. They label it as a prototype.
      3. The top person link goes to a first year graduate student (making a real ASIC takes a slightly larger team, I hear).

      Without more detailed information, this is hard to distinguish from PR.

    2. Re:How does this compare with Google's? by Ayano · · Score: 1

      It's probably funded by a company or the team/professor is working on Patenting it. This is typical of emergent technology not funded with public money.

      --
      I don't read AC
    3. Re:How does this compare with Google's? by bluefoxlucid · · Score: 2

      I'd hope the MIT chip could do math better. 3-7 times faster, 5%-6% as much power draw? That's 0.7%-2% as much power consumption per computational operation.

    4. Re:How does this compare with Google's? by burtosis · · Score: 1, Troll

      It's probably funded by a company or the team/professor is working on Patenting it. This is typical of emergent technology not funded with public money.

      Not yet funded with public money. Don't worry, some dumbass grad student(s) who singlehandedly redesign and optimize processors for neural networks won't keep that company position for long. No, as soon as it looks remotely profitable, and after thier dumb ass works 100hrs/week for three years to get a thesis to market, they will comfortably find themselves out the door with a firm boot mark in thier butt as thier only asset remaining from the company. Don't worry, big money can commit finnancial crimes like fraud and collusion and not be held accountable so forcing the deplorables out won't have to be by the books. The true visionaries are the venture capitalists who will take that technology and make it into something truly profitable to the right people, who ultimately deserve, no - are entitled to it.

      It's the American inventors modern nightma^h^h^h^h^h^h dream.

    5. Re:How does this compare with Google's? by ShanghaiBill · · Score: 4, Informative

      Does anyone know how MIT's new chips stack up against what Google already has in operation?

      This seems to be different.

      Google's TPUs reduce power and increase speed, but are targeted for internal use in data centers. You can't buy one.

      This MIT chip is targeted toward home use and mobile devices.

      Both chips do fast low precision matrix ops. The TPU uses eight bit multipliers. TFA is poorly written, but it appears that the MIT chip does analog multiplication. From TFA: In the chip, a node’s input values are converted into electrical voltages and then multiplied by the appropriate weights. Summing the products is simply a matter of combining the voltages. Only the combined voltages are converted back into a digital representation and stored for further processing.

      If this is true, then that could be a huge boost in efficiency, but results would not be exactly repeatable: You could get different results for the exact same inputs.

      Another feature is that the neurons in each layer produce a single binary output. That is obviously simpler than the TPU's 8-bit outputs, and is analogous to how biological neurons work. But it limits which algorithms can be used. RBMs (Restricted Boltzmann Machines) use single bit outputs, and were used in the first successful "deep" networks, but have more recently fallen out of favor. Single bit outputs make backprop more difficult, although it sounds like this chip is targeted more for deployment than for learning.

    6. Re:How does this compare with Google's? by bondsbw · · Score: 1

      You could get different results for the exact same inputs.

      Great, now we can have insane AIs.

      --
      All my liberal friends think I'm a conservative, all my conservative friends think I'm a liberal.
    7. Re: How does this compare with Google's? by Anonymous Coward · · Score: 0

      I don't know the IP policy at MIT, but at the last three universities I worked at, students, grad students and researchers retained majority ownership & control of patents they got with the help of the university legal team, and had the option to pay for their own patent lawyer if they wanted 100% ownership of the patent. There were a couple ways to attach strings to funding the complicated or restricted the process for permanent position researchers, but that did not affect grad students (and at two of the three l, grad student rules applied to postdocs too).

    8. Re:How does this compare with Google's? by HeckRuler · · Score: 1

      Price and computational utility aside, they sound GREAT for researching how biological neural networks work.

    9. Re:How does this compare with Google's? by ShanghaiBill · · Score: 2

      Price and computational utility aside, they sound GREAT for researching how biological neural networks work.

      I doubt that. This chip is designed to do fast and efficient matrix operations, which only work well if the neurons are in distinct and ordered layers. Biological brains don't do that . Also, biological brains learn by strengthening connections as they are used in a process very different from the backprop algorithm used in ANN, and it isn't clear if this new chip actually does any learning rather than just running a pre-programmed network.

      We will learn much more about biological brains from projects like OpenWorm, which is an attempt to understand and emulate the brain of C. elegans, a nematode.

      It is not clear that ANNs will be improved much by better understanding of BNNs. They work in different ways, and ANNs are much faster. You may be better than a computer at face recognition, but the computer is improving quickly and is WAY faster, scanning thousands of images per second.

    10. Re: How does this compare with Google's? by burtosis · · Score: 2

      I'm jaded because I had my technology push multi-million dollar startup stolen even though I was involved with several professors and graduate students, and licensed the technology from the university. We weren't allowed to buy off the patent costs, which for this portfolio (and due to government waste) wound up being around 400 thousand USD. Though I did have to pay off those costs before I was allowed any income disbursements from the royalty payments I made to myself, so in essence I paid for the patent portfolio, but retained no rights to it. Its quite likely the same story with MIT. They are so fucked, I feel sorry for those grad students. Without taking on money, they will have no product, if they take on money there is a 99% chance of complete and total fuckage. Trust me, I lived through exactly this scenario.

    11. Re: How does this compare with Google's? by Anonymous Coward · · Score: 0

      Hi, 504s on your link, can you summarize please?

    12. Re: How does this compare with Google's? by burtosis · · Score: 1

      It's just a comment I made before, it dosent 504 on any browser I tried but I'll repost:

      From someone who has had thier not that small startup get absolutely trashed by VC I agree with most of what you wrote, my main disagreement is it's a worse landscape than you paint. You are leaving out a complete disregard for all laws or actions that they probably won't be held accountable to. Here is how I was scammed It was a university startup and while I had the largest ownership by a good margin, we started with around 12 owners including some facility and licensed the technology through the university (you don't own what you invent at universities just like at companies). This made politics an issue from day one as emails from senior university officials from the business development office had comments like "who cares, fuck the students" and the law services butchered the articles of incorporation when a simple boiler plate would have been better. I was working two and a half full time jobs managing the technology and as this was my first company I had quite a bit to learn. We eventually took on money to produce product, but this basically "required" taking on a CEO with experience who due to various NDAs keeping information from us turned out to be a typical finnancial criminal. After the first CEO colluded with this new hire CEO, he was able to vote shares not yet vested through the milestones outlined in his agreement through a stupid and ignorant loophole in our articles and the agreement language. By combining them with the shares we lost in the opening round we lost control of the company. The CEO then made a predatory purchase agreement with the contract manufacturer who also happened to be the largest VC. This 10 million dollar purchase was hidden from finnancial disclosure during a subsequent investment round. When the company had a shortfall and couldn't pay an emergency shareholder meeting was called 1 week from an announcement on Christmas Eve night where it was announced the 10 million dollars invested in the company was now worthless because the company was insolvent and we now were so lucky to have our entire company bailed out ( with a 14-1 dilution) by undisclosed people who only paid 400k and the whole deal was kept secret to a few select large VC who fucked all the others (and me) over using inside knowledge of the company. They wouldn't provide any of the legal documentation required by law before the meeting and when a class action lawsuit started up the independent council investigating took verbal confirmation that they had in fact had a secret document that had disclosed the 10m off the books deal. I should have known when I tried to hire a law firm and the first 12 had conflicts that I was really fucked. tl:dr VC will just take your company and kick your withered corpse to the curb but only after milking all of your contacts and resources dry then burning the bridges on your behalf. The only reason you should take on money is if you are damn sure you can get the upper hand and fuck them over financially, because that's the only reason VC invest in startups.

  2. Just imagine by pablo_max · · Score: 2

    Just imagine a Beowulf cluster of these things ;)

    1. Re:Just imagine by Anonymous Coward · · Score: 0

      or being able to run Crysis!

  3. doesnt sound new by Anonymous Coward · · Score: 0

    "The general processor model is that there is a memory in some part of the chip, and there is a processor in another part of the chip, and you move the data back and forth between them when you do these computations"
    Registers and L1 cache? Wow, that's new!

    1. Re:doesnt sound new by tsqr · · Score: 1

      "The general processor model is that there is a memory in some part of the chip, and there is a processor in another part of the chip, and you move the data back and forth between them when you do these computations" Registers and L1 cache? Wow, that's new!

      Nice sarcasm. Maybe if you had been paying attention you would have been able to discern that "the general processor model" is a description of the status quo, not the new approach.

  4. But can it by AHuxley · · Score: 1

    mine cryptocurrencies?
    Some new ranking for chips in terms of minability.

    --
    Domestic spying is now "Benign Information Gathering"
  5. Where is Michael Biehn? by Anonymous Coward · · Score: 0

    Call Michael Biehn immediately. We need to resolve this situation ....

  6. "Smart" homes are stupid by pablo_max · · Score: 2

    Looking at what is available today, I would have to say that today's smart world is incredibly stupid. Not to mention fractured with loads of standards, apps and do-dads.
    When Google took over Nest I had high hopes.
    I had imagined they would do something clever like install their phased array mics into the "smart" fire alarms that could be in almost every room. Then from anywhere you could ask google something. But no... you need to find some stupid crappy little speaker and keep shouting HEY, GOOGLE, HEY GOOGLE, HEY GOOGLE, until it finally can hear you.
    With these chips, they could take that idea even further. Install connect appliances, connected switches and sockets and then figure out the patterns of usage and voices to "learn your ways" and begin to antisipate thing.
    Oh.. Bob always turns on the TV right after he grabs a beer from the fridge around 6pm. The fridge just opened, so I will turn on the TV for him.. also it was cold today, so i will adjust the heat in that room so Bob's ass doesnt get to cold on his leather lazy-boy.
    I should think that is all totally possible today.

    1. Re:"Smart" homes are stupid by Anonymous Coward · · Score: 0

      ...And not even remotely creepy right?

    2. Re:"Smart" homes are stupid by mikael · · Score: 1

      Add custom "voices" to these devices like Cylons, Cybermen, Daleks, Basil Fawlty, Village Idiot and Zen from Blakes Seven, and they would probably sell infinitely more when combined with locally processed speech recognition.

      --
      Vintage computer adverts: http://www.vintageadbrowser.com/computers-and-software-ads
    3. Re:"Smart" homes are stupid by Anonymous Coward · · Score: 0

      "Shut up friends. My internet browser heard us saying the word Fry and it found a movie about Philip J. Fry for us. It also opened my calendar to Friday and ordered me some french fries."
      --farnsworth

  7. Video Cards by SuperKendall · · Score: 1

    Since video cards have specialized processors that handle dot products (and all sorts of other matrix computations) like mad, how is what they are proposing much better than existing GPU's? In particular it seems like nVidia has been doing a ton of work to tailoring GPU's to be used with neural networks.

    --
    "There is more worth loving than we have strength to love." - Brian Jay Stanley
    1. Re:Video Cards by Anonymous Coward · · Score: 2, Informative

      how is what they are proposing much better than existing GPU's?

      How about reading the summary?

      GPU's aren't exactly known for being energy efficient.
      This chip is more energy efficient since it doesn't need to move the data to a central processor that might even be on another chip.
      It distributes the ALU's among the memory so it doesn't have to move the data as far.

      Also to get an idea of the scale we are working with here, speed of light / 5 cm is about 6 GHz.
      If you want to work fast you don't want to move data long distances.
      There is a limit to how fast information can travel and on a bidirectional bus you have to wait until the last word reaches the destination until you switch direction.
      Reduce the data path to a mm and you have a lot more margin to work with.

    2. Re:Video Cards by religionofpeas · · Score: 1

      For starters, video cards have too much precision.

    3. Re:Video Cards by mikael · · Score: 1

      GPU's have vector processor for their shader cores that do 32-bit and 64-bit floating point processing (IEEE-854 standard). To do a floating point calculation you have to align the two floating-point values by exponent, do the calculation and recalculate the new exponent and mantissa.

      Other solutions are to use fixed-point integers (8-bit, 16-bit and 32-bit) and even actual digital-to-analog conversion and back again.

      --
      Vintage computer adverts: http://www.vintageadbrowser.com/computers-and-software-ads
    4. Re:Video Cards by billyswong · · Score: 1

      and even actual digital-to-analog conversion and back again.

      Which is exactly what they do in the article. But the restriction of binary weight means the chip in design is incompatible with current mainstream neural networks.

  8. Amazon/Apple/Google want the uplink data... by kbonin · · Score: 1

    The major vendors aren't nearly as interested in dropping the system hardware cost as they are in having plausible access to live microphone streams. Since the user is the product, and privacy is irrelevant, its now all about the data mining for advertising and related behavioral research. This also keeps the IP in the neural networks away from competitors and open source developers prying eyes. These chips might be used for some preprocessing, but these vendors want that data stream to continue as long as possible...

  9. Amazon/Apple/Google want the uplink data... by Anonymous Coward · · Score: 0

    And the miracle solution is...? Don't use any of them. See how easy that was. You all are like a drug addict complaining about his addiction.

  10. Neat by XSportSeeker · · Score: 1

    Interesting development...
    But my understanding of this whole deal, and I might be wrong, is that we already have more than enough to make AIs local... this isn't a problem of capability, this is companies behind AI assistants trying to harvest as much data as possible from their costumers and turn a profit from it, and/or to use it for themselves.

  11. The question on everyone's miner: by Gravis+Zero · · Score: 1

    So... how good is it at computing SHA256 hashes? ;)

    --
    Anons need not reply. Questions end with a question mark.
    1. Re:The question on everyone's miner: by Anonymous Coward · · Score: 0

      SHA256 doesn't really use dot product.

    2. Re:The question on everyone's miner: by religionofpeas · · Score: 1

      Not at all useful. Neural nets require large memory bandwidth and multiplications. SHA256 hashing needs dedicated logic for SHA256 hashing, and very little memory bandwidth. Besides, there are already much better chips for hashing.

  12. Color me skeptical by ckatko · · Score: 4, Interesting

    That sounds like something an FPGA could do from the very beginning.

    The only new thing here would be possibly LARGER amounts of memory stored inbetween the fabric (reducing off-chip access, and increased number of LUTs not tied up as memory cells), and possibly like they said, combined "access and modify" operations.

    But I think the article itself doesn't understand what it's talking about then.

    And as general purpose as FPGA are in idea, they "custom adapted" to different tasks (and layout/fabric) since inception. So the question here is, are they talking about some kind of ASIC advancement that they didn't have before?

    >The chip can thus calculate dot products for multiple nodes — 16 at a time, in the prototype — in a single step, instead of shuttling between a processor and memory for every computation.

    This appears to be the only actual advancement/tech/change, being extruded out into an entire fluff article for college PR purposes.

    Personally, I'm way more interested in getting my hands on an "FPGA in CPU" ever since back in college when Altera was bought by Intel. Imagine a CPU that can be told to add CUDA cores when you start a game, or SHA cores when you start a server. Altera specializes is live reconfigurable FPGAs. FPGA's that can be "flashed" in whole or in part while still running.

    1. Re:Color me skeptical by boa · · Score: 1

      Personally, I'm way more interested in getting my hands on an "FPGA in CPU"...

      Maybe something like this?
      http://www.stretchinc.com/tech...

    2. Re:Color me skeptical by Anonymous Coward · · Score: 0

      That sounds like something an FPGA could do from the very beginning.

      The selling point was "energy efficient".

      Imagine a CPU that can be told to add CUDA cores when you start a game, or SHA cores when you start a server.

      You need a couple of more examples than that.
      If the choice is between CUDA cores and SHA cores you can just have them hardcoded and have more than twice of each compared to what will fit in an FPGA.
      They waste a lot of space on configurable routing.

    3. Re:Color me skeptical by religionofpeas · · Score: 2

      FPGA aren't really good for massive amounts of multiplications. Modern FPGAs have dedicated multipliers, but they only have a few of them. And the reason they have dedicated multipliers is because the general FPGA fabric sucks at doing multiplications.

    4. Re:Color me skeptical by mikael · · Score: 1

      Sun Microsystems used to have a patent for smart VRAM for video cards back in the 1990s. These put the basic OpenGL logic ops (and, or, xor etc...) onto the video memory so that entire blocks of pixels could be processed at accelerated pixblt speeds. Basically as fast as rows of chip memory were being pulled out and sent to the video output, and as deep as how many bit-planes you had. Back then that was 32-bits.

      FPGA's are used to simulate CPU and GPU cores. Sometimes they are bundled inside a PCI slot board and used to accelerate proprietary algorithms used for things like financial analysis. They are faster than CPU's but slower than ASIC's.

      --
      Vintage computer adverts: http://www.vintageadbrowser.com/computers-and-software-ads
  13. "embed them in household appliances" by Anonymous Coward · · Score: 0

    Neural network in my toaster...awesome! When will I be able to buy one?

  14. Sounds like hybrid memputing by wierd_w · · Score: 2

    Such things include "Computational Ram"
    https://en.wikipedia.org/wiki/...

    There is also a very old idea of using memory elements directly to compute results, which is true memputing. (There are few examples of this, because it is costly as an architecture-- but your brain is a pretty good biological example. The same components are used for data storage, as well as data processing.)

    Given that such "Computational Ram" devices already exist in the wild, I fail to see why more novel hardware is needed, excepting as a refinement of concept?

    1. Re:Sounds like hybrid memputing by mikael · · Score: 1

      Most brains are basically a sheet of computational neurons (outer periphery) while the interconnects warp this into a wrinkly structure like a Hilbert space filling curve in order to reduce connection distances. Internally, oxygenated blood is pumped outwards to bring in glucose and oxygen while taking away excess heat. Empty spaces are filled with fluid in order to maintain constant internal temperature.

      --
      Vintage computer adverts: http://www.vintageadbrowser.com/computers-and-software-ads
    2. Re:Sounds like hybrid memputing by Required+Snark · · Score: 1
      Prior art from 1972

      A Logic-in-Memory architecture for large-scale-integration technologies

      A computing machine is described which is structured around a distributed logic storage device called the Processing Memory. This machine, the Brookhaven Logic-In-Memory Processor (BLIMP), is meant only as a vehicle for simulating and evaluating its concepts, rather than for eventual fabrication. In particular, it is shown that the architecture used is very well suited to large-scale-integration (LSI) implementation technologies. It was first necessary to redefine the various goals of logic design optimization in the context of LSI implementation. Then an elemental building block of the Processing Memory is described as having evolved from associative memory circuits. It is shown that a computer such as BLIMP which utilizes the Processing Memory concept can meet the goals of design optimization for LSI. Design techniques for this project were developed as they were required. Of particular importance is a simulation system called MODEL, which documents the structure and analyzes the behavior of the proposed system.

      --
      Why is Snark Required?
  15. cryptocurrency mining by Anonymous Coward · · Score: 0

    But can it mine cryptocurrencies?

    A better question is "could a breakthrough in quantum computing make mining cryptocurrencies trivial?" (Making the Dutch Tulip market collapse look like a minor market correction."

  16. Middle Out? by Anonymous Coward · · Score: 0

    All this talk of moving things back and fourth reminds me of Middle Out Compression...

  17. Um... by Gibgezr · · Score: 0

    That could make it practical to run neural networks locally on smartphones

    I thought EVERY smartphone had a neural network chip in it already: that's how modern auto-focus works, unless I was misinformed.

    1. Re:Um... by Anonymous Coward · · Score: 1

      you were misinformed

    2. Re:Um... by Anonymous Coward · · Score: 0

      You are misinformed.

    3. Re:Um... by Gibgezr · · Score: 1

      Partly? Some quick research shows that many phones DO have neural-network code running on them, sometimes for passive auto-focus (active auto-focus, the type that uses range-finding sensors, does not seem to be very common on smartphones). Apple and Huawei actively advertise the use of NN tech in their phones, but not necessarily for auto-focus (face recognition and image enhancement seem to be what they are selling). The dedicated chip part is usually referred to as the ISP, or Image Signal Processor, a dedicated DSP chip that can run NN code. There's lots of papers out there that explain the NN tech with regards to smartphones, going back a couple of decades. So it seems like SOME phones do this, but not all.

  18. wow by Anonymous Coward · · Score: 0

    so does this mean I can run this chip with a double A battery?

  19. Skynet by Anonymous Coward · · Score: 0

    Isn't this the chip the reverse engineered from that bit of waste found in the steel plant after the Terminator was destroyed?

  20. Fuzzy Logic by cormandy · · Score: 1

    Oh just great! Looks like Iâ(TM)ll soon have to replace all my old appliances which employ âoefuzzy logicâ with nee appliances that employ neural networks.

  21. You mean like a Xilinx Zynq? by Anonymous Coward · · Score: 1

    ARM processor + FPGA fabric?

    And of course, the Virtex series has had (not incredibly well supported) partial reconfiguration on the fly for at least 10 years, and you can instantiate a CPU core of your choice.