Google's Custom Machine Learning Chips Are 15-30x Faster Than GPUs and CPUs (pcworld.com)
Four years ago, Google was faced with a conundrum: if all its users hit its voice recognition services for three minutes a day, the company would need to double the number of data centers just to handle all of the requests to the machine learning system powering those services, reads a PCWorld article, which talks about how Tensor Processing Unit (TPU), a chip that is designed to accelerate the inference stage of deep neural networks came into being. The article shares an update: Google published a paper on Wednesday laying out the performance gains the company saw over comparable CPUs and GPUs, both in terms of raw power and the performance per watt of power consumed. A TPU was on average 15 to 30 times faster at the machine learning inference tasks tested than a comparable server-class Intel Haswell CPU or Nvidia K80 GPU. Importantly, the performance per watt of the TPU was 25 to 80 times better than what Google found with the CPU and GPU.
Welcome our new Google overlords. (or whatever...)
"I say we take off, nuke the site from orbit. It's the only way to be sure."
outperforms general purpose chips?
Wow.
Man is this a "duh" moment. Purpose built ASICs are extremely fast and low power for what they accomplish. That's why we use them. Look at a small desktop network switch: Little tiny processor that can pass 16gb/sec of traffic around. try and put 8 NICs in a computer and have it switch traffic and you'll be amazed at how much power you need. The reason the switch is small is it is purpose built: It's ASIC does nothing but switch Ethernet packets.
Same deal with some thing on a CPU. You find that decoding an AVC video stream takes next to no CPU power on modern CPUs, yet decoding an MPEG-2 video takes some. Why? Because they have a small bit of dedicated logic for AVC decoding (usually some other formats too). It is low power because it is dedicated.
Always the question in designing a system is flexibility and unit cost vs fixed function and up front cost. A CPU is great because it can do anything, and you can just buy them straight out, tons of companies have them available for purchase right now. However they take a lot of silicon and power to perform a given task. An ASIC takes a bunch of up front money to design and do a manufacturing run, but is very small and efficient, however it can't be reconfigured to do anything else and needs a full respin. In the middle there is something like an FPGA. Which one is right for a application just depends on the balance of a lot of factors.
Just got a big bump in market valuation.
I thought the TPU was for hard drive encryption. Or is it doing double duty?
if all its users hit its voice recognition services for three minutes a day, the company would need to double the number of data centers
The performance bottleneck in machine learning is training the system and the amount of training data, not the number of users running the model. Not sure I understand how usage is so directly proportional to computing costs.
Fast Federal Court and I.T.C. updates
How does the TPU do on regular CPU and GPU type tasks? It's really an Apples to Oranges comparison either way.
Algorithms and processor sets are not artificial intelligence and neural networks. Biggest fish story of the 21st century. Cool technology, horrendously disengenuous PR narrative.
But 1000x as expensive?
outsourced foreign cheap labor!
Shouldn't the performance numbers between CPU and GPU be different enough that comparing a TPU to them would produce very different numbers.
The performance improvement range seems to be too small, right?
Wasn't there some movie where they have these cool chips and then the robots take over the world and some dude has to go back in time to bang his friend's mother?
Why link to an article that regurgitates the same information, when you could link to the actual blog post by Google: https://cloudplatform.googleblog.com/2017/04/quantifying-the-performance-of-the-TPU-our-first-machine-learning-chip.html
And while we are at it, why not also link to the paper: https://drive.google.com/file/d/0Bx4hafXDDq2EMzRNcy1vSUxtcEk/view
See subject: You impersonate me & downmod my real posts & can't prove me wrong technically/validly so I've won, obviously!
* LOL - & you do it via UNIDENTIFIABLE anonymous posts (also proving you're some BUTTHURT loser I've torn up so badly in technical debates on hosts files you're reduced to such BITCH tactics, lmao...)
APK
P.S.=> Thanks whoever you are impersonating me + doing so by UNIDENTIFIABLE cowardly trolling worm stalking/harassing tactics - you're just tipping your hand you can't get the better of me... apk
So you're saying, no one blows harder than you?
Oh good, so our dystopian future can be realized just that much faster then...
but how many fps does it get running the new Mass Effect? Oh it can't?
That's like saying a software defined radio is not a radio.
It's right -- but it's also completely wrong.
And the important part in the context here... yeah, the completely wrong part.
You can create a perfectly fine neural network with a general purpose von Neuman or Harvard architecture CPU. Speed and efficiency are issues, that's all, and that's what the TPU is designed to address.
I've fallen off your lawn, and I can't get up.
This is likely another demonstration of "those who have the money, make more money."
Solar panels: You can save all kinds of money. If you can afford to install the system in the first place.
Investments: You can make all kinds of interest. If you have money to invest.
Toilet paper: You can save lots of money. If you buy it in bunches on sale. But if you can't spare the funds... your TP costs more than the person with a few bucks to spare who buys it in bulk. Likewise has storage space for it, etc.
And so on.
I've fallen off your lawn, and I can't get up.
Both the K80 and Haswell are a couple of generations old - I'd like to see the performance increase vs Pascal based GPU cards and whatever is the latest in the Intel camp.
What does the machine language for these things look like? Does anybody know of a bare-bones example to illustrate how it does a simple sample neural net? Is it only for the offset shifting kind of NN's common for language AI, or other kinds also?
Table-ized A.I.
>> Google's Custom Machine Learning Chips Are 15-30x Faster Than GPUs and CPUs AT MACHINE LEARNING
There, I fixed it for you.
See subject & nothing does it as efficiently as APK Hosts File Engine 9.0++ SR-7 32/64-bit https://www.google.com/search?hl=en&source=hp&biw=&bih=&q=%22APK+Hosts+File+Engine%22+and+%22start64%22&btnG=Google+Search&gbv=1/
Ads/script & malware rob speed/security/privacy
Hosts add speed (via hardcodes/adblocks), security (vs. bad sites/malware/poisoned dns), reliability (vs. dns down), & anonymity (vs. dns requestlogs/trackers).
Less power/cpu/ram + IO use vs. DNS/routers/addons/antivirus + less security bugs/complexity & faster vs. addons/routers/remote dns!
Avoids DNSChangers in routers/IP settings & dns redirects (99.999% of ISP DNS != patched vs. it) + lightens DNS load & resolves faster from local system RAM!
* Via what u NATIVELY have in the IP stack in FASTER kernelmode!
APK
P.S. - Safe https://www.virustotal.com/en/file/e01211ca36aa02e923f20adee0a3c4f5d5187dc65bdf1c997b3da3c2b0745425/analysis/1433430542/
See subject: Many like + use my work (quoted): I'm going to continue using the Host File Engine. Your software is well written, functional. The Host File Engine performs exactly as promised by mmell
his hosts program is actually pretty good by xenotransplant
I've never tried to belittle (APK's) work, I've flat out said it's good by BronsCon
APK is kinda right. I've tried his hosts file generating software. It works by bmo
I like your host file system by Karmashock
I find your hosts file admirable by vel-ex-tech
his hosts tool is actually useful for those cases in which one does indeed want to locally block stuff outright while consuming minimum system resources by alexgieg
* Recommended & hosted by Malwarebytes' hpHosts!
APK
P.S.=> No one's as big a BLOWHARD as you UNIDENTIFIABLE cowardly troll! Thanks 4 proving I'm winning when all u have's stalking me & downmodding my posts yet not proving me technically wrong... apk
(Disclaimer, not an AI or machine learning expert but interested in learning!)
So will this chip (or board) be available outside of google? I've heard they've released (some of) their AI/Machine learning code, would be good if once you made a working application you could buy one of these things and speed it up. Would be especially useful for applications where access to the cloud was unavailable or intermittent at best (think self driving cars, drones, spacecraft).
I guess a PCI card that would go in a server would be best but maybe a dedicated peripheral could work
Any other companies working on similar hardware? Are there any standards, like Open GL for AI?
is faster at that task than a device designed for something else!
THIS IS NEWS THAT MATTERS?!?!?
How much?
Your "YOUS" gave it away & how'd your words taste as you EAT THEM (trying to impersonate me Bob) https://politics.slashdot.org/comments.pl?sid=10458715&cid=54192877/ ?
* Bit like your FOOT IN YOUR MOUTH ramming them back down your chicken-neck throat & washing them down w/ the bitter taste of SELF-defeat? Yes... lol!
APK
P.S.=> Eating your words isn't GOOD nutrition Bob the superWEASEL (hopefully you'll die of malnutrition)... apk