Google's Latest Machine Vision Breakthrough

← Back to Stories (view on slashdot.org)

Google's Latest Machine Vision Breakthrough

Posted by Soulskill on Tuesday July 23, 2013 @05:10PM from the can-now-gauge-your-receptivity-to-ads-by-scanning-your-face dept.

mikejuk writes "Google Research recently released details of a Machine Vision technique which might bring high power visual recognition to simple desktops and even mobile computers. It claims to be able to recognize 100,000 different types of object within a photo in a few minutes — and there isn't a deep neural network mentioned. It is another example of the direct 'engineering' approach to implementing AI catching up with the biologically inspired techniques. This particular advance is based on converting the usual mask-based filters to a simpler ordinal computation and using hashing to avoid having to do the computation most of the time. The result of the change to the basic algorithm is a speed-up of around 20,000 times, which is astounding. The method was tested on 100,000 object detectors using over a million filters on multiple resolution scalings of the target image, which were all computed in less than 20 seconds using nothing but a single, multi-core machine with 20GB of RAM."

13 of 113 comments (clear)

Min score:

Reason:

Sort:

Porn Collection by sycodon · 2013-07-23 17:21 · Score: 5, Funny

Can it sort and identify duplicates automagically in my porn collection?

--
When Fascism comes to America, it will call itself Anti-Fascism, and tell you to give up your guns.
1. Re:Porn Collection by Impy+the+Impiuos+Imp · 2013-07-23 19:05 · Score: 5, Funny
  
  Can it sort and identify duplicates automagically in my porn collection?
  Sure! It sorted your stuff into these categories:
  400-lb. naked guys kissing
  Stuff reported to the NSA
  Someone's drawing of a dragon humping a car
  Taylor Swift
  Over 750,000 pictures in all!
  
  --
  (-1: Post disagrees with my already-settled worldview) is not a valid mod option.
2. Re:Porn Collection by K.+S.+Kyosuke · 2013-07-23 20:37 · Score: 3, Funny
  
  Does it support boolean operators? So that, you know, you could find 400-lb. naked guys kissing Taylor Swift and similar material?
  
  --
  Ezekiel 23:20
20GB?? That's it??? by rosshalz · 2013-07-23 17:27 · Score: 5, Funny

-"... using nothing but a single, multi-core machine with 20GB of RAM" Phew.. here i was thinking it'd need some unrealisticalll high specs from my PC!!
1. Re:20GB?? That's it??? by Anonymous Coward · 2013-07-23 17:50 · Score: 3, Insightful
  
  This isn't 2005, 32GB in a workstation costs peanuts nowadays. Come out from under your rock.
  Cashews, maybe, but not peanuts.
Yeah, well by Anonymous Coward · 2013-07-23 17:58 · Score: 3, Funny

my cat can spot a Dentabite bag from across the room in 20 milliseconds, does that mean my cat has 20TB of RAM?
Re:Coming to mobile? by real-modo · 2013-07-23 18:12 · Score: 5, Informative

Wait, your phone can decode video?!? In real time, playing the movies at normal speed? How many kilograms does it weigh, and how long is the power lead? How big is the mortgage on it? (/socraticmethod)
The computer innovation process broadly goes like this: first algorithm sort-of works but is incredibly inefficient - tweaks on this - a rethinking of the whole approach that leads to massive speed-ups - further refinement - implementation of the algorithm in hardware, where it becomes just another specialized processor - everybody profits!.
This article is about the third, or possibly fourth, phase of the process. If it it works out, phase 5 is straightforward. By itself, step 5 typically leads to two orders of magnitude increase in performance, three orders of magnitude decrease in power consumption, and two to four orders of magnitude decrease in cost.
Phases 6 and 7 happen if and when enough people find the provided service useful. (If technologies are no good, that's when only rich people have them. Successful technologies, everyone gets access to eventually.)
Re:Spatial Hashing by Anonymous Coward · 2013-07-23 18:56 · Score: 5, Informative

Yes, it's a breakthrough. It won the best paper award at this year's Conference on Computer Vision and Pattern Recognition, a tier 1 computer vision conference.
Hashing invarient properties in images isn't new, but,
banded winner-take-all hashing of histograms-of-oriented-gradient part filters and then using matches across those bands to identify a test feature's nearest neighbors, while simultaneously computing an upper bound or exact dot products of those test features with their nearest learned features, for up to 100,000 objects with small amounts of memory, is new.
Re:Coming to mobile? by 91degrees · 2013-07-23 18:57 · Score: 4, Funny

Phase 7 is profits. You obviously assumed phase 6 was "???".
Re:Can it find Waldo?... by pspahn · 2013-07-23 19:03 · Score: 3, Funny

Some years ago, I had an idea for a tool that would, in a nutshell, identify a plant simply from a photo and some metadata (time of year, geolocation, etc). I know how it would work (and it would work), but I came to the conclusion that someone (ie. Google) would use the methods to develop a tool that would do the same thing but for human faces.
It was at that point I decided to leave that box closed.

--
Someone flopped a steamer in the gene pool.
Re:Can it find Waldo?... by Anonymous Coward · 2013-07-23 19:14 · Score: 3, Informative

There are several non-too-creepy apps that can identify plant species by a smartphone-photo of a single leaf.
http://leafsnap.com/about/
They seem to request metadata directly via your phone's location and time-of-request (their server, not your phone, does the pattern-matching). Which is convenient, although it may place you at a time and place you may rather not be placed, for instance if burying pirate gold under a particular tree.
Re:Captcha's be gone? by Anonymous Coward · 2013-07-23 19:45 · Score: 5, Funny

So Captcha's will become even easier to crack? Great, the sooner we can get rid of them, the better. As it is they are getting impossible to read by humans, thanks to idiots who don't know how to design them.
But there's no need to get rid of them if we'll all have a handy browser plugin that can decode them for us at the press of a button!
Re:Coming to mobile? by faffod · 2013-07-24 00:30 · Score: 3, Insightful

Current mobile seems to cap out at 2MB of RAM. There is a reason for this - power consumption. RAM requires a continuous trickle of power to maintain state. An increase in RAM leads to a direct increase in power consumption. Mobile improvements are going to be focused on power consumption rather than raw power. Moore's law will be followed, but it will not result in something that is 2x more RAM, it will result in something that is 2x less power drain. Ok, I will grant you that it will probably be a mix - some increase in RAM, some increase in computation, but a significant increase in battery life.

To go from 2GB to 30GB following Moore's law would take 8 years. I contend that it will take longer than that because we won't see exact doubling of specs due to improvements in power. Either way, 10 years is far enough out that I think the summary claiming that this will come to mobile is far fetched for now.