Google's Latest Machine Vision Breakthrough

← Back to Stories (view on slashdot.org)

Google's Latest Machine Vision Breakthrough

Posted by Soulskill on Tuesday July 23, 2013 @05:10PM from the can-now-gauge-your-receptivity-to-ads-by-scanning-your-face dept.

mikejuk writes "Google Research recently released details of a Machine Vision technique which might bring high power visual recognition to simple desktops and even mobile computers. It claims to be able to recognize 100,000 different types of object within a photo in a few minutes — and there isn't a deep neural network mentioned. It is another example of the direct 'engineering' approach to implementing AI catching up with the biologically inspired techniques. This particular advance is based on converting the usual mask-based filters to a simpler ordinal computation and using hashing to avoid having to do the computation most of the time. The result of the change to the basic algorithm is a speed-up of around 20,000 times, which is astounding. The method was tested on 100,000 object detectors using over a million filters on multiple resolution scalings of the target image, which were all computed in less than 20 seconds using nothing but a single, multi-core machine with 20GB of RAM."

25 of 113 comments (clear)

Min score:

Reason:

Sort:

Porn Collection by sycodon · 2013-07-23 17:21 · Score: 5, Funny

Can it sort and identify duplicates automagically in my porn collection?

--
When Fascism comes to America, it will call itself Anti-Fascism, and tell you to give up your guns.
1. Re:Porn Collection by Impy+the+Impiuos+Imp · 2013-07-23 19:05 · Score: 5, Funny
  
  Can it sort and identify duplicates automagically in my porn collection?
  Sure! It sorted your stuff into these categories:
  400-lb. naked guys kissing
  Stuff reported to the NSA
  Someone's drawing of a dragon humping a car
  Taylor Swift
  Over 750,000 pictures in all!
  
  --
  (-1: Post disagrees with my already-settled worldview) is not a valid mod option.
2. Re:Porn Collection by K.+S.+Kyosuke · 2013-07-23 20:37 · Score: 3, Funny
  
  Does it support boolean operators? So that, you know, you could find 400-lb. naked guys kissing Taylor Swift and similar material?
  
  --
  Ezekiel 23:20
3. Re:Porn Collection by cristiroma · 2013-07-23 21:08 · Score: 2
  
  No, because it would go mad and overheat the CPU, looking through all that porn.
20GB?? That's it??? by rosshalz · 2013-07-23 17:27 · Score: 5, Funny

-"... using nothing but a single, multi-core machine with 20GB of RAM" Phew.. here i was thinking it'd need some unrealisticalll high specs from my PC!!
1. Re:20GB?? That's it??? by technix4beos · 2013-07-23 17:38 · Score: 2
  
  My current iMac has 32GB of RAM, so I don't see it as being too far fetched.
  
  --
  user@host$ diff /dev/urandom /dev/uspto
2. Re:20GB?? That's it??? by Anonymous Coward · 2013-07-23 17:50 · Score: 3, Insightful
  
  This isn't 2005, 32GB in a workstation costs peanuts nowadays. Come out from under your rock.
  Cashews, maybe, but not peanuts.
3. Re:20GB?? That's it??? by quadrox · 2013-07-23 18:21 · Score: 2
  
  My laptop came equipped with 8 GB RAM - I upgraded it two 32 GB after finding out just how cheap RAM has become nowadays. I must admit to rarely being able to come anywhere near filling this up, but that is a good thing in itself. I still have some ~200 megapixel panoramas I need to stitch with hugin, so maybe it will come in handy then.
4. Re:20GB?? That's it??? by 140Mandak262Jamuna · 2013-07-24 00:26 · Score: 2
  
  At work I have two machines 32 cores 256 GB (one linux, one windows) for regression testing, a 32 core 24 GB machine for development and a 16 core 16 GB machine for paperwork, like emails, Rally, presentations etc. The spec is actually on the low end for a professional. Heck, we order the most powerful graphics cards on headless workstations without display (to do massively parallel computations).
  
  --
  sed -e 's/Chuck Norris/Rajnikant/g' joke > fact
Yeah, well by Anonymous Coward · 2013-07-23 17:58 · Score: 3, Funny

my cat can spot a Dentabite bag from across the room in 20 milliseconds, does that mean my cat has 20TB of RAM?
1. Re:Yeah, well by pspahn · 2013-07-23 19:00 · Score: 2
  
  And on the same note, a dog can predict where a ball is going to be when you bounce it off a wall, but that doesn't qualify it to go around processing physics simulations.
  
  --
  Someone flopped a steamer in the gene pool.
Captcha's be gone? by Suhas · 2013-07-23 18:03 · Score: 2, Interesting

So Captcha's will become even easier to crack? Great, the sooner we can get rid of them, the better. As it is they are getting impossible to read by humans, thanks to idiots who don't know how to design them.
1. Re:Captcha's be gone? by Anonymous Coward · 2013-07-23 19:45 · Score: 5, Funny
  
  So Captcha's will become even easier to crack? Great, the sooner we can get rid of them, the better. As it is they are getting impossible to read by humans, thanks to idiots who don't know how to design them.
  But there's no need to get rid of them if we'll all have a handy browser plugin that can decode them for us at the press of a button!
Re:Coming to mobile? by real-modo · 2013-07-23 18:12 · Score: 5, Informative

Wait, your phone can decode video?!? In real time, playing the movies at normal speed? How many kilograms does it weigh, and how long is the power lead? How big is the mortgage on it? (/socraticmethod)
The computer innovation process broadly goes like this: first algorithm sort-of works but is incredibly inefficient - tweaks on this - a rethinking of the whole approach that leads to massive speed-ups - further refinement - implementation of the algorithm in hardware, where it becomes just another specialized processor - everybody profits!.
This article is about the third, or possibly fourth, phase of the process. If it it works out, phase 5 is straightforward. By itself, step 5 typically leads to two orders of magnitude increase in performance, three orders of magnitude decrease in power consumption, and two to four orders of magnitude decrease in cost.
Phases 6 and 7 happen if and when enough people find the provided service useful. (If technologies are no good, that's when only rich people have them. Successful technologies, everyone gets access to eventually.)
Re:Coming to mobile? by real-modo · 2013-07-23 18:18 · Score: 2

Argh! There is no phase seven. Buffer overflow error.
Re:Can it find Waldo?... by lxs · 2013-07-23 18:24 · Score: 2

No but it can spy on you day and night.
Re:Spatial Hashing by Anonymous Coward · 2013-07-23 18:56 · Score: 5, Informative

Yes, it's a breakthrough. It won the best paper award at this year's Conference on Computer Vision and Pattern Recognition, a tier 1 computer vision conference.
Hashing invarient properties in images isn't new, but,
banded winner-take-all hashing of histograms-of-oriented-gradient part filters and then using matches across those bands to identify a test feature's nearest neighbors, while simultaneously computing an upper bound or exact dot products of those test features with their nearest learned features, for up to 100,000 objects with small amounts of memory, is new.
Re:Coming to mobile? by 91degrees · 2013-07-23 18:57 · Score: 4, Funny

Phase 7 is profits. You obviously assumed phase 6 was "???".
Re:Can it find Waldo?... by pspahn · 2013-07-23 19:03 · Score: 3, Funny

Some years ago, I had an idea for a tool that would, in a nutshell, identify a plant simply from a photo and some metadata (time of year, geolocation, etc). I know how it would work (and it would work), but I came to the conclusion that someone (ie. Google) would use the methods to develop a tool that would do the same thing but for human faces.
It was at that point I decided to leave that box closed.

--
Someone flopped a steamer in the gene pool.
Re:Can it find Waldo?... by Anonymous Coward · 2013-07-23 19:14 · Score: 3, Informative

There are several non-too-creepy apps that can identify plant species by a smartphone-photo of a single leaf.
http://leafsnap.com/about/
They seem to request metadata directly via your phone's location and time-of-request (their server, not your phone, does the pattern-matching). Which is convenient, although it may place you at a time and place you may rather not be placed, for instance if burying pirate gold under a particular tree.
Re:Coming to mobile? by cnettel · 2013-07-23 22:31 · Score: 2

"...might bring high power visual recognition to simple desktops and even mobile computers... computed in less than 20 seconds using nothing but a single, multi-core machine with 20GB of RAM."
Right... and by mobile computers you mean computers that I can lug from one desk to another.
Like the MacBook Pro Retina with 16 GB? The point of their approach seems to be lots and lots of RAM to do table lookups. The memory subsystem in a normal laptop is plenty fast for that. Bandwidth would be more of a problem than total space in a cellphone. If we had a compelling case for loads of RAM in a smartphone, it would be possible to design one without going wildly beyond current power or cost envelopes. A few more years of Moore and things will be fine.
Per object memory by gshegosh · 2013-07-23 22:38 · Score: 2

20GB per 100000 objects is 209kB per object. Don't know what resolution each image was, but I think 200kB is quite small.
Nice by nospam007 · 2013-07-24 00:04 · Score: 2

It would be nice if it could identify bird species (or other animals) preferably up to specific individual animals, like they do it with whales and penguins already.
I'd gladly pay money for such a program instead of getting only a free version, where I can check if aunt Mary with a drink in hand is in any photo in my collection.
We have already been waiting for years to get a program that can identify bird songs after shazaa came out, no luck yet, but hey, after all many towns have already a program that tells them: Somebody shot somebody with a .45, 0.23 miles in that direction, so there is still hope.
Re:Coming to mobile? by faffod · 2013-07-24 00:30 · Score: 3, Insightful

Current mobile seems to cap out at 2MB of RAM. There is a reason for this - power consumption. RAM requires a continuous trickle of power to maintain state. An increase in RAM leads to a direct increase in power consumption. Mobile improvements are going to be focused on power consumption rather than raw power. Moore's law will be followed, but it will not result in something that is 2x more RAM, it will result in something that is 2x less power drain. Ok, I will grant you that it will probably be a mix - some increase in RAM, some increase in computation, but a significant increase in battery life.

To go from 2GB to 30GB following Moore's law would take 8 years. I contend that it will take longer than that because we won't see exact doubling of specs due to improvements in power. Either way, 10 years is far enough out that I think the summary claiming that this will come to mobile is far fetched for now.
Re:Coming to mobile? by loufoque · 2013-07-24 00:58 · Score: 2

Surely you realize the video decoding on phones is done with dedicated hardware.
You could do it on the CPU though, the latest models (Galaxy S4 and all) should be powerful enough.