Face Recognition - Real or Science Fiction?

← Back to Stories (view on slashdot.org)

Face Recognition - Real or Science Fiction?

Posted by ryuzaki0 on Wednesday October 25, 2006 @02:42AM from the oops-i-did-it-again dept.

An anonymous reader writes "Facial recognition software has been touted as one of the technologies that will change our future, particularly in law enforcement. How close are we to being recognized by a computer anywhere we go, as portrayed in movies like Minority Report? According to the industry's recent Public Relations releases, these products are closer than we think. The reality though, is that current products work only when utilizing a small comparative sample, and any attempts for an individual to disguise themselves typically throw off the results. To see how far this technology needs to go before becoming mainstream, one site utilized Government-tested face recognition software, available freely through MyHeritage.com, to compare hundreds of famous people, animals, and cartoons to a database of 2,000 celebrities. Some of the results showed promise for the technology, but most were just funny — for example, who would mistake Barbara Streisand for Shrek, or Lance Bass of N'Sync for a Teletubby?"

29 of 202 comments (clear)

Min score:

Reason:

Sort:

trick question by Lurker2288 · 2006-10-25 02:44 · Score: 5, Funny

"who would mistake Barbara Streisand for Shrek, or Lance Bass of N'Sync for a Teletubby?"

I think it's more a question of 'how many beers' than of 'who.'
1. Re:trick question by WindBourne · 2006-10-25 03:36 · Score: 3, Insightful
  
  Or what computer program, camera, and lighting that you have. And add facial hair (perhaps on all), cosmetics(again on all), or even haircuts. Basically, it will always fail on those that do not want to be recognized. But down the road(20 year), it will work well on those that are not suspecting it i.e. it will be a good way to track down regular citizens when the government is granted the power to grab whoever they want and attribute it to say terrorism. Fortunately, we are a long ways from that. Or are we?
  
  --
  I prefer the "u" in honour as it seems to be missing these days.
2. Re:trick question by slysithesuperspy · 2006-10-25 10:49 · Score: 2, Funny
  
  I've never seen them in the same place at the same time. Hmm...
I've heard this for years by 2.7182 · 2006-10-25 02:45 · Score: 5, Interesting

After working in computer vision for 5 years I've realized that most problems aren't hard - they are not well defined. Mathematically face recognition is not a problem that can be stated.

Many other problems in CV are like this - edge detection, segmentation, etc. But people write hacks that work in restricted conditions and say they've solved.

And look, you could always just put on those Groucho Marx glasses.
1. Re:I've heard this for years by PieSquared · 2006-10-25 03:08 · Score: 5, Interesting
  
  I think the real problem is what it looks at. The shape of your face is what it looks at. What if you put a little clay or really thick makeup around your jaw and cheek bones to change your visible facial structure... and of course facial hair can be shaped to look like pretty much anything is under it without even adding anything artificial to your face. And of course, you'll need multiple frames of reference with a wide angle between them to get any useful information anyway... you can't really judge depth from a single frame and if you try a little eyeshadow will throw it off. I can only see facial recognition as proving that you aren't someone smaller then you are, not that you are a specific person. And of course you could always get one of those masks from mission impossible! Yea, that's what I thought!
  
  --
  Does a line appended to your comment give your post meaning in and of itself, or only in relation to those without?
2. Re:I've heard this for years by Illserve · 2006-10-25 03:48 · Score: 4, Insightful
  
  Many other problems in CV are like this - edge detection, segmentation, etc. But people write hacks that work in restricted conditions and say they've solved.
  
  Having worked in brain science for years I can say that the brain itself is a collection of hacks.
  
  It's just a very huge collection that covers all of the bases that we find ourselves in from day to day. Put a brain in a situation it's not designed to handle and it breaks down just as badly as many artificial CV algorithms do.
3. Re:I've heard this for years by john83 · 2006-10-25 04:15 · Score: 2, Funny
  
  That's just what I need: Tesco X-raying my head every time I walk into the store to direct me to specials on items I might want. It'll break as soon as I develop super-powers from all the X-rays though.
  
  --
  Strange women lying in ponds distributing swords is no basis for a system of government.
4. Re:I've heard this for years by perlchild · 2006-10-25 05:37 · Score: 2, Insightful
  
  Are skulls that unique? I was sure the unicity was contributed to by not just bone structure, but muscle structure, cartilage(the nose) and pigmentation. In fact, do we know for sure that no two faces are unique? Until facial recognition can tell fraternal twins better than a human can, perhaps we shouldn't put those in mission critical environments, shall we?
5. Re:I've heard this for years by bzipitidoo · 2006-10-25 05:39 · Score: 2, Interesting
  
  I also worked on a project to compare images. The idea was general purpose, not just for faces only. Unfortunately, it didn't really work. I took the woefully slow prototype and rewrote it so it worked correctly, and was far faster. And all that did was help destroy their illusions and delusions that it was going to work. Before, they could be optimistic because it wasn't fast enough to do hundreds of tests, and so they were able to point to extremely small sets of data upon which it had apparently mostly worked. Progress of a sort. Meanwhile, potential sources of more funding were not interested in the general purpose. Some wanted facial recognition for security uses, and another group wanted pr0n detection for censorship uses.
  I don't know why facial recognition is so sexy and hot right now. Quite a few ventures claim to have something that works, but when you dig into the few details they provide, you find a lot of vapor. Some sort of do work, but they need lots of help. The faces have to be positioned, scaled, and oriented the same. Some employ methods for handling that automatically, and some lean on people to do that. The lighting has to be just so. Pretty much every method must be trained. And if those difficulties aren't enough, the methods have to be fast. Of course we'd like them to be always right, but even after all the help, a method is doing awesome if it's right 90% of the time.
  Currently, people are far, far better at this problem, and we're still fooled quite often, otherwise no one would bother with disguises. But some expect computers to do better. Check a photo of a person against a database of a _million_ photos to see if there's a match? Could people do that? I doubt it. Facial recognition by computer is fantasy, and probably will remain fantasy for many years yet.
  "Do you see anything there?"
  I looked at the broad plumed hat, the curling love-locks, the white lace collar, and the straight, severe face which was framed between them. It was not a brutal countenance, but it was prim hard, and stern, with a firm-set, thin-lipped mouth, and a coldly intolerant eye.
  "Is it like anyone you know?"
  "There is something of Sir Henry about the jaw."
  "Just a suggestion, perhaps. But wait an instant!" He stood upon a chair, and, holding up the light in his left hand, he curved his right arm over the broad hat and round the long ringlets.
  "Good heavens!" I cried in amazement.
  The face of Stapleton had sprung out of the canvas.
  "Ha, you see it now. My eyes have been trained to examine faces and not their trimmings. It is the first quality of a criminal investigator that he should see through a disguise."
  "But this is marvellous. It might be his portrait."
  "Yes, it is an interesting instance of a throwback, which appears to be both physical and spiritual. A study of family portraits is enough to convert a man to the doctrine of reincarnation. The fellow is a Baskerville--that is evident."
  
  --
  Intellectual Property is a monopolistic, selfish, and defective concept. It is "tyranny over the mind of man"
6. Re:I've heard this for years by lawpoop · 2006-10-25 05:41 · Score: 3, Informative
  
  Brain hacks seem to be fundamentally different than computer hacks. Or, the brain seems to have a collection of hacks that we have almost no understanding of, in addition to the hacks that we do understand.
  
  Ever since the advent of solid state electronics, it was said to be only a matter of time before robots would be sweeping, washing dishes, performing surgery, etc.
  
  Things that we think are really simple, that even retarded people can do, like recognize a face or a voice, understand speech, move bipedally with grace (hell, with any number of legs -- 2, 4 or 6), pour a glass of water, etc. are *hard* for robots and AI. We don't even have a model for how these things work. Even really dumb animals like turkeys can run through their environments and successfully hunt and catch flying insects.
  
  We do have robots that are getting good with articulation, like Asimo, but we still aren't sure whether they are using the same 'tricks' that organisms use. That is to say, they are a solution to the problem of bipedal motion, but we don't know if they are the same solution that the human mind is. I'm not sure that we have even a model of what solutions organisms use.
  
  Meanwhile, things that we think are difficult, like playing chess, factoring polynomials, or other kinds of difficult math, are easy for a computer. Now we know that the brain can do complex math like trigonometry, in order to accomplish tasks like catching a ball. but that doesn't help the average person play chess or do complex math on paper. However, the average person excels at these hard AI problems, like having a conversation or pouring a glass of water.
  
  --
  Computers are useless. They can only give you answers.
  -- Pablo Picasso
recognized by AcidLacedPenguiN · 2006-10-25 02:46 · Score: 3, Interesting

This is all well and good, but the minute I get falsely identfied as a criminal just for being in the bar district late at night in the wrong place/wrong time I won't be too happy. . .

--
disclaimer: I've been known to store numbers in my ass for which to dig out when quantities are required.
so I guess... by theStorminMormon · 2006-10-25 02:46 · Score: 2, Insightful

So I guess next time a teletubby or Shrek wanders through a mall, they're totally going to throw off the face-recognition software.

Is it just me, or does that seem like a stupid way to test the software? If you want to show that rudimentary disguise is an easy way to get around it, that's valid, but just messing with the sample of potential matches by throwing in cartoon characters destroys the validity of the "study".

-stormin

--
The Southern Baptist Convention has creationism. On Slashdot, we have porn.
But I thought by xirtap · 2006-10-25 02:47 · Score: 3, Informative

I thought they used chips in the eyes of people in minority report, not face recognition.
1. Re:But I thought by john83 · 2006-10-25 02:52 · Score: 3, Informative
  
  I thought they used chips in the eyes of people in minority report, not face recognition.
  Retina recognition, I think.
  
  --
  Strange women lying in ponds distributing swords is no basis for a system of government.
MyHeritage site by LoverOfJoy · 2006-10-25 02:48 · Score: 4, Informative

I've tried out the software and it was fun for some laughs. I'm not sure how it works exactly but I can tell that the angle of the face makes a difference. When I put one picture of myself in where I'm looking ever so slightly to the right, I'm matched with celebrities photos looking in that direction. When I put in a similar photo facing the other direction, I get a different set of celebrities looking in the other direction. There's a few overlaps and those are the ones I think I look the most like (although it's a stretch to say I have anything that could pass as a celebrity look).
1. Re:MyHeritage site by Wubby · 2006-10-25 03:05 · Score: 2, Insightful
  
  I saw the same thing. Also, if the person in the image is doing something with their face (smiling, open mouth, wide eyes) it tends to match with images of people doing the same thing. Kinda simplistic, more like a trick than a tool.
  
  --
  Sig
  Appended to the end of comments you post. 120 chars
Inevitable. by Lethyos · 2006-10-25 02:49 · Score: 5, Insightful

Not to nitpick excessively, but you could easily substitute portions of this article with terms like (and relating to) “Internet”, “personal computer”, “telephone”, “car”, and others. Asking ourselves if a technology is “real or science fiction” when it already exists (albiet in a primitive form) is silly. Of course it exists; the question itself cites examples. Perhaps the meaningful questions might be along the lines of: “what are the challenges associated with making it accurate?” or “what impact will facial recognition have on society?”

--
Why bother.
Legal hoops by solevita · 2006-10-25 02:52 · Score: 2, Interesting

I'm wondering about the legality of all this, especially in a criminal justice system. My DNA, for example, can't be used in court as evidence unless certain hoops have been jumped through; the prosecutor needs a reason to obtain a DNA sample and then procedures must be followed.

I wonder if the same systems will apply to a computer analysed image of my face; will there be a criterea for when this image is admissable in court? Will I have rights concerning my image? Or are we just going towards a 1984 style system. Interesting because this hasn't been the result of DNA admissions to court, despite the seemingly more robust nature of this evidence.
1. Re:Legal hoops by Quadraginta · 2006-10-25 03:17 · Score: 3, Insightful
  
  Using a computer-captured image of your face in Court would presumably come under the same rules as using a photograph of your face. More or less, if you appear in public, your image can be used.
  
  The more interesting question, I suggest, is whether a computer recognition of your face is going to be in any way equivalent to a human recognition of your face.
  
  For example: if you stroll into a 7-Eleven, and the donuphage with a badge sitting there swilling coffee thinks you look like a famous bank robber whose mug has been circulated by the FBI, then he's entitled to take you into custody, and search you (for his own safety and those nearby, et cetera). If he finds half a gram of coke on you, you're in trouble. Now suppose it isn't the cop's eye/brain combination that "recognizes" you as a bank robber, but rather his shoulder-mounted camera/computer combination. Is he still entitled to act in the same way?
  
  You can argue it both ways: (1) the camera/computer is almost certainly always going to be worse at this kind of thing than the eye/brain. Recognition is about the single most important thing our eyes and brains do, and they are highly optimized for it by natural selection. If it could be done better and faster, we would do it. So, we should trust the camera/computer less. But (2) the camera/computer is not subject to the vagaries of human psychology, mood, et cetera. The cop may take you in unreasonably because he doesn't like your skin color or length of hair, the camera/computer isn't subject to the same prejudices. So maybe it's better to trust the mindless device.
2. Re:Legal hoops by B3ryllium · 2006-10-25 03:30 · Score: 4, Insightful
  
  or, maybe it's better to not carry a half-gram of coke on you.
The miracle of technology by Lisandro · 2006-10-25 02:52 · Score: 2, Funny

For example, who would mistake Barbara Streisand for Shrek, or Lance Bass of N'Sync for a Teletubby?

So, i see it's working correctly!
Here to Stay by DumbSwede · 2006-10-25 03:11 · Score: 4, Insightful

I believe Minority Report used retina scans, but that nit aside facial recognition works to a degree and will only get better. Security cams will eventually upgrade to HDTV resolutions, perhaps augmented with very high resolution stills when a potential match is made. This will all take more processing power, but all mighty god Moore will eventually gives us this day our daily CPU load.

About false positives. So what? Eyewitnesses make mistakes also. Eventually, perhaps very soon, machines will surpass humans in this arena just as they have in others. Can anyone here on Slashdot defeat Deep Blue at Chess?

As to the legality or ethics, what can be done will be done, at least in public areas. If it would be legal for a human to do (they haven't outlawed humans scanning for suspects in public areas) then it will be legal for machines to do despite the unease many will feel knowing they are constantly being watched.

--
Letter To Iran
Belittling technology by MobyDisk · 2006-10-25 03:15 · Score: 2, Interesting

Some of the results showed promise for the technology, but most were just funny -- for example, who would mistake Barbara Streisand for Shrek, or Lance Bass of N'Sync for a Teletubby?"
That's just trolling. The software was instructed to find the celebrity who most closely matched a cartoon character. It didn't mistake anyone for a cartoon character. And since cartoon characters are not within the scope of what the software is for, it shows that it worked better than expected. Attempts like this to belittle the success of the technology are akin to Ad Hominem attacks, and have no merit in a discussion.
re: hacks and restricted conditions by King_TJ · 2006-10-25 03:23 · Score: 4, Interesting

But don't we almost always get a computer to solve a problem that's not strictly a mathematical one using "hacks that only work in restricted conditions"?

Our spell-checkers in our word processors don't actually know anything about the rules of a language, phonics, etc. They just do lookups from a dictionary. If a word's not listed, it has no idea if it's spelled properly or not -- even if the misspelling is one that's simply not a possible correct sequence of letters for the language. Most don't even realize if a word is misspelled in the context of the sentence, as long as it matches a correct spelling in the word list.

Until we figure out how the human brain recognizes faces as individuals, we can't expect anything *but* a clever hack for a computer to do the same. And truthfully, I suspect the human brain takes many things into account to do a "recognition" on a person. How often do you see somebody in the store that you're pretty sure you know from a previous job, school, etc. but you're not quite sure? I've had this happen a few times, and to make a better determination, I had to take other factors into account, like the sound of their voice if I heard them speak, the way they walked, or maybe an expression that came across their face. Humans "key in" on specific things that help them remember a person. And depending on which "features" they chose, they may or may not be effective. (Say you remember a gal really well because of her long, flowing hair? If she cuts it real short, there's a good chance you won't recognize her at all anymore if she walks by you.)
Re:GA guided NN's by MBCook · 2006-10-25 03:29 · Score: 2, Interesting

The problem is the inputs. Do you inputs sets of geometry (eyes are X" apart, at an angle of 0.53 degrees, chin is .5" below lips, blah blah blah), the raw image, or something else? If you use the raw image, you'd need a system in the front end scale/rotate the images to be in about the same place otherwise you probably have no chance (unless you want your neural net to do that TOO, which would make training harder and take longer).
Even if you use geometry (we have a vague understanding of what makes people look similar or beautiful) you'll still run into problem. You have problems of perspective (not all pictures are taken straight on).
Garbage in, garbage out. The best solution is to provide tons of information and let the neural net sort out what matters and what doesn't (they are quite good at that) but that will require more training which means more time.
So in the end you may build a good system. But to use it you must provide it with geometry of a face that someone picks out after fixing the perspective on a photo. Or it works much like our brains and accounts for all that, but it will take you 6 years of non-stop training alone.
And what is a success? Two people who look similar? A perfect match? What if your software rates a picture of a celebrity impersonator (looking like the celebrity) over a picture of that celebrity looking different (movie role, disheveled mugshot, etc)? Is that a success?
And how do you rate the people for the training input? Sure a neural net can figure out the way to something where we know the end, but what about when we don't quite know the end?
It probably took evolution a VERY long time to get good at recognizing individuals. And even then, we are not that great (mistaken identity, all cocker spaniels look alike until you spend more time with them, etc).
It's a neat problem, but it is seriously tough even with the "voodoo magic" that a neural net would provide over trying to come up with a straight formula.

--
Comment forecast: Bits of genius surrounded by a sea of mediocrity.
Must be science fiction... by __aaclcg7560 · 2006-10-25 03:33 · Score: 2

Every morning I wake up to look into the mirror and it's a different face that I don't recognized. Maybe I need to upgrade my mirror?
Initially, cameras would be spaced like eyes by benhocking · 2006-10-25 03:44 · Score: 2, Interesting

If I was training to match V1-4, I'd have the input come from two "eyes" with inputs similar to what our eyes actually provide to our brain. We know quite a bit about visual cortex, but there's a lot we don't know. Initially, I'd train it using a batch of photographs for a single person (we'll call her "Momma") and then I'd train with a few others (where a match is a match only if it's the same person). From there, I'd create histograms of parameter settings that seem to do an adequate job on this small set, and then use this reduced parameter space to create populations that are evaluated after training on millions of photographs. (The photographs can be placed in front of the eyes - once for each photograph, mind you, and not for each "individual" being tested - just like we can recognize photos and not just people.)

I could imagine narrowing the parameter space down to 100 or so unknown parameters, and each training session might take several hours. Given enough resources (e.g., the Pittsburgh Supercomputer Center), I'd run population sizes of 500 or so (in parallel), so that you could possibly go through 4-5 generations per day. In a month, you might have some pretty good individuals. Of course, my research area is the hippocampus and not the visual cortex, so it might take significantly more than 100 parameters to even begin to set this up.

Now, someone else pointed out that such computers would not have the biases that we humans have, but that's not necessarily true. If you train the computer using an input set of 950,000 "white" people and 50,000 "black" people, it would tend to make the mistake of thinking that "black" people look a lot like each other. (Studies done with speech recognition have shown that neural networks trained on Japanese have a much harder time telling "l" from "r" than those trained on English.)

--
Ben Hocking
Need a professional organizer?
retinal scanning by Jack+Sombra · 2006-10-25 03:49 · Score: 2, Informative

"How close are we to being recognized by a computer anywhere we go, as portrayed in movies like Minority Report?"
Now I could be wrong but I am pretty sure Minority report was portraying retinal scanning not facial recognition
Mathematical rigor is a part of vision by AndOne · 2006-10-25 06:13 · Score: 3, Insightful

I'm afraid I'm going to call shennanigans on some of this. I've been doing Vision work for about 5 years now with a hefty does of image and signal processing in the mix(Working as gradstudent in the field right now in fact). Edge detection is well defined. The canny and shah-istan(think that's the name) are about as close to a mathematical optimal edge detector as one can get. There is in fact a well developed body of theory regarding differentiation of Signals. The problem doesn't lie in the mathematical models involved. It lies in how many people want to use those models. Edge detection suffers from spurious edges or edge flakes which are a symptom of noise in the signal at differention(ie differentiation enhances noise, integration smooths it). Segmentation can also be well defined you just have to be clear on what it is you're segmenting. Are you working in a color space, texture, motion? That matters. However you can get some very good results in these fields. See GPCA techniques for some examples of doing it. Or even modified PCA + EM or PCA+ Kmeans(clustering theory). Again very well defined. Mathematically there are several models for face recognition. One can examine the ideas of eigen faces(not my personal favorite but it's there), kernel based SSD type approaches to find key points, partial face detection followed by recognition over a sequence of images used to reconstruct the face, and more. The problem isn't the math. It's that when you project a model you are essentially destroying an entire degree of freedom which is a huge deal. Further just as you can match a partial finger print or a partial ear print you can match partial facechunks. The problem with makeup or facial hair comes when one relies on global matching techniques or uses only 2d information to do the matching. Now I'll be a first to say that alot of computer vision is a solution in search of a problem or that people do use a number of cheap hacks and dirty tricks to get things working but saying it's not mathematical is a lie. I can turn around and see at least 3 books at a glance that detail the mathematics that are a part of vision and image processing. So please don't confuse peoples fuzzy use or lack of understanding of the math for there being no math. Note: Machines are also bad at a number of tasks humans are really good at but the same can be said that there are many tasks that humans are very bad at but the machines excel at. Absolute range detection is a good example. Humans are very bad at telling you the exact range to an object, even with some sort of scale of the scene reference. Computers on the other hand(while suffering from noise in the signal) are still able to achieve significant accuracy depending on the range. You can see tyzx for an example of a comany who makes highly accurate stereo rigs.(They were around as of 2 years ago at least and I assume they're still going strong) Cheers

--
I don't care what you say, all I need is my Wumpabet soup.