Tiny, Blurry Pictures Find the Limits of Computer Image Recognition (arstechnica.com)
A new PNAS paper takes a look at just how different computer and human visual systems are. Humans can figure out that a mangled word "meant" something recognizable while a computer can't. Likewise with images: humans can piece together what a blurry image might depict based on small clues in the picture, where a computer would be at a loss. The authors of the PNAS paper used a set of blurry, tricky images to pinpoint the differences between computer vision models and the human brain.
They used pictures called "minimal recognizable configurations" (MIRCs) that were either so small or so low-resolution that any further reduction would prevent a person from being able to recognize them. The computer models did a better job after they were trained specifically on the MIRCs, but their accuracy was still low compared to human performance. The reason for this, the authors suggest, is that computers can't pick out the individual components of the image whereas humans can. This kind of interpretation is "beyond the capacities of current neural network models," the authors write.
They used pictures called "minimal recognizable configurations" (MIRCs) that were either so small or so low-resolution that any further reduction would prevent a person from being able to recognize them. The computer models did a better job after they were trained specifically on the MIRCs, but their accuracy was still low compared to human performance. The reason for this, the authors suggest, is that computers can't pick out the individual components of the image whereas humans can. This kind of interpretation is "beyond the capacities of current neural network models," the authors write.
This story is rather lacking without a single example of what they're talking about.
Can't you just press enhance to make them perfect again? I saw it on CSI...
Simply use google to predict what they meant to say.
The explanation is simple: context. We humans have many context information on our brains, very useful to infer knowledge from a wide range of noisy inputs (such as blurry pictures). If we train a computer to identify some aspects of blurry images *within specific context*, the computer will do a decent job.
Wonderful. Now I'll be forced to look at really time blurry pics every time I want to do anything on the web.
This seems to be the project the article is talking about: http://www.wisdom.weizmann.ac.il/~dannyh/Mircs/mircs.html
Ah, arrogance and stupidity, all in the same package. How efficient of you. -- Londo Mollari
Jerry Lettvin, in the 1960's did experiments on single optical nerve cells that showed how the retina itself enhances and discovers edges. Human vision is not a "pixel image", it's based on collecting and amplifying *edges* and differentials. Until the computer processing and the cameras, themselves, used for computer vision get this built in at the most basic levels of the CCD and immediate processing, a great deal of the most critical data is thrown out before any more sophisticated ""computer brain" can apply its algorithms.
The problem is with the algorithms used, not the capabilities of computers. If done right, for this specific task at hand, a computer would beat a human every time. For example, a computer looking at a 2x2 pixel square image of a letter could compare it against what it knows every character scaled down to 2x2 looks like under various scaling algorithms, the brightness levels of the four available pixels, and tell you with very high accuracy what it's looking at. A human, on the other hand, would have no clue.
computers can't pick out the individual components of the image whereas humans can
Yet in the realms of traditional machine vision the trick was employed long ago. When the deep networks came in the scene they discovered the same mechanism without assistance. Confusing, erroneous summaries be damned!
What I wonder is if is possible to create a computer virus that screws the machine so much that besides be used to host cat porn, but also make people sick, like a cancer machine or something.
when I was 13 and I liked it!
Hi! I make Firefox Plug-ins. Check 'em out @ https://addons.mozilla.org/en-US/firefox/addon/youtube-mp3-podcaster/
Uh-huh-huh-huh it says PNAS.
Just zoom and enhance.
Computers are not intelligent. People are. Most. Many. Okay, some. Okay, me!
To recognize a bald eagle, I don't need to be fed with a million eagle pictures; show me one flying eagle once or may be two or three times, it's done. Human visual cortex must be using ways of rotating a 3-D object and projecting how the object will appear if viewed in 2-D from different angles; also it can do simple scaling (bigger/smaller); and how color changes can affect [grey-scale/color].
Machine learning takes a million eagle pictures and does something of a curve-fitting to know how far a new point is from current cluster of points. It has no idea of 3-D objects/what effect a rotation could do; etc. In this case it's like a brute-force method versus a more sophisticated algorithm.
also a human will try to match the shown picture to whatever set of objects he knows already. e.g. a 3 year old who has learnt only say first 10 alphabets, if shown P may say it's D.. 'coz to him/her the closest match is D. computer vision may not do this.. because with a training set size running into millions.. potentially every class will claim a hit.
What the heck? I thought all you have to do is zoom and click 'enhance' and a computer can make a reasonably clear picture no matter how blurry the original.
Have gnu, will travel.
This human's neural network detected zoomed in faces with black bars covering their eyes, presumably out of shame, while they struggle with and guzzle a long, pinkish-red, penis-shaped object in their mouth.
PDF (Page 2): http://www.pnas.org/content/ea...
Excuse me for a moment.
They can only use it for 30 days before paying up to Khaled Mardam-Bey.
Computers are not intelligent. People are. Most. Many. Okay, some. Okay, me!
Nice try, APK, but you lost that argument five years ago.