Is Google's AI-Driven Image-Resizing Algorithm Dishonest? (thestack.com)

← Back to Stories (view on slashdot.org)

Is Google's AI-Driven Image-Resizing Algorithm Dishonest? (thestack.com)

Posted by EditorDavid on Saturday November 19, 2016 @08:34AM from the infinitely-more-sizes dept.

The Stack reports on Google's "new research into upscaling low-resolution images using machine learning to 'fill in' the missing details," arguing this is "a questionable stance...continuing to propagate the idea that images contain some kind of abstract 'DNA', and that there might be some reliable photographic equivalent of polymerase chain reaction which could find deeper truth in low-res images than either the money spent on the equipment or the age of the equipment will allow." An anonymous reader summarizes their report: Rapid and Accurate Image Super Resolution (RAISR) uses low and high resolution versions of photos in a standard image set to establish templated paths for upward scaling... This effectively uses historical logic, instead of pixel interpolation, to infer what the image would look like if it had been taken at a higher resolution.

It's notable that neither their initial paper nor the supplementary examples feature human faces. It could be argued that using AI-driven techniques to reconstruct images raises some questions about whether upscaled, machine-driven digital enhancements are a legal risk, compared to the far greater expense of upgrading low-res CCTV networks with the necessary resolution, bandwidth and storage to obtain good quality video evidence.
The article points out that "faith in the fidelity of these 'enhanced' images routinely convicts defendants."

45 of 79 comments (clear)

Min score:

Reason:

Sort:

Wait, what? by Rei · 2016-11-19 08:41 · Score: 5, Interesting

People are using this sort of thing in court?
I think these is a very interesting field for consumer needs, but I have to agree, that's disturbing if they're allowing what... let's face it... is data made up by an AI that "looks right", to convict people.

--
Wingus, Dingus! Listen up!
1. Re:Wait, what? by MichaelSmith · 2016-11-19 09:02 · Score: 2, Insightful
  
  But they were guilty.
  
  --
  http://michaelsmith.id.au
2. Re:Wait, what? by knightghost · 2016-11-19 09:08 · Score: 4, Interesting
  
  Yes, they use it in court. I once watched a federal prosecutor use this and lie so blatantly to the court that his own (image) expert witness sued him for false representation. Yet the defendant was still convicted based almost entirely on that upscale image "evidence" and served several years in prison.
3. Re:Wait, what? by Oligonicella · 2016-11-19 09:49 · Score: 3
  
  It's bad because it's allowing a piece of software to become a witness. One you cannot ask questions of unless you want to force the importing of the whole development team for each trial. Just having "working knowledge" of how the software functions is insufficient.
  And because "I extrapolated from other cases what the defendant should look like" wouldn't go over well if given by a human expert.
  "so it is possible" - Same extrapolation from scant information.
4. Re:Wait, what? by msauve · 2016-11-19 10:50 · Score: 1
  
  But, CSI .
  
  --
  "National Security is the chief cause of national insecurity." - Celine's First Law
5. Re:Wait, what? by msauve · 2016-11-19 11:17 · Score: 1
  
  "Why is that a bad thing?"
  
  Because it is manufacturing evidence from whole cloth.
  
  --
  "National Security is the chief cause of national insecurity." - Celine's First Law
6. Re:Wait, what? by rtb61 · 2016-11-19 12:07 · Score: 1
  
  So should a person be prosecuted for one hair follicle considering http://www.webmd.com/skin-prob.... Keep in mind that means 365,000 per year you scatter around for which you are now legally liable. So exactly for how long can DNA be recovered from a hair follicle, after you lose it.
  
  --
  Chaos - everything, everywhere, everywhen
7. Re:Wait, what? by Rei · 2016-11-19 12:42 · Score: 1
  
  Indeed. It's the digital equivalent to an artist looking at a vague picture and painting in details onto it.
  
  --
  Wingus, Dingus! Listen up!
8. Re:Wait, what? by Anonymous Coward · 2016-11-20 01:07 · Score: 1
  
  The article links to http://www.crime-scene-investigator.net/admissibilitydigitaleveidencecriminalprosecutions.html, which has summaries of dozens of cases where 'enhanced' images were admitted as evidence. Given that there seems to be a pretty high standard for evidence (and the fact it wasn't developed for forensic techniques), I think the article is exaggerating the likelihood of google's algorithm being admitted. This could be a good way to get sketches of suspects, starting with security footage or something similar.
  Also, all the sample images look the same. I suspect they've been scaled down so no differences between google's algorithm and the others are visible. Whoops.
Enhance! by Yvan256 · 2016-11-19 08:42 · Score: 1

Obligatory Futurama
1. Re:Enhance! by Incadenza · 2016-11-19 09:04 · Score: 1
  
  obligatory Peter Jackson, skip to the 9:00 mark for the enhancement action.
2. Re:Enhance! by Anonymous Coward · 2016-11-19 09:33 · Score: 1
  
  red dwarf http://www.dailymotion.com/video/x2qlmuy
This is only meant to distract you... by Nova+Express · 2016-11-19 08:52 · Score: 2

...from the fact that Google is run by shape-shifting reptoids.
WAKE UP, SHEEPLE!!!!! /Cue obligatory XKCD

--
Lawrence Person (lawrencepersonh@gmailh.com (remove all "h"s to mail)
http://www.lawrenceperson.com/
Depends on enhancement by guruevi · 2016-11-19 08:55 · Score: 4, Informative

You can't really upscale resolution but you can "enhance" images (especially raw ones) to a point. A lot of shots may be over or underexposed with some details left in one or more of the channels but visually blocked out, having thousands of minuscule changes and filtering go through a human in the hope of seeing something would be nearly impossible and having a filter to weed them out is helpful.
JPEG and similar compression are like MP3 - you can filter out what the algorithm defines as outside of the human realm to perceive but a lot of those assumptions are faulty leading to noticeable artifacts. However it is very hard to recover the data lost in "lossy compression" although you can make some assumptions to recover them.
The other problem with using these filters is that they're called artificial intelligences. They are not intelligent and calling them that leads to an assumption of infallibility. They're a form of Bayesian filtering and we've been using that since at least the days of OS/2 to "enhance" images, I used a demo of a program back then that did just that: inferences on JPEG to make a type of vector image. We just use more powerful clock cycles and more storage to have them perform better but they're not and never will be magic.

--
Custom electronics and digital signage for your business: www.evcircuits.com
1. Re:Depends on enhancement by rl117 · 2016-11-19 21:16 · Score: 2
  
  Agreed. You can enhance an image correctly if that processing only makes use of information in the original image. For example, deconvolution, despeckling, contrast enhancement. These change the image, but the process is either neutral (no information loss) or lossy (some information loss). You can't *add* missing information to an image, because that implies making assumptions about the image which are likely to be incorrect for most cases. Validating such assumptions are correct is extremely difficult. In the case of the google filtering, this is fine if it's purely for aesthetic purposes, but definitely not if it's used for any serious purpose. In the domain I work in (scientific and medical imaging), this would be classed as fraudulent misrepresentation of data, and would get you fired. In fact, a member of my faculty was fired just last week for academic fraud after being discovered to have been misrepresenting their image data over their whole career--it's taken extremely seriously.
2. Re:Depends on enhancement by Solandri · 2016-11-20 05:31 · Score: 1
  
  You're assuming that image enhancement algorithms are "neutral" solely if they use information already in the photo and don't add missing information to an image. But the very act of choosing which algorithms to use to "enhance" an image is not neutral - it's biased towards enhancements which disproportionately fit our expectations for how the real world works.
  
  For a real example, look at the upscaled photos of the boy's face in TFA. The upscaling algorithms other than bicubic look for edges, and strive to keep them sharp after the upscaling because edges are very important to our visual system. So if the original photo wasn't actually of the boy, but of a billboard which had a blurry photo of the boy, then the bicubic upscaling would actually be accurate. These other upscaling algorithms would actually be making up information by exaggerating the edges (e.g. his eyelashes) even though that information wasn't present in the original. They'd be guessing that the weak lines (eyelashes) in the original photo were in fact very sharp but very thin lines, and upscaling as if they were because that's usually correct. i.e. a probabilistic assumption about how to upscale was encoded in the algorithm itself. In the case of the non-Google algorithms, it was encoded by the programmers of the algorithms. In Google's case, the algorithm just happened to be taught by machine learning. The end result isn't really that different - they all "add" information to the photo by using assumptions about what a higher-resolution original usually actually look like.
  
  It's also important to realize the human visual system isn't one where simply adding more detail produces a "better" image. We cue off of certain traits, and enhancing those traits disproportionately improves the subjective quality of an image, even when it's actually decreasing the objective quality. A good example is unsharp masking. It actually degrades image quality by distorting the image to exaggerate edges (darkens the dark half of an edge, brightens the bright half). But because our eyes have neurons which fire when they detect edges, it makes this worse-quality image appear sharper and better because the "enhanced" photo triggers those neurons more frequently or heavily. This is also the reason we keep seeing faces on Mars. Our brain has neurons which scream "that's a face!" at us whenever they see anything remotely face-like. An algorithm tailored towards those neurons would enhance face-like qualities in photos and make us see nonexistent faces, even though it didn't "add" anything to the photo.
  
  If all you have is a low-res photo, then that's all you have, period. If you want to upscale it 2x, bicubic is probably the only neutral way to do it. (Nearest neighbor introduces high frequency noise by exaggerating pixel boundaries. Pixels are represented on displays as squares because that's the way to maximize light transmission from the backlight. Theoretically, pixels are points, not squares.)
I'm intelligent, gears are dumb. Intelligent==fail by raymorris · 2016-11-19 09:17 · Score: 4, Insightful

> They are not intelligent and calling them that leads to an assumption of infallibility.
That's an interesting comment. I'd think the opposite. I'm intelligent, and often wrong. Gears are dumb, and always perform multiplication correctly, never giving the wrong result. To me, intelligence implies the ability to come up with different answers, some of which may be wrong. If it can't come up with unexpected answers, it's just a dumb machine, I'd think.
Not dishonest, probabilistic! by Gravis+Zero · 2016-11-19 09:18 · Score: 2

Enhancing an image for increased resolution isn't dishonest... unless you present it as the absolute truth. The reality is that it's a probabilistic view of the unenhanced version which is to say that it probably looks as presented in the image but there are other possibilities that could match that image. Honestly, I doubt it's worse than a human's memory of image because human's don't store information as PNGs and our recall is far from perfect.

--
Anons need not reply. Questions end with a question mark.
1. Re:Not dishonest, probabilistic! by fph+il+quozientatore · 2016-11-19 20:02 · Score: 1
  
  The main problem here is judges and jurors not having a clue. Which is related to them having zero scientific training.
  
  --
  My first program:
  Hell Segmentation fault
Who put the stick up his ass? by pavon · 2016-11-19 09:18 · Score: 4, Informative

All upscaling algorithms are making up data based on assumptions on what "typical" hi-res images should look like given their low-res counterparts. That doesn't mean they are lying or misrepresenting. Furthermore, some assumptions are most statistically valid than others, and some produce more aesthetically pleasing results than others, actually resulting in images that are genuinely more likely to be closer to the true image than nearest neighbor.
Nowhere in google's paper are they suggesting that these images be used for forensic purposes, nor claiming that they are finding "deeper truth" or additional information in the images than what actually exists. They developed an approach that produces better results for common classes of images than previous algorithms, which is useful for a large number of applications that don't require the same level of rigor that forensics do.
1. Re:Who put the stick up his ass? by Tablizer · 2016-11-19 12:46 · Score: 1
  
  All upscaling algorithms are making up data based on assumptions on what "typical" hi-res images should look like given their low-res counterparts. That doesn't mean they are lying or misrepresenting.
  It's essentially using AI and statistics to guess. While not "lying or misrepresenting", it should be considered just that: a guess.
  If anyone is convicted based on such AI guesses, they should be let out of jail.
  
  --
  Table-ized A.I.
Pretty much garbage for static images by undefinedreference · 2016-11-19 09:34 · Score: 1

You can't get something from nothing. That's a fact. Humans can fill in some gaps and AI could probably do the same, but there is no guarantee the results are correct.
On the other hand, if it could actually discern more from a video (which humans can also do, but probably not quite as well), it might be able to "enhance" individual images to some extent and have accurate results.
That people can be convicted by the results is a little scary, but at some level no different from a jury misinterpreting a low resolution image. Aside from the fact it was a single opinion that swayed that of the entire jury.
1. Re:Pretty much garbage for static images by Viol8 · 2016-11-19 09:41 · Score: 1
  
  "That's a fact. Humans can fill in some gaps and AI could probably do the same, but there is no guarantee the results are correct."
  True, but for something like straight lines or curves that may have missing sections, filling in the missing bits would probably give a reasonable fascimile to the original. But sure, at the end of the day, whatever they call it , its just educated guesswork by a program. In most cases though it won't matter so long as it *looks* sharper and more detailed, whether the fine detail is correct probably won't concern many people.
2. Re:Pretty much garbage for static images by yes-but-no · 2016-11-19 10:24 · Score: 4, Insightful
  
  You can't get something from nothing.
  2, 4, 8, x, 32, 64. Can you guess x?
  
  It's not from nothing.. image captures nature; nature runs under physics; n physics under mathematical laws. So it is reasonable to guess what a missing pixel-block will be based on other sets of observations of similar situations.
3. Re:Pretty much garbage for static images by Anonymous Coward · 2016-11-19 11:41 · Score: 3, Insightful
  
  There are an infinite different functions that follows the pattern that generates different results for x.
  The problem when using it for forensics is that you will put the person following the pattern you implemented in jail, not the one that actually is guilty.
4. Re:Pretty much garbage for static images by Imrik · 2016-11-19 13:13 · Score: 1
  
  How about:
  2, x, 8, y, 32, z. Can you guess x, y and z?
5. Re:Pretty much garbage for static images by fph+il+quozientatore · 2016-11-19 20:00 · Score: 1
  
  Have you seen this nice little problem? http://mathworld.wolfram.com/C... 1, 2, 4, 8, 16, x. Can you guess x?
  
  --
  My first program:
  Hell Segmentation fault
6. Re: Pretty much garbage for static images by breakermelvin · 2016-11-19 21:57 · Score: 1
  
  Myopic, but my brain constructs sharp edges on probably sharp edges... Telegraph poles, skylines .... Fails miserably on reading text on distant signs. Fails on recognising distant faces. Other clues help ... gait, them doing recognition gestures...
7. Re:Pretty much garbage for static images by thegarbz · 2016-11-19 22:39 · Score: 1
  
  I can guess x to within +/-1 of the number you're after.
  That is quite significant in terms of filling in the blanks.
Re:I'm intelligent, gears are dumb. Intelligent==f by CaptainDork · 2016-11-19 10:11 · Score: 2

This.
I've been in the business 49 years starting when the slide rule was the calculator of choice.
"Artificial Intelligence" (AI) started with a basic definition that always circled back to the human brain as a reference for "intelligence."
In later years, a more realistic description of AI required us to drop the human brain part, but many people failed to catch the move.
A machine will only be intelligent when it can commit suicide because Facebook is down.

--
It little behooves the best of us to comment on the rest of us.
Re:I'm intelligent, gears are dumb. Intelligent==f by msauve · 2016-11-19 11:37 · Score: 1

I had a friend who often pointed out that a common definition of "life" (from the first Google hit for "definition of life": growth, reproduction, functional activity, and continual change preceding death) only works if you exclude fire.

--
"National Security is the chief cause of national insecurity." - Celine's First Law
Re:I'm intelligent, gears are dumb. Intelligent==f by guruevi · 2016-11-19 11:38 · Score: 1

Perhaps infallible is the wrong word.
The problem to the lay person is that the 'AI' in contemporary media is portrayed as a sort of super-intelligence that is purely logical and thus superior to humans (and subsequently morally 'better' as well). It's easy to say by an attorney that a non-human, self-aware entity enhanced a perfect digital replica of the scene, it is therefore free of any human bias and thus a 'perfect' proof.
To go with your gears example, when people use gears all the time and they're always right, imagine you develop a complex gear system that can do something no human has ever done nor can feasibly verify and you call it an 'mechanical intelligence', people will assume it's always right given the prior simple gears have always been right.

--
Custom electronics and digital signage for your business: www.evcircuits.com
Subspaces and stuff by John+Allsup · 2016-11-19 12:08 · Score: 1

I had thought of the possibility of this years ago. The basic idea is that, if you downsample an image, and then upsample it again, information is lost in the low resolution version that must be reconstructed somehow. Essentially what you need is a means to make educated guesses as to the missing information. Traditional codecs are based on the maths that results when the codec is intended to reconstruct an arbitrary image. If we constrain the space of possible images, such as photos of the same person, the amount of information necessary to specify the image is less. Where deep learning comes in is finding convenient subsets of 'all possible images' such that, if we assume an image lies within a given subset of 'all possible images', we can make better guesses as to the missing information than if the image was totally arbitrary.

--
John_Chalisque
1. Re: Subspaces and stuff by fferreres · 2016-11-19 12:35 · Score: 1
  
  Best post explaining the actual value. Also, it should be possible to measure the performance if the gurssing algorithms by comparing the lost from test images vs their original high res versions.
  
  --
  unfinished: (adj.)
Hmm. by Shane_Optima · 2016-11-19 12:26 · Score: 1

I thought this was just something TV / moviemakers had been doing since the 90s to purposefully annoy geeks.

"Zoom in on D2."

"Enhance!"
Re:A REALLY REALLY bad sharpen pass by Nebulo · 2016-11-19 12:52 · Score: 1

What... what did I just read?
I wonder if this was a botpost or a human typing.
Re:Train the AI on porn by Tablizer · 2016-11-19 12:54 · Score: 1

Train the AI on porn
then sit back and enhance enhance ENHANCE ENHANCE
"Oh shit, surgery marks, they are FAKE, there goes my woody."

--
Table-ized A.I.
Hallucinations as evidence by dumky2 · 2016-11-19 13:51 · Score: 1

This should be easy for a defense attorney to invalidate. Hallucinated images (assembled largely from a corpus of previous images to "enhance" some evidence) are not the same as an image that is run through an abstract de-blurring algorithm.
It's probably easy to demonstrate the problem with some examples, so that judge and jury "gets it".

--
These comments are mine; I do not speak for my employer.
Re:In Court by gweihir · 2016-11-19 14:38 · Score: 1

You know, in a bidding police-state it is far more important to get convictions than to convict the person that actually did it. "Tools" like this (and as an engineer and scientist, I am offended by the very idea that has been implemented here) are a welcome way to make it appear that everything is in order.

--
Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
Wrong title. by Thanatiel · 2016-11-19 14:49 · Score: 1

The AI is not dishonest, it has been designed to make-up stuff.
Its a bit like doing the fractal compression of an image, then restore to an higher resolution than the original. You will get a more detailed image, but it's content will have been made-up.
Fractal compression existed well before Google and no idiot used this feature as proof AFAIK.
I cannot believe anybody in his right mind would take any "make-up" algorithm as reliable evidence. One has to be pretty ignorant, or criminally insane, to use what is a (very nice) party-trick in a court of laws.

--
Irrelevant news and morons using moderation to mod down what they disagree on. 2018 resolution: so long.
As long as they use the proper command by Provocateur · 2016-11-19 16:34 · Score: 1

As NCIS episodes have demonstrated, the video analysts have to issue the command Enhance! for this thing not to lie

--
WARNING: Smartphones have side effects--most of them undocumented.
How did your DNA get in the house? Really? by Anonymous Coward · 2016-11-19 18:41 · Score: 1

How did your DNA get in the house? Really?
1) False match.
2) Carried in by animals, insects, etc.
3) On the sole of someone's shoe.
4) From dumpster-diving.
5) Planted, by cops or others.
I could go on all day.
1. Re:How did your DNA get in the house? Really? by slew · 2016-11-19 21:41 · Score: 1
  
  How did your DNA get in the house? Really?
  1) False match.
  2) Carried in by animals, insects, etc.
  3) On the sole of someone's shoe.
  4) From dumpster-diving.
  5) Planted, by cops or others.
  I could go on all day.
  How is that different than near-sighted and/or racist eye-witnesses, and jail-house snitches? Not really different. The only difference is tv show like CSI that "glorify" DNA evidence and vilify other forms of circumstantial evidence.
2. Re:How did your DNA get in the house? Really? by rtb61 · 2016-11-20 11:21 · Score: 1
  
  By far the easiest way to transfer in DNA evidence is by public transport. From your head to your coat, brush up against someone, now on their coat, they go home take off coat and hair drops on floor in bedroom. Now, something goes bad and your are done. Yeah, I go the numbers wrong one zero too many but even at 36,500 take public transport regularly and you hair will end up scattered throughout the city.
  
  --
  Chaos - everything, everywhere, everywhen
Bzzzt My algorithms say the black man did it by hoggoth · 2016-11-21 04:55 · Score: 1

Holy shit!
Remember when they found that bank loan "artificial intelligence" programs were discriminating based on the racial profile of your zip code? The program learned from the human examples they were given.
So it isn't impossible that algorithms that insert "likely" pixels into images would perhaps add minority colored pixels in an urban looking scene and white colored pixels in a suburban scene. You can't use image data that didn't come from the actual scene in court!!!!

--
- For the complete works of Shakespeare: cat /dev/random (may take some time)