Slashdot Mirror


Is Google's AI-Driven Image-Resizing Algorithm Dishonest? (thestack.com)

The Stack reports on Google's "new research into upscaling low-resolution images using machine learning to 'fill in' the missing details," arguing this is "a questionable stance...continuing to propagate the idea that images contain some kind of abstract 'DNA', and that there might be some reliable photographic equivalent of polymerase chain reaction which could find deeper truth in low-res images than either the money spent on the equipment or the age of the equipment will allow." An anonymous reader summarizes their report: Rapid and Accurate Image Super Resolution (RAISR) uses low and high resolution versions of photos in a standard image set to establish templated paths for upward scaling... This effectively uses historical logic, instead of pixel interpolation, to infer what the image would look like if it had been taken at a higher resolution.

It's notable that neither their initial paper nor the supplementary examples feature human faces. It could be argued that using AI-driven techniques to reconstruct images raises some questions about whether upscaled, machine-driven digital enhancements are a legal risk, compared to the far greater expense of upgrading low-res CCTV networks with the necessary resolution, bandwidth and storage to obtain good quality video evidence.

The article points out that "faith in the fidelity of these 'enhanced' images routinely convicts defendants."

8 of 79 comments (clear)

  1. Wait, what? by Rei · · Score: 5, Interesting

    People are using this sort of thing in court?

    I think these is a very interesting field for consumer needs, but I have to agree, that's disturbing if they're allowing what... let's face it... is data made up by an AI that "looks right", to convict people.

    --
    Wingus, Dingus! Listen up!
    1. Re:Wait, what? by knightghost · · Score: 4, Interesting

      Yes, they use it in court. I once watched a federal prosecutor use this and lie so blatantly to the court that his own (image) expert witness sued him for false representation. Yet the defendant was still convicted based almost entirely on that upscale image "evidence" and served several years in prison.

    2. Re:Wait, what? by Oligonicella · · Score: 3

      It's bad because it's allowing a piece of software to become a witness. One you cannot ask questions of unless you want to force the importing of the whole development team for each trial. Just having "working knowledge" of how the software functions is insufficient.

      And because "I extrapolated from other cases what the defendant should look like" wouldn't go over well if given by a human expert.

      "so it is possible" - Same extrapolation from scant information.

  2. Depends on enhancement by guruevi · · Score: 4, Informative

    You can't really upscale resolution but you can "enhance" images (especially raw ones) to a point. A lot of shots may be over or underexposed with some details left in one or more of the channels but visually blocked out, having thousands of minuscule changes and filtering go through a human in the hope of seeing something would be nearly impossible and having a filter to weed them out is helpful.

    JPEG and similar compression are like MP3 - you can filter out what the algorithm defines as outside of the human realm to perceive but a lot of those assumptions are faulty leading to noticeable artifacts. However it is very hard to recover the data lost in "lossy compression" although you can make some assumptions to recover them.

    The other problem with using these filters is that they're called artificial intelligences. They are not intelligent and calling them that leads to an assumption of infallibility. They're a form of Bayesian filtering and we've been using that since at least the days of OS/2 to "enhance" images, I used a demo of a program back then that did just that: inferences on JPEG to make a type of vector image. We just use more powerful clock cycles and more storage to have them perform better but they're not and never will be magic.

    --
    Custom electronics and digital signage for your business: www.evcircuits.com
  3. I'm intelligent, gears are dumb. Intelligent==fail by raymorris · · Score: 4, Insightful

    > They are not intelligent and calling them that leads to an assumption of infallibility.

    That's an interesting comment. I'd think the opposite. I'm intelligent, and often wrong. Gears are dumb, and always perform multiplication correctly, never giving the wrong result. To me, intelligence implies the ability to come up with different answers, some of which may be wrong. If it can't come up with unexpected answers, it's just a dumb machine, I'd think.

  4. Who put the stick up his ass? by pavon · · Score: 4, Informative

    All upscaling algorithms are making up data based on assumptions on what "typical" hi-res images should look like given their low-res counterparts. That doesn't mean they are lying or misrepresenting. Furthermore, some assumptions are most statistically valid than others, and some produce more aesthetically pleasing results than others, actually resulting in images that are genuinely more likely to be closer to the true image than nearest neighbor.

    Nowhere in google's paper are they suggesting that these images be used for forensic purposes, nor claiming that they are finding "deeper truth" or additional information in the images than what actually exists. They developed an approach that produces better results for common classes of images than previous algorithms, which is useful for a large number of applications that don't require the same level of rigor that forensics do.

  5. Re:Pretty much garbage for static images by yes-but-no · · Score: 4, Insightful

    You can't get something from nothing.

    2, 4, 8, x, 32, 64. Can you guess x?

    It's not from nothing.. image captures nature; nature runs under physics; n physics under mathematical laws. So it is reasonable to guess what a missing pixel-block will be based on other sets of observations of similar situations.

  6. Re:Pretty much garbage for static images by Anonymous Coward · · Score: 3, Insightful

    There are an infinite different functions that follows the pattern that generates different results for x.

    The problem when using it for forensics is that you will put the person following the pattern you implemented in jail, not the one that actually is guilty.