Google Brain Creates Technology That Can Zoom In, Enhance Pixelated Images (softpedia.com)
Google Brain has created new software that can create detailed images from tiny, pixelated images. If you've ever tried zooming in on an image, you know that it generally becomes more blurry. You'd just get larger pixels and not a clear image. Google's new software effectively extracts details from a few source pixels to enhance pixelated images. Softpedia reports: For instance, Google Brain presented some 8x8 pixel images which it then turned into some pretty clear photos where you can actually tell facial features apart. What is this sorcery, you ask? Well, it's Google combining two neural networks. The first one, the conditioning network, works to map the 8x8 pixel source image against other high-resolution images. Basically, it downsizes other high-res images to the same 8x8 size and tries to make a match on the features. Then, the second network comes into play, called the prior network. This one uses an implementation of PixelCNN to add realistic, high-res details to that 8x8 source image. If the networks know that one particular pixel could be an eye, when you zoom in, you'll see the shape of an eye there. Or an eyebrow, or a mouth, for instance. The technology was put to the test and it was quite successful against humans. Human observers were shown a high-resolution celebrity face vs. the upscaled image resulted from Google Brain. Ten percent of the time, they were fooled. When it comes to the bedroom images used by Google for the testing, 28 percent of humans were fooled by the computed image.
Google can put together images based on smaller images that look like faces.
I don't care how fancy the algorithm is, the original data was lost. This is still just a guess about the original content. It's just a better guess than was possible before.
I just hope law enforcement doesn't think they can use this to solve any crimes.
One of our competitors trademarked the term "hypothesis". From now on, we will call them "boneheaded ideas".
Is this Blade Runner-esque? Decker summoned some wicked camera technology. Don't bother me with those pesky limits to the physical laws.
Feed it minecraft screenshots and japanese porn, and see what the result is.
Inheritance is the sincerest form of nepotism.
So in other words... from a small picture of the earth viewed from orbit, Google can now show me my house AND the address on the UPS package sitting at my doorstep?
Amazing!
If it doesn't do uncrop, it's lame.
I have discovered a truly marvelous proof of killer sig, which this margin is too narrow to contain.
now we know the perception and purpose,
this is perfect for your "real" profile pic on dating sites, just upload a google enhanced image of your self created from your 8x8 pixel image. yes this celebrity is really me.
TV Detective: "We have this security video showing the murder."
TV Lab Rat: "It's too grainy to tell how is it?"
TV Detective: "Can't you enhance it?"
TV Lab Rat: "Sure. Who do you want it to look like?"
A while ago, someone made the nnedi upsampler that uses neural networks to upsample. It's still one of the best image upsamplers available.
Google's approach is quite a bit different. Where nnedi worked to better extract detail out of what was already in the image, Google seems to literally fill in detail that was probably in the source but maybe not. Much, I guess, like how our own memories work. It's an interesting approach and the results look quite fantastic. My only question is how well it will work on a random sampling.
I see you still don't actually know what a Fourier transform is.
The summary's explanation of what this does isn't correct. It says:
Google's new software effectively extracts details from a few source pixels to enhance pixelated images.
It doesn't extract details from a few source pixels. It invents details to add to those source pixels, based on the knowledge that the pixelated image is of a face, and of what faces look like. It produces something that plausibly fits the input data. How close this is to the original image, pre-pixelation, depends on what images were in its training set.
This is an interesting piece of work, but it doesn't mean that you can recover data that has been discarded.
Note to ACs: I usually delete AC replies without reading them. If you want to talk to me, log in.
Does this imply that movies have been lying to us all along? :-)
At least make the world a better place...
by making this a MAME scaler.
Cwm, fjord-bank glyphs vext quiz
https://arxiv.org/pdf/1101.007...
You are welcome on my lawn.
Anti-aliasing is not "providing data that is not actually there", anti-aliasing is removing data that is actually not there, more specifically high-frequency aliases of the low-frequency information. Hence the name anti-aliasing.
In practice, this means that the best any algorithm can do to improve a (general) blocky image, is smooth the edges of the blocks. What this algorithm apparently does, as far as I understand, is just correlate the blocky image to a library of known facial features. This has nothing to do with FFT or anti-aliasing whatsoever.
Apart from that, unless your name is "M", I am afraid
I know this because I am friends with both of the "B"s in BBM.
doesn't really mean anything.
A Fourier transform takes a signal and converts it into a bunch of sine waves that when combined will reproduce the original signal.
A signal is just varying amplitude over time. A sine wave is a signal where the amplitude is sin(t), where t is time. It turns out that all signals can be constructed by combining sine waves. The more sine waves you have the close to the original signal you can get.
Why it is useful to convert a signal to sine waves? Say you decide you are going to use 20,000 sine waves, the first one a 1Hz wave, the second one a 2Hz wave, 3Hz, 4Hz all the way up to 20,000Hz. You use a Fourier transform to convert the signal to those sine waves. Now you increase the amplitude of the low frequency sine waves, say 200Hz and below. Now do another Fourier transform to convert from sine waves back into a signal. Congratulations, you just pumped up the bass on your music.
In technical terms the signal is in the time domain, and the sine waves are in the frequency domain.
Disclaimer: This is a simplification, it's more complex than this but without getting into the heavy maths of it, this is basically what's going on.
const int one = 65536; (Silvermoon, Texture.cs)
SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC