Detecting Faked Photographs Gets Easier
nusratt writes "Some years ago, an issue of 'Whole Earth' had a convincing cover-photo of a flying saucer cruising low over downtown San Francisco in broad daylight. The accompanying feature article proclaimed that photographs can no longer be trusted as evidence of anything, because of the ease of doctoring images digitally and undetectably. Now, Dartmouth Professor Hany Farid and graduate student Alin Popescu 'have developed a mathematical technique to tell the difference between a "real" image and one that's been fiddled with.' Farid says, 'as more authentication tools are developed it will become increasingly more difficult to create convincing digital forgeries'." There's also an NYT story.
While it may become increasingly difficult to forge digital images, and even forge hard currency, the result could be of two possibilities;
1. Forgers get smart and use older cameras to take a picture of a digital forgery to pass as an original, using blurring techniques offered by physical means and lens... etc (easy)
2. Forgers quit being forgers (unlikely)
3. Alteration technologists create armor against image forgery detection algorithms (possible)
For me, I think any time spent trying to beat the detection of forgeries would be a good thing in terms of art and creativity -- not to mention the possibility of better digital growth algorithms to join layers mathematically seamlessly (which could be used in games and simulation engines for better realism). However, law enforcement agencies might try to combat the circumvention of forgery detection by charging people with crimes for only trying to make their images more realistic and improve technology. It's a messy issue, that will sort itself out over time.
In Doom 3 Bloopers, a mod I've started on, I am looking at ways of integrating realworld imagery into the mod, and this detection stuff could actually help me to better integrate my own art and images if I can find a way around it. Let's face it, if the math says it's an original, the human eye will be fooled, which is the goal of most video game design. If anyone wants to help along those lines, they should contact me!
The dangers of knowledge trigger emotional distress in human beings.
I think authentication tools make it easier. As someone who's tried a little photo manipulation in the past, I can tell you that the hardest thing is knowing when something's right. If you have an automated tool that can tell you when it's right, it becomes easier. Of course, that relies on the tools working...
While an equation to make sure that a photo is not forged is all well and good, it is self deafeating. Just like simple encryption, which is good for people with simple problems, exploring the equation will yield a way to fool it. If you have someone who truely wants to forge a photo, they probably could still defeat the check.
Now they'll be able to prove that my photo of Rumsfeld shaking hands with Saddam in 1983 is a fake.
Rumsfeld was really shaking hands with an alien, and Saddam was shaking hands with Elvis, but the resulting merger of the two photos was much more provocative.
You are in error. No-one is screaming. Thank you for your cooperation.
Farid and his students have built a statistical model that captures the mathematical regularities inherent in natural images.
I wonder if they've considered the potential applications in image compression?
...but it was only accurate when the photo contained an image of Admiral Ackbar.
While this might not be a problem for gross manipulations (the faked John Kerry/Jane Fonda photo being a recent example) I can imagine a class of images where subtle manipulations caused great effects and were not readily distinguishable from compression artifacts.
cheap labor conservatives - they want to keep you hungry enough to be thankful for minimum wage.
I fail to see how an image can have a "random set of pixels" as the article suggests. With color and brightness balancing, the pasted areas of an image blend in with the rest of the image (especially when the artist uses Photoshop's clone brush to combine the two images). This sounds to me like the computer could very well throw out false positives for images that have extreme color or brightness differences.
Farid's algorithm looks for the evidence inevitably left behind after image tinkering. Statistical clues lurk in all digital images, and the ones that have been tampered with contain altered statistics.
That makes sense. However, it seems like these statistics would be based on very minor details. What happens if you run their analyzer on an image that has been altered by using lossy image compression, such as JPEG compression? Lossy image compression is designed to obliterate details humans wouldn't notice; some of these details might be significant to their statistical model. Would JPEG-compressing an image make it impossible to determine its veracity? Would their software just tag all JPEG images with compression below a certain threshold as being "unnatural" (since they have been, after all, digitally altered-- just not digitally altered in a content-relevant way...)
And don't some digital cameras use lossy formats such as JPEG as their native storage format?
The article says the research is founded by the Department of Homeland Security. That means that despite the many useful possible aspects of this technology, it will probably never see the light of day. If the DHS is going to be using this technology to identify faked photos, it would be greatly in their interest for the full algorithm and its implementations to not be made available to the public-- since after all if people know what the DHS is using to determine faked photos, they can target the algorithm.
Now that I have written that, looking around, it appears I am actually wrong. If you look at Mr. Farid's personal page, it appears he will be publically presenting a paper covering the fakeness-detection algorithm. I hope the full algorithm will be presented to the academic community.
People making fakes of UFOs and breasts on the Olsen twins will discover new techniques to overcome the detection. It will always continue in a spammer/spam-filter fashion. I'm sure that we'll soon see tools to automatically take the mathematical principles into account and automatically correct fakes that can be detected.
But hey, I'm pretty sure this one pic of the Olsen twins I got from Kazaa is real anyway.
put the what in the where?
So if it detects digitally altered images, why can't we just 1) Alter the image to our heart's content 2) Print it on a high quality printer ($500- $2K at CDW) 3) Scan it back in with a nice scanner
"The object of war is not to die for your country, but to make the other bastard die for his." - Patton
If you are looking for more detailed information, along with equasions, here is a link to one of their recent publications on the topic: http://www.cs.dartmouth.edu/~farid/publications/sp 04.html
NYTimes story link, which is actually more informative and interesting than Dartmouth's own story. In particular, the instant that I started to read the NYT story, I dope-slapped myself for not having thought of the reverse implication of the technology, namely that it might be used to prove that a contraband image (such as child-porn) is NOT faked (and therefore is genuinely illicit).
The easy answer to that lies in the original compression artifacts remaining - any new fragment/change will not keep these in a statistically similar fashion, and thats what my understanding of this software is.
Smudging a part of an image would remove these artifacts, and would be near impossible to reproduce - like the paper grain on a canvass oil painting.
liqbase
Run it on the goatse guy! If the results are positive I can finally start sleeping again.
There are stories about successful defense against digital photographs in criminal cases. "Enhancements" using photoshop can be considered evidence tampering. So this technique can have a life-altering implication for some people.
Anyone who wants to know more can read the actual papers here: http://www.cs.dartmouth.edu/~farid/publications/ ...papers 1,2,4, and 5 seem to be what this research is about, but others probably cover it too.
For Doctored Photos, a New Flavor of Digital Truth Serum By NOAH SHACHTMAN
Published: July 22, 2004
From the material found on his hard drive, Bryan Sparks of Springfield Township, Ohio, seemed guilty when he was arrested in 2002. The sexually explicit pictures of minors appeared to put him on the wrong side of child pornography laws. But at his trial this spring, Mr. Sparks was acquitted because no one could tell for sure whether the images were authentic or just clever digital forgeries.
Mr. Sparks was eventually convicted on a separate charge of rape and sentenced to life in prison. But the uncertainties that surrounded his case, and others like it, are driving researchers to develop software that can automatically figure out which digital pictures are real and which ones are fake.
"It used to be that you had a photograph, and that was the end of it - that was truth," said Hany Farid, an associate professor of computer science at Dartmouth College who is a leader in the field. "We're trying to bring some of that back. To put some measure of guarantee back in photography."
At stake is more than the fate of possible child pornographers. The United States military has become increasingly reliant on digital images from drones and satellites to give soldiers a sense of the battlefield. Law enforcement officers routinely use digital cameras to photograph crime scenes. Newspapers and magazines are now dependent on digital photographs that can be easily doctored.
Over the last three years, Professor Farid and his students have become experts at forgery, making hundreds of images that look authentic but have in fact been digitally tweaked. License plate numbers are changed. A single stool standing on a checkerboard floor is suddenly a pair of stools. Dents on a car are wiped away with a few mouse clicks.
The skillful tampering disturbed the images in ways that the human eye could not detect. But Professor Farid says his algorithms can spot them and sound the alarm.
For example, when two images are spliced together - like the picture of a shark attacking a helicopter that has circulated around the Internet in the past few years - one or both of the original pictures usually has to be shrunk, enlarged or rotated to make the pieces fit together. And those changes, no matter how artful, leave clues behind.
Take a picture that is 10 pixels by 10 pixels, for a total of 100. Stretch it to 10 by 20 pixels, and image-editing software like Adobe Photoshop will assign the picture's original pixels to every other slot in the new picture. That leaves 100 pixels "blank," or without values. Image-editing software fills in the gaps by examining what their neighbors look like, and then applying an average. To oversimplify, if pixel A is blue, and pixel C is red, the blank pixel B will become purple.
This kind of averaging becomes "pretty obvious" after some analysis of the image, Professor Farid said.
In tests on several hundred doctored photos, this technique for detecting changes proved to be virtually foolproof if the picture quality was high enough. Uncompressed TIFF image files, which contain enormous amounts of data, were like an open book to Professor Farid's team.
But Professor Farid said that for now the technique does not work as well with files created in JPEG, the compressed picture format most commonly used online. As the size of a JPEG file shrinks, the correlations between pixels become much less obvious. "At 90 percent quality, it falls apart very quickly," Professor Farid noted.
Jessica Fridrich, a research professor in electrical and computer engineering at the State University of New York at Binghamton, is approaching the fraud problem from the other side. She is trying to figure out who took the digital picture in the first place.
Just like the rifling in a gun barrel leaves a distinctive pattern on the bullets it fires, a digital camera has a signature of sorts. Today's digit
If you have to ask, you'll never know.
I wrote about this technology a while ago, in "True or False? Investigating Digital Images." A keypoint is that the Dartmouth College team thinks that their technology, or a similar one, will soon be incorporated in the U.S. legal system to authenticate images. At the above link to my blog, you'll also find an analysis of a forged image and more references, including the full research paper published by the IEEE Transactions on Signal Processing journal.
Farid and his students have built a statistical model that captures the mathematical regularities inherent in natural images. Because these statistics fundamentally change when images are altered, the model can be used to detect digital tampering.
That's pretty much all the detail on the method to detect image altering. Seems reasonable, but:
1) How many real photos deviate how far from the statistical "norm" (i.e., how likely are false positives when checking for alteration?)
2) How long before there are tools that can inject the proper (expected) statistical characteristics into a faked image?
These are not addressed in the article. Anyone have more info?
everything in moderation
perhaps the technique could be reverse-engineered to allow the forgers to know whether or not their images can be detected as forgeries, and use this information to enhance their forging techniques to evade the detection tools...
If there is an algorithm that can detect whether or not a photograph is forged, wouldn't it be possible to have another 'competing' algorithm that randomly altered bits and used the detection-algorithm as its success scale? (The better it becomes at not being detected, the better the genetic algorithm is doing.) Seems that that'd work fairly well, and would apply to lots of other technologies that require seamless photograph overlays.
So good point. That does seem to be a problem. The NY Times article has more details than the other; it is worth reading.
it's a trap!
Lasers Controlled Games!
... they want to be able to pass off their "intelligence" photos as the real mc coy easier, so as to not get busted when they release crap, like the phony fat osama bin laden video(wicked fake) and the (possibly) berg beheading. this technology would help them to establish bonafides with the offical forgeries. We are *this* close to a running man scenario here with them being able to frame people or to alter public opinion with phony video and pictures. If they can create one and have it slip through the checking algorithms, then they are home free.
On the other hand, it would be nice to go back and look at a lot of older photos to see what's what with them now.
Lately there's a smidgen of controversy over some of the mars photos, this would be a great place to use the technique.
One wonders whether this will lead to a legal distinction between lossily compressed images and others. While audiophiles have long been ape for lossless compression, not as much a need has been felt for graphics. Where do lossless graphic compression efforts stand? Is this an area where a proprietary standard might lead to big $$$?
I survived the Dick Cheney Presidency 7 to 9 AM 7-21-07
cheap labor conservatives - they want to keep you hungry enough to be thankful for minimum wage.
To me this strikes me as the same sort of "solution" as DRM is, sure it stops Joe Six-Pack from putting Britney Spears' head onto a porn stars body, but it will not stop anyone who knows what they are doing when it comes to digital image manipulation.
If you are one in a million, then there are six thousand people who are just like you.
sp04.html
ih04.html
sacv03.html
And, we have two new papers currently in review (abstracts are currently on-line, and preprints will be available soon):
sp05a.html
sp05b.html
Some of these techniques work, as some have pointed out, only on high-quality jpeg or uncompressed images, while others work on lower-quality images. We are only in the early stages of development, and are currently working to extend some of these ideas to low-quality jpeg and gif images (though this will likely be a harder problem given that the compression artifacts will overwhelm any statistical perturbation resulting from tampering). One outcome of this may be that a legal standard is set that enforces images brought into a court of law to be of a certain resolution and compression quality.
I will be the first to admit that each of the techniques that we have developed can be reverse-engineered, though doing so is more difficult for some techniques than others. It is our hope, however, that as we and others continue to develop more techniques it will become increasingly more difficult (though never impossible) to simultaneously foil each of the detection tools.
I strongly suspect that the solution will be some sort of hardware image signing, rather than after-the-fact examination. Canon already offers a Data Verification Kit for their superb EOS 1-DS digital SLR. They don't give too many details, but my guess is that they can attach a cryptographically signed hash of the image data into the file header so that it's possible to confirm the integrity of the data later. Since the EOS 1-Ds can only save data in raw and JPEG formats, and since this doesn't make sense for raw data (which has to be processed to be turned into a viewable image) it seems likely that they have this working with JPEGs.
There's no point in questioning authority if you aren't going to listen to the answers.
...or the guy faked his own photo for the article.
C'mon now. math geek... handsome... math geek... handsome...
Sorry, I'm not buyin' it.
Before you design for reuse, make sure to design it for use.
The New York Times article actually says something about the methods used. It basically works by looking at the kinds of operations that need to be performed when interpolating between pixels in manipulations such as changes in size, rotation, etc.
And, "Professor Farid said that for now the technique does not work as well with files created in JPEG, the compressed picture format most commonly used online. As the size of a JPEG file shrinks, the correlations between pixels become much less obvious. 'At 90 percent quality, it falls apart very quickly," Professor Farid noted.'"
"How to Do Nothing," kids activities, back in print!