USAF Wants To Find Steganographic Content

← Back to Stories (view on slashdot.org)

USAF Wants To Find Steganographic Content

Posted by timothy on Saturday January 10, 2004 @09:29PM from the sir-yes-sir-we-must-examine-porn-sir dept.

Bud Higgins writes "The U.S. Air Force has posted a Small Business Technology Transfer Program (STTR) solicitation in which they seek proposals for the automated detection of steganographic content. They seek an application that should run both unobtrusively in the background and in a manual mode, and provide the user the capability to scan all email attachments, downloaded materials and accessed files with an appropriate steganalysis algorithm, reporting any abnormal results (i.e. the presence of steganography). I personally don't think that is feasible, but maybe a good programmer can prove me wrong. A link to the solicitation AF04-T008 can be found here. For those who are not familiar with the SBIR/STTR program, it provides up to $850k for 3 years of research." This sounds very similar to what Niels Provos did over a several-year period at University of Michigan's CITI and released under a free license. I hope the USAF doesn't spend too much of my money without considering extending that research.

13 of 267 comments (clear)

Min score:

Reason:

Sort:

Hrm by Cave+Dweller · 2004-01-10 21:38 · Score: 5, Insightful

Those of you paranoid enough will probably chime in with something along the lines of "Yeah, but Echelon probably has something like this built-in already!". Anyway, isn't the point of steganography to hide information in such a way that you *cannot reliably* tell whether the information was there in the first place?

I'm not sure what they're looking for here; perhaps a better steganography algorithm?
1. Re:Hrm by johannesg · 2004-01-10 23:26 · Score: 4, Insightful
  
  They might be looking for an algorithm that establishes just how random the "random bits" of a file are. For example, you would expect the least significant bits in a jpeg to be more or less random - any degree of organisation there could be a hidden text or something else.
  I would expect such an argument to have specific knowledge of various file formats, since randomness in a jpeg is not quite the same as randomness in for example a .EXE file.
  I would further expect that my approach would be soundly defeated by first encrypting the information to be hidden, since encrypted data looks a lot more random than normal data anyway.
  Personally I doubt it can be done. You might be able to defeat specific steganographic algorithms, but the general case cannot be solved. It would be a bit like having a universal decryption algorithm...
2. Re:Hrm by Ugmo · 2004-01-11 02:56 · Score: 3, Insightful
  
  I would further expect that my approach would be soundly defeated by first encrypting the information to be hidden, since encrypted data looks a lot more random than normal data anyway.
  
  It would still be somewhat valuable to know that encrypted messages were being sent even if you do not know what the content is. If you know bad guy #1 is posting some steg encoded pictures on his porn site and bad guy #2 visits it on a regular basis (along with 1000's of other non-bad guys) you could at least get a clue that something is up if bad guy#1 changes the frequency or number of his updates. In short, traffic analysis.
  
  If you cannot detect any kind of steg whatsoever, you can't even get this info.
stego wrapped pgp by Macgyver7017 · 2004-01-10 21:49 · Score: 3, Insightful

Maybe statistical analysis can determine if a given image or other medium is possibly hiding information. But if that information is encrypted, doesn't it look like random data without the key? Without knowing the key or even the cipher used to encrypt it... how can it be shown to actually be information? "That's just random noise/corruption in my images your honor... I dont know what your talking about"
Re:Feasible? by RomulusNR · 2004-01-10 21:51 · Score: 5, Insightful

Uh, sure, the "this is supposed to be random noise" trick will work about as long as the average spam-filter-avoidance trick lasts.

"The enemy is sending out an abnormally large amount of random noise data. Must just be having microphone trouble. Nothing to see here."

Roger that.

No +1, cause I've been drinking...

--
Terrorists can attack freedom, but only Congress can destroy it.
Re:Oh yeah? by Soko · 2004-01-10 21:58 · Score: 5, Insightful

Take off the tinfoil hat, dude. Checking all pics on the net for steganographic info is virtually impossible - just too much info to sort through in a reasonable time frame.

They likley want this to scan documents leaving thier internal network in an attempt to catch people who are sending out sensitive or secret info. To me this looks like the USAF is plugging a leak, not going on the hunt.

Soko

--
"Depression is merely anger without enthusiasm." - Anonymous
Interesting by arvindn · 2004-01-10 22:03 · Score: 4, Insightful

Looks like detection of steganographic content might be a significantly easier problem than decoding it. The reason is that normal compressed images don't have redundancy -- i.e, the image file size is no larger than it needs to be for the quality (information content) that it has. But embedding a message introduces redundancy, by an amount proportional to the capacity of the stego system. This can be detected, the programmer only needs to have a good grasp of the image format, domain transformation techniques etc.
But I had a this little idea. Suppose we "pollute" normal images with random data with say 1% redundancy. What I mean is, whenever you create an image you take some random data and steganographically embed it in the image. Write a gimp plugin or something so that the process is transparent and automatic. Your file only becomes 1% bigger, so its no big deal. Not everyone needs to do this, just sufficiently many people so that the vast majority of the positives of stego detection systems are going to be false positives. As long as the message is encrypted before embedding, it is provably impossible to tell a genuine stego image from a false positive, assuming that the underlying encryption isn't broken. So you get a secure stegosystem with 1% efficiency "for free".
[dons tinfoil hat]
We'd all better soon start doing something like this, given where governments are going.
/me runs off to patent office
1. Re:Interesting by Anonymous Coward · 2004-01-11 00:18 · Score: 3, Insightful
  
  Actually, if the plugin uses a good enough random source then it's not possible to distinguish the results from good steganography. That's kind of the point. The problem that the original poster is trying to solve is that good steganography is too good at looking like completely random data, and there's not that much completely random data when real-world codecs and image formats are involved...
Perfect Programming is not needed for it to work by leoaugust · 2004-01-10 22:06 · Score: 4, Insightful

I personally don't think that is feasible, but maybe a good programmer can prove me wrong.

The "solution" can be implemented with the current laws and regulations, and I think the programmer is only a small part to make this system work. A lot of enforcement authorities have to come together and the current evidence suggests that they will come together. Of course, it is a moot point that by the time they figure this out, people would have learned to hide data in other creative ways - the eternal cat-and-rat game ...
Consider this
the automated detection of steganographic content.

If Adobe (and others) could be forced to include in their code methods to detect currencies Slashdot | Photoshop CS Adds Banknote Image Detection, Blocking? and not disclose it till they were caught by some vigilant users, what makes us so smug that other major companies with "closed" software are not already in-bed-with-the-feds ? So, it is conceivable that the automatic detection may be going on and we wouldn't be any wiser.
They seek an application that should run both unobtrusively in the background and in a manual mode,

See the Adobe example of how such "spyware" can be forced to run "unobtrusively."
and provide the user the capability to scan all email attachments, downloaded materials and accessed files with an appropriate steganalysis algorithm,

Major Email providers like Yahoo and Hotmail already provide automatic scanning for virus, AOL is including automatic scanning for spyware, MicroTrend (?) already has Online Virus Scanning of your Hard Drive (!), and so under the threat of the Patriot Act (and it's ilk) many of these companies can be forced to scan everything that goes in and out of their systems.
reporting any abnormal results (i.e. the presence of steganography).

This is the key. Now the threshold for "abnormal" has been reduced so much (almanac carriers as potential terrorists, CAPPS passenger detection based on names and 15 flights were cancelled last month based on this, anti-war protestors as possible terrorists and hence being tailed by the Feds etc.) that the problem of false alarms no longer dogs the current administration and law enforcement agencies.
This is the crux. When the error threshold is reduced so much that the high rates of error are no longer problematic, then any solution (whether efficient or not) can be implemented. Who cares whether it works well or not. Till now the false alarms were the things that stopped such 1984-ish like scenarios from unfolding. Once you accept high errors, and accept even high collatoral damage as the price of doing "business," you can have a solution to almost anything implemented - whether it deserves to be implemented or not is a whole different issue. But who cares? You got nothing to hide - Right?

--
To see a world in a grain of sand, and then to step back and see the beach where the sand lies ...
steganography vs. compression by graf0z · 2004-01-10 22:28 · Score: 4, Insightful

The basic problem with steganography is that it hides content in noise but compression reduces noise.
It is easy to 'steganohide' content in uncompressed noisy files like tiff or wav. But that content gets destroyed by lossfull compression which is mainly used by multimedia formats (jpeg, mpeg, divx, mpg3, ...). If not, it's called a watermark, but (un)fortunately nobody found a watermark algorithm yet which is robust against lossfull codecs and adding some more noise.
So You have to steganohide Your content after compressing. But compressed files have much less noise, and that noise is not random noise but has statistical quirks. If You just hide Your content as white noise and add it to the file - thats detectable, because it changes the statistical behaviour of the file!
Instead You have to write an specific steganografic algorithm for each lossfull compression format You want to hide content in! It has to respect the 'format noise character'. That's what Niels Provos did for pnm and jpeg with outguess.
/graf0z.
Re:I don't think this can possibly work. by JKR · 2004-01-11 00:58 · Score: 3, Insightful

The problem is that emailing streams of random data around looks pretty suspicious. You want to hide random-looking data in a NON-random stream (that has a legitimate purpose, e.g. an image file). THAT's why you can detect it.
Even random data has to fit in. For example, it used to be the case that the A/D stage of some cheap sound cards was so noisy that the recording from line-in gave you a 16 bit audio sample stream with the bottom 4 bits effectively random(like dithering but much much worse.) However, the noise (while random in nature) was shaped in a particular way, so if you just hide your encrypted secrets in those 4 bits it would be obvious that the "noise" wasn't appropriate.
Jon.
Re:Oh yeah? by SlashdotLemming · 2004-01-11 01:25 · Score: 4, Insightful

They likley want this to scan documents leaving thier internal network in an attempt to catch people who are sending out sensitive or secret info. To me this looks like the USAF is plugging a leak, not going on the hunt.

That's exactly one of the reasons for the technology. The DoD has an obligation to protect sensitive information. There are a crazy number of hoops that need to be gone through to get unclassified info off of a classified system. They can't have people encoding stuff in pictures of Barney then walking away with it.

I know the usual paranoids are up in arms about the AF doing this, but the same people would flood "The DoD is so stupid" if it were found out that people were abusing the technology to transport classified info.
Establishing innocence on false positives--how? by dpbsmith · 2004-01-11 03:40 · Score: 3, Insightful

In these days when the FBI thinks possession of an almanac makes you suspicious...what happens to you if some half-baked experimental steganography-detection program looks at billions of .jpgs, gets to an image you've included in an eBay auction descriptions, and detects some not-quite-decodable signal just above the noise that it interprets "there's definitely something hidden in that image, even though we can't tell what?"

How do you prove that you're innocent?

How do you prove that your image does NOT contain steganography?

Worse yet, suppose you are using steganography--say, a watermark to prevent people from stealing your image. Will the FBI believe what you tell them is the decoded content?

I mean, a few decades ago some nutcase analyzed Shakespeare's First Folio and decided that it was printed in a mixture of two slightly different fonts that constituted a binary code with a message proving that it had been written by Sir Francis Bacon. (No kidding). That proves that it's easy for someone who's looking for steganography to find it, whether it's there or not.

--
"How to Do Nothing," kids activities, back in print!