Program Hides Secret Messages in Executables

Re:stenography by gunne · 2003-03-02 00:17 · Score: 2, Informative

Steganography would be more precisely defined as "information hiding". It doesn't require that it is impossible to find the data hidden, but it tries to conceal the existence of the data.
Cryptography on the other hand does not try to try to hide the existence of information, it just tries to hide what message is embedded in that information.
Cryptography != Steganography, but they can be used in conjunction.

For those who encounter compilation problems... by Anonymous Coward · 2003-03-02 00:17 · Score: 4, Informative

Add -ldl to the LDFLAGS in the Makefile.

First used in a86.com by Ninja+Programmer · 2003-03-02 00:17 · Score: 4, Informative

This is a well known technique that was used in the mid-80s by Eric Isaacson in his product "a86". See here: http://eji.com/a86/

Eric Isaacson used the technique to mark executables, so that he could determine if they were created with an unregistered copy of a86.

Re:Redundancy? by brejc8 · 2003-03-02 00:18 · Score: 4, Informative

Some instructions have dont care bits in them.
You could remove these bits in order to compress the file but they occur so rarely its not worth it.
And yes the redundency would have been used up.

--
Mouse powered Chips, Open source Processors and Lego

The meaning by Anonymous Coward · 2003-03-02 00:19 · Score: 3, Informative

It just means that you can encode certain stuff in equivalent ways (*). Like: mov eax, 0 xor eax, eax sub eax, eax are all equivalent in functionality to zero the eax register.

* = Taking into account flags and instruction size restrictions, etc.

The "redundancy" comes from these facts. So, it's not size redundancy as such, and you can't remove the redundancy. It's more like permutations of the instructions are equivalent (length stays the same).

Re:Redundancy? by sql*kitten · 2003-03-02 00:19 · Score: 4, Informative

Can someone explain to me exactly what this means? Will all i386 executable binaries have unnecessary redundancy? Could the size of the binary be harmlessly reduced by removing it? If so, then why isn't this done?

It means that if you want to add 50 to a number, you can choose to do (+50) or (-(-50)). They both take up the same amount of space and do the same thing. But if you first process a program to ensure that all additions and subtractions are actually additions, then you can encode data into the list of additions by making some of them into subtractions.

Re:stenography by JohnFluxx · 2003-03-02 00:21 · Score: 4, Informative

er...

Steganography requires that it is impossible to prove that data is being hidden there. (Without reference to other material, etc etc).

From The Free On-line Dictionary of Computing (09 FEB 02):

steganography

Hiding a secret message within a larger one in such a way that others can not discern the presence or contents of the hidden message. For example, a message might be hidden within an image by changing the least significant bits to be the message bits.

Re:You might have gotten hoaxed. by Ninja+Programmer · 2003-03-02 00:23 · Score: 2, Informative

This is technically impossible, for two reasons.

Yeah, I know another unchecked perpetual motion machine story from timothy. But no, in this case, the story is not wrong, its just 15 years old (the technique was used 15 year ago, I mean.)

The key point is to exploit x86 instruction set redundancy to find a few bits of entropy here and there strewn throughout the executable code. RISC instructions have the same potential. For example:

add r0, r1, r2
add r0, r2, r1 # not much different

Re:Redundancy? by BenV666 · 2003-03-02 00:29 · Score: 5, Informative

Can someone explain to me exactly what this means?

It means exactly what it says, there is more than 1 road that leads to Rome.... combining instructions in different ways leads to the same results.

Will all i386 executable binaries have unnecessary redundancy?

Almost everything can be done in several ways. Consider these 2 pieces of asm:

XOR DX,DX MOV AX,3 MOV BX,4 MUL BX

verses

MOV BX,4 MOV AX,3 XOR DX,DX MUL BX

Same results, same size, different order :)

Could the size of the binary be harmlessly reduced by removing it? If so, then why isn't this done?

Often the binary can't get much smaller without having effect on efficiency of the code, as far as I trust compilers that is :) (ASM rules!!! :)) I.e.

MOV AX,A000 MOV ES,AX

verses

PUSH A000 POP ES

Same effect while the latter saves 1 byte in code.

Re:You might have gotten hoaxed. by ethnocidal · 2003-03-02 00:30 · Score: 3, Informative

You are both correct and incorrect. While it's obviously not possible to simply go through changing instructions, operators and operands without consideration as to the effect on the program, it is possible to leverage functionally identical instructions to represent a bit.

If you read the article, a trivial example would be subtracting -5, rather than adding +5. The presence of a subtraction operation, rather than an addition operation can signify a binary digit.

Unfortunately, due to the consistent output from compilers, this is not steganography - you can both tell that the executable has been altered, and read the message! His plans for the future (parameter organisation, etc.) may be more relevant, but at the moment this is a proof of concept implementation, not a usable system.

Anyone interested in other forms of steganography could do worse than to read Andrew Tanenbaum's page on the subject.

Re:Redundancy? by Ninja+Programmer · 2003-03-02 00:30 · Score: 2, Informative

It means that if you want to add 50 to a number, you can choose to do (+50) or (-(-50)).

Actually on the x86, those two are not equivalent. They set the carry flag in opposite directions.

Re:Redundancy? by etcpasswd · 2003-03-02 00:30 · Score: 5, Informative

From my understanding, it appears that he chooses a complentary pair of instructions: addition-subtraction. Then you designate "1" to addition instruction, and "0" to subtraction. So, if you look at only these instructions, your executable can contain a binary string (addition and subtraction instructions).

Now what the author does is, alter the original binary string to that bit-string data of our interest (of the same length). This process requires flipping of instructions. For example, if some instruction is addition (1), but your data requires it to be (0) bit, you change the instruction to subtraction, and change the operand to a negative of the original value. Same applies to flipping a '0' to '1'.

Addition-subtraction works because there are no overflow issues (atleast with signed ints). Since this is also a very common operation, your executable is likely to be large enough to "hold" sizeable data.

Re:Virus by KDan · 2003-03-02 00:34 · Score: 5, Informative

Never. The information, though contained in an executable file, is not itself executable (unless you went and took that information out and then executed it separately. The whole point is it does not affect the execution of the program that you hide the information into. So you can put whatever information you want in there (even the code for a virus) and it will still not be a virus, because that information will never get executed unless you actively go in there, extract it, paste it as an executable file somewhere else (eg in memory) and then execute it - so you'd need another virus to do this, basically.

Daniel

--
Carpe Diem

Hiding messages within messages by Bender+Unit+22 · 2003-03-02 00:36 · Score: 4, Informative

Hiding messages within messages are used often in many contexts, like the radio broadcasts in WW2 sending "birthday greetings" among other things

Re:You might have gotten hoaxed. by peope · 2003-03-02 00:36 · Score: 3, Informative

This is no hoax

I has the same properties as:
a*b gives the same result as b*a.

You have options on what instructions to use which yields the same results.

Lets say a*b is a 1 and b*a is a 0. You could describe a byte with eight occurancies of the (a*b || b*a) operation.

a*b b*a b*a a*b a*b a*b b*a a*b == 10011101

A common practice with x86 is to use XOR AX, AX instead of MOV AX, 0 to clear the AX register.

However, this is not interchangeable since they do not have the same size. The XOR method is often used because it is faster and have less size IIRC.

Re:You might have gotten hoaxed. by ZigMonty · 2003-03-02 00:36 · Score: 4, Informative

This is technically impossible, for two reasons.

Did you read the article?

First, executables are called executables because the computer interprets them. They are made of instructions, and unlike a document you cannot simply tamper with things because it will confuse the computer when it tries to run the executable.

Of course you can tamper with executables! As long as your modified version does the same thing, there is no harm done. If you change the addition of a positive number to the subtraction of a negative number, you get the same result if you run it. You run through the binary and if the current bit of data to be hidden is a 0, you don't modify that particular addition instruction and if the data bit is 1 then you *do* modify it. If you compare the modified binary to an original, you can see all the changes and extract the hidden data.

Second, and most importantly, the size of the file is dependent on the size of the bytes within the file. Because the bytes in the file have differing values depending on the instructions they encode, altering the data will alter the size unless you're borrowing from one byte to inflate another -- and in this case, again, you run afoul of the first problem.

This makes no sense to me. The replacement instruction is the same size as the original.

I'm surprised the editors didn't review this before approving it for posting. This is really pretty elementary to anyone who understands object code.

I don't doubt that you understand object code but you don't seem to understand this technique.

Re:Redundancy? by erc · 2003-03-02 00:38 · Score: 2, Informative

PUSH/POP is significantly slower than two MOV instructions on an x86, though...

--
-- Ed Carp, N7EKG erc@pobox.com PGP KeyID: 0x0BD32C9B What I'm up to: http://intuitives.mine.nu

Speaking of which, Ogg Vorbis for Nero by Anonymous Coward · 2003-03-02 00:56 · Score: 1, Informative

The first vorbis plugin for Nero is out.

One less thing for the mp3-lUsers to complain abou

Re:You might have gotten hoaxed. by chrisseaton · 2003-03-02 01:20 · Score: 2, Informative

You're confusing "byte" and "char". "char" is related to character sets, "byte" has nothing to do with them. Just because you're using 16 bit unicode does not change the size of the byte, it simply means that your "char" is two "bytes" (if your bytes are 8 bits). Why would a unicode system half the resolution of memory just because of the character set used? You could have a byte of 8 bits, a character of two bytes, or a byte of 128 bits and a character of 256 bits. No connection between the two.

Re:stenography by jaavaaguru · 2003-03-02 01:46 · Score: 2, Informative

Stenography is the art of writing in Shorthand. :-)

--
Follow me

Re:stenography by Hank+the+Lion · 2003-03-02 04:35 · Score: 3, Informative

It should be easy enough to get around this. The statistical telltale is only due to the fact that El-Khalil consistently uses the same type of instruction to encode a certain bit value. Have Hydan XOR the hidden message with a secret key that produces the right distribution of ones and zeros prior to encoding the message and the problem disappears.

I'm afraid this will not work.
Problem is: 'normal' programs will do 'sub 50' instead of 'add -50'. If you don't want to be visible that a message is contained, you cannot change that. But if you don't change that (in about 50% of the cases), you can't hide the information! The only key that would work here would be as long as the message itself!

The technique you propose will work to get a more even distribution of ones and zeros, but not the 'all zeros' (sub 50) distribution that is present in 'standard' programs.

Re:Redundancy? by SomeGuyFromCA · 2003-03-02 04:59 · Score: 2, Informative

It exploits redundancy in the i386 instruction set by defining sets of functionally equivalent instructions.

Can someone explain to me exactly what this means? Will all i386 executable binaries have unnecessary redundancy? Could the size of the binary be harmlessly reduced by removing it? If so, then why isn't this done?

You're confusing redundancy in the program (extra instructions executed) with redundancy in the instruction set (extra instructions available).

The i386 set has add and subtract instructions where only one is strictly needed. From what I've read, this tool works by changing a sub 50 to an add -50, taking advantage of this. (Or a add 30 to sub -30.)

The problem is, no person or complier would write code this way unless they had a particular reason to. Such as hiding something.

--
if the answer isn't violence, neither is your silence / freedom of expression doesn't make it alright

UTF-16 in the Java language by yerricde · 2003-03-02 05:07 · Score: 2, Informative

There is no "byte" data type in C

There are distinct "byte" and "char" data types in the Java programming language. The "byte" is 8-bit as expected in PC-type and RISC architectures, but because the Java programming language's native character encoding is UTF-16 Unicode, "char" is 16-bit.

--
Will I retire or break 10K?

Re:stenography by amRadioHed · 2003-03-02 06:59 · Score: 3, Informative

All stenographic methods that I've heard of leave some signs of tampering. For instance, the common method of hiding information in an image file by fiddling with the least signifigant bits in the RPG values is completely undetectable to the eye, however a statistical analysis of those low bits will reveal an unnatural amount of randomness. Really this is unavoidable since most any innocent looking data is going to have some natural order to it.

--
We hope your rules and wisdom choke you / Now we are one in everlasting peace

Re:Messages can be found in games too by willpost · 2003-03-02 08:00 · Score: 2, Informative

Unfortunately, David Pridie, aka. Martial Artist, the programmer that wrote the first message, "passed away very suddenly on the morning of Friday, January 12, 2001. He died in his home, listening to music and playing a computer game. An attack of bronchial asthma was established as the cause, something which he had complained of the past week or so before. "

"At the time he got himself and H2O in quite a bit of hot water with Nintendo. He figured it was his small piece of immortality"
He was right

http://www.pridie.org

Re:Yes, it can be done... by peter · 2003-03-02 10:12 · Score: 2, Informative

Hydan doesn't give you any deniability, does it? I just read the artice; I haven't tried the program, but if you use a well-known method of embedding info, it's not very steganographic anymore. The bad guys can just run hydan on your executables and see what comes out.

If you want deniability even in the face of torture, you want rubber hose crypto. You might also want to use an authentication method more complicated than a password, so they'll have to torture you in the computer room instead of the dungeon, and they can't break your fingers or damage your higher brain functions.

--
#define X(x,y) x##y
Peter Cordes ; e-mail: X(peter@cordes , .ca)

Re:The problem is if you have two copies by peter · 2003-03-02 10:39 · Score: 2, Informative

Not in this day and age, because everyone uses strong hashes. I suppose the error-detection code that they preserved was CRC-32, or an checksum (add up all the bytes). There is no known way to efficiently figure out how to change a file without changing its MD5 or SHA1 hash. Any cryptographically strong hash will make undetectable modification computationally infeasible.

--
#define X(x,y) x##y
Peter Cordes ; e-mail: X(peter@cordes , .ca)

Re:stenography by Anonymous Coward · 2003-03-02 12:40 · Score: 1, Informative

OK, but geeks forget that possible/impossible isn't a binary state, like 1 and 0. It's a about likelihood. Is there a 1% chance that this file contains a hidden message? Or is it more like 90%?

"Abstraction is selective ignorance." -- Andrew Koenig. It is sometimes neccessary to forget some things in order to adequately analyze a situation. However, this misses the point...

Steganography is like a one-time-pad. If I transmit a message encrypted with a one-time-pad, and an attacker intercepts the message, then it is impossible for the attacker to determine the actual contents of the message. In steganography, when the message is intercepted it is impossible for the attacker to even tell whether or not he has the message. This is the ideal. In practice, one-time-pads can be broken, and steganographic messages can be detected. Which brings us to the point of the original poster. If I can run a simple statistical analysis on the transmissions, and find the programs with hidden messages in them, then this method effectively becomes the steganographic equivalent of a Caesar cipher. In other words, it's worthless. In the kitten example you are concerned not only with hiding the message, but, most importantly, with hiding the communication channel. You would most certainly not just send the picture to M. bin M. There are a number of rather obvious ways to correct this.

As an example of a practical steganographic system there are communications systems which continuously stream garbage across the channel. When you want to use the channel the garbage is modified so that your message is encoded, and likewise for the response. Any attacker in the middle has no way of knowing what is garbage and what is not. Again, in practice there are attacks on this type of system, but they rely on extra information besides just access to the communication channel. Note that this is an, admittedly rough, approximation of posting random images with lsb modifications on the internet

Slashdot Mirror

Program Hides Secret Messages in Executables

28 of 243 comments (clear)