MD5 Proven Ineffective for App Signatures

Hmm... by Char-i-o's · 2007-12-01 23:41 · Score: 1

Need more salt.

Perhaps they should consider using something else than SHA-1? SHA-2 anyone?

Nothing new by grumbel · 2007-12-01 23:43 · Score: 5, Insightful

Unless I am missing something this is really nothing new. The same has been demonstrated with a webpage and javascript years ago, i.e. two different webpages producing the same MD5, doing it again with an .exe doesn't really sound all that interesting, especially since the attacker still needs to manipulate both the good .exe and the evil .exe and when he has access to the good .exe you are toast anyway.

This of course doesn't mean we should continue to use MD5, but the attack is really of rather theoretical nature.

Re:Nothing new by Anonymous Coward · 2007-12-01 23:46 · Score: 2, Informative

http://www.codeproject.com/dotnet/HackingMd5.asp

14 Sep 2005
Re:Nothing new by Instine · 2007-12-02 00:16 · Score: 1, Redundant

Bingo. While it may be of some interest to security freaks, the theoretical nature of this and so many other 'stories' means there's little news in them. Can we have fewer "if the attacker has full access to exe/db/OS/acc/.... they can do terrible things by simply [insert psuedo exploit here]"? We know.

--
Because you can - or because you should?
Re:Nothing new by Bert64 · 2007-12-02 00:16 · Score: 5, Interesting

If he has access to the good exe *before* it's signed, why not simply replace it with the malicious one so that the malicious one gets signed and distributed instead of the good one...

--
http://spamdecoy.net - free throwaway anonymous email - avoid spam!
Re:Nothing new by Anonymous Coward · 2007-12-02 00:40 · Score: 5, Informative

No, this is different. In the case of the colliding webpages, bit level inspection immediately reveals what's going on: both "good" and "bad" version are included in the webpages, with an if-statement to choose which one to display.

When you inspect these binaries at bit level, they contain only the "good" or the "bad" version, and some random data appended to it to make the MD5 hash of the files collide. This technique thus also works for file formats which don't have control statements such as "if" or "file starts at offset". See also: http://www.win.tue.nl/hashclash/Nostradamus/, scroll down to: "Didn't Daum and Lucks do something like this in 2005?"

Marc Stevens already constructed these "chosen-prefix" collisions for X.509 Certificates, see the HashClash project page. What's new in these results, is that it did not require massively distributed computing efforts, only one Playstation 3 and less than two days of computation. There is no paper available yet as to how he achieved this major optimization, but his MSc thesis gives a clue: see "future work" at the end of section 7.4.
Re:Nothing new by MathFox · 2007-12-02 00:44 · Score: 4, Informative

This is a different kind of attack: the "old" collision prefix attack had two blocks X and Y with the same hash that allowed one to create two programs:
X; if (X) then GOOD else EVIL
and
Y; if (X) then GOOD else EVIL
but the evil code would be in the signed good program, it would not be run.

The new attack is different: it is a method to generate blocks GX and EX for two random files such that the files GOOD+GX and EVIL+EX hash to the same checksum.

--
extern warranty;
main()
{
(void)warranty;
}
Re:Nothing new by The+New+Andy · 2007-12-02 00:44 · Score: 2, Informative

This is a different attack. The previous attacks meant that you could make two files with the same MD5 by making them completely identical, except for one small block which was any known collision.
This attack means that you get to choose the two files, and the attack generates two blocks to append to the original files which mean they hash to the same value.
So the exploits before have been:
File 1: x = [A] if (x == [A]) { do one thing } else { do something else } File 2: x = [B] if (x == [A]) { do one thing; } else { do something else; }
Where [A] and [B] are blocks which collide (and they are aligned on block boundaries for calculating the hash).
The new attack is exploited like this::
File 1: do one thing; ignore the rest of the file; [C] File 2: do something else; ignore the rest of the file; [D]
Where [C] and [D] are generated by this algorithm. This means that on a quick glance, the code doesn't look completely silly and there is no trace of what the hidden content says (just a bit of random stuff at the end of a file which might suggest that it is there)
Re:Nothing new by houghi · 2007-12-02 01:00 · Score: 1

The attacker does not need access to good.exe. He however can send me evil.exe and tell me to look on for the MD5SUM and verify for myself that said program is valid if I so desire.

Yes, you COULD then download it from the trusted site. However people interested in doing this are not interested in the failed attapts. They are interested in the attapated that worked.

--
Don't fight for your country, if your country does not fight for you.
Re:Nothing new by grumbel · 2007-12-02 01:28 · Score: 2, Informative

### The attacker does not need access to good.exe.

He *does* need access to good.exe. You can't generate a file that matches a given MD5, what you however can is generate two files that have the same MD5 and different content, both good.exe and evil.exe contain appended data to make the sums match. Its still a weakness, but a much less critical one then being able to generate a file for a given MD5.
Re:Nothing new by owlstead · 2007-12-02 02:15 · Score: 1

At my work I have two bash scripts that do the same. Unfortunately some invalid characters mess up the script. Now it does something useful (it actually prints: "something useful" if I remember correctly), or it crashes. This is because you still have to put some arbitrary data at the end of the script. Some command to make the interpreter stop would probably fix that, but since it was only proof of concept for me, I didn't go that far.

As the article mentions, this could be really bad for third party signing (e.g. MS Authenticode). Since I've seen that they sign practically anything that comes from a "trusted email address" that means practically squat. Of course, it is not fun to test if such an attack is somewhere in a text or binary file. The companies would be better off not using MD5 for such purposes.

Leave MD5 in legacy applications that are not vulnerable. Try and avoid SHA-1 as well. Everybody should upgrade to SHA-256 or SHA-512 for new applications or protocols that can be upgraded. The article is a good read-up for people that have not yet understood the problems with MD5, nothing more, nothing less.
Re:Nothing new by jthill · 2007-12-02 02:20 · Score: 1, Informative

Because you want an honest party to verify the "good" one, sign its MD5 with their trusted key, and actually distribute the good one.
Then you can in chosen circumstances replace it with the bad one (on, say, specific installs), and an ordinary audit will see the trusted signature on the package you thoughtfully provided on DVD.
Or think contracts: any signed-MD5 signature for a document in a format that ordinarily includes random-looking garbage is now untrustworthy, because what that person signed may have nothing in common with what you're being shown.

--
As always, all IMO. Insert "I think" everywhere grammatically possible.
Re:Nothing new by kasperd · 2007-12-02 02:51 · Score: 3, Interesting

After having read the actual article I realize that there in fact is something new in it. The slashdot story put all the focus on software signing, which is not the interesting part of the article. The interesting part of the article is, that they have found a new and stronger way to produce collisions. For one thing it is going to be a lot less obvious that a file is crafted. The original attack required all the colliding files to contain all the meaningful content with some psuedorandom content to select between them. The new attack doesn't require this, in fact you could even produce collisions beteween files of different formats. Like a jpg file and an exe file with the same md5 hash. But still it is just a collision attack, it produces collisions between two crafted files. They don't produce collisions between a collision between an arbitrary original file and one crafted file.

--

Do you care about the security of your wireless mouse?
Re:Nothing new by Kjella · 2007-12-02 03:00 · Score: 2, Interesting

Sneaking it past security control perhaps? Here's good.exe, run it in a sandbox all you like and it won't do anything funny. Then mark this MD5-sum as good and add it to the list of trusted installers, while I'll replace it with evil.exe before distribution/installation in the production environment.

For a pracical example:
1. Become a kernel contributor on some obscure driver.
2. Add a magic number somewhere, which is the good twin.
3. Wait for this to flow upstream to Linus, then downstream to all the distros.
4. Find a way to hack a mirror of your distro of choice
5. Replace the signed kernel with your trojaned kernel, that'll still be signed
6. Wait for people to install trojaned systems (enterprise systems!)
7. Profit (there is no ???)

Of course, this assumes you can use it knowing just the little magic bits. If you need to be the one compiling both good and evil using the exact same source, then it's very limited.

--
Live today, because you never know what tomorrow brings
Re:Nothing new by norton_I · 2007-12-02 03:54 · Score: 1

What you describe is not as easy as you make it sound. You can fiddle with the "good" source code so that it passes whatever hash test you want, but even assuming it gets included into the kernel with zero changes, what gets signed by redhat are the binary module or the whole kernel package. Even assuming the source to your module is not changed at all, the binary module is not likely to be the same as the version you compiled. At the very least, redhat includes a build number in every kernel package (also included in each kernel module), and unless you know in advance what the final released build number is going to be, your carefully constructed hashes will be invalidated.

People should stop using MD5 for secure hashes, but this vulnerability is already well known, and really requires signing a binary provided by an untrustworthy source to be effective. It is probably much easier to just get people to trust you (perhaps start a 3rd party RPM repo), then give them bogus binaries.
Re:Nothing new by fatphil · 2007-12-02 04:16 · Score: 1

Nope, this attack is new. This is taking fixed prefixes, and appending a suffix to create the collision. Previous attacks required being able to put something _before_ your fixed payloads.

--
Also FatPhil on SoylentNews, id 863
Re:Nothing new by OrangeTide · 2007-12-02 05:29 · Score: 1

#!/bin/sh
echo Hello World
return
42k k22 452n4 ,mnas,dn a,.sn

--
“Common sense is not so common.” — Voltaire
Re:Nothing new by Ghubi · 2007-12-02 06:14 · Score: 1

Actually isn't it GX + GOOD and EX + EVIL?
Re:Nothing new by Red+Pointy+Tail · 2007-12-02 13:10 · Score: 1

Your reasoning is correct, but it is flawed because having this MD5-exploit does not make it practically any easier to get away with publishing your exploits.

1) Even without this 'MD5-exploit', if the distributor is malicious from onset, then it is easy to just distribute a driver with a hidden exploit and sign it off as valid. Exploits can be written such that it functions like it is supposed to most of the time but provides a backdoor.

2) If you say that injecting this hidden exploit instantly show up in a code audit so the publisher can't get away with it -- then wouldn't a code audit also detect the verbiage that needs to be tacked on to the good.exe to make the MD5 match with the evil one, and also raise a red flag?
Re:Nothing new by Anonymous Coward · 2007-12-02 14:49 · Score: 0

Because you can use the "good" as the executable you distribute, say for 6 months. People start using it, noticing no problems/malware. After a while you release the "bad" one, with the same md5 checksum.

Well everyone's boned then by tietokone-olmi · 2007-12-01 23:53 · Score: 3, Interesting

[...] This may mean that the attacker needs insider access to the party operating the trusted software integrity protection or code signing process.

An attack that requires insider access? Well colour me frightened!

Or don't. That's more accurate anyhow.

Re:Well everyone's boned then by Anonymous Coward · 2007-12-02 00:13 · Score: 2, Interesting

Perhaps you should read this article with particular reference to the table 'Stages in the life cycle of cryptographic hash functions'. By the way you are one or two stages behind.

Right by setrops · 2007-12-01 23:54 · Score: 1

"This may mean that the attacker needs insider access to the party operating the trusted software integrity protection or code signing process.'"

Let's see now, the attacker already has access to the machine and is probably the one creating or comparing the MD5. Is the problem really with MD5?

Well, duh! by YA_Python_dev · 2007-12-01 23:59 · Score: 4, Insightful

The problem has nothing to do with salt, and can be certainly temporarily "fixed" switching to SHA-1 or, even better, SHA-2. But the real root of the problem here is that, for the attack to work, someone signed as trusted a binary file that contained malicious code in the first place, even if in a disable form.

Let me explain that. First, this is very old news: we know since 2004 that collision can be found in MD5 hashes (two different files with the same md5sum), and there now are tools that can generate collisions in seconds. All you need is a common prefix and suffix for both files and two block of 128 bytes that are generated automatically and you can insert between the prefix and the suffix to create the two files.

Applying this to pretty much any file type that can contain binary data (even XML 1.1!) is trivial. For an executable file you can simply insert code in your prefix/suffix that looks at the pseudo-random 128 bytes and does radically different things depending on it. This as already been demonstrated for HTML+JS and even for postscript files.

Bottom line: if you have an executable file from an untrusted source it may contain bad things (the attack described requires that both the original signed file and the file that you are actually executing are generated by the same hostile source).

--
There's a hidden treasure in Python 3.x: __prepare__()

Re:Well, duh! by bhima · 2007-12-02 00:23 · Score: 1

I thought SHA-1 which similarly compromised and folks were advised to use the SHA-2 variants SHA-256 & SHA-512... with SHA-384 being mostly silly except for very specific applications where key length was limited but computation was not.

--
Nothing in the world is more dangerous than sincere ignorance and conscientious stupidity.
Re:Well, duh! by Anonymous Coward · 2007-12-02 01:22 · Score: 0

This type of attack always seem to depend on adding data to an altered blob so the hash turns equal again (the way I understand it at least). So maybe the problem could be solved by not only storing the hash but also the lenght of the data it was hashed from? I think for example gentoo does this already when checking the SHA1 & MD5 sum of source tarballs.
Re:Well, duh! by cheater512 · 2007-12-02 01:47 · Score: 1, Insightful

Well every hash function will eventually have flaws like this.
Its inevitable.
Re:Well, duh! by hotrodent · 2007-12-02 02:09 · Score: 2, Insightful

Something I've never understood about this problem: Why is the following not an easy "fix"?

1) Generate an MD5 hash for a file.
2) Generate an SHA-2 hash.
3) .. more as needed ...
4) Concatenate the results for a "super hash"
5) Profit?

Surely to manipulate 2 (or more) schemes to ensure the super hash is the same on a tampered file would be _many_ orders of magnitude harder?
Trying to make the SHA-2 match would destroy all the previous work done to make the MD5 match, then fixing the MD5 would change the SHA-2 again.
IANAC (cryptographer) so excuse my ignorance on this if I'm missing something.
Re:Well, duh! by Anonymous Coward · 2007-12-02 04:44 · Score: 0

It's not adding data (length) to the blob, it's changing the data bit-by-bit. So the lengths are the same.
Re:Well, duh! by fatphil · 2007-12-02 05:29 · Score: 1

"someone signed as trusted a binary file that contained malicious code in the first place,"

Just plain wrong. Read the article.

--
Also FatPhil on SoylentNews, id 863
Re:Well, duh! by fatphil · 2007-12-02 05:36 · Score: 1

It has been proven that such concatenation isn't as strong as the sum of the strengths as the two hashes; if you can crack one of the component hashes, then you can generally generate a vast number of collisions to throw at the rest other hash. Sure, it's stronger than the strongest individually, but it's best to just use a single full-width hash.

--
Also FatPhil on SoylentNews, id 863
Re:Well, duh! by icepick72 · 2007-12-02 06:07 · Score: 1

the attack described requires that both the original signed file and the file that you are actually executing are generated by the same hostile source

In other words, it's as easy as accidentally downloading it from the wrong web site. The files will still look right. The malicious web site would have obtained the executable from the originating download site and modified it according. Exceedingly simple and nefarious to attack the unsuspecting.
Re:Well, duh! by grumbel · 2007-12-02 07:20 · Score: 1

### The malicious web site would have obtained the executable from the originating download site

No, it would have to *modify* the original file on the original server to make this work.
Re:Well, duh! by hackstraw · 2007-12-02 07:39 · Score: 1

irst, this is very old news: we know since 2004 that collision can be found in MD5 hashes (two different files with the same md5sum), and there now are tools that can generate collisions in seconds. All you need is a common prefix and suffix for both files and two block of 128 bytes that are generated automatically and you can insert between the prefix and the suffix to create the two files

OK, that sounds fun. But at least for open source code, the md5checksum is on a compressed archive, not a raw executable. Wouldn't this collision still be practically impossible on an archive?
Re:Well, duh! by mlts · 2007-12-02 08:26 · Score: 1

Hash functions are a relatively young cryptographic technology, only have been around for about fifteen years in the mainstream. Unfortunately, due to this, there are still a lot of bugs to be ironed out. MD5 is (IIRC) only the second generation of hashes. Bulk cryptography has had far longer to evolve, from enigma-type rotor algorithms, to DES, and finally to AES, where each algorithm has been scrutinized for years for any possible issues.

I commend NIST for having a SHA-3 crypto contest. I'm pretty sure something solid will come of it, and if someone *does* know how to generate collisions with the chosen algorithm, they will keep it to themselves very quietly, and not expose it.

One security precaution I do think people should do is go by not just the MD5/SHA hash, but also sign the size of the file. Its a lot harder to generate an evil.exe when it has to be exactly the same size as the good.exe.
Re:Well, duh! by icepick72 · 2007-12-02 08:40 · Score: 1

Don't need the original server in this exploit proof. If anything you just need source code and your own web page, both easy to obtain (open source), and make the user believe you have a correct version ...

It says: Now, we can publish good.bin in the Internet for people to download it, and later, we can replace it with evil.bin. Now, the users will get infected, without noticing and convinced that there is no tampering, because the MD5 signature is the same for both files, in others words we have MD5(good.bin) == MD5(evil.bin).
Re:Well, duh! by mrmeval · 2007-12-02 10:30 · Score: 1

It's not intuitive but stringing them together doesn't work well. I'd just dump md5 and go with more proven code.

I don't understand why a gnupg digital signature is not used by default.

--
I'd go on a Vegan diet but the delivery time from Vega is too long. --brownkitty
Re:Well, duh! by Garridan · 2007-12-02 11:46 · Score: 1

Nope. That's really not a problem.

MD5 is what, 64 bytes? Your file will probably be less than a terabyte (2^40 bits), so it'll have a representation of 40 bits, or 5 bytes. So now you have a 71 byte hash. Are you going to tell me that 71 bytes is really "a lot harder" than 64? You're delusional.

The problem is that data is intrinsically incompressible. If you map the space of all finite sized binary words into the space of all binary words of n bits, you have an infinite-to-one map. If you want collisions to be "hard", you increase the size of your target space. If you want 'em to be really hard, you're playing against the future, which is almost always a losing bet.
Re:Well, duh! by kayditty · 2007-12-02 16:27 · Score: 1, Informative

No. MD5 is 16 bytes.
Re:Well, duh! by Garridan · 2007-12-02 17:47 · Score: 1

w00t. 16, to 21 bytes then. That's still a *tiny* key. Given that the popular collision method is "insert garbage into the file", one needs only "replace file contents with garbage" instead.
Re:Well, duh! by cheater512 · 2007-12-03 01:46 · Score: 1

Making bad.exe the same size is dead simple actually as long as its smaller to begin with.
You can pad it out without too much trouble.

Whats tougher is making dual collisions. Say MD5 and SHA1.

Not accurate, not new by Niten · 2007-12-02 00:05 · Score: 4, Insightful

MD5 collision attacks aren't really new, although this is a powerful example. An equally meaningful example of a collision attack on the algorithm, in the form of two different PostScript files with the same MD5 hash, was provided at least two years ago (IIRC).

The key to understanding the limits of this demonstration's significance is to realize that a collision attack is quite different from a prefix attack. These researchers were able to create a pair of executables having the same hash value by specially constructing them as such; crafting a new executable to match a specific hash value corresponding to some other party's executable is vastly more difficult to achieve.

So while this demonstrates MD5 to be useless for uses where the purported signatory is to be included in our threat analysis -- as has already been demonstrated to us by other researchers -- the algorithm is still relatively safe if our only goal is to ensure that a given executable almost certainly came from a specific party (rather than showing that it is a specific executable from said party). In other words, one could conceivably use MD5 to verify that the Ubuntu packages on that FTP server were in fact produced by Canonical. So no, demonstration does not mark MD5 as completely useless for code signing; the most common applications of code signing are entirely unconcerned with collisions in the hash function.

In conclusion: the title is terribly misleading, or possibly just misinformed. Boo! Hiss!

Re:Not accurate, not new by Niten · 2007-12-02 00:08 · Score: 1

Err, "prefix attack"? It's too early in the morning for me to be posting to Slashdot...

pretext attack, for the record.
Re:Not accurate, not new by cortana · 2007-12-02 00:34 · Score: 1

You're at stage six, "Explain why a simple collision attack is still useless, it's really the second pre-image attack that counts".

http://www.linuxworld.com/cgi-bin/mailto/x_linux.cgi?pagetosend=/export/home/httpd/linuxworld/news/2007/111207-hash.html :)
Re:Not accurate, not new by ferespo · 2007-12-02 10:14 · Score: 1

Totally off topic, but it seems that you know something about the subject and I have a question. When a file is encrypted through a strong algorithm and there are private keys between the parts, it is still necessary to add something like MD5 to preserve integrity? I mean, in this particular case, it appears that confidentiality assures data integrity (in the general case, of course) ? Is it possible in this context to alter an encrypted file without violating confidentiality?
Thanks
Re:Not accurate, not new by Niten · 2007-12-02 16:34 · Score: 1

(I'm no expert on this by any means, but I hope I can partially answer your question...)

When a file is encrypted through a strong algorithm and there are private keys between the parts, it is still necessary to add something like MD5 to preserve integrity?
I'm not sure I completely understand what you mean by "private keys between the parts." But if you mean entirely symmetric encryption, then in many circumstances a cryptographic signature -- and therefore a hash function to produce the signature -- is unnecessary, as an attempt to tamper with a transmission without knowing the secret key would result in meaningless gibberish on the receiving end. The major caveat is that, in the context of such a message, random data must be easily recognizable as meaningless gibberish: this will be the case if the transmission is a plaintext written message or something structured (XML, a structured binary format, etc). But if the transmission is of some sort of arbitrary data, then an attacker might be able to do some damage simply by tricking the receiving end into interpreting essentially random (from the attacker's perspective) data as a valid input from a trusted sender. Replay attacks can also be a threat, and cryptographic signatures are often used to prevent them, but it's sort of a separate issue...

Where hash functions and signatures really come into play is in public-key cryptography: here anyone can encrypt a message (actually, encrypt a symmetric key which was used to encode a message) to the recipient, so the fact that a message is properly encrypted is meaningless and additional means are required to demonstrate that the message came from a trusted party -- however "trusted party" may be defined.
Re:Not accurate, not new by jhol13 · 2007-12-02 17:32 · Score: 1

Relatively safe?

No.

MD5 is completely utterly broken. Sites like http://www.itconsult.co.uk/stamper.htm are now useless. Actually they are worse than nothing: this can fool people.

The fact that I can create good and bad version of programs is very bad: I can put the good, wait for scrutiny and then change to the bad. People have no easy way of knowing which one they downloaded (and are using).

It is very apparent it is only matter of year or two before someone can replace any file with another. Why wait for that? Why not replace MD5 as signing method now - before the shit really hits the fan?
Re:Not accurate, not new by ferespo · 2007-12-03 02:33 · Score: 1

You got it very well, I meant symmetric encryption. Thank you very much
Re:Not accurate, not new by Eivind+Eklund · 2007-12-04 21:07 · Score: 1

It is sometimes possible, depending on the exact way we use the symmetric cipher and the exact symmetric cipher.
Three examples:
First, a stream cipher. A stream cipher is a cipher that generate a pseudorandom stream of data, which you then XOR with your data stream. The most well-known stream ciphers are RC4 (used in e.g. SSL) and the one-time pad (having a bunch of true random data you XOR with your data, where the random stream is the same length as your data). The attack here is that the attacker can randomly flip bits in your data, at whatever offset he choose.
Second, a standard block cipher, employed in block-at-a-time mode (Electronic Codebook - ECB - Mode). Most standard symmetric ciphers are block ciphers - on encrypt, they take one block of plaintext (uncrypted data) and a key and produce an encrypted block (ciphertext); on decrypt, they take a block of ciphertext and a key and produce corresponding the plaintext block. In "block at a time mode" you just sequentially split the plaintext into blocks, encrypt each block with the key, and write it to the output. There are three weaknesses here: First, the attacker can replace blocks of your data with other blocks of your data, even if he can't know the content of the data. If the data is highly structured and he knows the structure, he may be able to do a targetted attack by replacing a block. Secondly, he may randomly trash a block. Again, problem with structured data. Thirdly, he may learn of the content of the data by learning which blocks are equal. This is a significant information leak, which can help with the two modification attacks above (and is very unfortunate in its own right). This brings us to the third large example, which is
Third example, a block cipher used in block chaining (CBC) mode. Here, each block of plaintext is XORed with the previous block of ciphertext before being encrypted. This is probably the most common way to use block ciphers, as they are no longer vulnerable to the information leak above. In CBC mode, a one bit corruption in a ciphertext block will trash the corresponding decrypted data block - and cause a controlled one-bit change in the next data block. Without integrity checking, this can be VERY unfortunate.
There are other block cipher modes available, with other properties, including some that are supposed to provide integrity. I am not really familiar with those, so I'll stop here. The point was just to show that you absolutely have to take integrity into considering; the symmetric crypto doesn't give it to you for free, at least not in many significant cases.
Eivind.

--
Doubting the existence of evolution is like doubting the existence of China: It just shows that you're uninformed.

Birthday Attack by tangent3 · 2007-12-02 00:15 · Score: 4, Insightful

This is an example of a Birthday Attack. 1. Attacker generates Good.exe and Evil.exe which hashes to the same MD5 2. Attacker passes Good.exe to the key owner to sign 3. Key owner signs and release Good.exe and Good.exe.MD5 4. Attacker releases Evil.exe as Good.exe This of course, requires some serious social engineering to work. MD5 is outdated, yes, but at the moment it is still resilient against a normal attack where an attacker has to generate an Evil.exe to hash to the same MD5 as an already-available Good.exe

Re:Birthday Attack by Anonymous Coward · 2007-12-02 00:28 · Score: 4, Informative

Sorry but you are wrong. The attack uses two md5 inputs which collide to construct two programs which are otherwise identical. The program can then be contrived to exhibit different behaviour depending on which of the two colliding inputs was used. This is nothing to do with the birthday paradox (except that it may have been used to find the collisions in the first place). Otherwise you description of the attack is accurate.
Re:Birthday Attack by owlstead · 2007-12-02 02:02 · Score: 1

Mod parent up. The person is only a A/C, but I cannot see how this is related to the birthday paradox.
Re:Birthday Attack by Anonymous Coward · 2007-12-02 05:35 · Score: 1, Informative

This is nothing to do with the birthday paradox (except that it may have been used to find the collisions in the first place).

That's the whole point of the parent, isn't it? Unless I'm missing something, the collision is generated by a birthday attack approach and the parent was exactly right. It's why the attacker needs to write both GOOD.EXE and EVIL.EXE; if he could generate an arbitrary md5 sum, he could replace any GOOD file and this would be much worse.

IMHO it's still pretty annoying (though not new).

use two hash functions by m2943 · 2007-12-02 00:17 · Score: 4, Informative

The particular scenario they describe is irrelevant; MD5 checksums aren't intended to protect against that. If the attacker can manipulate the original file, he can usually simply alter it to become malicious itself.

The case that matters is producing a program with the same checksum as a given program, without the ability to manipulate the correct program beforehand. That's still hard.

Nevertheless, code signing mechanisms in general should probably be prepared for flaws in hash functions. It might be best always to use two hash functions and to have some strategy of migrating. That way, if one hash function gets compromised, there is still another one in place and can be used until the original one has been replaced.

Re:use two hash functions by mollymoo · 2007-12-02 02:53 · Score: 3, Interesting

The particular scenario they describe is irrelevant; MD5 checksums aren't intended to protect against that. If the attacker can manipulate the original file, he can usually simply alter it to become malicious itself.

The problem as I see is that the harmless version can be released and gain trust. That version can be tested and inspected, even checking the binary wouldn't reveal malicious code because there wouldn't be any malicious code to find - no dodgy looking system calls, for example. Just a chunk of seemingly random data, which could be disguised as a lookup table, compressed image or whatever. At some later point, after the harmless version has gained trust, its use has become more widespread and the rate of downloads has increased correspondingly, it can be replaced by the malicious version. So while you could initially release a malicious version, being able to first release a harmless version can widen the impact of an attack.

--
Chernobyl 'not a wildlife haven' - BBC News
Re:use two hash functions by m2943 · 2007-12-02 04:16 · Score: 1

Why would you need MD5 forgeries for that? A package maintainer can simply gain trust with a sequence of good binary releases and then, at some point, release a correctly signed bad binary corresponding to a small source code change.
Re:use two hash functions by maxwell+demon · 2007-12-02 04:59 · Score: 1

But if you have built up trust that way, I'm sure simply releasing an "improved version" would also get you installed on a lot of machines, even though the md5sums differ (indeed, it would be very suspicious if the md5sum of the improved version still is the same as of the original one).

--
The Tao of math: The numbers you can count are not the real numbers.
Re:use two hash functions by noidentity · 2007-12-02 09:41 · Score: 1

It might be best always to use two hash functions and to have some strategy of migrating. That way, if one hash function gets compromised, there is still another one in place and can be used until the original one has been replaced.

Isn't using two hash functions the equivalent of using one hash function that is the mathematical equivalent of the two?
Re:use two hash functions by AnyoneEB · 2007-12-02 13:17 · Score: 1

Well, yes, but it at least gets you more bits. In addition, I think MD5 and SHA1 use sufficiently different mathematics that it would be difficult to come up with an attack technique which was effective on both of them. Instead you would likely have to try to find collisions in the two separately and keep going until they both collide for the same data, which does not help much in finding a collision.

--
Centralization breaks the internet.
Re:use two hash functions by noidentity · 2007-12-02 13:49 · Score: 1

If this is true, then they should create a new MD5SHA1 hash function that can be treated as a unit and a single hash value when being displayed. Or would that be considered too impure/inelegant?
Re:use two hash functions by ToasterMonkey · 2007-12-02 14:03 · Score: 1

That version can be tested and inspected, even checking the binary wouldn't reveal malicious code because there wouldn't be any malicious code to find - no dodgy looking system calls, for example. Just a chunk of seemingly random data, which could be disguised as a lookup table, compressed image or whatever. I'll ask the same thing many others are asking. If you can sneak garbage into a binary, why not sneak the actual payload in? And... there are people who inspect binaries and are thrown off by random garbage code? What exactly are they looking for then? If I throw a fistful of metal widgets under someone's hood, would I expect to fool the driver or his mechanic? The attack most of you are suggesting is so bizarrely over-complicated.

It's like a crazy plan to put a suspicious looking EMPTY package in the mail.. followed by an equally puzzling plan to somehow deliver the suspicious looking EVIL package later on.

Let me give it another shot...
1. Already have access to source and/or binary to be released.
2. You can't sign your own package.
3. Submit a benign, funny looking package for signing & release.
4. Switch out evil, funny looking package for distribution later, keeping the same valid signature.

Is this a plot from Mission Impossible IV? This is so silly. The attacker would already have to have access to the source, and distribution system, but not the signing mechanism for this attack to be worthwhile. You are so screwed at that point, any weakness in MD5 doesn't matter one bit.
Re:use two hash functions by AnyoneEB · 2007-12-02 15:59 · Score: 1

If you are going to start using a new hash function, you might as well use SHA-2 or something else with no known attacks. MD5 and SHA1 attacks are interesting because a lot of existing applications use them, but seeing as there are known partial attacks against both, one would likely be best off avoiding them for new applications.

--
Centralization breaks the internet.
Re:use two hash functions by m2943 · 2007-12-04 16:32 · Score: 1

The issue is not whether two secure hash functions are better than one, the issue is whether two different, possibly insecure hash functions are better than one. And they are, because the probability that two independent hash functions become compromised at the same time is much smaller than a single function getting compromised.

Ah yes, this again by Effugas · 2007-12-02 00:18 · Score: 4, Interesting

OK, it's pretty damn cool to see people 'round here referencing my work on Javascript MD5 collisions :)

The relevant links are:

http://www.doxpara.com/research/md5/t1.html
http://www.doxpara.com/research/md5/t2.html ...and the original paper:

http://www.doxpara.com/research/md5/md5_someday.pdf

I'm pretty sure I talked about third party attestation in that paper.

A more interesting point was made to me just the other day, which is that there's always enough ambient entropy in any real world system to deviate between trusted and untrusted behavior. In other words, for a turing complete app, you *can't* create a meaningful hash, because you aren't capturing all bits that will drive the execution flow. So, getting code signed really doesn't assert anything other than a business relationship. App signatures don't actually work, for any arbitrarily good hash.

Re:Ah yes, this again by TorKlingberg · 2007-12-02 02:11 · Score: 1

With a fairly short hash there will always we several valid codes with the same hash, but doing different things. However if finding such pairs is difficult enough, hash signing works. This difficulty is the whole point of hashes.
Re:Ah yes, this again by Anonymous Coward · 2007-12-02 03:40 · Score: 0

ur face contains ambient entropy
Re:Ah yes, this again by Henry+V+.009 · 2007-12-02 04:18 · Score: 2, Insightful

A more interesting point was made to me just the other day, which is that there's always enough ambient entropy in any real world system to deviate between trusted and untrusted behavior. In other words, for a turing complete app, you *can't* create a meaningful hash, because you aren't capturing all bits that will drive the execution flow. So, getting code signed really doesn't assert anything other than a business relationship. App signatures don't actually work, for any arbitrarily good hash.
That is simply wrong. For any "arbitrarily good" hash there exist collisions out there between bad_app and good_app. However, if finding bad_app or bad_app2, etc., is computationally impracticable (which is the definition of a good hash), then the hash is quite useful.

Currently md5 is fairly broken, in that a person can specially prepare good_app and bad_app. However, it is not yet completely broken (like CRC, if it were ever used as a secure hash), in that it is not yet possible for someone to take an arbitrary md5-signed file (like the recently released ubuntu iso, to give an example) and generate a collision. The birthday paradox makes the former much easier than the later.

This is a REMOTE attack, and reasonably potent by CarpetShark · 2007-12-02 00:33 · Score: 3, Informative

An attack that requires insider access? Well colour me frightened!

If you'd read the article, you'd see that one of the (prominent) possible attack scenarios listed is that of software distribution: distribute a good file, with the intent of replacing it later. For example, in debian, even with MD5 checksums on all your data, and tools reporting what's changed during the software update, this would still allow downloading infected files, without noticing.

It's a danger both from malicious distributors, and from hacked distribution sites.

Re:This is a REMOTE attack, and reasonably potent by quigonn · 2007-12-02 01:13 · Score: 4, Funny

If you'd read the article,

Reading the article? THIS IS SLASHDOT!!!!!!1!

--
A monkey is doing the real work for me.
Re:This is a REMOTE attack, and reasonably potent by cloudmaster · 2007-12-02 04:35 · Score: 1

Yes, you have to be an "insider" to distribute the good file.
Re:This is a REMOTE attack, and reasonably potent by OrangeTide · 2007-12-02 05:34 · Score: 1

If you're an insider and you tell me that you're updating me to a new version with a completely different MD5 I would probably trust you too. The new version could have any bizarro code in it, because I already trust you. MD5's strength or weakness doesn't matter at that point.

True it is interesting that you can make MD5 do these neat tricks. But as in insider your non-malicious garbage appended exec should likely fail to pass the review process. People will ask "why do you have this funny bit of assembly that appends some hex??". If there is no review process then it doesn't matter and you can drop in your malicious code right away. If you're not inserting the garbage at the source level, and by passing the review. again you could MD5 *anything* anyways. So it doesn't matter.

--
“Common sense is not so common.” — Voltaire
Re:This is a REMOTE attack, and reasonably potent by Anonymous Coward · 2007-12-05 00:10 · Score: 0

Yes, and here's a "tip" for you: Take care of your customers!

ONE block, surely by CarpetShark · 2007-12-02 00:39 · Score: 2, Interesting

Surely the point is that, if you can generate two blocks that do this, then you can generate one block to pair with a previously known block -- such as something in open source code.

Re:ONE block, surely by jthill · 2007-12-02 01:23 · Score: 5, Informative

TFA points out specifically that no one knows how to target a specific hash code. All they can do is make two files converge on the same hash code by inserting data into *each* of them.

--
As always, all IMO. Insert "I think" everywhere grammatically possible.
Re:ONE block, surely by colmore · 2007-12-02 12:37 · Score: 1

It's a little obscure in the article (and elsewhere) but you have to generate two executables, both of which are manipulated to hit a common target. Basically some hashcodes are easier to hit than others. You can't hit an arbitrary one.

Anyway, use one of the SHA algorithms

--
In Capitalist America, bank robs you!

Use GnuPG instead by gweihir · 2007-12-02 00:39 · Score: 4, Insightful

As many projects have done for years. md5 sums as crypto-protection are more or less a historic way to do it.

--
Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.

Re:Use GnuPG instead by Anonymous Coward · 2007-12-02 03:25 · Score: 0

The default hash algorithm for GPG is SHA1--which is becoming weaker.

Door locks are insecure if you can get a key too! by Nezer · 2007-12-02 00:49 · Score: 2, Insightful

This may mean that the attacker needs insider access to the party operating the trusted software integrity protection or code signing process. Isn't this a bit like saying that door locks are insecure although you may need access to a party trusted with the keys in order to exploit? Aren't these "trusted parties" *always* a potential weak-link in the security chain?

So MD5SUM veriefies downloads only by houghi · 2007-12-02 01:03 · Score: 1

I alsways thought that MD5SUMS where there only to verify wether a download was successfull or not.

--
Don't fight for your country, if your country does not fight for you.

Re:So MD5SUM veriefies downloads only by grumbel · 2007-12-02 01:34 · Score: 2, Informative

An MD5 checksum file alone serves no other purpose then to check that the download is correct, since an attacker that can upload a changed file could also just change the MD5 checksum file. Things look a little different if you get the MD5 from a different trusted source or when the MD5 file is signed by a GPG key.

Not a real life scenario... by Matthieu+Araman · 2007-12-02 01:05 · Score: 3, Insightful

Real life scenario :

developper A produce software X(for example openssh), calculate hash of program X and sign the hash with his PGP key.
He then put all these files on mirrors servers on Internet (but not his private PGP key !)

One mirror is hijacked by B.
B wan't to replace X by X' with the same hash than X

This article doesn't provide anything as it says MD5(X+a)=MD5(Y+a), which imply you have to change A in the first place which can't be done easily (and if you can change the original program, then what's the point ?)

Re:Not a real life scenario... by SagSaw · 2007-12-02 05:38 · Score: 1

developper A produce software X(for example openssh), calculate hash of program X and sign the hash with his PGP key.
He then put all these files on mirrors servers on Internet (but not his private PGP key !)

/ One mirror is hijacked by B.
B wan't to replace X by X' with the same hash than X

True, but imagine that developer A is the attacker. That developer can create two different versions of his program: One benign version to go trough verification, QA testing, and any customer acceptance testing, and a malicious version which is actually distributed. Now developer A can slip in malicious features which would otherwise be discovered during testing.

Also, keep in mind that B might also have access to the signing key. If so, he can create X' and X'' from X which have the same MD5 sum. X', without malicious content, is made available for download. B can then use X'' for whatever he wants. For example, he can arrange that only his target gets X''. Since there have been no reports of problems with X', and the signature on X'' checks out and matches the signature on X', the target may never know that he has received a different version of X than everybody else.

--
Come test your mettle in the world of Alter Aeon!
Re:Not a real life scenario... by Matthieu+Araman · 2007-12-02 07:42 · Score: 1

If I trust developer A and he include bad thing, he will risk ruining his reputation so that's not a md5 problem.
The problem is also not the same with a binary only exe and some open source programs where you have access to the source and all the change in the repository are signed with a pgp key.
That way, it's easier to verify that a binary is made from a given source.
It's a lot harder to try hiding something like md5 garbage in source form.
Re:Not a real life scenario... by Anonymous Coward · 2007-12-02 07:46 · Score: 0

But protecting me against attacks from the author is not something any sort of hash/signature system can do anyway. The author could attack a specific host just by checking the IP address (or any other unique attribute) of that host in his code, and executing the malicious version only on a match. Such code would pass the QA tests and would not generate problem reports for testers or other users. And it would work even if you hashed with more than one algorithm and signed with more than one key, because the program itself doesn't need to be changed between the "good" and "bad" versions.

So unless someone can think of an attack that doesn't require the cooperation or compromise of someone already capable of signing the code, I'm not gonna worry about this too much.
Re:Not a real life scenario... by SagSaw · 2007-12-02 09:22 · Score: 1

If I trust developer A and he include bad thing, he will risk ruining his reputation so that's not a md5 problem.

Yes and no. In some cases, the risk of damage is low enough that it makes sense to trust the developer. In other cases, the risk is great enough that there should be safe-guards in place to ensure that a rogue developer can't insert malicious code. Safe-guards which rely on the MD5 hash should no longer be relied upon to prevent a rogue developer from inserting malicious code.

It's a lot harder to try hiding something like md5 garbage in source form.

Not necessarily. Take an empty line in the source file and add various combinations of tabs and spaces. For some languages, you might be able to add other white space characters as well. Or, you could add 128-bits worth of initialized variables and hide the fact that they do nothing. In any case, creating 128 possible changes to the file should be easy to do in a manner which passes a cursory inspection.

--
Come test your mettle in the world of Alter Aeon!

A workaround? by finnw · 2007-12-02 01:41 · Score: 1

Create trusted_program.

Take trusted_program and /bin/false, use this technique to generate trusted_program2 and false2.

Post both trusted_program2 and false2 on your web page along with their shared md5sum and invite the user to download them (presumably the user trusts you and your web server or he wouldn't download them in the first place.)

The user is now confident that you cannot replace trusted_program2 with malicious_program without changing the md5sum, because this technique only works with two prefixes, not three.

--
Is Betteridge's Law of Headlines Correct?

Re:A workaround? by sqlrob · 2007-12-02 05:37 · Score: 1

Here's an example with 12. Linked to from TFA even.

what it really means .. by rs232 · 2007-12-02 02:08 · Score: 2, Interesting

"if you can change the original program, then what's the point ?)"

Well, what it means is that an evil software megacorporation could publish a digitally signed app that could be replaced with another presumably nefarious prog later on ..

Re:Not a real life scenario...

--
davecb5620@gmail.com

Nothing new by kasperd · 2007-12-02 02:24 · Score: 1

As others have pointed out, there is nothing new in this. The same has been demonstrated with other languages before. For example a few years ago it was demonstrated with postscript, and that was as far as I know the first demonstration with meaningful content. While that may have come as a surprise to some people, it was only a minor curiosity to people understanding how md5 works. Doing this thing with exe files is less significant than it was to do it with postscript files for the following reason. You are not likely to sign an exe file from an untrusted source, because there is no way to verify if the content is malicious, and most people know this. In fact that is the very reason for having signatures in the first place. With postscript files it is different, to most people a postscript file is just a document. With a document you can read it, and then you know exactly what the content is. At least I can understand why people would think that way. So less social engineering is required to get your crafted postscript file signed than it would to get a crafted exe file signed. If you can get somebody to sign your carefully crafted exe file, then this attack doesn't matter anyway. Because if you could get your malicious code into the signed executable, there are lots of other ways to trigger it. Having an interchangable piece of pseudorandom binary data in the file itself is a neat way to trigger your code. But it can be based on other external factors such as timing, the IP address of the machine, the existence of certain other files on the system, or just some secret sequence of inputs. It is all just the matter of putting a backdoor into a piece of code, and attacking md5 in this way is not even the most convenient backdoor you could make.

If it was possible to make a crafted file that would match the md5 of some existing file, which you had no control over, then there would be a lot more reason to worry. Luckily that is not the case yet. Still the demonstration of collisions does serve as a warning, that md5 is weak and maybe somebody will be able to completely break it at some point. There has been a reason to worry about that ever since the collisions were first demonstrated. The construction of additional collisions with meaningful content doesn't change that threat in any way. If you were not worried before this news but you are worried afterwards, it is because you didn't understand the threat.

--

Do you care about the security of your wireless mouse?

Comment removed by account_deleted · 2007-12-02 02:29 · Score: 1

Comment removed based on user account deletion

Security hashes by billcopc · 2007-12-02 02:40 · Score: 1

Okay so someone was a bit late to learn that MD5 collisions are indeed possible. Congrats, you're still retarded!

It's not exactly hard to understand that a 128-bit hash is going to be less unique than a multi-kilobyte executable. I believe 3rd grade math has that covered. With processor speed increasing steadily, these things become easier to break with each passing day.

--
-Billco, Fnarg.com

Re:Security hashes by SagSaw · 2007-12-02 05:11 · Score: 1

It's not exactly hard to understand that a 128-bit hash is going to be less unique than a multi-kilobyte executable.

In theory, it should require 2^64 attempts to find two identical messages with the same MD5 hash. That should be enough to routine prevent brute-force attacks for the foreseeable future, except possibly for attackers with deep enough pockets to build special MD5 cracking machines. This issue isn't simply that the 128-bit hash is less unique than the multi-kilobyte executable being signed.

What the authors of this paper have done is expanded on a known flaw in the MD5 algorithm to allow them to find two messages with the same signature in 2^50 attempts instead of 2^64 attempts (requiring 1/16384th the time). What should only have been routinely possible for an attacker with the money to build dedicated hardware is now practical for anybody who wants to dedicate a Playstation3 to the task for a couple of days.

Also note that this attack "only" lets the attacker create two files with the same MD5 hash. It does not allow the attacker to create a new file with the same MD5 hash as an existing file. For now, the means the attacker still can't create a malicious executable to replace an existing, benign, executable unless the attacker also has access to the signing key or can change the published MD5 hash of the executable. However, if the attacker has access to the signing key or can change the MD5 hash published for the executable, he can create two versions of the executable. The attacker could submit the benign version verification and QA testing but distribute the malicious version instead.

--
Come test your mettle in the world of Alter Aeon!
Re:Security hashes by billcopc · 2007-12-02 12:49 · Score: 1

I don't trust MD5 hashes anyway. If I can't trust the source, I don't run its code.

--
-Billco, Fnarg.com

Haw haw by Anonymous Coward · 2007-12-02 03:02 · Score: 0

Still got him his moderation boosts, tho.

Md5 as a signature by Almahtar · 2007-12-02 03:38 · Score: 1

I see a lot of comments about how, since this attack requires access to the file both before and after signing, this is a non-issue. In most cases you're right, but get creative.

You have a lengthy verification process for new software - you check it over thoroughly to make sure it can be trusted, and after you certify it as trustworthy you sign it and only need to re-certify if the signature changes next time you download it from me.

I deliver a new version of the software to you (the "good" version), you certify and sign it (using MD5, unfortunately for you). I swap out the "evil" one, and next time you download it -- sure enough, the signature verifies it's fine.

What if you even had a virus scanner that used MD5's on executables for lazy re-scanning when they'd been modified?

I'm not sounding the "holy crap we're doomed" alarms, just pointing out that if you can take two different files and get the same "signature" from them, it's not a very good "signature", now is it?

Re:Md5 as a signature by Grampaw+Willie · 2007-12-02 04:44 · Score: 1

I deliver a new version of the software to you (the "good" version), you certify and sign it (using MD5, unfortunately for you). I swap out the "evil" one, and next time you download it -- sure enough, the signature verifies it's fine

That shouldn't work: MD5 requires the recipient to regenerate the HASH and then check the signature. I have no idea why they think this is a performance improvement as you are going to have to scan the entire content of the messsage ( program ) in order to regenerate the hash.

so by checking the signature on the HASH instead of on the whole document all then have done is to weaken PGP signature checking capability. But not much: the odds of two source files producing the same HASH ain't real good (HASH is like a CRC ) .

Mod parent down: first paragraph flat wrong. by Anonymous Coward · 2007-12-02 04:09 · Score: 0

the real root of the problem here is that, for the attack to work, someone signed as trusted a binary file that contained malicious code in the first place, even if in a disabled form.

Wrong. True of other breaches, not this one.

Optional Secutity ? tee hee ROF,LMFAO by Grampaw+Willie · 2007-12-02 04:25 · Score: 1

FromMSDN Library:

Use digital signatures WHEN you want to distribute data, and you want to assure recipients that it does indeed come from you. Signing data does not alter it; it simply generates a digital signature string you can bundle with the data.

(emphasis added)

If you want security it has to be in effect 100% of the time. Not just here and there WHEN we have time for it and we don't bypass it to improve performance.

the issue here is not whether MD5 is vulnerable but whether it is being used all the time like it needs to be

anything and everything that is executable needs a signature that can be verified before it is executed and until that standard is made mandatory RATS will continue to have a festival which will only get worse and fast.

NO SIGNATURE? NO EXECUTE.

Cryptography such as these digital signatures is pretty good these days: proper use will render any attack on the cryptography itself a poor choice of options.

But Bruce Schneier notes in is recent book that all too often cryptography is like putting a post in the middle of a field and hoping the attacker runs into it. If there is anyway around the post the attacker will just take the easy way out and never bother the cryptography. He's not playing your game; he's playing a different game and he is governed only by the opportunities left open to him.

Signatures should be required on all eMails as well and any eMail without a signature that you recognize and approve should go into quarantine so you can dispose of it.

Use multiple hash functions... by Roogna · 2007-12-02 04:33 · Score: 0, Redundant

Back in the day I remember always being told that a single hash function was never secure for verifying information... and that for security you should use two -different- algorithms or more. Simply because an attacker can manipulate the data to collide in a single function, it's that much more difficult to manipulate the data to collide in two entirely different hash spaces.

Did this concept change over the years, or is it just me? heh

You could do this... by ultramkancool · 2007-12-02 04:47 · Score: 0

or you could just crack the MD5 check itself, since you are modifying the program anyways, seems kinda pointless.

Bost projects I've seen.. by Junta · 2007-12-02 05:29 · Score: 1

Use public-private key signing rather than hashes (a hash is pretty limited, for *every* file to transfer, there must be a checksum in existence on the client side that got there through a 'secure' means. Signing means they just need to be confident they got your public key once and from then on out, your signatures can be proven/disproven on files without need for further guaranteed secure means.

About the only place I see MD5 sums used much is for large iso files, get the md5 sum from the distribution site, then grab the iso from a mirror and make sure it's ok. For apt and yum, where signatures are checked automatically, it's pretty certain they use public-private key signing.

--
XML is like violence. If it doesn't solve the problem, use more.

Re:Bost projects I've seen.. by gweihir · 2007-12-02 07:02 · Score: 2, Insightful

I agree that today basically the only use of md5's is integrity checks against transmission and storage errors. I sometimes use them on backups.

You are quite right, that md5 does not provide and connection to the signer. With a PGP/GPG signature, once I have the correct public key, I can verify all and every signature made with it. And if I do not have the correct key, the first genuine signature will result in an error. Howeber I guess most people do not bother. Even if it is easy. For a kernel download, e.g., it adds about 5 seconds.

--
Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
Re:Bost projects I've seen.. by AnyoneEB · 2007-12-02 13:24 · Score: 2, Informative

As I understand it, the normal way to generate a digital signature is to use a hash algorithm like MD5 or SHA1 and then encrypt the hash with a private key. Then you verify by hashing the file and decrypting the signature with the public key and checking to see if they match. Therefore, distributing signatures instead of hashes is orthogonal to the discussion at hand. If the hash is broken, then the signature is broken, too.

See Wikipedia for more information on digital signatures.

--
Centralization breaks the internet.

Re:Voting machines by Dare+nMc · 2007-12-02 05:31 · Score: 1

why not simply replace it with the malicious one so that the malicious one gets signed and distributed instead of the good one

Think voting machines. So far that has been the most requested approach, a verified hash code from open source, that is verified on each machine...

MD5+SHA1? by Midnight+Thunder · 2007-12-02 06:01 · Score: 1

Surely a hybrid MD5+SHA1 signature would prove better? You can find weaknesses in each, but putting them together and the likelihood of the both weaknesses appearing at the same time would be greatly diminished. Other than extra CPU requirements, are there any issues with this approach?

--
Jumpstart the tartan drive.

Re:MD5+SHA1? by kasperd · 2007-12-02 06:30 · Score: 1

Other than extra CPU requirements, are there any issues with this approach?
Depends on how you look at it, and what you are trying to protect against. In most cases combining two hashes in that way will produce something that can be attacked faster than brute force. You'd get better security with a hash function that was designed to be 288 bits in the first place. (Or even 256 bits). In fact even if the two hashes had no weaknesses at all, you could still produce a collision for the concatenation slightly slower than an attack against the strongest of the two.

An attack could work the following way. Produce a collision for the first block of sha1 (2^80 work using birthday attack, I think there might be faster ways). Take the output IV and repeat the attack for the next block. You are going to need about 64 blocks in total. That is 2^86 work. Now run through all combinations of your blocks to find a collision for md5 (that amount of work is nothing compared to the previous work). So this combined hash can be attacked in 2^86 work not 2^144 like you would expect from a 288 bit hash.

--

Do you care about the security of your wireless mouse?
Re:MD5+SHA1? by grumbel · 2007-12-02 07:40 · Score: 1

True, but on the other side relying on a single hash leaves you with a single point of failure, if the hash function is compromised, you don't have anything trusted left to check against. If on the other side you have two hash function you have a good chance that you figure out that one of your hash functions isn't secure any more and thus have enough time to replace it with something new, since there is always that second, still secure, function left.
Re:MD5+SHA1? by kasperd · 2007-12-02 07:55 · Score: 1

If on the other side you have two hash function you have a good chance that you figure out that one of your hash functions isn't secure any more and thus have enough time to replace it with something new
Don't expect a better warning just because you are using two different hash functions. With md5 you did get the warning a few years ago. Still most uses of md5 are secure. If you were to be using two, there is no guarantee that you are going to be told about the first break, if somebody want to attack the combination, they might decide to keep silent until they have a working attack. It all boils down to the intentions of whoever finds the weakness. Besides, if you wanted to pay the price in terms of CPU usage, you could go for a 256 bit hash.

--

Do you care about the security of your wireless mouse?
Re:MD5+SHA1? by Cairnarvon · 2007-12-02 08:00 · Score: 1

Similarly, since ROT13 is such a weak encryption, why not just apply it twenty times? It's bound to be stronger, no?
Okay, stupid example, but you can't assume just piling on additional algorithms is going to make things more secure. While it's probably reasonably safe for MD5+SHA1, you're almost certainly much better off just using, say, SHA-512 instead.

Mod parent down ; missed the point by Anonymous Coward · 2007-12-02 07:19 · Score: 0

The grandparent wasn't trying to say that hashes can't identify applications. Instead it was trying to say that there's no such thing as a good app and a bad app. It's impossible to tell what an app will do in advance since very few apps can be entirely understood without understanding their entire runtime environment which is impossible.

However, the grandparent is still wrong; IMHO. App signing says a) this is the app that I built; you can trust it to behave that way. b) this app was built by XX who's reputation you can check up on. In general, a good application is designed so that it behaves properly independent of the different environmental inputs, within the scope of "normal" computer behavior (radiation attacks, for example, not included). An MD5 sum is not telling you that the formal proof of the application is good (there probably isn't one); instead it is allowing you to predict how the application was designed.

Re:Mod parent down ; missed the point by Henry+V+.009 · 2007-12-02 08:02 · Score: 1

I thought he might possibly have meant that. And yes, it's still wrong.
Re:Mod parent down ; missed the point by Effugas · 2007-12-03 03:12 · Score: 1

No, I'll stand by my original point. Predicting the behavior of a turing-complete environment is quite literally the halting problem. People keep trying to "prove" software, and let me tell you first hand, it keeps failing miserably. I think you're ignoring the reality that most software transforms external data, meaning unpredictable environmental inputs (even mouse clicks!) are first class citizens.

The problem is that MD5 can only hash the bag of bits available at compile time. It misses the accumulated bits at runtime. This isn't an MD5 bug, it's a bug in the concept that you can sign behavior or predict what software will do.

Signing can only establish behavior. I didn't used to believe that -- I was wrong.

Alternative approach using AES-256 by gr8dude · 2007-12-02 07:23 · Score: 1

Although AES-256 is not a hashing algorithm, I've seen it applied in hashing. Since it is a block cipher, when you encrypt a file, at the last iteration you have a chunk of 256 bits, which is used as a digest. If you change anything in the file, the change will propagate to other blocks (if encryption is done in CBC mode), so the last block (i.e. digest) will be different.

--
The saddest poem

One hash to rule them all--wot? that's all? by slashdotard · 2007-12-02 09:45 · Score: 1

There seems to a yet another massive explosion of XOR thought that seems to ignore the possibility of using more than one hash to sign an object, whether it be code, text or other data.

It has occurred to a few people--just a few people--to sign objects with both MD5 and SHA-1.

It seems that it is more difficult to get both MD5 and SHA-1 collisions by quite some orders of magnitude. Someday, perhaps, it can be done but not today. Well, no one has said so, at least.

Anyone for some OR thinking?

--
me. --a by-product of public education

Re:Best projects I've seen.. by gweihir · 2007-12-03 00:56 · Score: 1

Oh, the discussuion basically says that hash-only is broken anyways, even if the hash remains secure. Therefore breaking this specific hash does not matter a lot.

--
Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.

Slashdot Mirror

MD5 Proven Ineffective for App Signatures

117 comments