SHA-1 Collisions for Meaningful Messages

SHA-256 for ever by Anonymous Coward · 2006-08-27 01:50 · Score: 0

Or at least for the next 3 years.

yeah... nonsense by macadamia_harold · 2006-08-27 01:52 · Score: 5, Funny

Using the new method, it is possible, for example, to produce two HTML documents with a long nonsense part

To achieve this, the method uses material pulled from myspace.com.

--
Push Button, Receive Bacon

Re:yeah... nonsense by Anonymous Coward · 2006-08-27 02:04 · Score: 2, Funny

To achieve this, the method uses material pulled from myspace.com.

Falling back to sco.com, Microsoft.com, digg and slashdot, in that order.
Re:yeah... nonsense by niceone · 2006-08-27 04:03 · Score: 1

Can I have this algorithm as a friend then? I'm strugling to reach double digits.

--
ccalam - acoustic versions of new songs.

Quite simple to check file size also by JonathanR · 2006-08-27 01:53 · Score: 1

I would have thought this not such a big issue for software developers who aren't incompetent.

Re:Quite simple to check file size also by Anonymous Coward · 2006-08-27 02:08 · Score: 0

If I understand correctly they use some kind of birthday-paradox-approach and creating a file with the same file length shouldn't be a big problem. (Or have the same filename, creation-/modification-date, etc)
Re:Quite simple to check file size also by Apage43 · 2006-08-27 02:12 · Score: 2, Insightful

Where's the "Correct filesize" kept? If it's stored in the document, it's still possible (Though more difficult) to change it and make a collision.
Re:Quite simple to check file size also by Xugumad · 2006-08-27 02:38 · Score: 2, Insightful

I have to say, trusting SHA-1 to do what it says on the tin, is not incompetent. Naive, sure, but not incompetent.
Re:Quite simple to check file size also by Athenais · 2006-08-27 02:45 · Score: 1

Would it be possible to develop a hashing algorithm that also contains information on the file's correct size? Perhaps by using it as a variable in the transformation matrix, so a given block of data would have an entirely different hash depending on what the total file size was.

(Of course collisions would still be possible, just more difficult--but that's true about any hash of any length less than file size, is it not?)

--
Bored With ProgressQuest?
Re:Quite simple to check file size also by The_Wilschon · 2006-08-27 03:19 · Score: 1, Interesting

What if it were tacked onto the end of the hash? So your hash would look like

071883abcdef18234760abdefcd38f387bc93-78b

And 78b would be the size of the file. Not being a cryptologist, not even remotely, I am probably missing something quite simple and obvious here, but at first glance it seems like this would work.

--
SIGSEGV caught, terminating

wait... not that kind of sig.
Re:Quite simple to check file size also by wfberg · 2006-08-27 03:34 · Score: 3, Informative

I'm not quite sure what your comment means.
As the heise.de article points out; the twins are of equal length - the file size would be the same.
Finding hash twins whereby the chosen one is, oh, let's say 160 bits longer is a degree less sophisticated.

--
SCO employee? Check out the bounty
Re:Quite simple to check file size also by wirelessbuzzers · 2006-08-27 03:36 · Score: 2, Informative

I would have thought this not such a big issue for software developers who aren't incompetent.

I don't know the details of this particular attack, but usually attacks on hashes like this produce two documents with the same file size. Certainly the MD5 collisions a couple years ago had the same file size.

--
I hereby place the above post in the public domain.
Re:Quite simple to check file size also by Lockejaw · 2006-08-27 03:39 · Score: 2, Interesting

That would work for typical everyday use (like a checksum next to a link to a downloadable file). Of course, this is assuming that the birthday attack wasn't keeping a uniform file size.

It would also take a bit more (though maybe not much) to apply it to digital signatures.

--
(IANAL)
Re:Quite simple to check file size also by edmudama · 2006-08-27 04:09 · Score: 3, Informative

From the article:
Using the new method, it is possible, for example, to produce two HTML documents with a long nonsense part after the closing </html>tag, which, despite slight differences in the HTML part, thanks to the adapted appendage have the same hash value.
So it appears that both the original and the new messages need that appendage. This isn't just about adding an appendage to a known, appendage-less document.

--
More data, damnit!
Re:Quite simple to check file size also by CastrTroy · 2006-08-27 06:21 · Score: 1

Yes, but the summary states that it is a similar HTML doc with lots of nonsense put at the end so that the hash matches. I very much doubt in this case that the files are the same. Plus, if you know something about the document, lit the size, or the fact that the file should be XHTML 1.0 compliant, without any trailing garbage that doesn't make any sense, then it makes the hash a whole lot harder to crack. The thing is, if you rely 100% on the hash, then you could get screwed, however, if you verify the hash, and then perform some additional check on the file, it's much harder to get screwed.

--

Anthropic principle: We see the universe the way it is because if it were different we would not be here to see it.
Re:Quite simple to check file size also by Simon+Garlick · 2006-08-27 11:38 · Score: 1

So SHA-1 is broken. In other news, sky found to be blue.

Come on guys, SHA-1 was broken years ago. That people are now devising attacks is hardly newsworthy.

--

-----
PGP Key ID 0xCB8FF658
Re:Quite simple to check file size also by DigitAl56K · 2006-08-27 19:36 · Score: 2, Interesting

That's fine, except where the interpreted data can be created from more than one set of actual data. An example might be when creating a hash of a zip archive. If I take the original zip archive, and recompress it using a more efficient algorithm I can reduce the actual data size. I could even go so far as to manipulate the compression algorithm for every byte encoded until I found an encoding that enabled me to produce a hash collision and still match the original file size.

For other document types, I could, for example, take a marked-up UTF-16 document and change the character encoding to UTF-8, saving roughly half the space for a document whose characters were mostly English glyphs, then execute the same attack. For rich documents, space could be saved by minimally degrading image quality. For binaries, by running packers on them.

Adding the filesize to the hash string doesn't offer as much protection as you might first think when at the end of the day you can still find ways to create a collision.
Re:Quite simple to check file size also by Stellian · 2006-08-27 20:17 · Score: 2, Informative

I have to say, trusting SHA-1 to do what it says on the tin, is not incompetent. Naive, sure, but not incompetent.

Let's not forget this attack is against a reduced 64 round version of SHA-1.
SHA-1 still does what it says on the tin: the best attack known against the 80-round version is 2^63, which is still not practical.
Re:Quite simple to check file size also by LeDopore · 2006-08-28 06:38 · Score: 1

What The_Wilschon means (I think) is that unless the original html document had a long padded sequence outside the tag, the sizes of the real and fake pages wouldn't be the same. However, if the phony page could have a slightly smaller size than the original between the html tags (say, a bank login page with 16 bytes less), then adding the total file size doesn't really help you.

The only thing I think that would really help is sending an html page without any comments and data that's not processed, and declaring that there shouldn't be any hidden data. The problem here is that the random data used to make the hashes collide could be white text against a white background (or other invisible changes), so the bottom line is that you're right, wfberg, there's no way to make sure you're not going to get pwned without a secure hashing algorithm.

--
Expected time to finish is 1 hour and 60 minutes.
Re:Quite simple to check file size also by owlstead · 2006-08-30 03:13 · Score: 1

*Both* of the pages need that "garbage" at the end to make this work, just like the MD5 attack. In that sense, you can still trust MD5 certificates. They have already created different X509 certificates using the MD5 attack that both verify, and have different keys. But these certificates need to be generated by the same person creating the attack. It is still impossible to have a random message, create a MD5 or SHA1 hash and then create a collission.

Newsflash by Anonymous Coward · 2006-08-27 01:57 · Score: 0

I would have thought this not such a big issue for software developers who aren't incompetent.

That tiny minority and their program code remain unaffected.

git by Anonymous Coward · 2006-08-27 02:00 · Score: 0

This is bad news for the git.

Re:git by kasperd · 2006-08-27 09:05 · Score: 3, Interesting

This is bad news for the git.
It is not a major problem. First of all to exploit it, you'd have to generate a collision and have one of the two versions accepted in mainstream. Second you'd have to get the wrong version onto some user's machine before the correct version. Linus explained this in a posting somewhere after the original SHA-1 weakness was published. And though Linus AFAIK does not have any education in cryptography, he has demonstrated, that he clearly knows how to apply cryptographic primitives in a sound way. I have a PhD in cryptography, and I have read about the design of git, and I did not spot any weakness.

For now though from a theoretical viewpoint this is a major weakness, it still requires way too much processing power to be realistic. And the way git is designed, I don't think it is going to be any major problem switching to a new hash once cryptographers starts to agree which one should be considered secure in the future. Once they start using a new hash, you can actually still safely use old repositories based on SHA-1. Because once there is no longer being added new data based on SHA-1, a collision is no longer enough to perform an attack, rather you need a second preimage, something which there has not yet been demonstrated an efficient way to produce.

--

Do you care about the security of your wireless mouse?

Yeah, well.... by Anonymous Coward · 2006-08-27 02:04 · Score: 0

a4055725b7794277bd9da3a7904c011cb24edbec

Not like if it was AES by bruno.fatia · 2006-08-27 02:11 · Score: 2

It's a hash algorithm... no big deal ? Just like it was proven possible to break MD5 for binary executables and others them (both) won't be dropped for, say, storing passwords on a database.

Re:Not like if it was AES by rg3 · 2006-08-27 02:20 · Score: 3, Informative

I'm not an expert so don't take me seriously about it, but: I think hash algorithms are very important when signing things. For example, SHA-1 is the default hash algorithm used in GPG to sign messages. When the first attacks were mentioned I changed that to use RIPEMD-160. If you download something that has a SHA-1 sum to check both correctness and autenticity, it's a problem. Someone could modify the original tarball, for example, introducing a trojan horse, and append some other not useful data to it so the sum matches. I don't know if that's technically feasible as I say, but I imagine the possibilities are not so far. And, furthermore, for me this is another important warning that we should move out of SHA-1 ASAP. BTW, if I recall correctly BitTorrent uses SHA-1 to verify the 256KB chunks. There, having a fixed-size chunk is an advantage for this case, but, as I said, I wouldn't trust SHA-1 much longer. A further step and people could build evil BitTorrent clients that, at least, could corrupt your downloads introducing noise chunks.
Re:Not like if it was AES by bruno.fatia · 2006-08-27 02:54 · Score: 1

I agree they could be useless in some time but for now I'd pretty much trust MD5 and SHA-1 for login hashes. SHA-256 could be an alternative to SHA-1 in that case.
Re:Not like if it was AES by petermgreen · 2006-08-27 03:22 · Score: 1

right now the same person has to create the good and bad original files then get people to sign the good one before he distributes the bad one.

and upon close inspection the good one would look very suspicious (lots of random garbage)

so its a break of sorts but nowhere near as bad as a preimage attack would be.

--
note: i'm known as plugwash most places but i screwd up registering that here somehow in the past and now can't register
Re:Not like if it was AES by n0-0p · 2006-08-27 03:23 · Score: 1

The SHA and MD5 attacks are bait-and-switch, so the attacker must control both messages. The current research doesn't imply any weaknesses for HMAC applications, such as password storage. So yes, SHA and MD5 are probably still quite acceptable for these purposes if they're already in place.
Re:Not like if it was AES by shokk · 2006-08-27 03:28 · Score: 3, Insightful

The problem is that your old keys and the messages they encrypted are available for cracking now and forever. Most people only encrypt important messages, which are easy to look for in a mailbox, and at a later time could be easy to crack. There's probably even a good change the data in that mail could still be important.

Now, if all emails were encrypted, it would be harder to immediately see what messages in a mailbox deserve your attention. But then at a later date CPU speed may make that a negligible difference.

--
"Beware of he who would deny you access to information, for in his heart, he dreams himself your master."
Re:Not like if it was AES by suv4x4 · 2006-08-27 07:29 · Score: 2, Interesting

Someone could modify the original tarball, for example, introducing a trojan horse, and append some other not useful data to it so the sum matches.

That's the neture of hash keys anyway. The security weakness is not into collisions possibly existing, but how fast and feasible it is to find them.

It's a simple logical rule: if you have a 256-bit key, this is 2^256 possible hash combinations. If you put in a folder all possible 257 byte long text files, then each file will have a key that matches a the key of another file in that folder.

Make it a 258 byte file, and you have 4 possible 258-byte long files with the same hash. Now make it a 100kbyte file...

Again, it's not about the fact collisions exist, it's apparent they do, it's whether you can abuse this fact or not. MD5 has been compromised for certain applications, for example, but doesn't make it useless.
Re:Not like if it was AES by RhettLivingston · 2006-08-28 01:48 · Score: 1

Back in the days where a simple CRC32 was used to verify file integrity, I designed a deployment system for a test station that eliminated the need to include a file listing the CRC32 values of all of the other files on the test stations disk. All files on the disk were executables, so I simply wrote a program that appended a record to the end of all files causing all of the files to have a 0 CRC32 value. I then added a CRC32 calculation to the OS loader and failed any load that didn't come up with 0. With any methodology that uses a short enough hash, the design of a program to do this should be trivial.

This is a big deal by gweihir · 2006-08-27 02:12 · Score: 4, Insightful

One thing is that cryptographic hash functions should be easier to make secure than ciphers. At leaste that is what many cryptogtaphers thought. The other is that up to now you could rely on SHA-1 to be collision resistant, no matter what. The argument that you have a large part of the message being "garbage" does not give any real security. Many, many applications can still be attacked, and they need not even be broken for that.

While expected since last year, selecting and using crypto-hashes just got a lot more difficult and error prone.

--
Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.

Re:This is a big deal by Anonymous Coward · 2006-08-27 02:25 · Score: 1, Insightful

One thing is that cryptographic hash functions should be easier to make secure than ciphers.

How so? By default, a hash is breakable since its reducing the infinite input into a finite output. Ciphers do not.
Re:This is a big deal by kassemi · 2006-08-27 02:31 · Score: 2, Insightful

Whirlpool is a good choice these days. It's longer than most of the hashes out there, but I don't believe there have been any attacks yet demonstrated against it.

For those pythoners out there I wrote a quick wrapper for it that should get you started. Excuse any site errors and just hit refresh

--
What the hell's a "gewie?"
Re:This is a big deal by FooBarWidget · 2006-08-27 02:44 · Score: 3, Informative
For anyone wanting to use Whirlpool in their apps, here are libraries that you can use:
- Whirlpool library for Ruby. This is written by me, based on the sample C implementation by the inventors.
- The above library can also be used in C apps. Just copy whirlpool-*.[ch] to your project. See whirlpool-algorithm.h for API.
- The GNU Crypto library for Java contains a Whirlpool implementation.
Re:This is a big deal by Ckwop · 2006-08-27 03:02 · Score: 4, Interesting

Whirlpool is a good choice these days. It's longer than most of the hashes out there, but I don't believe there have been any attacks yet demonstrated against it. For those pythoners out there I wrote a quick wrapper for it that should get you started. Excuse any site errors and just hit refresh

Seconded. Whirpool uses similiar mathematics to AES so an attack that breaks Whirpool is likely (although not certain by any stretch of the imagination) to also break AES.

I think much like it is harder to design a cipher that resists attack when you use an LFSR as your base primitive it is hard to design a hash that is secure that uses an Unbalanced Fiestel Network (UFN).

This is why I do not advocate moving to the higher SHAs. I believe that some weakness will be discovered and it will be found the UFN made it worse.

If you're going to use AES, you've already thrown all your eggs in the Wide-trail design basket. If you're going to do that for the cipher, you might aswell do the same for the hash too.

In fact, in most cases you will use the hash has part of an authentication primitive anyway. In this case, there's a good argument for dumping a new hash and using an encrypt-authenticate mode of operation instead of something like HMAC. That way, you reduce the number of assumptions which have to be true for the system to be secure, which can only be a good thing.

In short, if you need to authenticate use your favourite encrypt-authenticate mode. If you need a hash for some other purpose, use Whirlpool.

Simon
Re:This is a big deal by Anonymous Coward · 2006-08-27 03:09 · Score: 0

Also in PHP > 5.2
Re:This is a big deal by bcrowell · 2006-08-27 05:23 · Score: 1

About a year ago, I switched my open-source grade-recording app for teachers from using SHA1 for digital watermarking to using Whirlpool. The good news is that my users should gradually be upgrading, and new users will already be using the new, more secure version. The bad news is that it now depends on the Perl Digest::Whirlpool library, which is on CPAN, but hasn't been packaged by any distros that I know of. Because of that, it's a lot harder for the average user to install than it used to be.

--
Find free books.
Re:This is a big deal by jd · 2006-08-27 06:35 · Score: 3, Interesting

Whirlpool is also in mhash, which is now many versions on from the ones supplied by Fedora Core and Gentoo. Oh well. It's also in the Linux kernel's crypto library. Whirlpool is a damn good hash and uses the same principles as the Rijndael cipher, which means that the underlying concepts have been deeply analyzed twice - once in each form - showing the basic ideas are fairly solid. Being long should reduce the risk of collisions so is actually a strength in many cases - particularly as we're talking bytes, not megabytes.

Tiger is another good hash function - faster than Whirlpool, smaller for those embedded cases where even the bytes matter, and I believe it is not known to have any attacks against it. Tiger also appears in mhash, not certain if it's in the kernel but it should be.

I don't see that it is really of any consequence whether anyone has actually demonstrated an attack on SHA - the point of security is to not wait until AFTER the house has been plundered to upgrade. SHA is FIPS-180, but if there is even a theory on how it might become broken, I would urge people to use something stronger. Security that is only certain to be good against skript kiddies is really not very useful security.

--
It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
Re:This is a big deal by FooBarWidget · 2006-08-27 07:18 · Score: 1

What I did is writing my own Whirlpool Perl module, which is included in the main program. The XS file is here. It depends on whirlpool-*.[ch] as available in my Ruby Whirlpool library.
Re:This is a big deal by bcrowell · 2006-08-27 07:26 · Score: 1

Interesting, thanks for the link. AFAIK, there's nothing wrong with the one on CPAN, though. It's just that you can't do an "apt-get install perl-whirlpool," or the equivalent on other distros.

--
Find free books.
Re:This is a big deal by Kjella · 2006-08-27 11:21 · Score: 1

In fact, in most cases you will use the hash has part of an authentication primitive anyway. In this case, there's a good argument for dumping a new hash and using an encrypt-authenticate mode of operation instead of something like HMAC. That way, you reduce the number of assumptions which have to be true for the system to be secure, which can only be a good thing.

Alright, so there's this SSH packet coming down the line. For those of you not in the know, it loos like this:
[encrypted block][hmac of decrypted block]. The only reason the HMAC should be a strong hash is so you can't infer from the HMAC what the contents are. Got another block matching that HMAC? Doesn't matter. You'd need to know the encryption keys to make the encrypted packet too. Breaking SHA1 would only really be a threat to signing operations - HMACs work fine just the way they are.

--
Live today, because you never know what tomorrow brings
Re:This is a big deal by PhotoGuy · 2006-08-27 23:08 · Score: 1

The other is that up to now you could rely on SHA-1 to be collision resistant, no matter what.
This statement makes little sense to me. "Collision resistant"? It's still quite collision resistant. But by the very definition of such a hash (mapping more than n messages, into n hashes), means a hash could never be collision-*proof*. Since any hash is not going to be collision proof, you can't truly "rely" upon it in the absolute sense.
Perhaps a more precise definition is that there is now a known way to craft collisions intentionally. I think this will be a potential problem for any hash solution for the forseeable future.

--
Love many, trust a few, do harm to none.
Re:This is a big deal by gweihir · 2006-08-28 02:25 · Score: 1

How so? By default, a hash is breakable since its reducing the infinite input into a finite output. Ciphers do not.

That is nonsese. Breaking a Hash means to find collisions with reasonable effort. Breaking a cipher means to reverse it with reasonable effort.

--
Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
Re:This is a big deal by gweihir · 2006-08-28 02:29 · Score: 1

But by the very definition of such a hash (mapping more than n messages, into n hashes), means a hash could never be collision-*proof*.

See? You answered your question yourself. Except that in crypto terms SHA-1 is now not collision restistant anymore.

--
Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
Re:This is a big deal by Anonymous Coward · 2006-08-28 04:09 · Score: 0

The other is that up to now you could rely on SHA-1 to be collision resistant, no matter what.

It still is collision resistant. Much like a watch being water resistant to 100m doesn't guarantee that it won't have water damage if you submerge the watch in less than 100m of water, SHA1 collisions may occur, but it won't be a common occurance.
Re:This is a big deal by Anonymous Coward · 2006-08-28 15:09 · Score: 0

" The other is that up to now you could rely on SHA-1 to be collision resistant, no matter what. "

No you couldn't. SHA-1 reduces any message whatsoever to a 160-bit hash value. There are more than 2^160 possible messages of any length whatsoever. Therefore many messages map to the same hash value. No cryptographer has ever pretended otherwise.

Original by JonathanR · 2006-08-27 02:15 · Score: 1

So the original document that is trying to be forged/modified has to convienently have a "long nonsense part" after the tag. How real-world relevant is that?

Re:Original by Apage43 · 2006-08-27 02:22 · Score: 1

Because very often, anything after the end of the content, may it be an HTML document or image of some sort, isn't displayed to the user, so it's possible to forge these. It's not undetectable by any means, any one who suspects the document of being forged will be able to prove it, but as it's still possible not to notice it's a forgery (To the typical user it WILL look fine and WILL have a valid hash), it's still cause for concern.
Re:Original by Anonymous Coward · 2006-08-27 02:24 · Score: 0

Imagine a PGP-signed message in HTML-format. Someone can easily forge a message to pass the PGP-test, but include HTML nonsense between comment tags.
Re:Original by untouchableForce · 2006-08-27 02:25 · Score: 3, Informative

It's quite relevant for those using it as a way to verify executables are the way developer had intended. Like the attack last year they're saying it would be possible for someone craft an exe without a virus, generated a checksum for it, get it linked to from major websites (after passing a virus scan of course), and then drop a virus in the end of the file and not have the checksum change. That's the real-world relevance.

--
Moderation is not supposed to be used as an indicator of agreement.
Re:Original by edmudama · 2006-08-27 04:12 · Score: 1

please define "easily" ... did you read the article?

--
More data, damnit!
Re:Original by Anonymous Coward · 2006-08-27 05:54 · Score: 0

Your computer probably stores your passwords as a hash. When a password is entered its hash is computed and that hash is compared to the one on file. If they match then access is granted. The password isn't viewable on your hard-drive but the hash is.
Re:Original by JonathanR · 2006-08-27 12:50 · Score: 1

How easy is it to incorporate a lot of garbage content into an executable?
Re:Original by matfud · 2006-08-28 06:04 · Score: 1

very easy. Executables can contain arbitrary data. You just have to make sure the execution path does not take you into it. static data such as icons are often stored in executables.

Re:This is NOT a big deal by packetmon · 2006-08-27 02:26 · Score: 4, Informative

Even if their test hardware could be accelerated from 33 MHz to 4 GHz, the process would still take 170,000 years. And even if a giant cluster of such machines were used, no collisions would be found within a realistic timeframe of a few years.

The second reason to keep cool is just as important, if not even more so: hackers will have to execute a pre-image attack to manipulate, for instance, a contract that has been digitally signed. In other words, hackers will have to find a second, manipulated contract with the same hash value as the real contract. In principle, the number of operations needed is thus far greater (2160). Indeed, as far as we know all attacks to date have only concerned collisions, and Wang et al. does not change that. There are no known methods to reduce significantly the number of operations needed for pre-image attacks.

Don't you think you're flying off the meter here a bit... Just because a collision was found means truly little. So a garbage laced HTML page was created after the actual HTML closing tag... 1) No one will see what comes after that unless you like viewing the source of a webpage as opposed to an actual page. 2) You should read up on birthday paradoxes. If someone created two similar messages, it would take years for them to figure out how to compute a hash to match. Now in the field of sending out something so so so secure, what makes you think that even if a someone did re-computate a hash to match, that message would be worth anything years down the line. Someone would have to be able to accomplish a collision, re-computate the hash in their new message and send it all within minutes for it to truly be a threat.

Let's look at this scenario... A massive kernel update is made to say Linux... The information is hashed, posted, and everyone is now going to update their Linux boxes... Unless someone is so quick fast to intercept along this path, most are safe unless they choose to verify the hash years down the line (which by then would be worthless). So unless someone can exploit this within minutes (no more than I would guesstimate 36 hours), I see little reason to get all bent out of shape over this...

--
Infiltrated dot Net

Know-How Article by Anonymous Coward · 2006-08-27 02:32 · Score: 0

For all those failing "to see a problem" or guessing about the implications: this actually is quite severe, and it doesn't just concern "broken" applications.

Heise has some fine backround in their Know-How article about cracked hashes.

Re:This is NOT a big deal by hankwang · 2006-08-27 02:42 · Score: 4, Informative

1) No one will see what comes after that unless you like viewing the source of a webpage as opposed to an actual page.

Common web browsers (I just tried Opera, FF, and Lynx) will happily display everything after the closing tag. You would have to put it inside  comment delimiters, but then it doesn't matter whether it is before or after the closing tag. Unless the attack doesn't work if the --> has to come at the end, but then you can just omit the closing tag. Only an XHTML-compliant browser would complain. From cursory scanning TFA it is not clear to me what the reason is for mentioning the closing </html.

--
Avantslash: low-bandwidth mobile slashdot.

How about this combination: by Name+Anonymous · 2006-08-27 02:46 · Score: 4, Interesting

Provide the following 3 pieces of data:

1) Message/file length
2) SHA1 hash
3) MD5 checksum or some other hash/checksum that's calculated way differently from SHA1.

Providing the length means that the person trying to change the data needs to keep it the same length which makes it more difficult.

Using 2 different hashing/checksumming methods means they have to be able to match both of them in order to be able to switch the data.

The more restrictrictsion we toss on the data, the harder it is to manipulate. I do think that using more than 2 or 3 hashing/checksumming methods would be overkill however.

Re:How about this combination: by Anonymous Coward · 2006-08-27 04:18 · Score: 0

wrong, the more restrictions you place upon the data, the more precise the information you're providing about that data!
Re:How about this combination: by FLEB · 2006-08-27 05:18 · Score: 2, Insightful

In cases of verification (rather than security) isn't more specificity better? I'd agree that double-hashing something like a secret password causes a loss of security, but if you're double-hashing a file to verify its contents, more specificity means it's harder to get a match by garbage-packing.

I really am asking-- I'm not all that up on the guts-and-wherefores of encryption/hashing, and I've wondered about this question as well.

--
Information wants to be free.
Entertainment wants to be paid.
You just want to be cheap.
Re:How about this combination: by Name+Anonymous · 2006-08-27 05:34 · Score: 1

wrong, the more restrictions you place upon the data, the more precise the information you're providing about that data!
Actually when you going for data verification of non-secret data, more precise information is better.
Re:How about this combination: by Xenna · 2006-08-27 17:37 · Score: 1

That's exactly the method I use in my upload confirmation messages:

Channel: 1691 (TEST)
Upload object: 1820 (Test AH)
Time of upload: 24-05-2006 14:12:41
Size of file: 65298 bytes
Uploader: doe (John Doe)
File received as: report.doc
File stored as: doe.doc
File SHA1 digest: e9dd2b06972aac8ef63a5b33e75775ad88d84556
File MD5 digest: df914e41a7d85d4bfd897d368a052f8b
Re:How about this combination: by Lord+Ender · 2006-08-28 02:18 · Score: 1

No. You're wrong.

The only reasonable use for stuff like this is the birthday attack. In the birthday attack, the attacker generates both documents with the hash collisions. They both have a big block of invisible junk in them. That leaves plenty to play with.

Your suggestion does nothing against the birthday attack.

--
A slashdotter who didn't build his own computer is like a Jedi who didn't build his own lightsaber.

Why not include message size? by Pedrito · 2006-08-27 03:10 · Score: 0, Redundant

It seems to me that all the attacks on both of these hash systems (SHA-1 and MD5) involve different message sizes which is easily fixed by using both hash and message size as a verification. I honestly don't come close to understanding the math involved in these hashes, but I get the idea that the complexity involved in creating a hash using the exact same amount of data is far higher.

Maybe factoring in message size as part of the hash is the solution? I don't know. Again, the math is way above me. But you could certainly do it by adding on some extra bits to the hash itself. A bit costly in size, but seems like maybe it might be worthwhile.

Re:Why not include message size? by TraxPlayer · 2006-08-27 03:23 · Score: 1

No, the MD5 attack produce the files with the same lenght.

--
If the code and the comments disagree, then both are probably wrong. - Schryer
Re:Why not include message size? by Apage43 · 2006-08-27 03:27 · Score: 1

And what's to stop a would-be attacker from changing the included message size?
Re:Why not include message size? by Anonymous Coward · 2006-08-27 03:32 · Score: 0

One possible way is to hash the message appended to the size - while this is not totally secure it does have the advantage of specifying more bits which can be verified by other means thus making the task far harder - especially as they would be at the end of the message.
Re:Why not include message size? by Cynic · 2006-08-27 03:35 · Score: 1

Both SHA-1 and MD5 specify including the length of the message as part of padding to the next 512-bit block.
Re:Why not include message size? by wirelessbuzzers · 2006-08-27 03:38 · Score: 1

t seems to me that all the attacks on both of these hash systems (SHA-1 and MD5) involve different message sizes which is easily fixed by using both hash and message size as a verification. I honestly don't come close to understanding the math involved in these hashes, but I get the idea that the complexity involved in creating a hash using the exact same amount of data is far higher.

No, usually the attacks produce 2 files with the same size.

Maybe factoring in message size as part of the hash is the solution? I don't know. Again, the math is way above me. But you could certainly do it by adding on some extra bits to the hash itself. A bit costly in size, but seems like maybe it might be worthwhile.

Yeah, SHA-256 adds some bits onto the hash itself, as do longer versions like SHA-384 and SHA-512. But it's unclear how much that helps; sometimes cryptographic attacks don't get much harder if you increase the size of the data.

--
I hereby place the above post in the public domain.
Re:Why not include message size? by Anonymous Coward · 2006-08-27 07:14 · Score: 1, Informative

Not so: only SHA1 does; MD5 does not. This means that, given two files with the same MD5 has, any fixed file concatenated after both will again produce the same hash. This is how the (MD5-)colliding pair of postscript documents was generated.

to find one... how big will the html get by kemo_by_the_kilo · 2006-08-27 03:23 · Score: 1

I mean this find is awesome... but to get a matching SHA-1, how big does the html get with the jibberish. I mean it just has to create the same hash, but i dont think it would be a couple of lines.... then again the first post about myspace content is prolly on to something, i guess there is a little truth in all jokes.

Re:to find one... how big will the html get by sowth · 2006-08-27 05:42 · Score: 1

IANACE (I am not a crypto expert), but...
Well, since you can easily squeeze about 6 bits for readable gibberish (meaning up/lowercase letters numbers and a few punctuation marks), I calculate you should only need 27 characters (160 bits/6). However that is only an estimate. You may need more because the inbetween bits may affect the result. If you were willing to go with unreadable characters, then 20 bytes should be enough.
Either way, I don't see why you'd need more than one line, so this attack may not be as noticable as one may think. I suppose if you were trying to use a set of real words forming realistic sentances, you'd need a much larger set, however I imagine you'd probably have a greater chance of it not being noticed...

Already expected by matrixhax0r · 2006-08-27 03:48 · Score: 1

Well, a variation of SHA-1, now called SHA-0 was broken similarily more that a year ago (before MD5 was broken). It was just a matter of time they figured out how to break full SHA-1. In the mean time, people who knew about this switched to SHA-256 which is still known to be good.

--
If it's no on fire, it's a hardware problem.

NO SHA-1 COLLISIONS HAVE EVER BEEN FOUND by SiliconEntity · 2006-08-27 03:51 · Score: 5, Informative

NO SHA-1 COLLISIONS HAVE EVER BEEN FOUND!

Ahem.

Sorry, my caps lock key got stuck there.

No SHA-1 collisions have ever been found. The first person or group to find one will achieve considerable fame. I say this as an attendee of both last week's Crypto conference and the immediately following hash function workshop.

The work factor estimated for a SHA-1 collision is something over 2^60 cycles. That would put it on par with the biggest calculations that have ever been done (publicly anyway). So far nobody has put together a sufficient effort to achieve a collision.

At the hash function workshop, cryptographer Antoine Joux published a set of recommendations for how such a hash collision effort should be mounted, in order to minimize the damaged from a published collision. The main goal is to make it difficult to take a published collision and use it to create harmful effects in various ways. Hopefully Joux's guidelines will be followed if and when a SHA-1 collision finding project gets started.

multiple hashes MD5 and SHA-1 by KalvinB · 2006-08-27 04:10 · Score: 2, Interesting

If the MD5 of the two different strings that had the same SHA-1 value are different then there's no real reason to panic. For an added level of security you could also calculate the byte length of the data.

Software will just need to be updated to calculate two hashes. Good luck finding two sets of data that are different yet have the same length, the same SHA-1 hash and the same MD5 hash.

--
Work Safe Porn

Re:multiple hashes MD5 and SHA-1 by Anonymous Coward · 2006-08-27 04:46 · Score: 1, Insightful

Even if you find that set, you don't overcame the most hard thing to do : have it real malicious code. Because, well, that's a hell to do a SHA-1 collision, and it's even more of a hell to do it with a code purposed to do anything usable and not just random crap.
Re:multiple hashes MD5 and SHA-1 by Lord+Ender · 2006-08-28 02:53 · Score: 1

Good luck finding two sets of data that are different yet have the same length, the same SHA-1 hash and the same MD5 hash.

You are a crypto n00b! Good luck? It's called a birthday attack. And calculating 2 hashes on the same data is not fundamentally different from just having a longer hash.

All you are proposing is, instead of using SHA1(data), use NewHash(data) where
NewHash(data)=SHA1(data),MD5(data)

That just a longer hash, and just as attackable by the birthday attack.

--
A slashdotter who didn't build his own computer is like a Jedi who didn't build his own lightsaber.

Add size of file by yaminb · 2006-08-27 04:41 · Score: 1, Redundant

I can never quite figure out why the MD5 and SHA and all these other algorithms don't include the original message size as part of their hash. This would eliminate all attack vectors that stem from adding or deleting information from a file. But I guess that's too simple a solution for these mathematicians.

Re:Add size of file by evanbd · 2006-08-27 05:38 · Score: 2, Informative

And actually reading the hash function descriptions is too complicated a solution for you? Oh, I forgot, you want your karma and most mods will assume you know what you're talking about because they don't read the specs either.
MD5, SHA, and every other hash function I've ever read the spec for appends some zeros followed by the original message size (the zeros are so it comes to an integer number of blocks) as the first thing it does. For exactly this reason.
At a guess, this attack requires that the two files be the same size. (But I haven't actually read TFA.) And attacks that delete or add data but can't manage to correct the file size are the minority anyway.
Re:Add size of file by Goaway · 2006-08-27 05:42 · Score: 1

You can never quite figure it out because you are completely clueless about crypto. Hope that helps.

PS: The attacks on both algorithms create files of equal size.
Re:Add size of file by the_greywolf · 2006-08-27 05:48 · Score: 1

As I understand it, the RIPEMD family pad only the final block with zeroes to end the message on a block boundary, and this is where the "extra bytes" are added. I agree with the GP on this, since it's much harder to collide a hash if that hash includes the raw message length as a component - then the final padding becomes a known quantity. (For example, such an algorithm, if comparing a message to a hash, could pre-pad the final block with a known quantity of zeroes before checking the real message length. The result would be that the size is verified before hashing is attempted. It would certainly decrease the chance of collision in that specific case.(

--
grey wolf
LET FORTRAN DIE!
Re:Add size of file by evanbd · 2006-08-27 06:29 · Score: 2, Informative

I'm pretty sure you're confused here. First, note that the padding is a 1 followed by some number of zeros, so no matter what the message actually ends with it is clear what is padding and what is message. That is, in all circumstances adding zeros to the end of the message changes what is being hashed, even when you ignore the file size part. And secondly, the message size as appended onto the message is the *original* message size, *before* any padding occurred. So appending zeros not only changes what the padded file looks like, it also changes the file size string that gets appended after the padding.
I really do think the mathematicians are doing exactly what you guys think they should be.
Re:Add size of file by yaminb · 2006-08-27 06:30 · Score: 1

Yes, they append the file length to the original message. Not to the message digest. I know you're an expert in cryptography, but my simple mind sees it this way. MD5. 1. Takes original message 2. Appended Message = original message plus some added bits (includes the file size) 3. Message Digest = Md5 algorithm run on the Appended Message Would it not be simpler to just include the file size as part of the message digest itself. 1. The original message 2. Message Digest = md5 run on the original message 3. Final message digest = Message digest with original message file size appended to it This way, the original message size is always preserved and is evidently clear. Anyone, even a simpleton like me could then be assured that attacks based on adding/deleting from a file would result in a different md5. That's all I'm saying from my laymen's reading of the MD5 specification.
Re:Add size of file by yaminb · 2006-08-27 06:34 · Score: 1

Yes, they append the file length to the ORIGINAL message. Not to the message digest.
I know you're an expert in cryptography, but my simple mind sees it this way. MD5.

1. Takes original message
2. Appended Message = original message plus some added bits (includes the file size)
3. Message Digest = Md5 algorithm run on the Appended Message

Would it not be simpler to just include the file size as part of the message digest itself.

1. The original message
2. Message Digest = md5 run on the original message
3. Final message digest = Message digest with original message file size appended to it

This way, the original message size is always preserved and is evidently clear. Anyone, even a simpleton like me could then be assured that attacks based on adding/deleting from a file would result in a different md5. That's all I'm saying from my laymen's reading of the MD5 specification.
Re:Add size of file by Anonymous Coward · 2006-08-27 07:33 · Score: 0

SHA1 already does, as part of the padding. MD5 does not, and that's part of the reason that a colliding pair of postscript files could be found so easily.
Re:Add size of file by mrogers · 2006-08-27 09:21 · Score: 1

Hash functions aren't only used for calculating the checksums of files. For example, what if you want to calculate the rolling hash of an incoming stream of data? In the cases where you know the length of the data I agree it's a good idea to include it, but you don't need to change the hash function: just calculate H(length||data) instead of H(data).
Re:Add size of file by Dasein · 2006-08-28 05:45 · Score: 1

If the message size is not digested then it can be changed without detection.

--
You are not a beautiful or unique snowflake -- but you could be if you got off your ass.

OK, what do we use now? by Walter+Carver · 2006-08-27 04:42 · Score: 1

SHA-1 and MD5 are becomming non-safe to use. So, we go to SHA-256 or a completely different algorithm?

Re:OK, what do we use now? by Anonymous Coward · 2006-08-27 05:54 · Score: 0

That would depend on the application. For a webapp I use an md5 sub-string based on timestamp and random int as the password salt for a pass length between 8-24 chars, storing the result in a sha1 hash. Data theft is probable, collisions unlikely.

For PGP or checksumming, use sha-256, whirlpool or whatever has the coolest sounding name (joke).
Re:OK, what do we use now? by kayditty · 2006-08-27 10:43 · Score: 0

It does, indeed, depend upon the application. However, for password hashing, I would recommend bcrypt. OpenBSD implements this in its passwording scheme, and, on the Linux front, there's Openwall GNU/*/Linux. Solar Designer also has what might be needed for application implementation here: http://www.openwall.com/crypt/

Insecure by zlogic · 2006-08-27 04:43 · Score: 5, Informative

SHA-1 was proved to have insecurities years ago. Because of that SHA-2 ("SHA-256", "SHA-384", and "SHA-512") was released back in 2001 as a better version of SHA-1. SHA-2 was tested and no insecurities were found (yet). What's more, SHA-2 is now the official US standard.
Complaining that SHA-1 is insecure is like complaining that Windows 98 is insecure.
Oblig Wikipedia link: http://en.wikipedia.org/wiki/SHA_hash_functions

Re:Insecure by doublebackslash · 2006-08-27 06:28 · Score: 2, Informative

I just want to put a plug out for my good friend whirlpool
Slected as the replacement for SHA-1 years ago by the European NSA, 512 bits long, based on AES.
Who loves ya?

--
md5sum /boot/vmlinuz
d41d8cd98f00b204e9800998ecf8427e /boot/vmlinuz
Re:Insecure by Anonymous Coward · 2006-08-27 09:28 · Score: 0

European NSA? No such agency.
Re:Insecure by doublebackslash · 2006-08-27 09:38 · Score: 1

I was implying that European analogue for the NSA, though I suppose I should have been more specific.

--
md5sum /boot/vmlinuz
d41d8cd98f00b204e9800998ecf8427e /boot/vmlinuz
Re:Insecure by Bronster · 2006-08-27 12:01 · Score: 1

md5sum /boot/vmlinuz
d41d8cd98f00b204e9800998ecf8427e /boot/vmlinuz

Haha, I have cracked your worthless hash and I can tell you that the sha1sum of your very short kernel also begins with a 'd', see:

sha1sum /boot/vmlinuz
da39a3ee5e6b4b0d3255bfef95601890afd80709 /boot/vmlinuz
Re:Insecure by adamofgreyskull · 2006-08-27 21:17 · Score: 1

They never did have a character called Encrypted-ness did they?

Why not more rounds? (was Re:How about this c...) by sowth · 2006-08-27 05:17 · Score: 1

Why not just create an algorithm with more rounds? I'm sure it would have more benefits than just slapping two algorithms together. In fact MD5 and SHA came from the same algorithm anyway (MD4), so I don't see much of a benefit... You may even need less processing time to calculate the hash than just doing both.

Re:Why not more rounds? (was Re:How about this c.. by Name+Anonymous · 2006-08-27 05:37 · Score: 1

Why not just create an algorithm with more rounds? I'm sure it would have more benefits than just slapping two algorithms together.

Thi smight be good is the number that was output was two seperate numbers concatenated together. Otherwise it's still just one number they have to match when they are modifying the hash protected data.

Size does matter by greazer · 2006-08-27 06:09 · Score: 0, Redundant

It sounds like this scheme would alter the size of the original text. Don't most schemes compare the hash and the size of the text?

Re:How about this combination: - Not so good by Anonymous Coward · 2006-08-27 06:10 · Score: 1, Informative

This is broken in O(K 2^(n/2)) time, see Joux's paper "Multicollisions in iterated hash functions".

Horray for validation by QuantumFTL · 2006-08-27 06:20 · Score: 1

It looks like validation of document structure significantly complicates the problem of creating a hash collision (the technique mentioned relied on padding the document with junk at the end, which, IIRC, is not in line with modern XHTML specs). Of course it's best to use a secure hash to begin with, but this does add yet another layer of resistence.

Re:This is NOT a big deal by SanityInAnarchy · 2006-08-27 09:12 · Score: 2, Informative

Not true -- XHTML-compliant browsers will only treat it as XHTML if it's sent with a "Content-Type: application/xhtml+xml" header. This is very nice for keeping your page clean -- view it in Firefox with that header, and it'll parse it as XML, and complain whenever you have a problem with your XML, saving you a trip to the even more pedantic W3C validator. Unfortunately, sending that header will have browsers that don't understand XHTML attempt to download the page, rather than displaying it as HTML. Even worse, browsers aren't consistent in the "Accept:" header they send, which is supposed to help with this -- Firefox prefers XHTML and says so, for instance, but while Safari is perfectly capable of receiving XHTML, it just says "Accept: */*", which says it doesn't prefer any type of document -- I may as well send it a PDF (which it would try to download). Even browsers which indicate a preference have to have "*/*" somewhere in there if they want to be able to download files.

So, there's really no good, standard way of detecting whether to send XHTML as XHTML or HTML. I've tried implementing something on my website, but I doubt that it will really work properly until I break down and attempt to detect specific versions of browsers.

--
Don't thank God, thank a doctor!

A little information theory by bwcbwc · 2006-08-27 10:29 · Score: 3, Informative

Actually, hashes are difficult to secure for general communications purposes without putting a cap on the size of the transmission. In information-content terms, a collision proof hash is equivalent to a lossless compression algorithm.

A hash will either contain all of the non-redundant information in the original content, or some of the information gets lost during the hash. Non-redundant information being defined in an information-theory sense that a given bit is completely random/unpredictable based on the content of preceding bits.

In order for a hash to be completely collision proof, it has to contain all of the non-redundant information contained in the original file. Otherwise information in the orignal message is lost in the hash. And if information is lost from the original message, that creates a possibility of constructing a message that differs only in the information that is removed by the hash. Only if the original message is reconstructible from the hash (plus possible information contained in the hash algorithm itself) will it be collision-proof. You've either got the information-content, or you don't. And if you don't have the content, you can't validate it.

--
We are the 198 proof..

Re:A little information theory by LeDopore · 2006-08-28 06:58 · Score: 1

You said "In order for a hash to be completely collision proof, it has to contain all of the non-redundant information contained in the original file." I don't think that's accurate, since a 256 bit hash of a completely random 512 bit message is 256 bits long, so it cannot contain the 512 bits of information in the original message. I believe you mean that for an N-bit hash function to be completely collision-proof, it must share at least N bits of mutual information with the message. If every bit of the hash is sensitive to every bit of the message in a unique way, then as long as the message has at least a little more information than N bits the hash will be collision-free.

--
Expected time to finish is 1 hour and 60 minutes.

Nothing to see here.. by xquark · 2006-08-27 10:32 · Score: 1

When messages are signed they are already compressed and the
file size is used as part of the final message digest.

If your application doesn't do this then you should use another
applications.

Using the above mentioned processes none of these attacks are
viable, both in the short and long terms. (supposedly)

When receiving the following message:

Jon D0e owe$ Jane Do3 $1000`000 d011ar5.

or

Jon Doe owes Jane Doe $1000000 do11ars.
$4#^$%^&*5#$^()^%ER$%^RT#$3

Would you not ask the sender to resend
their original message?

Arash Partow

--
Arash Partow's Philosophy: Be a person who knows what they don't know, and not a person who doesn't know.

Re:Nothing to see here.. by Jaime2 · 2006-08-27 13:39 · Score: 1

Send the original mesage as a PDF attachment. There's a ton of place to hide data in a PDF and no one would give it a second thought.
Re:Nothing to see here.. by xquark · 2006-08-27 15:48 · Score: 1

The MD should be calculated on the compressed form of the file and its
uncompressed (original) size.

The question is can you tunnel a match for the MD that is also a valid
compressed stream of some compression algorithm X. FIPS 140 doesn't
say to compress but it does say to use a format in the message that
is totally reliant on every bit in the message to be decoded properly
and that does not ignore bits it does not understand. I believe an
amendment to this should be for every erroneous bit it receives it
should add exponentially more errors to the final rendering of the
document. Its a very simple requirement, not very difficult at all,
easy to implement.

The best example I've seen so far for the attack described in the
article is one where someone using the postscript format created two
documents that had the same md5 value. They did this by placing the
garbage between the final piece of the header and the header ending
tag.

Though, even this is not valid attack vector, because a precautious
user would rightfully assume the guy at the other end (sender) run
some program Z that will remove all garbage from the document, so as
a result they could run program Z too on the document, from that a
couple of things should happen, initially the file should not change
if the document has not been tampered (aka is the original document),
and secondly and most importantly the MDs should be the same.

--
Arash Partow's Philosophy: Be a person who knows what they don't know, and not a person who doesn't know.

If you can replace the file... by Kaenneth · 2006-08-27 11:45 · Score: 1

If you can manage to replace a hashed file from a web site with content of your choosing, couldn't you just as well change the hash it's checked against?

Even if the hash and the file came from seperate sources, odds are a user went to 'xyzzysoft.com' from which the link to both the hash and the file are provided; so if xyzzy's page were hacked, they are screwed anyway.

To use a car analogy, to steal a purse in a car, a theif dosn't pick the lock, he smashes the window.

Unless you have a seperate, secure channel for transmitting the hashes, end-to-end, and compare them manually. (such as faxing the hash, then downloading the file) it's silly to use the logic of: File Hash equals Website Hash therefore File is safe, File is the one I wanted, File is really from Xyzzy. Not that hashes are useless, far from it they are excellent to verify against transmission/storage corruption, viruses, and unsophisticated attacks.

There have been hacks such as replacing the system random number generator to cheat in an online game, which would wreak utter havok with any crypto depending on it; a complier virus that adds itself to compliers compiled with an infected compiler, which could silently modify the output of a hash comparing program if a particular sequence was detected. And here's an idea I just thought up, a hacked file decompression library that modifies executables as they are being extracted, after you carefully manually checked the hash of the compressed package you recieved. (if you used a user-priviledge extraction utility for something like an updated kernel mode driver)

Re:If you can replace the file... by devilspgd · 2006-08-27 17:22 · Score: 1

Perhaps... But not always.

Consider this: A patch to an OS or popular application comes out. The patch is subject to peer review before public release, the hash is published. 30 seconds after the public release happens, I replace the file on one of the mirrors (and NOT the hash -- I don't control the main site, only a mirror or two) with a malicious version with the same hash.

Consider bittorrent -- If you could quickly and easily generate new data that matches a known hash, you could join any torrent you want as a seed, but distribute bad data that passes the hash. Your peers wouldn't know any different and would pass the bad data on to their peers too, essentially polluting the entire torrent. Wouldn't the RIAA love that?

Neither of these are possible yet, but being able to generate different content (even if random, it doesn't even need to be malicious code or English words) could be useful in some cases.

--
Give a man a fish, he'll eat for a day, but teach a man to phish...

easy tiger... by Anonymous Coward · 2006-08-27 12:50 · Score: 3, Insightful

I think the key point is this:

No SHA1 collisions have ever been published

whether or not they have been found is a different matter entirely.

Oh no!!! by einhverfr · 2006-08-27 12:52 · Score: 1

Nessie is using encription now? Beware in case you go to Loch Ness....

--

LedgerSMB: Open source Accounting/ERP

Re:Why not more rounds? (was Re:How about this c.. by Jaime2 · 2006-08-27 13:34 · Score: 1

That doesn't always work. For example, two rounds of DES encryption has weaknesses that neither one nor three round suffer from. The only way to count on an encryption or hash algorithm is to have it tested by the general public for many years. Simply increasing a key size or number of rounds is often a crapshoot and you won't know whether you were successful for a long time.

The better alternative is to move to an existing well tested algoritm while the security researchers are testing the future replacements. If there's one place you don't want to be on the bleeding edge, it's with security algorithms.

what? by YesIAmAScript · 2006-08-27 20:02 · Score: 1

It's a hash function, not an encryption.

So how can SHA-2 be secure? It has no proven collisions? I can prove it has collisions. It accepts more than 512 bits of input and only procues 512 bits of output, it must collide.

There have not been shown to be collisions for meaningful messages in SHA-1 yet. And the "break" for it requires on the order of 2^63 operations.

SHA-1 is a long way from dangerous to use at this point.

--
http://lkml.org/lkml/2005/8/20/95

Re:what? by zlogic · 2006-08-27 21:30 · Score: 1

A hash function is secure when it's difficult to do a brute-force password attack, e.g. you have the /etc/passwd contents and yet you still need weeks to get the real passwords. SHA-1 has thousands of iterations even for simple 6-character passwords.
If a bug is found, the function can be simplified so that instead of two weeks the password can be found in two days, because the number of iterations is reduced.

Re:This is NOT a big deal by owlstead · 2006-08-27 20:49 · Score: 1

Ehm, this text was written for the predating 2^63 way of finding collissions and should be considered rather out of date, even though the current attack is only for 64 rounds of SHA-1. Oh, and supplying the reference would also be a good idea. Kudo's for reading the document and following the link though.

http://www.heise-security.co.uk/articles/75686

Passwords are 'files', too by RichiH · 2006-08-27 22:03 · Score: 1

If they really did that as default, you would have a /etc/shadow filled with data about the exact length of each password. Great thing, that ;)

Some Software does include size optionally, other does compute two hashes.

The basic problem though, is that MDn and SHA-n are in the same family and these algorithms have been fundamentally broken for some time now. Instead of slathering on more of old security (Hi, 3DES!), get a new, better and above all different approach like http://en.wikipedia.org/wiki/Whirlpool_(algorithm) . By the time everyone has switched, the old algorithms are completely useless, anyway (Hi, 3DES!)

Yeah, but.. by RichiH · 2006-08-27 22:37 · Score: 1

..good sites will sign the hash with a private key. In this case, having the ability to change the hash, too does not mean it would be desirable for the attacker.

Re:This is NOT a big deal by merlin_jim · 2006-08-28 01:49 · Score: 1

The attack is not about the garbage characters - it's about what came first. The garbage characters ARE the attack themself, they don't need to carry a virus or something to be malicious... Point being, let's say you sign a document saying you're going to give me 15% of revenue. I change it to 20% and start adding random characters at the end until I get a document that maintains the same hash, meaning I can "prove" that the second document is the one you signed...

--
I am disrespectful to dirt! Can you see that I am serious?!

No, they can't by Anonymous Coward · 2006-08-28 02:33 · Score: 0

"Someone could modify the original tarball, for example, introducing a trojan horse, and append some other not useful data to it so the sum matches."

No, they can't. The attack can not take an existing message and modify it in such a way that the modified version has the same hash. These attacks can only generate a prefix and suffix chunk of seemingly random data that you can sandwich your choice of payload between and get the same hash. In other words, you have to carefuly create BOTH messages to get them to hash the same, and it's fairly obvious if someone has done so.

Your existing signed messages are still perfectly safe.

Re:Why not more rounds? (was Re:How about this c.. by malpingu · 2006-08-28 08:16 · Score: 1

I think you're confusing the number of rounds (16) in the internal DES Feistal cipher algorithm with the number of times DES may be applied as a whole with (usually different) keys to assure a longer effective key length. Triple-DES (DES3), for instance, is often applied in encrypt-decrypt-encrypt (EDE) mode using either two or three different 56-bit keys for an effective key size of 112 or 168 bits, respectively.

Re:Why not more rounds? (was Re:How about this c.. by Jaime2 · 2006-08-28 13:35 · Score: 1

I'm not confusing them, I'm just citing an example where "doing it more" ends up making the system less secure. More rounds, more bits, more iterations, it doesn't really matter. What matters is that any lay person's quick fix is likely to cause as much harm as good. It takes a lot of cryptanalysis and a lot of time to test the quality of an encryption algorithm.

My favorite example is when script kiddies pull out some wacky scheme like "Take a message, type it on an image in photoshop, save as JPEG, reverse the bytes in the file, zip it, and xor with 'HELLO WORLD'." They don't realize that a determined skilled attacker can break that much more easily than if it were a proven cipher. If you don't have a lot of experience in cryptography, you shouldn't even try to "improve" an algorithm.

Remember...... ALL encryption algorithms are 100% resistant to every attack the designers thought of, except brute force. If it wasn't resistant to their test attacks, they would have changed it. Therefore, only someone else can verify the strength of an algorithm. It takes many "someone elses" before you can be confident an algorithm is actually good.

Re:Why not more rounds? (was Re:How about this c.. by malpingu · 2006-08-28 20:27 · Score: 1

I agree, generally. Of course, not all multiple iterations of DES (e.g., DES3-EDE-168) result in a weaker cipher than the original. I was rather seeking to clarify the term 'round' which, especially in relation to Feistal ciphers, has a particular meaning that is generally accepted by cryptographers.

"ALL [well-designed] encryption algorithms are 100% resistant ..."

That is to say, some badly constructed algorithms may not even be resistant to some attacks anticipated by the designer. ;-)

that's not what hash function security is about by Paul+Crowley · 2006-09-03 21:46 · Score: 1

No, that doesn't characterize hash function security at all.

First of all, all hash functions of the same size are pretty much equally vulnerable to a brute force attack - that's practically the defining feature of a brute force attack. It's resistance to shortcut attacks that matters.

Secondly, there are loads of properties people want hash functions to have - not only preimage resistance but second preimage resistance, collision resistance, correlation-freedom, multiplication-freedom and more. In fact no-one has successfully formalized all the properties we want our hash functions to have: the formalism that comes closest to capturing it, the Random Oracle Model, is impossible for any instantiable hash function to achieve.

--
Xenu loves you!

Slashdot Mirror

SHA-1 Collisions for Meaningful Messages

128 comments