Slashback: Flashmob, Currency, Verification

← Back to Stories (view on slashdot.org)

Slashback: Flashmob, Currency, Verification

Posted by timothy on Thursday March 25, 2004 @11:59AM from the nothing-important-happened-this-month dept.

The first Slashback in a while, with updates and reactions to previous Slashdot stories, including a Flash-mod supercomputing reminder, the upside of microwave-tested currency, CUPS' user-interface foibles, an alternative to MD5 sums, and more. Read on for the details.

Reminder of your scheduled spontaneous appointment. Zero_K writes "As previously posted on Slashdot and the NY Times, the University of San Francisco's, Computer Science department is building a 'flash mob' supercomputer on April 3rd. On their newly updated official web-site (Main Site, ISO's) the team has now posted the ISO image of their custom morphix that will be used to boot all the computers into the cluster, documentation is on the website (under 'downloads') and on the CD (index.html). I personally plan on downloading and testing this ISO tonight. And after the cluster is taken off line, there will be a massive LAN PARTY (Possibly one of the biggest in San Francisco...) On a 10-Gigabit LAN...Oh sweetness ... So if you are in or around the SF Bay Area on April 3rd, be sure to sign up and bring your laptop or desktop to campus and help make history."

Whaddya mean, "no pun intended"? Rudiger writes "After the dust (no pun intended) has settled around the whole Operation Dust Bunny thing, McAfee updates their signature database classifying Dust Bunny as an application. To be more specific: 'This program is detected as a "potentially unwanted application."' They also say 'This is not a virus or trojan.' Should we leave it to the experts this time?"

Would you read Atlas Shrugged on this screen? An anonymous reader writes "The so-called 'electronic paper,' being a high-clarity monochrome display to become a foundation for comfortable and inexpensive 'electronic papers,' has finally shown its face. The new electronic paper, which looks a bit like an iPod, has 10MB memory, keyboard, Memory Stick PRO slot, voice recorder, speaker, and headphones output, and USB2.0 interface."

(We mentioned the device yesterday, but this link provides better images of it.)

Now they're Pragmatic Publishers as well -- much success! AndyHunt writes "As you may have heard, the Pragmatic Programmers have started their own publishing company (see Slashdot reviews here and here). We've just signed our first outside author: Mike Clark, editor of the JUnit FAQ and developer of JUnitPerf and JDepend. He'll be writing the eagerly-anticipated Pragmatic Project Automation book, the third volume in our Jolt Productivity award-winning series."

Exactly how many bits, Ma'am? And in what order, did you say? jlcooke writes "Two months (almost to the day) after getting slashdotted for an innocent post to sci.crypt - the MD5CRK project has launched. The aim is to get the thousands of applications and websites to drop MD5 for SHA-1 or SHA-256 by finding a counter-example of a security requirement in MD5. Press Release is here."

How to take criticism, by example. slashdot_commentator writes "Eric S. Raymond has recently written a wonderful piece explaining to the Linux zealot why it may not be the operating system of choice of all users. (Or what user aspects open source developers need to focus on to further Linux World Domination.) The op-ed specifically focuses on the CUPS printing system. (But it would be a mistake to dismiss it as a screed against CUPS.) The CUPS authors surprisingly acknowledged ESR's points, and he wrote a followup to the article."

Hitting them where it figuratively hurts. Ian Wilson writes with a followup to the Slashdot post earlier this month on "website thieves stealing content and designs from others, taken from silicon.com. Well, now silicon.com is reporting that it has contacted the offending site's advertisers and forced them to stop paying ad revenues - thus effectively crippling the illegal site - after all, no revenue, no reason to the run the site."

Express your appreciation with PizzaPal. Chuck writes "After you guys published the article on $20 bills exploding when microwaved, a co-worker of mine went to put his soup in the microwave and found a $20 bill in it. Too bad it was an older one, but someone around the office must have left it in there after reading your article. The co-worker then took me out to lunch. Thanks, Slashdot!"

59 of 218 comments (clear)

Flash-mod? by Rorschach1 · 2004-03-25 12:13 · Score: 4, Funny

I've seen that before... it's when I get modded -1 Flamebait within 30 seconds of posting!
Microwave... by ruprechtjones · 2004-03-25 12:18 · Score: 5, Funny

Hmm, just went upstairs and checked my own microwave for cash. Nothing. Maybe I should get my dimwitted roomates to start reading Slashdot.

--
Kip Hawley is an idiot.
1. Re:Microwave... by SupaZeph · 2004-03-25 12:23 · Score: 5, Funny
  
  Maybe I should get my dimwitted roomates to start reading Slashdot.
  
  Don't forget to point out to them for the best results, they need to use a large wad of cash, preferably > $1,000, rather than a solitary $20.
McAfee problems... by lukewarmfusion · 2004-03-25 12:21 · Score: 5, Interesting

The other day, there was a bitTorrent link in the article, and I realized that I didn't have Bit Torrent installed. So when I went to download it, McAfee told me it was Spyware.

Bit Torrent is spyware?

Yet another reason for me to hate McAfee.
1. Re:McAfee problems... by Kenja · 2004-03-25 12:28 · Score: 2, Insightful
  
  Its yoru own fault for having it installed. Yank the thing out by the short hairs and install a real anti virus program.
  
  --
  
  "Have you ever thought about just turning off the TV, sitting down with your kids, and hitting them?"
2. Re:McAfee problems... by Kenja · 2004-03-25 12:36 · Score: 3, Interesting
  
  There are two good AV programs for Windows that I know of, F-Prot and Command. Both will run you around 25$ a year and both run very light. I've had to turn them off when playing games, but other then that they dont have any problems.
  
  --
  
  "Have you ever thought about just turning off the TV, sitting down with your kids, and hitting them?"
3. Re:McAfee problems... by Anonymous Coward · 2004-03-25 12:45 · Score: 4, Informative
  
  AVG
  http://fileforum.betanews.com/detail.php3?fid =1028 312263
  
  Antivir
  http://www.fileforum.com/detail.php3?fi d=103256665 3
  
  Just use a throw-away mail account to register.
4. Re:McAfee problems... by spacecowboy420 · 2004-03-25 12:48 · Score: 3, Interesting
  
  AVG baby - free for personal use, updated FAST found @ grisoft.com. I liked it so much I bought licenses for the entire company. No slips yet - for over a year.
  
  --
  ymmv
5. Re:McAfee problems... by ryanr · 2004-03-25 12:48 · Score: 4, Interesting
  
  According to Bram, McAfee is currently flagging anything that uses the NSIS installer, which BT uses for recent builds. It's a false alarm, as noted.
  
  Further, make sure you download the Official client from the Official site. Suprnova has been purposely running a banner ad for a couple of months now for a BT 3.3 client that IS laden with spyware.
6. Re:McAfee problems... by Verteiron · 2004-03-25 12:51 · Score: 3, Interesting
  
  Try Avast Antivirus. The home version is free for personal use and keeps itself more up-to-date than any other AV I've ever used. The scanning engine is light and fast. The pro version for businesses is extremely powerful and flexible and runs $40.
  
  They're also beta-testing a Linux A/V client, they actually FIX BUGS when people post them to their forum, and all-in-all a few of the other A/V companies could learn from them when it comes to ease-of-use, auto-updating, and product support...
  
  --
  End of lesson. You may press the button.
7. Re:McAfee problems... by IO+ERROR · 2004-03-25 12:52 · Score: 2, Interesting
  
  the original BT client is not spyware (look at the source yourself if you disbelieve) but other BT clients might be.
  
  --
  How am I supposed to fit a pithy, relevant quote into 120 characters?
Electronic Paper by El · 2004-03-25 12:21 · Score: 5, Insightful

Hmm... put an 802.11b interface on this thing, and it won't matter that it has a trivially small amount of memory...

--
"Freedom means freedom for everybody" -- Dick Cheney
1. Re:Electronic Paper by Anonymous Coward · 2004-03-25 12:54 · Score: 2, Funny
  
  Have most people even read 500 books?
  Do editions of Playboy count?
Eh? by Feztaa · 2004-03-25 12:24 · Score: 5, Funny

a co-worker of mine went to put his soup in the microwave and found a $20 bill in it.

He found a $20 bill in his soup?
1. Re:Eh? by Tumbleweed · 2004-03-25 12:33 · Score: 4, Funny
  
  Probably a tip from the fly...
Dustbunny.... by dealsites · 2004-03-25 12:25 · Score: 2, Funny

At least we slashdotted thier site. So I guess there is probably a gap in there where they didn't get all the data they were looking for.

--
Live updates from Slickdeals, Tech Bargains, Bens Bargains, Got|Apex, etc..
md5 is weak? by lukewarmfusion · 2004-03-25 12:28 · Score: 4, Funny

This saddens me. I just finished implementing an md5 password hashing routine for a web application.

At least it's not production yet, so I can switch it over.

See? This is why my bosses should let me read Slashdot at work.
1. Re:md5 is weak? by Tom7 · 2004-03-25 14:27 · Score: 3, Insightful
  
  MD5 is not weak for password hashing.
  
  But why did you bother reimplementing it? There are loads of free, public domain implementations, unless you are working in some fringe language (no shame in that).
E-paper by Fiz+Ocelot · 2004-03-25 12:29 · Score: 3, Interesting

I'd love to be able to condence a lot of my books into something like that, but it's still just too small. It should fold out to two sides for one thing since many books are written in a format with that in mind. (at least text books for classes).
If they can do that, make notes using handwriting easy (no recognition required), I'd love that...
But I bet the main opponents to this would be book publishers who charge exhorbiant amounts for "new editions" where hardly anything was changed. oh well.
So then what... by Alizarin+Erythrosin · 2004-03-25 12:29 · Score: 2, Insightful

If we are trying to get people to move away from MD5 sums, what do we use? CRC?

--
There are only 10 kinds of people in this world... those who understand binary and those who don't
1. Re:So then what... by harlows_monkeys · 2004-03-25 13:07 · Score: 4, Informative
  
  If we are trying to get people to move away from MD5 sums, what do we use?
  SHA1, which you can use via the sha1sum command in the GNU core utilities, probably already installed on most Linux systems.
Re:Flashmobs can be fun! by orthogonal · 2004-03-25 12:29 · Score: 4, Funny

I had 10,000 assholes on my screen and so many being launched I couldn't stop them.

Welcome to Slashdot.

--
Opinions on the Twiddler2 hand-held keyboard?
wanna outsource the SF lan party? by xot · 2004-03-25 12:31 · Score: 4, Insightful

Anyone wanna outsource the infrastructure and SW for the Lan party to us indians? ;-)
Jokes apart, i'd really like to fly down to USA top be a part of the lan party and see how those guys manage things.Its one thing to have a lan party with 100 ppl but using up complete subnets is one different league!

--
Lord of the Binges.
Re:I'll drop MD5 in a heartbeat... by fredrikj · 2004-03-25 12:33 · Score: 2, Informative

SHA-1 isn't really "their" message digest algorithm, they're just recommending it as a replacement for MD5, which they're trying to crack.

You have three different "MD5 sum" utilities that all give different checksums for the same data? If so, then at least two of them aren't actually MD5 utilities, in the sense that they don't compute MD5 sums. *cough*
I love this guy! by spurious+cowherd · 2004-03-25 12:34 · Score: 3, Insightful

The technical details of these tests aren't important, and anybody who writes me arguing for a different set will have fixated on the wrong level of the problem.
The point is that, unlike a command tool for techies that should give them lots of choices, the goal of a GUI is to present the user with as few decision points as possible.
Remember the Macintosh dictum that the user should never have to tell the machine anything that it knows or can deduce for itself.
this is as clueful as it gets. Most app designers should heed him

--
Time flies like an arrow, fruit flies like a banana.
1. Re:I love this guy! by nautical9 · 2004-03-25 16:42 · Score: 2, Insightful
  
  I wholeheartedly agree, more apps should put forth more effort to autodetect and autoconfigure as much as they can, to present the fewest number of options.
  However, it's equally as import to still allow experienced users to bypass any settings with whatever they feel, because try as we might to code perfect autodection routines, there will be times when it is wrong and the user will know better. Bury it behind an "advanced" button or some such, but don't blindly assume the autodection can't possibly be wrong.
  There's nothing more annoying than knowing something is there and working, yet the program refuses to acknowledge it and offers no way to force the issue.
The luxury of ignorance by The+Bungi · 2004-03-25 12:36 · Score: 5, Insightful

I don't normally like ESR but in this case he has really been outstanding, possibly because he's not particularly afraid of "uber-geeks" shouting him down with insults and "RTFM motherfucker" epitaphs.
I see this every single day. The open source community (as it were) is full of people who want to use and like operating systems like Linux and BSD but are just too fucking afraid of even uttering anything that might reveal their ignorance (and I don't use that word in a negative sense) of whatever it it they're trying to accomplish with their computers.
Slashdot and USENET are full of endless threads about how easy it is to do this-or-that and if you haven't figured it out you must be supremely stupid and lazy. "What, you want it in a fucking silver plate?". Normal people (the ones not buying into open source right now) are petrified at this. They eventually either figure out how to do it ($deity bless Google) or just give up.
Without gross generalizations of course, I can't claim that everyone is this way. But there seems to be a troubling majority of zealots who are just so fantastically out there in their claims that [insert technology here] is so easy to use that even a "brain dead Windoze luser" must be able to figure it out, so they just cannot figure out why everyone hasn't dumped "M$". I mean, it's all so easy and efortless.
Maybe this will indeed be a wake up call for everyone.
1. Re:The luxury of ignorance by donnz · 2004-03-25 13:01 · Score: 2, Insightful
  
  I appreciate ERS is trying to raise the bar on UI design which is good, but I do think his comments are extreme. I see the opposite to you, lots of users at work and at home, happily using GNU/Linux desktops and some rather well written and designed end user applications.
  
  So to tar all OSS with the same brush seems pointless and counter productive.
  
  --
  -- Free software on every PC on every desk
2. Re:The luxury of ignorance by The+Bungi · 2004-03-25 13:19 · Score: 2, Interesting
  
  Like I said, I was not generalizing. I agree that there is good, but at the same time I recognize that there is a lot of bad. Do you?
  There are so many little things that cripple non-expert users in Linux. Just off the top of my head, on RH9/GNOME, inserting a CD-ROM brings up a dialog that reads
  Would you like to mount /dev/cdrom?
  Or something like that. I mean, c'mon. If I wanted that I'd be running fwwm or something. Do I want to "mount" "/dev/cdrom"? How the hell should I know?? Or even better, try installing a TrueType font on Linux. Oh my god.
  If more effort was directed towards these things rather than to making yet-another-theme-for-KDE Linux would be vastly more user-friendly and maybe it would be actually giving Windows a run for its money on the desktop.
3. Re:The luxury of ignorance by Spy+Hunter · 2004-03-25 14:16 · Score: 4, Insightful
  
  The problem is that user interface design is not something that can be done by a geek sitting alone in his/her room coding. Even so-called user-interface experts can't fix a user interface by themselves. The one and only key to designing a good user interface is USER TESTING. This means finding other people who have never used your software and observing them as they learn how to use it. It really is crucial to get actual people to use your software while you watch. Without user testing, your user interface will be crap no matter how many self-proclaimed experts pontificate on the merits of your various design choices. 10 minutes of user testing is worth days of speculation about how to make your interface better. However, nobody writing open-source software does usability testing; they make their GUIs by themselves based soley on their preconcieved notions of what a GUI should be like. No matter how well-intentioned the developer is, this process won't produce easy-to-use software, and it won't produce new innovations in user interface design. I am convinced that this is the reason open-source software interfaces suck and are mostly copycats of other software.
  Another big problem with UI design in general is that when things go wrong, there is a tendancy to blame the user instead of the software. "You should have clicked this other button" or "You should have seen this option" or the ever-popular "You should have read the manual". This attitude is not restricted to open-source software developers; you see it everywhere. The fact of the matter is, if one person makes a stupid mistake, it's quite likely that other people will too, so you should account for it in your design no matter how stupid the mistake seems. If you want a good user interface, you have to make sure that even the stupidest mistakes people make are accounted for in your design. The attitude you need to have to design a good UI is: _every_ mistake a user makes is entirely the fault of the interface, because a truly good interface would either eliminate the possibility of making a mistake or at least be smart enough to indicate that you're making a mistake. Obviously it's not possible to meet this ideal, but a lot can be done to eliminate most mistakes users make. On-the-fly spell/grammar checking is a good example of this philosophy.
  
  --
  main(c,r){for(r=32;r;) printf(++c>31?c=!r--,"\n":c<r?" ":~c&r?" `":" #");}
Wow.. by msimm · 2004-03-25 12:38 · Score: 3, Interesting

ESR just jumped A LOT of points in my book. I haven't read anything so dead on in the community in ages. But add to that his level of tact and his *gasp* sympathy for the user. Wow. Definitely worth the read.

--
Quack, quack.
Re:I'll drop MD5 in a heartbeat... by agentZ · 2004-03-25 12:44 · Score: 2, Informative

Or grab the nifty new (v1.1 released today) md5deep. Computes MD5, works recursively and most any platform too.
Re:I'll drop MD5 in a heartbeat... by The+Bungi · 2004-03-25 12:47 · Score: 5, Insightful

Bullshit. I have code that generates MD5 checksums written in C++ (using Crypto++ and using the CryptoAPI), VB (using the CryptoAPI), C# (using the System.Security.Cryptography.MD5 provider) and Python (using md5 on both Windows and Linux) and they all generate exactly the same digest for the same data as the UNIX checksum utility.
If you wrote code to generate the checksum(s) and it's not working then you have a problem between the keyboard and chair, not with the algorithm. That's a standard that is not OS, platform or language specific.
Re:I'll drop MD5 in a heartbeat... by ryanr · 2004-03-25 12:53 · Score: 2, Informative

Be extraordinarily careful when trying to take a MD5 sum of a text file. Most operating systems will give you different file contents for a text file, depending on how you ask to open and read the file. If you have MD5 utils that aren't explicity requesting all files in binary mode, then they are being sloppy.

You also have to be careful with text files that they aren't being modified on the fly when being transferred between machines.
Re:I'll drop MD5 in a heartbeat... by clifyt · 2004-03-25 12:56 · Score: 3, Interesting

Thats strange -- are you feeding it the same data?

I have a few implementations of MD5 that I use for various apps that ALL give the same results. Sometimes you have to make sure that character sets and otherwise are being processed that same way, and it all comes out the same way.

Lets see -- I have the PHP builtin function, a perl implementation (for systems that don't have it built into the OS), a Javascript one and one that was for just plain ASP (not the .NET -- never used it yet. Hell, I use it to pass off authentication between these languages when I can't get away with using the same language through out. All work exactly the same...and I'm not even that great of a programmer...
How do you know it isn't? by Eevee · 2004-03-25 12:59 · Score: 2, Interesting

Not a flame or anything, but did you check the source for the Bittorrent client you downloaded? SpywareInfo shows there is a Bittorent client floating away with an infection of spyware.

Just for grins, I checked my machine and McAfee ( Virusscan Enterprise 7.0.0, virus defs 4341) didn't complain about ABC [Yet Another Bittorrent Client] 2.6.5 being on my machine. (Nor did AdAware 6.0.) So McAfee doesn't go after all Bittorrent clients.
Finding 1 MD5 collision does break the system... by tstoneman · 2004-03-25 13:13 · Score: 2, Insightful

So you spend all these resources to find one collision amongst 2^128 combinations.... not really that useful. Sure it is significant, but does it really bring down the entire MD5 infrastructure?

To really destroy MD5, you need to either be able to reverse the plaintext from the hash, or build a lookup table where you can get the plaintext from the hash.

Both of these seem infeasible, especially the lookup table, so things like Paypal using MD5, which the web site uses as an example, doesn't seem quite true.
$400 book!!! by iamhassi · 2004-03-25 13:20 · Score: 2, Insightful

"In Japan it will cost about 40,000 ($365). "
for a 7.5" by 5" device with 800x600 4-tone grayscale and 10 megs they want how much??? Damn thing probably doesn't even have a decent processor, can't do 1/10th the things a 5 yr old Palm could do and they're charging $400?!? Did I warp back to 1984? Sure it's not a Mac?
Let Dell copy it and sell them for $149.

--
my karma will be here long after I'm gone
Proprietary form by kisielk · 2004-03-25 13:23 · Score: 2, Insightful

The device supports Sony's BBeB (BroadBand eBook) format and utilizes OpenMG copyright security.

Apart from this, does it support any other format? I'd love to have something like this to read the countless PDF and HTML books I have, but if I had to buy them again in BBeB format, it's not quite as cool.
Drop MD5? No. It depends on the intended use. by pla · 2004-03-25 13:24 · Score: 5, Informative

I frequently use MD5 in my code, for verifying a file's integrity. I do not use SHA-1 or SHA256, because they run a lot slower than MD5, without providing a realistically better guarantee that a file contains what it did at the time of its creation (if 128 bits leaves a significant chance of collision, you have bigger problems than choice of hashing algorithms... Such as how to store over a trillion yottabytes, which corresponds to one bit per 10 picograms assuming you used the entire Earth as a storage device).

Now, cryptographically, MD5 does not have the same "strength" as the SHA256. If you want to prevent tampering, you should most certainly switch to an SHA. But to just check the validity of a large block of data (such that a mere CRC doesn't suffice), MD5 works beautifully.

Additionally, I would point out to those who seem to believe finding a single MD5 collision would invalidate the whole algorithm - BS. For SHA256, going though every possible 257 bit block, you can guarantee a collision. For any hashing algorithm, that will hold true. I don't care if someone came up with a quantum hash (pulled from my posterior, since quantum-blah seems like the word of the day for magical guarantees of computational perfection), you'll still have at least one collision in N+1 bits, where the hash generates N bits.

So can we drop the SHA elitism that seems to have infected people lately? If you want to waste time in your code, go right ahead. But don't fault those of us who actually understand that, outside the realm of hard cryptography, MD5 more than suffices as an all around good hashing algorithm.
Re:Finding 1 MD5 collision does break the system.. by interiot · 2004-03-25 13:27 · Score: 3, Insightful

LOL. You mention in your own post that MD5 is 128 bits long. If you just restrict yourself to documents that are, say, 10mb big, that means there are 2^81920 possible plaintext documents for each MD5 hash. Granted, only some of them will look remotely like english, STILL... 2^81920 is quite enough to come up with many plaintext documents per hash. If you restrict yourself to keys
Re:Finding 1 MD5 collision does break the system.. by fredrikj · 2004-03-25 13:31 · Score: 3, Insightful

As far as I've understood it, the primary purpose is to demonstrate that cracking MD5 is realistic. If this project can then anyone with decent resources (the MD5CRK FAQ claims $100,000 would be enough) can do it. Also, additional collisions will most likely be found soon after the first one (the probability of finding collisions increases), and the data collected from the search can be used for future efforts (e.g. for analysis that might reveal actual statistical flaws in the algorithm).
Re:I'll drop MD5 in a heartbeat... by MenTaLguY · 2004-03-25 13:52 · Score: 2, Informative

MD5 is standardized and portable.

Perhaps some of the utilities you are using consider file metadata when generating the checksum?

Also beware of implicit conversions being done to your data by your I/O libraries, as other posters have noted.

--

DNA just wants to be free...
Error in MD5CRK assumptions by droyad · 2004-03-25 14:03 · Score: 3, Informative

We aim to disprove one of the fundamental requirements of a secure message digest: No two inputs can be found which produce the same digest
That is an incorrect assumption. The fundamental requirement is: It is hard (next to impossible) to find two inputs which produce the same digest (and still make sense
The message digest is usually shorter than the message, so this means that the digest contains less "information" that the message. Which means there will be more than one message for the same digest. This loss of "information" means also that you cant reverse a hash to get the original message and be 100% certain you have the right message. There is an infinite number of messages that produce that hash.
Re:Is it just me..... by gregfortune · 2004-03-25 14:19 · Score: 3, Insightful

Is it really so much to ask that people learn how to use the tool they choose to use properly? Is it so much to ask that people know how to read?

When they shouldn't have to read or choose, it's lacking for an app to make them choose. In ESR's case, he shouldn't have had to make the decision as the system had all the info it needed to answer the question itself.

Yes, this takes more effort on the part of the programmer and that's probably why it's not done yet, but it's near-sighted to argue against a change that only improves the user experience. Not only does Aunt Tilly now have a good chance of getting her printer setup, I don't have to work nearly as hard reading manuals and experimenting with settings to get mine working. Why read the manual if you don't have to?
Re:Drop MD5? No. It depends on the intended use. by jlcooke · 2004-03-25 14:27 · Score: 2, Informative

Almost forgot your comment about speed. SHA-1 is slightly slower then MD5. SHA-256 is slightly slower then SHA-1. SHA-384/512 use 64 bit operations so it is much slower on 32bit systems. In short, you concerns about speed are unfounded. Read on.

Run this command:
openssl speed md5 sha1

I get: ...
The 'numbers' are in 1000s of bytes per second processed.
type 16 bytes 64 bytes 256 bytes 1024 bytes 8192 bytes
md5 13426.71k 46361.18k 124663.83k 222340.64k 286203.62k
sha1 11175.12k 30058.96k 69783.42k 104107.06k 121809.96k

I also ran "time md5sum file94mb" and "time sha1sum file94mb" file 3 times in succession. The performance is much closer.

a959b7de4f11fe89ba57ecc6fe2f6a07 file94mb
real 0m1.070s
user 0m0.860s
sys 0m0.060s

a959b7de4f11fe89ba57ecc6fe2f6a07 file94mb
real 0m1.070s
user 0m0.850s
sys 0m0.070s

a959b7de4f11fe89ba57ecc6fe2f6a07 file94mb
real 0m1.071s
user 0m0.810s
sys 0m0.110s

5d926755ef975a8900b89b514feac9ded29c4477 file94mb
real 0m1.538s
user 0m1.260s
sys 0m0.060s

5d926755ef975a8900b89b514feac9ded29c4477 file94mb
real 0m1.524s
user 0m1.270s
sys 0m0.040s

5d926755ef975a8900b89b514feac9ded29c4477 file94mb
real 0m1.520s
user 0m1.280s
sys 0m0.030s
of course there are MD5 collisions! by Mr.+Slippery · 2004-03-25 14:37 · Score: 5, Informative

Are the MD5CRK folks trolling, smoking crack, or just not explaining themselves very well?
They "aim to disprove one of the fundamental requirements of a secure message digest: No two inputs can be found which produce the same digest - this is also known as a collision."
MD5 gives a 128-bit digest. There are more than 2^128 possible messages. Of course there are collisions. What MD5 claims is that the difficulty of coming up with two messages having the same message digest is on the order of 2^64 operations, and that the difficulty of coming up with any message having a given message digest is on the order of 2^128 operations.
No digest algorithm can claim to be free of collisions; they are many-to-one mappings.

--
Tom Swiss | the infamous tms | my blog
You cannot wash away blood with blood
My comments on the slashback stories mentioned... by magores · 2004-03-25 14:53 · Score: 2, Funny

-Cool
-Duh!
-huh?
-Whoa!

Please feel free to apply to comment of your choice, to the /back story of your choice.... Moderate as appropriate.
Re:Finding 1 MD5 collision does break the system.. by Geoffreyerffoeg · 2004-03-25 14:54 · Score: 2, Insightful

> to...be able to reverse the plaintext from the hash

THE plaintext? Firstly, there cannot be only one plaintext. By the pigeonhole principle, a few byte sum cannot be unique for all multi-megabyte texts.

Besides, if that were possible, MD5 would not be destroyed; it would become the world's best compression.
Maybe this underscores the problem... by dozer · 2004-03-25 15:31 · Score: 3, Insightful

ESR says, "Let's go back to the queue type selection screen. Remember that one? It looks like this: Locally connected, Networked CUPS (IPP), Networked Unix (LPD), Networked Windows (SMB), Networked Novell (NCP), Networked JetDirect". He then goes on to say that all of this should be autodetected and then the irrelevant options grayed out. According to him, each host do a Christmas tree scan (!!) of the local network to see what printer types to prompt for.

First of all, he'd better stay the hell away from my network. I thank goodness that no other (non-script-kiddie) application on this planet performs unprompted scans like this. DHCP, of course, doesn't count. :)

Second, what if the printer is currently down? Or I'm configuring a machine to be installed offsite? I can think of any number of scenarios where I'd want to configure a network printer that isn't currently on the network.

A program should NEVER think that it's smarter than the user. What if CUPS doesn't detect "wvlan0" as a network interface? Well, it would gray out all the network printer options. But that's clearly wrong -- the user *knows* that the machine is networked. If CUPS allowed him to configure the network printer, everything would just work. Note that CUPS probably should put up a warning dialog "Warning: I could not detect a network -- do you want to continue," but it should not prevent or restrict anything.

ESR's solution relies on too much magic and will cause support nightmares. It is too system-dependent -- it might work on Red Hat, but it'll probably break on SuSE. Or an ARM-based machine. Or a token ring network. Etc. And when it breaks, the user will be surprised and have no other recourse than to consult the documentation.

Incidentally, graying something out is almost always wrong because it gives no indication as to why it's grayed out! You should let the user select it, then put up an informative dialog telling the user that what he's doing doesn't make sense, and what he or she might do to fix it. Always, always, always tell WHY.

Yes, the CUPS UI is flawed ("client-error-forbidden! client-error-forbidden!"), but ESR's proposal is even worse. It's a measly six-item menu! If Easy Software did try to implement it, after a ton of programmer time they'd have an interface that is more surprising, less informative, and more fragile. Not a step in the right direction.

The proper way to fix this unfriendly menu is to create a wizard The first page would allow you to select a locally-connected printer or, if there are no unconfigured local printers, a network printer (possibly launching a Samba browser to help). Wizards are great for reducing perceived complexity without reducing functionality.

Creating a good user interface is hard. I think that ESR just proved this. :)
1. Re:Maybe this underscores the problem... by Sabalon · 2004-03-26 04:23 · Score: 2, Informative
  
  I like the idea of my computer auto-detecting any network printers. I don't have my printer linked up to the network as I don't have time to try and figure out how to do it.
  
  I do to. But I also want the ability to say "you're wrong...do it this way" to the computer.
  
  I think you are in the minority. If the printer is currently down then you can't use it so configure it later when you CAN use it.
  
  For home users, this may not be an issue. For an office environment, it may be. There have been a few times dealing with printers where part of the support group is out unboxing them, putting them in place, assigning an IP to it, while the network group is setting the servers up to have that print queue. It'd suck if there were 20 of these and we had to wait until they were all unboxed and setup to add the queue, or get interrupted to do each one as it's setup.
  
  Sure...have the ability to say "I don't think this is the option you are looking for" but at the same time, allow people to override any auto-detection with the caveat that this may no work if you don't know what you're doing. Just don't force them to be at the mercy of the wizard programmer as to what should happen in the real world.
  
  How will it just work if it cannot connect to the network interface?
  
  The point was, I believe, that the wizard may not be smart enough to recognize every type of connection. For instance, if the wizard was written by someone who never say anything but Linux, there may be code that looks for ethx. But on HPUX it's lanx, or whatever Solaris calls its interfaces. Or what if my only connection is a ppp connection (dialup or VPN) and it isn't connected at the time I'm setting it up. Again, it's about saying "this is what it looks like to the wizard, but if you wanna igore those assumptions, go ahead at your own risk."
  
  They user doesn't care about how difficult it is behind. If it's badly designed then it will be system dependant and a support nightmare. It will need a bit of thought in the design, that's all.
  
  Amen brother.
  
  And it isn't just Linux - printers on any Unix system (or any system at all) are ridiculous.
2. Re:Maybe this underscores the problem... by dozer · 2004-03-26 07:45 · Score: 2, Interesting
  
  I don't mean to imply that graying out is always bad. In trivial cases, as you observed, it's amazingly useful. This case, however, is definitely not trivial. Have you ever hunted all over a user interface trying to figure out why the hell a particular menu item is grayed out? I have. It's agonizing. Mac apps from the early 90s were notorious for this.
  
  There have been some solutions in the past. Balloon help did a really good job of explaining WHY a particular menu item was grayed out. It's too bad it worked so poorly and looked so stupid. I've seen Microsoft apps put the info in the status bar when you hover over a menu item. This is good too. But, please, never gray something out if the reason is not immediately obvious.
  
  Wizards aren't MS-specific. Heck, OSX uses them a lot. It's a way of taking the user by the hand and guiding him or her through a complex process. It's true that MS has given Wizards a bad name by using them all over the fricken place, but that doesn't mean they're all bad.
  
  Here's how my proposed wizard would work. My apolgies for the ugliness of the following. I made some really nice ascii art but the lameness filter rejected it.
  
  1. Entry screen:
  
  Local Printers: .-------------. | Epson C80 | | '-------------' [Select Network Printer] [Next]
  
  All local printers would be displayed. The first unconfigured local printers will be preselected. Clicking the Select Network button takes to to step 2, Next takes you to step 4.
  
  2. Select Network Printer Type:
  
  o Internet Printing Protocol o Windows (SMB) o Unix (LPD) [Next]
  
  3. Browse Network Printers
  
  .-------------- | SMB or IPP browser '--------------- | Editable Text Box to display/accept share name | [next]
  
  4. Configure selected printer...
  (insert rest of wizard here).
  
  I can't think of an easier or more capable way of solving this problem. If you can, I'd love to hear it.
  
  I know this will sound trite, but look at Apple's Human Interface Guidelines for some very sound principles in UI design.
  
  I was a Mac developer for 4 years... I can quote those guidelines backwards and forwards. I think I still have a copy in my garage. The problem is, they are definitely showing their age. As you noted, even Apple doesn't strictly ahdere to them anymore. The world is a more complex place now.
The Insanity of Blind Autoconfiguration. by The+Monster · 2004-03-25 15:47 · Score: 3, Interesting

How exactly can you verify that there's not a Windows print server on a non-local subnet that you want to use?
I thought the same thing, and emailed ESR to that effect on the 11th of this month:

I have been saying for some time that the biggest hurdle for Linux right now is the difficulty of configuring the system for a non-geek. But I can't go along with you on this: > If the preceding rules leave just one choice, so inform the user and go straight to the form for that queue type. I spend a fair amount of effort getting character-based tools (Bourne scripts that run on SCO Open Server, AIX, and occasionally HP-UX and Linux, to be precise) for non-technical users to work, including the frequently daunting task of autodetecting configurations to come up with reasonable defaults. I have learned the hard way that autodetection is never 100%. Even Microsoft gets this - their 'Wizards' always have a check box or button for [x] Let me choose/configure/whatever. Just because no Jet Direct is found on your local subnet via autodetection doesn't mean that you don't want to configure printing to it. It might be on the other side of a router. Should autodetection offer the most likely prospects for what the user intends? Absolutely. But there must always be a clearly-labelled way to explore other options. It's easy enough to do... Which printer do you wish to configure? Windows Print Shares: [ ] \\DEXTER\HP HP DeskJet 656c [ ] \\DEEDEE\EPSON Epson Stylus C84 Unix Print Shares [LPD]: [ ] pana@192.168.1.200 Panasonic KXP-1100 HP JetDirect: [ ] 192.168.1.50:9100 HP LaserJet 4L [ ] 192.168.1.50:9101 Dymo SE-300 [ ] 192.168.1.50:9102 Generic Centronics OTHER [ ] I don't see the printer in this list. [ <- Back ] [ Next -> ] [ Cancel ]

He hasn't replied to my email.

--

[100% ISO 646 Compliant]
SVM, ERGO MONSTRO.
Re:Finding 1 MD5 collision does break the system.. by Idarubicin · 2004-03-25 18:27 · Score: 2, Informative

Granted, only some of them will look remotely like english, STILL... 2^81920 is quite enough to come up with many plaintext documents per hash.
Peachy. Where were you going to put the lookup table for that? 2^81920 is on the order of 10^25000. If you could store one of those documents on an atom (attach it with a little dab of glue, okay?) you'd have enough plaintext documents for every atom in this universe...and for every atom to have its own universe of attached atoms...and still have enough documents to be short several orders of magnitude of storage space. Generating the table is left as an exercise for the reader. Cheers.

--
~Idarubicin
MD5CRK will need a few more participants... by Paul+Crowley · 2004-03-25 20:23 · Score: 2, Interesting

By my calculations, at the current rate they'll take over 500 years to produce a collision. They need about a hundred times as many people on board to get anywhere.

The sum I did is

sqrt(-l(0.5)*2*2^128)/(1.325*10^9*86400*365)
51 9.78646399116343804161

N=2^128 is the space they're looking for a collision in. The expected number of collisions found after k items have been produced is very close to k^2/2N, so the probability zero have been found is exp(-k^2/2N) by the Poisson distribution. Assume exp(-k^2/2N) = 0.5 and solve for k, then divide by their declared rate of 1.325 gigaMD5s a second.

I don't know whether this inclines me to give the whole thing up or to climb on board. The latter is probably more fun.

Incidentally, the algorithm they're using to do the search efficiently is pretty cool. Paul C van Oorschot and Michael J Wiener, Parallel Collision Search with Cryptanalytic Applications (pdf)

--
Xenu loves you!
MD5CRK boneheaded by Ed+Avis · 2004-03-25 21:25 · Score: 3, Insightful

According to the MD5CRK site:

We aim to disprove one of the fundamental requirements of a secure message digest: No two inputs can be found which produce the same digest - this is also known as a collision.
That is bullshit. Of course two inputs can be found which produce the same message digest. This is the pigeonhole principle. Now the MD5CRK developers seem like smart people, and so it's more likely that they just haven't explained it very well.

They go on to say

To raise awareness we will find at least two strings of printable text that produce an identical MD5 hash.

But I don't see what that would achieve either: two strings of gibberish that happen to have the same MD5 sum. Find a way to produce two documents which both have meaning (perhaps two pieces of source code, or two different school reports) and have the same signature, and that would be impressive.

--
-- Ed Avis ed@membled.com
1. Re:MD5CRK boneheaded by Ed+Avis · 2004-03-26 03:42 · Score: 2, Interesting
  
  The definition of collision-resistent is that you cannot find ANY inputs x,y st x!=y and H(x) == H(y). None. No exceptions.
  In other words, that the function H is injective. But no message digest function producing a fixed-length digest from an arbitrary-length input can have such a property.
  
  Lets say I could easily generate MD5 collisions on 'random-looking' 128-bit strings ... Would MD5 be considerd broken?
  It depends on how you were doing it 'easily'... if you simply had a great deal of brute force to apply, then you could apply the same brute force to SHA-1 or any other message digest function. Only if you have some way of finding collisions which is better than brute force would this be a weakness in the digest function. (Of course, one can imagine a trivially weak message digest that has only 'A' and 'B' as possible outputs; for that algorithm even a brute force attack is easy enough to worry about, but this isn't the case for MD5 as far as I can tell.)
  
  Looking at the method MD5CRK will be using, it seems they'll just be brute-forcing MD5 by running it in a feedback loop to find a cycle (which must exist). As they say, this could be applied to any function which has finite range and domain (assuming that there is some reasonable mapping from domain back to range: in this case, they can both be treated as strings). Exactly the same technique could be applied to SHA-1. Do they simply mean that SHA-1 has a larger range of output values, so its cycles are probably longer and harder to find?
  
  --
  -- Ed Avis ed@membled.com
MD5 colision demonstration. by rixster · 2004-03-25 23:46 · Score: 3, Interesting

For all those interested in the MD5 signing of a message and how "impossible" it is - take a look at www.cryptool.org and the demonstration under "Individ. Procedures" -> "Attack on the hash value of a signature". You may be (unpleasently) surprised about how easy it is to match two completely different documents to have the same MD5.

--
Two wrongs may not make a right, but three ....
Re:Is it just me..... by jpop32 · 2004-03-26 00:45 · Score: 3, Interesting

Setting up a printer in Linux was one of the first things I did after I figured out how to install it, and surprise surprise all i had to do to get it working was, wait for it, READ THE MANUAL.

And in the Windows, installing a network printer goes like this: Select 'add new printer', click next, check 'network printer', click next, click next, select the printer from the list, click next, click finish, admire the test page printed out on the remote printer. Windows user is done before Linux user read the first page of the manual.

Do you for a second believe that Linux way is better in this respect?

Is it really so much to ask that people learn how to use the tool they choose to use properly? Is it so much to ask that people know how to read?

Joe Q. User (you know, the one Linux needs to win over in order to establish world domination) answers: Yes, and yes. I want to install a printer, not read something. If I wanted to read something, I'd go to the library.