Finding More Than One Worm In the Apple

From whence the headline? by SuperKendall · 2014-05-16 04:20 · Score: 4, Insightful

The "Apple" had only one bug, the Goto Fail bug - since Apple did not use OpenSSL they never had the second bug.

So why is the headline painting Apple as the source of both bugs?

--
"There is more worth loving than we have strength to love." - Brian Jay Stanley

Re:From whence the headline? by Anonymous Coward · 2014-05-16 04:42 · Score: 4, Insightful

The "Apple" had only one bug, the Goto Fail bug - since Apple did not use OpenSSL they never had the second bug.
So why is the headline painting Apple as the source of both bugs?
Dude.. chill, it is an actual apple, as in a fruit -- it is a saying. I didn't read the headline your way at all.
Re:From whence the headline? by jeffmeden · 2014-05-16 04:45 · Score: 3, Insightful

They are comparing the test methods that might have cought the Apple SSL "goto fail" bug vs the Heartbleed openssl bug (which was unchecked memory access). How do we know there isn't another SSL bug in either? That's right, we don't. And we won't until testing (automated or otherwise) gets better in both places. Ideally we would have a way to find (and fix) lots of worms in both.
Re:From whence the headline? by jythie · 2014-05-16 05:14 · Score: 1

Though even with automated testing, we still do not 'know'. Striving for perfection is something worth doing, but believing perfection is possible is not.
Re:From whence the headline? by Anonymous Coward · 2014-05-16 05:15 · Score: 2, Informative

It's exactly the original title of the article which is:
"acmqueue - Finding More Than One Worm in the Apple"
Re:From whence the headline? by radtea · 2014-05-16 05:19 · Score: 3, Interesting

And we won't until testing (automated or otherwise) gets better in both places.
I'm skeptical of testing (automated or otherwise), and I think point in TFS is well-taken: testing that would have caught this bug would have involved creating tests that virtually duplicated the system under test.
While some code is susceptible to test-driven development and thorough testing, and that should be done where-ever possible, the resources required to test some code effectively double the total effort required, and maintaining the tests becomes a huge headache. I've worked in heavily-tested environments and spent a significant fraction of my time "fixing" tests that weren't actually failing, but which due to changes in interfaces and design had become out-of-date or inappropriate.
That's not to say that testing can't be done better, but it's clearly a hard problem, and I've yet to see it done well for the kind of code I've worked on over the past 20 years (mostly algorithmic stuff, where the "right" answer is often only properly computable by the algorithm that is supposed to be under test, although there are constraints on correct solutions that can be applied.)
So I'm arguing that a culture of professionalism, that implements best-practices including coding standards and code reviews (possibly automated) that check for simple things like open if statements and unchecked memory access would be lower cost and at least as effective as heavier-weight testing.
This is a static-analysis vs dynamic-analysis argument, and while I certainly agree that dynamic analysis is necessary, both these bugs would have been caught with fairly simple-minded static analyzers checking against well-known coding standards from a decade ago.

--
Blasphemy is a human right. Blasphemophobia kills.
Re:From whence the headline? by phantomfive · 2014-05-16 05:41 · Score: 2

I don't know about the headline, but the other day I ran into a bug on OSX (some commandline tools developed problems with UTF-8). "No problem," I thought, "I'll just report it." I had to create an account to report a bug, which was annoying, but then when I got to the bug reporting website, I found this error message. "LOL" I thought, "but ok, I'll email them." I told them their bug website was having trouble, and they emailed back and said, "please report that through our bug reporting tool."

--
"First they came for the slanderers and i said nothing."
Re:From whence the headline? by rasmusbr · 2014-05-16 05:41 · Score: 2

Okay, but in this case the bug had little to do with the algorithm. The bug was triggered unconditionally for every input.
Re:From whence the headline? by serviscope_minor · 2014-05-16 05:55 · Score: 3, Informative

both these bugs would have been caught with fairly simple-minded static analyzers checking against well-known coding standards from a decade ago.
Except they wouldn't. Coverity out right stated that their static analyzer would not have caught the heartbleed bug.

--
SJW n. One who posts facts.
Re:From whence the headline? by Greyfox · 2014-05-16 06:01 · Score: 2

It's no so much that -- if your coupling is loose enough, you should be able to test the API of any component in your system. But you have to come up with that test. Programmers often have blind spots for things where "no one would ever do that." It might be OK for programmers to come up with basic tests that exercise the API and make sure it's at least marginally functioning as designed, but you also really need to throw some guys at the code who just like to break things. Paid software houses don't even do that very often, much less open source projects.
You also never see software auditing anymore. Everyone says "Oh you don't need that anymore now that we have Fortify," but fortify didn't catch this bug, did it? I did some auditing for Data General back in the '90's and found the buffer overflow in the AT&T telnet server 2 years before the same overflow was found on the Linux one. Fortify might have actually caught that one, since it was a fixed length buffer accepting user input, if anyone had ever thought to run Fortify against that program.

--
I'm trying to teach myself to set people on fire with my mind... Is it hot in here?
Re:From whence the headline? by Cinder6 · 2014-05-16 06:03 · Score: 1

It's an attempt to get more views, I think. I know I clicked the link when I saw it in my RSS feed because I thought, "Holy crap, they found another glaring security whole in Apple products?" Then it's somebody analyzing others' analyses.

--
If you can't convince them, convict them.
Re:From whence the headline? by eulernet · 2014-05-16 06:51 · Score: 1

I totally agree with you: testing is not a panacea.
Another good practice is adding bounds checking when building in debug mode (for example in "assert" statements).
When you develop your code, you use the bounds-checked version, and when in production, the checks are removed.
Also, you seem to forget that OpenSSL is full of legacy code, and legacy code is a not 2 times harder to unit-test but a hundred times !
You need to cut the code into little pieces, but the goal of OpenSSL was to write the fastest code.
So there is a culture problem before writing tests !
Re: From whence the headline? by Anonymous Coward · 2014-05-16 07:12 · Score: 1

Except the TFS is about how a simple unit test could have found it. The quote from Langley is being rebutted, not reinforced.
Re:From whence the headline? by DarwinSurvivor · 2014-05-16 07:15 · Score: 1

We may never know that there isn't, but we very well could know that there is.
Re:From whence the headline? by Anonymous Coward · 2014-05-16 07:34 · Score: 2, Interesting

I seem to remember seeing an article on the NASA coding practice, and they do exactly what the summary suggests: every important feature is implemented twice, with two different algorithms, and they are tested against each other to ensure they produce the same result. They also do formal code reviews of every check-in (no matter how minor), and any bug found is treated as a process problem (i.e. how can we fix the process that allowed this bug in), rather than just a software problem.
As a result they produce code which is as close to perfect as anyone has ever come, and costs about 10x the industry average to develop.
Re:From whence the headline? by 93+Escort+Wagon · 2014-05-16 08:35 · Score: 1

Dude.. chill, it is an actual apple, as in a fruit -- it is a saying. I didn't read the headline your way at all.
Actually, you are wrong. If you read the article, you'll see its main focus is on the "goto fail" bug and what the author perceives as the development shortcomings that allowed it to happen in the first place. The focus is pretty Apple-centric, mainly because he's using the "goto fail" bug as the primary evidence to support his central tenet. However I did not get the impression the author was anti-Apple.
Heartbleed is only mentioned as an afterthought because (as the article mentions) it became public knowledge some time after the author wrote the first draft of the article.
The "finding more than one worm" phrase doesn't appear to refer to Heartbleed at all - it's about (in the author's opinion) changing practices so more bugs can be caught and/or prevented.

--
#DeleteChrome
Re:From whence the headline? by Paradise+Pete · 2014-05-16 15:29 · Score: 1

This is why people hate Apple users.
If something like that make you "hate," then perhaps you should consider raising your level of tolerance.
Re:From whence the headline? by Reziac · 2014-05-17 04:53 · Score: 1

This is why it's not Apple Pi.

--
~REZ~ #43301. Who'd fake being me anyway?
Re:From whence the headline? by davester666 · 2014-05-17 09:22 · Score: 1

link bait. mentioning Apple gets them the most hits.

--
Sleep your way to a whiter smile...date a dentist!
Re:From whence the headline? by oldCoder · 2014-05-19 08:46 · Score: 1

If the system had been designed from scratch to be testable, testing would be easier.
So we can see that testing is a function of specification and design. As many have now realized (TDD etc).
The problem: "If the system had been designed".
It wasn't designed. It grew.

--

I18N == Intergalacticization

Tests can never catch these bugs by Anonymous Coward · 2014-05-16 04:22 · Score: 3, Insightful

For the same reason new viruses will always defeat anti-virus software: Each virus is tested against existing anti-virus programs and only released into the wild when it has defeated all of them.

Re:Tests can never catch these bugs by Gunboat_Diplomat · 2014-05-16 04:33 · Score: 1

For the same reason new viruses will always defeat anti-virus software: Each virus is tested against existing anti-virus programs and only released into the wild when it has defeated all of them.
Not all of them. Malware writers go for the biggest targets for the least effort, and often just test against (and even have active intervention against) the best known/most used free AV programs (Microsoft, Avast, AVG, etc.), since a major volume of users use free solutions, and the major commercial (Symantec, Trend, Kaspersky, F-Secure, McAffee, etc.). It is actually a good bet that if you go with a lesser known commercial AV product you gain a significant protection advantage.
Re:Tests can never catch these bugs by Kimomaru · 2014-05-16 04:44 · Score: 1

Sadly, it's a shame that people put much faith in AV programs given their effectiveness (http://arstechnica.com/security/2014/05/antivurus-pioneer-symantec-declares-av-dead-and-doomed-to-failure/). I think author R.R. Martin has it right (https://www.youtube.com/watch?v=X5REM-3nWHg), keep separate machine for different purposes - one for serious work and one for messing around with. It doesn't feel like a good idea to use one machine for everything.
Re:Tests can never catch these bugs by Gunboat_Diplomat · 2014-05-16 04:57 · Score: 1

Sadly, it's a shame that people put much faith in AV programs given their effectiveness (http://arstechnica.com/security/2014/05/antivurus-pioneer-symantec-declares-av-dead-and-doomed-to-failure/). I think author R.R. Martin has it right (https://www.youtube.com/watch?v=X5REM-3nWHg), keep separate machine for different purposes - one for serious work and one for messing around with. It doesn't feel like a good idea to use one machine for everything.
Symantec is mixing up stuff here to try to position themselves for the new hot profitable APT market. For one; the context of this quote about AV being dead was a WSJ interview with the CEO where he said it in the context if Symantec being able to increase their profit, as AV has become quite cheap and APT is getting all the nice profit margin - it was not said in a context of user need, but in a context of Symantec profit need.
Then they mix up some statistics about targeted advanced hacker attacks (APT), which of course isn't stopped by AV, but it doesn't make the treat from traditional malware any less. All reports and research show that regardless of APT, the threat from standard malware is increasing, not decreasing (just as those hit by Cryptolocker..).
Yes, AV is not 100%. There will be APT type attack that bypass it, and there will be time periods with brand new malware that bypass it. But that last point is often overblown. Well over 90% of actual real world infections are from known malware that would be stopped by a good AV program. Even a condom isn't 100% safe, that doesn't mean that it is meaningless to use a condom.
Re:Tests can never catch these bugs by bluefoxlucid · 2014-05-16 05:01 · Score: 1

Actually, a static checker did find the OpenSSL bug, but nobody used Frama-C to check OpenSSL. Any parametric fuzzing would have caught the OpenSSL bug as well: give it construction of the packet and say, "Vary the data in this fixed length field, vary data and size of this variable-length field." Such tests only account for what types of data come through the program, and may cause strange behavior.
Test-driven development would also have caught Heartbleed. Similar to fuzzing, TDD would produce valid and invalid tests and their valid results. For example, a TLS Heartbeet test would send a valid and an invalid request and check each response against an expected response. Most likely, this process would prevent bugs like Heartbleed; it would later fail in regression, i.e. if the test expects 65000 "HELLO" to close connection but instead gets back a response.
Various tests can and have caught these bugs.

--
Support my political activism on Patreon.
Re:Tests can never catch these bugs by Laxori666 · 2014-05-16 05:02 · Score: 2

Did you actually read the article? The particular apple bug would have been easily caught by testing. In fact, the handshake code was copy-pasted six times in the code, and only one of the copies had the bug... if the developer had thought about about testing at all, that code would have been factored into one function, and even just by doing so the bug would have been less likely.
Re:Tests can never catch these bugs by Kimomaru · 2014-05-16 05:13 · Score: 1

Possible, but even assuming this, the main issue is that AV in general is considered a relevant safety measure when perhaps it should not be. The assumption by itself can lead to a false sense of security. Frankly, I'd rather run multiple VMs on a machine at the very least - MS Windows for games and Debian for serious work. I don't do serious work on a Windows machine or on any Apple device for that matter - I'd rather my OSs and apps be open source and subject to comminity scrutiny.

Two Code Smells by SuperKendall · 2014-05-16 04:27 · Score: 1

After reading TFA, I'm not sure I like the suggested approach to the "fix" in the code by replacing the two if blocks with a common method where you pass in all sorts of parameters.

Yes duplicate code is bad, I agree that's a "code smell" (one of the worst coding terms every to be invented BTW).

But just as odiferous to me, is a method with a billion arguments like the combined extracted method has. Sure duplicate if statements smell bad, but the replacement is worse and also harder to comprehend.

I know it's in theory more testable, but at what cost when the code is more obsfucated? If the code and tests are harder to understand are you really better off?

--
"There is more worth loving than we have strength to love." - Brian Jay Stanley

Re:Two Code Smells by bluefoxlucid · 2014-05-16 05:02 · Score: 1

Code smell? What the fuck? What was wrong with "antipattern"?

--
Support my political activism on Patreon.
Re:Two Code Smells by SuperKendall · 2014-05-16 06:06 · Score: 1

I totally agree. It sounds better in every way and is clearer to boot.

--
"There is more worth loving than we have strength to love." - Brian Jay Stanley

Testing isn't Perfect by Anonymous Coward · 2014-05-16 04:28 · Score: 1

If you're selling that you coulda/woulda caught all X for X that haven't happened yet, you're selling snake oil. The reality is that this computer stuff is a little harder than it looks to do properly, and if all you have to offer is marketing bullshit and a History of Art degree, maybe you should leave it to the professionals, and push for budget to do things correctly rather than just do them.

Re:Testing isn't Perfect by jeffmeden · 2014-05-16 04:47 · Score: 1

If you're selling that you coulda/woulda caught all X for X that haven't happened yet, you're selling snake oil. The reality is that this computer stuff is a little harder than it looks to do properly, and if all you have to offer is marketing bullshit and a History of Art degree, maybe you should leave it to the professionals, and push for budget to do things correctly rather than just do them.
But PC-Lint has been successful at finding _every_ bug in Dr Dobbs...

Worth repeating... by QuietLagoon · 2014-05-16 04:28 · Score: 4, Interesting

The ultimate responsibility for the failure to detect this vulnerability prior to release lies not with any individual programmer but with the culture in which the code was produced.

.
I've often said that you don't fix a software bug until you've fixed the process that allowed the bug to be created. The above quote is of a similar sentiment.

Re:Worth repeating... by radtea · 2014-05-16 05:06 · Score: 4, Insightful

I've often said that you don't fix a software bug until you've fixed the process that allowed the bug to be created.
One of the things that struck me about the goto fail bug was that it was specifically engineered out of coding best practices in the '90's.
Any reasonable coding standard from that time forbade if's without braces for precisely this reason. And yeah, that's a "no true Scotsman" kind of argument (if a coding standard didn't contain such a clause it was not by my definition "reasonable") but the point still holds: software developers at the time were aware of the risk of open if statements causing exactly this kind of failure, because we had observed them in the wild, and designed coding standards to reduce their occurrence.
So to be very specific about what kind of processes and culture would have prevented this bug: a reasonable coding standard and code reviews would have caught it (much of the code review process can be automated these days), and a culture of professionalism is required to implement and maintain such things.
The canonical attribute of professionals is that we worry at least as much about failure as success. We know that failures will happen, and work to reduce them to the bare minimum while still producing working systems under budget and on time (it follows from this that we also care about scheduling and estimation.)
Amateurs look at things like coding standards and reviews and say, "Well what are the odds of that happening! I'm so good it won't ever affect my code!"
Professionals say, "The history of my field shows that certain vulnerabilities are common, and I am human and fallible, so I will put in place simple, lightweight processes to avoid serious failures even when they have low probability, because in a world where millions of lines of code are written every day, a million-to-one bug is written by someone, somewhere with each turn of the Earth, and I'd rather that it wasn't written by me."
It's very difficult to convince amateurs of this, of course, so inculcating professional culture and values is vital.

--
Blasphemy is a human right. Blasphemophobia kills.
Re:Worth repeating... by pla · 2014-05-16 05:12 · Score: 1

I've often said that you don't fix a software bug until you've fixed the process that allowed the bug to be created. The above quote is of a similar sentiment.

Sounds great! Now just show me a program (more complex than "Hello World") with no bugs.

Yes, culture can play a large role in the frequency and severity of bugs released into production code. But humans make mistakes, simple as that. No amount of code reviews or test suites or BS like "pair programming" can ever get around that basic fact.

Or perhaps as a better example of the problem with that philosophy - Show me a program with no bugs that used OpenSSL. In that case, even the trivial "Hello World" example would have a serious bug completely out of the control of the developer.
Re:Worth repeating... by drinkypoo · 2014-05-16 07:21 · Score: 1

Yes, culture can play a large role in the frequency and severity of bugs released into production code. But humans make mistakes, simple as that. No amount of code reviews or test suites or BS like "pair programming" can ever get around that basic fact.
Uh, you have gone perpedicular to the problem here. Code reviews and test suites are things we have because of that basic fact.

--
"You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
Re:Worth repeating... by Shatrat · 2014-05-16 07:32 · Score: 1

It's like the Einstein quote, "We can not solve our problems with the same level of thinking that created them'"

--
09 F9 11 02 9D 74 E3 5B D8 41 56 C5 63 56 88 C0
Re:Worth repeating... by Eunuchswear · 2014-05-16 07:55 · Score: 1

For fucks sake, in the 70's we got rid of programming languages where it was even possible to omit the fucking 'braces' - Algol68 replacing algol 60, Fortran 77 replacing FORTRAN 66.
Then those Pascal and BCPL reborn losers came along, and it all gies go hell.

--
Watch this Heartland Institute video
Re:Worth repeating... by jc42 · 2014-05-16 10:53 · Score: 1

I've often said that you don't fix a software bug until you've fixed the process that allowed the bug to be created. The above quote is of a similar sentiment. Sounds great! Now just show me a program (more complex than "Hello World") with no bugs.
Of course, it has been occasionally pointed out that the canonical "Hello World" program (from the "C Bible") actually has a bug. Granted, it's not one that you're ever likely to observe in the wild, and good luck writing malware to exploit it. But most programmers, even expert C programmers, can't spot it despite being trivially obvious when pointed out. This is actually a fairly nice example of how difficult it can be to write bug-free software, and I'd wonder if it was done intentionally in that book "with malice aforethought". ;-)

--
Those who do study history are doomed to stand helplessly by while everyone else repeats it.
Re:Worth repeating... by Chelloveck · 2014-05-16 11:08 · Score: 1

Have you read the BSD style(9) man page? It specifically recommends omitting unnecessary braces. In an organization which follows this style guide, not only would the lack of braces not be flagged in a code review but if there were braces around a single-statement 'if' clause the reviewers might require that they be removed. Now, given that OSX is derived from BSD...

Use a space after keywords (if, while, for, return, switch). No braces are used for control statements with zero or only a single statement unless that statement is more than a single line, in which case they are permitted.

[...]

Closing and opening braces go on the same line as the else. Braces that are not necessary may be left out. if (test) ....stmt; else if (bar) { ....stmt; ....stmt; } else ....stmt;

(Leading dots added as placeholders because why the hell would anyone ever want to post code samples on a News for Nerds site, anyway?)
It should be noted that the "super-secure" OpenBSD platform ships with this same style guide. FWIW, I agree with you that braces should be mandatory. I think this is a supremely dumb recommendation.

--
Chelloveck
I give up on debugging. From now on, SIGSEGV is a feature.
Re:Worth repeating... by Darinbob · 2014-05-16 11:25 · Score: 1

Probably a lack of code reviews, or if they had code reviews not enough reviewers were being pedantic about style. I can actually understand this, because when I give reviews I do get pushback when pointing out violations of the local coding standards. I suspect a lot of people do code reviews only superficially and avoid any review of design/optimization/style specifically to avoid the stress of arguing about it.
Re:Worth repeating... by linuxrocks123 · 2014-05-17 10:39 · Score: 1

main() { printf("hello, world"); } Missing return from main, undefined behavior after the printf. What do I win? More seriously, I think it's an overstatement that "even expert C programmers can't spot this". It jumped out at me right when I looked at it. I do a lot of C++, but not much C. I had to do some research to make sure it really was undefined behavior in C when I noticed it, but it did jump right out at me, and I wouldn't have let someone off for this in a code review, because it's obviously either undefined or bad style.

--
vi ~/.emacs # I'm probably going to Hell for this.
Re:Worth repeating... by oldCoder · 2014-05-19 09:03 · Score: 1

As you have proven, standards are not enough.
Modern languages implement this stuff in the tools and compiler and language spec. For example in Go, code is formatted automatically, showing the problem.
Dead code warnings would also have prevented this. The struct hashOut is given a value but not used. Tools can detect that sort of error. Even compilers can be built that error out on this. If you are willing to go to newer languages.

--

I18N == Intergalacticization

Neatness counts by mariox19 · 2014-05-16 04:31 · Score: 3, Insightful

if ((err = SSLHashSHA1.update( &hashCtx, &signedParams)) != 0) goto fail; goto fail;

Those familiar with the C programming language will recognize that the first goto fail is bound to the if statement immediately preceding it; the second is executed unconditionally.

Sorry, but it needs to be said: this is sloppy, he-man coding. Is there a problem with using brackets? Is it your carpal tunnel syndrome? Are you charged by the keystroke?

This is how mistakes happen. For shame!

--

quiquid id est, timeo puellas et oscula dantes.

Re:Neatness counts by BronsCon · 2014-05-16 04:36 · Score: 1

if ((err = SSLHashSHA1.update( &hashCtx, &signedParams)) != 0) { goto fail; } goto fail;
Seems as though it still would/could have happened. Would it have been easier to catch? Likely. Still would have happened, though.

--
APK quotes people (including myself) without context and should not be trusted. Just thought you should know.
Re:Neatness counts by Anonymous Coward · 2014-05-16 04:49 · Score: 2, Insightful

if ((err = SSLHashSHA1.update( &hashCtx, &signedParams)) != 0) {
goto fail;
}
goto fail;
Seems as though it still would/could have happened. Would it have been easier to catch? Likely. Still would have happened, though.
True, it COULD have happened. But that's a helluva lot more obvious.
And if you RTFA (I know....), the author really had to contrive a BS example with mismatched braces to make a case against requiring braces on all conditional code even if they're only one line.
If mismatched braces is your "proof" that a code standard that requires braces all the time doesn't help prevent the creation of bugs like this, you're really desperate.
Re:Neatness counts by jeffmeden · 2014-05-16 05:05 · Score: 1

It's not clear to me how using brackets would have helped. The code would have failed even if there were brackets around one or the other or both of the offending statements. And it's not clear if the additional brackets would have increased the likelihood that the mistake would have been noticed.
Brackets around the entire IF would have caused it to work as expected, the second `goto fail;' would simply never get used. Brackets around only the first would have caused the second to look a lot more out of place, at least as much more as two identical lines on identical indents could. Where is the indent regime on this issue anyway? Isn't that even more of a style atrocity than foregoing brackets? Nothing like intentionally making your code look like assembly spit from a debugger to make it easy to maintain.
Re:Neatness counts by jones_supa · 2014-05-16 05:48 · Score: 1

if ((err = SSLHashSHA1.update( &hashCtx, &signedParams)) != 0) goto fail; goto fail;

Making an assignment inside the if test makes it also more ambiguous. I would have gone with:
err = SSLHashSHA1.update(&hashCtx, &signedParams); if (err != 0) { /* code... */ } /* code... */
Re:Neatness counts by spatley · 2014-05-16 11:57 · Score: 1

i will go you one further and say that the more open style of braces would have shown the bug quite clearly if ((err = SSLHashSHA1.update( &hashCtx, &signedParams)) != 0) { goto fail; goto fail; if ((err = SSLHashSHA1.final( &hashCtx, &hashOut)) != 0) { goto fail; } } // if you always put a brace on the line after the evaluation of an if, and tab in, the nesting will be obvious.

Lack of static and structural coverage analysis by postmortem · 2014-05-16 04:37 · Score: 1

So good test should catch this goto fail for sure, either functional test or an unit test. Looks like neither are thorough for the library.

Bot more importantly, if static analysis or structural coverage of code was done, both would point out that there is something wrong with the code.

All of these testing strategies should be done for such s critical piece of software.

Fuzz Testing. Next! by VortexCortex · 2014-05-16 04:38 · Score: 1

Automated unit test stubs with range checking and input fuzzing. Took me two weekends to build one atop Doxygen. If your build environment does not do this you're maliciously stupid.

Unit tests are just one tool by g01d4 · 2014-05-16 04:55 · Score: 1

FTFA:

Compiler and static-analysis warnings also could have detected the unreachable code, though false warnings might have drowned out the signal if such tools weren't already being used regularly.

I'd purpose that these tools weren't being used properly rather than turning the issue into a nail for the unit testing hammer.

Re:It takes brains by jeffmeden · 2014-05-16 04:57 · Score: 4, Insightful

I've been in this field for 20+ years now, and I don't necessarily (in fact, I usually don't) agree with whatever the current trend is (which is probably why my karma is negative). One underlying trend, has been to make software something that can be made by anyone - to remove the requirement of having a special mind that is able to think through algorithms and code. This has generally been accomplished through process, and abstraction. Process - if we can describe a method well enough, then anyone should be able to follow it to it's logical conclusion. Abstraction - we keep adding layers upon layers in an effort to simplify and streamline that which is a complex thing (lots of numbers in sequence to control a microprocessor and it's accompanying hardware). You can probably tell that I'm not a great fan of either - though I'm really really trying to not be a negative type, and to go with the flow more. But I can't help my fundamental feelings that there is just no substitute for a smart individual with a gift of understanding the logic of code. I'm always against process because it takes the gift that i was given and neutralizes it. Personal feelings aside, I just don't think that all the process in the world is ever going to get ahead of the curve that is the battle between perfectly functional software and bugs.

If you make brilliant code that only you can understand, sorry to be harsh but you aren't that brilliant. We definitely need to value people who can generate and perfect algorithms, but do you think anyone would remember/value the Pythagorean Theorem if it was 40 steps long? No, he thought of a (then brilliant) way to do it simply and easily so that one only needs to understand basic math to pull it off. This is what we need more of; a single elegant algorithm that is so short it is hard to misuse is better than 1,000 algorithms that are all so hard to understand that only the author knows exactly how it works and will be forgotten as soon as the particular language or application fades into the past.

-Wall -Werror by Megane · 2014-05-16 04:59 · Score: 4, Interesting

Turning on all warnings and forcing them to errors certainly would have caught the bug in Apple's SSL code. Anyone who just lets warnings fly by in C code is an idiot. Even if the warning is mildly silly, getting it out of the way lets the important warnings stand out. Sensible warnings from C compilers are the very reason we don't use lint anymore. Even then you still have to watch out, because some warnings won't appear at low optimization levels, and I recall hearing that there are a few obscure warnings not turned on by -Wall.

Also, it could have possibly been introduced by a bad merge. One of the things that putting braces on every if/for/while/etc. does is give merges more context to keep from fucking up, or at least a chance to cause brace mismatch.

As for Heartbleed, just the fact that the code wouldn't work with a compile time option to use the system malloc instead of a custom one should have been enough to raise some red flags. Because rolling your own code to do something "more efficiently" than the system libraries never introduces new problems, right?

--
#naabhaprzrag, #sverubfr-000, #agi-fcbafberq, negvpyr[pynff*=' negvpyr-ary-'] { qvfcynl: abar !vzcbegnag; }

Re:-Wall -Werror by roger10-4 · 2014-05-16 05:53 · Score: 1

Need to explicitly add -Wunreachable-code. Annoyingly, "-Wall" doesn't catch this particular error (at least on the versions of gcc I've used).
Re:-Wall -Werror by Qzukk · 2014-05-16 09:55 · Score: 1

One of the things that putting braces on every if/for/while/etc. does is give merges more context to keep from fucking up, or at least a chance to cause brace mismatch.
I'm not so sure. I've lost track of the number of times where patch has chosen a completely random

....} ..} }

to wedge a new } else { in, because at the end of a block, all the braces look the same.
I put an end to that by ending blocks with } // if (foo)

--
If I have been able to see further than others, it is because I bought a pair of binoculars.
Re:-Wall -Werror by Eunuchswear · 2014-05-16 10:38 · Score: 1

macro abuse.
C has a huge design bug - pragmas can't be used in macros.

--
Watch this Heartland Institute video
Re:-Wall -Werror by rabtech · 2014-05-16 11:17 · Score: 2

Turning on all warnings and forcing them to errors certainly would have caught the bug in Apple's SSL code. Anyone who just lets warnings fly by in C code is an idiot. Even if the warning is mildly silly, getting it out of the way lets the important warnings stand out. Sensible warnings from C compilers are the very reason we don't use lint anymore. Even then you still have to watch out, because some warnings won't appear at low optimization levels, and I recall hearing that there are a few obscure warnings not turned on by -Wall.
Let me quote from one of the best-tested and most widely used projects out there, SQLite, from http://www.sqlite.org/testing....

Static analysis has not proven to be especially helpful in finding bugs in SQLite. Static analysis has found a few bugs in SQLite, but those are the exceptions. More bugs have been introduced into SQLite while trying to get it to compile without warnings than have been found by static analysis.

The bolded part has been my experience unfortunately. Static analysis is nearly useless.
An appropriate test for something like an SSL stack is a separate test harness that "fuzzes" the stack by exploring large random combinations of values, some with known good certificates and others with randomly generated (and thus broken) ones. These days one can spin up thousands of VMs, run a massive suite of billions of test cases in parallel over a few hours, then spin them down and spend a relatively small sum of money.
And yes, the test harness for something like this is probably going to exceed the # of lines of code of the actual implementation by an order of magnitude. For really important security-critical stuff like cryptography, SSL/TLS, keychain management, etc it is well worth the effort.

--
Natural != (nontoxic || beneficial)
Re:-Wall -Werror by Plumpaquatsch · 2014-05-18 06:19 · Score: 1

Need to explicitly add -Wunreachable-code. Annoyingly, "-Wall" doesn't catch this particular error (at least on the versions of gcc I've used).
Not only that: "later" versions of gcc (like 4.5.2 & 4.7.3) have removed support for -Wunreachable-code without warning that the flag isn't supported. http://gcc.gnu.org/ml/gcc-help/2011-05/msg00360.html

--
Of course news about a fake are Fake News.

Reeks of a terrible article by rebelwarlock · 2014-05-16 05:14 · Score: 2

We have some lovely elements coming together right here on the slashdot blurb:

1. Stupid pun instead of a descriptive title
2. Full caps in the article excerpt
3. Trying to bring up coding "culture"
4. Assertion that it totally could have been caught beforehand, but they aren't sure exactly how.

Somehow, I don't think I'm missing much by not reading the article.

Re:Reeks of a terrible article by jones_supa · 2014-05-16 05:56 · Score: 1

The young whippersnappers are to blame.

Merge Conflict by znigelz · 2014-05-16 05:15 · Score: 3, Insightful

This is clearly the automatic resolution of a merge conflict by the versioning control software. These are such a nightmare to debug and happen all the time. Developers rarely check their entire change visually post merge. Though this can be found using static analysis that force coding standards (such as forcing the use of brackets or proper indentation for the lexical scope). Though the bugs from automatic conflict resolution can only be really improved through better versioning software. These are without question the worst and most frustrating bugs.

Re:Merge Conflict by phantomfive · 2014-05-16 05:56 · Score: 1

Oh yeah, you are surely right, now that I think about it. If anyone looks at this code, they will easily see which single line caused the problem:

if ((err = ReadyHash(&SSLHashSHA1, &hashCtx)) != 0) goto fail; if ((err = SSLHashSHA1.update(&hashCtx, &clientRandom)) != 0) goto fail; if ((err = SSLHashSHA1.update(&hashCtx, &serverRandom)) != 0) goto fail; if ((err = SSLHasehSHA1.update(&hashCtx, &signedParams)) != 0) goto fail; goto fail; if ((err = SSLHashSHA1.final(&hashCtx, &hashOut)) != 0) goto fail; It's a problem that could have been caught easily with a dead-code static analysis. Or someone looking at the code (maybe Apple doesn't do code review on every check-in?) Or stepping through the code in a debugger (though I guess that's rare to do after it's already been committed). Or a few unit tests on this function.

There are lots of ways this bug could have been avoided.

(PS. Is there no way to insert a space into Slashdot? Is there no way to insert properly indented code? That would be better than working on beta).

--
"First they came for the slanderers and i said nothing."

Re:It takes brains by Pro923 · 2014-05-16 05:17 · Score: 1

I kinda agree... I mean, I never go out of my way to artificially complicate my code. I'm not one of those people that uses macros just for the sake of showing how clever I can make them. The problem is that - to use your example - how many people do we have now that actually learn how to derive the Pythagorean Theorem? How do we build on that? The gifted people that COULD build on it, can't - because they're sandboxed into a process, or a higher level abstraction. My kids - they were in awe of the Saturn V, the F-1 Engine - all of that technology (with the slide rules!) and knowhow that we had back in the 60's (after watching Apollo 13)... I told them that I didn't think we had the capacity to do that anymore. Then I read an article recently where they were looking at the F-1 engine like it was some alien artefact. Point being, I just don't think that we value our smart people anymore because we just stick them into the same process as every dope that has a connection to get his job. We need find the smart people, encourage them with financial incentives and get THEM to write the code.

Quo Bono by ThatsNotPudding · 2014-05-16 05:31 · Score: 1

Who benefitted? The TLAs. Accidental, my ass.

Where's the revision history? by Animats · 2014-05-16 05:35 · Score: 1

The article doesn't give the revision history of how that code got there. Who put it there, when did they put it there, and what did the code look like before and after they put it there. We need the names!

Use compiler warnings by roger10-4 · 2014-05-16 05:41 · Score: 1

It's remarkable how many organizations don't enable aggressive compiler warnings (or worse, ignore or disable them). One of the best practices I've learned is to turn on every warning that you possibly can and use the option that treats all warnings as compiler errors. The code from Apple may have been properly unit tested. However, if this was the result of a bad automated merge, unit tests are often not repeated on the resulting code base headed for system test. The GCC "-Wunreachable-code" option would have caught this type of error.

Overconfidence in unit tests... by Junta · 2014-05-16 05:42 · Score: 1

The article contains the same flaw that people who rabidly declare unit tests as a panacea. The article basically shows that after discovery of a bug, a unit test can retroactively be constructed that would have caught the bug, therefore it's inexcusable that the bug got released, ignoring the fact that is hindsight. Unit tests are not without their utility certainly, but practically speaking you will not be able to construct unit tests that catches every single possible scenario. This is tricky enough for trying to catch functional problems, but for security problems where an adversary is explicitly trying to bend something beyond even what the developer conceived of in design, unit tests become even more tricky. If someone has the foresight in implementing a feature to craft a test case to explicitly try malicious things, then they probably wouldn't have messed up the code in the first place. Of course, there is value in having the first developer with that awareness institute such a test case so that a follow up activity gets checked, but I think in most of the cases the bug came with the first checkin of the function, meaning the developer just never considered the possibility at all. This means they made buggy code and they would have or in fact did also made inadequate test cases. You can't just say 'if Apple had done unit tests, their code would have been perfect!'. There are projects without unit tests that fare pretty well and there are projects with unit tests that fail miserably in terms of quality.

I have heard people claim with a straight face that they now have '100% coverage' through unit tests and then go on to say at-will releases are therefore safe to do without any particular testing.

--
XML is like violence. If it doesn't solve the problem, use more.

Re:Overconfidence in unit tests... by russotto · 2014-05-16 11:06 · Score: 1

The article contains the same flaw that people who rabidly declare unit tests as a panacea. The article basically shows that after discovery of a bug, a unit test can retroactively be constructed that would have caught the bug, therefore it's inexcusable that the bug got released, ignoring the fact that is hindsight.

The function was supposed to check various ways a key exchange message could be screwed up. The minimum set of unit tests appropriate for such a function are pretty clear -- feed it messages that are screwed up in each different way, and make sure all of them fail. And feed it a message that is not screwed up and make sure it succeeds. This won't catch everything, but it would have caught this one.
There are lots of times it's unreasonable to expect a developer to have written a unit test which would have caught the bug; this isn't one of them.

Re:Fuzz Testing. Next! by msclrhd · 2014-05-16 05:50 · Score: 1

They are all tools that can be applied to improve the quality of the code. No one thing is "The Solution".

* Test Driven Development (TDD) is a good approach to ensure that the code you write is testable. This will not work for things like UI code, but other code will benefit.

* Unit Tests can either be developed via a TDD-like approach (easier to do), or after the code is written (harder to do).

* Automated Regression Tests (a superset of Unit Tests) provide good coverage for ensuring code works as expected without involving a large manual testing team. These will only detect the things covered by the automated tests.

* Static Code Analysis tools can pick up a lot of problem areas, but will not detect every problem. These results can be used to identify what tests need to be created to prevent future regression.

* Fuzz testing is good at providing strange data to e.g. a protocol or file format parser. These are intended to be soak tests -- e.g. "does my regular expression parser handle all these strange and possibly invalid constructs". Fuzz testing would have most likely found the heartbleed bug (because it would have permutated the length of data to request). Any failures here should be converted to Unit/Regression tests to ensure that the problem is (a) fixed by any code changes made and (b) does not occur in the future. Fuzz testing will typically find hard to identify bugs (e.g. data races) that are not easy to identify from manually constructed tests or static analysis.

* Manual/ad hoc testing is important as it can uncover bugs that the developers are not aware of.

* Code and Security Reviews help identify potential issues (e.g. if you have someone knowledgeable about SQL injection, they can assess whether some code is vulnerable to that attack).

None of these is a silver bullet, but the more you have the better the code will be.

Re:It takes brains by Junta · 2014-05-16 05:55 · Score: 2

If you make brilliant code that only you can understand

There's a false dichotomy here. He said that only *some* are qualified enough to create solutions to complex problems. You are saying his claim is that only *one* can understand, implying that the problem can't possibly be too hard, and that any hard code to follow is just because the developer is terrible at coding.

As a counter to your example of the Pythagorean Theorem, what about post-graduate math and science? There are tons of things which would make 40 steps seem easy by comparison. Should society forgo those just because only some people are realistically going to be able to understand and apply that correctly?

A very ubiquitous situation is that with the 'anyone can understand it or else it shouldn't exist at all' philosophy, there is no way we'd have cryptographic libraries at all.

I will agree that his stance against processes is a bit too harsh, but I've been around enough to know in some scenarios such a jaded perspective would be perfectly understandable. I've seen some projects that had appropriate and helpful processes that did help quality, but been witness to many many more that had ineffective process that achieved nothing but create busy work while still churning out crap code.

--
XML is like violence. If it doesn't solve the problem, use more.

And how deep can we test? by gwolf · 2014-05-16 06:22 · Score: 1

Of course, it's obvious today that a test for behavior on inconsistent requests should have been done in OpenSSL. As well as a test for each failure cause should have been done by Apple. And next week, when an off-by-one bug bites us on an integer overflow in libfoobar, people will say testing for that condition should have been trivial.

So, yes, some conditions can be found with fuzzers. Of course, fuzzers work in an erratic way, and not all bugs can be triggered by them. But maybe fuzzing our code (more importantly, our security-sensitive code) will yield better results than preparing tests for those components in the system we are aware of.

Then again, properly fuzzing takes quite a bit of time. It is way less fun to watch a fuzzer than to see tests making green check marks...

The culture by computational+super · 2014-05-16 06:59 · Score: 2

Yeah, the "culture" is "hurry up and get it done so you can get on to the next thing because if something takes more than an hour to do it's not worth doing" and it exists in every single software development organization on planet Earth. Until these things actually start costing real money to people with real power, this will continue.

--
Proud neuron in the Slashdot hivemind since 2002.

Simple Unit Test to catch Apple's bug. by Decameron81 · 2014-05-16 08:20 · Score: 1

At least Apple's bug could've been caught with basic unit-testing. This is the snippet of code from Apple's bug:

static OSStatus SSLVerifySignedServerKeyExchange(SSLContext *ctx, bool isRsa, SSLBuffer signedParams, uint8_t *signature, UInt16 signatureLen) { OSStatus err; ...

if ((err = SSLHashSHA1.update(&hashCtx, &serverRandom)) != 0) goto fail; if ((err = SSLHashSHA1.update(&hashCtx, &signedParams)) != 0) goto fail; goto fail; if ((err = SSLHashSHA1.final(&hashCtx, &hashOut)) != 0) goto fail; ...

fail: SSLFreeBuffer(&signedHashes); SSLFreeBuffer(&hashCtx); return err; }

Just implement a unit test with the following logic:

1. When SSLHashSHA1.update() is called, DO NOT return an error.
2. Expect 2 calls to SSLHashSHA1.update() and check the input parameter on each call.
3. Expect 1 call to SSLHashSHA1.final() and check the input parameters are what you'd expect.

That simple unit test would've caught this issue without any need of duplicating code.

--
diegoT

Re:Simple Unit Test to catch Apple's bug. by Decameron81 · 2014-05-16 08:23 · Score: 1

PS: this is pretty obvious while unit-testing but I'll make it clear to avoid any confusion... the real implementation of SSLHashSHA1.update() and SSLHashSHA1.final() would not be called in this unit test, as that'd be outside of the scope of it.

--
diegoT

So , is there a tool that finds every occurrence? by Marrow · 2014-05-16 09:10 · Score: 1

I mean, if unbraced if statements are so deadly, then why are they not outlawed unless specifically allowed by compiler directive. On your head be it.

Bug is somewhere else by gnasher719 · 2014-05-16 10:15 · Score: 2

Ok, writing "goto fail;" twice in a row is a bug. But it's not the real bug. This code was checking whether a connection was safe, and executed a "goto fail;" statement if one of the checks failed. It also executed one "goto fail;" by accident, skipping one of the checks. But one would think that a statement "goto fail;" would make the connecction fail! In that case, sure, there was a bug, but the bug should have led to all connections failing, which would have made it obvious to spot (because the code wouldn't have worked, ever).

So the real problem is a programming style where executing a statement "goto fail;" doesn't actually fail! If a function returns 0 for success / non-zero for failure like this one, it should have been obvious to add an "assert (err != 0)" to the failure case following the fail: label. And that would have _immediately_ caught the problem. There should be _one_ statement in the success case "return 0;" and _one_ statement in the failure case "assert (err != 0); return err;".

Re:It takes brains by spatley · 2014-05-16 13:11 · Score: 1

It takes a genius to write code that can be understood by an idiot.

Microsoft has gone this route for many years by Trax3001BBS · 2014-05-16 16:36 · Score: 1

With buffer overflows, over and over it's said they can be tested for, that there's no just reason for buffer overflows in this day and age.

The fact is it takes more money, more time, and can easily be patched if one pops up.

Slashdot Mirror

Finding More Than One Worm In the Apple

79 of 116 comments (clear)