Optimizations - Programmer vs. Compiler?

← Back to Stories (view on slashdot.org)

Optimizations - Programmer vs. Compiler?

Posted by Cliff on Friday February 25, 2005 @08:46AM from the who-can-obfuscate-better dept.

Saravana Kannan asks: "I have been coding in C for a while (10 yrs or so) and tend to use short code snippets. As a simple example, take 'if (!ptr)' instead of 'if (ptr==NULL)'. The reason someone might use the former code snippet is because they believe it would result in smaller machine code if the compiler does not do optimizations or is not smart enough to optimize the particular code snippet. IMHO the latter code snippet is clearer than the former, and I would use it in my code if I know for sure that the compiler will optimize it and produce machine code equivalent to the former code snippet. The previous example was easy. What about code that is more complex? Now that compilers have matured over years and have had many improvements, I ask the Slashdot crowd, what they believe the compiler can be trusted to optimize and what must be hand optimized?" "How would your answer differ (in terms of the level of trust on the compiler) if I'm talking about compilers for Desktops vs. Embedded systems? Compilers for which of the following platforms do you think is more optimized at present - Desktops (because is more commonly used) or Embedded systems (because of need for maximum optimization)? Would be better if you could stick to free (as in beer) and Open Source compilers. Give examples of code optimizations that you think the compiler can/can't be trusted to do."

55 of 1,422 comments (clear)

Ask the compiler... by inertia187 · 2005-02-25 08:48 · Score: 5, Funny

Programmer: Hey, compiler. How do you like optimizing?
Compiler: Optimizing? Optimizing? Don't talk to me about optimizing. Here I am, brain the size of a planet, and they've got me optimizing inane snippets of code. Just when you think code couldn't possibly get any worse, it suddenly does. Oh look, a null pointer. I suppose you'll want to see the assembly now. Do you want me to go into an infinite loop or throw an exception right where I'm standing?
Programmer: Yeah, just show me the stack trace, won't you compiler?

--
A programmer is a machine for converting coffee into code.
1. Re:Ask the compiler... by llamalicious · 2005-02-25 09:36 · Score: 5, Funny
  
  And then the compiler shared his view of the universe with the programmer, who promptly committed suicide.
Clear Code by elysian1 · 2005-02-25 08:48 · Score: 5, Insightful

I think writing clear and easy to understand code is more important in the long run, especially if other people will have to look at it.
1. Re:Clear Code by normal_guy · 2005-02-25 08:50 · Score: 5, Insightful
  
  That should be "especially _since_ other people will have to look at it."
  
  --
  
  Linux: Free if your time is worthless.
2. Re:Clear Code by daveho · 2005-02-25 08:52 · Score: 5, Insightful
  
  I agree 100%. Write code that is easy to understand and modify, then optimize it, but only after you have profiled it to find out where optimization will actually matter .
3. Re:Clear Code by david+duncan+scott · 2005-02-25 09:23 · Score: 4, Interesting
  
  Yup, that's why your bank throws away all three zillion lines of COBOL every year -- because there's a greater risk in maintenance than in new code.
  
  I wish I could put my hands on an article I read a couple years back on the code in the Space Shuttle. They go at that code base with an attitude that makes the average paranoid look happy-go-lucky. In fact, they approach software engineering kind of like other engineers do -- as if lives depended on it. It's old, it's slow, and it works. (Oh, wait, here it is.) That's how code is maintained.
  
  --
  This next song is very sad. Please clap along. -- Robin Zander
4. Re:Clear Code by Rei · 2005-02-25 09:56 · Score: 5, Insightful
  
  An important lesson that I wish I had learned when I was younger ;) It is crazy to start optimizing before you know where your bottlenecks are. Don't guess - run a profiler. It's not hard, and you'll likely get some big surprises.
  
  Another thing to remember is this: the compiler isn't stupid; don't pretend that it is. I had senior developers at an earlier job mad at me because I wasn't creating temporary variables for the limits of my loop indices (on unprofiled code, nonetheless!). It took actually digging up an article on the net to show that all modern compilers automatically dereference any const references (be they arrays, linked lists, const object functions, etc) before starting the loop.
  
  Another example: function calls. I've heard some people be insistant that the way to speed up an inner loop is to remove the code from function calls so that you don't have function call overhead. No! Again, compilers will do this for you. As compilers were evolving, they added the "inline" keyword, which does this for you. Eventually, the compilers got smart enough that they started inlining code on their own when not specified and not inlining it when coders told it to be inline if it would be inefficient. Due to coder pressure, at least one compiler that I read about had an "inlinedamnit" (or something to that effect) keyword to force inlining when you're positive that you know better than the compiler ;)
  
  Once again, the compiler isn't stupid. If an optimization seems "obvious" to you, odds are pretty good that the compiler will take care of it. Go for the non-obvious optimizations. Can you remove a loop from a nested set of loops by changing how you're representing your data? Can you replace a hack that you made with standard library code (which tends to be optimized like crazy)? Etc. Don't start dereferencing variables, removing the code from function calls, or things like this. The compiler will do this for you.
  
  If possible, work with the compiler to help it. Use "restrict". Use "const". Give it whatever clues you can.
  
  --
  "Lock and load, Brides of Christ!"
5. Re:Clear Code by DJStealth · 2005-02-25 10:05 · Score: 4, Insightful
  
  Take the following example that is clear, but only 1 is considered optimized. Lets say you're traversing a 2D array of data (e.g., an image). for(x=0; x < width; x++) { for(y=0; y < height; y++) { ... } } versus for(y=0; y < height; y++) { for(x=0; x < width; x++) { ... } } The latter piece of code is just as clear as the first; however, will likely run about 50 times faster than the first, due to caching issues. Will the compiler optimize the first piece of code to look like the second? Probably not (tell me if I'm wrong), as there may be a reason to process things in a particular order. In addition, the latter piece of code may actually be less clear, as in some cases, it may not read well to do height before width in the for loop. As a result, you'll still need to write code thinking about optimization.
6. Re:Clear Code by lymc · 2005-02-25 10:35 · Score: 4, Interesting
  
  Now, let me tell you the flip side of that story. I was working on the Viking Mars Lander program long ago (1976). The code for the lander programs was frozen a year before launch (which itself was a year before landing). Some dozen "books" of the assembly code were created and archived for use when the landers finally landed. All went well for about 6 months after landing. Updates were made and duly entered into the master asssembly listings. After six months of this, the listings were so xed out, and all the margin space used with notes, that errors started creeping in. Finally an uploaded patch wrote over part of the antenna pointing table, and the lander was lost, but for a fortuitious accident which allowed the table to be re-established (with much howling and gnashing of teeth). Sometimes new is better, even if it is painful.
7. Re:Clear Code by lubricated · 2005-02-25 10:48 · Score: 5, Insightful
  
  well the first thing to optimize is the algorithm. Use a O(n^2) algorithm that does the same job as an O(e^n) algorithm if you can. Algorithmical optimization makes the most difference. I am working on a program who's speed is directly proportional to how how often a particular function is called. Well, I try to reduce calls to this function by various means, no compiler I've sean can optimize an algorithm, only the implementation of it. With that I'm happy to have the compiler do the work.
  
  --
  It has been statistically shown that helmets increase the risk of head injury.
8. Re:Clear Code by drgonzo59 · 2005-02-25 11:15 · Score: 5, Interesting
  
  Same goes for code that runs on the airplanes (like Boeing passenger aircraft). In fact the developers have to prove that each possible! branch that code could ever take won't lead to unpredictable behavior or crash. If you have 100 independent 'if' statements that is at least 2^100 possibilites. The code they write is very linear, they avoid branching at all cost.
  
  There is a whole are of study involved in correctness checking, which is related to the SAT (Satisfiability) problem.
  
  The operating system choice is also interesting. Linux doesn't even come close to what they need. Having device drivers in the kernel is just not a good idea. It needs to have a separation kernel, at least that is the goal. I presently think they use the INTEGRITY operating system by Green Hills, but I could be wrong.
9. Re:Clear Code by GryMor · 2005-02-25 11:39 · Score: 5, Informative
  
  It's working on a 2d array of data and is presuming that it is ordered as such:
  
  123
  456
  789
  
  data[y][x];
  
  This, in memory, is:
  123456789
  
  Similarly, the accsess order for the second loop is:
  123456789
  
  But for the first one, it is:
  147258369
  
  The first one hits memory sequentially, which is good for caching as each cache line stores a large chunk of sequential memory.
  
  Considering hitiing the cache as oposed to hitting main memory is at least 100 times faster, you'll be lucky if the first loop is only 50 times slower.
  
  This still presumes data stored in the specified order in memory (which is common for image formats, but not the only way things are done).
  
  --
  Realities just a bunch of bits.
You should always... by Anonymous Coward · 2005-02-25 08:49 · Score: 5, Funny

Optimize. Using cryptic, short variable names also shaves valuable microseconds off compile time and run time.
1. Re:You should always... by FyRE666 · 2005-02-25 08:58 · Score: 4, Funny
  
  ... and by god don't let me see anyone using comments - comments are the devil's alphabet soup! Every programmer worth his/her salt knows that source code is self documenting...
  
  --
  Code, Hardware, stuff like that.
2. Re:You should always... by MillionthMonkey · 2005-02-25 09:02 · Score: 4, Funny
  
  But the code compiles so much faster when you turn it all into comments.
3. Re:You should always... by WindBourne · 2005-02-25 09:20 · Score: 4, Funny
  
  Sadly, some will even work better.
  
  --
  I prefer the "u" in honour as it seems to be missing these days.
4. Re:You should always... by ron_ivi · 2005-02-25 09:21 · Score: 4, Informative
  
  Using cryptic, short variable names also shaves valuable microseconds off compile time and run time.
  There's nothing wrong with short variable names.
  Thus quoth Linus in the Linux Coding Style guide.
  Chapter 3: Naming C is a Spartan language, and so should your naming be. Unlike Modula-2 and Pascal programmers, C programmers do not use cute names like ThisVariableIsATemporaryCounter. A C programmer would call that variable "tmp", which is much easier to write, and not the least more difficult to understand. HOWEVER, while mixed-case names are frowned upon, descriptive names for global variables are a must. To call a global function "foo" is a shooting offense. GLOBAL variables (to be used only if you _really_ need them) need to have descriptive names, as do global functions. If you have a function that counts the number of active users, you should call that "count_active_users()" or similar, you should _not_ call it "cntusr()". Encoding the type of a function into the name (so-called Hungarian notation) is brain damaged - the compiler knows the types anyway and can check those, and it only confuses the programmer. No wonder MicroSoft makes buggy programs. LOCAL variable names should be short, and to the point. If you have some random integer loop counter, it should probably be called "i". Calling it "loop_counter" is non-productive, if there is no chance of it being mis-understood. Similarly, "tmp" can be just about any type of variable that is used to hold a temporary value. If you are afraid to mix up your local variable names, you have another problem, which is called the function-growth-hormone-imbalance syndrome. See next chapter.
5. Re:You should always... by rjstanford · 2005-02-25 09:56 · Score: 5, Informative
  
  LOCAL variable names should be short, and to the point. If you have
  some random integer loop counter, it should probably be called "i".
  Calling it "loop_counter" is non-productive, if there is no chance of it
  being mis-understood.
  
  That last clause is an important one that often gets neglected. In fact, you should never, ever, call a variable loop_counter. That's as bad as pure reverse hungarian - it tells you how its used, not what it means.
  
  I suggest that, for all non-trivial cases (and I'd prefer to see people err verbosely than compactly), you should use descriptive names. Not loop_counter, but maybe something like curRow? It doesn't have to be long, but at least then as the loop grows over time someone can understand a piece of code more easily than having to scroll back up to check that you are indeed in the "i" loop. Its even more critical when someone comes along and adds a nested (or containing) loop. Or whatever.
  
  Same with "tmp". If its truly temporary, such as:
  
  int tmp = getFooCount();
  doSomething(tmp);
  
  then it should be removed and rewritten as:
  
  doSomething(getFooCount());
  
  If its not that temporary, give it a real name. If you insist that it is temporary then you may have a scoping issue - having variables useful but only in part of your function could indicate that your function is doing too much work. If you insist its truly temporary, scope it down: ...
  someRandomCode();
  {
  int foo = getFoo();
  doSomething(foo);
  doSomethingElse(foo);
  }
  moreCode(); ...
  
  At least now you've guaranteed that it is temporary. Better yet, just name it usefully.
  
  --
  You're special forces then? That's great! I just love your olympics!
6. Re:You should always... by Trillan · 2005-02-25 10:10 · Score: 4, Insightful
  
  With the greatest respect to Linus, but writing a kernel does not make you the authority on programming. It does make you the authority on what particular style you allow in your CVS tree, but that's it.
  
  I certainly agree that loop_counter is a bad name, though. But rather than use i, I prefer to at least make a note of what sort of objects I'm looping through.
  
  For instance:
  
  int taskI; int taskCount = GetTaskCount(); for (taskI=0; taskI<taskCount; taskI++) { ... }
  
  Code can never be 100% self documenting, but that's no reason not to settle for 0%. Whether you use CamelCase or words_broken_with_underscores is a matter of style, and you should stick with the style of the code base you're working on.
  
  Anyone who can't or won't work with multiple languages or adopt the necessary style for an existing project is a poor programmer. When you create project, you create the rules. When you work on someone else's project, you follow the rules.
Time to post the famous Knuth quote... by xlv · 2005-02-25 08:50 · Score: 4, Informative

Donald Knuth wrote "We should forget about small efficiencies, about 97% of the time. Premature optimization is the root of all evil."
Algorithms, Not Stupid Processor Tricks by American+AC+in+Paris · 2005-02-25 08:50 · Score: 5, Insightful

This is marginally away from the submitter's question, but it warrnats attention:
The sad truth is that, as far as optimization goes, this isn't where attention is most needed.
Before we start worrying about things like saving two cycles here and there, we need to start teaching people how to select the proper algorithm for the task at hand.
There are too many programmers who spend hours turning their code into unreadable mush for the sake of squeezing a few milliseconds out of a loop that runs on the order of O(n!) or O(2^n).
For 99% of the coders out there, all that needs to be known about code optimization is: pick the right algorithms! Couple this with readable code, and you'll have a program that runs several thousand times faster than it'll ever need to and is easy to maintain--and that's probably all you'll ever need.

--
Obliteracy: Words with explosions
1. Re:Algorithms, Not Stupid Processor Tricks by Flyboy+Connor · 2005-02-25 09:16 · Score: 4, Interesting
  
  Quite agree, it's about the algorithm, not about the code.
  One of the finest moments in my programming career was when my boss asked me to see if I could gain a speed improvement in a program that surveyed a huge datastore and generated volumes of text from it. This program had to run once a month, and deliver its result in the same month. The program that was originally written, unfortunately, took three months to run (it started out OK, but the data store had grown considerably). They had asked one of our "best programmers" to create a faster version of the program. He did that by reprogramming the entire thing in assembly (you may now understand why managers thought he was one of the best programmers). It took him six whole months to finish the new version. The resulting program completed the task in just about one month. However, my boss was afraid that when the datastore would grow a bit more, we would again be in trouble. That's when he asked me to look it over. I started by investigating the problem, which at first glance looked like a network traversing problem. I soon realised it could be solved by a nested matrix multiplication (which is, of course, a standard way to discover paths in a network). It was a matrix with about a million rows and columns, but since it contained only zeroes and ones (with a couple of thousands times more zeroes than ones), the multiplication was easy to implement in a fast way. Within half-a-day, I had built a prototype program in a high-level language which did the whole job in a few hours.
  While I am still pleased with this result, I really think it came off so well not because I was so smart, but because the assembly programmer was not really worthy of the name. Still, I often use this as an illustration for students who are writing illegible code and argue that it is so very fast.
2. Re:Algorithms, Not Stupid Processor Tricks by beelsebob · 2005-02-25 10:50 · Score: 4, Informative
  
  That's not really the point being made though - the question really is "is it worth spending the next week trying to make optimisations, or coding a better algorithm to do this."
  And while we're at it - n^2 vs 2^n *is* a big deal when working on 1ghz+ systems... If we have 1000 objects, an operation that takes 10 cycles and an n^2 algorithm, then we get a runtime of 0.01 seconds (10 x 1000 x 1000 cycles ), if we have a 2^n algorithm then we get a runtime in the hours, and no amount of optimizing the code in the loop (even down to one instruction) is going to get us anywhere near the n^2 algorithm.
Clear & Concise Code by kwiqsilver · 2005-02-25 08:52 · Score: 4, Interesting

It's better to write clear, legible code that saves a human minutes of reading, than complex code that might save a computer a few milliseconds of processing time per year, because human time costs more than machine time.
Also the clear code will result in fewer misinterpretations, which will mean fewer bugs (especially when the original author is not the one doing maintenance years later), further reducing costs in dollars, man hours, and frustration.
From the "Patenting Fire" department by slipnslidemaster · 2005-02-25 08:53 · Score: 4, Funny

I just checked the U.S. Patent office and sure enough, just minutes after your post, Microsoft patented "if (!ptr)" as a shorthand for "if (ptr==NULL)".

Prepare to be sued.

--

"What the hell is an aluminum falcon?"
Tradeoffs by Black+Parrot · 2005-02-25 08:53 · Score: 4, Insightful

Hard to measure, but what is the tradeoff between increased speed and increased readability (which is a prerequisite for correctness and maintainability)? And if you can estimate that tradeoff, which is more important to the goals of your application?

As a side note, it is far more important to make sure you are using efficient algorithms and data structures than to make minor local optimizations. I've seen programmers use bizarre local optimization tricks in a module that ran in exponential time rather than log time.

--
Sheesh, evil *and* a jerk. -- Jade
Most people should not bother by El+Cubano · 2005-02-25 08:53 · Score: 5, Insightful

What about code that is more complex? Now that compilers have matured over years and have had many improvements, I ask the Slashdot crowd, what they believe the compiler can be trusted to optimize and what must be hand optimized?
Programmers cost lots more per hour than computer time. Let the compiler optimize and let the programmers concentrated on developing solid maintainable code.
If you make code too clever in an effort to try to pre-optimize, you end up with code that other people have difficulty understanding. This is leads to lower quality code as it evolves if the people that follow you are not as savvy.
Not only that, but the vast majority of code written today is UI-centric or I/O bound. If you want real optimization, design a harddrive/controller combo that gets you 1 GBps off the physical platter (and at a price that consumers can afford).
Beware of habits. by SharpFang · 2005-02-25 08:54 · Score: 4, Interesting

I got in the habit of writing "readable but inefficient" code, taking care that my constructs don't get too sophisticated for the optimizer but then depending on gcc -O3 thoroughly. And then it happened I had to program 8051 clone. Then I learned there are no optimizing compilers for '51, that I'm really tight on CPU cycles, and that I simply don't know HOW to write really efficient C code.
Ended up writing my programs in assembler...

--
45 5F E1 04 22 CA 29 C4 93 3F 95 05 2B 79 2A B2
Huh by NullProg · 2005-02-25 08:54 · Score: 4, Informative

As a simple example, take 'if (!ptr)' instead of 'if (ptr==NULL)'.

Both forms resolve to the same opcode. Even under my 6502 compiler.

CMP register,val
JNE

Enjoy,

--
It's just the normal noises in here.
1. Re:Huh by DunbarTheInept · 2005-02-25 09:11 · Score: 4, Insightful
  
  Not true. Many CPUs have a unary jump-if-zero, or a jump-if-nonzero operation. Thus the comparasin step can be bypassed since you know you're comparing to zero.
  
  However, any compiler worth anything should find that and optimize it very easily in the case where you're comparing to a constant that evaluates to zero.
  
  --
  Don't label something "offtopic" unless you know the topic well enough to tell what's on topic.
$.02 by MagicM · 2005-02-25 08:54 · Score: 4, Insightful

1) Code for maintainability
2) Profile your code
3) Optimize the bottlenecks

That said, (!ptr) should be just as maintanable as (ptr == NULL) simply because it is a frequently used 'dialect'. As long as these 'shortcuts' are used throughout the entire codebase they should be familiar enough that they don't get in the way of maintainability.
micro optimization by fred+fleenblat · 2005-02-25 08:54 · Score: 4, Insightful

What you're talking about it micro-optimization.
Compilers are pretty good at that, and you should let them do their job.

Programmers should optimize at a higher level: by their choice of algorithms, organizing the program so that memory access is cache-friendly, making sure various objects don't get destroyed and re-created unnecessarily, that sort of thing.
Wrong, wrong, wrong by JoeBuck · 2005-02-25 08:54 · Score: 4, Informative

Don't give advice when you don't know C. C requires that when a 0 is converted to a pointer, the result is NULL, so it is absolutely false to claim that NULL could be defined as -1.
"ptr == 0" must give the same result as "ptr == NULL", always.
1. Re:Wrong, wrong, wrong by Anonymous Coward · 2005-02-25 09:17 · Score: 4, Informative
  
  From the ANSI C specification, section 6.2.2.3:
  
  An integral constant expression with the value 0, or such an expression cast to TypeIs_VoidPointer, is called a null pointer constant. If a null pointer constant is assigned to or compared for equality to a pointer, the constant is converted to a pointer of that type. Such a pointer, called a null pointer, is guaranteed to compare unequal to a pointer to any object or function. A null pointer constant has TypeIs_NULL.
  
  Once again, it should be said:
  
  Don't give advice not to give advice to not give advice when you don't know C when you don't know C when you don't know C.
Those who forget Tony Hoare... by smug_lisp_weenie · 2005-02-25 08:55 · Score: 5, Insightful

...are doomed to repeat the biggest trap in computer programming over and over again:

"Premature optimization is the root of all evil"

If there's only one rule in computer programming a person ever learns, "Hoare's dictum" is the one I would choose.

Almost all modern languages have extensive libraries available to handle common programming tasks and can handle the vast majority of optimizations you speak of automatically. This means that 99.99% of the time you shouldn't be thinking about optimizations at all. Unless you're John Carmack or you're writing a new compiler from scratch (and perhaps you are) or involved in a handful of other activities you're making a big big mistake if your spending any time worrying about these things. There are far more important things to worry about, such as writing code that can be understood by others, can easily be units tested, etc.

A few years ago I used to write C/C++/asm code extensively and used to be obsessed with performance and optimization. Then, one day, I had an epiphany and started writing code that is about 10 times slower than my old code (different in computer language and style) and infinitely easier to understand and expand. The only time I optimize now is at the very very end of development when I have solid profiler results from the final product that show noticable delays for the end user and this only happens rarely.

Of course, this is just my own personal experience and others may see things differently.
Write C for C programmers by swillden · 2005-02-25 08:55 · Score: 5, Insightful

With regard to your example, I can't imagine any modern compiler wouldn't treat the two as equivalent.
However, in your example, I actually prefer "if (!ptr)" to "if (ptr == NULL)", for two reasons. First the latter is more error-prone, because you can accidentally end up with "if (ptr = NULL)". One common solution to avoid that problem is to write "if (NULL == ptr)", but that just doesn't read well to me. Another is to turn on warnings, and let your compiler point out code like that -- but that assumes a decent compiler.
The second, and more important, reason is that to anyone who's been writing C for a while, the compact representation is actually clearer because it's an instantly-recognizable idiom. To me, parsing the "ptr == NULL" format requires a few microseconds of thought to figure out what you're doing. "!ptr" requires none. There are a number of common idioms in C that are strange-looking at first, but soon become just another part of your programming vocabulary. IMO, if you're writing code in a given language, you should write it in the style that is most comfortable to other programmers in that language. I think proper use of idiomatic expressions *enhances* maintainability. Don't try to write Pascal in C, or Java in C++, or COBOL in, well, anything, but that's a separate issue :-)
Oh, and my answer to your more general question about whether or not you should try to write code that is easy for the compiler... no. Don't do that. Write code that is clear and readable to programmers and let the compiler do what it does. If profiling shows that a particular piece of code is too slow, then figure out how to optimize it, whether by tailoring the code, dropping down to assembler, or whatever. But not before.

--
Note to ACs: I usually delete AC replies without reading them. If you want to talk to me, log in.
code should be written for people to read by SamSeaborn · 2005-02-25 08:56 · Score: 5, Insightful

"Programs should be written for people to read, and only incidentally for machines to execute."
- Structure and Interpretation of Computer Programs
Not a question that can be asked generally by Sycraft-fu · 2005-02-25 08:56 · Score: 4, Informative

Each compiler is different. Some will optimise things other won't.

In general, however, systems are now fast enought that when in doubt, write the clearest code possible. I mean for most apps, speed is not critical, however for all apps stability and lack of bugs is important and obscure code leads to problems.

Also, for things that are time critical, it's generall just one or two little parts that make all the difference. You only need to worry about optimizing those inner loops where all the time is spent. Use a profiler, since programmers generally suck at identifying what needs optimising.

Keep it easy to read and maintain, unless speed is critical in a certian part. Then you can go nuts on hand optimization, but document it well.
Check out the LLVM demo page by sabre · 2005-02-25 08:58 · Score: 5, Interesting

LLVM is an aggressive compiler that is able to do many cool things. Best yet, it has a demo page here: http://llvm.org/demo, where you can try two different things and see how they compile.

One of the nice things about this is that the code is printed in a simple abstract assembly language that is easy to read and understand.

The compiler itself is very cool too btw, check it out. :)

-Chris
That's a Tony Hoare quote, not Donalded Knuth by Dan+Ost · 2005-02-25 08:59 · Score: 4, Informative

Donald Knuth was quoating Tony Hoare when he said that.

--

*sigh* back to work...
The algorithm that must not be named! by coyote-san · 2005-02-25 09:19 · Score: 4, Funny

Grrr, you named the algorithm that must not be named! Cursed be the name of the fool who thought it would be a good algorithm for introductory students - I've lost count of the number of people convinced that this satan-spawned algorithm is faster than an insertion sort (it's not) and that there's no reason for them to learn to use the qsort() function. N.B., not to implement a quick sort, but to simply call a standard library routine.

The most frustrating thing is that, if you must use the algorithm that must not be named, the bidirectional form of the algorithm is much faster (in practice) than the unidirectional form yet really no more complex to code than the latter if you have any potential as a software developer.

--
For every complex problem there is an answer that is clear, simple, and wrong. -- H L Mencken
If you're not willing to TIME it... by dpbsmith · 2005-02-25 09:30 · Score: 4, Insightful

...then the code isn't important enough to optimize. Plain and simple.

Never try to optimize anything unless you have measured the speed of the code before optimizing and have measured it again after optimizing.

Optimized code is almost always harder to understand, contains more possible code paths, and more likely to contain bugs than the most straightforward code. It's only worth it if it's really faster...

And you simply cannot tell whether it's faster unless you actually time it. It's absolutely mindboggling how often a change you are certain will speed up the code has no effect, or a truly negligible effect, or slows it down.

This has always been true. In these days of heavily optimized compilers and complex CPUs that are doing branch prediction and God knows what all, it is truer than ever. You cannot tell whether code is fast just by glancing at it. Well, maybe there are processor gurus who can accurately visualize the exact flow of all the bits through the pipeline, but I'm certainly not one of them.

A corollary is that since the optimized code is almost always trickier, harder to understand, and often contains more logic paths than the most straightforward code, you shouldn't optimize unless you are committed to spending the time to write a careful unit-test fixture that exercises everything tricky you've done, and write good comments in the code.

--
"How to Do Nothing," kids activities, back in print!
Optimization rules... by Anonymous Coward · 2005-02-25 09:46 · Score: 5, Funny

When I wrote my ray-tracer for the final project of my graphics class, I used gcc -o3 and it optimized my code into Pov-ray, which was sweet. I was done with the project in like ten minutes.

Plus I got extra credit for implementing phong shading. I didn't even try to do phong shading.
Re:Not always. by zaffir · 2005-02-25 09:49 · Score: 4, Insightful

I make my code easy to read for my own sanity. I've lived out this bash.org quote way too many times.

--
"Upon attaching the waterblock to my penis, I began to notice that I know nothing about computers." -- JRockway
Re:Clear Code - Boeing by pagebt · 2005-02-25 09:58 · Score: 4, Funny

And believe me it is a pain in the a$$. Our company did the verification for the code in the microprocessor that controls the locks to the bathroom door on a 777, if the crapper tank is full then the door locks to make sure there isn't an overflow and thus frozen turd/urine meteors that fall from the sky. Every byte of the code MUST be excercised including all error conditions.
Re:Clear Code - Boeing by Anonymous Coward · 2005-02-25 10:05 · Score: 4, Funny

So how many dumps does it take to fill up the crapper tank? I'd hate to be the last QA engineer in line to use the crapper. Also what happens when that last engineer fills up the crapper, does the bathroom door look thus trapping him inside?
Premature Optimization by fizban · 2005-02-25 10:06 · Score: 4, Insightful

Premature Optimization is the DEVIL! I repeat, it is the gosh darn DEVIL! Don't do it. Write clear code so that I don't have to spend days trying to figure out what you are trying to do.

The biggest mistake I see in my professional (and unprofessional) life is programmers who try to optimize their code is all sorts of "733+" ways, trying to "trick" the compiler into removing 1 or 2 lines of assembly, yet completely disregard that they are using a map instead of a hash_map, or doing a linear search when they could do a binary search, or doing the same lookup multiple times, when they could do it just once. It's just silly, and goes to show that lots of programmers don't know how to optimize effectively.

Compilers are good. They optimize code well. Don't try to help them out unless you know your code has a definite bottleneck in a tight loop that needs hand tuning. Focus on using correct algorithms and designing your code from a high level to process data efficiently. Write your code in a clear and easy to read manner, so that you or some other programmer can easily figure out what's going on a few months down the line when you need to add fixes or new functionality. These are the ways to build efficient and maintainable systems, not by writing stuff that you could enter in an obfuscated code contest.

--
+1 Insightful, -1 Troll. What can I say, I'm an Insightful Troll.
valgrind by cyco/mico · 2005-02-25 10:12 · Score: 4, Informative
If in doubt, use valgrind and kcachegrind. One run with callgrind gives you all the information you want:
- How often are functions called (and branches taken)
- Which functions take most of the time
- See the assembler code for each line with a mouse click (no need to guess anymore)
callgrind/kcachegrind is by far the easiest profiling solution I ever tried, and it seems answer more or less all of your questions.
i, j, k, ... by bsd4me · 2005-02-25 10:17 · Score: 4, Informative

I think that most people forget that the reason that i, j, k, etc. are used for loop counters is that unless otherwise declared, I..N default to INTEGER in FORTRAN. This convention just carried over as programmers migrated from FORTRAN to other languages and has been passed down through the ages.

--
(S(SKK)(SKK))(S(SKK)(SKK))
Re:Clear Code - Boeing by Sponge+Bath · 2005-02-25 10:58 · Score: 5, Funny

when that last engineer fills up the crapper, does the bathroom door look thus trapping him inside?
HAL! Open the bathroom door!

I'm sorry Dave, you shouldn't have had that last burrito.
Must be nice by peccary · 2005-02-25 11:09 · Score: 4, Insightful

one product
one customer
420,000 lines
260 staff
no competition
no trade shows
no salespeople selling new features that have never been discussed

It's interesting to talk about their attention to detail, but to hold it up as a model for all software development neglects to consider that they are working under an entirely different set of constraints from most everyone else.
C and "flexibility" of expression operators by GunFodder · 2005-02-25 11:18 · Score: 4, Insightful

I think the example is fine; you just displayed an assumption that highlights one of the quirks of C.

! means "not" or "inverse of"; it is a boolean function. The variable ptr is a pointer; it is a reference to data, which means it isn't really data itself. !ptr shouldn't compute; a boolean operator should only work on boolean data. But C logical comparators are designed to work on everything. You are just supposed to know that 0 == NULL == false. This supposition is totally arbitrary and doesn't hold up in any language with strong typing.

This is what makes C difficult for beginners. Bad code compiles even though it has logical flaws, and ends up failing in mysterious ways.

The second case makes more sense. Equality is an operator that should work on all types of data. NULL is necessary if you are going to abstract data through the use of pointers or objects. Doing away with NULL would be equivalent to eliminating true and false and using 1 and 0 instead. Or eliminating strings and using sequences of ASCII codes. These substitutions are technically correct but in reality they make code unreadable.
Dear Lord by sholden · 2005-02-25 11:35 · Score: 4, Insightful

Ten years of programming in the language and you:

1) Don't know when two things are obviously equivalent to any non-brain dead compiler.

2) Think something other than readability matters.

3) Think the non-idiomatic way of doing something is more readable.

But I'm sure I'm just repeating the comments I can't be bothered reading.
Re:Not always. by Foz · 2005-02-25 13:37 · Score: 5, Insightful

No, you're adopting a black or white approach. You are, in essence, saying that you don't need to comment at all. The original poster was saying that comments needed to be everywhere, on everything. I believe in a middle ground approach.

I comment things that are non intuitive. I comment things that I *think* may be non intuitive. I comment things that I think someone else might have some difficulty understanding, because I happened to be deep into a code burn and consequently wrote something pretty tight, pretty sweet, but also pretty obfuscated. Finally, I comment things that I think *I* may not understand when I go back and look at the code again 3 months from now.

I don't comment every single line... I don't comment simple data structures, loops "/* this is a for loop using the integer variable I */" etc which would be stupid. I do however disassemble the complex portions of my code, describe how I'm dispatching events and best of all *why* I decided to do things a certain way instead of a different way.

I have, however, been handed 30k lines of code with zero documentation and not a single comment anywhere in it, with absolutely no clue at all how it worked and no access to the original programmer and been told "We need such and such fixed|updated|added by friday" and had to spend the entire week basically tracing every single line of code to figure out that the original programmer must have been smoking crack with NO indication of why he wrote things how he did and NO help when he decided to be exceedingly "clever"
in his code. That time was wasted.

Would it have killed him to simply put a comment block explaining his event dispatch model? Or to tell me what his functions and methods did and best of all why they did it?

There *is* a middle ground, believe it or not.

-- Gary F.
/. posters by Saville · 2005-02-25 17:21 · Score: 4, Insightful

"I ask the Slashdot crowd, what they believe the compiler can be trusted to optimize and what must be hand optimized? Give examples of code optimizations that you think the compiler can/can't be trusted to do."

Somehow 99% of the readers took this to mean "What is the difference between NULL and the zero bit pattern and do you think it is a good idea to write clear code and do the profile/algorithm change cycle until there is nothing left to optimize or should I write low level optimized code from the start?"

sigh.. I've only found two comments with code so far after going through hundreds of posts. This is possibly the worst signal to noise ratio I've witnessed on /.