Arrays vs Pointers in C?

← Back to Stories (view on slashdot.org)

Posted by Cliff on Wednesday October 12, 2005 @10:42AM from the optimizations-may-not-be-portable dept.

UOZaphod asks: "A recent sub-discussion on Slashdot (in which, I confess, I was involved) piqued my curiosity because of several comments made about C compiler optimizations. I was informed that said optimizations have made it so that indexing an array with the [] operator is just as fast as using an incremented pointer. When the goal is maximum performance across multiple CPU architectures, can one always assume that this is true?" "Here are my own thoughts on the issue:

For discussion purposes, I present the following two equivalent functions which reverse the contents of a string. Note that these code fragments are straight C, and do not account for MBCS or Unicode.

The first function uses array indexing:

void reversestring_array(char *str) {
int head, tail; char temp; if (!str) return; tail = strlen(str) - 1; for (head = 0; head < tail; ++head, --tail) {
temp = str[tail]; str[tail] = str[head]; str[head] = temp;
}
}

The second function uses pointers:

void reversestring_pointer(char *str) {
char *phead, *ptail, temp; if (!str) return; ptail = str + strlen(str) - 1; for (phead = str; phead < ptail; ++phead,--ptail) {
temp = *ptail; *ptail = *phead; *phead = temp;
}
}

While there are obvious optimizations that could be done for both functions, I wanted to keep them as simple and semantically similar as possible.

Arguments have been made that the compiler will optimize the first example using register indexing built into the CPU instruction set, so that it runs just as fast as the pointer version.

My argument is that one cannot assume, in a multi-architecture environment, that such optimizations will always be available. Semantically, the expression array[index] must always be expanded to *(array + index) when the index is variable. In other words, the expression cannot be reduced further, because the value of the index is unknown at run time.

Granted, when I compiled the above examples on an x86 machine, the resulting assembly for each of the two functions ended up looking very similar. In both cases, I enabled full compiler optimization (Pentium Pro). I will present just the inner loop for each function...

The array function:

forloop: mov bl,byte ptr [esi+edx] mov al,byte ptr [ecx+edx] mov byte ptr [ecx+edx],bl mov byte ptr [esi+edx],al inc esi dec ecx cmp esi,ecx jl forloop

The pointer function:

forloop: mov bl,byte ptr [ecx] mov dl,byte ptr [eax] mov byte ptr [eax],bl mov byte ptr [ecx],dl inc ecx dec eax cmp ecx,eax jb forloop

While this example appears to prove the claim that compiler optimizations eliminate the differences between array and pointer usage, I wonder if it would still be true with more complicated code, or when indexing larger structures.

I'd certainly be interested in hearing more discussion on the matter, accompanied by examples and references."

5 of 308 comments (clear)

Min score:

Reason:

Sort:

Why do you care? by Julian+Morrison · 2005-10-12 10:48 · Score: 4, Insightful

For any real programming task, the question has to be: why do you care baout that? Is it, specifically, a bottleneck in your code as detected with profiling tools?

If it isn't, then don't wank around optimizing for single cycles on a machine that probably bleeds off a million cycles every time you raise a window.
1. Re:Why do you care? by JanneM · 2005-10-12 14:37 · Score: 5, Insightful
  
  For any real programming task, the question has to be: why do you care baout that? Is it, specifically, a bottleneck in your code as detected with profiling tools?
  
  When the programming task is something like real-time image processing (computer vision), this kind of thing can make a serious difference. If 90% of your time is spent running these kinds of loops over and over again, an improvement in time will make a real difference on what combination of methods you have time for; or how exhaustively you can search for features during one frame; or what resolution image you are able to work on.
  
  If your code does something nice and graphical where 99% of the time is spent waiting for the user, sure, you're absolutely right. And if your system is doing something inherently bounded - it works until it's done, then it stops and waits until it's time again - then all you need is to make it fast enough and no faster. But there are real-world systems that today, and for the foreseeable future, are bounded by the available processing power and that can always benefit from any improvement in execution time.
  
  --
  Trust the Computer. The Computer is your friend.
2. Re:Why do you care? by NonSequor · 2005-10-12 19:10 · Score: 4, Insightful
  
  You don't understand the problem. A chemical reaction is only as fast as its slowest step. Catalyzing the other steps will not yield an improvement in reaction rate.
  
  For computer programs, you won't gain anything worthwhile by optimizing code that the computer only spends 1% of its time executing. That's not to say you should do a sloppy job, but you should focus on what matters. Microoptimization techniques (those techniques that involve choices of instructions and their orders rather than changing the algorithm that is used) typically yield very small gains. Microoptimization can yield substantial benefits when used properly in heavily used sections of code, but the time involved in trying to microoptimize everything could be better used to work on macrooptimizations or organizing the code to make it more amenable to later modification.
  
  There's no sense in trying to make your program 0.3% faster when you could be finding a way to make it 20% faster instead.
  
  --
  My only political goal is to see to it that no political party achieves its goals.
I echo the above statements by 21chrisp · 2005-10-12 11:13 · Score: 4, Insightful

These types of opimizations are virtually pointless on modern machines. The increased readability and lower likelihood of programming errors on the array option far outway any speed increase for the pointer option. Plus, as you noticed, the resulting assembly is basically the same. Most likely, both will run at virtually the same speed with modern compilers. Not only would the speed difference be unnoticeable, it would be utterly inconsequential.

IMHO there is no place for pointer arithmetic in modern software. If someone working for me wrote something like the second option, I would ask them to rewrite it.
As Donald Knuth said... by jschmerge · 2005-10-12 13:28 · Score: 4, Insightful

Premature optimization is the root of all evil.

I personally let the compiler writers worry about this type of thing. I'd rather have my code be more readable than fast. That being said, there are many times that I switch back and forth between pointer arithmetic and array indexing within the same program. I'll primarily use the pointer arithmetic for things like simple string processing where it just leads to more compact code, and I'll use the array indexing when I have an actual bonafide array...

In any event, my point is that you should be programming in a way that is maintainable and readable; you shouldn't be worried about shaving tens of clock cycle off of such simple loops. In more complex loops, you probably will not be able to shave nearly as much time off, because your indexes won't always be sitting in a register and the data that the index points to will most likely not even fit into a register. In this case, I don't think anyone will argue with the assertion that this:

(ptr + index)->dataMember

is less readable than this:

ptr[index].dataMember