Comparing G++ and Intel Compilers and Vectorized Code

← Back to Stories (view on slashdot.org)

Comparing G++ and Intel Compilers and Vectorized Code

Posted by timothy on Thursday December 19, 2013 @03:46AM from the different-lenses dept.

Nerval's Lobster writes "A compiler can take your C++ loops and create vectorized assembly code for you. It's obviously important that you RTFM and fully understand compiler options (especially since the defaults may not be what you want or think you're getting), but even then, do you trust that the compiler is generating the best code for you? Developer and editor Jeff Cogswell compares the g++ and Intel compilers when it comes to generating vectorized code, building off a previous test that examined the g++ compiler's vectorization abilities, and comes to some definite conclusions. 'The g++ compiler did well up against the Intel compiler,' he wrote. 'I was troubled by how different the generated assembly code was between the 4.7 and 4.8.1 compilers—not just with the vectorization but throughout the code.' Do you agree?"

24 of 225 comments (clear)

Min score:

Reason:

Sort:

Documentation is King by jpschaaf · 2013-12-19 03:51 · Score: 3, Interesting

For better or worse, I've always given the intel compiler the benefit of the doubt. They have access to documents that the GCC folks don't.
1. Re:Documentation is King by Curupira · 2013-12-19 04:27 · Score: 5, Informative
  
  Yeah, on Intel processors. What about AMD and other x86 processors? Don't ever forget that ICC was once caught red-handed disabling important features when the CPUID did not return GenuineIntel...
2. Re:Documentation is King by Runaway1956 · 2013-12-19 05:45 · Score: 4, Interesting
  
  Yep, they have access to some cool documents. It took a lot of work to document the fact that the intel compiler was actually crippling code if it was run on AMD processors. I mean, some suspicious, somewhat paranoid people suspected that intel was crippling code on AMD processors, but it took a good deal of work to actually demonstrate it.
  That is just one of the many reasons I don't use Intel.
  
  --
  "Windows is like the faint smell of piss in a subway: it's there, and there's nothing you can do about it." - Charlie Br
3. Re:Documentation is King by Darinbob · 2013-12-19 07:05 · Score: 3, Insightful
  
  GCC also works with many CPUs that Intel compiler does not. That includes x86 compatible chips from other vendors, as well as the advanced features in Intel chips that were originally introduced by competiting clones. So maybe Intel is nice, but that's irrelevant if you don't even use Intel hardware in your products.
  If Intel really is basing their compiler off of secret architecture documents, then people should be able to deduce what's going on from looking at the generated assembler. Ie, find some goofy generated code that does not seem to make sense given public documents, get a benchmark to compare it, figure out there's a hidden feature, and then make use of it.
4. Re:Documentation is King by Anonymous Coward · 2013-12-19 10:20 · Score: 3, Insightful
  
  If the CPU reports it supports SSE2, and the compiler supports it, I expect it to bloody well use those instructions when told to, not silently produce fucking x87 garbage. Really rocket science apparently.
Not sure why it's troubling. by serviscope_minor · 2013-12-19 03:55 · Score: 5, Insightful

I don't think it's troubling.
Firstly they beat on the optimizer a *lot* between major versions.
Secondly, the compiler does a lot of micro optimizations (e.g. the peephole optimizer) to choose between essentially equivalent snippets. If they change the information about the scheduling and other resources you'd expect that to change a lot.
Plus I think that quite a few intresting problems such as block ordering are NP-hard. If they change the parameters of their heuristic NP-hard solver, that will give very different outputs too.
So no, not that bothered, myself.

--
SJW n. One who posts facts.
1. Re:Not sure why it's troubling. by david.emery · 2013-12-19 04:10 · Score: 5, Informative
  
  Mod parent up +1 insightful.
  Unless you suspect and are trying to debug a code generator error (one of the least pleasant/most difficult debugging experiences I've had), the base assertion that you should understand your compiler's code generation is at best unrealistic, and probably just dumb. Code generation is extremely complex, requiring deep knowledge of both this specific compiler's design and this specific computer's instruction set architecture, how the caches work, pre-fetching approaches, timing dependencies in instruction pipelines, etc, etc. If you do suspect a code generator error, you're best off hiring a compiler expert at least as a consultant, and be prepared for a long hard slog.
  Maybe 30 years ago, for a PDP-8, you could assert that the C code you wrote had some semblance to the generated machine code. That hasn't been true for a very long time, and C++ is most definitely not C in this regard.
2. Re:Not sure why it's troubling. by zubab13 · 2013-12-19 04:17 · Score: 4, Informative
  
  Just use something like libsimdpp[1] and you are sure that your code stays vectorized between compiler versions. As a bonus, this and similar wrapper libraries give you an option to produce assembly for multiple instruction sets (say SSE2, AVX and NEON) from the same code. [1]: https://github.com/p12tic/libsimdpp
3. Re:Not sure why it's troubling. by loufoque · 2013-12-19 09:29 · Score: 3, Interesting
  
  Explicit vectorization is indeed much more reliable than automatic vectorization, and it will always deliver better performance.
  Interestingly, there seems to be quite a few abstraction layer libraries for SIMD. There are also at least Boost.SIMD (part of NT2 [1]) and Vc [2].
  Several array-handling libraries (NT2 [1], Eigen [3]) also a leverage SIMD explicitly.
  Alternatively there are plenty of languages based on C with explicit SIMD programming, like the Intel SPMD Compiler [4].
  If you're interested in SIMD, there is also apparently a workshop being held soon on this subject in Orlando [5].
  [1] https://github.com/MetaScale/nt2
  [2] http://code.compeng.uni-frankfurt.de/projects/vc/
  [3] http://eigen.tuxfamily.org/index.php?title=Main_Page
  [4] http://ispc.github.io/
  [5] https://sites.google.com/site/wpmvp2014/
Very different code by Anonymous Coward · 2013-12-19 03:58 · Score: 4, Interesting

I have worked on a couple of projects that compiled and ran perfectly with GCC 4.6 and 4.7. They no longer run when compiled with the latest versions of GCC. No warnings, no errors during compilation, they simply crash when run. It's the same source code, so something has changed. The same code, when compiled with multiple versions of Clang, runs perfectly. The GCC developers are doing something different and it is causing problems. Now it may be that a very well hidden bug is lurking in the code and the latest GCC is exposing that in some way, but this code worked perfectly for years under older versions of the compiler so it's been a nasty surprise.
1. Re:Very different code by david.emery · 2013-12-19 04:14 · Score: 5, Insightful
  
  Unfortunately, that's not unique to GCC. I've seen this happen with several different compliers for different programming languages over the years. Worse, I've seen it with the same compiler, but different Optimizer settings.
  In one case, our system didn't work (segfaulted) with the optimizer engaged, and didn't meet timing requirements without the optimizer. And the problem wasn't in our code, it was in a commercial product we bought. The compiler vendor, the commercial product vendor (and the developer of that product, not the same company as we bought it from) and our own people spent a year pointing fingers at each other. No one wanted to (a) release source code and then (b) spend the time stepping through things at the instruction level to figure out what was going on.
  And the lesson I learned from this: Any commercial product for which you don't have access to source code is an integration and performance risk.
2. Re:Very different code by Mr+Z · 2013-12-19 04:31 · Score: 4, Informative
  
  Actually, the scope of int i changed in C++. Previously, the scope would extend beyond the for. If you enable warnings, G++ will tell you all about it.
  
  --
  Program Intellivision!
3. Re:Very different code by david.emery · 2013-12-19 04:33 · Score: 4, Insightful
  
  Well, in part that depends on your market. Most of my work has been in military systems or air traffic systems, where the cost of failure >> lost opportunity cost. That's a point a lot of people forget; not all markets (and therefore the risk calculations for bugs, etc) are created equal.
4. Re:Very different code by drawfour · 2013-12-19 04:54 · Score: 4, Informative
  
  This is why all code should be compiled with highest warning level enabled, and all warnings should be treated as errors. The compiler can have a very hard time guessing at what you meant, so it's best to be as explicit as you can. If, for some reason, you're positive the code needs to be a certain way that is, and it is correct, you can always use a "pragma warning(disable)" (with appropriate push/pop semantics) to keep your code compiling clean.
5. Re:Very different code by PhrostyMcByte · 2013-12-19 05:03 · Score: 3, Insightful
  
  Your projects were likely doing something which resulted in undefined behavior. It's been extremely rare to have GCC break working standards-compliant code.
6. Re:Very different code by drawfour · 2013-12-19 06:12 · Score: 3, Insightful
  
  There is a reason for warnings -- it's because you're doing something wrong. Unfortunately, the compiler lets you do it anyway, probably because there is a ton of legacy code that would suddenly "break" if they were errors by default. But that doesn't mean that you should stop trying to fix these issues. Many of these issues only appear to be benign until you stumble upon the exact issue the warning was trying to warn you about. Static code analysis tools are also your friend. That doesn't mean you can blindly trust them -- static analysis tools do have false warnings. But they're way better than inspecting the code yourself. You'll miss something way more times than the analysis tools will give you a false positive.
7. Re:Very different code by 0123456 · 2013-12-19 06:52 · Score: 3, Informative
  
  There is a reason for warnings -- it's because you're doing something wrong.
  Uh, no. It's because you're doing something that may be wrong. If it was wrong, the compiler would given an error, not a warning.
  'if ( a = b )' for example. The compiler warns because you probably meant 'if ( a == b )'. But maybe you didn't.
  There's little reason to write such C code on a modern quad-core 3GHz CPU which spends 90% of its time idle and where the compiler will probably generate the same machine code anyway, but that doesn't make it wrong.
8. Re:Very different code by Darinbob · 2013-12-19 08:08 · Score: 3, Informative
  
  Sometimes warnings are false positives as well. Especially when turning warning levels up high they will warn about things that may be indicators of a bug or typo but which actually aren't problems, or in some cases are even intentional. Such as unused variables or parameters; is that a bug or a stylistic choice to not litter the code with extra #ifdef? An unused parameter in general seems an odd thing to complain about, usually the parameter list is fixed in an API or design document whether or not the actual implementation needs all the parameters.
News for nerds or not by symbolset · 2013-12-19 04:05 · Score: 5, Informative

Asking any audience larger than about 20 to compare the qualitative differences of object code vectorization is statistically problematic as the survey group is larger than the qualified population.

--
Help stamp out iliturcy.
Vectorized factorials! by Mr+Z · 2013-12-19 04:23 · Score: 4, Interesting

One amusing thing I discovered is that GCC 4.8.0 will actually unroll and vectorize this simple factorial function: Just look at that output!

--
Program Intellivision!
1. Re:Vectorized factorials! by DickBreath · 2013-12-19 07:19 · Score: 4, Funny
  
  Here is how I do a factorial function. No recursion, no loops, no vectorization needed. It's in Java. Converting this basic idea to C is left as an exercise for advanced readers.
  
  static public long factorial( int n ) {
  switch( n ) {
  case 0:
  case 1: return 1L;
  case 2: return 2L;
  case 3: return 6L;
  . . . cases 4 to 18 omitted to bypass slashdot filters . . .
  case 19: return 121645100408832000L;
  case 20: return 2432902008176640000L;
  }
  return 0L;
  }
  
  --
  
  I'll see your senator, and I'll raise you two judges.
2. Re:Vectorized factorials! by Mr+Z · 2013-12-19 07:34 · Score: 3, Informative
  
  Why isn't that just a lookup table? My point in mentioning factorial is that there's no point in vectorizing that thing. Even a simple loop would be small compared to the cost of a single L2 cache miss.
  
  --
  Program Intellivision!
Why is this still a topic? by excelsior_gr · 2013-12-19 04:36 · Score: 3, Interesting

This is 2013 (almost 2014!) why are we talking about vectorization? Why don't people write code in vector notation in the first place anyway? If Matlab and Fortran could implement this 25 years ago, I am sure we are ready to move on now...
Re:yuo Fai7 It by LordByronStyrofoam · 2013-12-19 05:34 · Score: 3, Funny

OMG! What's this goatse doing here?? I thought all these images were taken down by a DMCA notice by the original asshole!

--
Slashdot's name? When my compiler sees /. it generates a warning about a badly formed comment.