A Review of GCC 4.0

I'll tell you what the problem is... by Anonymous Coward · 2005-05-02 04:45 · Score: 5, Funny

Well clearly the problem is that you compiled GCC 4.0.0 with GCC 3.4.3! What I did was go through the GCC 4.0 source code in two seperate windows, fire up hexedit in another, and go through line by line "compiling" GCC 4.0 with the GCC 4.0 source, in my head. I wouldn't recommend doing this with -funroll-loops, my hands started cramping up.

Or you could wait to compile 4.0 until the 3.0 branch makes it to 3.9.9, then it will be close enough anyway. YMMV, people say I give out bad advice, go figure...

Re:I'll tell you what the problem is... by Inkieminstrel · 2005-05-02 05:03 · Score: 3, Interesting

Gee, I would have just compiled 4.0.0 with 3.4.3, then compiled 4.0.0 again with 4.0.0.
Re:I'll tell you what the problem is... by DJCacophony · 2005-05-02 05:12 · Score: 4, Funny

but then the gcc 4 you compiled with gcc 3.4.3 would produce tainted compilations, and the second 4.0.0 compilation would lean towards 3.4.3 because it was compiled with a compiler that was compiled by 3.4.3. You would have to then take the second compilation of 4.0.0 and compile 4.0.0 with it, at which point the similarity to 3.4.3 would make it somewhere along the lines of 3.7.0. If you continue to compile it, while it will never reach 4.0.0, it will approach closely enough that for all intents and purposes, it will be 4.0.0. The forumula is as follows:

V3-V1~2(V3-V2)

where V1 was used to compile V2, and V2 was used to compile V3.

--
Slow Down, Cowboy! It's been 60 minutes since you last successfully posted a comment.
Re:I'll tell you what the problem is... by Anonymous Coward · 2005-05-02 05:12 · Score: 5, Informative

This was meant as a joke, but for those who took this too seriously: if you have ever tried building GCC yourself, you should know that it always recompiles itself.

A gcc "stage 1" build is gcc compiled with your old compiler. The "stage 2" build is gcc compiled with the compiler created in the previous stage. This is the one that gets installed. The "stage 3" build is optional and verifies that the "stage 2" compiler creates the same output as the previous one.
Re:I'll tell you what the problem is... by confused+one · 2005-05-02 05:19 · Score: 2, Interesting

I believe the Linux from Scratch (LFS) folks have found you have to repeat this three (3) times to have, what is effectively, a clean 4.0.0 compile.
Re:I'll tell you what the problem is... by TelJanin · 2005-05-02 05:41 · Score: 2

Unless one of them produces faulty code, compiling 4.0 with 4.0 compiled with 3.4 will suffice.
Re:I'll tell you what the problem is... by rsidd · 2005-05-02 05:54 · Score: 4, Informative

The GP was a joke, but since you're serious, this is exactly what the "bootstrapping" build of gcc does: it builds a stage 1 build with the system compiler, then a stage 2 build with the stage 1 build, then -- if you want -- a stage 3 build with the stage 2 build, and verifies that the stage 2 and stage 3 builds are the same.

Expected by Hatta · 2005-05-02 04:46 · Score: 4, Interesting

It was a long time before GCC 3 got better than 2.95. I expect the same thing will happen here.

--
Give me Classic Slashdot or give me death!

Re:Expected by Rei · 2005-05-02 05:11 · Score: 5, Insightful

I think the problem is that, if I'm not mistaken, he's testing all C code except Povray. The biggest reported improvements in 4.0 were for g++, so using such a small C++ sample base (Povray - one purpose, one set of design principles, few authors) seems bound to produce inaccurate benchmarking.

Further, on his most reasonable C benchmark (the Linux kernel), he only records compile time and binary size, but no performance. I call it the most reasonable benchmark because it has thousands of contributors and covers a wide range of code purposes and individual coding habits - and yet, performance is omitted.

In short, I wouldn't trust this benchmark. Probably the best benchmark would be to build a whole Gentoo system with both, with identical configurations, and check build times and performances ;)

--
Dear Lord: One of your creatures may be hurt tonight. Please let it be the other creature.
Re:Expected by ajs · 2005-05-02 05:17 · Score: 4, Interesting

I'm not convinced that this test shows that gcc4 is less effecitve than gcc3, though.

First off, all of the programs tested are programs that use hand-tooled assembly in the most performance-sensitive code. That has to mean that the compiler is moot in those sections.

A better test would be to compare three things: the hand-optimized assembly under gcc 3 vs the C code (usually there's a configure switch that tells the code to ignore the hand-tuned assembly, and use a C equivalent) under gcc4 vs that same C code under gcc4.

I think you'd see a surprising result, and if the vectorization code is good enough, you should even see a small boost over the hand-tuned assembly (since ALL of the code is being optimized this way, not just critical sections).
Re:Expected by markov_chain · 2005-05-02 05:48 · Score: 5, Funny

Is it that surprising that a Gentoo user thinks of compiling time as the performance metric? :)

--
Tsunami -- You can't bring a good wave down!
Re:Expected by Rimbo · 2005-05-02 06:00 · Score: 5, Interesting

I think the author of the article misunderstands just what happened with GCC 4.0.

The main improvement in GCC 4.0 is implementing Single Static Assignment.

SSA is not an optimization. It is a simplification. If you can assume SSA, then it opens the door to an entire class of optimizations that can help improve your performance without affecting your code's correctness.

That last bit -- optimizing code without affecting correctness -- was a big problem in the days before SSA.

In that regard, SSA is a similar technology to RISC -- it does not speed things up by itself, but it enables speedups for later on.

The lack of SSA is one thing that kept gcc out of the hands of compiler researchers. Now that it does that, academia can start hacking away with gcc, and the delay you expect is the time between implementing SSA and implementing all of the optimizations that really will improve code performance.
Re:Expected by The+boojum · 2005-05-02 06:05 · Score: 4, Informative

Sorry to tell you this, but the review is even mistaken with respect to Povray. Povray is not a C++ program - it's good ol' C. So in fact, none of the programs he benchmarked were C++. The test was exclusively on C code.

As nice as C is, a lot of the improvements in GCC seemed to have been targetted at improving its handling of C++ code. I'd particularly like to know how it fairs with respect to modern C++ style code - massively templated stuff with STL, Boost, traits and policies, smart pointers, lots of small inlined methods, etc. This test tells me nothing about that, and that's where a lot of development is these days.
Re:Expected by Shisha · 2005-05-02 06:05 · Score: 2, Informative

I call it the most reasonable benchmark because it has thousands of contributors and covers a wide range of code purposes and individual coding habits - and yet, performance is omitted.
As someone who has done some kernel (see this old project) and other programming, I would probably disagree with this statement. The code you find in the linux kernel is rather different (think concurrency, locking, I/O waiting, message passing) to the code you'd find in a number crunching application (think for loops that take forever, huge data sets, nested recursion) which would be rather different to the code you find in something like LAME (think DSP code).

As for the coding habits, the kernel develoment process encourages similar coding habits as it makes the code easier for others to read. There would be differences between different subsystems, which brings us to another problem: where do you start benchmarking the kernel as a whole?
Re:Expected by lpp · 2005-05-02 06:13 · Score: 2, Funny

GCC 4.0.0 can't be all that bad. After all, that's one less Gentoo user I have to listen to go on about how awesome their uber-tweaked system is.
Re:Expected by uid8472 · 2005-05-02 06:33 · Score: 2, Informative

POVray used to be written in plain C, but recent versions -- 3.5 and later, I think -- use C++.
Re:Expected by ma_luen · 2005-05-02 06:34 · Score: 5, Informative

I think you are over estimating the interest of the research community in working on gcc. The move from the intentionally underdocumented and ill defined intermediate representations to tree ssa is a huge step for gcc. Unfortunately, there is still no real effort to make the platform attractive to do experimental work on.

The McCat compiler from McGill (which is what gcc borrowed the ssa rep from), C-- or the LLVM project all provide a much nicer platform. The internal representation is clearly documented, there are frameworks and examples for writing new passes and most importantly they all allow for whole program compilation.

Until gcc decides to support some of this the project will continue to be ignored by research groups. This might be fine since research compiler work can be fairly ugly and it is just easier to port what works.

Otherwise I agree that the move to ssa form is a critical step for gcc to take and it will enable it to become a "modern" compiler. More emportantly it will enable the inclusion of the large body of compiler work that is based on ssa forms.

Mark

What about... by elid · 2005-05-02 04:47 · Score: 2, Interesting

...Tiger? Wasn't it compiled with GCC 4.0?

Re:What about... by scotlewis · 2005-05-02 04:56 · Score: 5, Informative

Yes and no. The default compiler is GCC4, however, the kernel and much of the OS (pretty much all of Darwin, in fact) are still compiled with GCC3 because they haven't completely cleared the codebase of GCC3-isms.

That said, remember that the submitter is talking about GCC4 on x86 platforms, and remember that Apple is putting a lot of work into making sure the PowerPC optimizations are as good as possible. Not to mention things like GCC4's auto-vectorization of code to take advantage of the Altivec unit (which has a more noticeable effect than MMXing x86 code).

It would be nice to see some test results for Apple's GCC versions 3 and 4.
Re:What about... by Space+cowboy · 2005-05-02 18:23 · Score: 3, Interesting

It would be nice to see some test results for Apple's GCC versions 3 and 4.

Well, I did have a bunch of results for you, but the CRAPPY LAMENESS FILTER won't let me post them. Apparently I have to use less 'junk' characters (of course the CRAPPY PROGRAMMER didn't define what a 'junk' character is in the error message, so that's NO USE WHAT-SO-EVER.)

So, I guess I'll summarise. gcc version 4 is slightly worse than 3.3, and slightly better when the tree-vectorize option is passed and altivec code is generated.

Simon

--
Physicists get Hadrons!

The performance of compiled code by pclminion · 2005-05-02 04:49 · Score: 5, Informative

This has always bugged me.

Some people spend 10 hours tweaking compiler settings and optimizations to get an extra 5% performance from their code.

Other people spend 2 hours selecting the proper algorithm in the first place and get an extra 500% performance from their code.

To semi-quote The Matrix: One of these endeavors... is intelligent. And one of them is not.

Re:The performance of compiled code by pclminion · 2005-05-02 04:54 · Score: 3, Insightful

At some point you've got the best algorithm, you've profiled, you've hand-optimised, you've got the fastest hardware you can afford....and you *still* need that last 5%. That's when you spend 10 hours tweaking compilers settings...
If you really, positively need an extra 5% performance, you might as well just buy a computer that's 5% faster.
Re:The performance of compiled code by Jeremy.DeGroot · 2005-05-02 04:55 · Score: 3, Insightful

You should do both. Choosing the right algorithms is crucial, no doubt about it. But if you've got a massive database application, that 5% can represent a huge amount of work and be worth the trouble. A little bit of extra performance can, in many cases, go a long, long way towards adding to the value of the software. Both endeavors are intelligent in many (if not most) cases. Performance is important in software, and any little bit you can squeeze out will likely be a big deal.
Re:The performance of compiled code by Stiletto · 2005-05-02 04:57 · Score: 4, Funny

If you really, positively need an extra 5% performance, you might as well just buy a computer that's 5% faster.

You work at Microsoft, right? No? Intel?
Re:The performance of compiled code by Minwee · 2005-05-02 04:59 · Score: 5, Insightful

Unfortunately, including a faster computer with every copy of the code you distribute may be prohibitively expensive.
Re:The performance of compiled code by kfg · 2005-05-02 04:59 · Score: 5, Insightful

And in both groups you will find people who believe that execution speed is the measurement of code quality.

KFG
Re:The performance of compiled code by pclminion · 2005-05-02 05:03 · Score: 4, Interesting

You're obviously a small box user. Have you ever worked in the real world where huge batch runs can take weeks?
Yes.

You think companies should splash out another million or too on new hardware, just because you use a pissy little machine?

I think that companies should re-evaluate their "need" for an extra 5% performance. Here's an idea -- if you need something 10 minutes faster, why not start the process 10 minutes sooner?

5% just gets lost in the noise. You beef up your system, making it 5% faster... And then some retard in production makes a mistake and sets you back six weeks.
Re:The performance of compiled code by mattgreen · 2005-05-02 05:11 · Score: 4, Insightful

It is because it is easier to delve into needlessly technical aspects afforded by compiler settings and 'optimizations' than it is to admit that one's algorithm is not sound. Kids running Gentoo delude themselves into thinking that omitting the frame pointer on compiles is going to make a massive difference in terms of performance, and fail to remember it makes bug hunting far more difficult when applications crash. Additionally, the 5% gain mentioned can be a severe overstatement. I frequent a game programming board, and the widespread use of C++ has led to an abundance of nano-optimization threads, the most amusing of which was an attempt to optimize strlen().

Optimizing every single line of code is a complete waste of time, since the 80/20 rule generally applies. Use a profiler to determine where that 20% is.
Re:The performance of compiled code by The+Snowman · 2005-05-02 05:13 · Score: 3, Interesting

I think that companies should re-evaluate their "need" for an extra 5% performance. Here's an idea -- if you need something 10 minutes faster, why not start the process 10 minutes sooner?

In any large organization, the process gets in the way. Some suit decides the product needs a new feature, or needs to ship sooner, or whatever, and this slowly trickles down to the developers who suddenly are put in crunch time where every minute counts. Schedules and deadlines may change daily. People's jobs may be at risk. Shit happens.

Nobody really likes it, but that is sometimes how we arrive at the point where we "need" an extra 5% performance, where we "need" the program to finish ten minutes sooner. Starting earlier is not always an option, usually because you don't know you even have to start *at all* until the last minute.

--
24 beers in a case, 24 hours in a day. Coincidence? I think not!
Re:The performance of compiled code by IHateSlashDot · 2005-05-02 05:25 · Score: 3, Insightful

The point of this article is compiler optimizations, not algorithm selection. At the point that I look at compiler performance, I've already done all of the algorithm tuning so your point is moot. This is a very interesting benchmark for those that of who already write good code and want the compiler to make the best of it.
Re:The performance of compiled code by BlurredWeasel · 2005-05-02 05:43 · Score: 5, Insightful

I run gentoo (not for performance, but mainly because I am familiar with it, and it is easy), and you know what...I don't bug hunt. And adding -fomitframepointer or whatever the hell the option is (its in my flags somewhere) doesn't cost me anything, makes my system say (made up stat) 5% faster and I am happy. It makes no sense why you should deride me (read: gentooers) as an idiot. We're just end users, and if we can get a little bit of performance for free, well why not.
Re:The performance of compiled code by zCyl · 2005-05-02 06:01 · Score: 2, Insightful

I think that companies should re-evaluate their "need" for an extra 5% performance.

If you're talking about a program written by one person to be run by one person, or written by five people to be run by five people, or a program that will be run a limited number of times or while people are getting coffee, then absolutely you are correct.

But if you're talking about a small group of programmers making an interactive program (including simulations which people wait for the answer to before starting another run) to be run by millions of people, or to be run iteratively millions of times or over an enormous dataset of comparably size, then 5% is absolutely worth it. If you spend 10 manhours tweaking out 5%, and you've gained only a mere 100 milliseconds, then as a whole you've made out quite well after the collective time saved by those millions of people, or by the millions of runs, are accumulated. And often 5% can result in much more time savings than that.

If you really, positively need an extra 5% performance, you might as well just buy a computer that's 5% faster.

If you can afford all the computers that are 5% faster, then do both! Then you get 10%, and double the benefit. If the first 5% makes a significant difference for a certain application, then the second one probably will as well.
Re:The performance of compiled code by caluml · 2005-05-02 06:07 · Score: 2, Funny

Kids running Gentoo delude themselves into thinking that omitting the frame pointer on compiles is going to make a massive difference in terms of performance, and fail to remember it makes bug hunting far more difficult when applications crash.
Gentoo? Crash? No way, dude. It, like, never crashSegmentation fault.

--
Get your own free personal location tracker
Re:The performance of compiled code by m50d · 2005-05-02 06:24 · Score: 4, Funny

Dude, when did you ever see "Segmentation fault" sent over the network? If you're going to use that old joke at least do it r#i*&$£!"^ NO CARRIER

--
I am trolling
Re:The performance of compiled code by damiena · 2005-05-02 07:38 · Score: 3, Funny

AYNARD: It reads, 'Here may be found the last words of Joseph of
Aramathea. He who is valiant and pure of spirit may find the
Holy Grail in the Castle of Segmentation Fault'.
ARTHUR: What?
MAYNARD: '... the Castle of Segmentation Fault'.
BEDEVERE: What is that?
MAYNARD: He must have crashed while carving it.
LAUNCELOT: Oh, come on!
MAYNARD: Well, that's what it says.
ARTHUR: Look, if he was crashing, he wouldn't bother to send it over the network
'aaggggh'. He'd just print it!
MAYNARD: Well, that's what's his comment says
GALAHAD: Perhaps he was dictating.
Re:The performance of compiled code by Brandybuck · 2005-05-02 07:58 · Score: 4, Insightful

If you have a choice of algorithms, then of course use the better algorithm. But for most of the day-to-day code we deal with, we don't have that choice, because we're not dealing with code that has any grand algorithms to it. For example, if I'm writing a GUI frontend to a command line app, what are my choices of algorithms? Not much.

In my real life coding work, the places where algorithm efficiency makes a difference are far outweighed by those places that don't. And of those places that do make a difference, the performance is rarely a critical need. For example, I just coded up some RAMDAC lookup tables, and a difference of algorithm would make a huge difference in efficiency. But this particular routine was triggered by a user event (clicking a button in a config dialog), so that my dogslow but highly readable/understandable algorithm wasn't a bottleneck for anything. In this case tweaking the compiler settings would have given a 5% boost to everything, but a change in algorithm would only have given a 1/10 second boost for an event that would happen approximately once a week or less.

--
Don't blame me, I didn't vote for either of them!
Re:The performance of compiled code by mattgreen · 2005-05-02 13:27 · Score: 3, Insightful

Exactly. You couldn't distinguish an app compiled with frame pointers disabled from one with frame pointers enabled. The reason I point this particular optimization out is that it makes life hell for developers if you end up sending a core dump file in. This is a net loss in my book, since open source software evolves rapidly and being able to assist in reporting bugs is vital.

How is pointing out that one optimization people crow about is largely ineffective being an asshole?
Re:The performance of compiled code by Dr.+Weird · 2005-05-02 16:30 · Score: 2, Insightful

I think that companies should re-evaluate their "need" for an extra 5% performance. Here's an idea -- if you need something 10 minutes faster, why not start the process 10 minutes sooner?
Is this a troll? In most high-performance computing environments (national and local supercomputing centers) -- at least when they are being well-utilized -- they are in non-stop use. You don't start ten minutes sooner because you or another user are hammering the machine with other jobs.

I got your value by Anonymous Coward · 2005-05-02 04:51 · Score: 4, Funny

"Like a baby, we won't really appreciate its value until it's matured a bit."

Does this mean I have to wait until it's 18?

Re:I got your value by Martin+Blank · 2005-05-02 04:55 · Score: 5, Funny

Generally yes, although you'll only have to wait until 16 in some states.

--
You can never go home again... but I guess you can shop there.

Fast KDE compile. by Anonymous Coward · 2005-05-02 04:52 · Score: 4, Informative

It's damn fast for KDE compile as someone tested.

Re:Fast KDE compile. by badfish99 · 2005-05-02 04:56 · Score: 3, Informative

Well, the article you link to starts with the words:
KDE sources now blacklist gcc 4.0.0 because it miscompiles KDE
It must be easy to compile fast if you don't mind getting the wrong answer.
Re:Fast KDE compile. by AArmadillo · 2005-05-02 05:51 · Score: 4, Informative

http://gcc.gnu.org/bugzilla/show_bug.cgi?id=20973
GCC bug report
http://lists.kde.org/?l=kde-cvs&m=111451142117674& w=2
KDE CVS report

It involves some problem with register allocation. It seems only to miscompile KHTML, and there is already a patch attached to the GCC bug report (although the patch just disables the optimization that is causing the problem, rather than fixing the core problem itself).

The Future? by liam193 · 2005-05-02 04:52 · Score: 2, Interesting

Version 4.0.0 is laying a foundation for the future, and should be seen as a technological step forward with new internal architectures and the addition of Fortran 95.

While I know the benefits of Fortran 95 are a big thing, saying it's a technological step forward to incorporate for the first time a 10 year old standard seems a bit ridiculous. When I first saw this article I had to check my calendar to make sure it was May 1st and not April 1st.

Re:The Future? by discordja · 2005-05-02 05:01 · Score: 5, Funny

I can see how that'd throw you off since it's May 2. :)

--
I stole this .sig
Re:The Future? by Anonymous Coward · 2005-05-02 05:07 · Score: 2, Insightful

Considering the glacial pace of change in the Fortran world, using a "standard" created as recently as 1995 is considered to be a radical and reckless act.
Re:The Future? by Lemming+Mark · 2005-05-02 05:29 · Score: 2, Informative

It's not just the addition of F95, which obviously only benefits Fortran users.

GCC4 has a new optimisation architecture called "Tree SSA". This introduces a new representation (well, actually two: GENERIC and GIMPLE, although the latter is a subset if the former) for programs under compilation. The GIMPLE representation is used for advanced high-level optimisations before feeding the code into the compiler "back end" for architecture-specific optimisation and code generation.

The advantages of Tree SSA are multiple:
* cleaner architecture
* allows high level optimisations that were previously hard or impossible to do at the RTL (Register Transfer Language) level used by the backends
* despite being "high level" many optimisations that take advantage of program structure can be made language independent because of the GENERIC / GIMPLE representation

However, it'll take a while for new optimisations that have been enabled by this framework to be written. The idea is that Tree SSA breaks a fundamental barrier to the continued improvement of optimisation in GCC and should yield gains in years to come.

There are some other nifty things in GCC4 like the "mudflap" system for detecting program errors. Enhanced type-checking fussiness is also welcome as far as I'm concerned, even if it results in some compile errors.
Re:The Future? by GauteL · 2005-05-02 09:42 · Score: 2, Informative

It IS a big thing. This is the first freely distributable, readily available compiler of Fortran 95.

Up until now, my PhD work has needed compilers I can't just simply install without high fees, because the academic free license for propriatary compilers still sounds a bit fishy in it's requirements. This is actually a major boost for the Scientific Computing community.

However, lots of people have just NOW started to trust current F95 compilers (lots of academic code are still written in F77). It will be several more years until they trust the GNU Fortran 95 compilers.

Besides, while it is called Fortran 95, does not mean it was actually in heavy use by 1995.

We have this thing on Earth, called tact. by Tim+Browse · 2005-05-02 04:53 · Score: 3, Funny

Like a baby, we won't really appreciate its value until it's matured a bit.

Is that what you say to new parents? :-)

Screenshots? by jmcneill · 2005-05-02 04:54 · Score: 4, Funny

Where are the screenshots?

Re:Screenshots? by wawannem · 2005-05-02 05:09 · Score: 2, Informative

damn... I meant:

Here you go:

bash$ gcc -o test main.c
bash$

Compilation Speed Test by a KDE developer by Anonymous Coward · 2005-05-02 04:55 · Score: 5, Interesting

http://www.kdedevelopers.org/node/view/1004 Qt: -O0 -O2 gcc 3.3.5 23m40 31m38 gcc 3.4.3 22m47 28m45 gcc 4.0.0 13m16 19m23 KDElibs (with --enable-final) -O0 -O2 gcc 3.3.5 14m44 27m28 gcc 3.4.3 14m49 27m03 gcc 4.0.0 9m54 23m30 KDElibs (without --enable-final) -O0 gcc 3.3.5 32m56 gcc 3.4.3 32m49 gcc 4.0.0 15m15 I think KDE and Gentoo people will like GCC 4.0 ;)

kettle? black? by dem3tre · 2005-05-02 04:58 · Score: 2, Insightful

I love the open source movement but I wonder why the following comment is OK for open source projects and not close source?

quote "That said, no one should expect a "point-oh-point-oh" release to deliver the full potential of a product, particularly when it comes to a software system with the complexity of GCC."

I bet no one would dare say that about certain product from Redmond.

Re:kettle? black? by Dan+Berlin · 2005-05-02 05:01 · Score: 5, Informative

Significant difference. If you ask gcc folk (like me), we'd happily tell you that 4.0 will probably be, performance wise, win in some cases, and a lose in others. Anytime you add large numbers of optimizations, it takes a while to tune everything else so that we get good generated code. 4.0 is more a test of the new optimizers than something that is supposed to produce spectacular results in all cases.
Re:kettle? black? by imroy · 2005-05-02 05:39 · Score: 3, Insightful

I think you're a little mixed up there. When we criticize MS, we're often referring to the release of known buggy and badly implementated software to the general public. Instead the submitter of this article is referring to the "full potential" of the new optimization framework in GCC-4.0. It will, in theory, allow for much better optimizations to be performed on internal parse tree. But for now many of the CPU models are incomplete or non-existant, or something like that. The full potential of these optimizations will be delivered in a later release, either 4.0.x or 4.1, or perhaps a little of both. And the GCC team wouldn't have released GCC 4.0 with known, serious bugs.

Or perhaps I've just been trolled. Wouldn't be the first time. I see that this is your first comment on slashdot. Welcome. Just don't troll.
Re:kettle? black? by Miniluv · 2005-05-02 06:13 · Score: 2, Informative

No, what it means is that if you're doing anything that is compiler sensitive, you should test before you deploy. Some people are seeing gains with GCC4, other people are seeing the same, and some are seeing losses. Each group needs to make a decision about a compiler for their situation.
The GCC folks released this with it being well documented that it wasn't going to blow the doors off for everyone in every situation, but instead that this was a major step forward for internals, which should allow them to make some major steps forward that are externally visible soon.
Re:kettle? black? by Grishnakh · 2005-05-02 06:56 · Score: 2, Interesting

The other argument that has been missed being... GCC is not charging you money to upgrade/use 4.0. When you pay a bunch of money, expectations of performance, reliability, and speedy bug-fixes rise.

Exactly. The goals in releasing software are completely different for GCC and MS.

For GCC, like a lot of open-source products, the idea behind releasing all-new x.0 versions is to get it out there so early adopters will start using it, filing bug reports, etc. It's the same reason the Linux kernel releases new even-numbered versions (2.x.0) before they're really ready for mainstream use. If they waited too long, people would avoid them, thinking they're "just development versions", and it'd take forever to get the bugs out. Unlike with commercial software, you need to know what you're doing when you use these open-source products directly, rather than using a packaged distribution. If you want a stable system, don't download GCC or Linux directly and compile from sources. Get the version that comes with your distribution.

MS, on the other hand, wants to get money and marketshare when they release all-new versions of their software. When they release a new version, they do so with the implication that this product is ready for general use, and that all the bugs are worked out (after all, you're paying a lot of money for it, so shouldn't the debugging be part of the price?). This is reinforced by the fact that most updates are not free.

If I buy a car, I expect it to work reliably, and not break down within the warranty term, as long as I perform the required maintenance. If a wheel falls off as soon as I drive out of the dealer's lot, they have to fix it at their own expense. I might even get a new car under the Lemon Law if this happens multiple times. What's more, if the car has any serious defects, these are usually fixed for free under factory recalls.

But when I buy MS software, none of this applies. There is no warranty whatsoever. If it doesn't work, there is no recourse. And if they release a new version that fixes a lot of problems, there's no guarantee it will be free. Did all the Windows ME buyers get a new version of Windows to fix that disaster of an OS? No, they had to upgrade at their own expense.

The bottom line is that the product MS releases is a shrink-wrapped product that is supposedly intended for direct use by the general public, which implies a certain level of fitness. GCC, on the other hand, does not release its product directly to the general public (though they're certainly free to download it and try it out if they wish). Its product is intended for use by software developers (who want to try out the latest, and possibly buggy compiler) and distributions. People who just want to surf the net or write documents should not be concerned with this, nor should developers who want to produce stable code. These people should all be simply using precompiled distributions, and using the software versions provided by them.

The value of a baby by Laxitive · 2005-05-02 05:02 · Score: 5, Funny

"Like a baby, we won't really appreciate its value until it's matured a bit."

Seriously, this is why I don't appreciate babies. At least after about 4 or 5 years, they're useful for mild manual labour. Sure they'll complain and cry, but all you gotta do is tie their dishwashing to the number of fish heads they're allotted that week. Works pretty well, I gotta say. Anyway, at least they're not a net productivity drain like babies are.

Anyway, what I mean to say is: from your description, it looks like I'll be staying away from GCC 4 for a while, too. Goddamn babies.

-Laxitive

Kind of a weird review by Just+Some+Guy · 2005-05-02 05:03 · Score: 4, Interesting

As far as I'm concerned, unless you're using "-Os" because you're deliberately building small binaries at the expense of all else - say, for embedded development - the resulting binary size is completely irrelevant as a compiler benchmark. What if the smaller result uses a slower, naive algorithm (which in this case would mean choosing an obviously-correct set of opcodes to implement a line of C instead of a less-obvious but faster set)?

Second, the runtime benchmarks were close enough to be statistically meaningless in most cases. The author concludes with:

Is GCC 4.0 better than its predecessors?
In terms of raw numbers, the answer is a definite "no".

My take would have been "in terms of raw numbers, it's not really any better yet." It's close enough to equal (and slower in few enough cases that I'd be willing to accept them), though, that I'd be willing to switch to it if I could do so without having to modify a lot of incompatible code. It's clearly the way of the future, and as long as it's not worse than the current gold standard, why not?

--
Dewey, what part of this looks like authorities should be involved?

Re:Kind of a weird review by larien · 2005-05-02 05:13 · Score: 2, Informative

Smaller binaries = quicker load time (less disk I/O or memory being moved around) and smaller memory footprint. Yes, this is mostly in embedded apps where memory sizes might still be in KB rather than GB, but if you're analyzing performance, memory usage is relevant, even if it may not be your primary concern.
Re:Kind of a weird review by Waffle+Iron · 2005-05-02 05:59 · Score: 2, Informative

the resulting binary size is completely irrelevant as a compiler benchmark.
As the ratio of raw CPU MIPS to memory bandwidth and latency continues to increase, systems lean more and more on caches to compensate. Since larger code eats up more of the scarce cache resources to do the same job, small code can be more important than code with the lowest raw instruction clock count. This can be especially important in C++, where redundant code generated by templates can really get out of hand if not properly controlled.

Re:intel compiler by MORB · 2005-05-02 05:07 · Score: 5, Interesting

Intel compiler's reason why it generate faster code is because it does auto-vectorisation (ie, it automatically finds out how to transform some code patterns to take advantage of native vector operation, such as those provided by sse). They started to implement this in gcc 4.0, but it's a veyr first iteration that for what I know is still kinda limited. I'm not even sure it's enabled by default, even in -O3. There are lots of improvement there targeted at gcc4.1.

The ? operator by shreevatsa · 2005-05-02 05:07 · Score: 5, Funny

The worst part is that they now say that the

<?

,

>?

,

<?=

and

>?=

operators are deprecated, and will be removed. Damn, I liked them so much. Sure, they weren't part of the standard, and only a GCC extension, but it's just so much more fun to say

a = b <? c

than to say

a = min(b,c)

or even

a=b<c?b:c

. The best use was saying

a<?=b

instead of the painful

a=min(a,b)

.

Re:The ? operator by keshto · 2005-05-02 05:19 · Score: 3, Interesting

Dude, you've never coded in a commercial environment , have you ? Or are all your company's projects meant to be compiled by a specific version of gcc only, regardless of the OS and architecture? I use gcc exclusively these days, but it's for my research. Back when I was working, we had to code for both VC++ and g++ . Atleast, the ones of us who worked on core-engine code. Fixing some moron's VC++ -specific idiocy sucked.
Re:The ? operator by City+Jim+3000 · 2005-05-02 05:40 · Score: 2, Insightful

I'd say "thank god" after reading that text. And mod your comment "funny".

a=min(a,b)

that is one of the most self-descriptive statements i have ever seen.
Re:The ? operator by shreevatsa · 2005-05-02 06:30 · Score: 4, Funny

I don't get it. I was very serious when I wrote that, still this comment has 60%Funny, and even 20%Troll.
In case you were wondering why anyone would want to use a=min(a,b), you really haven't programmed enough. To take a simplistic example, how would you find the largest integer in an array? (Sure, you can just #include , then say *max_element(a,a+N) and be done with it, but let's suppose you don't want to do that...)
Well, the way to do it would be to write a loop like this:
int largest = 0; for(int i=0;i<N;++i) largest=max(largest,a[i]);
I really think it's faster and better to code the last line as largest>?=a[i]. There is less unnecessary clutter.
Oh well. I guess this comment will now lose all its Funny mod points, but what the heck.
Re:The ? operator by Heywood+Jablonski · 2005-05-02 07:51 · Score: 5, Funny

The worst part is that they now say that the <? , >? , <?= and >?= operators are deprecated...
That's because they were conflicting with the new gphp front-end.
Re:The ? operator by Phleg · 2005-05-02 07:54 · Score: 2, Insightful

If I ever see a developer do something as stupid as this on a job application, there's no way they will ever get a job working for me.

Having clean, readable code is far, far more important than saving a few minutes in total in a project. Using compiler-specific features is generally frowned upon, but acceptable in cases where there are significant performance or time gains. Using a compiler-specific alias to save yourself a few extra keystrokes at the extreme cost of readability is just being lazy, and not thinking about how that code will be maintained in a year.

I feel the same way about the ternary operator, actually. There are a few cases where it's clear enough to be used, and where it saves several lines of typing. However, 95% of the time that people use it, it only makes the code impossible to understand.

--
No comment.
Re:The ? operator by Rhesus+Piece · 2005-05-02 15:57 · Score: 2, Informative

You know... funny doesn't always mean
we're laughing _with_ you. ;-)

Do the new models replace or confuse old ones? by expro · 2005-05-02 05:07 · Score: 3, Interesting

I agree that this compiler is a cornerstone of free software.

But it was very frustrating to me to try to port the compiler to a new platform by modifying existing back ends for similar platforms.

After spending a few months on it (m68k in this case), I could not escape the layers of hack upon cruft upon hack upon cruft, that made it extremely difficult to make even fairly superficial mods because everyone seemed to be using the features differently and all the power seemed lost in hacks that made it impossible to do simple things (for me anyway). I am quite familiar with many assemblers and optimizing compilers.

I hope that the new work makes a somewhat-clean break with the old, otherwise, I would fear yet another layer to be hacked and interwoven, with the other ones that were so poorly fit to the back ends.

I suspect that not all backends are the same and perhaps the same experience would not be true for a more-popular target, but it seems to me it shouldn't be that hard to create a model that is more powerful yet more simple. Such would seem to me to be a major step forward and enable much greateer optimization, utilization, maintainability, etc.

-ftree-* by Anonymous Coward · 2005-05-02 05:08 · Score: 5, Interesting

The whole point of gcc4.0.0 is the tree-ssa thing. The author of this test didn't seem to notice that this stuff doesn't get enabled in -O2 nor -O3, but does have to be enabled by hand. This includes autovectorization (-ftree-vectorice) among other things which may make a difference.

If I was him, I'd repeat the tests again enabling the -ftree stuff when building with gcc4.0.0.

Re:-ftree-* by Florian+Weimer · 2005-05-02 05:56 · Score: 3, Informative

The whole point of gcc4.0.0 is the tree-ssa thing.

True, this is the major infrastructure change which justified the "4".

The author of this test didn't seem to notice that this stuff doesn't get enabled in -O2 nor -O3, but does have to be enabled by hand.

No, most tree-ssa optimizers are enabled implicitly at -O2 (they replace quite a few of the old RTL-based optimizers). Only some numerics code can benefit from loop autovectorization (which has to be enabled explicitly; for most source code, it just increases compile time).

when? why? by ChristTrekker · 2005-05-02 05:10 · Score: 2, Interesting

At what point (of 3's evolution) would you say it surpassed 2.95? Why?

--

Constitutionally Correct

non-x86 arch? by ChristTrekker · 2005-05-02 05:12 · Score: 3, Interesting

What about the performance on MIPS? PPC? C'mon, people...enquiring minds want to know!

--

Constitutionally Correct

Re:What the hell? by Kupek · 2005-05-02 05:16 · Score: 2, Interesting

One of the major changes in the 4.0.0 release is the internal reorganization that allows for more aggressive optimizations. Hence, he tested how the optimized performance of the latest 3.x versus 4.0.0. How do you tell the compiler to optimize? Well, you have to pass it "lots of flags."

This is expected, I think by diegocgteleline.es · 2005-05-02 05:16 · Score: 5, Informative

I found this in the osnews announcement

"Before we get a bunch of complaints about the fact that most binaries generated by GCC 4.0 are only marginally faster (and some a bit slower) than those compiled with 3.4, let me point out a few things that I've gathered from casually browsing the GCC development lists. I'm neither a GCC contributor nor a compiler expert.

Prior to GCC 4.0, the implementation of optimizations was mostly language-specific; there was little or no integration of optimization techniques across all languages. The main goal of the 4.0 release is to roll out a new, unified optimization framework (Tree-SSA), and to begin converting the old, fragmented optimization strategies to the unified framework.

Major improvements to the quality of the generated code aren't expected to arrive until later versions, when GCC contributors will have had a chance to really begin to leverage the new optimization infrastructure instead of just migrating to it.

So, although GCC 4.0 brings fairly dramatic benefits to compilation speed, the speed of generated binaries isn't expected to be markedly better than 3.4; that latter speedup isn't expected until later installments in the 4.x series."

Re:This is expected, I think by Ian+Lance+Taylor · 2005-05-02 06:14 · Score: 2, Informative

That is only partly true. All gcc releases since 1.0 have integrated optimizations across all languages. What gcc 4.0 has added is a new, higher-level framework for language independent optimizations. The new framework, known as tree-ssa, permits more powerful optimization techniques, in particular some of the techniques which have been developed by the compiler research community since gcc was first written in the 1980s. The old language independent optimization framework, known as RTL, is still there and is still used in gcc 4.0.

Like a baby by Anonymous Coward · 2005-05-02 05:25 · Score: 4, Funny

Like a baby, we won't really appreciate its value until it's matured a bit.

"Come here son. Did you know your mother and I almost decided to not keep you when you were born? You were just a baby at the time, you didn't seem to have any value. I mean, seriously, what use is there for a baby? I'm glad we didn't make that mistake.
Now go play outside and don't come back before dinner time, and pick up the trash when you leave."

It really does depend on the code by DoofusOfDeath · 2005-05-02 05:27 · Score: 4, Informative

There was one test case I did for my own use. I've got a small C++ program that's computationally heavey and has a small working set of memory.

On that program (on a P4) I got an 11% reduction in runtime using GCC 4 vs. GCC 3.3.5. This was actually a big deal for me work.

The lesson here: You're mileage with GCC 4.0's improvements may vary from the benchmarks, and you might want to try it on your own code.

Any big spoilers in the review? by GnuPooh · 2005-05-02 05:32 · Score: 4, Funny

I don't wanna read the review it reveals the ending or something. I mean what good is a compiler without some big unexpected surpises?

"Like a baby..." by Gannoc · 2005-05-02 05:36 · Score: 4, Funny

Like a baby, we won't really appreciate its value until it's matured a bit.

Are you kidding? Babies are worth $15,000-$20,000 easily, even if they're female. Once e-Bay stops being a bunch of pussies and we get some open bidding started, I expect their value to go up even higher.

Once again, we see that the ./ editors have no idea what they're writing about.

Re:Gentoo by TelJanin · 2005-05-02 05:44 · Score: 2, Insightful

Only on /. could this get modded up...

He's not a dumbass because he uses Gentoo. It's pretty obvious that he doesn't know what he's talking about. Straight from TFA:

Some folk may object to my use of -ffast-math -- however, in numerous accuracy tests, -ffast-math produces code that is both faster and more accurate than code generated without it.

"I don't know about you, but if I want my math done fast and wrong I'll ask my cat" - Anonymous

Re:Speed/Performance Benchmarks?? by tomstdenis · 2005-05-02 05:46 · Score: 2, Informative

Not always. Usually Intel's CC produces faster code except when you give it code [LibTomMath] that it can't efficiently optimize.

Also as he hates to have pointed out his options aren't always optimal.

Quite a few applications are faster with 3.4.3 on a P4 with "-fno-regmove" as well as -O3. My AES for instance goes down from >500 cycles/block to 380 cycles/block on my Prescott P4 with this switch.

380 cycles/block is faster than Intel CC v8.0 with "-O3 -xP -ip" by about 30 cycles/block.

Also the guy probably didn't try profiling. I can drop a fair chunk of cycles in doing ECC point multiplies on my P4 with GCC by doing a profiled build system. ...

ETC!

Tom

--
Someday, I'll have a real sig.

Tree-ssa is in there by jensend · 2005-05-02 05:51 · Score: 3, Insightful

Unless the GCC documentation is very wrong, the only tree-ssa optimizations in 4.0 which don't get turned on by default at -O3 are -ftree-loop-linear, -ftree-loop-im, -ftree-loop-ivcanon, -fivopts, and -ftree-vectorize. It's true that some of these may be good optimization wins (probably increasing compile time in the process, but that's what the higher optimization levels are all about), but there are plenty of tree-ssa optimization passes being used in these tests.

Auto-vectorization, by the way, does not fall into a "obvious optimization wins which perhaps should be enabled at -O3 by default" category. It can bring very big performance benefits in some situations, but it should be used with caution.

Re:Tree-ssa is in there by Dan+Berlin · 2005-05-02 06:19 · Score: 2, Informative

of the ones you listed, only -ftree-loop-linear and -ftree-vectorize are not on by default. (you also missed a couple, though they slipped my mind)

4.0.0 broke backward compatibility big time by cpghost · 2005-05-02 05:52 · Score: 4, Informative

Recently, a discussion took place on a FreeBSD mailing list wether the project wanted to use GCC 4.0.0 as the system compiler. Some objections where:

KDE would not compile cleanly
Most of the 12.000+ ports would need manual tweaking because of other incompatibilities.
Some C constructs have been obsoleted, requiring huge sweeps over the existing BSD code base.

If I understood it right, We won't have a GCC 4.0.0 system compiler on FreeBSD anytime soon. Installing the gcc40 port is, of course, always possible.

--
cpghost at Cordula's Web.

Re:4.0.0 broke backward compatibility big time by chrysalis · 2005-05-02 06:14 · Score: 4, Insightful

gcc 4.0 just tries to follows standards. If something doesn't compile with gcc 4, don't blame the compiler. The source code was broken at the first place.

--
{{.sig}}
Re:4.0.0 broke backward compatibility big time by mandolin · 2005-05-02 07:26 · Score: 3, Informative

If you actually follow the link, it looks like gcc4 was miscompiling bits of kde. Here is an example.
"not compiling cleanly" may have been a less-accurate description of the problem.
Re:4.0.0 broke backward compatibility big time by JoeBuck · 2005-05-02 08:25 · Score: 4, Informative

In the case of the KDE problem, there were two bugs, one of which was in KDE, but the other of which was in 4.0.0. So it sometimes is appropriate to blame the compiler. That bug is fixed in CVS and will be fixed in 4.0.1.
It should not surprise anyone that the first 0.0 release has some bugs. It's the first release of a compiler with a completely new optimization structure (tree-ssa). I would advise waiting for 4.0.1 for a production-quality release, or go with vendor patches (by making Fedora Core 4 with a 4.0.0 based compiler, Red Hat will probably shake out a few more bugs).

What a crappy review by Fefe · 2005-05-02 06:04 · Score: 3, Insightful

Uh, and this review is helping us... how?

lame uses assembler code for vectorization. One of the new features of gcc 4 is the beginnings of a vectorization model. A good test for gcc 4 would have been to compile some C-only bignum libraries, and Ogg Vorbis! povray is also a good example, but then you need to test more than one specific test-run. Maybe gcc 4 makes radiosity in pov-ray 400% faster at a 2% cost in the rest of the code?

This guy is the Tom's Hardware of Linux reviews, except he doesn't have the annoying ads, and he does not split his lack of content over 30 HTML pages.

The new warnings of gcc 4 have helped me find a bug in my code. That saved me a week. Consider how much faster gcc 4 needs to make pov-ray or lame to save you a week of work!

gcc 4 can now reorder functions according to profile feedback. That should make large C++ projects faster. Also, the ELF visibility should make KDE start much faster. This should have been tested!

Please note that I'm not saying gcc 4 produces faster code. I don't rightly know. I do know it produces smaller code for my project dietlibc, where size matters more than speed.

Funny you should say that - a story about sprintf by Szplug · 2005-05-02 06:11 · Score: 3, Interesting

A prospective professor gave a talk at our school, he'd been working at some grid computing lab. Well their code was underperforming, so they profiled it and found that sprintf was taking 30? 50? 70? % (I forget) of their code time - the machines had to communicate with each other a lot, and they used sprintf to serialize. (It's an easy fix - C++'s stream operators are much faster, since the type is known at compile time.)

Oh yeah, also, for Quake 1, John Carmack hired Michael Abrash, an assembly language guru, to help out. Well Abrash found that GCC's memcpy() (or whatever it was) was copying byte-by-byte instead of by word (or something, I don't remember) and his reimplementation of that alone, doubled the frame rate!

Just some interesting counter examples to keep in mind :)

--
Someday we'll all be negroes

Users?? by bobbuck · 2005-05-02 06:18 · Score: 5, Funny

Does Gentoo have users? I though they only had installers.

Observations on Apple's GCC4 release by Paradox · 2005-05-02 06:30 · Score: 4, Interesting

It isn't a huge deal for most people, but it seems like the new GCC is singificantly better at optimizing for the PowerPC now.

I've been working with the GNU GSL on my mac a lot, and I recently updated to Tiger. The first thing I noticed when I recompiled the GSL with Apple's modified GCC4.0 is the significant and noticable speed increase. With this intense math stuff, doing SVD on 300x200 matricies, and it's shocking how much faster it is. I went from 3-5 seconds down to less than one.

I am not going to post any hard numbers because I haven't rigorously compared them yet, but I'll make some formal comparisons this week.

--
Slashdot. It's Not For Common Sense

Re:Funny you should say that - a story about sprin by Procyon101 · 2005-05-02 06:36 · Score: 2, Insightful

ACK! If 70% of your time is spent in a serialization function call, FORGET about optimizing the function call.... You are WAY too fine grained in your algorithm for effective parallelization. He's have been better off running the whole damn thing serially on a single box methinks. His fancy grid algorithm spent more time doing "grid" stuff than working on his problem!

No, the third run is for finding bugs by Bananenrepublik · 2005-05-02 06:44 · Score: 2, Interesting

If your compiler compiles correctly, a program (leaving floating point inaccuracies aside) should produce the same result no matter what compiler it is compiled with. I.e. a gcc 4.0 should produce the same results no matter if it's itself compiled with 3.4.3 or 4.0.

Re:No, the third run is for finding bugs by Anonymous Coward · 2005-05-02 08:29 · Score: 5, Informative

Yes, as long as it wasn't miscompiled.

Historically, GCC tends to bring out the worst in compilers. That is why when you build GCC, the system compiler will be used once, /without optimizations/ to produce a slow GCC 4.0 which can be used to compile itself. This is done twice (stage 1 compiles stage 2 and stage 2 compiles stage 3) so that 2 and 3 can be compared to ensure that there were no miscompilations, as it is unlikely that a miscompiled compiler will produce correctly executable machine code that replicates exactly.

Unlikely but possible. Look for the paper "Reflections on trusting trust" for a beautiful hack involving intentional miscompilations. The author basically changed the compiler so that when "login" was being compiled, the compiler inserted a back door. And when a new compiler was being compiled, the compiler would insert the code to insert the back door and to change the next compiler. And then no matter how much you checked teh source to either login or the compiler, you would never notice the back door.
Re:No, the third run is for finding bugs by petermgreen · 2005-05-02 10:04 · Score: 4, Informative

you mean http://www.acm.org/classics/sep95/ ?

--
note: i'm known as plugwash most places but i screwd up registering that here somehow in the past and now can't register

Re:Nice analogy... by Just+Some+Guy · 2005-05-02 07:08 · Score: 2, Insightful

Lighten up, Francis.

Signed, a parent of three.

--
Dewey, what part of this looks like authorities should be involved?

Re:Still generating 386 assembly? by Ian+Lance+Taylor · 2005-05-02 07:21 · Score: 4, Informative

You need to read up on the -march, -mfpmath, -mmmx, -msse, -msse2, -msse3, and -m3dnow options.

If you build gcc yourself, you can even make them the default by configuring with an appropriate --with-arch option.

No danger by SuperKendall · 2005-05-02 07:25 · Score: 2, Funny

I don't get it. I was very serious when I wrote that, still this comment has 60%Funny, and even 20%Troll. ....
Oh well. I guess this comment will now lose all its Funny mod points, but what the heck.

To the contray, now it's even funnier.

--
"There is more worth loving than we have strength to love." - Brian Jay Stanley

Re:intel compiler by Kupek · 2005-05-02 07:30 · Score: 2, Insightful

Auto-vectorization is not the reason why Intel's compiler is better. It certainly helps, but in my experience, not much. Intel's compiler just does better optimizations across the board. Which is no surprise: Intel is making a compiler for thier chipsets. They have inside knowledge of what's best to do when for a particular chip. Further, Intel's compiler is marketed as the fastest compiler for x86, which as far as I know, is true. Hence, they spend a lot of time on the optimizations.

GCC, on the other hand, has a different goal: get a working compiler on as many platforms as possible.

Re:intel compiler by X · 2005-05-02 07:48 · Score: 2, Informative

Um, if it doesn't make code run faster, what's the point of including it?

Sigh.. I wasn't saying that it doesn't make code run faster. I was saying that it doesn't necessasrily make code run faster. Auto-vectorization is only a win in certain circumstances. There are a whole host of optimizations that only apply in specific circumstances and/or only improve performance in certain circumstances and slow things down in others. If there weren't trade-offs with optimizations, compilers would just have "-O" and wouldn't bother with tons of other optimization flags.

--
sigs are a waste of space

God, this gets old. by stonecypher · 2005-05-02 08:21 · Score: 3, Insightful

You know, I remember when someone did this to GCC 3, comparing against 2.9.5.

4.0.0 is a brand new compiler. Lots of techniques in it are brand new. Lots of tweaks and polish can be applied. If you actually take the time to compare 3.4 to 3.0, you'll find that the gap is bigger than 4.0 to 3.4. Furthermore, if you compare 2.9.5 to 3.0, you'll find 2.9.5 is better than 3.0 by a much wider margin than 3.4 is to 4.0.

This is a misunderstanding of the nature of progress. 4.0 is a brand new compiler with brand new internal behaviors. Lots of things are at the It Works stage, instead of the It's Efficient stage. You can't compare a 3-year polished compiler to a 3-week polished compiler; it's utter nonsense.

If you want to compare 4.0 to something, compare it to 3.0, or sit down.

--
StoneCypher is Full of BS

Re:moronic review by ChaoticCoyote · 2005-05-02 09:59 · Score: 2, Informative

I have no clue what you're talking about.

One benchmark *was* C++ (povray), and, in fact, I use KDE as my desktop. It just so happens that most code in a distribution is written in C.

I have quite a bit of heavily-templatized C++ in my library and customer code, but it is either proprietary (under NDA) or unsuitable to timing. As I state in the article, C++ programmers should seriously consider GCC 4.0 for it's imrpoved compile times, if nothing else.

--
All about me

Re:4.0? by cimetmc · 2005-05-03 00:31 · Score: 2, Informative

The recommended compiler for linux 2.6.x is the main compiler which was used when linux 2.6 was released. The linux developers don't want to officially switch compilers in mid term of a version because that would be a potential additional source of bugs. It does not necessarily mean that newer GCC version would be less good. It's just that it might produce code that would behave differently for whatever reason and it would make development more difficult.

The dicussion about recommended GCC versions of the Linux kernel regularly pops up on the kernel mailing list. For instance, you can see one such a discussion here:
http://groups-beta.google.com/group/linux.kernel/b rowse_frm/thread/c2da87604102e689/5d754728f97e5105

A much better indicator on GCC quality is to see what versions various Linux distributions actually use. For instance, if you take SuSE pro 9.3, it uses GCC 3.3.5.

Marcel

105 of 429 comments (clear)