Performance Benchmarks of Nine Languages

Trig functions... by eples · 2004-01-09 03:30 · Score: 3, Interesting

I am not a compiler nerd (IANACN?), so maybe someone else can answer the following simple question:

Why are the Microsoft languages so fast with the Trig functions?

--
I'm a 2000 man.

Re:Trig functions... by Kingpin · 2004-01-09 03:34 · Score: 4, Funny

They probably cheat and use undocumented native OS calls.

--
Unable to read configuration file '/bigassraid/htdig//conf/14229.conf'
Geocrawler error message.
Re:Trig functions... by bigjocker · 2004-01-09 03:35 · Score: 3, Insightful

What is interesting in these functions is that, as pointed in the article, there seems to be something wrong with Sun's implementation for Java. Removing this test JDK 1.4.2 executes almost on par with Visual C++ (the winner).

This is (once again) proof that Java is not slow, in fact it's really fast. It's slow starting, and yes, consumes more memory than native code, but the gained security, stability and ease of programming (reduced development times) are worth the memory use increase.

Also, the memory use should be addressed by project Barcelona (I believe these will be available in the forthcoming J2SDK 1.5, along with generics, enums, etc).

--
Life isn't like a box of chocolates. It's more like a jar of jalapenos. What you do today, might burn your ass tomorrow.
Re:Trig functions... by Dr.+Evil · 2004-01-09 03:39 · Score: 4, Insightful

Don't forget that it is also percieved as slow since just about any application anyone has seen for a desktop environment written in Java has a sluggish GUI.
Yeah, I know Java's strengths aren't in the Desktop arena, they're in development and the back-end.
Re:Trig functions... by tealwarrior · 2004-01-09 03:41 · Score: 4, Informative

What is interesting in these functions is that, as pointed in the article, there seems to be something wrong with Sun's implementation for Java.

For many math functions java uses a software implementation rather than using the built in hardware functions on the processer. This is to ensure that these function perform exactly the same on different architectures. This probably accounts for the difference in performance.

--
In theory, there is no difference between theory and practice, in practice there is.
Re:Trig functions... by mengel · 2004-01-09 03:48 · Score: 5, Interesting

Probably the Microsoft languages use the Intel trig instructions.
In the case of Java, you find that the Intel floating point trig instructions don't meet the Java machine spec. So they had to implement them as a function.
It all depends if you want accuracy or speed.

--
- "History shows again and again how nature points out the folly of men" -- Blue Oyster Cult, 'Godzilla'
Re:Trig functions... by csnydermvpsoft · 2004-01-09 03:50 · Score: 5, Informative

If more people would use the SWT libraries (part of the Eclipse project) instead of the crappy AWT/Swing libraries, then this misconception would go away. SWT works by mapping everything to native OS widgets if possible, giving it the look, feel, and speed of a native app. I used Eclipse for quite a while before finding out that it is almost 100% pure Java (other than the JNI code necessary for the native calls).
Re:Trig functions... by Doomdark · 2004-01-09 03:53 · Score: 5, Insightful

Don't forget that it is also percieved as slow since just about any application anyone has seen for a desktop environment written in Java has a sluggish GUI.
It's in many ways unfortunate that with JDK 1.2 (Swing) and onwards, Sun pretty much dumped fast native support for GUI rendering. It has its benefits -- full control, easier portability -- but the fact is that simple GUI apps felt faster with 1.1 than they have done ever since (or even more). This is, alas, especially noticeable on X-windows, perhaps since often the whole window is rendered as one big component as opposed to normal x app components (in latter case, x-windows can optimize clipping better).
Years ago (in late 90s, 97 or 98), I wrote a full VT-52/100/102/220 terminal emulator with telnet handling (plus for fun plugged in a 3rd party then-open SSH implementation). After optimizing display buffer handling, it was pretty much on par with regular xterm, on P100 (Red hat whatever, 5.2?), as in felt about as fast, and had as extensive vt-emulation (checked with vttest). Back then I wrote the thing mostly to show it can be done, as all telnet clients written in Java back then were horribly naive, doing full screen redraw and other flicker-inducing stupidities... and contributed to the perception that Java is and will be slow. I thought it had more to do with programmers not optimizing things that need to be optimized.
It's been a while since then; last I tried it on JDK 1.4.2... and it still doesn't FEEL as fast, even though technically speaking all java code parts ARE much faster (1.1 didn't have any JIT compiler; HotSpot, as tests show, is rather impressive in optimizing). It's getting closer, but then again, mu machine has almost an order of magnitude more computing power now, as probably does gfx card.
To top off problems, in general Linux implementation has been left with much less attention than windows version (or Solaris, but Solaris is at least done by same company). :-/

--
I like paying taxes. With them I buy civilization -- Oliver Wendell Holmes
Re:Trig functions... by bigjocker · 2004-01-09 04:09 · Score: 4, Insightful

This is a perfect example of why I don't like java. If I use java then I have to stick to the decisions made by someone else even if they are completely wrong for my situation. But you are free to use yor C/C++ optimized functions in Java. Just make a wrapper class and access them natively, just like Java does access a lot of Math functions:
public static native double acos(double a); public static native double asin(double a);
Those are from the
StrictMath
class, used by the
Math
class. You did know that you have access to Java libraries source code, didn't you? For real math/science stuff java is horrible For real math stick with Fortran, SciLab or Matlab. For Real Time applications use C. No language will suit all needs. Personally I develop enterprise applications using Java and Games using C. Also, what I find humerous is the whole NIO (new IO) stuff. basically Java started out using threads to deal with multiple IOs but due to scaling issues they developed 'new' IO which is basically the equivilent of select! Yeah thats real 'new'!!! The tradittional IO subsystem in Java was a traditional one: sockets, streams, buffers, etc. It Is very scalable, just look at the Tomcat, Jetty and JBoss project before NIO appeared. NIO is just an optimization for very specific tasks, for some stuff you still need a separate Thread for each connection.

--
Life isn't like a box of chocolates. It's more like a jar of jalapenos. What you do today, might burn your ass tomorrow.
Re:Trig functions... by fuzzbrain · 2004-01-09 04:21 · Score: 3, Interesting

SWT libraries use native widgets. I think the parent post is perhaps overstating the speed increase of SWT over the latest versions of Swing. The main thing I like about SWT java apps vs swing apps is that they fit into the rest of the desktop better. Another benefit is that SWT and gcj fit well together. Again the benefit here isn't so much speed as reduced download sizes for those users who don't have java installed already.
Re:Trig functions... by happyfrogcow · 2004-01-09 04:22 · Score: 3, Informative

SWT works by mapping everything to native OS widgets if possible

Isn't that what AWT tried to do originally? I'm just delving into Java for the first time the last few months, but I thought I've read this in "Core Java, Vol. 1"

They say (pg. 236 "Core Java, Vol. 1) that this resulted in a "write once, debug everywhere" problem since you will have different behavior, different limitations and different bugs on each implementation of AWT on each platform
Re:Trig functions... by dnoyeb · 2004-01-09 04:23 · Score: 4, Interesting

Just because Swing is slow does not make it crappy. It meets nicely what it was designed to do. I use swing applications all the time. Today we have 1GHz processors, its not even an issue any longer, but it wont be allowed to die...

Eclipse is nice, I love eclipse. But I dont mistake it as a Swing replacement. AWT has a purpose, as does Swing and SWT, they are all different.

I believe AWT should be as fast as SWT because its also natively implemented.
Re:Trig functions... by Darren+Winsper · 2004-01-09 04:51 · Score: 3, Informative

Actually, I'm going to complain because you are completely wrong. Sun complained about Microsoft changing the language in a way that was incompatible with everybody else's implementation.
Re:Trig functions... by Abcd1234 · 2004-01-09 05:14 · Score: 4, Informative

Sorry, dude, but SWT is nowhere *near* as complete as Swing, in terms of functionality. I know, I've tried to use it. Basically, because SWT was designed more or less specifically with Eclipse in mind, it has massive gaps in it's APIs (for example, the imaging model is *severely* lacking). Worse, it's difficult to deploy, and even more difficult to use, as the documentation is remarkably incomplete. So, as much as I hate to say it, SWT simply can't replace Swing right now, and I don't expect it to any time soon.
Re:Trig functions... by William+Tanksley · 2004-01-09 05:23 · Score: 4, Informative

Enumorators? Reflection?

I'm only a beginner in C# and Java, but I know both have reflection, and the proposed Java 1.5 has enums. Kudos to C# for having them first :-), but Java 1.5 has them better, as first-class objects.

Also .net/IIS is a better platform for webdevelopment.

Better for whom? Why? Doesn't it have the severe shortcoming of platform lockdown?

I can write a c#.net app in 1/4th the code of a java one. Go take a look at Microsoft's petshop program if you do not believe me.

I can write an assembly app in 1/4 the code of a Python one. Assuming, of course, that the Python app wasn't written for small code size... The simile is very accurate; Sun didn't write their petshop for small size.

The Java Petshop reimplementation here spanks both Sun's and Microsoft's petshop in terms of size, and pretty clearly demonstrates that both languages could do better.

BTW, I absolutely love C# -- from what I've done with it so far. My only complaint is that its support is at best halfhearted for other platforms, and I will not allow my work to be tied down to one platform. This is the only thing that kept me from learning K (well, K is portable, the only problem is that it's only available from one vendor, Kx systems). Anyhow, I think C#'s bytecode is far beyond anything Sun's ever going to do with Java.

ALso WIndows2k3 is as stable as Linux now. NT4 is old. The situation has improved dramatically. I have never even seen a blue screen on windows2k yet!

I agree with all of that, but it's not enough. I have seen blue screens and system crashes on 2000 and XP (XP far, FAR FAR more often than 2000). But then I've seen system crashes on Linux, so I'm not just complaining about MS ;-).

-Billy
Re:Trig functions... by shemnon · 2004-01-09 05:37 · Score: 3, Insightful

First off, SWT only performes well on windows, and stack on top of that that the principal native abstractions are taylored to a win32 environment. Based off of that it is easy to see how SWT performes quite nicely on Windows.

Elsewhere it sucks. MacOS, GTK, photon, Motif. Even porrly writeen swing programs outperform on those platforms.

But back to your FUD. Yes, bad programmers make ugly and poor performing GUI code. Swing is no different in that regard. But have you looked at recent swing programs in the 1.4.2 version of the JDK? Tried stuff like CleverCactus (a mail client)? Synced your MP3s on a new RIO? Used Yahoo's Site Builder to make a web site? There are excelent swing progams out there. Many you probobly don't realize are java swing apps!

But since SWT is only in early adopter land we haven't seen the real dogs of GUIs it can make yet, especially since you have to do such arcane and ancient tasks in SWT as managing your own event queue! :( Give the same bad programmer SWT and you won't get a bad GUI instead you will get a non-fucntioning GUI.

--
--Shemnon
Re:Trig functions... by Pieroxy · 2004-01-09 06:03 · Score: 3, Informative

You are so wrong, that it is a shame you posted this one!

AWT has native widgets: Combo box, menu, button, text area, input box, checkbox, etc... Not only primitives.

What you are describing is Swing, not AWT.

Swing relies on the most basic AWT features: Component/Container and drawImage, and re-implement the whole widget sets in Java, relying on these two AWT components.

--
Write boring code, not shiny code!
Re:Trig functions... by be-fan · 2004-01-09 06:23 · Score: 3, Interesting

Actually, this is a myth. Modern GC methods do not introduce a huge amount of overhead, when you take into account a few things:

- Algorithms for allocating memory in a manual management scheme can be quite complicated. Look up on how large glibc's memory allocator is. Memory allocation algorithms for GCs tend to be much simpler, often as simple as a simple pointer increment.

- Deallocation algorithms for manual memory management are often even more complicated than the allocation algorithms. They are nearly always slower. Plus, objects are deallocated one at a time. Deallocation algorithms for GC can be simpler, but most importantly, the GC can deallocate large numbers of objects at once. This is, of course, more efficient.

- Copying GCs can compact holes in memory, which makes for better cache utilization.

Depending on the problem at hand, a GC can be a little slower or about the same. For a functional programming style (which has a particular pattern of memory usage) GC is usually faster than manual management. For programs that tend to operate in phases, allocating large numbers of objects gradually and freeing them at once, GC will also be faster.

The real problem with GC is that it affects latency. Modern GCs only freeze the app for a fraction of a second, but that's a large amount of time for something like a game or movie. There are some work arounds for this, though. Latency-sensitive apps can disable GC and use manual memory management. Or, they can use a real-time garbage collector, which has guaranteed latency, but does incur a large fixed overhead. Of course, these problems can be worked around, as evidenced by the fact that major PS2 games like Jak and Daxter were written in a GC'ed language (Common Lisp). Most proponents of GC will tell you that such work-arounds are a good deal easier than hunting down memory leaks and dangling pointers!

--
A deep unwavering belief is a sure sign you're missing something...
Re:Trig functions... by Nurgled · 2004-01-09 06:40 · Score: 4, Insightful

Faster processors should enable us to achieve more, not achieve the same old stuff much less efficiently.
Re:Trig functions... by Skjellifetti · 2004-01-09 06:59 · Score: 3, Informative

I seem to recall that if the platform doesn't have a widget that SWT needs, it uses its own implementation.

Or just doesn't bother implementing it at all. Try printing from eclipse on Linux.

--
FreeSpeech.org

Accurate? by Nadsat · 2004-01-09 03:31 · Score: 5, Interesting

Not sure of the accuracy. Benchmark is on a loop:

32-bit integer math: using a 32-bit integer loop counter and 32-bit integer operands, alternate among the four arithmetic functions while working through a loop from one to one billion. That is, calculate the following (while discarding any remainders)....

It also relies on the strength of the compiler, not just the strength of the language.

--

The Custom Mary

Why did VB do so bad on IO. by nberardi · 2004-01-09 03:31 · Score: 4, Interesting

Why did VB do so bad on IO compared to the other .Net benchmarks? They were pretty much equal up until the IO benchmarks? Any chance of getting the code published that was used to test this?

Re:Why did VB do so bad on IO. by enkafan · 2004-01-09 03:48 · Score: 5, Informative

Because the guy who wrote the code decided to use the VB6 compatability features instead of the .NET runtime for VB. Why one would do this, I have no idea.
Re:Why did VB do so bad on IO. by pr0c · 2004-01-09 04:25 · Score: 4, Informative

VB.net compiles into the same thing as c#, it SHOULD have benchmarked the exact same and so there would have been no point.

Re:Wow by finkployd · 2004-01-09 03:33 · Score: 3, Interesting

Well, for performance it does. For cross platform compilation it rocks the house. If you really want performance you need to be using something like Intel's C compiler (which oddly was not tested)

Finkployd

Java Performing worse then C by ViolentGreen · 2004-01-09 03:34 · Score: 4, Insightful

I conducted my own tests pitting Java 1.4 against gcc 3.3 and icc 8.0 using his benchmark code, and found Java to perform significantly worse than C on Linux/Athlon.

Why is this a suprise? C has been most commonly used for so long because of it's speed and efficiency. I think anyone who has done much work with either developing or running large scale java programs knows that speed can definitely be an issue.

--
Not everything is analogous to cars. Car analogies rarely work.

Re:Java Performing worse then C by Kingpin · 2004-01-09 03:40 · Score: 4, Insightful

All that matters to anti-Java zealots is speed. The list of benefits coming from using Java is too long to take the speed-only view seriously.

--
Unable to read configuration file '/bigassraid/htdig//conf/14229.conf'
Geocrawler error message.
Re:Java Performing worse then C by finkployd · 2004-01-09 03:46 · Score: 3, Insightful

Not always though, I think the thing people neglect to consider is that there are times when performance and scale are important enough that the benefits of Java do NOT outweigh C, and vice versa.

I feel sad for someone who only has enough room in their world for one computer language.

Finkployd

Under Windows... by Ianoo · 2004-01-09 03:34 · Score: 3, Insightful

I see once again that Eugenia (a supposed pro-Linux pro-BeOS person who doesn't use Windows) has done all her benchmarks [i]under[/i] Windows. I have a feeling that Python would perform a lot better if it was running in a proper POSIX environment (linked against Linux's libraries instead of the Cygwin libs). Probably the C code compiled with GCC would perform a fair bit better too.

Re:Under Windows... by MBCook · 2004-01-09 03:59 · Score: 4, Insightful

Because as we all know VC++ and the other Microsoft languages are so widly available for Linux/BeOS. I'm sorry but your comment is pure troll. It would be interesting to have things like GCC under Linux on the same computer there too, but you can't compare Microsoft's .NET to anything under Linux, because .NET doesn't run under Linux (I know about Mono, but that isn't MS's runtime).

--
Comment forecast: Bits of genius surrounded by a sea of mediocrity.
Re:Under Windows... by Umrick · 2004-01-09 04:03 · Score: 3, Interesting

Actually, I'm quite comfortable with the performance numbers Python turned in. I use Python quite a bit, and for the things the benchmark was run on, it's the kind of area I'd find looking for bottlenecks, and in turn implement in C or C++.

Python's huge win is not in speed, but in the ability to express the program in a very concise and easy to understand way.

The fact that Psyco can provide huge speed ups via a simple import is just icing.
Re:Under Windows... by Waffle+Iron · 2004-01-09 04:55 · Score: 3, Insightful

But these where low-level numeric benchmarks. Except for the I/O one, they wouldn't have changed due to linking against different libraries.
The review article is /.ed now, but from the test names on the summary table it looks like the tests are indeed mostly numeric. Unfortunately, only a small minority of people make their living writing number crunching code.
For the vast majority of business and web-based apps, the bulk of operations involves string manipulation. If an app is compute intensive and not I/O or GUI bound, then the bottleneck is usually creating, modifying and destroying strings. Benchmarks on string handling would be more useful to most developers.
However, doing string manipulation benchmarks isn't so simple. There are at least four approaches to strings, and some languages let you pick any of these:
-- dangerous and very fast: using static buffers and in-place modifications like old-school C
-- somewhat safer and may be fast: using semiautomatic memory management with mutable strings, like C++/STL or C with glib's g_string
-- safer still: using totally automatic memory management with mutable strings, like Ruby or (IIRC) Perl
-- safest: using totally automatic memory management with immutable strings, like Java or Python
Of course, for each problem the algorithms would need to be structured differently to get the maximum possible speed in each of the above four methodologies.
Basically, for string-intensive code, claiming that Java is just as fast as C will always be a false statement if you compare C code written in the first dangerous style vs. Java, which is always written in the fourth and safest style. No matter what technical tricks the VM writers come up with, there is just no way that they'll be able to match C's ability essentially zero-overhead in-place buffer operations over and over in the same spot that stays loaded in the L1 cache. (Actually, you probably could write Java code that operates on raw character arrays, and it might approach the speed of C. But that would probably look even uglier than the C code.)
In the few cases that I've ported a string-intensive high-level-language algorithm to raw low-level C code with few or no mallocs (not a trivial task), I've gotten at least a 10X speedup on the CPU-bound tasks, and at least 10X less memory usage. (Note that I did those tests largely out of curiosity. For most applications, even a 10X speedup is rarely worth the increased development time, bug vulnerabilities or maintenence issues. My opinion is that if you have to write code like this, you should confine it to a C extension library to a high-level language like Python.)
I've found that STL can be faster or slower than Java, depending on how smart you are. It's very easy to inadvertently get C++ to thrash around with needless automatic data copying.
Languages like Perl and Python can be very competetive on string operations if you know how to use their libraries. By using the most powerful operations that work on the largest chunks of data at one time (Python's re.findall(), for example), you take advantage of the fact that the library call is mostly written in C. Bit-banging in a dynamic interpreted language is usually dog slow, as the Python numbers seem to show on the summary chart.
To sum it up, most people write apps whose performance can't be predicted by a few simple language benchmarks, because the way the app is written can affect the performance more than the language it's written in.

.NET Languages and IL by ClubStew · 2004-01-09 03:37 · Score: 3, Interesting

Why benchmark the various ".NET languages" (those languages whose compilers target the CLR)? Every compiler targeting the CLR produces Intermediate Languages, or more specifically MSIL. The only differences you'd find is in optimizations performed for each compiler, which usually aren't too much (like VB.NET allocates a local variable for the old "Function = ReturnValue" syntax whether you use it or not).

Look at the results for C# and J#. They are almost exactly the same, except for the IO which I highly doubt. Compiler optimizations could squeeze a few more ns or ms out of each procedure, but nothing like that. After all, it's the IL from the mscorlib.dll assembly that's doing most the work for both languages in exactly the same way (it's already compiled and won't differ in execution).

When are people going to get this? I know a lot of people that claim to be ".NET developers" but only know C# and don't realize that the clas libraries can be used by any languages targeting the CLR (and each has their shortcuts).

Would like to see... by CaptainAlbert · 2004-01-09 03:40 · Score: 3, Interesting

...some analysis of the code generated by Visual C++ and gcc side by side, particularly for those trig calls. If there's that great a discrepancy between the runtimes, that's a good clue that either one of the compilers is under-optimising (i.e. missing a trick), or the other is over-optimising (i.e. applying some transformation that only approximates what the answer should be). I didn't see any mention of the numerical results obtained being checked against what they ought to be (or even against each other).

As any games/DSP programmer will tell you, there are a million ways to speed up trig providing that you don't *really* care after 6dps or so.

OK, maybe I'm just bitter because I was expecting gcc 3.1 to wipe the floor. :)

--
These sigs are more interesting tha

this is just so bogus by rpeppe · 2004-01-09 03:42 · Score: 4, Insightful

Benchmark code like this does not represent how these languages are used in practice. Idiomatic Java code tends to be full of dynamic classes and indirection galore. Just testing "arithmetic and trigonometric functions [...] and [...] simple file I/O" is not going to tell you anything about how fast these languages are in the real world.

What about coder's performance? by G4from128k · 2004-01-09 03:43 · Score: 3, Interesting

Given the ever accelerating clockspeed of processors, is the raw performance of langauges that big an issue? Except for CPU-intensive programs (3-D games, high-end video/audio editing), current CPUs offer more than enough horsepower to handle any application. (Even 5-year old CPUs handle almost every task with adequate speed). Thus, code performance is not a big issue for most people.

On the other hand, the time and cost required by the coder is a bigger issue (unless you outsource to India). I would assume that some languages are just easier to design for, easier to write in, and easier to debug. Which of these langauges offers the fastest time to "bug-free" completion for applications of various sizes?

--
Two wrongs don't make a right, but three lefts do.

Re:What about coder's performance? by Dalroth · 2004-01-09 04:26 · Score: 4, Insightful

Raw performance will ALWAYS be an issue. If you can handle 100,000 hits per day on the same hardware that I can handle 1,000,000 (and these are not made up numbers, we see this kind of discrepency in web applications all the time), then I clearly will be able to do MORE business than you and do it cheaper. That gives me a competitive advantage from now till the end of time. If you throw more hardware at the problem, well, so can I and I'll still be ahead of you.

Performance realities do not go away, no matter how much we may wish they would. Now, does that mean you're going to go write major portions of your web application in assembly to speed it up? No, probably not. But your database vendor may very well use some tricks like that to speed up the key parts of their database. You sink or swim by your database, so don't say it doesn't matter because it absolutely does.

Anyway, in my day-to-day operations, I can think of quite a few things that get compiled directly to executable code even though they don't have to be. Why would you do this if performance wasn't an issue and we could just throw more hardware at it?

1. Regular expressions in the .NET environment are compiled down to executable code, then executed.

2. XSL transformations in the .NET environment are compiled to a form of executable code (I don't think it's actual .NET byte code, but it may be) and then executed.

3. The XmlSerializer classes creates a special compiled executable specifically created to serialize objects into XML (byte code!!).

And the list just goes on and all of this eventually ends up getting JITed as well. My pages are 100% XML based, go through many transformation steps to get to where they need to be, and on average render in about 70-100ms (depending upon the amount of database calls I need to make and the size of the data). This all happens without spiking our CPU utilization to extreme levels. There is *NO WAY* I could've done this on our hardware if nobody cared about performance.

As always, a good design is the most important factor. But a good design that performs well will always be superior to one that doesn't.

Bryan

Speed or accuracy? by derek_farn · 2004-01-09 03:43 · Score: 5, Interesting

The Java performance is best explained by an article by Prof Kahan: "How JAVA's Floating-Point Hurts Everyone Everywhere" http://www.cs.berkeley.edu/~wkahan/JAVAhurt.pdf also see "Marketing vs. Mathematics" http://www.cs.berkeley.edu/~wkahan/MktgMath.pdf I suspect the relatively poor floating-point performance of gcc is also caused by the desire to acheive accurate results.

Alternative comparison, compiler shootout by SiW · 2004-01-09 03:44 · Score: 5, Informative

Don't forget about the Win32 Compiler Shootout

About the Python performance by ultrabot · 2004-01-09 03:45 · Score: 5, Insightful

Note that Python is pretty easy to extend in C/C++, so that speed critical parts can be rewritten in C if the performance becomes an issue. Writing the whole program in C or C++ is a premature optimization.

--
Save your wrists today - switch to Dvorak

Re:Sitting on a Benchmark by Mathetes · 2004-01-09 03:47 · Score: 3, Informative

Oh wait! C# only runs on one operating system. Can you name any other development languages that only run on ONE OS, boys and girls? Neither can I.

Ximian's Mono has a C# compiler for open OS's:

http://www.go-mono.com/c-sharp.html

Re:They should benchmark development time by ultrabot · 2004-01-09 03:50 · Score: 5, Interesting

Someone should do a study on the time taken to design, implement and debug a resonably complex chunk of code under C++ and Java. I'm pretty sure that the result would show the huge advanatage of Java over C++.

The difference b/w Java and C++ would be dwarfed by the difference b/w Java and Python. Java may be 30-40% more productive than C++, but Python is 1000% more productive than Java. And yes, this applies to larger projects. J2EE may come to its own w/ projects that have hundreds of mediocre programmers, but if you have a mid-size team of highly skilled developers creating something new & unique (something like Zope or Chandler), Python will trounce the competition.

--
Save your wrists today - switch to Dvorak

Read the OSNews thread by be-fan · 2004-01-09 03:50 · Score: 5, Insightful

There were a number of problems with this benchmark, which are addressed in the OSNews thread about the article.

Namely:

- They only test a highly specific case of small numeric loops that is pretty much the best-case scenario for a JIT compiler.

- They don't test anything higher level, like method calls, object allocation, etc.

Concluding "oh, Java is as fast as C++" from these benchmarks would be unwise. You could conclude that Java is as fast as C++ for short numeric loops, of course, but that would be a different bag of cats entirely.

--
A deep unwavering belief is a sure sign you're missing something...

Quoting the results section here... by Jugalator · 2004-01-09 03:50 · Score: 5, Informative

Site was showing signs of Slashdotting, so I'll quote one of the more important sections... Results Here are the benchmark results presented in both table and graph form. The Python and Python/Psyco results are excluded from the graph since the large numbers throw off the graph's scale and render the other results illegible. All scores are given in seconds; lower is better. int long double trig I/O TOTAL Visual C++ 9.6 18.8 6.4 3.5 10.5 48.8 Visual C# 9.7 23.9 17.7 4.1 9.9 65.3 gcc C 9.8 28.8 9.5 14.9 10.0 73.0 Visual Basic 9.8 23.7 17.7 4.1 30.7 85.9 Visual J# 9.6 23.9 17.5 4.2 35.1 90.4 Java 1.3.1 14.5 29.6 19.0 22.1 12.3 97.6 Java 1.4.2 9.3 20.2 6.5 57.1 10.1 103.1 Python/Psyco 29.7 615.4 100.4 13.1 10.5 769.1 Python 322.4 891.9 405.7 47.1 11.9 1679.0

--
Beware: In C++, your friends can see your privates!

Re:Wow by thoolihan · 2004-01-09 03:51 · Score: 3, Insightful

Keep in mind too that these benchmarks were all run on windows. I think gcc plays a lot nicer with glibc compared to the windows native libraries. Also, as pointed out, it's about being portable, not the most optimized compiler.

-t

--
http://unmoldable.com W:"No one of consequence" I:"I must know" W:"Get used to disappointment"

IBM Java by PourYourselfSomeTea · 2004-01-09 03:52 · Score: 3, Interesting

Using the IBM Java VM, I've been able to achieve consistently cutting my runtimes in half over the Sun VM. Anyone currently using the Sun VM for production work should test the IBM one and consider the switch.

My application that I benchmarked is data and network and memory intensive, although not math intensive, so that's what I can speak for. We consistently use 2 GB of main memory and pump a total of 2.5 TB (yes, TB) of data (doing a whole buch of AI style work inside the app itself) through the application over it's life cycle, and we cut our total runtime from 6 days to 2.8 days by switching to the IBM VM.

Not testing languages by xtheunknown · 2004-01-09 03:52 · Score: 3, Insightful

You are not testing the languages, you are testing the compilers. If you test a language with a crummy compiler (gcc sucks compared to commercial optimized C++ compilers) you will think the language is slow, when in fact, the compiler just sucks. The only valid comparisons that can be made are same language, different compilers.

--

They that can give up essential liberty to obtain a little temporary safety deserve neither liberty nor safety.

wrong questions by ajagci · 2004-01-09 03:53 · Score: 4, Insightful

The Java JIT has been comparable to C in performance for many years on certain microbenchmarks. But Java remains a "slow language". Why?

The design of the Java language and the Java libraries means that enormous numbers of calls are made to the memory allocator in idiomatic Java.
The Java language has several serious limitations, such as the lack of true multidimensional arrays and the lack of value classes.

So, yes, you can construct programs, even some useful compute intensive programs, that perform as well or better on Java than they do in C. But that still doesn't make Java suitable for high-performance computing or building efficient software.

Benchmarks like the one published by OSnews don't test for these limitations. Microbenchmarks like those are still useful: if a language doesn't do well on them, that tells you that it is unsuitable for certain work; for example, based on those microbenchmarks alone, Python is unlikely to be a good language for Fortran-style numerical computing. But those kinds of microbenchmarks are so limited that they give you no guarantees that an implementation is going to be suitable for any real-world programming even if the implementation performs well on all the microbenchmarks.

I suggest you go through the following exercise: write a complex number class, then write an FFT using that complex number class, "void fft(Complex array[])", and then benchmark the resulting code. C, C++, and C# all will perform reasonably well. In Java, on the other hand, you will have to perform memory allocations for every complex number you generate during the computation.

Less simple benchmarks by DuSTman31 · 2004-01-09 03:53 · Score: 4, Insightful

The optimisers in sun's Java VM work on run-time profiling - they identify the most run sections of code and use the more elaborate optimisation steps on these segments alone.

Benchmarks that consist of one small loop will do very well under this scheme, as the critical loop will get all of the optimisation effort, but I suspect that in programs where the CPU time is more distributed over many code sections, this scheme will perform less well.

C doesn't have the benefit of this run-time profiling to aid in optimising critical sections, but it can more afford to apply its optimisations across the entire codebase.

I'd be interested to see results of a benchmark of code where CPU time is more distributed..

Re:Wow by be-fan · 2004-01-09 03:53 · Score: 5, Interesting

According to these benchmarks it doesn't.

The short of it is that GCC 3.2.1 is highly competitive with ICC 7.0, except for two cases:

FP-intensive code on the Pentium 4
Code that allows Intel C++ to auto-generate SSE vector code for it

--
A deep unwavering belief is a sure sign you're missing something...

Python Longs are arbitrary precision! by PommeFritz · 2004-01-09 03:56 · Score: 4, Informative

The Python 'long' type is not a machine type such as a 32 or 64 or perhaps even 128 bit integer/long.
It is an arbitrary precision decimal type! That's why Python's scores on the Long test are so much higher (slower) than the other languages.
I wonder what Java scores when the benchmark is reimplemented using BigDecimal instead of the 'long' machine type.
Python uses a highly efficient Karatsuba multiplication algorithm for its longs (although that only starts to kick in with very big numbers).

Re:Wow by LinuxInDallas · 2004-01-09 03:56 · Score: 4, Interesting

There was an interesting article in Dr Dobb's a few months back. They did a performace (C++) comparison of 6 or so compilers, gcc included. The end result was that performace wise (execution AND code size) gcc came in last place in all their testing. However, gcc did win when it came to conformance to the C++ standard as it was the only compiler that supported all the language features.

Re:Speed? No. Stability. Yes by Isochrome · 2004-01-09 04:04 · Score: 4, Insightful

OK, Speed does matter a lot.

But what about type safety? Java has no generic typed containers, like the STL. This means you tend to find errors at runtime instead of at compile time.

I need to know that my code is as safe as possible. I don't want a user to find a bug because my hand tests didn't get 100% code coverage every time.

And how about predictable performance. I would much rather know that this function will tak 200ms all of the time instead of 100ms most of the time a 10 s due to garbage collection occasionally.

Language performance arguments miss the point by tizzyD · 2004-01-09 04:07 · Score: 5, Insightful

Consider what was done years ago with assembly. The performance was incredible, and the amount of superfluous garbage in the code was minimal. Hey, if you wrote the assembly, why would you spend time putting it in?

Then, with more and more languages, especially ones with VMs, you get further and further away from the hardware. The end result: you lose performance. It does more and more for you, but at the expense of real optimizations, the kind that only you can do.

Now the zealots will come out and say, "Language X is better than language Y, see!" To me this argument is boring. I tend to use the appropriate tool for the job. So:

Python for scripts, prototypes, proofs of concept, or components where performance generally is not an issue.
For desktop apps, Visual Basic (yep, most IT apps are in VB). There is no justifiable reason for an IT department group to write a sales force reporting system in C++! If you want C++, go get a job at a software company. Stop wasting money and time making yourself feel like a hotshot. [I'd consider Kylix here if it was based on Basic. Why? Because honestly, Pascal is just about dead, and Basic is the king of the simple app. Let's just live with it and move on. I do want a cross-platform VB . . . ]
For web apps, well, I stick around PHP/ASP.NET. Why? Portability! And moreover, the sticking point in a web-based app is not the UI layer; it's usually the underlying data extraction and formatting. Don't waste your time with lower level languages there. IMHO it's just not worth it. JSP and Java stuff, yuck! Too much time for too little bang.
Java/C# (also consider mono/LISP for most production apps. Why? Portability! I want no vendor holding me by the balls. I want platform independence on the back end, and these are the few ways to achieve it. I'd include Haskell/OCAML here when appropriate. Perl? I'm loathe to use Perl as production, considering most Perl code cannot be understood 2 weeks after it's written. I'd rather take the hit in performance and be able to pass the code to someone else later.
C++/C for components--just components--where performance is at an absolute premium or there exists some critical library that only has this kind of interface. But this step has to be justified by the team, with considerable explanation why a different architecture could not suffice. Otherwise, the team could waste time checking for dangling pointers when instead it could be doing other things, like finishing up other projects.
Assembly? Only when there is not a C complier around. Embedded stuff. Nowadays, you just do not have the time to play.

Yes, my teams use many languages, but they also put their effort to where they get the biggest bang for the buck. And in any business approach, that's the key goal. You don't see carpenters use saws to hammer in nails or drive screws. Wise up!

--
...tizzyd

Re:They should benchmark development time by ultrabot · 2004-01-09 04:07 · Score: 4, Insightful

Productive for you now ... but what about 6 months down the road? What if you want to realize your product to the world, how hard is it to extend it?

The advantages over Java are even increased 6 months down the road. Python code is much more readable and maintainable, hence easier to extend. Dynamically typed object model scales incredibly well.

I used to think the same about Perl vs Java, until I started looking at frameworks like Cocoon and they're all written in Java.

Comparing Perl to Java is foolish, Perl is more like Awk than a general purpose programming language, and not meant for large projects at all.

--
Save your wrists today - switch to Dvorak

trig calls in gcc by ajagci · 2004-01-09 04:11 · Score: 5, Informative

The Pentium trig instructions are not IEEE compliant (they don't return the correct values for large magnitude arguments). gcc errs on the side of caution and generates slow, software-based wrappers that correct for the limitations of the Pentium instructions by default. Other compilers (e.g., Intel and probably Microsoft) just generate the in-line instructions with no correction. When you look at the claimed superiority of other compilers over gcc, it is usually such tradeoffs that make gcc appear slower.

You can enable inline trig functions in gcc as well, either with a command line flag, or an include file, or by using "asm" statements on a case-by-case basis. Check the documentation. With those enabled, gcc keeps up well with other compilers on trig functions.

Re:trig calls in gcc by greg_barton · 2004-01-09 05:17 · Score: 3, Insightful

On top of this, the Java/Linux VM from Sun (and most other Java/Linux VMs) is compiled using gcc. I don't know for sure, but I wouldn't be suprised if Sun erred on the side of correctness and standards compliance, so I'll bet they compiled the VM without inlining any trig function calls.

Windows a good choice for this test by nberardi · 2004-01-09 04:21 · Score: 3, Insightful

Windows was a good choice for this test, because many of the development languages that were used in this test aren't really mature enough in *nix. (i.e. .Net languages and arguably Java) A better test would be doing both tests on both OS's, because GCC is really more optimized twords Linux, while VC++ is more optimized twords Windows. I would have rather seen VC++ vs. Borderland C++, because that is a more real world business example.

These kind of benchmarks are so 1970s by Junks+Jerzey · 2004-01-09 04:22 · Score: 3, Insightful

It is amusing that the obsession with raw speed never goes away, even though computers have gotten thousands of times faster since the the days of the original wisdom about how one shouldn't be obsessed with speed. Programmers put down Visual Basic as slow when it was an interpreted language running on a 66MHz 486. It was still put down as slow when it shared the same machine code generating back-end as Visual C++ running on a 3GHz Pentium 4. And still some people--usually people with little commercial experience--continue to insist that speed is everything.

Here's a bombshell: if you have a nice language, and that language doesn't have any hugely glaring drawbacks (such as simple benchmarks filling up hundreds of megabytes of memory), then don't worry about speed. From past experience, I've found it's usually easy to start with what someone considers to be a fast C or C++ program. Then I write a naive version in Python or another language I like. And guess what? My version will be 100x slower. Sometimes this is irrelevant. 100x slower than a couple of microseconds doesn't matter. Other times it does matter. But it usually isn't important to be anywhere near as fast as C, just to speed up the simpler, cleaner Python version by 2-20x. This can usually be done by fiddling around a bit, using a little finesse, trying different approaches. It's all very easy to do, and one of the great secrets is that high-level optimization is a lot of fun and more rewarding than assembly level optimization, because the rewards are so much greater.

This is mostly undiscovered territory, but I found one interesting link.

Note that I'm not talking about diddly high-level tasks in language like Python, but even things like image processing. It doesn't matter. Sticking to C and C++ for performance reasons, even though you know there are better languages out there, is a backward way of thinking.

Slashdotted by ReadParse · 2004-01-09 04:57 · Score: 4, Funny

They should have written their site in one of the higher-performing languages.

RP

Performance not important? Umm , not quite... by Viol8 · 2004-01-09 04:58 · Score: 3, Interesting

I was a bit surprised by this quote in the article:

"Even if C did still enjoy its traditional performance advantage, there are very few cases (I'm hard pressed to come up with a single example from my work) where performance should be the sole criterion when picking a programming language. I"

I can only assume from this that he has never done or known anyone who has done any realtime programming. If you're going to write something
like a car engine management system performance is the ONLY critiria, hence a lot of these sorts of systems are still hand coded in assembler , never
mind C.

Cost of Hardware vs. Cost of wetware by G4from128k · 2004-01-09 05:04 · Score: 4, Insightful

Raw performance will ALWAYS be an issue. If you can handle 100,000 hits per day on the same hardware that I can handle 1,000,000 (and these are not made up numbers, we see this kind of discrepency in web applications all the time), then I clearly will be able to do MORE business than you and do it cheaper.

You raise excellent points. For many enterprise and server applications, performance is an issue. But I never said one should care nothing abut performance, only that in many applications the cost of the coder also impacts financial results.

For the price of one software engineer for a year (call it 50k to 100k burdened labor rate), I can buy between 20 to 100 new PCs (at $1000 to $3000 each). If the programmer is more expensive or the machines are less expensive, then the issue is even more in favor of worring about coder performance.

The trade-off between the hardware cost of the code and the wetware cost is not obvious in every case. A small firm that can double its server capacity for less than the price of a coder. or the creators of an infrequently-used application may not need high performance. On the other hand, a large software seller that sells core performance apps might worry more about speed. My only point is that ignoring the cost of the coder is wrong.

These different languages create a choice of whether to throw more hardware at a problem or throw more coders at the problem.

--
Two wrongs don't make a right, but three lefts do.

Re:They should benchmark development time by ultrabot · 2004-01-09 05:09 · Score: 3, Funny

Ummm.. Slashdot is written in Perl, as are many other large projects. I've yet to see anything like Slashdot written in Awk.

I heard there was a vote b/w Perl, Awk, Intercal and sed, and Perl won by a narrow margin.

--
Save your wrists today - switch to Dvorak

Not a fair test - Frame Pointers by Compenguin · 2004-01-09 05:16 · Score: 3, Insightful

His benchmark isn't fair, he's omitting the fame pointer on VC++ but not gcc. How is that fair?

Consider the logic... by tizzyD · 2004-01-09 05:19 · Score: 3, Insightful

But that's what you know so well. I bet if I took a team of VB programmers who knew VB as well as you know VC++, and then challenged them to produce a series of business oriented apps, in most cases you'd lose. How many VC++ programmers does it take to create a business app? 2 * VB Programmers usually -OR- how long does it take to create and debug a business app in VC++? 2 * VB time. ;-) JK!

Guido van Rossum noted in an interview the following statistic, and I think it bears considerably on appropriateness:

This [ed: these stats] is all very informal, but I heard someone say a good programmer can reasonably maintain about 20,000 lines of code. Whether that is 20,000 lines of assembler, C, or some high-level language doesn't matter. It's still 20,000 lines. If your language requires fewer lines to express the same ideas, you can spend more time on stuff that otherwise would go beyond those 20,000 lines.
A 20,000-line Python program would probably be a 100,000-line Java or C++ program. It might be a 200,000-line C program, because C offers you even less structure. Looking for a bug or making a systematic change is much more work in a 100,000-line program than in a 20,000-line program. For smaller scales, it works in the same way. A 500-line program feels much different than a 10,000-line program

So then, unless you quantify the types of apps you build, the team you use, and the results that are expected, my experience has shown me that most of the time, for business apps, it's overkill. Now, if you're in a dev team at a software company, well then, I could consider the other side.

--
...tizzyd

Coming in 1.5, but you can do this now by SuperKendall · 2004-01-09 05:27 · Score: 4, Insightful

That's a feature built into Java 1.5, but you can get a test reference implementation which is about 96% of the features now to try it out. It has a really clean syntax and provides the benefit you seek.

--
"There is more worth loving than we have strength to love." - Brian Jay Stanley

Tried this with gcj 3.2, here are the results by BreadMan · 2004-01-09 05:39 · Score: 3, Informative

Compiled with gcj Benchmark.java --main=Benchmark -o benchmark, compiled other program with the same optimization level.

Comparison against gcc, gcj and Java 1.4.1 on the same host:

.......... gcc........gcj.......java int.......28,700.....35 ,386....22,953 double....30,000.....73,504....27, 529 long......61,520.....61,729....68,914 trig.. ....11,020....112,497...176,354 io.........1,930. ....16,533....11,297

I was somwhat surprised on the difference in the trig tests, as both appear to use libm. Not surprised that the IO was slower, the Java IO classes are nifty but do add quite a bit of overhead compared fputs/fgets.

(Sorry about the formatting, it was the best I could do)

I just sped the Python version by 7x and 1.5x by b0rken · 2004-01-09 05:40 · Score: 5, Interesting

IMO this benchmark is nonsense, and the way the Python code is written is even worse. I looked at the "trig" and I/O benchmarks. In the i/o benchmark, the output is assembled in the stupidest way possible:

linesToWrite = [myString] for i in range(ioMax - 1): linesToWrite.append(myString)

Changing this to 'linesToWrite = [myString] * ioMax' dropped time on my system from 2830ms to 1780ms (I'd like to note that I/O on my system was already much faster than his *best* I/O score, thank you very much Linux)

In the trig test, I used numarray to decrease the runtime from 47660.0ms to *6430.0ms*. The original timing matches his pretty closely, which means that numarray would probably beat his gcc timings handily, too. Any time you're working with a billion numbers in Python, it's a safe bet that you should probably use numarray!

I didn't immediately see how to translate his other mathematical tests into numarray, but I noted that his textual explanation in the article doesn't match the (python) source code!

(My system is a 2.4GHz Pentium IV running RedHat 9)

--
Hate stupid software on freshmeat? Laugh at

Python Benchmark by SlightOverdose · 2004-01-09 05:40 · Score: 4, Informative

The windows version of Python is much slower. Testing with Python2.3 + psyco on a 2.4ghz p4 running Linux 2.4.20 yeilds impressive results

$ python -O Benchmark.py
Int arithmetic elapsed time: 13700.0 ms with
Trig elapsed time: 8160.0 ms

$ java Benchmark
Int arithmetic elapsed time: 13775 ms

$ java -server Benchmark
Int arithmetic elapsed time: 9807 ms

(n.b. this is only a small subset of the tests- I didn't feel like waiting. Trig was not run for java because it took forever.)

To dismiss a few common myths...
1) Python IS compiled to bytecode on it's first run. The bytecode is stored on the filesystem in $PROGNAME.pyc.

2) the -O flag enables runtime optimization, not just faster loading time. On average you get a 10-20% speed boost.

3) Python is a string and list manipulation language, not a math language. It does so significantly faster than your average C coder could do so, with a hell of a lot less effort.

.NET benchmark flawed. It is faster. by Anonymous Coward · 2004-01-09 05:46 · Score: 3, Insightful

I see just one small issue with the benchmarks. Microsoft claims, that all .NET languages are compilled at the runtime. This means, that the first pass of the execution through the function has a compile time added on top of the execution, which falsifies somewhat the .NET execution time benchmark. I did some simple tests that confirm this. To my surprise, .NET languages are actually faster than Visual C++, Borland C++ or GNU C+ for a simple 1/n series calculation without visible loss of accuracy. Don't ask me how it is possible. I don't know, but it is a fact that my benchmark shows. My best guess would be that the just in time compiler is better in getting code optimized for the CPU in the particular machine it runs or maybe it is better in filling the cache. The key of the benchmark is to write software in such a way that it runs through the function at least two times. The first time it runs just to allow just in time compiler to compile the code and then it runs subsequent times to measure performance. Below is the schematics of my benchmark:

double benchmark(int number_of_iterations);

void main (void)
{
Time start,end;
double outcome;

benchmark(1); // This is to allow .NET "just in time" compiler to compile the benchmark function

for(int i = 1; i < 11; ++i)
{
start = CurrentTime(); // CurrentTime is a placeholder here for a system time function in ticks
outcome = benchmark(i*1000000);
end = CurrentTime();
lprt (i,outcome,end-start); // lprt is a placeholder for a nice formatting print here
}
}

double benchmark (int number_of_iterations)
{
double s,t;

s = 0.0;
t = 1.0;

for(int i = 1; i < number_of_iterations; ++i)
{
s += 1.0/t; // This is the body of the benchmark
t += 1.0;
}
return (s);
}

As you can see above, I run the benchmark function once with counter of 1 and ignore its outcome before starting to measure time. The key is to allow compiler to compile the benchmarking function before running actual benchmark. Once it is done, I run then the benchmark 10 times for succesively larger counter from 1 billion to 10 billion and print number of iterations (in billions), the accuarcy and the time it takes to run. The idea here is that under the assumption that the benchmark time is related to number of iterations as a linear function I can easily find linear best fit function between number of cycles and run time in the form of

time = a * number_of_cycles + b

and then use value of a as a measurement of the benchmark. The value of b is good check, how the benchmark behaves. If it is large, then something went wrong. In my case it was always close to zero. I'm now away from my home computer and I don't have all the compilers, that were tested in this article, so I can't repeat those benchmarks modified to this method at the moment, but you guys might try to do it yourself.

Some people might challenge this by stating that the compile time for .NET is part of the execution time but I disagree. My position on this is, that in most real life cases the software runs into the particular functions many times thus creating long exectution times. It is rare, that a signle function call creates long exectuion time that is annoying to the user.

Best regards.

Delphi & Kilyx by the_mushroom_king · 2004-01-09 06:35 · Score: 3, Insightful

Why are the two best Borland languages never included in benchmarks? Maybe in just the odd-ball that doesn't use C++, Java, or Micro$oft. - TMK

Slashdot Mirror

Performance Benchmarks of Nine Languages

71 of 954 comments (clear)