High-level Languages and Speed

Old debate by overshoot · 2006-07-17 23:21 · Score: 5, Informative

Twenty years ago we were still in the midst of the "language wars" and this was a hot topic. The argument then, as now, was whether a high-level language could be compiled as efficiently as a low-level language like C [1].

Well, we ran our own tests. We took a sizable chunk of supposedly well-written time-critical code that the gang had produced in what was later to become Microsoft C [2] and rewrote the same modules in Logitech Modula-2. The upshot was that the M2 code was measurably faster, smaller, and on examination better optimized. Apparently the C compiler was handicapped by essentially having to figure out what the programmer meant with a long string of low-level expressions.

Extrapolations to today are left to the reader.

[1] I used to comment that C is not a high-level language, which would induce elevated blood pressure in C programmers. After working them up, I'd bet beer money on it -- and then trot out K&R, which contains the exact quote, "C is not a high-level language."
[2] MS originally relabled another company's C complier under license (I forget their name; they were an early object lesson.)

--
Lacking <sarcasm> tags, /. substitutes moderation as "Troll."

Re:Old debate by StarvingSE · 2006-07-17 23:34 · Score: 4, Insightful

C is not a low level language. If you're not directly manipulating the registers on the processor, you are not in a low level language (and forget about the "register" keyword, modern compilers just treat register variables in C/C++ as memory that needs to be optimized for speed).

If anything, C is a so-called mid level language. If it wasn't, you'd be using an assembler instead of a compiler.

--
I got nothin'
Re:Old debate by StrawberryFrog · 2006-07-17 23:49 · Score: 2, Informative

essentially ALL of the strongly typed and structure languages - have pretty much died out.

Uh, Java and C# are strongly typed and structured languages.

--
My Karma: ran over your Dogma
StrawberryFrog
Re:Old debate by CapnOats.com · 2006-07-18 00:07 · Score: 3, Informative

...trot out K&R, which contains the exact quote, "C is not a high-level language."

Actually the quote from my copy of K&R, on my desk beside me is,

C is not a "very high level" language...

emphasis is mine.
Re:Old debate by shreevatsa · 2006-07-18 00:19 · Score: 5, Informative

For what it's worth, at The Computer Language Shootout, OCaml does pretty well. Of course, C is still faster for most things (but note that the really high factors (29 and 281) are in OCaml's favour!), but OCaml is pretty fast compared to Java or Perl. Haskell does pretty well too. Functional programming, anyone?
Of course, these benchmarks measure only speed, are just for fun, and are "flawed", but they are still interesting to play with. If you haven't seen the site before, enjoy fiddling with things to try and get your favourite language on top :)
Re:Old debate by cerberusss · 2006-07-18 00:46 · Score: 2, Informative

It also says in the introduction (next page):
C is a relatively "low level" language.

--
8 of 13 people found this answer helpful. Did you?
Re:Old debate by bloodredsun · 2006-07-18 00:50 · Score: 4, Interesting

If I had mod points I'd certainly mod you informative. Those benchmarks might be synthetic and flawed but as a general illustration of how the various languages differ, that link is fantastic.

Of course I'll just use it for my own ends by convincing my managers that we're using the right languages - "Yes boss you'll see that we use C++ for the stuff that needs to be fast with low memory overhead, Java for the server side stuff, stay the fuck away from Ruby and if you say 'Web 2.0' at me one more time I'll be forced to wham you with a mallet!" ;-)
Re:Old debate by masklinn · 2006-07-18 01:11 · Score: 2, Informative

No they're not, they're statically typed but many languages exist with much stronger type systems (Ada, Modula2, Haskell).

--
"The way we can tell it's C# instead of Haskell is because it's nine lines instead of two." -- wadler
Re:Old debate by masklinn · 2006-07-18 01:14 · Score: 3, Interesting

Haskell also does very well, and Digital Mars' impressive D is consistently in the top spots (one wonders why the hell Soustrup is still trying to improve C++ when he could just switch to D and build from there)

--
"The way we can tell it's C# instead of Haskell is because it's nine lines instead of two." -- wadler
Re:Old debate by Anonymous Coward · 2006-07-18 01:16 · Score: 2, Interesting

The O'Caml examples are not functional, they're imperative. The best-performing Haskell examples are also written imperatively.
Re:Old debate by StarvingSE · 2006-07-18 01:18 · Score: 2, Insightful

Key word is "relatively." C is low level compared to languages such as Java and C#, which do a lot of things such as memory management for you.

--
I got nothin'
Re:Old debate by StarvingSE · 2006-07-18 01:30 · Score: 2, Interesting

Java is hardly going the way of a legacy language. It is heavily used in the business world for web applications, which are becoming much more popular, not less.

And although you call Cobol legacy, it really isn't. Many financial institutions still run applications written in cobol since it is too costly and risky to migrate the old code to a new language. Cobol was meant for the financial industry, and its probably there to stay. Colleges and universities are even starting to teach it again since it is in high demand in the job market right now (older cobol programmers are retiring).

--
I got nothin'
Re:Old debate by Bastian · 2006-07-18 01:35 · Score: 5, Insightful

The article addressed this point by mentioning that the definitions of high and low level language are a moving target. Nowadays I think most people consider assembly language to be its own thing, and the low-level classification has now been shifted into a domain that was once described completely by the term high-level. The term "high-level language" has been replaced by the term "programming language."

If you're going to go with the jargon as it's most often used nowadays (which is a perfectly reasonable thing to do), then C would certainly be about as low as you can get without manipulating individual registers - i.e., without being assembly language.
Re:Old debate by Bastian · 2006-07-18 01:43 · Score: 2, Informative

I'm pretty sure that over my C programming career I've managed (sometimes by accident, sometimes by misguided designs at creating a "clever hack") to cram data of every type into a bin that was reserved for every other type without the use of a cast. C is statically typed, but I wouldn't say it's strongly-typed at all.
Re:Old debate by Anonymous Coward · 2006-07-18 01:48 · Score: 2, Informative

And although you call Cobol legacy, it really isn't. Many financial institutions still run applications written in cobol since it is too costly and risky to migrate the old code to a new language.
Errr, well if they're no longer running it, you can't debate if it's legacy, can you? The code would be gone. Not to be an English nazi or anything, but as a word, legacy just means it was handed down from a predecessor, usually a different generation. I've always assumed this meaning carried over to programming as well.
Re:Old debate by Marcos+Eliziario · 2006-07-18 01:54 · Score: 3, Informative

Not really. Modern pipelined architectures make hard-written assembler slower than compiler generated. A human can't really deal with out-of-order execution.

--
Your ad could be here!
Re:Old debate by Megane · 2006-07-18 01:57 · Score: 2, Informative

Uh, K&R is slightly older than Java or C#... there was no such thing as memory management or virtual machines (as we know them today) back then.
Actually, there were virtual machines back then, just not on micros or minis.
And as far as this high-level/low-level thing goes, I'd call C a "mid-level" language.

--
#naabhaprzrag, #sverubfr-000, #agi-fcbafberq, negvpyr[pynff*=' negvpyr-ary-'] { qvfcynl: abar !vzcbegnag; }
Re:Old debate by jacksonj04 · 2006-07-18 01:58 · Score: 3, Insightful

Low level says what you want the system to do. High level says what you want the language (Via compiler, interpreter etc) to make the system do.

--
How many people can read hex if only you and dead people can read hex?
Re:Old debate by rainman_bc · 2006-07-18 01:58 · Score: 4, Interesting

stay the fuck away from Ruby

What's wrong with Ruby, as a replacement for a very ugly language called Perl?

Ruby is an elegant language, fully Object Oriented, and does just as well as Python and Perl...

Ruby On Rails OTOH is a different story and don't want to get into a flame war over it, but Ruby itself is pretty good for a lot of things you'd otherwise write in Perl but don't like the ugliness of Perl...

I've found some people don't get the distinction between Ruby and Ruby on Rails.

--
09 F9 11 02 9D 74 E3 5B D8 41 56 C5 63 56 88 C0
Re:Old debate by dzfoo · 2006-07-18 02:03 · Score: 2, Informative

>> Uh, K&R is slightly older than Java or C#... there was no such thing as memory management or virtual machines (as we know them today) back then.

Didn't Infocom implemented their "database query system" (which eventually became their famous text-adventure game engine) using a virtual machine they called the Z-machine? As far as I know that system predated Java and C# by a few decades.

http://en.wikipedia.org/wiki/Z-machine

-dZ.

--
Carol vs. Ghost
...Can you save Christmas?
Re:Old debate by masklinn · 2006-07-18 02:11 · Score: 2, Informative

Yep C is very weakly typed (some could say that it's untyped, as is ASM) as only the compiler does some sanity check, and even then it doesn't work too hard at it.

--
"The way we can tell it's C# instead of Haskell is because it's nine lines instead of two." -- wadler
Re:Old debate by NickFitz · 2006-07-18 02:23 · Score: 2, Interesting

The statement "C is not a high level language" is not logically equivalent to the statement "C is a low level language", so the OP is still entitled to his beer money :-)

--
Using HTML in email is like putting sound effects on your phone calls. Just say <strong>no</strong>.
Re:Old debate by exp(pi*sqrt(163)) · 2006-07-18 02:35 · Score: 2, Insightful

Haskell does OK. But compare to Clean, another pure lazy functional language. Clean blows away Haskell most of the time and competes favourably with C, sometimes beating it.

--
Doesn't it make you feel good to know that our freedoms are protected by politicans, lawyers and journalists.
Re:Old debate by fyngyrz · 2006-07-18 03:30 · Score: 2, Informative

C would certainly be about as low as you can get without manipulating individual registers - i.e., without being assembly language.

Actually, I think Forth is a little lower. The RPN nature of the language makes for a considerably closer mapping from language use to stack use for one thing, and for another, Forth atoms tend to be more primitive and more prefab than what a particular expression in C might produce.

C remains my favorite for anything that requires speed. It has always seemed to me that when someone who understands what is going on at the machine level writes C code, they can make quite fast results as compared to someone who has learned C syntax, but doesn't have a sense of what is happening with stacks, LEAs, how a particular problem may map to float, fixed or integer approaches on top of a particular processor or chip set. C++ approaches appear overrated to me. If I want objects, I make them. If I want a *really* high level approach, I use Python.

Basically, give me C or give me Python.

--
I've fallen off your lawn, and I can't get up.
Re:Old debate by MetaKey · 2006-07-18 03:38 · Score: 2, Insightful

To fill out your item [2], the name of the company was Lattice. MS bought the Lattice compiler and renamed it MS C.
It was an early example of the MS method of software development: buy out someone who has a viable product and do a much better job of marketing that product.
I maintain that MS has never been much of a software development company but, rather, a software marketing company. Certainly, the vast majority of their "innovations" have been in marketing. MS tends to incrementally improve on other developers software while being very innovative in their marketing of that software.
Lattice C was an early example. Excel is a mid-life example. A more recent example is the Groove Networks collaboration tool. MS recently bought them and will include the Groove in the next version of Office. They pretty much had to do this as Office is pretty stale. Who really needs a newer version of Word for example? And OpenOffice is coming along and is free. The only way to improve the Office product enough to warrent an upgrade was to add serious collaboration capabilities. And, this being MS we're talking about, the only way to do that was to go out and buy serious collaboration capabilities. Now they'll integrate it into Office and market the bejeezus out of it.
I rest my case...
Re:Old debate by Julian+Morrison · 2006-07-18 03:54 · Score: 3, Interesting

If you looked at the shootout you'd see what was wrong in Ruby: it's just about the slowest serious scripting language. It seems to be using pure bloody-minded interpretation without any bytecode or JIT stage.

Nothing wrong with the language that a proper implementation couldn't cure, basically.
Re:Old debate by Anonymous Coward · 2006-07-18 03:54 · Score: 2, Interesting

Yeah, but Clean is endorsed by a small dutch company (whereas Haskell is promoted by several people with english-sounding names who can write research papers in perfect English) so nobody cares about Clean.
Re:Old debate by letxa2000 · 2006-07-18 04:17 · Score: 2, Insightful

Haven't people generally considered C to be kind of a cross platform assembler? That certainly seems to be the attitude of the Scheme crowd...

Anyone that considers C to be a "cross-platform assembler" probably has never worked in assembler, and almost definitely hasn't done so on more than one platform.
'C' is only "low level" to those that don't get any closer to the hardware than, say, Visual Basic. Anyone that has programmed in assembly language will assure you that 'C' is quite high level. I'd be willing to accept "mid-level", but in reality once you've worked at the assembly level you will realize that there's very little difference between 'C' and Visual Basic. 'C' and VisualBasic are essentially both high-level languages; but 'C' just seems more intimidating than VisualBasic to the VisualBasic programmer. Those that call 'C' mid-level are probably VisualBasic programmers that think 'C' is intimidating so it clearly can't be a high-level language like VB.
When you write a single line in 'C' and realize that that can correspond to hundreds of assembly language instructions, you realize that 'C' is very much a relatively high-level language. When you try to do floating point math on an 8-bit processor with no floating point instructions, you realize that 'C' is very much a high-level language. When you try to add three numbers and multiply it by a fourth, and you come from 'C', you realize that (1 + 2 + 3) * 4 is a heck of a lot more complicated than you imagined.
The main difference between VB and 'C' is that VB gives you more self-contained packages to let you interact with today's GUI's. 'C' gave you printf which was fine for writing to a terminal. VB gives you all kinds of controls to let you do pretty GUI stuff. The concept is exactly the same, and both are high level.
I say all of this having programmed in assembly language, then Basic, then QuickBasic, then 'C', then VisualBasic, and now almost exclusively 'C' and assembly language in truly embedded systems (embedded != Windows or Linux in a small form factor).
Re:Old debate by The_Wilschon · 2006-07-18 04:20 · Score: 4, Informative

Garbage collection, a form of memory management in widespread use today, was invented "around 1959" by John McCarthy as he discovered LISP. This predates K&R, first edition in 1978, by quite a bit.

--
SIGSEGV caught, terminating

wait... not that kind of sig.
Re:Old debate by civilizedINTENSITY · 2006-07-18 04:44 · Score: 4, Informative

Actually, there was, way before C (let alone Java or C#.)

"Lisp is very old language, second only to Fortran in the family tree of high level languages." A Little history

Whereas C (rather like Fortran) wanted to stay "close to the metal", Lisp wanted to transcend metal to get closer to the math. Hence, innante elegance :-)
Towards the end of the initial period, it became clear that this combination of ideas made an elegant mathematical system as well as a practical programming language. Then mathematical neatness became a goal and led to pruning some features from the core of the language. This was partly motivated by esthetic reasons and partly by the belief that it would be easier to devise techniques for proving programs correct if the semantics were compact and without exceptions. The results of (Cartwright 1976) and (Cartwright and McCarthy 1978), which show that LISP programs can be interpreted as sentences and schemata of first order logic, provide new confirmation of the original intuition that logical neatness would pay off.
It is true that Lisp ran inside an interpreter rather than a VM. Still, garbage collection is *old*, and memory management techniques from the 1950s/60s shouldn't be considered a new thing.

Still waiting for the Visual.Lisp.Net, though :-) When UML and visual design paradims are finally swallowed by Lisp, oh what fun times we'll have! ;-)
Re:Old debate by Julian+Morrison · 2006-07-18 06:43 · Score: 2, Insightful

Nah, that's nonsense. Python has string objects, Scheme has continuations. Ruby's still slower.

First-class reentrant continuations and dynamic typing (another major efficiency hog) probably constrain you to, in the best case, the same box as compiled Scheme - about the same as Java.
Re:Old debate by Dr.+Zowie · 2006-07-18 07:13 · Score: 3, Interesting

There are two main problems with Ruby as a replacement for "a very ugly language called Perl":

* It's not sufficiently prettier than Perl

* It's not Perl

Perl may look ugly but it is to most programming languages as English is to most other languages. Perl is a brawling, sprawling mess of borrowed, Hamming-optimized idioms that is extremely ugly from the POV of a syntax engineer and extremely expressive from the POV of a fluent speaker.

Ruby is more like Esperanto - elegant, clean, and spoken by practically no-one because it isn't very expressive.
Re:Old debate by masklinn · 2006-07-18 07:18 · Score: 2, Interesting

How about the fact that you can't use an integer as an array index in Ada and you have to use natural numbers (defined as a positive or null integer), because array indexes can't be negative (in most languages anyway, some -- like Python -- are exceptions to this quite common rule) and you therefore shouldn't be allowed to use a number that might ever be negative as an index. C# merely gives you a warning if your index is explicit (e.g., myArray[-1]) and doesn't do anything otherwise, before throwing an IndexOutOfRangeException at runtime.

That's one measly example, but I find it quite interresting.

So no, C#'s unsafe keyword isn't a factor (and the lack of implicit conversion clearly isn't, if anything implicit conversion is the sure sign of quirky and unsafe type systems).

When people say that an Ada program that compiles will usually work without problem, they're not joking, Ada's type system is so extensive and so strong that it misses very few errors (that it could handle, that is, flaws in your own logic can't be patched by a compiler).

--
"The way we can tell it's C# instead of Haskell is because it's nine lines instead of two." -- wadler
Re:Old debate by lewp · 2006-07-18 08:00 · Score: 3, Funny

Do remember please that when that quote was written in the 1970s, Java and C# would have seemed absurd and unrealistic.

Some things never change.

--
Game... blouses.
Re:Old debate by jgrahn · 2006-07-18 10:41 · Score: 2, Insightful

This is the first valid criticism of C++ vs C I've ever seen. Most complaints about C++ are "it's not what I'm used to" from C programmers, but this is just a fundamental design flaw in C++.

What fundamental design flaw -- that malloc() is less convenient to use in C++? For crying out loud, use new!
Ok, void pointers are less useful in C++ than in C. In my experience, that has been a non-problem. But then I've never tried to program in C with a C++ compiler -- I have enough problems without creating artificial ones.
Re:Old debate by pthisis · 2006-07-18 10:46 · Score: 2, Informative

Well, all I can say is that C++ is (so much) more than just a "stricter" C.

While I agree with your core point, I have to take exception to the implication that C++ is at all a stricter C (even if it's also more). C++ and C are different languages, and C is not a subset of C++. There are valid C programs that are invalid in C++ (even not using things like variables named "new", etc), and features like implicit void casting that C++ lacks. There are programs that are valid C and valid C++ but behave differently.

And that's without getting into features of modern C (variable size arrays, language built-in complex numbers, restricted pointers, etc) that are not in C++ as far as I know.

But as far as your main point, yes, the reason to use C++ is if you want/need C++ features. My original objection was to the suggestion that you just "write C but use a C++ compiler to add namespaces and nothing else". Many of the drawbacks of C++ compared to C are pretty minor, and may be worth the tradeoff if you're going to take advantage of a lot of language features. Writing "C in C++" is just silly, though.

--
rage, rage against the dying of the light
Re:Old debate by chthonicdaemon · 2006-07-18 18:18 · Score: 2, Informative

Automated memory mgmt via garbage collection has been a feature of Lisp and many other languages since the early 1960s http://www-128.ibm.com/developerworks/library/j-jt p10283/

--
Languages aren't inherently fast -- implementations are efficient

Bah by perrin · 2006-07-17 23:24 · Score: 4, Insightful

So we "still can get good performance" from C? The implication is that C will somehow become overcome by some unnamed high-elvel language soon. That is just wishful thinking. The article is not very substantial, and where it tries to substantiate, it misses the mark badly. The claim that C cannot handle SIMD instructions well is not true. You can use them directly from C, or the C compiler can use them through autovectorization, as in gcc 4.1. The claim that C cannot inline functions from another source file is also wrong. This is a limitation in gcc, but other compilers can do it, and IIRC the intel compiler can. It is certainly not "impossible".

Re:Bah by TheRaven64 · 2006-07-17 23:38 · Score: 5, Insightful
The claim that C cannot handle SIMD instructions well is not true. You can use them directly from C, or the C compiler can use them through autovectorization, as in gcc 4.1
You have two choices when using SIMD instructions in C:
1. Use non-portable (between hardware, and often between compilers) intrinsics (or even inline assembly).
2. Write non-vectorised code, and hope the compiler can figure out how to optimally decompose these into the intrinsics. Effectively, you think vectorised code, translate it into scalar code, and then expect the compiler to translate it back.
Compare the efficiency of GCC at auto-vectorising FORTRAN (which has a primitive vector type) and C (which doesn't), if you don't believe me.
The claim that C cannot inline functions from another source file is also wrong. This is a limitation in gcc, but other compilers can do it, and IIRC the intel compiler can. It is certainly not "impossible".
When you pass a C file to a compiler, it generates an object file. It has absolutely no way of knowing where functions declared in the header are defined. You can hack around this; pass multiple source files to the compiler at once and have it treat them as a single one, for example, but this falls down completely when the function is declared in a library (e.g. libc) and you don't have access to the source.
--
I am TheRaven on Soylent News
Re:Bah by Anonymous Coward · 2006-07-17 23:42 · Score: 5, Insightful

C is faster in the same sense that assembly is faster: You have more control over the resulting machine code, so the code can by definition always be faster. You can optimize by hand. But that comes at a price: You have to optimize by hand. That's why C isn't always faster, especially not when it's supposed to be portable. The question isn't whether there could be a faster program in a language of choice, it's whether a language is at the right level of abstraction for a programmer to describe what the program must do and not a bit more. Overspecification prevents optimization. If you write for (int i=0; i<100; i++) where you really meant for (i in [0..99]), how is the compiler going to know if order is important? The latter is much more easily parallelized, for example. C is full of explicitness where it is often not needed. Assembly even more so. That's the problem of low level languages.
Re:Bah by gbjbaanb · 2006-07-18 00:02 · Score: 2, Informative

It seemed to me the article was criticising C and trying to compare Java favourably. ie, C is a low level language that canot be optimised, Java is a high level language that can. roughly.

It didn;t say much at all otherwise, but it did have a nice collection of adverts.

Optimisation:
You don't have to hack around, some compilers do it for you. The new MS compiler does a 'whole program optimisation' where it will link things together from separate object modules. Still cannot handle libraries, but then, that's just an issue that applies to all programs that are split into component parts. (except as the article implies, java that uses the bytecode in class libraries... except when compiled to native code as the first page of the article mentioned as a way to boost speed. Can't have it both ways :-) )
Re:Bah by rbarreira · 2006-07-18 00:17 · Score: 2, Insightful

Use non-portable (between hardware, and often between compilers) intrinsics (or even inline assembly).

Which usually isn't a big problem anyway since the code sections in which that's an advantage are usually quite small and infrequent, so if you really need the performance you can make a very little sacrifice of inserting conditional compiling statements with different code for the platforms which you are interested on.

It's certainly not an ideal solution but it's a very attractive one, and it has the advantage that you can have experts on each CPU optimizing the code of the platform they know best.

--

The AACS key is NOT 0xF606EEFD628B1CA427BEA93A9CA9773F
Re:Bah by eraserewind · 2006-07-18 01:05 · Score: 2, Insightful

Compare the efficiency of GCC at auto-vectorising FORTRAN (which has a primitive vector type) and C (which doesn't), if you don't believe me.
You see this all the time in SW Engineering. If there is a well defined high level API specifying what something is trying to do rather than how it should be efficiently (at the time) done, it will eventually be far more efficient to use the API because it will be get dedicated instructions in the chipset or even be completely implemented in a dedicated HW device whereas the how to do it version will be forever limited by how it's doing it.

For PCs this isn't so obvious, since generic hardware + biggest CPU going tends to get used, but in embedded devices the dedicated hardware is much more often the way to go than the processor upgrade. My last project I can think of two APIs that gave us this benefit immediately without SW effort on our part, and a third area that benefitted by ripping out all the "optimized" code that bypassed the API, and using the (now HW accelerated) API directly.
Re:Bah by sasdrtx · 2006-07-18 01:34 · Score: 2, Insightful

From the first sentence: "The closer to the metal you can get while programming, the faster your program will compile..." WTF? How fast a language compiles has nothing to do with the so-called myth, whcih is that low-level languages allow a good programmer to produce programs that run faster. They may well compile faster (and they probably retain that advantage), but that's beside the point.

Oddly enough, he proceeds to jump back on track and discuss optimization techniques and levels, most of which is OK. But he berates Java for implementing arrays (that's supposed to be an advantage over C and C++, which don't), and ignores the advantages of managed memory provided by a virtual machine.

C. Needs more work.

(yes, that's a pitiful pun.)

--
Most people don't even think inside the box.

High Level by HugePedlar · 2006-07-17 23:24 · Score: 4, Insightful

I remember back in the days of the Atari ST and Amiga, C was considered to be a high-level language. People would complain about the poor performance of games written in C (to ease the porting from Amiga to ST and vice versa) over 'proper' Assembly coded games.

Now I hear most people referring to C and C++ as "low level" languages, compared to Java and PHP and visual basic and so on. Funny how that works out.

I like Assembler. There's something about interacting intimately with your target hardware. It's a shame that it's no longer feasible with today's variety of hardware.

--
Argh.

Re:High Level by radarsat1 · 2006-07-18 00:25 · Score: 4, Insightful

No. Well, generally you'll have faster code if you code it in assembly. But things change when you enter the world of embedded programming... you're right, portability isn't AS important as speed. Sometimes. In certain parts of your program. But I recommend you DON'T disregard portability, even when it comes to microprocessors. In a real-world engineering project, you never know when one day parts will change, parts become obsolete, and you don't want to be left having to translate thousands of lines of assembly code.

Rather, usually whats done is that most of the code is written in C, and only those parts that REALLY REALLY have to be optimized, like interrupt handlers for example, can be done in assembly. People use assembly for routines that, for example, have to take exactly a certain number of instruction cycles to complete.

But it should be avoided as much as possible. It's just not worth losing the portability.

More and more these days, microprocessors are embedding higher level concepts, and even entire operating systems, just to make software development easier.
Re:High Level by rockmuelle · 2006-07-18 01:39 · Score: 2, Interesting

"I like Assembler. There's something about interacting intimately with your target hardware. It's a shame that it's no longer feasible with today's variety of hardwar"

A minor observation about the feasability of working with the target hardware: the two most popular instruction set architectures for commidity hardware, PowerPC and IA-32, have both been stable since the mid 90s. The programming guide for PowerPC processors is still pretty much the same document as it was in 1996, around the same time the PowerPC ISA was defined. IA-32 has undergone some changes with each new major processor family, but is still backwards compatible at the instruction level with processors released in the 80s.

Contrast this with high-level (i.e., non assembly languages). Java has undergone a few major revisions in its 10 year lifespan. C++ has yet to have a compiler that fully implements the spec (think export and the really fun template games). Scripting languages are constantly evolving and sometimes aren't backwards compatible over a 4 year period. Then there's the Microsoft switch to .Net that invalidated billions of lines of VB and VC++ code. Compared to these languages, machine code is incredibly stable and portable (across processor iterations, at least).

Of course, there are architecture considerations for squeezing performance out of code. But, again, these haven't changed much in the last 10 years, either. The memory bus is still the bottleneck and you still get 50-80 instructions for 'free' on each load, even if you're not filling the pipeline completely. If you are doing something that isn't memory bound, it's not that hard to look up the instruction latencies in the manual and code things up to fully utilize the processing units and keep the pipline full. At least, it's no more difficult developing scalable EJB applications for your favorite web application engine...

-Chris

Article is theory not practice - no measurements by ChrisRijk · 2006-07-17 23:25 · Score: 2, Interesting

Not really much "meat" here. The proof is in the pudding as they say - but there's no benchmarks here. Just some minor talk about how things should compare.

I don't agree with the basic premise of the article at all - but I've also written equivalent programs in C and more modern languages and compared the performance.

Inaccurate summary by rbarreira · 2006-07-17 23:26 · Score: 4, Insightful

The task of mapping C code to a modern microprocessor has gradually become increasingly difficult.

This is not true. What they mean, I think, is "the task of mapping C code to efficient machine code has gradually become increasingly difficult".

--

The AACS key is NOT 0xF606EEFD628B1CA427BEA93A9CA9773F

Re:Slashdot by jamie · 2006-07-17 23:27 · Score: 2, Informative

We had to make a change to our 'comments' table schema that would have locked up the site if we had allowed full access. At over 15M rows, this takes some time. Sorry about that.

It's very simple by dkleinsc · 2006-07-17 23:33 · Score: 4, Interesting

The speed of code written in computer language is based on the number of CPU cycles required to carry it out. That means that the speed of any higher-level language is related to the efficiency of code executed by the interpreter or produced by the compiler. Most compilers and interpreters these days are pretty darn good at optimizing, making the drawback of using a higher-level language less and less important.

If you don't believe me, I suggest you look at some of the assembly code output of gcc. I'm no assembly guru, but I don't think I would have done as well writing assembly by hand.

--
I am officially gone from /. Long live http://www.soylentnews.com/

Re:It's very simple by rbarreira · 2006-07-17 23:41 · Score: 4, Informative

I'm no assembly guru, but I don't think I would have done as well writing assembly by hand

I don't believe this as much as the people who I see repeating that sentence all the time...

Not many years ago (with gcc), I got an 80% speed improvement just by rewriting a medium sized function to assembly. Granted, it was a function which was in itself, half C code, half inline assembly, which might hinder gcc a bit. But it's also important to note that if the function had been written in pure C code, the compiler wouldn't have generated better code anyway since it wouldn't use MMX opcodes... Last I checked, MMX code is only generated from pure C in modern compilers when it's quite obvious that it can be used, such as in short loops doing simple arithmetic operations.

An expert assembly programmer in a CPU which he knows well can still do much better than a compiler.

--

The AACS key is NOT 0xF606EEFD628B1CA427BEA93A9CA9773F
Re:It's very simple by hummassa · 2006-07-17 23:59 · Score: 2, Insightful

An expert assembly programmer in a CPU which he knows well can still do much better than a compiler.
FOR ONE FUNCTION. If you programmed the whole system in asm, you'd see that the assembler+you combo would lose so many opportunities for optimization that a good compiler got. And that's the whole point of the article.

--
It's better to be the foot on the boot than the face on the pavement. ~~ tkx Kadin2048
Re:It's very simple by spinkham · 2006-07-18 00:04 · Score: 2, Interesting

True, since they can always start with the compiler output, and are thus will at least do no worse.
The more interesting question is if a person with only passing familiarity with assembly can do better then the compiler, and the answer to that is usually no these days.

--
Blessed are the pessimists, for they have made backups.
Re:It's very simple by jtshaw · 2006-07-18 00:12 · Score: 3, Interesting

Most compilers and interpreters these days are pretty darn good at optimizing, making the drawback of using a higher-level language less and less important.

In the past, most compilers were dreadful at optimizations. Now, they are just horrible. I guess that is an improvement, but I still believe there is a lot of good research to come here.

I do agree that the playing field has become pretty even. For example, with the right VM and the right code you can get pretty good performance out of Java. Problem is "the right VM" depends greatly on the task the program is doing.. certainly not a one vm fits all out of the box solution (ok.. perhaps you could always use the same VM, but app specific tuning is often neccesary for really high performance).

At any rate.. people just need to learn to use the best tool for the job. Most apps don't actually need to be bleedingly fast, so developing them in something that makes the development go faster is probably more important then developing them in something to eek out that tiny performance gain nobody will probably notice anyway.

C is the 3vil by Anonymous Coward · 2006-07-17 23:35 · Score: 4, Funny

Isnt the JIT for java written in C though.

ahah now we know why my java program is so slow. damn C slowing it down.

Great article! by TeknoHog · 2006-07-17 23:38 · Score: 4, Funny

This is exactly what I've been saying over and over, why I think that e.g. Fortran is better than C in many respects. The main point is neatly summarized at the end:

the more information you can give to your optimizer, the better the job it can do. When you program in a low-level language, you throw away a lot of the semantics before you get to the compilation stage, making it much harder for the compiler to do its job.

--
Escher was the first MC and Giger invented the HR department.

It goes both ways by JanneM · 2006-07-17 23:46 · Score: 4, Interesting

Sure, CPU:s look quite a bit different now than they did 20+ years ago. On the other hand, CPU designs do heavily take into account what features are being used by the application code expected to be run on them, and one constant you can still depend on is that most of that application code is going to be machine-generated by a C compiler.

For instance, 20 years ago there was nothing strange about having an actual quicksort machine instruction (VAXen had it). One expectation was still, at the time, that a lot of code would be generated directly by humans, so instructions and instruction designs catering to that use-case were developed. But by around then, most code was machine generated by a compiler, and since the compiler had little high-level semantics to work with, the high-level instructions - and most low-level one's too - went unused; this was one impetus for the development of RISC machines, by the way.

So, as long as a lot of coding is done in C and C++ (and especially in the embedded space, where you have most rapid CPU development, almost all coding is), designs will never stray far away from the requirements of that language. Better compilers have allowed designers to stray further, but stray too far and you get penalized in the market.

--
Trust the Computer. The Computer is your friend.

Re:It goes both ways by pesc · 2006-07-18 00:21 · Score: 4, Informative

20 years ago there was nothing strange about having an actual quicksort machine instruction (VAXen had it).

While the VAX had some complex instructions (such as double-linked queue handling), it did not have a quicksort instruction.

Here is the instruction set manual.

--

)9TSS
Re:It goes both ways by Trailer+Trash · 2006-07-18 01:46 · Score: 2, Informative

For instance, 20 years ago there was nothing strange about having an actual quicksort machine instruction (VAXen had it). One expectation was still, at the time, that a lot of code would be generated directly by humans, so instructions and instruction designs catering to that use-case were developed. But by around then, most code was machine generated by a compiler, and since the compiler had little high-level semantics to work with, the high-level instructions - and most low-level one's too - went unused; this was one impetus for the development of RISC machines, by the way.

As someone else mentioned, there is no quicksort instruction. That's far too complex and involves looping and conditional branching. Probably the most complex of vax instructions was the polyf/polyg instruction, which would compute a polynomial to 7 iterations thus allowing one instruction to compute a trigonometric function. There were also instructions for copying strings up to 64k (and those instructions were interruptable), and instructions to format numbers a la cobol pics. These instructions were generally emulated in the smaller microvaxen and such, but were in microcode on the larger ones. Note that even x86 has a string copy instruction.

Now, here's where you're really wrong. Those instructions weren't put in there as a convenience to humans writing in assembly. Instead, they were put in there as a convenience to compiler writers who could make use of the high-level assembly instructions to ease their code generation. The cobol compiler was almost unnecessary. They had numeric data types to cover it, it was nuts.

They also had instructions to deal with octawords (128 bit integers), and of course the vax allowed accesses of any size integer on any boundary, which could mean a couple of fetches for a particular piece of data. There are assembly instructions to force alignment.

The only non-magic of which I'm aware is that it was "required" that between writing a piece of code into memory and executing it there should be an intervening rei instruction, apparently to clear all caching. I put the word "required" in quotes for a reason. A professor at a college that I attended wrote a very popular Scheme compiler. I mentioned one day to a grad-student friend this requirement, and somehow we ended up getting to the prof. He didn't have that in his compiler and it worked just fine writing to a piece of memory then executing it. I showed him the page in the VAX Architecture Handbook (probably around 276 or 278) and we got a good chuckle.

Anyway, shortly after VAX came out people started to seriously think about simplifying the instruction set and putting more burden on the compilers. I still believe the Alpha is probably the king of risc, ironic given that VAX is the king of cisc. Most of the lessons that VAX taught us were in the negative.

--
Do you have ESP?
Re:It goes both ways by Waffle+Iron · 2006-07-18 02:25 · Score: 3, Interesting

That reminds me of the most specialized machine instruction I ever saw. Back in the 80s I was in a EE lab where we made our own CPUs on breadboards out of AMD bitslice chips, then we implemented the specified instruction set in microcode. A large chunk of the grade was based on the lab instructor running a standard test program on each team's "system" and checking the expected results.
One guy I knew realized that he was never going to get his rig stable enough to run through the whole test, so he set up a single opcode to just dump the entire expected output of the test program to the printer then halt. IIRC, he pulled it off.

High-level languages have an advantage by Bogtha · 2006-07-17 23:47 · Score: 5, Insightful

The more abstract a language is, the better a compiler can understand what you are doing. If you write out twenty instructions to do something in a low-level language, it's a lot of work to figure out that what matters isn't that the instructions get executed, but the end result. If you write out one instruction in a high-level language that does the same thing, the compiler can decide how best to get that result without trying to figure out if it's okay to throw away the code you've written. Optimisation is easier and safer.

Furthermore, the bottleneck is often in the programmer's brain rather than the code. If programmers could write code ten times faster, that executes a tenth as quickly, that would actually be a beneficial trade-off for many (most?) organisations. High-level languages help with programmer productivity. I know that it's considered a mark of programmer ability to write the most efficient code possible, but it's a mark of software engineer ability to get the programming done faster while still meeting performance constraints.

--
Bogtha Bogtha Bogtha

Re:High-level languages have an advantage by Eivind · 2006-07-18 00:01 · Score: 5, Insightful

If programmers could write code ten times faster, that executes a tenth as quickly, that would actually be a beneficial trade-off for many (most?) organisations.
Especially since you can combine. Even in high-performance applications there's typically a only a tiny fraction of the code that actually needs to be efficient, it's perfectly common to have 99% of the time spent in 5% of the code.
Which means that in basically all cases you're going to be better off writing everything in a high-level language and then optimize only those routines that need it later.
That way you make less mistakes, and get higher-quality better code quicker for the 95% of the code where efficiency is unimportant, and you can spend even more time on optimizing those few spots where it matters.
Re:High-level languages have an advantage by StormReaver · 2006-07-18 00:40 · Score: 2, Informative

"If programmers could write code ten times faster, that executes a tenth as quickly, that would actually be a beneficial trade-off for many (most?) organisations."

This sound perfectly reasonable in theory. In practice, however, it's not. Users want speedy development AND speedy execution. I developed a Java image management program for crime scene photos, and the Sheriff Patrol's commander told me flat out: we'll never use this. It's too slow.

I rewrote the program using C++ and Qt, and gained a massive speed improvement. The Sheriff Patrol and detective units have been using it ever since, and they love it. I had been a Java booster for upwards of eight years until then. That was (roughly) three years ago, and I haven't written a line of Java since. I have, however, run my historic Java programs in SUN's most recent JVM. The newer hardware runs it faster, but Qt/C++ still smokes Java. Qt gives me speedy development, and C++ gives me fast execution. It's the best of both worlds.
Re:High-level languages have an advantage by mrsbrisby · 2006-07-18 01:56 · Score: 2, Insightful

The more abstract a language is, the better a compiler can understand what you are doing

Except it doesn't. Nobody has written a compiler that smart, and I don't care what anyone says: I don't think anyone ever will.

Learning how to invent and develop algorithms is important. Learning how to translate those algorithms into various languages is important. And knowing how the compiler will translate those algorithms into machine instructions- and how the CPU itself will process those machine instructions, will yield a lot more performance than choice of languages.

Consider djbfft, one of the fastest FFT implementations, outruns many FFT implementations in Java, Haskell, Lisp, or assembly, and yet it's written in C.

Don't confuse me: I'm not saying C is fast, or C is good, I'm saying djbfft is good. Reordering the instructions in the C code will lower the efficiency- even if the code is otherwise equivelent.

That said, I agree with almost everything else in your post.

Typical Java Handwaving by mlwmohawk · 2006-07-17 23:48 · Score: 5, Insightful

The first mistake: Confusing "compile" performance with execution performance. The job of maping C/C++ code to machine code is trivial.

I've been programming professionally for over 20 years, and for those 20 years, the argument is that computers are now fast enough to allow high level languages and we don't need those dirty nasty assemblers and low level languages.

What was true 20 years ago is still true today, well written code in a low level language tailored to how the computer actually works will always be faster than a higher level environment.

The problem with computer science today is that the professors are "preaching" a hypothetical computer with no limitations. Suggesting that "real" limitations of computers are somehow unimportant.

If computer science isn't about computers, what is it about? I haate that students coming out of universities, when asked about registers and how would they write a multiply routine if they only had shifts and adds, ask "why do I need to know this?"

Software sucks today because software engineers don't understand computers, and that's why languages and environments like Java and .NET will make software worse.

Re:Typical Java Handwaving by iotaborg · 2006-07-18 00:12 · Score: 2, Insightful

If computer science isn't about computers, what is it about?

I was rather under the impression that computer science was the theory of computation, where the computer is simply a tool; just as much as a soldering iron is a tool in electrical engineering.
Re:Typical Java Handwaving by cain · 2006-07-18 00:13 · Score: 5, Insightful

If computer science isn't about computers, what is it about?

"Computer science is no more about computers than astronomy is about telescopes" -- Edsger Dijkstra quotes (Dutch computer Scientist. Turing Award in 1972. 1930-2002)
Sorry, you're arguing against Dijkstra: you lose. :)
Re:Typical Java Handwaving by rbarreira · 2006-07-18 00:23 · Score: 2, Insightful

Of course, an Idiot might write nonsense code in .NET, but that doesn't mean .NET is a bad thing.

I think his point was not that abstractions are bad, but that not knowing what's happening behind the scenes isn't good.
Even to optimize .NET code, sometimes it's good to inspect the generated CIL (or even asm!) code in order to know why something isn't going fast.

--

The AACS key is NOT 0xF606EEFD628B1CA427BEA93A9CA9773F
Re:Typical Java Handwaving by arevos · 2006-07-18 00:31 · Score: 4, Insightful

The first mistake: Confusing "compile" performance with execution performance. The job of maping C/C++ code to machine code is trivial.

I've designed compilers before, and I wouldn't class constructing a C/C++ compiler as "trivial" :)

If computer science isn't about computers, what is it about? I haate that students coming out of universities, when asked about registers and how would they write a multiply routine if they only had shifts and adds, ask "why do I need to know this?"

One could also make the opposite argument. Many computer courses teach languages such as C++, C# and Java, which all have connections to low level code. C# has its pointers and gotos, Java has its primatives, C++ has all of the above. There aren't many courses that focus more heavily on highly abstracted languages, such as Lisp.

And I think this is more important, really. Sure, there are many benefits to knowing the low level details of the system you're programming on; but its not essential to know, whilst it is essential to understand how to approach a programming problem. I'm not saying that an understanding of low level computational operations isn't important, merely that it is more important to know the abstract generalities.

Or, to put it another way, knowing how a computer works is not the same as knowing how to program effectively. At best, it's a subset of a wider field. At worst, it's something that is largely irrelevant to a growing number of programmers. I went to a University that dealt quite extensively with low level hardware and networking, and a significant proportion of the marks of my first year came from coding assembly and C for 680008 processors. Despite this, I can't think of many benefits such knowledge has when, say, designing a web application on Ruby on Rails. Perhaps you can suggest some?

Software sucks today because software engineers don't understand computers, and that's why languages and environments like Java and .NET will make software worse.

I disagree. I think software sucks because software engineers don't understand programming
Re:Typical Java Handwaving by Oligonicella · 2006-07-18 00:36 · Score: 4, Insightful

"The job of maping C/C++ code to machine code is trivial."

Which machine, chum?

"I've been programming professionally for over 20 years..."

OK, bump chests. I've been at it for 35+. And? Experience doth not beget competence. There are uses for low-level languages and those that require them will use them. Try writing a 300+ module banking application in assembler. By the time you do, it will be outdated. Not because the language will change, but because the banking requirements will. Using assembler to write an application of that magnitude is like trying to write an Encyclopedia article with paper and pencil. Possible, but 'tarded.

"Software sucks today because software engineers don't understand computers, and that's why languages and environments like Java and .NET will make software worse."

More like, 'software sucks today for the same reason it always has -- fossized thinkers can't change to make things easier for those who necessarily follow them.' Ego, no more.
Re:Typical Java Handwaving by Erich · 2006-07-18 02:04 · Score: 2, Insightful

That low-level stuff is important if you have code that needs to run fast. Need to multiply a number by a constant? You can use shifts and adds instead. Does the same thing, but takes the processor less time.

INCORRECT
. Shifts and adds are sometimes faster for certain constants. Power of two, maybe power-of-two plus one. But for any arbitrary constant, this is false on most processors. Multipliers are much faster than a stream of many shifts and adds. Furthermore, the compiler should hold the knowledge of when a shift-and-add is better-performing than a multiply for what constant values. And, if you're not using MyPrettySchoolProjectCC, it probably does.
Now what your compiler *really* hopefully knows about is how to make division by a constant into a multiply. That can really save time. Division is an iterative process and is very hard to make fast. Multiplies are highly parallel; you can do large multiplies fully pipelined and with pretty low latency. And you can typically turn a 32 bit / 32 bit divide into a 32x32->64 multiply with the reciprocal. Since you can determine the reciprocal at compile time this is probably a win.
Maybe you just went to a school where they didn't show you how multiplies are actually implemented on modern hardware. Shift registers with accumulators they aren't. This is also potentially a reason why the professor will tell you that you can't outsmart the compiler. The typical college student can't, because he or she doesn't understand enough about how things really work. But any engineer with a decent amount of experience -- or most grad students -- can outsmart a compiler easily.

--
-- Erich
Slashdot reader since 1997

Yes, but is it worth it? by Toreo+asesino · 2006-07-17 23:52 · Score: 2, Informative

Of course, lower-level languages can be faster, but I'd suggest that writing code at a very low-level is rarely worth the extra effort.

Take Quake II for instance; as quoted from the article 'the managed version initially ran faster than the native version' - which would suggest higher-level languages are certainly capable of comparing to that of their lower-level siblings.

Also, take into account the added developer time gained from factors like memory-management being, well, managed, and ever-falling processor & memory prices, and the logical conclusion is usually "write at a higher-level".

There are of course more considerations than these when deciding on a development platform, but essentially, I think there'd have to be very good reasons for writing green-field projects too close to the machine.

--
throw new NoSignatureException();

Single Page Version of the Article by jaaron · 2006-07-17 23:56 · Score: 2, Informative

Here's a print view of the article so that you don't have to keep moving through the pages. Despite that annoyance, it was a good article. I wish there had been more concrete examples though.

--
Who said Freedom was Fair?

Some comments on the article by rbarreira · 2006-07-18 00:04 · Score: 4, Insightful

OK, the article isn't bad but contains a few misleading parts... Some quotes:

one assembly language statement translates directly to one machine instruction

OK, this is nitpicking but there are some exceptions - I remember that TASM would convert automatically long conditional jumps to the opposite conditional jump + an unconditional long jump since there was no long conditional jump instruction.

Other data structures work significantly better in high-level languages. A dictionary or associative array, for example, can be implemented transparently by a tree or a hash table (or some combination of the two) in a high-level language; the runtime can even decide which, based on the amount and type of data fed to it. This kind of dynamic optimization is simply impossible in a low-level language without building higher-level semantics on top and meta-programming--at which point, you would be better off simply selecting a high-level language and letting someone else do the optimization.

This paragraph is complete crap. If you're using a Dictionary API in a so called "low-level language", it's as possible for the API to do the same optimization as it is for the runtime he talks about; and you're still letting "someone else do the optimization".

When you program in a low-level language, you throw away a lot of the semantics before you get to the compilation stage, making it much harder for the compiler to do its job.

That's surely true. But the opposite is also true - when you use an immense amount of too complex semantics, they can be translated into a pile of inefficient code. Sure, this can improve in the future, but right now it's a problem of very high level constructs.

Due to the way C works, it's impossible for the compiler to inline a function defined in another source file. Both source files are compiled to binary object files independently, and these are linked.

Not exactly true I think. Yes, the approach on that page is not standard C, but on section 4 he also talks about some high level performance improvements which are still being experimented on, so...

--

The AACS key is NOT 0xF606EEFD628B1CA427BEA93A9CA9773F

Typical "/." Handwaving by Anonymous Coward · 2006-07-18 00:08 · Score: 5, Insightful

"I've been programming professionally for over 20 years, and for those 20 years, the argument is that computers are now fast enough to allow high level languages and we don't need those dirty nasty assemblers and low level languages."

The "appeal to an expert" fallacy?

"What was true 20 years ago is still true today, well written code in a low level language tailored to how the computer actually works will always be faster than a higher level environment."

It also means that portability becomes ever harder, as well as adaptability to new hardware.

"If computer science isn't about computers, what is it about? I haate that students coming out of universities, when asked about registers and how would they write a multiply routine if they only had shifts and adds, ask "why do I need to know this?""

It's about algorithms. Computers just happen to be the most convienent means for trying them..

"The problem with computer science today is that the professors are "preaching" a hypothetical computer with no limitations. Suggesting that "real" limitations of computers are somehow unimportant."

With the trend towards VM's and virtualization, that "hypothetical" computer comes ever closer.

"Software sucks today because software engineers don't understand computers, and that's why languages and environments like Java and .NET will make software worse."

Now who's handwaving?

Re:Typical "/." Handwaving by 14CharUsername · 2006-07-18 00:54 · Score: 3, Insightful

Now who's handwaving?

I'd say you are. His first statement wasn't a logically fallacy, he was just pointing out this argument has been going on for a long time.
You made a good point about portability, but I think that was your only point. And its easily shot down byt the fact that its just as easy to port a standard C/C++ API to a new environment as it is to port Java/.NET to a new environment.
He made an excellent point about many new graduates not knowing how the CPU actually works and you replied with: "It's about algorithms. Computers just happen to be the most convienent means for trying them.." ??? What the hell does that mean? Handwaving indeed.
His main point was that VM's are always slower compiled machine code. Even if computers are doubling in speed every 18 months or whatever, native machine code will still be faster than virtual machine code.
With the trend towards VM's and virtualization, that "hypothetical" computer comes ever closer.

Right there you have just proven yourself to be an academic. Trends do not make reality. Besides that, what about gcj? If VMs were so great, why would anyone want to compile java to native code? In the real world, people care about performance. Academics are satisfied that a problem has a solution. In the real world we need to be able to get a solution in the minimum amount of time. VMs always take more time.
Now you may continue your handwaving.
Re:Typical "/." Handwaving by Azarael · 2006-07-18 01:14 · Score: 2, Insightful

The "appeal to an expert" fallacy?
I've never come across that fallacy in philosophy class, however, if you mean the "Improper Appeal to Authority" fallacy then it isn't. If the above poster was a movie star or a well known public figure and their comments about the article are being referenced to prove a point (assuming said movie star or public figure isn't an expert programmer), then that would be an improper appeal to authority. In any case, the insight and experience of long time programmer is valuable. Sure they can be wrong but, they still know their stuff front to back. Likely the GP poster knows very well that you can through as much virtualization as you want at a problem, but no matter what, you're still bound to the limitations of the underlying hardware. Maybe at some point hardware with almost infinite flexibility will exist and I'd be surprised if that happened any time soon.
Re:Typical "/." Handwaving by Anonymous Coward · 2006-07-18 04:04 · Score: 3, Insightful

"In the real world we need to be able to get a solution in the minimum amount of time. VMs always take more time."

I'd argue that in the real world (or at least business world) we need the solution to be developed in the shortest amount of time, with the most amount of security. While a VM based language is not guaranteed to provide quicker time / security, in most cases it probably will.

What I didn't see in TFA... by s_p_oneil · 2006-07-18 00:11 · Score: 4, Insightful

I didn't see anything mentioning that many high-level languages are written in C. And I don't consider languages like FORTRAN to be high-level. FORTRAN is a language that was designed specifically for numeric computation and scientific computing. For that purpose, it is easy for the compiler to optimize the machine code better than a C compiler could ever manage. The FORTRAN compiler was probably written in C, but FORTRAN has language constructs that are more well-suited to numeric computation.

Most truly high-level languages, like LISP (which was mentioned directly in TFA), are interpreted, and the interpreters are almost always written in C. It is impossible for an interpreted language written in C (or even a compiled one that is converted to C) to go faster than C. It is always possible for a C programmer to write inefficient code, but that same programmer is likely to write inefficient code in a high-level language as well.

I'm not saying high-level languages aren't great. They are great for many things, but the argument that C is harder to optimize because the processors have gotten more complex is ludicrous. It's the machine code that's harder to optimize (if you've tried to write assembly code since MMX came out, you know what I mean), and that affects ALL languages.

They put the D in DUH by billcopc · 2006-07-18 00:17 · Score: 2, Informative

The main reason C is "faster" than high level languages is because C doesn't cover bad programmers' butts with elaborate type checking, ref counting and garbage collection. Take a properly designed C app with graceful error handling and secure inputs, and you will take a performance hit. Let's face it, most of the code we write in C involves error handling and idiot-proofing, things that most high-level languages have built-in functionality for these boring, repetitive slabs of code we all hate writing.

I see no reason why a high-level application couldn't be compiled as skillfully as a feature-equivalent low-level application. It's just a matter of breaking down the code into manageable building blocks.

--
-Billco, Fnarg.com

Assembler by backwardMechanic · 2006-07-18 00:36 · Score: 4, Insightful

Every serious hacker should have a play with assember, or even machine code. There is real magic in starting up a uP or uC on a board you built yourself, and making it flash a few LEDs under the control of your hand assembled program. I found a whole new depth of understanding when I built a 68hc11 based board (not to mention memorizing a whole bunch of op-codes). Of course, I'd never want to write a 'serious' piece of code in assembly, and it still amazes me that anyone ever did!

Re:Article is theory not practice - no measurement by mrchaotica · 2006-07-18 00:36 · Score: 3, Informative

The proof is in the pudding as they say

No, what they say is "the proof of the pudding is in the eating." (Just pointing it out because most people get it wrong.)

--

"[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz

Re:high level vs. low level 101 by backwardMechanic · 2006-07-18 00:44 · Score: 2, Insightful

I love these hard definitions of soft concepts. Just because you write down some rules, it doesn't mean we follow them. Any programmer understands roughly what 'high level' and 'low level' mean, but I'm sure we'll all argue over where the boundaries are - they're not well defined. I guess you stopped at 101?

Flawed Argument by logicnazi · 2006-07-18 00:51 · Score: 3, Interesting

The fact that C code is not as close to assembely code as it once was isn't the relevant issue. The question is whether C code is still closer to the assembely than high level languages are. This is undoubtedly true. If you don't believe this try adding constructs to ruby or lisp to let you do low level OS programming and see how difficult it would be.

I'm a big fan of high level languages and I believe eventually it will be the very distance from assembely that high level languages provide that will make them faster by allowing compilers/interpreters to do more optimization. However, it is just silly to pretend that C is not still far closer to the way a modern processor works than high level languages are.

If nothing else just look at how C uses pointers and arrays and compare this to the more flexible way references and arrays work in higher level languages.

--

If you liked this thought maybe you would find my blog nice too:

Imaginary history by dpbsmith · 2006-07-18 00:53 · Score: 5, Interesting

Whoa! This article seems to be making up history out of whole cloth. I'm not even sure where to begin. It's just totally out to lunch.

C was not a reaction to LISP. I can't even imagine why anyone would say this. LISP's if/then/else was an influence on ALGOL and later languages.

C might have been a reaction to Pascal, which in turn was a reaction to ALGOL.

LISP was not "the archetypal high-level language." The very names CAR and CDR mean "contents of address register" and "contents of decrement register," direct references to hardware registers on the IBM 704. When the names of fundamental languages constructs are those of specific registers in a specific processor, that is not a "high-level language" at all. Later efforts to build machines with machine architectures optimized for implementation of LISP further show that LISP was not considered "a high-level language."

C was not specifically patterned on the PDP-11. Rather, both of them were based on common practice and understanding of what was in the air at the time. C was a direct successor to, and reasonably similar to BCPL, on Honeywell 635 and 645, the IBM 360, the TX-2, the CDC 6400, the Univac 1108, the PDP-9, the KDF 9 and the Atlas 2.

C makes an interesting comparison with Pascal; you can see that C is, in many ways, a computer language rather than a mathematical language. For example, the inclusion of specific constructs for increment and decrement (as opposed to just writing A := A + 1) puts it closer, not to PDP-11 architecture, but to contemporary machine architecture in general.

--

"How to Do Nothing," kids activities, back in print!

Re:Imaginary history by masklinn · 2006-07-18 01:56 · Score: 3, Informative

LISP was not "the archetypal high-level language." The very names CAR and CDR mean "contents of address register" and "contents of decrement register," direct references to hardware registers on the IBM 704.

You forgot "CONS" which comes from the IBM cons cells (a 36bit machine word on the 704), which is the block holding both a CAR and a CDR.

The thing is, the names only existed because no one found any better name for them, or any more interresting name (Common Lisp now offers the "first" and "rest" aliases to CAR and CDR... yet quite a lot of people still prefer using CAR and CDR).

LISP has always been a high level language, because it was started from mathematics (untyped lambda calculus) and only then adapted to computers.

And the fact that Lisp Machines (trying to get away from the Von Neumann model) were built doesn't mean that Lisp is a low level language, only that IA labs needed power that the Lisp => Von Neumann machines mappings could not give them at that time.

Lisp is a high level languages, because Lisp abstracts the machine away (no memory management, not giving a fuck about registers or machine words [may I remind you that Lisp was one of the first languages with unbound integers and automatic promotion from machine to unbound integers?])

--
"The way we can tell it's C# instead of Haskell is because it's nine lines instead of two." -- wadler

Dude can't even write a clear sentence by Rinzai · 2006-07-18 00:56 · Score: 2, Insightful

From TFA: The closer to the metal you can get while programming, the faster your program will compile -- or so conventional wisdom would have you believe. In this article, I will show you how high-level languages like Java aren't slow by nature, and in fact low level languages may compile less efficiently.

I believe the phrase the faster your program will compile means "the faster the compiler will translate your program into machine-executable code." Apparently the author means "the compiler will generate faster code." He then makes the same mistake again, equivocating between the process of compilation and the quality of the compiled output.

If you can't manage to write a clear sentence defining what topic you're exploring...what else might you be getting wrong?

Gentoo Stage 1 FTW! by Anonymous Coward · 2006-07-18 01:06 · Score: 2, Funny

I thought this was the exact reason that Gentoo Linux exists.

A stage 1 install will do the following:
1) Compile glibc from source using architecture specific optimizations
2) Compile gcc using the previously compiled optimized glibc and optimize gcc for the architecture
3) Compile everything else using said architecture specific optimized tools
4) ???
5) Profit!

And before all the trolls come in and say how long it takes to compile things, the Gentoo Handbook has several tricks like compiling from RAM, etc... to speed up compile times. I normally don't waste my time with Stage 1 because there are plenty of Stage 3 tarballs I can grab for whatever architecture I may be using at the time.

Back to the point, if your glibc is compiling your code using the MMX registers for memcpy(), memset(), etc... it completely invalidates the point in the article about how those extra registers go unused. Additionally the point he makes about data structures, while valid, is a non-issue given that most serious programmers have taken a Data Structures and Algorithms course where you learn that O(n Log n) is less than O(n^2), and will choose to use trees and hash tables where appropriate.

[sarcasm]Nevermind, he's dead on, no one ever implements a spanning tree in C code[/sarcasm]

I see your point about the vectorization of library code, but point me to a high level language which does not suffer from that flaw given your assertion of closed source libraries. That is true across the board regardless of the language used.

Re:Student Perspective by embracethenerdwithin · 2006-07-18 01:07 · Score: 4, Informative

I thought it might be helpful for a current student to let you know what it is we learn today at my college. I'm a senior Software Engineering major, not a comp sci major. Comp Sci is another department and has a totaly different focus. They focus on super efficent algorithms, we focus on developing large software projects.

My software engineering program has been very Java intensive. My software engineering class, object oriented class, and software testing class were all java based. We dabbled in C# a bit as well.

However, I also had an assembly class, a programming languages class where we learned perl and scheme(this language sucks) and about five algorithms classes in C++. I also had an embedded systems class in both C and assembly(learned assembly MCU code, then did C).

I feel like this is all pretty well rounded; I've learned a bunch of languages and am not really specialized in one. I'd say I am best at Java right now, but I can also write C++ code just fine.

I've never been told a computer has any kind of crazy limitless performance. In embedded systems, I learned about performance. Making a little PIC microcontroller calculate arctan was fun(took literally 30 seconds without a smart solution). I also learned that there is a trade off between several things such as performance, development time, readability, and portability.

We are taught to see languages as tools, you look at your problem and pull a tool out of the tool box that you think fit the problem best. You have to weigh whats important for the project and chose based off of that.

The final thing I'd like to point out is that one huge issue with software today is it is bug ridden. How easy something is a test makes a big difference in my opinion. Assembly and C will pretty much always be harder to test than languages like Java and C#.

I don't think the universities are the problem, at least not in my experience.

Re:Along those lines... by LizardKing · 2006-07-18 01:24 · Score: 2, Informative

One interesting feature the compiler/IDE system I was using at the time (TopSpeed's) had was this concept that all their language compilers (M2, C, C++, etc) all compiled into an intermediate binary form, and their final compiler did very heavy optimizations on that "byte code".

That's no different to most compilers. GCC for instance parses the "frontend" language (C, C++, etc) into an intermediate language and performs most optimisations on that intermediate language before translating it to assembler instructions. Optimisation can be performed in the high level language, and even the assembler, but most is performed at the intermediate level as this way all frontends can potentially benefit.

"The Truth about C++ Revealed" by Rod,+Hot · 2006-07-18 01:24 · Score: 5, Funny

Dusted this off from the rec.arts.humor archive... It seemed appropriate.

From:

Subject: The truth about 'C++' revealed

Date: Tuesday, December 31, 2002 5:20 AM

On the 1st of January, 1998, Bjarne Stroustrup gave an interview to the IEEE's 'Computer' magazine.

Naturally, the editors thought he would be giving a retrospective view of seven years of object-oriented design, using the language he created.

By the end of the interview, the interviewer got more than he had bargained for and, subsequently, the editor decided to suppress its contents, 'for the good of the industry' but, as with many of these things, there was a leak.

Here is a complete transcript of what was was said, unedited, and unrehearsed, so it isn't as neat as planned interviews.

You will find it interesting...

__________________________________________________ ________________

Interviewer: Well, it's been a few years since you changed the world of software design, how does it feel, looking back?

Stroustrup: Actually, I was thinking about those days, just before you arrived. Do you remember? Everyone was writing 'C' and, the trouble was, they were pretty damn good at it. Universities got pretty good at teaching it, too. They were turning out competent - I stress the word 'competent' - graduates at a phenomenal rate. That's what caused the problem.

Interviewer: problem?

Stroustrup: Yes, problem. Remember when everyone wrote Cobol?

Interviewer: Of course, I did too

Stroustrup: Well, in the beginning, these guys were like demi-gods. Their salaries were high, and they were treated like royalty.

Interviewer: Those were the days, eh?

Stroustrup: Right. So what happened? IBM got sick of it, and invested millions in training programmers, till they were a dime a dozen.

Interviewer: That's why I got out. Salaries dropped within a year, to the point where being a journalist actually paid better.

Stroustrup: Exactly. Well, the same happened with 'C' programmers.

Interviewer: I see, but what's the point?

Stroustrup: Well, one day, when I was sitting in my office, I thought of this little scheme, which would redress the balance a little. I thought 'I wonder what would happen, if there were a language so complicated, so difficult to learn, that nobody would ever be able to swamp the market with programmers? Actually, I got some of the ideas from X10, you know, X windows. That was such a bitch of a graphics system, that it only just ran on those Sun 3/60 things. They had all the ingredients for what I wanted. A really ridiculously complex syntax, obscure functions, and pseudo-OO structure. Even now, nobody writes raw X-windows code. Motif is the only way to go if you want to retain your sanity.

[NJW Comment: That explains everything. Most of my thesis work was in raw X-windows. :)]

Interviewer: You're kidding...?

Stroustrup: Not a bit of it. In fact, there was another problem. Unix was written in 'C', which meant that any 'C' programmer could very easily become a systems programmer. Remember what a mainframe systems programmer used to earn?

Interviewer: You bet I do, that's what I used to do.

Stroustrup: OK, so this new language had to divorce itself from Unix, by hiding all the system calls that bound the two together so nicely. This would enable guys who only knew about DOS to earn a decent living too.

Interviewer: I don't believe you said that...

Stroustrup: Well, it's been long enough, now, and I believe most people have figured out for themselves that C++ is a waste of time but, I must say, it's taken them a lot longer than I thought it would.

Interviewer: So how exactly did you do it?

Stroustrup: It was only supposed to be a joke, I never thought people would take the book seriously.

Programs don't need optimization... by aadvancedGIR · 2006-07-18 01:24 · Score: 2, Insightful

as much as development process.

CPU power is available and cheap but time to market is critical. Most of the time, you don't need to do the fastest program ever, but to do a program that works reasonably well and that you can debug easily (some may say it is the same requirement).

C may not be the best tool for any given task but it is a pretty decent swiss army knife that most people know how to use reasonably well.

Disclaimer: I'm not in web devmnt but in embedded real time on DSP. With 8 dedicated ALU (2 mul, 2 add/sub, 2 logic and 2 load/store) running at the same time on the chip, there is still not many good alternatives to C (let the compiler optim and pray) and ASM (massive headhache).

Re:Along those lines... by pfdietz · 2006-07-18 01:36 · Score: 2, Informative

The more recent versions of GCC also perform transformations on a tree-based intermediate form, before converting that into the older RTL form. There are certain high level optimizations that just work better on abstract syntax trees.

Quoted often, but still wrong by Qbertino · 2006-07-18 01:36 · Score: 2, Insightful

Computer science is no more about computers than astronomy is about telescopes" -- Edsger Dijkstra quotes (Dutch computer Scientist. Turing Award in 1972. 1930-2002)

I see this quote everywhere, and just because it's by some semi-famous academic, nodody questions it and takes it for granted. The quote is utter rubish.

With astronomy you have stars, which aren't man made and thus only scarcely understood and the tools we use to look at them, teleskopes, which are man-made. We understand them.

Computers and Computer Science are both things that are entirely man-made. There is no natural phenomenon that we call 'computer' and a science that studies this natural phenomenon called "computer science". It's all one thing. The quote is rubbish and contains no usefull information whatsoever. On the contrary: the conclusion it draws in abolutely false.

--
We suffer more in our imagination than in reality. - Seneca

Re:Quoted often, but still wrong by StrawberryFrog · 2006-07-18 04:42 · Score: 2, Informative

The quote is utter rubish. ... With astronomy you have stars, which aren't man made ... Computers and Computer Science are both things that are entirely man-made. There is no natural phenomenon that we call 'computer' and a science that studies this natural phenomenon called "computer science".

Not. Even. Wrong.

If astronomy was called "telescope science" you'd also forget that it was about ways of looking at the skies. Computers are more flexible that that - they are used to model and study all kinds of natural phenomena. Algorithyms are strictly speaking mathematics, which is a feature of the universe and not "man made" if anything ever was. Computers are used to store and manipulate data about all kinds of things, most of which are not about computers. learning how to do all that is computer science.

--
My Karma: ran over your Dogma
StrawberryFrog

the author... by ynohoo · 2006-07-18 01:38 · Score: 2, Insightful

the author is only a couple of years out of college and he is already well on his way to be becoming a professional troll. I see a bright future for him...

--
need a free COBOL editor for Windows?

Whaaa! My language is better. by Anonymous Coward · 2006-07-18 01:40 · Score: 2, Insightful

People will always argue over 3 things:
1) whether assembly is faster than C
2) whether interpreted languages are faster than C/C++

The real question here is - which type of language does well for your application?

Ultimately C will be faster if a good programmer, who understands the language and the application. However, will he be more productive? I'd never write a third person shooter in python, perl, or java. However, what I might do is add a 3d engine to a python statistics modeling program that's already written in one of those languages. Most people would agree, writing a web interface in C is just insane if you have anything particularly useful you want to write. However, I'll probably write a multi-process webserver in C, just because it makes sense for speed (I know python has a built-in webserver, but there are features it doesn't have. You may be able to write it in python, but will all those features that apache have be fast?).

The bottom line is:
- Define the application you want to build.
- Define your requirements (responsiveness, rhobustness [security,reliability,etc], extendability, deadlines).
- Do a little research with a few languages (just experimentation). Write prototype interfaces in the language, do a little benchmarking, just play with it.
- Make a decision on a language based on what you've found and what's required.

As more high level languages appear (functional languages look very promising), see what those languages have over what's already out there. If it has an applicability to what you're doing, use it.

I'm tired of seeing everyone beat a dead horse. Yes, I know the two arguments:
- X is faster
- Y is just as fast as X, but can do it in less lines of code.

X & Y are different, there's no ignoring it. There's more dimensions to languages than speed and time to market, don't ignore them.

C and Smalltalk is what happened. by LWATCDR · 2006-07-18 01:43 · Score: 4, Informative

C became popular because of Unix. Since you could get the source code for Unix most big universities used Unix in there OS courses. And since it was written in c you where going to learn C if took Computer Science. Textbooks started to assume you knew c. Magazines started to assume you knew c. People wrote free small c compilers and then came GCC, so now you could have a good free c compiler for just about any system. But before GCC all the buzz was about Smalltalk. Smalltalk was the future. OOP was going to replace structured programing. The problem was very few people has a computer that could run Smalltalk. So C++ was born.
A final blow to Modula-2 was simply Borland didn't create a Modula-2 compiler. For many years when you said Pascal you reall meant Turbo or Borland Pascal. Borland was the Pascal company and they add objects to pascal and eventual created Delphi.
I am sure Topspeed has closed up shop. There just isn't much room for compiler makers anymore. You have the free software at the bottom end and the Microsoft Monster at the top. Only a few niche players are left. Ada seems to be a place where a good compiler company can still make a few dollars.

--
See my blog http://ilovecookes.blogspot.com/ for light hearted technical information.

Re:C and Smalltalk is what happened. by glindsey · 2006-07-18 02:20 · Score: 3, Interesting

Well, that's not entirely true: compilers for PCs may not be a big business anymore, but compilers for embedded systems are still a huge business, despite the availability of GCC for many platforms. You need only look at IAR to confirm that...
Re:C and Smalltalk is what happened. by Wudbaer · 2006-07-18 03:21 · Score: 2, Informative

Borland didn't create a Modula-2 compiler

Small nitpick: They did indeed create a Modula-2 compiler - I think even called Turbo Modula-2 - at the end of the 80s for CP/M. I purchased it back then for my C-128 (those where the days *looks at current laptop* - not). However, CP/M then already had begun its way into obsolecence, and Borland's German division needed almost 6 months to deliver the damn thing. When I finally got it it was more or less unusable, as the IDE froze or something like that when you tried to compile something. In that respect it's better to think they never had released this abomination.
Re:C and Smalltalk is what happened. by belmolis · 2006-07-18 04:17 · Score: 2, Interesting

This seems generally to be true, but some small outfits are apparently still making money selling compilers. In the early 90s I used the Power C compiler for DOS. It was a nice compiler and cheap ($20). Recently I was amazed to see that the company, Mix Software, is still in business, with the same low prices. How they do this I have no idea.
Re:C and Smalltalk is what happened. by Anonymous Coward · 2006-07-18 06:33 · Score: 2, Informative

Further: Topspeed was a suite of compilers by JPI (the split away group of Borland -> JP Jensen Partners), They ended up writing the compiler for Clarion (well reused their existing compiler technology as all their compilers: c/c++/pascal/modula2 shared the same obj format and as such they could mix and match languages within a single exe/dll - sound familier!!!), after providing this compiler, they merged with Clarion to form TopSpeed the company, whose main product was Clarion...

The myth of assembly performance by p3d0 · 2006-07-18 01:45 · Score: 4, Insightful

Well, generally you'll have faster code if you code it in assembly.

No, generally you'll have slower code. In a few specific, well-chosen places, you may get faster code. If you had unlimited time, patience, and performance tuning expertise, then you could beat the compiler on a large application, but how realistic is that?

Coding large apps in assembly is usually way beyond the point of diminishing returns in terms of performance.

--
Patrick Doyle
I mod down every jackass who puts his moderation policy in his sig. Oh, wait a sec....

Don't be so sure by overshoot · 2006-07-18 02:01 · Score: 3, Insightful

Well, generally you'll have faster code if you code it in assembly.

I wouldn't even grant that in the general case.

Amazingly far back (try the 80s) a professor friend of mine had a marvelous example of compiler-generated code where the compiler had done such an amazing job of optimising register use that you had to trace through more than 20 pages of assembler output with colored markers to trace from where the register was loaded to where it was used.

No way I would ever have the huevos to code that way in assembler. On a RISC machine or (Heaven help us) the Itanic it gets lots worse.

--
Lacking <sarcasm> tags, /. substitutes moderation as "Troll."

Um no. by wonkavader · 2006-07-18 02:17 · Score: 2, Informative

No, what they SAY is "The proof is in the pudding" --

From google:

Results 1 - 10 of about 326,000 for "the proof is in the pudding". (0.47 seconds)
Results 1 - 10 of about 118,000 for "the proof of the pudding is in the eating" [definition]. (0.30 seconds)

They're not right, of course, but then, sadly, you're not either, since what people say has changed. It's changed to something nonsensical, which people quote without understanding, which is annoying, like "I could care less!":

Results 1 - 10 of about 2,180,000 for "I could care less". (0.28 seconds)
Results 1 - 10 of about 776,000 for "I couldn't care less". (0.22 seconds)

But "the proof is in the pudding" kind of rolls off the tongue better... like a pudding which tastes nasty and you are therefore gently, but suavely, spitting out.

LANGUAGES are not interpreted by alispguru · 2006-07-18 02:18 · Score: 2, Informative

Most truly high-level languages, like LISP (which was mentioned directly in TFA), are interpreted, ...

Programming languages are not "interpreted". A language IMPLEMENTATION may be based on an interpreter. Every major implementation of Common Lisp today has a complier, and most of them don't even have an interpreter any more - everything, including command-line/evaluator input, is compiled on-the-fly before being executed.

... and the interpreters are almost always written in C. It is impossible for an interpreted language written in C (or even a compiled one that is converted to C) to go faster than C.

Again, this is a property of implementations, not of languages. The highest-performance Common Lisp implementations have scaffolding written in C and assembly, but they do not use a C compiler when they compile Lisp code. They often use non-C ABI conventions for argument passing and stack handling, to make their style of function calling faster.

I don't mean to be harsh, but the "Lisp is slow because it's interpreted" meme is about twenty years out of date. It tends to be spread primarliy by college professors whose last exposure to Lisp was pre-1980, and it really grates on those of us who know better.

--

To a Lisp hacker, XML is S-expressions in drag.

Initially by Vexorian · 2006-07-18 02:20 · Score: 3, Insightful

The article later points out that the native version was running slower due to not using optimization options correctly. And later the native version was running 15% faster than the managed version

--

Copyright infringement is "piracy" in the same way DRM is "consumer rape"

Lisp and operating systems by alispguru · 2006-07-18 02:29 · Score: 2, Insightful

Existing high-level languages, such as LISP, provided too much abstraction for implementing an operating system

Huh? I would argue that commercially successful (as in boxes sold to Fortune 500 companies and used in production) operating systems have been written in three languages:

* Assembly

* C

* Lisp

Are there any commercially successful OSs written in C++ yet?

(revealing my ignorance and posting flamebait, all in one)

--

To a Lisp hacker, XML is S-expressions in drag.

An expert assembly programmer in a CPU... by Terje+Mathisen · 2006-07-18 02:34 · Score: 4, Insightful

I've probably written more assembly than most slashdot readers, and most of what you say is true:

It used to be the case that I could always increase the speed of some random C/Fortran/Pascal code by rewriting it in asm, parts of that speedup came from realizing better ways to map the current problem to the actual cpu hardware available.

However, I also discovered that much of the time it was possible to take the experience gained from the asm code, and use that to rewrite the original C code in such a way as to help the compiler generate near-optimal code. I.e. if I can get within 10-25% of 'speed_of_light' using portable C, I'll do so nearly every time.

There are some important situations where asm still wins, and that is when you have cpu hardware/opcodes available that the compiler cannot easily take advantage of. I.e. back in the days of the PentiumMMX 300 MHz cpu it became possible to do full MPEG2/DVD decoding in sw, but only by writing an awful lot of hand-optimized MMX code. Zoran SoftDVD was the first on the market, I was asked to help with some optimizations, but Mike Schmid (spelling?) had really done 99+% of the job.

Another important application for fast code is in crypto: If you want to transparently encrypt anything stored on your hard drive and/or going over a network wire, then you want the encryption/decryption process to be fast enough that you really doesn't notice any slowdown. This was one of the reasons for specifying a 200 MHz PentiumPro as the target machine for the Advanced Encryption Standard: If you could handle 100 Mbit Ethernet full duplex (i.e. 10 MB/s in both directions) on a 1996 model cpu, then you could easily do the same on any modern system.

When we (I and 3 other guys) rewrote one of the AES contenders (DFC, not the winner!) in pure asm, we managed to speed it up by a factor of 3, which moved it from being one of the 3-4 slowest to one of the fastest algorithms among the 15 alternatives.

Today, with fp SIMD instructions and a reasonably orthogonal/complete instruction set (i.e. SSE3 on x86), it is relatively easy to write code in such a way that an autovectorizer can do a good job, but for more complicated code things quickly become much harder.

Terje

--
"almost all programming can be viewed as an exercise in caching"

More Myth here by wonkavader · 2006-07-18 02:36 · Score: 2, Informative

It's possible to say everything siad in this article -- vaugely, as it is said in this article -- and be right, and yet still dance around the reality.

Take a look yourself on http://shootout.alioth.debian.org/

C's faster than Java. It will probably always generally be so, unless you're trying to run C code on a hardware Java box.

This article says Java, for example, CAN be faster. But it doesn't say "C is almost always faster than Java or Fortran, usually faster than ADA, and C can be mangled (in the form of D Digital Mars, for instance) to be faster than C usually is. Often, Java is a pig, compared to C, BUT THERE ARE TIMES WHEN IT ISN'T. Really. There are times, few and far between, when it's actually, get this, FASTER. It's fun to look for those few times. And if you write programs which do that, that'd be cool. And as processors get wackier and wackier, there will be more and more times where this is true. Meanwhile, if your developers write good code, Java's easier to develop in and debug." Which would be more completely correct.

Excuse, me, now. I have to go back to my perl programming.

SW Industry - Down The Drain by smcdow · 2006-07-18 02:38 · Score: 2, Informative

With the trend towards VM's and virtualization, that "hypothetical" computer comes ever closer.

Yay. With continued displays of attitudes like that, I'm going to leave the industry.

It is getting increasingly difficult to hire S/W engineers that understand that there is an operating system and also hardware beneath the software they write. I need people NOW that can grok device drivers, understand and use Unix facilities, fiddle with DBs, write decent code in C, C++, Java, and shell, and can also whip together a decent WS interface. Someone who does all of those.

WhyTF has the S/W industry become so compartmentalized? I can hire a device driver person, but he won't know anything about web services. I can hire a DB person, but she won't know a damn thing about poking values into registers. I can hire a web-services person, but he will have never worked on a Unix platform before. WTF? Really, WTF?

In short, I can't hire someone who can take ownership of an entire system. It's always, "Well, that's a hardware thing, go ask Foo", "Oh, it looks like the database, need to talk to Bar", "The Web interface is borked, we'll need to bring Baz in", "Hm, it doesn't do this when we run it on Windows" (this one always pisses me off, because they can never explain why, and that's because they know nothing about Unix). How come I can't hire someone who could understand a whole vertical stack (and maintain it, and provide analysis and fixes when something breaks)?

I do this kind of thing now. If I can do it, it can't be that hard. But everybody thinks they have to specialize. THIS IS WHAT'S WRONG WITH THE INDUSTRY.

--
In the course of every project, it will become necessary to shoot the scientists and begin production.

The advantage of Fortran is purely coincidental. by master_p · 2006-07-18 02:46 · Score: 2, Insightful

When Fortran was made, nobody thought that CPUs of 30 years in the future will have vector processing instructions. In fact, as Wikipedia says, vector semantics in Fortran arrived only in Fortran 90.

The only advantage of current Fortran over C is that the vector processing unit of modern CPUs is better utilised, thanks to Fortran semantics. But, in order to be fair and square, the same semantics could be applied to C, and then C would be just as fast as Fortran.

The fact that C does not have vector semantics reflects the domain C is used: most apps written in C do not need vector processing. In case such processing is needed, Fortran can easily interoperate with C: just write your time-critical vector processing modules in Fortran.

As for higher-level-than-C languages being faster than C, it is purely a myth. Code that operates on hardware primitives (e.g. ints or doubles) has exactly the same speed in C, Java and other languages...but higher level languages have semantics that affect performance as much as they can help performance. All the checks VMs do have an additional overhead that C does not have; the little VM routines run here and there all add up to slower performance, as well as the fact that some languages are overengineered or open the way for sloppy programming (like, for example, not using static members but creating new ones each time there is a call).

Depends on the job... by porkchop_d_clown · 2006-07-18 03:35 · Score: 2, Interesting

C is best at what it was designed for - controlling the computer. It used to be that people chose the language to match the app they were writing: For math, use Fortran or APL. For reports use Cobol or RPG. C for flipping bits. Pascal for teaching.

We're where we are today because, for many years, C was the one you could get for free. The others cost hundreds of dollars.

I remember the first time I encountered a computer that shipped from the vendor with GCC instead of a proprietary compiler - it was like seeing a death sentence for Abacus, Lightspeed, and all those other little compiler companies.

--
Clear, Dark Skies

Forth by Drasil · 2006-07-18 03:45 · Score: 2, Interesting

It can be made to be fast, and it can be made to be as high level as you want. I ofter wonder what the world would have been like if more programmers had gone the Forth way instead of the C/*nix way.

New debate by Dzonatas · 2006-07-18 03:54 · Score: 4, Interesting

High level languages have always been compared to cognitive semantics and grammatical styles. That is the higher the level of the language the easier it is for us humans to read and write it. Conversely, the lower the level the language is the more discreet steps are needed to describe an instruction or data.

Speed of program languages or machine languages are not measured by how high or low level they are to us. They are also measured by time to develop and implement the program. The article basically makes a point of it, that it's "better to let someone else" to optimize the low-level code while you write with the high-level language. You could write a super fast machine coded program, but it'll take you much longer to write it than with a simpler higher level language.

The new debate is over datatypes and the available methods to manipulate them. Older hardware gave us the old debate with primitive datatypes and a general set of instructions to manipulate the data. Newer hardware can give us more than just primitives. For example, a unicoded string datatype seen by the hardware as a complete object instead of an array of bytes. With hardware instructions to manipulate unicoded strings, that would pratically take away any low-level implementation of unicoded strings. The same could be done for UTF-8 strings. We could implement hardware support for XML documents and other common protocols. How these datatypes are actually implemented in hardware is the center of the debate.

Eventually, there will be so many datatypes that there will be seperate low-level languages specifically designed for a domain a datatypes. The article makes the point there exists an increase in complexity for newer compliers to understand what was intended by a set of low-level instructions. Today's CPUs have a static limit of low-level instructions. The future beholds hardware implemented datatypes and their dynamic availability of low-level instructions. Newer processors will need to be able to handle the dynamic set of machine language instructions.

Does the new debate conflict with Turing's goal to simply make a processor unit extensible without the need to add extra hardware? For now, we have virtualization.

Re:Along those lines... by orthogonal · 2006-07-18 03:59 · Score: 4, Interesting

Here's an actual data point:

I sped up some C code by unrolling a loop with Duff's Device. Duff's Device, for those who haven't encountered it, makes an ingenious use of the often-maligned C behavior that case statements, in the absence of a break or return statement, fall-through.

Duff's Device takes advantage of the fall-through by jumping into the middle of an unrolled loop of repeated instructions. If eight instructions are unrolled, Duff's Device iterates the loop

count divided by eight (count / 8 )

times, but enters the loop by jumping to the

count mod eight (count % 8)

'the unrolled instruction from the end of the loop. (This sounds complicated, but isn't; just look at the code and it becomes clear.)

The whole point of Duff's Device is speed and locality of code. Speed: because the loop is unrolled, more instructions are executed for each jump back to the top (and jumps are, relatively, expensive, because they mean any preloaded instructions must be tossed out ans re-read. Locality: (hopefully) all the instructions can be cached, so the processor doesn't have to re-read them from memory.

But what gcc does with Duff's Device on ARM targets is just bizarre. gcc uses a jump table (good) to directly change the Program Counter (good, so far). But instead of jumping into the loop (which would be good), gcc uses the jump table to jump to ...

a redundant assignment and ...

an unconditional jump.

Yes, gcc very smartly makes a jump table (which directly changes the Program Counter, just like a jump would) to jump to a jump. This is simply a waste of code and time:

I'd show you the entire assembly code gcc produces, but slashdot won't let me: "Your comment violated the "postercomment" compression filter. Try less whitespace and/or less repetition. Comment aborted." cmp r2, #7 ldrls pc, [pc, r2, asl #2] <-- directly modify the Program Counter making it pc + ( r2 << 2 ) b .L70 .p2align 2 .L79: <-- jump table .word .L71 .word .L72 .word .L73 .word .L74 .word .L75 .word .L76 .word .L77 .word .L78 .L72: <-- first jump table destination mov r1, lr <-- redundant assignment made at every destination b .L80 <-- actual jump into unrolled loop [ 7 repeats of the above, with differnt branch targets elided] .L87: <-- for each iteration of the loop, we're moving exactly 8 halfwords = 4 words ldrh r3, [r0], #2 <-- what would be fastest is to load multiple four words, <-- then shift high words down strh r3, [ip, #0] @ movhi [6 repeats of the above elided] .L80: ldrh r3, [r0], #2 strh r3, [ip, #0] @ movhi sub r1, r1, #1 <-- a subs instruction here would obviate the need for the cmp r1, #0 <-- cmp instruction that follows it, saving a cycle per iteration bgt .L87

Why a jump table just to set up an unconditional jump? Why the redundant mov, which could have been done once, prior to the jump table jump? Who knows, that's what gcc does.

In this particular case, the object is to copy halfwords to a memory address, which address is really mapped to an output device. ARM processors, of course, are optimized for word addresses, so the "best" way to do this would be to load multiple words (LDM), shift the upper

--
Opinions on the Twiddler2 hand-held keyboard?

Re:Article is theory not practice - no measurement by PitaBred · 2006-07-18 05:16 · Score: 3, Funny

How can you have any pudding if you don't eat your meat?

--
My blog. Good stuff (when I remember to update it). Read it.

Some of the real optimization issues by Animats · 2006-07-18 05:21 · Score: 4, Interesting

The article is a bit simplistic.

With medium-level languages like C, some of the language constructs are lower-level than the machine hardware. Thus, a decent compiler has to figure out what the user's code is doing and generate the appropriate instructions. The classic example is

char tab1[100], tab2[100]; int i = 100; char* p1 = &tab1; char* p2 = &tab2; while (i--) *p2++ = *p1++;

Two decades ago, C programmers who knew that idiom thought they were cool. In the PDP-11 era, with the non-optimizing compilers that came with UNIX, that was actually useful. The "*p2++ = *p1++;" explicitly told the compiler to generate auto-increment instructions, and considerably shortened the loop over a similar loop written with subscripts. By the late 1980s and 1990s, it didn't matter. Both GCC and the Microsoft compilers were smart enough to hoist subscript arithmetic out of loops, and writing that loop with subscripts generated the same code as with pointers. Today, if you write that loop, most compilers for x86 machines will generate a single MOV instruction for the copy. The compiler has to actually figure out what the programmer intended and rewrite the code. This is non-trivial. In some ways, C makes it more difficult, because it's harder for the compiler to figure out the intent of a C program than a FORTRAN or Pascal program. In C, there are more ways that code can do something wierd, and the compiler must make sure that the wierd cases aren't happening before optimizing.

The next big obstacle to optimization is the "dumb linker" assumption. UNIX has a tradition of dumb linkers, dating back to the PDP-11 linker, which was written in assembler with very few comments. The linker sees the entire program, but, with most object formats, can't do much to it other than throw out unreachable code. This, combined with the usual approach to separate compilation, inhibits many useful optimizations. When code calls a function in another compilation unit, the caller has to assume near-unlimited side effects from the call. This blocks many optimizations. In numerical work, it's a serious problem when the compiler can't tell, say, that "cos(x)" has no side effects. In C, it doesn't; in FORTRAN, it does, which is why some heavy numerical work is still done in FORTRAN. The compiler usually doesn't know that "cos" is a pure function; that is, x == y implies cos(x) = cos(y). This is enough of a performance issue that GCC has some cheats to get around it; look up "mathinline.h". But that doesn't help when you call some one-line function in another compilation unit from inside an inner loop.

C++ has "inline" to help with this problem. The real win with "inline" is not eliminating the call overhead; it's the ability for the optimizers to see what's going on. But really, what should be happening is that the compiler should check each compilation unit and output not machine code, but something like a parse tree. The heavy optimization should be done at link time, when more of the program is visible. There have been some experimental systems that did this, but it remains rare. "Just in time" systems like Java have been more popular. (Java's just-in-time approach is amusing. It was put in because the goal was to support applets in browsers. (Remember applets?) Now that Java is mostly a server-side language, the JIT feature isn't really all that valuable, and all of Java's "packaging" machinery takes up more time than a hard compile would.)

The next step up is to feed performance data from execution back into the compilation process. Some of Intel's embedded system compilers do this. It's most useful for machines where out of line control flow has high costs, and the CPU doesn't have good branch prediction hardware. For modern x86 machines, it's not a big win. For the Itanium, it's essential. (The Itanium needs a near-omniscient compiler to perform well, because you have to decide at compile time which instructions should be executed

Ugly? Idiomatic! by Paolone · 2006-07-18 09:08 · Score: 2, Insightful

Perl is not ugly, just really really idiomatic. As with all idiomatic languages, you can't grok what something means if you're not exposed to it.
It's just a matter of "if you can't stand the line noise, get out from the code-kitchen!". :)
Even if I can understand easily Perl code, what I can't really stand is C pointer arithmetic if it steps too far...

Less space for alternative vendors by EmbeddedJanitor · 2006-07-18 10:08 · Score: 2, Insightful

I do stuff in embedded space using IAR or GreenHills and gcc. For the most part, the proprietary vendors are losing ground to gcc. The proprietary advantage is shrinking, especially with more modern micros and as gcc improves. For the most part, code that comes out of gcc is no worse than code coming out of IAR or GreenHills. Where the priopritary guys have a real advantage is in better Clib implementations. The Clib, and newlib, that are normally used with gcc are huge and bloaty in comparison.

--
Engineering is the art of compromise.

Re:wasted ink by SanityInAnarchy · 2006-07-18 11:47 · Score: 2, Insightful

Unfortunately, just because a new generation is growing up doesn't mean we'll want to rewrite absolutely everything. It'd be much better if things were developed rationally as soon as possible -- that reduces the total amount of legacy c/c++ code which will ultimately have to be rewritten later.

Besides, it's not a new concept, and if this generation of programmers didn't get it, neither will the new generation, because among the very first generation of programmers were people who understood Lisp machines. Of course, if a new generation really does start using mostly Ruby when the current one can't handle Lisp, we'll know it was those darned parentheses. Just as any sufficiently advanced technology is indistinguishable from magic, any sufficiently advanced language is indistinguishable from Lisp.

It will be funny to see this turned on its head, if there are ever enough, say, Python or Ruby programmers to improve python/ruby compilers/runtimes to where, a couple generations of processors later, it's C that has a lack of optimizations and is actually farther from the hardware. We may actually see a C virtual machine as a necessity!

More practically, I try to work with languages that suit the task at hand, which is really never C unless I'm dealing with a huge existing C codebase.

--
Don't thank God, thank a doctor!

Re:wasted ink by SanityInAnarchy · 2006-07-18 18:32 · Score: 2, Insightful

Oh, hell no.

Java feels way slower than anything else. My college courses were mostly in Eclipse. It runs fast enough, but it takes forever to start, which is true of many, many Java apps.

Which means that when these same programmers end up learning C/C++, they'll think Java is slow because it's "interpreted". I guess there's at least the hope that they'll wind up using C#, and thinking Java is slow because it sucks. Which is good enough, because Java sucks for other reasons, even though it isn't really slow.

But really, with Generics, Java has basically picked up most of the features and syntax of C++, added garbage collection and much more anal retentive restrictions, and called it a whole new language. The bytecode and virtual machine is really not relevant to the awfulness of the language itself -- you can write a perfectly good language for the JVM -- but the JVM, specificalyl, has its own drawbacks, in that it's hard to write more libraries for Java, and many of the existing libraries suck in profound ways compared to C/C++ alternatives, or even .NET.

Frankly, the only good thing about them learning Java is that at least for awhile, their code may be portable, because it's so hard to make OS-specific or arch-specific Java.

--
Don't thank God, thank a doctor!

Re:While we're at it by moro_666 · 2006-07-18 20:21 · Score: 2, Insightful

the problem of yum is in the design of the application, not the language.

python is fast enough for almost any package management quest, but yum is the worst piece of ... that i have seen on that frontier. proper indexes and logical stops would make it much faster. there's your chance to write it. choose whatever language you want. design has to be good.

--

I'd tell you the chances of this story being a dupe, but you wouldn't like it.

Slashdot Mirror

High-level Languages and Speed

124 of 777 comments (clear)