A C++ Library That Brings Legacy Fortran Codes To Supercomputers

Code... by gigaherz · 2013-09-21 04:39 · Score: 3, Informative

...like rice, is not countable. At least not since I learned the word.

Re:Code... by Nerdfest · 2013-09-21 04:40 · Score: 2

It really does lower one's opinion towards the author. If I read TFAs, I wouldn't read this one.
Re:Code... by Anonymous Coward · 2013-09-21 04:46 · Score: 1

in hpc the convention is that while code is indeed a formless pile
a code is also a discrete and countable thing - an application or package, as in we've managed
to port 3 big codes to the new machine
none of which explains why, of all the various communication and decomposition libraries that
sit under such codes, this one got posted
Re:Code... by john.burton1765 · 2013-09-21 05:05 · Score: 3, Insightful

I couldn't agree more. Although the word "codes" is usually a red flag not to bother reading any more in any article or question
Re:Code... by Anonymous Coward · 2013-09-21 05:14 · Score: 0

It's a deep tradition in the numeric community to speak of "codes". I think it sounds stupid too, but everyone does it.
Re:Code... by mjwalshe · 2013-09-21 05:19 · Score: 1

no its not its ungrammatical - so ungrammatical in fact that I as a dyslexic notice it.
Re:Code... by jeremyp · 2013-09-21 05:28 · Score: 2

Not to mention the fact that the author has erased history (well, the summary implies the author has erased history - I haven't read TFA) because the Cray 1 had a vector processing unit and a specially designed compiler to make use of it, and the compiler was for Fortran. This was in 1978 when C++ didn't even exist.

--
All I want is a secure system where it's easy to do anything I want. Is that too much to ask ~~ Randall Munroe
Re:Code... by Anonymous Coward · 2013-09-21 05:47 · Score: 0

In scientific computing, the overwhelmingly accepted term is "a code" to refer to what other fields of computing refer to as an "application" or "program". Language evolves, and each discipline comes up with its own common terms and jargon. Deal with it.
Re:Code... by rubycodez · 2013-09-21 05:49 · Score: 1

never worked in the field of high performance numerical methods?
Re:Code... by gigaherz · 2013-09-21 05:54 · Score: 1

Obviously not. Even after reading the other replies, and realizing that I was partially wrong, I still can't help but feel that it sounds wrong.
The concept of code, to me, is a collection of statements, expressions, functions, packages, etc. Like a bowl of rice, it makes no sense to count the grains for themselves. I can accept that other people understand it differently, but it's not easy not to feel that they are doing it wrong...
Re:Code... by beelsebob · 2013-09-21 06:16 · Score: 0

No, simply wrong. Code is a plural noun. Just like sheep is a plural noun, and rice is a plural noun. The word codes refers to the plural of a completely different meaning of code –i.e. cyphers.
Re:Code... by Anonymous Coward · 2013-09-21 06:17 · Score: 0

In scientific computing, a "code" is a software package. Abaqus and Ansys are finite element "codes".
Re:Code... by Livius · 2013-09-21 06:27 · Score: 2

Code is a mass noun, and it's number is indeterminate, neither singular nor plural.
Re:Code... by dfghjk · 2013-09-21 06:29 · Score: 0

Why concern yourself with the opinions of those still stuck using Fortran because they're afraid of work? They are doing it wrong. Language is useless without agreed upon usage.
To use the rice analogy, you would say "bowls of rice", not "rices". Now, if enough stupid people or even an entire community intentionally made that error, it may well become adopted as correct usage. It would still be ignorant.
Understand that scientists and researchers are not programmers and don't know any better.
Re:Code... by Cito · 2013-09-21 06:33 · Score: 1

it is correct though
similar to how moneys and monies are both correct plurals of money, even though in America people use money to refer to both singular and plural but they should recheck the dictionary.
codes is interchangeable as well http://www.thefreedictionary.com/code
Re:Code... by Anonymous Coward · 2013-09-21 06:50 · Score: 0

No, because a program is not "a code", it is simply "code". Multiple programs are not "multiple codes", they are simply "code". Monies (never "moneys", unless you're attempting to exemplify the illiteracy and progression of stupidity mentioned previously) is valid because money and currency are directly interchangeable terms. When speaking of a items of either, they may be prefixed with counting articles ("a", "some", "many", "one", "two"). Code as a term describing computer programs is not prefixed with a counting article, and is therefore not pluralized.
Re:Code... by boristhespider · 2013-09-21 07:03 · Score: 3, Insightful

Oh now, that's a bit harsh. Programming in Fortran isn't something done because people are afraid of work. I genuinely get tired of the incessant Fortran-bashing by people who -- in my experience, at least -- have almost never, if ever, actually used the language or seen why other people do. In most cases they seem to be repeating jokes their lecturers made about the language, jokes that were first written back when FORTRAN 77, with those stupid capitals and all, was the dominant form.
Now, I'm very much not a fan of F77. In fact, I hate the language. It's clunky and decrepit and not suited for modern programming practices. But it's easy to call from later Fortran standards, and each one has vastly improved the situation. Fortran 2008 is a genuinely nice language. True, it's not OO - though you can force it to act almost as if it is - but not everything has to be forced into OO. What it is is extremely good for numerical work, and dealing with arrays in particular in Fortran is a dream after, say, C, or even C++11. The fact it calls F77 routines without effort or pain also helps, since there genuinely is a vast body of code still in F77. (The oldest I came across was F66, ported directly from Fortran IV. Now that really did need to be rebuilt in something approaching a sane language.)
I'm not saying F2008 is "better" than either C or C++11 -- that's a meaningless statement. But there are things that make it a very nice language to use, and other things -- character strings, I'm looking at you -- that make it distinctly unpleasant. Same as any other language, really.
Re:Code... by mjwalshe · 2013-09-21 07:21 · Score: 1

Not where i come from it isn't and my first job was in the math modeling section of a world leading rnd organization
Re:Code... by Thomasje · 2013-09-21 08:06 · Score: 2

I studied math in college, and many numerical algorithms textbooks refer to software as "codes". It seems to be common practice in the computational mathematics world. I assume it goes back to the days before Fortran, before high-level languages in general, when source code literally consisted of a series of codes.
Re:Code... by K.+S.+Kyosuke · 2013-09-21 08:12 · Score: 1

...like rice, is not countable. At least not since I learned the word.
It's a shiboleth of physicists and lousy journalists.

--
Ezekiel 23:20
Re:Code... by K.+S.+Kyosuke · 2013-09-21 08:21 · Score: 1

True, it's not OO - though you can force it to act almost as if it is - but not everything has to be forced into OO.
That's an almost meaningless statement, unless you define what is "OO" (ten programmers will give you twelve different definitions for that).
Also, the modern replacement for Fortran ought to be Fortress. (Or, more properly, a language using similar algebraic techniques to improve the HPC programming experience, seeing as the original Fortress effort has wrapped up.)

--
Ezekiel 23:20
Re:Code... by boristhespider · 2013-09-21 08:24 · Score: 1

"That's an almost meaningless statement, unless you define what is "OO" (ten programmers will give you twelve different definitions for that)."
Very true. In this exact context I'm meaning an object containing functions that are contained within its namespace. There's almost certainly actually a way to do that in modern Fortran but I'm not aware of it, and instead I use modules as a rough analogue of a class and select the functions from them that I want to use in a particular method. (I can also get around quite a bit of the issue by overloading the function names in the interface, if it seems that important. Normally it doesn't, to be honest.)
Re:Code... by jythie · 2013-09-21 09:28 · Score: 1

That is one of the things that happens when multiple disciplines overlap, sometimes their jargon does not always match up.
Re:Code... by manicb · 2013-09-21 10:51 · Score: 1

"Secret codes" or "cipher codes" are countable. So are "weather simulation codes".
Re:Code... by aldousd666 · 2013-09-21 11:08 · Score: 1

It's funny, that was my first response, before even starting the summary. The writer seems to know nothing of the difference between codes and code. Baby, bathwater, out.

--
Speak for yourself.
Re:Code... by cas2000 · 2013-09-21 12:47 · Score: 3, Informative

actually, "codes" is common usage amongst researchers and has been since at least the 1970s.
most of them are not programmers or geeks or computer scientists, they're researchers or academics or post-grad students who happen to do a little programming or simply use someone else's "codes".
it used to make me cringe every time i heard it when working with academics and researchers and on HPC clusters, but then i got used to it and stopped caring.
and, really, they're not interested in a lecture or why it's a dumb usage of the word. they've got stuff they want to get done ("codes to run") and don't give a damn.
Re:Code... by Anonymous Coward · 2013-09-21 14:02 · Score: 0

Software professionsals say "code", and they assume you're an idiot if you say "codes".
Likewise, academics say "codes," and they assume you're an idiot if you say "code."
p.s. Also note the subtle difference in placement of the punctuation relative to the close-quotes. ;-)
Re:Code... by Half-pint+HAL · 2013-09-22 00:41 · Score: 1

The tradition was for a "code" to be a cipher or other defined means of information formalisation. The use of "a code" as synonymous with "a program" is relatively recent.

--
Got them moderator blues I blieve I walk out the do', With these mod-points I been gettin', I 'most never post no mo'
Re:Code... by Half-pint+HAL · 2013-09-22 00:49 · Score: 1

To use the rice analogy, you would say "bowls of rice", not "rices".
What, you mean like that ignorant error we all make of counting fish, cows etc?

--
Got them moderator blues I blieve I walk out the do', With these mod-points I been gettin', I 'most never post no mo'
Re:Code... by Anonymous Coward · 2013-09-22 02:11 · Score: 0

Code is either short for "instruction code" or "machine code". Sometimes one uses it as a shorthand for "source code" but in this case "code" is still short for the former alternatives.
One instruction code, two instruction codes.
Code is very much countable.
Re:Code... by Sir_Sri · 2013-09-22 09:40 · Score: 1

This is how a lot of people in scientific computing use the phrase.. unfortunately.
My GF is an astrophysicist, and as a computer scientists I cringe every time I hear her colleagues discussing their fortran codes.
Re:Code... by rubycodez · 2013-09-23 03:20 · Score: 1

but one of the definitions of the word code, from centuries past, is "A systematic collection of regulations and rules of procedure or conduct".
Re:Code... by gigaherz · 2013-09-23 06:54 · Score: 1

Yes. Security codes (numbers), building codes (rules), and possibly other meanings, are countable. But on the software context, there seems to be a disagreement.
Re:Code... by Anonymous Coward · 2013-09-23 07:08 · Score: 0

Fortran has had type-bound procedures since the 2003 standard.
Re:Code... by boristhespider · 2013-09-23 07:24 · Score: 1

You learn something every day! I'm going to look into those, thank you.
Re:Code... by rubycodez · 2013-09-23 15:31 · Score: 1

The word "softwares" also is used in exactly the same way as "codes" is by various specialists. There is also the words "internets" though most here would say there can be only one internet. Groups of people get to decide the meanings of words, and a critical mass of them can define the language.

It never ceases to amaze me by Anonymous Coward · 2013-09-21 04:40 · Score: 0, Funny

how tenaciously researchers cling to their old, cludgy fortran code.

Re:It never ceases to amaze me by Mitchell314 · 2013-09-21 04:50 · Score: 3, Insightful

In old codes, you're already familiar with the existing quirks and bugs, and the base is heavily patched up from years of debugging.

--
I read TFA and all I got was this lousy cookie
Re:It never ceases to amaze me by Anonymous Coward · 2013-09-21 04:55 · Score: 3, Insightful

Fortran is by no means outdated. Seriously, check out the new Fortran 2008 standard and its state-of-the-art compilers (e.g. the NAG one).
You'll be blown away by its speed and clean looking code. C++ might have features that fortran lacks (complex template usage seems rather popular), but that doesn't always reduce the development time. At least that my experience.
As long as you're working on scientific projects, fortran is practically unmatched.
Re:It never ceases to amaze me by Anonymous Coward · 2013-09-21 05:05 · Score: 0

Yes, exactly.
FORTRAN has a syntax that was designed for scientific and engineering applications. And as the parent pointed out, compiler design has improved dramatically over the decades.
Spending all that time and effort to change another language just to have a different different syntax makes no sense. It gets you nothing and that time is better spent on actual work - or free time.
And when you think of it, there hasn't been any improvements in programming languages since FORTRAN or COBOL.
Really those languages were to move programmers away from assembly and machine code. And today's "modern" languages really don' t offer much more - except, I guess, more abstraction and it's subsequent overhead.
Re:It never ceases to amaze me by mjwalshe · 2013-09-21 05:21 · Score: 3, Insightful

have you any idea how much it woudl cost to port it to cludgy C++ (which lacks a lot of things needed for scientific computing) you then have to re-qualify All of your models which is both time and resource intensive.
Re:It never ceases to amaze me by Anonymous Coward · 2013-09-21 06:20 · Score: 0

I'm a scientific software developer an have written and maintained a good bit of Fortran 77 and 90 code, but never any modern Fortran. I could never figure out what good websites or books to reference even to learn about the fancier features of modern Fortran. I can see that it's used in some areas, but it's got little use in mine (solid mechanics, fluid mechanics).
Re:It never ceases to amaze me by mikael · 2013-09-21 07:03 · Score: 1

And the funny thing is, game developers who are designing game engines in C++ for multi-core systems also use scripting languages like LUA at the high level. Then you have so many ways of doing parallel processing of arrays in C++; STL vectors (foreach), Intel TBB, Intel ABB, Boost, pthreads, and many others. I can't imagine what it would be like trying to bolt together a dozen or more different utility libraries each using their own favorite blend of parallel processing API's.
I guess Fortran is like the Python language in that there is only one way of writing everything.

--
Vintage computer adverts: http://www.vintageadbrowser.com/computers-and-software-ads
Re:It never ceases to amaze me by boristhespider · 2013-09-21 07:05 · Score: 1

A pedantic note - it hasn't been called "FORTRAN" since Fortran 90 was introduced. Otherwise it's nice to see people defending it for scientific applications :)
Re:It never ceases to amaze me by boristhespider · 2013-09-21 07:19 · Score: 4, Funny

Not at all. It might be a bit more monocultured than, say, C++ but there are still more than enough ways to skin the same cat that you end up with a ton of cat parts and a mass of confusion.
Re:It never ceases to amaze me by mbkennel · 2013-09-21 08:12 · Score: 2

" I can't imagine what it would be like trying to bolt together a dozen or more different utility libraries each using their own favorite blend of parallel processing API's."

In Fortran you don't. Fortran has the mathematically expected parallel constructions built into the language, and the compiler directives commonly used before things are entirely in the language were reasonably standard.

I think Fortran is very good for quantitative programming and I regret that in my commercial enterprise it is essentially forbidden as alien.
Re:It never ceases to amaze me by Anonymous Coward · 2013-09-21 08:55 · Score: 0

have you any idea how much it woudl cost to port it to cludgy C++ (which lacks a lot of things needed for scientific computing)
There is nothing you can do in fortran that can't be done better in C++
There are tons of things that can be done in C++ that can't be done in fortran. For instance, in C++ I can put a function pointer in a type. You cannot put a function pointer in a derived type in Fortran.
Re:It never ceases to amaze me by jythie · 2013-09-21 09:33 · Score: 1

Esp given how easy it is to link Fortran to C, C++, and ObjC. Granted it has been years since I worked with the stuff, but the last project I was on we jumped between Fortran, C and C++ depending on which had better stuff for any particular part of the program, and GCC compiled them all down into a single binary.
Re:It never ceases to amaze me by jythie · 2013-09-21 09:35 · Score: 1

While there are some add on libraries that can do 'ok', Fortran is generally much better then C or C++ at handling numbers. In applications I have worked on that used both languages, generally you only let the C++ code handle the data when it didn't have to be all that accurate, such as sending it to the UI. But for calculations you would keep it in Fortran.
Re:It never ceases to amaze me by boristhespider · 2013-09-21 09:45 · Score: 1

Very true. For my own comfort I tended to stick within pure Fortran programs - I'm still more comfortable in Fortran than in C or C++ - but there were things that were better to do elsewhere, particularly where I had access to a library that I much preferred (say, in C, which happened quite a bit) to what I easily had available in Fortran. Sure, I could have gone hunting but it was a lot easier to build a trivial wrapper around the C library and just call that. I can't actually remember why I didn't like the Fortran interfaces for the GSL that were around but I ended up building wrappers around quite a bit of that at various times.
Re:It never ceases to amaze me by Lawrence_Bird · 2013-09-21 09:51 · Score: 1

please go back to your java and c++ world and leave Fortran alone - you don't know the first thing about Fortran or Fortran compilers.
Re:It never ceases to amaze me by Aardpig · 2013-09-21 10:40 · Score: 1

Yes you can, since Fortran 2003.

--
Tubal-Cain smokes the white owl.
Re:It never ceases to amaze me by Anonymous Coward · 2013-09-21 15:18 · Score: 0

The choice of programing language has nothing to do with the accuracy of calculations.
NEVER EVER USE FORTRAN FOR SCIENTIFIC CODE
It's a dead language.
Re:It never ceases to amaze me by Anonymous Coward · 2013-09-21 15:54 · Score: 0

This is just wrong. Here's an example that I run into as a physicist rather frequently:
Fortran handles arrays for scientific purposes far more efficiently and clearly than C++ does. For instance, let's say I'm doing a quantum mechanics calculation and I want to construct a multidimensional array that holds a bunch of eigenvalues, and these eigenvalues are indexed by quantum numbers, one of which ranges from 1 to n, one which ranges from 0 to L, and one which ranges from -L to L (and the specific eigenvalues may well depend on the quantum numbers themselves). Fortran allows array indices to start and end with any value you want. In C++, array indices always start at 0 (as far as I know). So in Fortran, you can label your arrays with numbers that make intuitive sense to the problem. In C++, you're required to add a few lines to keep track of the index math, which can obfuscate the code quite a bit (especially when you're debugging). That's one of the big reasons most scientists find it so much easier to write matrix routines in Fortran than in other languages.
Re:It never ceases to amaze me by Anonymous Coward · 2013-09-21 18:47 · Score: 0

Fortran isn't dead. It's immortal.
Re:It never ceases to amaze me by Anonymous Coward · 2013-09-21 18:55 · Score: 0

There are tons of things that can be done in C++ that can't be done in fortran. For instance, in C++ I can put a function pointer in a type. You cannot put a function pointer in a derived type in Fortran.
Et des "templates" de C++ ne sont pas aussi puissants que les "macros" de Common Lisp. So what? Use the language you like. The vast majority of scientific computing has no need for function pointers, and the vast majority of people doing scientific computing probably haven't even heard of function pointers. Here's something you can't do when writing in C++ ... avoid people who think C++ is cool.
Re:It never ceases to amaze me by Half-pint+HAL · 2013-09-22 01:36 · Score: 1

There is nothing you can do in fortran that can't be done better in C++
Yeah? Well there's nothing in C++ that can't be done better in assembler.

--
Got them moderator blues I blieve I walk out the do', With these mod-points I been gettin', I 'most never post no mo'

Very limited scope by RoverDaddy · 2013-09-21 04:51 · Score: 2

I took a look at TFA and followed up by reading the description of LibGeoDecomp:

If your application iteratively updates elements or cells depending only on cells within a fixed neighborhood radius, then LibGeoDecomp may be just the tool you've been looking for to cut down execution times from hours and days to minutes.

Gee, that seems like an extremely limited problem space, and doesn't measure up at all to the title of this Slashdot submission. It might really be a useful tool, but when I clicked to this article I expected to read about something much more general purpose, in terms of 'bringing Legacy Fortran to Supercomputers'.

By the way, regarding the use of the word 'codes': I don't think English is the first language of this developer. Cut some slack.

--
RETURN without GOSUB in line 1050

Re:Very limited scope by Mitchell314 · 2013-09-21 05:01 · Score: 2

AFAIK a lot of simulation problems are centered around 'update node based on neighbors', like particulate dispersal or flux.

--
I read TFA and all I got was this lousy cookie
Re:Very limited scope by Zero__Kelvin · 2013-09-21 05:07 · Score: 1

FTA:

"My idea for this post was to take an simple 3rd party program and try to marry it with LibGeoDecomp while preserving as much as possible of its original code and structure. The Hello World equivalent for computer simulations is Conway's Game of Life. My choice fell on this code, kindly provided by KTH, Sweden. The code has several advantages:"
In regard to:

"By the way, regarding the use of the word 'codes': I don't think English is the first language of this developer. Cut some slack."
You really think that sentence was written by a person with a tenuous grasp of the English language. Seriously?

--
Guns don't kill people; Physics kills people! - John Lithgow as Dick Solomon on Third Rock From The Sun
Re:Very limited scope by Anonymous Coward · 2013-09-21 05:59 · Score: 0

I am confused, are we talking about the precedent sentence in quotes or are we discussing about the one in the TFA. Your ambiguity is really annoying and it feels amateurish.
Re:Very limited scope by Zero__Kelvin · 2013-09-21 06:17 · Score: 1

That's funny. I was going to say the same thing about your inability to create an account and log in! (Also, there is zero ambiguity; I was in fact quite specific.)

--
Guns don't kill people; Physics kills people! - John Lithgow as Dick Solomon on Third Rock From The Sun
Re:Very limited scope by Anonymous Coward · 2013-09-21 12:35 · Score: 0

That's funny. I was going to say the same thing about your inability to create an account and log in!
You are confused by my unwillingness to create an account ? Holy cow, you should have your temporal lobes checked....
Re:Very limited scope by Half-pint+HAL · 2013-09-22 01:43 · Score: 1

You really think that sentence was written by a person with a tenuous grasp of the English language. Seriously?
Tenuous grasp, no; but non_native != tenuous_grasp. The blog's at a German university. The choice of "marry" over "combine" is slightly unusual, as the idea of choice falling on something. It's very, very good. But it's still most likely not his first language, so pedantic polemics are uncalled for.

--
Got them moderator blues I blieve I walk out the do', With these mod-points I been gettin', I 'most never post no mo'

How do I put this... by girlintraining · 2013-09-21 05:01 · Score: 1

I think I speak for many geeks when I say....

KHHHAAAAAAAAAAAAAANNNNNN!!!!

That is all.

--
#fuckbeta #iamslashdot #dicemustdie

Misleading title by Anonymous Coward · 2013-09-21 05:11 · Score: 0

High performance Fortran compilers for supercomputers and clusters have been around since before a good portion of the posters here were born. In fact, they often beat compilers for other languages. In certain disciplines like atmospheric science, Fortran is the language for super computing problems, even today. Misleading title.

bigger problems by stenvar · 2013-09-21 05:12 · Score: 1

Seems to me that there are bigger problems when porting Fortran code to C++, like lack of a multidimensional array type in C++, lack of all the other Fortran libraries, and the fact that Fortran code usually still seems to give faster executables than comparable C++ code on numerical applications.

Re:bigger problems by bored · 2013-09-21 06:04 · Score: 1

fortran code usually still seems to give faster executables than comparable C++ code on numerical applications
I don't think this is true anymore. C++ is pretty much the only language that has BLAS libraries that can actually beat the fortran ones. The latest C++ template libraries are using SSE/etc vector intrinsics and are capable of meeting if not exceeding the fortran performance for many applications.
But, if you have a bunch of code in fortran, its probably not worth the trouble to convert it.
Re:bigger problems by mjwalshe · 2013-09-21 07:26 · Score: 1

If since 1979 a language hasn't manged to support things like multidimensional arrays maybe taking it out behind the back of the barn with a shotgun might be the best solution.
Re:bigger problems by oursland · 2013-09-21 08:18 · Score: 1

It's still true. Fortran uses these intrinsics as well, furthermore the way Fortran handles variables is stronger than C/C++, which permits the compiler to perform more aggressive optimizations. Fortran also has convenient syntax for performing common mathematical operations on datasets. Yes, you can replicate this in C++ with operator overloading, but Fortran puts this in at the core language permitting the compiler writers to target these specific operations for optimization.

Lots of existing code is in Fortran and is easiest interfaced in Fortran. In addition Fortran 2008 included things like concurrency in the language that C++ only got in 2011 as a part of the standard library.

The theme of this project is more about "my language (C++) the one true language" than reality.
Re:bigger problems by stenvar · 2013-09-21 08:43 · Score: 1

C++ is pretty much the only language that has BLAS libraries that can actually beat the fortran ones.
Why would any Fortran compiler be using a slower BLAS implementation than the C compiler?

The latest C++ template libraries are using SSE/etc vector intrinsics and are capable of meeting if not exceeding the fortran performance for many applications
Hand-tuned C code is "capable of meeting if not exceeding the fortran performance for many applications", but that doesn't make C a good numerical programming language. The question is whether normal, straightforward numerical code runs faster when written in one or the other language, not whether you can produce fast code if you invest enough time in writing it.
Re:bigger problems by Anonymous Coward · 2013-09-21 09:03 · Score: 0

C++ doesn't lack multidimensional array types, you blithering idiot. They just aren't first-class objects.
Re:bigger problems by Anonymous Coward · 2013-09-21 09:54 · Score: 0

The theme of this project is more about "my language (C++) the one true language" than reality.
Humorously, C++ is the one true language right now because of the exact same reason Fortran used to be -- libraries. Big numerical libraries, big GUI toolkits ... I hate to use it, but if I need to interface with the rest of the world, C++ probably has that native interface. Fast forward 20 years, and I think everyone will be wondering how we ever put up with the crap that is C++.
Regarding this example the blog post talks about... it's interesting as a blog post, and good for someone that wants to use this library (and there may be good reasons to do so), but it smells like crap for a Fortran programmer. The author complains about the Fortran code being slow because it copies an array instead of swapping pointers? Then use pointers in Fortran! They're great! To do otherwise is cheating. The array of 2000x2000 is too small to see really efficient parallel performance? Uh, no, that's plenty big. Shorter code in the end? I suppose, but the starting code was not written efficiently or concisely, and you took out all the comments too. Performance would be better with openMP? Uh, very unlikely these days.

Its code not codes FFS by mjwalshe · 2013-09-21 05:16 · Score: 1

And why would you fuck about with C++ when there is so much missing - just get a book and learn FORTRAN if you need to work in the scientific computing environment.

Re:Its code not codes FFS by Anonymous Coward · 2013-09-21 05:28 · Score: 0

When dealing with many HPC centres, the word 'code' is used instead of 'program' or 'application'. So 'codes' is correct, it simply means that two or more programs are involved.
Re:Its code not codes FFS by Greyfox · 2013-09-21 06:31 · Score: 1

If I were doing new development, which I am, I would use C++ because like Java and Ruby I can develop code in it more quickly, make my libraries more user-friendly (Where the user in this case is another programmer or myself later on) and am more likely to be able to find other programmers who can use those libraries without requiring them to learn an ancient language that just barely qualifies as structured-programming-capable. And yes, I HAVE written fortran code. And assembly, back in the day.
Unlike Ruby (Which I can develop code in VERY quickly) my C++ is type-safe and I can catch a lot of smile programming errors at compile time. I appreciate not finding run-time bugs in my deep space probe when it's 80 million miles from earth. And yes, I also write unit tests for my libraries.
Unlike Java, my C++ code doesn't require a gigantic VM to run in, I know exactly when my resources are being freed, I don't have to worry about someone else on the team using RTTI bullshit (I've never seen an instance of RTTI that wasn't indicative of a terrible design. Would welcome good examples,) and I can revert to the C standard library for full control of the hardware. If the OS can do it, the C standard library probably has a function for it. Mostly I'm just more comfortable in the language, though. In a lot more java-using applications, the decision to use Java was the wrong one. I've had to support a few of those. It's left me with a permanent distaste for the language. A distaste which Oracle does not help with its shenanigans (Attempting to claim copyright on APIs and trying to install some fucking useless toolbar every time it patches my system.)
Note that a lot of this IS subjective opinion and not really a critique of one language or the other. I can program in anything, if I need to. I just happen to like C++. I've posted what I consider to be respectably nifty libraries to github and already have several more planned. It's not like someone can't look at my source code and decide for themselves if it's crap or not.

--
I'm trying to teach myself to set people on fire with my mind... Is it hot in here?
Re:Its code not codes FFS by oursland · 2013-09-21 08:30 · Score: 2

Then you're likely a waste of time and detriment to your team.

I have had conversations with some of my friends who work on the peta-scale clusters and thought much the same as you. But, it turns out, when you're working with that level of system, you're probably addressing some small part of a much, much larger problem that has been largely solved. The existing code that performs 99.9% of your task is written in Fortran and actively developed by a very successful team of researchers. Attempting to rewrite the working, debugged, code so you can work in your favorite language today is not only impossible, but would likely get you removed from the team.
Re:Its code not codes FFS by Anonymous Coward · 2013-09-21 09:08 · Score: 0

You have no idea what the fuck you are talking about. Either your "friends" doesn't exist or they are full of shit.
Re:Its code not codes FFS by Anonymous Coward · 2013-09-21 09:40 · Score: 0

You are c++ villiage idiot!
Re:Its code not codes FFS by jythie · 2013-09-21 09:43 · Score: 1

Many technological decisions are based off how easy it is to find programmers to fill roles, which means if you want to be able to easily hire people onto a project you have to bend to what is generally popular. I worked on several projects that ended up switching languages or even OSes because what we were using made finding candidates more difficult.
Re:Its code not codes FFS by jythie · 2013-09-21 09:46 · Score: 2

Not really. When projects are already dominated by a particular language, esp projects that can have decades or more of legacy design to them, programmers who want to come in and rewrite perfectly good subsystems in their preferred language are not all that well looked upon.
Re:Its code not codes FFS by mjwalshe · 2013-09-21 09:52 · Score: 1

yeah that is how I started at a world leading Rnd organization was told to get a book from company library and teach my self FORTRAN (BTW I was a high school leaver) I would expect any CS graduate to be able to do that - either that or I am a 10x programmer and didn't realize it.
Re: Its code not codes FFS by cwebster · 2013-09-21 10:00 · Score: 3, Informative

Please don't learn FORTRAN, learn Fortran instead. (For the pedantic, all caps is F77. Normal caps is F90 and later.)
Re:Its code not codes FFS by Greyfox · 2013-09-21 11:07 · Score: 1

You didn't really read my post, did you? I'm perfectly fine being presented a complete system written entirely in FORTRAN and being told to support it. If I'm doing NEW DEVELOPMENT, I prefer C++.
Thing about those old systems, they typically weren't written by dumbasses. Most of my career has been following along behind dumbasses cleaning up at them. It's lucrative work, and I'm never hurting for something to do. Every so often I happen upon a system that was actually written by engineers and it's usually delightful when I find one.
Given the ease with which you snap to a judgment, you must be a fucking amazing programmer, so let's see your open source repos. My home page is set to mine.

--
I'm trying to teach myself to set people on fire with my mind... Is it hot in here?
Re:Its code not codes FFS by Khashishi · 2013-09-21 14:10 · Score: 2

That's because you aren't doing development on computationally expensive simulation codes that run on supercomputers. Because then you would use FORTRAN. C++ is such a memory hog, and the memory overhead scales with the number of processors. In FORTRAN, you only allocate what you need to use, and that's important when working with large arrays. Java and Ruby are out of the question.
FORTRAN is not obsolete, because there are currently no other languages that can fill the role. When running simulations that take 100000+ cpu-hours, it's worth the extra coding effort to write it in FORTRAN. Assembly language isn't being considered because generally, these codes need to run on different supercomputers which all have unique architecture. Therefore, optimizing compiling scripts exist for each supercomputer for use with FORTRAN.
Re:Its code not codes FFS by serviscope_minor · 2013-09-21 20:12 · Score: 1

C++ is such a memory hog, and the memory overhead scales with the number of processors.
What on earth are you talking about?

--
SJW n. One who posts facts.

Modern Fortran by Anonymous Coward · 2013-09-21 05:16 · Score: 0

Modern supercomputers all have perfectly adequate Fortran compilers from a variety of vendors, so I'm not sure what problem this library is trying to solve.

And yes, in HPC-world an application is known as 'a code'. Believe it or not the ones I work with have configuration files known as 'input decks', a terminology dating back to the days of punch card input.

Re:Modern Fortran by rubycodez · 2013-09-21 05:45 · Score: 2

more than adequate, Fortran is still the most optimizable language for high performance numeric computation, moreso than C and derived languages
Re:Modern Fortran by gentryx · 2013-09-21 06:39 · Score: 1

Yeah, if your Fortran code already scales on big iron, then LibGeoDecomp probably doesn't have much to offer for you. This article was rather meant as a primer for those who are working on older, sequential Fortran codes which are not yet parallelized, and who don't want to go through all the pains of building an MPI-enabled parallelization for them.

--
Computer simulation made easy -- LibGeoDecomp
Re:Modern Fortran by dfghjk · 2013-09-21 06:43 · Score: 1

Fortran is the most used and therefore the biggest target for continued improvement. Saying it is the "most optimizable" means nothing. As a tools company you aren't going to focus on what none of your customers do.
x86 is the most modern high-performance instruction set by your reasoning. Sometimes alternatives are just not sufficiently compelling, that doesn't mean they are inferior.
Re:Modern Fortran by oursland · 2013-09-21 08:24 · Score: 1

Compilers often cannot make optimizations in C/C++ and similar languages because of how flexible the languages are to the user's needs. Fortran, on the other hand, is more restrictive and the compiler can make guarantees about aliasing and alignment that permit things like autovectorization. This really is a part of the core language, not just the result of monumental resources put at the issue.
Re:Modern Fortran by Anonymous Coward · 2013-09-21 10:24 · Score: 0

Fortran has built-in syntax elements to specify e.g. sequential loops ("For") or parallel loops ("forall"), allows explicit declaration of subroutines/functions without side effects (which thus may be used in forall constructs), etc. A good Fortran coder (I only ever met one) will use these extensively. The compiler then has much more information for optimization then you can ever convey in C or C++.
Re:Modern Fortran by rubycodez · 2013-09-23 03:29 · Score: 1

but many more languages are now more common that Fortran, and most newly created code is C++ and often wrapped by other languages such as python.
x86 most modern?, it may well be. You do realize that x86 processors don't have internals that represent the instruction set (unlike processors of decades ago), it is more accurate to say they emulate the x86 via microcode. The internal architecture is extremely advanced even if the x86 memory and register model is a not a linear clean and orthoganal design.

Send In The Codes by Anonymous Coward · 2013-09-21 05:31 · Score: 0

Isn't it rich?
Are we a pair?
Me here at last on the ground,
You in mid-air.
Send in the codes.

Isn't it bliss?
Don't you approve?
One who keeps tearing around,
One who can't move.
Where are the codes?
Send in the codes.

Just when I'd stopped
Opening doors,
Finally knowing
The one that I wanted was yours,
Making my entrance again
With my usual flair,
Sure of my lines,
No one is there.

Don't you love farce?
My fault, I fear.
I thought that you'd want what I want -
Sorry, my dear.
And where are the codes?
Quick, send in the codes.
Don't bother, they're here.

Isn't it rich?
Isn't it queer?
Losing my timing this late
In my career?
And where are the codes?
There ought to be codes.
Well, maybe next year . . .

Author here. by gentryx · 2013-09-21 05:36 · Score: 3, Informative

The IEEE and Los Alamos National Laboratory seem to have a different opinion on this. And even the Oxford dictionary knows the use of codes. But surely those guys can't even spell gigahertz.

--
Computer simulation made easy -- LibGeoDecomp

Re:Author here. by gigaherz · 2013-09-21 05:49 · Score: 1

I guess I was mistaken in assuming the computer-science way of thinking about the concept of "code" (vs. "program", "algorithm", or "function", which are definitely discrete, countable things) should extended to other fields.
As a side note, although it is true that my nickname was originally a misspelling of "gigahertz", I chose it while I was young, and as a non-native English speaker, my knowledge of the language was lacking. I have been perfectly aware of that fact for a long time, but I chose to maintain the uncorrected version, both for the habit, and because I found it amusing that "herz", when read in German, means "heart".
Re:Author here. by dfghjk · 2013-09-21 06:20 · Score: 1

"I guess I was mistaken in assuming the computer-science way of thinking about the concept of "code" (vs. "program", "algorithm", or "function", which are definitely discrete, countable things) should extended to other fields."
You were not mistaken, the world just grows more poorly educated. Yesterday's illiteracy is today's literacy. Making "code" synonymous with "program" and subsequently requiring a plural form serves no purpose, sounds stupid, and is stupid.
No doubt others wish to "commentate" on the matter so I step down from my soap boxen.
Re:Author here. by Anonymous Coward · 2013-09-21 06:41 · Score: 0

giga-Hertz, you insensitive clod.
Re:Author here. by boristhespider · 2013-09-21 06:42 · Score: 2

I'd suggest you don't be so pious. I'm for protecting the language as much as anyone else but ultimately it evolves. I don't think this is really about the people employing numerical techniques in science becoming "more poorly educated"; I think it's about your field branching out and attracting new jargon and new uses for the old jargon. It's just what happens.
As it happens, I've spent close to ten years in academia where we build "codes" (typically in Fortran -- 90 or more recent if you were lucky; 77 if you weren't) to solve problems. I have since moved into professional development, chiefly in C++ and occasionally in C#. The use of the spoken language changes and the two fields have different ways to express the same concepts. Ultimately I don't really see a major problem with this.
All that said, misuse the word "corn" and I begin to get extremely irritated so I probably shouldn't be so pious myself... :)
[In British English "corn" doesn't mean maize, it means wheat, or occasionally barley -- more accurately, it means the chief arable crop of an area. When English speakers settled North America, that chief arable crop was maize, hence the American usage. What annoys me isn't Americans calling maize "corn" -- which is entirely valid in both North America and in their dialect of English -- but rather the *British* thinking that corn is maize. It isn't, it's wheat. I also wish they'd get off my God damned lawn.]
Re:Author here. by petteyg359 · 2013-09-21 06:43 · Score: 1

I had some common taters for dinner.
Re:Author here. by gentryx · 2013-09-21 06:46 · Score: 1

Just to add another twist: even as an English native speaker I would not be surprised if you spelled Hertz wrong since it's a German name, and because Herz and Hertz are pronounced identically in German, it's even a common misspelling in Germany, too. :-)

--
Computer simulation made easy -- LibGeoDecomp
Re:Author here. by Anonymous Coward · 2013-09-21 07:51 · Score: 0

It doesn't help that a Herz (heart) beats at 1 Hertz, so it would even be a plausible etymology. :-)
Re:Author here. by Half-pint+HAL · 2013-09-22 00:44 · Score: 1

You were not mistaken, the world just grows more poorly educated. Yesterday's illiteracy is today's literacy.
Forsooth, good sirrah, thou hast spake sagely, and shewn thyself more wise that thy wyrd wurds should haue me think.

--
Got them moderator blues I blieve I walk out the do', With these mod-points I been gettin', I 'most never post no mo'
Re:Author here. by Half-pint+HAL · 2013-09-22 00:47 · Score: 1

"Corn" means "grain" (grain is from the French for corn, coming from a common root). "Corn" is only wheat by virtue of wheat being the most common type of grain... a "default", if you will. Similar to how I call a mallard a "duck", because it is the only breed found commonly where I grew up. (Daffy and Donald bost confused me when I was a child...)

--
Got them moderator blues I blieve I walk out the do', With these mod-points I been gettin', I 'most never post no mo'
Re:Author here. by boristhespider · 2013-09-22 01:46 · Score: 1

Calling corn 'wheat' a default is a good way of putting it - it was what I was trying to get across in more words. "Corn" is ultimately related to "Kernel" and does certainly come from "seed", "grain". But the language evolved so that now it's a catch-all for grains, and wheat by default across most of Europe. Of course, I want the kids off my lawn and the language to stop evolving ;)

Very limited indeed by gentryx · 2013-09-21 06:03 · Score: 4, Informative

I took a look at TFA and followed up by reading the description of LibGeoDecomp:

If your application iteratively updates elements or cells depending only on cells within a fixed neighborhood radius, then LibGeoDecomp may be just the tool you've been looking for to cut down execution times from hours and days to minutes.

Gee, that seems like an extremely limited problem space, and doesn't measure up at all to the title of this Slashdot submission. It might really be a useful tool, but when I clicked to this article I expected to read about something much more general purpose, in terms of 'bringing Legacy Fortran to Supercomputers'.

Correct. We didn't try to come up with a solution for every (Fortran) program in the world. Because that would either take forever or the solution would suck in the end. Instead we tried to build something which is applicable to a certain class of applications which is important to us. So, what's in this class of iterative algorithms which can be limited to neighborhood access only?

cellular automata
stencil codes
Lattice Boltzmann methods for computational fluid dynamics (technically a subclass of stencil codes)
Particle in cell codes
Short-ranged n-body simulations

It's interesting that almost(!) all computer simulation codes fall in one of the categories above. And supercomputers are chiefly used for simulations.

By the way, regarding the use of the word 'codes': I don't think English is the first language of this developer. Cut some slack.

Thanks :-) You're correct, I'm from Germany. I learned my English in zeh interwebs.

--
Computer simulation made easy -- LibGeoDecomp

Re:Very limited indeed by jabuzz · 2013-09-22 21:05 · Score: 1

All of which can be programmed as matrix based, using the inbuilt matrix operations of the language (Fortran 90 or later) and which I would expect the compiler to parallelize for me without me doing anything.
Re:Very limited indeed by gentryx · 2013-09-22 21:15 · Score: 1

There is more to it though than just parallelization and vectorization. Are you familiar with cache blocking? If not, here is a great paper on the subject. This is something the compiler won't do for you as it transforms the algorithm. Our library can do this (gives you approx. 2x speedup). The library can do this because it knows more about the problem domain compared to a (generic Fortran) compiler.

--
Computer simulation made easy -- LibGeoDecomp

Old and kludgy makes it harder to port. by Ungrounded+Lightning · 2013-09-21 06:31 · Score: 2

Not only does it cost a LOT to port this stuff and risk errors in doing so, but the cruftier it is the harder (and more expensive and error-prone) it is to port it.

If, instead, you can get the new machines to run the old code, why port it? Decades of Moore's Law made the performance improve by orders of magnitude, and the behavior is otherwise unchanged.

If you have an application where most of the work is done in a library that is largely parallelizable, and with a few tiny tweaks you can plug in a modern multiprocessor-capable library and run it on a cluster, you get another factor of almost as-many-processors-as-I-decide-to-throw-at-it, with small effort and negligible chance of breaking the legacy code.

What a deal!

And it's one less reason to touch the tarbaby of the rest of the working legacy code.

Let the COMPUTER do the work. People are for setting it up - with as little effort as practical - and moving on to something else that is important and can't yet be automated.

Eventually somebody will teach the computers to convert the Fortran to a readable and easily understandable modern language - while both keeping the behavior identical and highlighting likely bugs and opportunities for refactoring. Until then, keeping such applications in the legacy language (unless there's a really good reason to pay to port them) is often the better approach - both for economy and reliability.

--
Bantam Dominique roosters crow a four-note song. Once you've heard it as "Happy BIRTHday" you can't NOT hear it that way

Re:Old and kludgy makes it harder to port. by boristhespider · 2013-09-21 07:24 · Score: 1

"Eventually somebody will teach the computers to convert the Fortran to a readable and easily understandable modern language - while both keeping the behavior identical and highlighting likely bugs and opportunities for refactoring."
That language will likely be Fortran 2008 or 2015...
Re:Old and kludgy makes it harder to port. by jythie · 2013-09-21 09:36 · Score: 1

Well, you can always decompile the Fortran into Java.....
Re:Old and kludgy makes it harder to port. by boristhespider · 2013-09-21 09:41 · Score: 1

I've long been tempted to get a .net compiler for Fortran. That would make it *really* easy to build some ugly Java.

The trick is to avoid solving the bigger problems by gentryx · 2013-09-21 06:35 · Score: 1

We're using Boost Multi-array as a multi-dimensional array, so that's not really a problem. And since we call back the original Fortran code users are still free to use their original libraries (some restrictions apply -- not all of these libraries will be able to handle the scale of current supercomputers).

Regarding the speed issue: yeah, that's nonsense today. It all boils down writing C++ in a way that the compiler can understand the code well enough to vectorize it.

--
Computer simulation made easy -- LibGeoDecomp

Fortran works fine with MPI by poodlediagram · 2013-09-21 06:48 · Score: 5, Informative

...and has done for years.

We write a scientific code for solving quantum mechanics for solids and use both OpenMP and MPI in hybrid. Typically we run it on a few hundred processors across a cluster. A colleague extended our code to run on 260 000 cores sustaining 1.2 petaflops and won a supercomputer prize for this. All in Fortran -- and this is not unusual.

Fortran gets a lot of bad press, but when you have a set of highly complex equations that you have to codify, it's a good friend. The main reason is that (when well written) it's very easy to read. It also has lot's of libraries, it's damn fast, the numerics are great and the parallelism is all worked out. The bad press is largely due to the earlier versions of Fortran (66 and 77), which were limited and clunky.

In short, the MPI parallelism in Fortran90 is mature and used extensively for scientific codes.

Re:Fortran works fine with MPI by Anonymous Coward · 2013-09-21 07:01 · Score: 0

And Fortran 2008 brings goodies not found anywhere else.
Re:Fortran works fine with MPI by oursland · 2013-09-21 08:21 · Score: 1

I'm not 100% sure on that. Languages like Go have brought in a lot of the same things, like language-level concurrency. However, Fortran has really been designed to address the problems that are solved on supercomputers first and general language second. This makes it far easier to focus on the task at hand instead of working around limitations in the language.
Re:Fortran works fine with MPI by Anonymous Coward · 2013-09-21 10:12 · Score: 0

Fortran 2008 is fine. I use it all the time. But don't lie... Turing complete is Turing complete. You want something that is unique and awesome? Macros in Common Lisp will knock your socks off.

God... by Anonymous Coward · 2013-09-21 06:54 · Score: 1

You do know that Fortran 2008 has better support for parallelism and concurrency than c++ don't you? Or do you still think everyone is using F77?

Codes code as peoples people by amaurea · 2013-09-21 06:57 · Score: 2

I, too, work in HPC computing, and while I found "codes" very jarring to begin with, I've learned to live with it. I am not sure the "code" vs. "codes" issue it is more grammatically problematic than "people" vs. "peoples". A people (countable) is made up of people (uncountable). Similarly "a code" (countable, but nonstandard) is made up of code (uncountable). Personally I would use "a program" or "a library" instead of "a code", though.

Another related issue is whether "data" is countable or not. I'm used to it being uncountable, with there being more or less of it, but not "several data". But scientific journals in my field prefer the countable version "a datum", "several data", which is arguably more historically correct. That, too, took some getting used to.

"Legacy" Fortran code? by Anonymous Coward · 2013-09-21 07:03 · Score: 1

As long as you need a double C (and forget about C++) standard committee tracker diploma to write a fucking function processing three non-aliased arguments with two-dimensional runtime-sized arrays, a C compiler supporting the most recent standards will produce much worse code than a Fortran compiler compiling some 50-year old Fortran subroutines written by a reasonably good mathematician without much programming experience.

Of course, if we are talking about earlier C standards, not even a standard-following geek has a chance to come close. And if we are talking about C++, we are still waiting for standards that allow efficient code generation for the most basic numerical subroutines.

Try it: write a simplistic Fortran subroutine doing a matrix multiplication for variable-sized multidimensional arrays. Easy to write in a Fortran-IV subset of the language. A moderately talented squirrel could do it between focusing on its nuts.

Then try to coax a C or C++ compiler into generating anything closely efficient. You'll need to revert to recent language standards, and your generated code will still have a much worse product of runtime times incomprehensibility.

That's why extern "Fortran" is still the most important numeric C code ingredient. Because the old libraries were written by genius mathematicians and lousy programmers, and you need genius programmers to get the stuff close to good in C/C++. But double geniuses are hard to come by.

You only have to rewrite it a *little* bit by msobkow · 2013-09-21 07:26 · Score: 4, Insightful

You don't have to rewrite your code entirely, just a little bit.

You only have to restructure the subroutines and change the syntax.

Well, that sounds like rewriting to me. Just because there is a library that might implement the same semantics as FORTRAN's math does not mean that it isn't a rewrite, coming with all the risks for new errors and gotchas that that implies.

--
I do not fail; I succeed at finding out what does not work.

Re:You only have to rewrite it a *little* bit by Anonymous Coward · 2013-09-21 11:30 · Score: 0

My observation exactly! First I scanned the article, and thoiught, "wow an AI to convert Fortran to C!" Then I read a bit more and saw the rewriting involved.
In the cases in which I had to 'convert' fortan to C it was to use the preexisting old sphaghettied fortran in C or C++ libraries. Rather than pull my hair out trying to figure out the fortran programmers intent, I just turn the fortran into a subroutine and connect the fortran obj files to my C code.
You do know that fortran was not designed to be easy to read, reuse or recycle.

Re:The trick is to avoid solving the bigger proble by emt377 · 2013-09-21 07:39 · Score: 1

You never want a compiler to vectorize code. You want interfaces to vectoring hardware that you use to vectorize operations on your data. Just like you don't want compilers to provide multidimensional arrays - memory isn't multidimensional, so there's no natural layout. Instead you implement the arrays you need - even if they look the same the complexity contract and implementation is completely different for statically dimensioned (e.g. template params in C++) vs dynamically dimensioned (can be resized); sparsely populated either an entire row in a dimension, by specific dimension, or by any dimension (for instance only have data in rows 0, 5, 10383484387373, colums -4948484, 0, 338383 - implying sparsely populating only the intersecting cells); where indexes are arbitrary types (say complex), etc. NONE of this has a natural representation. Just like vectored operations in a NUMA architecture require careful data management for maximum throughput - so if you want to apply this to a sparse data set for instance you need to think through how this is to be done rather than just think a compiler can spit it out for you (other than in the most trivial demos that lack real-world requirements).

Efficient software is more than good assembly by gentryx · 2013-09-21 07:39 · Score: 1

Your argument seems to focus mainly on how well a compiler can optimize a given code. But writing efficient software takes more. Ever tried to implement an AMR or 3D cache blocking in Fortran? It's a pain. Object orientation gives your programmers a huge boost in efficiency. And if they can use this efficiency to implement algorithms which converge faster, then this will make your code ultimately run faster. Even the last piece, the arithmetic kernel, can be done efficiently in C++ if you adopt modern libraries like Boost SIMD.

--
Computer simulation made easy -- LibGeoDecomp

Re:Efficient software is more than good assembly by Anonymous Coward · 2013-09-21 20:30 · Score: 0

Your argument seems to focus mainly on how well a compiler can optimize a given code. But writing efficient software takes more.
For number-crunching numerical code: no, that's basically it.

Ever tried to implement an AMR or 3D cache blocking in Fortran? It's a pain.
Most definitely. Any algorithm requiring sophisticated data structures is what is giving old-style Fortran its deserved bad reputation. C/C++ provide the tools for creating data structures on a low level. Including multidimensional arrays (you "just" need to do all of the index arithmetic for more than one dimension manually, and strength reduction can kick in. You're still hampered by aliasing, though).
The problem is that the standard toolsets don't include such things, and that the view on the data structures is not a high level one. Arrays are pointers you can do arithmetic with, and that means that the compiler cannot deduce a lot of high-level invariants for your data structures. And that severely hampers its ability for optimization.
And that's really bad regarding the semantics of multidimensional arrays and/or number crunching. For scalar data flows, C is not too bad.
High quality C++ libraries tend to work around some of the partly intentional gaps. The problem is that each library has its own incompatible framework for implementing missing basic data structures, and that means that you can't just plug them together, which defeats the idea of a library.

Object orientation gives your programmers a huge boost in efficiency. And if they can use this efficiency to implement algorithms which converge faster, then this will make your code ultimately run faster.
If the core of an algorithm is number crunching, then faster convergence is achieved by faster number crunching.

Even the last piece, the arithmetic kernel, can be done efficiently in C++ if you adopt modern libraries like Boost SIMD.
The sobering thing with object orientated programming in C++ is that things usually still don't fit together unless they have been designed from the start to fit together. Which sort of defeats the whole idea.
You have Boost, which is nice. But does not cooperate with anything else, having its own matrix classes.
In Fortran, you can plug together dinosaurs like LINPACK, BLAS, EISPACK, ODEPACK and others in order to solve tasks. And they optimize better than any equivalent written in C.
I'm not a fan of Fortran. But that does not mean that I am blind to half a century of egg that C/C++ unbelievably keeps accumulating on its face. Can you believe that there still is no common way between C and C++ to work with and pass multidimensional arrays of varying size to functions? C got them about a decade ago after far too long, but C++ has not adopted them because of the "it's more fun to let everybody try to do it by hand, because we can" attitude.
Fortran programs are the silverfishes of programming. They are stuck eons behind, but nobody really bothered competing in their ecological niche.

Did you read TFA? by gentryx · 2013-09-21 07:50 · Score: 1

Just asking because otherwise you'd had a better view on how intrusive (or not) this restructuring is. To give some numbers: a while ago we ported a simulation (video here) to the library. The simulation model was about 5000 lines of code. Not much, but the code was highly condensed and had been carefully modeled in the course of 3 years. We ended up having to change less than 100 lines to make it work with LibGeoDecomp. That's a far cry from a rewrite.

--
Computer simulation made easy -- LibGeoDecomp

Re:Did you read TFA? by msobkow · 2013-09-21 09:55 · Score: 1

And so on the basis of one example you're willing to take their word that changing languages doesn't require re-debugging the entire program?
My, my, but you are naive, aren't you?

--
I do not fail; I succeed at finding out what does not work.
Re:Did you read TFA? by msobkow · 2013-09-21 09:57 · Score: 1

Even when you change compilers but keep the same source code you have to redebug complex FORTRAN code, due to idiosyncracies in implementations over the years.

--
I do not fail; I succeed at finding out what does not work.
Re:Did you read TFA? by msobkow · 2013-09-21 10:02 · Score: 1

Sometimes you get new bugs even with the same compiler, just because you changed optimization flags for the build.

--
I do not fail; I succeed at finding out what does not work.

Targeted at Managers by PPH · 2013-09-21 07:52 · Score: 1

Someone has some legacy Fortran code and a task of modifying it. There are two approaches: Port it or work on the existing source. Porting it allows for hiring from a very large (but shallow*) pool of programmers familiar with 'current' languages like C++. Working with the existing code means having to locate resources in a much smaller market. The former are cheap. The latter much more expensive. What to do?

*Good programmers can probably pick up a book and teach themselves Fortran pretty easily. But even in the C++ world, these people are more highly paid. There exists a large supply of people who know one language, but not the concepts of programming in general and are not cross trainable. These people work cheap**.

**Putting this class of people on such a project probably signals disaster.

--
Have gnu, will travel.

Recent experience with old code by spaceyhackerlady · 2013-09-21 07:54 · Score: 1

Reminds me of a recent experience writing a new system to replace a legacy system.

A key part of one of the homegrown network protocols was a CRC. This sounds OK, but the implementation was wrong. I spent a lot of time trying to reverse-engineer just what the original engineers had implemented. The fact that it was written in ADSP2181 assembler didn't help. It had never been an issue before because both ends of the link used the same wrong implementation, so the errors cancelled out.

I ended up writing an instruction-level simulation of the ADSP2181 processor (only needed a handful of instructions) and executing the original code directly. It works fine. Performance isn't an issue, though moving from a 33 MHz DSP chip to an eight-core 2.8 GHz box certainly helps in that department. :-)

...laura

interesting tool, misleading summary by excelsior_gr · 2013-09-21 08:02 · Score: 4, Interesting

It is true that there are a lot of legacy Fortran codes in scientific computing, but chances are that they are already parallel, so this tool won't be much of a use for those supporting them. OpenMP and MPI have been in use in Fortran codes for decades. The summary seems to think that legacy Fortran codes need saving and porting. They don't. They are just fine, number crunching faster than you can say DO CONCURRENT.

Having said that, LibGeoDecomp seems quite nice if you find a piece of serial code and you want to make a rough parallel version of it without much hassle. But if you are writing new code, you can parallelize it natively. Nevertheless, I believe that we must focus our resources in developing the current compilers. The Compaq compiler died in the hands of HP and people moved mostly to the intel compiler, since the open-source community was focused in C++ at the time and the gcc was stuck with the obsolete g77. Then g95 came along, that brought us all the cool stuff of Fortran 90/95, while gfortran was being developed. Now gfortran seems decent, but it still has to match the speed of ifort in order to sit at the cool kids' table. Also, we need the features of the latest Fortran standards. I would gladly use a compiler that is feature-complete, even if the executables are relatively slow, because I will be able to switch into the mindset of the Fortran2008 standard and stop doing things the Fortran95-way while coding. They will then have all the time they need to make it more efficient.

Re:interesting tool, misleading summary by ShakaUVM · 2013-09-22 21:36 · Score: 1

Yeah, we were used mixed Fortran/C++ code when I was in grad school circa 2000. The Fortran code was already parallelized due to a long history of having better parallelization tools than C, so we wrapped it in a C library and linked the object code together. Students in my HPC class didn't know that they were calling Fortran at all when they executed the numerical kernel we provided, just that it worked and was fast.

In my personal experience... by tlambert · 2013-09-21 08:10 · Score: 2

In my personal experience...

Most of the physics code in FORTRAN that I've dealt with are things like relativistically invariant P-P and N-P particle collision simulations in order to test models based on the simultaneous solution to 12 or more Feynman-Dyson diagrams. It's what was used to predict the energy range for the W particle, and again for the Higgs Boson, and do it rather reliably.

The most important part of this code was reproducibility of results, so even though we were running Monte Carlo simulations of collisions, and then post-constraining the resulting pair productions by the angles and momentum division between the resulting particles, the random number stream had to be reproducible. So the major constraint here was that for a reproducible random stream of numbers, you had to start with the same algorithm and seed, and the number generation had to occur linearly - i.e. it was impossible to functionally decompose the random number stream to multiple nodes, unless you generated and stored a random number stream sufficient to generate the necessary number of conforming events to get a statistically valid sample size.

So, it was linear there, and it was linear in several of the sets of matrix math as it was run through the diagrams to filter out pair non-conforming pair production events.

So we had about 7 linearity choke-points, one of which could probably be worked around by pre-generating a massive number of PRNG output far in excess of what would be eventually needed, and 6 of which could not.

The "add a bunch of PCs together and call it a supercomputer" approach to HPC only works on highly parallelizable problems, and given that we've had that particular capability for decades, the most interesting unsolved problems these days are not subject to parallel decomposition (at least not without some corresponding breakthroughs in mathematics).

I converted a crap-load of FORTRAN code to C in order to be able to optimize it for Weitek vector processors plugged into Sun hardware, including the entire Berkeley Physics package, since that got us a better vector processor than was on the Cray and CDC hardware at Los Alamos where the code was running previously, but adding a bunch of machines together would not have improved the calculation times.

Frankly, it seems to me that the available HPC hardware being inherently massively parallel has had a profound effect on constraining the problems we try to solve, and that there are huge, unexplored areas that are unexplored for what amounts to the equivalent of someone looking for their contact lens under the streetlight, rather than in the alley where they lost it, "because the light's better".

Re:In my personal experience... by Anonymous Coward · 2013-09-21 13:37 · Score: 0

See here for a parallel way to deal with your random number generation problem:
http://www.stat.osu.edu/~herbei/GPU/RNG.pdf
Re:In my personal experience... by tlambert · 2013-09-21 14:56 · Score: 1

See here for a parallel way to deal with your random number generation problem:
http://www.stat.osu.edu/~herbei/GPU/RNG.pdf
Thanks; read the paper; it presents three methods, 2 of which are unsuitable for parallel decomposition to an arbitrary number of CPUs (the Mersenne Twistor is not suitable to thread level decomp.), and one of which where you have to really carefully define you m(i). Changing algorithms isn't really an option, unless you are willing to rerun all of your historical computations, since unles you use the same PRNG, there is no guarantee of precise reproducibility, which is one of the issues here.
I think it'd be easier, with today's storage capabilities, to just pre-generate them, but that doesn't get around the dependent matrix operations problems which make it a linear computation after that.

But aside from that, Mrs. Lincoln? by russotto · 2013-09-21 08:24 · Score: 1

Source code modification is required, but mostly limited to restructuring into a new pattern of subroutines.

That's not what I call "limited". More like a rewrite, or at least a salvage operation.

Re:The trick is to avoid solving the bigger proble by stenvar · 2013-09-21 08:48 · Score: 1

We're using Boost Multi-array [boost.org] as a multi-dimensional array

Boost Multi-array doesn't support most modern Fortran array features, so it's useless for porting modern Fortran code to C++: you end up having to rewrite most of the code from scratch.

Regarding the speed issue: yeah, that's nonsense today [ieee.org].

That just shows that with enough effort, you can create efficient special purpose libraries in C++; of course you can. The question is whether straightforward, boring numerical code compiles into fast executables. If you write it using Boost multi-array, it ends up being much slower (not to mention more tedious) than equivalent Fortran code.

Re:The trick is to avoid solving the bigger proble by stenvar · 2013-09-21 08:52 · Score: 1

You never want a compiler to vectorize code.

I most certainly do.

Just like you don't want compilers to provide multidimensional arrays - memory isn't multidimensional, so there's no natural layout

There is a natural layout that handles 99% of all numerical needs. Numerical programmers understand it, and so do compilers.

NONE of this has a natural representation.

You listed a bunch of exceptional cases that should indeed be handled by libraries. But not to support common cases well because of exceptional cases is stupid.

FUD by gentryx · 2013-09-21 08:55 · Score: 1

Care to backup those claims with actual code/numbers? I'm just asking because my FUD alarm just rang. Part of my job is performance engineering. My experience is that if you use C++ correctly, you get code which at least matches Fortran code.

--
Computer simulation made easy -- LibGeoDecomp

Re:FUD by stenvar · 2013-09-21 10:47 · Score: 3, Informative

Care to backup those claims with actual code/numbers?
You claim to be writing high performance code and you don't understand the difference between Boost multi-array and Fortran arrays? I'm sorry, but if you do any kind of high performance computing, you should at least have a decent understanding of one of the major tools used for it, namely modern Fortran. Once you do, you can then make an informed choice, instead of behaving like an immature language zealot.
Here are two places you should start looking:
http://en.wikipedia.org/wiki/Fortran_95_language_features#Arrays_2
http://en.wikipedia.org/wiki/High_Performance_Fortran
(The Fortran code on libdecomp.org is cringe-inducing and inefficient.)
And, FWIW, I'm primarily a C++ programmer, because that's what the market demands, not a Fortran programmer, but at least I know my tools and their limitations.

My experience is that if you use C++ correctly, you get code which at least matches Fortran code.
If you use C, assembly, or Java "correctly", you can usually match Fortran code. That is entirely not the point.

misleading title and story by already_read · 2013-09-21 08:56 · Score: 1

The story makes it sound like there's no support for Fortran in MPI but there totally is: https://computing.llnl.gov/tutorials/mpi/ I recognize that some kind of abstraction layer in a support library on top of MPI might be useful, but let's call it what it is.

Re:The trick is to avoid solving the bigger proble by Anonymous Coward · 2013-09-21 14:04 · Score: 0

All true.

But there is a C++ library, ObjexxFCL, that has Fortran 2008 array semantics and speed. It is used in conjunction with a Fortran to C++ conversion system.

No, I'm taking MY word for it. :-) by gentryx · 2013-09-21 20:56 · Score: 1

Sorry, I should probably have added a disclaimer that I'm involved in the development of the library as my signature apparently doesn't make it obvious enough: I'm the project lead.

So far we've built about a dozen application with LibGeoDecomp, including porting a dozen large scientific codes towards it. You're right that porting a code usually involves debugging. But that's inevitable when parallelizing a previously sequential code anyway. We don't claim to do magic, we just have some cool tricks up our sleeves. And that's a Good Thing(tm). Because those who claim to cast magic usually disperse just b/s while clever tricks can save you weeks (months even) of work. Here is what you don't have to do if you use LibGeoDecomp:

You don't have to write a proven (and correct) parallelization that scales to 1850000 (that's 1.8M) MPI processes.
You don't have to devise your own domain decomposition and load balancing scheme.
You don't have to write scalable parallel IO and application-level checkpoint/restart code.
...and so on and so on. A more complete list is here.

As said, parallelizing a sequential code will almost always involve some sort of debugging, no matter which tool you use. But the library also brings a couple of facilities to ease that transition: 1. you can first adopt the SerialSimulator which performs no parallelization at all, but allows you to check the data transfer and callbacks. 2. you can then transition to those parallelization which run on a single node only (e.g. the CacheBlockingSimulator or the CudaSimulator) to check that there are no race conditions before (3.) you finally more to large scale systems using e.g. the HiParSimulator (used for full system runs on JUQUEEN, an IBM BG/Q and ATM the fastest European machine) or the HpxSimulator (used for runs on TACC's Intel Xeon Phi equipped Stampede; BTW: it's built on HPX, a parallel runtime to C++). 4. Finally you can piggy-back the TestCell onto your model, which will use checksums to validate the data the library gives back to your code.

--
Computer simulation made easy -- LibGeoDecomp

Re:No, I'm taking MY word for it. :-) by jabuzz · 2013-09-22 20:59 · Score: 1

I would just hope that my Fortran90 or better compiler did all that matrix stuff which is built into the language parallel for me automatically without me needing to lift a finger.
I would have thought that any scientific code in Fortran with obvious parallelism would have had any had matrix stuff re factored into the built in language matrix syntax ages ago, and would be a recompile away. I guess I must be missing something where scientific code with significant parallelism can't be expressed in matix form...
Anything that cannot be done this way is surely handled by coarray Fortran which is part of the 2008 standard. http://en.wikipedia.org/wiki/Co-array_Fortran
I am sure your project is useful, but I fail to see why you would not adopt more standard approaches.
Re:No, I'm taking MY word for it. :-) by gentryx · 2013-09-22 21:11 · Score: 1

Co-array Fortran is more generic than LibGeoDecomp, so there are problems you can solve with Coarrays where our library would be of little use. But then again there are algorithms which would require a lot of work just with Coarrays, but are a breeze with LibGeoDecomp. In short: both are solving different problems, albeit there is a certain overlap.

--
Computer simulation made easy -- LibGeoDecomp

QED by gentryx · 2013-09-21 21:15 · Score: 1

So you said Fortran codes we faster than C++ codes and now that's not the point any longer as they really aren't? Great, thanks!

The links you provided show that Fortran has some convenience functions for selecting parts of arrays and applying arithmetics to them. What I didn't see is anything you can't so with Boost Multi-Array and Boost SIMD.

--
Computer simulation made easy -- LibGeoDecomp

Re:QED by stenvar · 2013-09-22 02:14 · Score: 1

So you said Fortran codes we faster than C++ codes
I said no such thing; that doesn't make any sense. What I said is:
Boost Multi-array doesn't support most modern Fortran array features, so it's useless for porting modern Fortran code to C++: you end up having to rewrite most of the code from scratch.
and
That just shows that with enough effort, you can create efficient special purpose libraries in C++; of course you can. The question is whether straightforward, boring numerical code compiles into fast executables. If you write it using Boost multi-array, it ends up being much slower (not to mention more tedious) than equivalent Fortran code.

The links you provided show that Fortran has some convenience functions for selecting parts of arrays and applying arithmetics to them.
Yes, and in addition to that, the compilers know how to do kick-ass optimization on these "convenience functions", vectorize these expressions, and (depending on the compiler) parallelize them, for up to seven dimensions and any in-memory layout, stride, and indexes. In addition, there is a simple and straightforward notation for distributing those computations in Fortran and HPF.
And if all of that were so easy to implement, there would already be C++ libraries doing it, but unfortunately there aren't. Boost multi-array certainly does none of those things. Even if these features weren't so useful for writing readable, high performance numerical code (and they are), they are essential for porting modern Fortran code to C++, because if there isn't anything equivalent, a port requires all of that to be rewritten by hand with loops.
(In addition, your Fortran example and your use case for LibGeoDecomp are piss-poor, but that's a separate issue.)
Re:QED by gentryx · 2013-09-22 08:22 · Score: 1

Sorry, I got overexcited and did see something in your post that apparently wasn't there.
And yet I don't buy into this "OMG, C++ is either clumsy or slow compared to Fortran" FUD (I hope I'm paraphrasing it correctly this time). For a certain (perhaps smallish) domain LibGeoDecomp is such a library which makes it easy to write short, yet (nearly) optimal code with C++.
I don't doubt though that there are use cases where it's hard to come up with a good C++ solution while Fortran would outperform it in both, speed and simplicity.

--
Computer simulation made easy -- LibGeoDecomp
Re:QED by stenvar · 2013-09-22 16:28 · Score: 1

And yet I don't buy into this "OMG, C++ is either clumsy or slow compared to Fortran" FUD
Well, I suggest you learn modern Fortran well. Given that you claim to be writing tools for HPC, you really should.

For a certain (perhaps smallish) domain LibGeoDecomp is such a library which makes it easy to write short, yet (nearly) optimal code with C++.
Not even close. I got a single-core speedup of a factor of two simply by rewriting your Fortran example, which tells me that LibGeoDecomp must have significant overhead somewhere.
I write a significant amount of stencil code, and I don't see myself using LibGeoDecomp; it seems to be both less efficient and more cumbersome than other solutions.
Re: QED by gentryx · 2013-09-22 18:17 · Score: 1

I write a significant amount of stencil code, and I don't see myself using LibGeoDecomp; it seems to be both less efficient and more cumbersome than other solutions.
Well then, what about a challenge? Let's compare code size/performance for a simple example code? You'll use Fortran, I'll use my library.
I suggest a Jacobi-style smoother (v_ {t+1}(x, y, z) = (v_{t}(x, y, z-1) + v_{t}(x, y-1, z) + v_{t}(x-1, y, z) + v_{t}(x, y, z) + v_{t}(x+1, y, z) + v_{t}(x, y+1, z) + v_{t}(x, y, z+1)) * (1.0/7.0)) as the benchmark.
You seem to be an expert on the subject so I assume it won't be much of an effort or that you'll even have a solution readily at hands.

--
Computer simulation made easy -- LibGeoDecomp

Agreed. by gentryx · 2013-09-21 21:24 · Score: 1

If your code is already parallelized, LibGeoDecomp might not have a terrible lot to offer for you. The blog post was by no means directed against Fortran as a language. Instead it advocates a way for folks to bring their existing, sequential Fortran codes to supercomputers without having to spend months doing the parallelization manually.

--
Computer simulation made easy -- LibGeoDecomp

Re:Agreed. by Anonymous Coward · 2013-09-22 00:01 · Score: 0

without having to spend months doing the parallelization manually.
Months? I've done this work on a complex sequential Fortran code... took an afternoon. Parallelization around 98% using something like 64 processors. Piece of cake. If you think that it is really so hard, and have some use for it, feel free to leave information for job offers...

HPC is just a niche market, too by gentryx · 2013-09-21 22:56 · Score: 1

You're right: the current compute architectures we see in HPC are geared at data parallel problems of massive size. Clock speeds are stagnating, sometimes even stepping down (e.g. NVIDIA Kepler has its cores actually clocked slower that Fermi with its hot clock for the shaders). Your description sounds like you'd benefit from a singular core which is tuned for single thread performance (e.g. with really big caches, a large out of order execution window) and runs at 5-10 GHz (which might require liquid nitrogen cooling).

But then again this is another niche, probably even smaller than the current HPC market, so it might not be commerially viable to develop products for it.

--
Computer simulation made easy -- LibGeoDecomp

Moreover... by Anonymous Coward · 2013-09-23 07:15 · Score: 0

There is no absence of tools to support continued use of Fortran. There is an excellent free Gnu compiler for Fortran that implements all of Fortran 1995 (as best I can recall) and most of Fortran 2003. This includes the C interop features, which (possibly with a little bit of interface coding) allow Fortran to be called from C and vice versa without compiler-specific hacking. Intel sells a commercial compiler for Windows that plugs into Visual Studio. I think there's even a .NET implementation.

Slashdot Mirror

A C++ Library That Brings Legacy Fortran Codes To Supercomputers

157 comments