Slashdot Mirror


Ask the Man Behind the NOAA's New Beowulf Cluster

Greg Lindahl sent in this story last September about a massive Alpha Linux cluster that's being built by HPTi for the NOAA's Forecast Systems Laboratories. What Greg forgot to mention when he submitted the original story is that he's the project's chief designer. What with all the Beowulf (and Alpha) interest around here, we figured he'd make a great interview guest, especially now that the project is well under way. Please post your questions below. Answers to 10 - 15 of the highest-moderated ones should appear within the next week.

9 of 87 comments (clear)

  1. The Future of Scientific Programming? by Matt+Gleeson · · Score: 5

    The raw performance of the hardware being used for scientific and parallel programming has improved by leaps and bounds in the past 10-20 years. However, most folks still program these supercomputers much the same way they did in the 80's: Unix, Fortran, explicit message passing, etc.

    You have worked in research with Legion and in industry at HPTi. Do you think there is hope for some radical new programming technology that makes clusters easier for scientists to use? If so, what do you think the cluster programming environment of tomorrow might look like?

  2. In the beginning... by zpengo · · Score: 5

    How did you come to be the project's chief designer? I'm curious to know the background of anyone who gets to work on such an interesting project.

    --


    Got Rhinos?
  3. The Future of the Control Software by PacketMaster · · Score: 5

    I built a Beowulf-style cluster this past semester in college for independent study. One of the biggest hurdles we had was picking out a message passing interface such as MPI or PVM. Configurining across multiple platforms was then even worse (we had a mixture of old Intels, SunSparcs and IBM RS/6000's). What do you see in the future for these interfaces in terms of setup and usage and will cross-platform clusters become easier to install and configure in the future?

    --

    Some people take their .sig way too seriously

  4. Weather forecasting in general. by Matt2000 · · Score: 5

    Ok, a two parter:

    As I understood it weather models are a fairly hard thing to paralleliz (how the hell do you spell that?) because of the interdependence of pieces of the model. This would seem to me to make a Beowulf cluster a tough choice as it's inter-CPU bandwidth is pretty low right? And that's why I thought most weather prediction places chose high end super-computers because of their custom and expensive inter-CPU I/O?

    Second part: Is weather prediction getting any better? Everything I've read about dynamic systems says that prediction past a certain level of detail or timeframe is impossible. Is that true?

    Disclaimer: I might be dumb.

    Hotnutz.com - Funny

    --

  5. What about a dnet type client? by x0 · · Score: 5

    I am curious as to whether (no pun intended...:)) or not you have ever done any testing to see if a distributed.net type enviornment would be useful for your type of work?

    It seems to me that there are more than a few people who are willing to donate spare cpu cycles for various projects. At a minimum. you could concentrate on the client side binaries and not worry as mouch about hardware issues.

    --
    In the immortal words of Socrates, who said; 'I drank what?'
  6. Beowulf Alternatives? by vvulfe · · Score: 5

    Before deciding on a beowulf clusters, what different options did you explore (Cray? IBM?), and what motivated you to choose the Beowulf System?

    Additionally, to what would you compare the system that you are planning to build, as far as computing power is concerned?

    Thanks,
    VVulfe

  7. Biggest whack in the head? by technos · · Score: 5

    Having built a few small ones, I got to know quite a bit about Linux clusters, and about programming for them. Therefore, this question has nothing to with clusters.

    What was the biggest 'WTF was I thinking' on this project? I'd imagine there was a fair amount of lateral space allowed to the designers, and freedom to design also means freedom to screw up.

    --
    .sig: Now legally binding!
  8. Question about maintinance. by Legolas-Greenleaf · · Score: 5
    A major problem with using a beowulf cluster over a single supercomputer is that you now have to administer many computers instead of just one. Additionally, if something is failing/misbehaving/etc., you have to determine which part of the cluster is doing it. I'm interested a] how much of a problem this is over a traditional single machine supercomputer, b] why you chose the beowulf over a single machine considering this factor, and c] how you'll keep this problem to a minimum.

    Besides that, best of luck, and I can't wait to see the final product. ;^)
    -legolas

    i've looked at love from both sides now. from win and lose, and still somehow...

  9. Why alpha? by crow · · Score: 5

    Why did you choose Alpha processors for the individual nodes? Why not something cheaper with more nodes, or something more expensive with fewer nodes? What other configurations did you consider, and why weren't they as good?