LinuxBIOS, BProc-Based Supercomputer For LANL

← Back to Stories (view on slashdot.org)

LinuxBIOS, BProc-Based Supercomputer For LANL

Posted by timothy on Monday October 7, 2002 @04:03PM from the yeah-but-what-about-the-neon-lights dept.

An anonymous reader writes "LANL will be receiving a 1024 node (2048 processor) LinuxBIOS/BProc based supercomputer late this year. The story is at this location. This system is unique in Linux cluster terms due to no disks on compute nodes, using LinuxBIOS and Beoboot to accomplish booting, and BProc for job startup and management. It is officially known as the Science Appliance, but is affectionately known as Pink to the team that is building much of it."

21 of 189 comments (clear)

Not a Beowulf cluster by Anonymous Coward · 2002-10-07 16:14 · Score: 4, Funny

But if you'd replace the expensive high-performance interconnect with a cheap ethernet, then it would be a Beowulf cluster.
LinuxBIOS by Anonymous Coward · 2002-10-07 16:35 · Score: 5, Insightful

I wonder why LinuxBIOS hasn't taken off. I've debated ordering one of their "kits." It seems to me the 3 second boot time of LinuxBIOS should be a selling point for some obscure Linux vendor, but no one really offers it yet.

I really imagine a machine with an 8MB EEPROM/ROM that can be updated as needed, but provides a boot environment and login screen - while spinning the disks in the background. This would make an excellent product.

Why hasn't anyone done this yet?

Curious
Medical (Was:Uses) by srw · 2002-10-07 16:38 · Score: 3, Interesting

A former client who worked at a Cancer Center used a cluster to simulate radiation treatments.
Why not use embedded tech? by Chirs · 2002-10-07 16:46 · Score: 4, Insightful

This sounds like some kind of dual-processor rackmount type solution. Why not go all the way and use something like compactPCI? You can fit 21 cPCI blades into 8U of rackspace.

A standard blade could have up to a couple gigs of ram, a powerpc or p3/p4 cpu, 100BT or 1000BT ethernet, etc, etc.

You boot the things using bootp/tftpboot and then run linux off a ramdisk.

We're using cPCI at work to run VoIP softwitches. Currently we're at over a million calls an hour on a wimpy 450MHz processor.
Lots of chip programming by Anonymous Coward · 2002-10-07 16:47 · Score: 3, Funny

I don't envy the developers... After every revision of LinuxBIOS, they get to reflash 1024 motherboards, which could take a while...
1. Re:Lots of chip programming by Anonymous Coward · 2002-10-07 17:13 · Score: 3, Informative
  
  Not really, a new revision can be flashed with a single utility that can be run on all the nodes in parallel.
Re:Wow... by binaryDigit · 2002-10-07 17:02 · Score: 5, Funny

Think so? Wouldn't a system with disks be more suitable for that

Nah, just one honkin RAMDisk. Could serve up mucho porn/warez, when the feds come knockin, just pull the plug, presto, no evidence :)
Re:Uses by marm · 2002-10-07 17:10 · Score: 5, Interesting
Does anybody know other applications that supercomputers are being used for. I know some do weather predictions.

Ok, non-military uses, off the top of my head:
- mathematical research - simply complicated maths on big numbers
- fluid dynamics modelling - traffic flows, or aerodynamics, or hydrodynamics - this is also tied in quite closely with weather/climate prediction
- statistical modelling - wouldn't you like to know if the stock market is going to go up or down tomorrow, before it happens?
- computational chemistry/biochemistry - protein folding is just the tip of the iceberg - imagine being able to design a molecule and then simulate the effect it will have on the human body, without that substance ever having been actually synthesized or going near a human... this is the future of drug development
- quantum mechanical simulation - related to computational chemistry, imagine taking all those complicated quantum mechanics equations to their logical conclusions, predicting as-yet undiscovered subatomic particles and their behaviour, or to design better magnetic containment fields so that practical fusion energy generation is possible
- good old-fashioned databases and signal processing - when you have hundreds of terabytes of data that you wish to mine for interesting patterns, speed matters
I'm sure there are plenty more applications for supercomputer power - any kind of complicated or chaotic system is a good candidate for modelling, especially when there's more than one unknown variable (multivariate analysis is complicated, to say the least).
betatest: I've uses Bproc and Linux Bios by goombah99 · 2002-10-07 17:27 · Score: 5, Informative

I've been a beta tester on the prototype for this system. It works great. I've seen diskless systems before they all were NFS nighmares, could not scale and had horrible tendencies to cause rippling crashes as one computer after the next timed out on some critical disk based kernel operation it could not complete across a wedged network.
This one, brpoc, is different it is completely stable. You never get NFS wedges. Jobs launch in flash. Plus if you do reboot the whole thing is back up in seconds (literally).
Bproc is an incredibly light weight job submission system. It is so light weight and fast that it changes how you think about sumbitting jobs. Rather than designing long duration jobs and tossing them on queue, you can just run tiny short jobs if you want with no loss to overhead. It makes you re-think the whole idea of batch processing.
when the jobs run they appear in the process list of the master node. That is if you run "top" or "ps" the jobs are listed right there. In fact from the users point of view the whole system looks like just one big computer.

--
Some drink at the fountain of knowledge. Others just gargle.
Re:Uses by afidel · 2002-10-07 17:28 · Score: 3, Interesting

The largest (largest by a long shot it outpowers the rest of the top10 combined) supercomputer in the world is the NEC Earth Simulator in Japan. It is being used to do the most detailed climate modeling ever attempted. Not only that but they are attempting a complete system model which AFAIK has never before been possible. In addition the last couple clusters that I have read about have been for biomedical research, maybe it's just what I read but I believe bioinformatics is going to be one of the biggest pushers of HPC going forward. Genomics is nothing compared to proteonics, mapping the genome probably takes about as much computing power as simulating the folding of one large protein series!

--
There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
Don't do it! by FyRE666 · 2002-10-07 17:33 · Score: 4, Funny

I will personally track down and slaughter the first person to mention a popular clustering architecture, and how one might imagine it...

--
Code, Hardware, stuff like that.
1. Re:Don't do it! by Elbereth · 2002-10-07 18:08 · Score: 3, Funny
  
  Imagine a Mosix cluster of them!
Cluster details by MalleusEBHC · 2002-10-07 17:42 · Score: 4, Funny

Cluster Overview:
* 2050 Intel 2.4GHz Xeon processors

Now when people complain about the United States government being responsible for global warming they will have some good hard facts to use.
Re:Wow... by JesseL · 2002-10-07 17:44 · Score: 3, Funny

Is that an imperial shitload or a metric shitload?

--
"Prefiero morir de pie que vivir siempre arrodillado!"
Re:Uses by foobar104 · 2002-10-07 19:07 · Score: 3, Informative

Wrong. Render farms are neither clusters nor supercomputers. At best, a render farm might be considered an array.

A supercomputer is a single system image. Some people call large clusters "supercomputers," but technically they're wrong.

A cluster is an interconnected group of computers that can communicate with each other. Usually a cluster depends on some kind of software layer to allow programs to run across multiple systems, something like MPI. Clusters are tightly interconnected many-to-many systems.

An array has a single job control system and a number of job execution systems. Batch jobs are submitted by users to the job control system, which doles them out to the various execution systems and then collects the results. The execution nodes don't talk to each other, and one job runs on one execution node at a time. Render farms are basically arrays; each execution node works on rendering a single frame of a multiframe animation. Because each frame can be rendered independently, without any dependencies on the previous and subsequent frames, rendering is particularly well suited to array computing.
Good Stuff by Perdo · 2002-10-07 19:54 · Score: 3, Interesting

"The Science Appliance" as it is dubbed will use dual processor AMD based nodes.

Scary part is that this will be one of the top 5 supercomputers in the world.

Scary because you could buy all the hardware off the shelf for about half a million dollars.

On a lighter note:

"The Linux NetworX cluster will be used solely for unclassified computing, including testing on ASCI-relevant unclassified applications."

I think they mean text mode quake.

I guess they got tired of "Global Thermo-Nuclear War"

--
If voting were effective, it would be illegal by now.
1. Re:Good Stuff by Perdo · 2002-10-07 23:40 · Score: 3, Interesting
  
  Good dual amd boards come with gigabit ethernet. With prices as they are, the nodes can be put together for about $350,000. That would leave $150,000 for 512 ports of gigabit switches. Cisco gigabit 48 port switches run $5,000. Double that and add an additional nic to each box and use a flat neighborhood network (.pdf)
  
  That should give each node about 200 MB/s aggregate bandwidth (the best gigabit ethernet runs at 800 Mb/s or 100 MB/s), easily exeeding what can be achieved with much more expensive solutions.
  
  About the cost of a nice house.
  
  Put into perspective, a cluster that could outperform Japan's earth simulator would cost 2 million in hardware costs. Outperforming Seti@home's 3,000,000 users would require $10,000,000.
  
  I know where my lotto money is going :P
  
  --
  If voting were effective, it would be illegal by now.
Don't be so sure by marm · 2002-10-07 22:36 · Score: 3, Insightful

A supercomputer is a single system image. Some people call large clusters "supercomputers," but technically they're wrong.

Says who?

Once upon a time 'supercomputer' meant 'any computer made by Seymour Cray', and this was reasonable, because he (probably) invented the concept. Then there was the mid-80's loose but widely-accepted definition 'any computing system that can do more than 200 MIPS'. Then MIPS went out of fashion and processors got faster and it was 'anything that does more than a GigaFlop'. Or there's the US Department of Commerce definition which was 'any computing system that does more than 195 Mtops (Million theoretical operations per second)' during the 80's, which then got changed to 1500 Mtops and is probably something different now.

Note that most Linux cluster systems would meet the requirements of most of these - indeed, most single-CPU computers today would meet most of these requirements, which is how Apple manages to get away with calling the G4 a 'supercomputer'.

Really, these days 'supercomputer' means absolutely anything you want it to be, although if I had to define it, I think probably the fairest definition would be 'anything that can run the LINPACK benchmark suite and get on the Top500 list'.

Nice try at creative redefinition though.
1. Re:Don't be so sure by foobar104 · 2002-10-08 03:56 · Score: 3, Informative
  
  The important thing to notice about the word "supercomputer" is that it's singular. A supercomputer is a single system image; this is implicit in the definition. This is not to say that supercomputing clusters aren't worthy; it's just that they're different in important ways from single-system-image supercomputers.
  
  Some classes of problems aren't suited for cluster computation. I won't pretend to be educated enough to tell you exactly which problems can and can't be adapted for cluster computation, but consider the nature of clusters to see my point. Clusters are highly scalable, but the inter-node latency is huge. An interconnect like Myrinet can get your remote messaging latencies down to the microsecond range, but the far more common MPI/PVM-over-Ethernet solution is a thousand times slower than that. This makes it somewhat inefficient for node N to try to access a bank of memory on node M. In order for a cluster to be efficient, each node should have sufficient physical memory to hold it's entire data set, and each node should be able to operate more-or-less autonomously, without having to contact other nodes.
  
  Supercomputers are fundamentally different from clusters. In some cases, you can do the same job with either a supercomputer or a cluster. Some jobs are better suited to clusters, while some are better suited to supercomputers. Some jobs, as I mentioned above, are better suited to arrays than to either clusters or supercomputers. It just depends on the job.
Re:cannot use pink by Neon+Spiral+Injector · 2002-10-08 00:04 · Score: 3, Funny

And what about the pop star who needs a belt for her pants (but I hope she doesn't get one)?
Re:AMD Opteron by Hoser+McMoose · 2002-10-08 00:31 · Score: 3, Informative

Ugg.. I do WISH that people would stop reading "Tom's Hardware", or at least that they would get a clue first and realize that Tom doesn't know dick-all about what he's talking about most of the time.

His comments about heat rising more then 1C/second make NO SENSE AT ALL! It's flat-out wrong! I don't know what orafice he pulled that comment from, but it certainly had no technical backing to it. The chip uses a thermal diode. It will tell you the temperature whenever you poll it. It doesn't matter how fast or slow you poll it, it will give you the temp. You would really have to go out of your way to try to break this sort of data to get it to only be able to handle a 1C/s temp increase.

As for the heat "problem". AMD's AthlonXP chips have a maximum power consumption of roughly 50-70W. Intel's P4's have a maximum power consumption of roughly 50-70W (yes, they consume almost the exact same amount of power, check the data sheets).

For comparison, Intel's Itanium has a maximum power consumption of around 100-130W, and IBM's Power4 is also on the high-side of 100W.