A New Approach To Linux Clusters

Re:Duh! by Anonymous Coward · 2001-08-13 05:01 · Score: 0

I don't know if you know this, but cum streaks if you don't clean it up right away.

Re:Linux by skinney · 2001-08-13 17:05 · Score: 1

I totally agree with your comment. However, my real reasom for posting this is 'cause I have name envy.
heh. wish I would have thought of that...
~Shane www.shanekinney.net

Long Live /.

Re:old ideas come back around, if they are good by Animats · 2001-08-13 06:47 · Score: 2

(CDC 6600) ten Peripheral CPUs (called PPUs)

There was only one PPU on the CDC 6600, equipped with a hardware multiprogramming unit to make it look like ten independent CPUs. The PPU had ten sets of machine state and I/O channels, but only one arithmetic-logic unit (ALU). This was back when the ALU was the expensive part of the CPU.

one of the CPUs should be dedicated to the OS

MacOS 8 on multiprocessors was the last major commercial incarnation of that idea. But that was an ugly hack to put multiprocessing on a uniprocessor OS.

There's a lot to be said for channelized I/O, like mainframes have. All the peripherals look roughly similar to the OS, security is better because peripherals can't write all over memory, and there's less diversity in drivers. Intel tried this, but ran into troubles because they made the channel controllers fully programmable and put a little OS under Windows to run them. Microsoft hates it when you put stuff under their OS.

Re:Plan 9 style architecture? by yukonbob · 2001-08-13 08:05 · Score: 1

Offtopic, but still near-and-dear to the hearts of many readers: How does the licensing for Plan 9 work? I saw a copy of the GPL listed as Exhibit A at the very bottom of their own licensing page.

The licensing looks liberal enough (it's labelled "PLAN 9 OPEN SOURCE LICENSE AGREEMENT" license), but how is it related to the GPL?

-bch

Re:actually it shows why Cray always does so well. by fgodfrey · 2001-08-13 06:49 · Score: 3, Interesting

The Top 500 is not a list of who makes the best machines. It's a list of what real world installations that run LINPAC the best. LINPAC is a benchmark, not a real piece of code. The T3E holds the record for *sustained* performance on real world code. That, in my opinion, and probably the opinions of the people still buying them, is more valuable than any benchmark. The "systems" on the list above the T3E in question are all strongly connected clusters, not single system image machines. Well, the Hitachi and NEC boxes are probably is an SSI. That's not saying they are bad, mind you, just that they aren't single machines. There are still some codes that run best on vector based SMP systems and for those codes, you buy a Cray. Also, for MPI code that communicates *a lot* between nodes, the T3E will run rings around a cluster.

Just to sumarise my basic point: The Top 500 is a benchmark and is not necesarily a good indication of who makes better computers than who.

--
Go Badgers! -- #include "std/disclaimer.h"

this brings up something.. by xtermz · 2001-08-13 04:28 · Score: 3, Interesting

i have been thinking about for quite a while now. Beauwould clusters are all nice and well, but what about joe sixpack with some it background like me who wants to get some sort of cluster going on. what methods are available (be it a simplified beouwoulf cluster or whatever...) for the guy with 3 or 4 old machines who wants to waste some electricity and try his hand at clustering some machines. is it possible to do it without being a CS major, or is it just a matter of having enough time/resources...

--

I lost my concept of community when my community lost all concept of me.

Re:this brings up something.. by gnuLNX · 2001-08-13 05:54 · Score: 2, Informative

Yes. Building a cluster is fairly easy if you know a little linux and some programming. Check out http://www.beowulf.org and some of the other sites about information. I am getting ready to build a 20 node cluster with pentium III 1Ghz processors. I am playing the waiting game right now with the University and shipping company however. If you don't mind writing your own distributed applications ( using CORBA or some other libriaries) then you can set up the cluster as four individual machines. Well actually if you have 4 then one will be the rserver node and 3 will be the slave nodes. the server node will need 2 ethernet cards. It will be the only node that connects to the outside world. The other nodes can then be connected by NFS. One peice of advice. Before you build a cluster you should first decide what you are bui;lding it for. Not all software can scale to parallel computing. You must first design your problem then build your cluster. Many cluster are build and run tailored to the problem they are solving. For instance the cluster I am building will be more like a network of Workstations then a beowulf. But for my particular problem it will work in the same way. There are alot of sites out there pertaining to beowulf clusters. You need some inux experiance and some hacker ethics, but it is dooable by anybody for sure. Have fun

--
what?
Re:this brings up something.. by Meech · 2001-08-13 06:55 · Score: 1

Beowulf Clusters are not that hard to build. the only differnce between building an 18 node cluster and a 4 node cluster is the number of computers. If you have the hardware, some free time, and the patience, then setting one up is not that hard. The hard part is actually doing something with one. Programming in MPI and PVM is not an easy task even with a degree in CS (unless you have no life). Check out Beowulf.org for more info.
Re:this brings up something.. by jageryager · 2001-08-13 06:36 · Score: 2, Informative

This link describes how MOSIX can be best applied.
The best solution for any distributed computing problem depends on that problem. How CPU intensive is the job? How much data will need to be distributed to nodes. Do intermediate processing steps require intermediate answers from other nodes? How fast is the CPU? How fast is the network?
Basically, if you have a lot of processing that could be manually distributing to a bunch of hosts, via rsh, or rlogin, then MOSIX can be used to easily manage/monitor that work with no coding. For harder problems that couldn't be manually distributed you might need MPI or PVM with special code in order to do the equivalent of "threaded" distributed computing.
Manual distribution is often easiest when you have network shared filesystems ( NSF, etc.,) and so is MOSIX.
MPI is short for Message Passing Interface. You can use MPI libs to do interprocess/interhost messaging and I/O on non homogenous networks. MPI does not require shared filesystems, though your own project might use them. MPI is certainly easier to manage when you have shared filesystems. One must be careful to conider the I/O time involved in network read/writes. Also note that multiple network nodes will clobber each other's data if they all try to write to the same file over the network.
MPI, or PVM is often used when the problem of breaking the job into pieces, or putting the results back together is non-trivial. For instance, if you were doing very processor intensive image processing on a large file, you may need to break the image into pieces, or tiles, and then distribute the processing of the tiles, with some processors sharing intermediate results, then stitch the results back into one file and finally write that file.
The approach that Unlimited Scale is using only makes sense in limited cases, i.e.., when computers are " getting bogged down in processing interrupt requests from peripherals." In general, multithreaded processing, or even distributed processing, only makes sense when I/O time is dwarfed by CPU time. SMP machines have an advantage in that they have really fast, communication between nodes, compared to Ethernet. Beowulf clusters have relatively slow communication between nodes. Beowulf clusters can only really be effective in the more CPU bound and less I/O intensive jobs. But if you have a job that can be run faster on a Linux cluster, you can save the big bucks on your initial hardware purchase. The more CPU intesive the processing is, the slower the network you can put up with. SETI is a good example of this. If it takes 12 hours to process a packet of data, then I/O over a 56K modem is OK.

--
"They that give up essential liberty to obtain a little temporary safety deserve neither liberty nor safety"-B.Franklin
Re:this brings up something.. by Raging+Idiot · 2001-08-13 04:52 · Score: 2, Interesting

Normally I just troll, but this is important.
MOSIX works fucking awesome. I used it to compress MP3s. I ripped waves and stored them on my bad-ass machine. Then I ran a at daemon on my slowest machine and ran compression routines in the at q in batch mode. By running on the slowest machine in the group it guaranteed that the jobs would migrate to the faster machines in the group and the slowest machine remained as a "task manager". It would run instances until every machine was busy, then batch would hold jobs until a machine came free, then it would release and on with the show.
It works EXTREMELY well. Try MOSIX for some serious fun.

--

Stupidity never felt so good.
Re:this brings up something.. by baptiste · 2001-08-13 04:33 · Score: 5, Informative

Try a MOSIX Cluster This type of cluster spreads processes out to the machine with the least load. A Beowulf can be done, but to take advantage of it, you have to run custom software that is capable of parallel processing.

--
Top Most Bizarre/Disturbing Error Messages

LNXI has cool solution to cooling Beowulf clusters by cworley · 2001-08-13 05:45 · Score: 2

These folks are using standard rack mounts, fitting 5 standard ATX motherboards in 8u of rack space (no special motherboard needed). They mount them vertically which makes cooling much more efficient when piling large numbers of CPU's into a small space.

--
When I die, please cast my ashes upon Bill Gates -- for once, make him clean up after me!

Re:Request for help by laertes · 2001-08-13 05:46 · Score: 2, Informative

The most obvious solution is to use some sort of byte code, but you said that speed was an issue. If you're using Linux, you might want to look into dl_open, a library call. Dl_open lets you load dynamically linkable libraries at run time.

I would imagine you would use this as follows; first, you'd get some data points with which to calculate from a server. Then, you'd also get the name of a shared library which is on a server (NFS mounted probably). This library has a function name 'calc' or some such that does that calculation. You can then call that function, and post the results somewhere.

I would avoid using MPI or PVM, since those are not designed for farming out data the way you are. You should probably use your own job control protocol. Also, you might want to allow for multiple archetectures, naming the library foo-0.0.1.i386.so and foo-0.0.1.alpha.so and so forth,

--

Yes, I'm still a junky. Are you still a bitch?

From the last Cray notice by HerrGlock · 2001-08-13 04:42 · Score: 2, Interesting

There were a few "Because Linux does not scale well with multiple servers" posts about why someone would use a mainframe as opposed to a Beowulf.

Well, it looks like there are people working on the task. But that's not the real point, the right tool for the right job is the point. A whole lot of processes that do not require another process to finish before the next one is where Beowulfs shine, if you want throughput or process with dependancies then a mainframe is your best bet.

But it's still nice to have an alternative for those of us who cannot afford a mainframe.

DanH

--
Cav Pilot's Reference Page
UNIX - Not just for Vestal Virgins anymore

Re:In the words of Seymour Cray: by Anonymous Coward · 2001-08-13 04:44 · Score: 1, Funny

I'd like to see the chickens have a go at it.

That depends.... by nizo · 2001-08-13 05:51 · Score: 1

If my neighbor just gave me 1024 free chickens because he upgradeded to two oxen, and I didn't like to eat chicken, perhaps getting them to plow a field would be useful?

--
I Am My Own Worst Enemy

Re:In the words of Seymour Cray: by m2 · 2001-08-13 07:02 · Score: 2

Two strong oxen or 1024 chickens?

I don't know. What's the tradeoff? Lots of chicken droppings on your field? The soil can use some nutrients, I'm sure. Can the chickens do it? Have you tried or is it a hunch of yours? Who eats more? The two oxen or the chickens? What about the sleeping place? Sure, those are a lot of chickens, but they don't have a problem if you sit them side to side and they don't have a problem with sleeping in three or four rows. Manageability, yes, that's a problem... now we see a real tradeoff. Being able to replace a chicken if it dies (and chickens are cheap) or one (possibly both) of your oxen, which are not cheap, versus manageability of the two oxen. Sure, feeding the chickens is also a problem, as well as collecting the eggs wow! byproducts! suddenly I can do something with the chickens that I couldn't do with the oxen.

I like this analogy.

In the words of Seymour Cray: by LocalYokel · 2001-08-13 04:33 · Score: 5, Funny

If you were plowing a field, which would you rather use?
Two strong oxen or 1024 chickens?

--

--
E2 IN2 IE?

Re:In the words of Seymour Cray: by Anonymous Coward · 2001-08-13 11:12 · Score: 1, Funny

"You want me to do what with a million monkeys?"

Have them take another crack at writing the DMCA. That first one didn't come out so well.
Re:In the words of Seymour Cray: by Temkin · 2001-08-13 04:43 · Score: 1

If you were plowing a field, which would you rather use? Two strong oxen or 1024 chickens?

I've also heard this attributed to one of Pyramid's VP's, while drawing comparisons to Sequent systems. Anyone remember Pyramid?

Temkin
Re:In the words of Seymour Cray: by Anonymous Coward · 2001-08-13 06:07 · Score: 0

I believe this is actually a quote from Mike Ess, not Seymour Cray. Or atleast that is what I heard. Seymour only reluctantly embraced more than one oxen.
Re:In the words of Seymour Cray: by alexjohns · 2001-08-13 08:33 · Score: 1

I once saw some celebrity charity thing where they had a bunch of kids (seemed like they were about 7 years old) compete in a tug-of-war with Lou Ferrigno (The Hulk on the old TV shows.) They had Lou just stand still, holding the rope, while they kept adding kids. Guess how many kids it took? Only about 10. I was quite surprised. The obvious strength advantage of an ox over chickens might not be as big as you think.

If your task is just plowing a field, and you just had your standard plows, perhaps oxen would be the best solution. Have you ever tried hitching a bunch of chickens to a bunch of mini-plows? Might be worth a shot. Depending on how big a field, you could easily hire someone else to plow it for you and pay them in a couple of dozen chickens. Or rent a tractor for the price of some chickens.

Overall, if my life didn't depend on getting that field plowed, I might choose the chickens just to see if I could come up with an innovative solution to the problem. It would be more fun than walking behind some oxen, breaking my back. Reminds me of the canonical story of the student given the task of measuring the height of a building with no tools other than a barometer. Seems like the more intellectually interesting solutions would come out of using the chickens.

To sum it up, tilling a field with oxen sounds boring. Trying to achieve the same result with a 1000 chickens sound like fun (for a day or two, anyway.) Then, on to the next project. You want me to do what with a million monkeys?
Re:In the words of Seymour Cray: by pyg · 2001-08-13 09:21 · Score: 1

A permaculturalist would choose the chickens.
Re:In the words of Seymour Cray: by Scooter · 2001-08-13 12:17 · Score: 1

hmm good job field ploughing is not high on my list of computing tasks...
Re:In the words of Seymour Cray: by Black+Parrot · 2001-08-13 05:28 · Score: 3, Funny

> If you were plowing a field, which would you rather use? Two strong oxen or 1024 chickens?

Either one makes for a fine bar-b-que, once the plowin's done.

--
Sheesh, evil *and* a jerk. -- Jade
Re:In the words of Seymour Cray: by fitz22 · 2001-08-14 02:40 · Score: 1

If you were planting seeds in a field, which would you rather use?
Two strong oxen or 1024 chickens?
Re:In the words of Seymour Cray: by Anonymous Coward · 2001-08-13 05:50 · Score: 0

John receives a phone call.
"Hello," he answers.

The voice on the other end says, "This is Susan. We met at a party about 3 months ago."

John: "Hmm... Susan? About 3 months ago?"

Susan: "Yes, it was at Bill's house. After the party you took me home. On the way we parked and got into the back seat. You told me I was a good sport."

John: "Oh, yeah! Susan! How are you?"

Susan: "I'm pregnant and I'm going to kill myself."

John: "Say, you ARE a good sport."
Re:In the words of Seymour Cray: by dynoman7 · 2001-08-14 00:29 · Score: 1

You have successfully converted your field into a mud hole.

Please reboot.

--
Blarf.
Re:In the words of Seymour Cray: by Anonymous Coward · 2001-08-13 05:59 · Score: 0

One Fall day, Bill was out raking leaves when he noticed a hearse slowly drive by. Following the first hearse, was a second hearse which was followed by a man walking solemnly along, followed by a dog, and then about 200 men walking in single file. Intrigued, Bill went up to the man following the second hearse and asked him who was in the first hearse.

"My wife," the man replied.

"I'm sorry," said Bill. "What happened to her?"

"My dog bit her and she died."

Bill then asked the man who was in the second hearse.
The man replied, "My mother-in-law. My dog bit her and she died as well."

Bill thought about this for a while. He finally asked the man, "Can I borrow your dog?" To which the man replied, "Get in line."
Re:In the words of Seymour Cray: by Anonymous Coward · 2001-08-13 06:35 · Score: 0

There was a married couple who were in a terrible accident. The woman's face was burned severely. The doctor told the husband they couldn't graft any skin from her body because she was so skinny.

The husband then donated some of his skin..... however, the only place suitable to the doctor was from his buttocks. The husband requested that no one be told of this, because after all this was a very delicate matter!

After the surgery was completed, everyone was astounded at the woman's new beauty. She looked more beautiful than she ever did before! All her friends and relatives just raved about her youthful beauty!

She was alone with her husband one day and she wanted to thank him for what he had done. She said, "Dear, I just want to thank you for everything you did for me! There is no way I could ever repay you!!!"

He replied, "Oh don't worry, Honey, I get plenty thanks enough every time your mother comes over and kisses you on your cheek!!"
Re:In the words of Seymour Cray: by Anonymous Coward · 2001-08-13 06:08 · Score: 0

A guy goes to buy a train ticket, and the girl selling tickets has an incredible set of jugs. He says, "Give me two pickets to Titsburgh...umm...I mean, two tickets to Pittsburgh."

He's really embarrassed... The guy in line behind him says, "Relax, pal. We all make Freudian slips like that. Just the other day at the breakfast table I meant to say to my wife, 'Please pass the sugar', but I accidentally said, 'You fucking bitch, you wrecked my life.'"
Re:In the words of Seymour Cray: by Beatbyte · 2001-08-13 04:44 · Score: 1

Neither.
1024 Oxen.

;-)

--
Get paid to code OSS
Re:In the words of Seymour Cray: by phutureboy · 2001-08-13 06:05 · Score: 2, Insightful

Somehow I can't see 1024 chickens all agreeing to go in the same direction at the same time.
Re:In the words of Seymour Cray: by nettdata · 2001-08-13 05:31 · Score: 1

If you were plowing a field, which would you rather use? Two strong oxen or 1024 chickens?

Well, if the chickens were $2 a piece, and the oxen were $25,000 per, and I only had $5,000 to my name, I'd have to say the chickens.

--

$0.02 (CDN)
Re:In the words of Seymour Cray: by kiwaiti · 2001-08-13 04:50 · Score: 2, Funny

You have successfully converted your field into a mud hole.
What next?
Kiwaiti

--
Member of the Legion Of Microsoft Haters

Re:Request for help by tbo · 2001-08-13 05:14 · Score: 2

I think this may be a case where a bit more thinking and literature research about the problem would help a great deal.

We're looking at using MINUIT, a package written by the computing divsion at CERN, as our fitting engine. MINUIT's algorithms are quite advanced, and it's commonly recognized within the physics community as the best general-purpose fitting package out there.

I think you may not realize how complicated the functions we're trying to fit are. Here's the quick and simple version: we study magnetic fields within superconductors and semiconductors on a microscopic level. We do this by using spin-polarized muons or radioactive light ions as a probe, and measuring anisotropy of the emitted decay products. That data then has to be compared against complex models of superconductivity. The computationally expensive part here is calculating the values predicted by the model for a given set of parameters. This has to be done once for each data point to calculate chi-squared, and repeated many times (once in each iteration of the fitting process), each time with different parameters. The models typically contain difficult integrals which must be evaluated numerically thousands of times with very high precision.

Since the function we're trying to fit changes fairly often depending on the sample and measurement techniques used, it's not practical for us to spend huge amounts of time optimizing each individual function to be fitted. The fitting package is already optimized, so the only thing left is to parallelize it.

TERAS has part of it's clusters running linux.... by fuzzel · 2001-08-13 22:58 · Score: 1

Check out SARA: TERAS' is a 1024-CPU system consisting of two 512-CPU SGI Origin 3800 systems. This machine has a peak performance of 1 TFlops (1012 floating point operations) per second. The machine will be fitted with 500MHz R14000 CPUs organized in 256 4-CPU nodes and will possess 1 TByte of memory in total. 10 TByte of on-line storage and 100 TByte near-line StorageTek storage will be available. 'TERAS' will consist of 44 racks, 32 racks containing CPUs and routers, 8 I/O racks and 4 racks containing disks.

The fun part: parts of this huge machine are running Linux :)

For more closeup pictures see: http://unfix.org/news/sara/

Ain't it sweeeeeeeeeeet?

--
http://unfix.org

Parallel Sysplex anyone? by gelfling · 2001-08-13 07:55 · Score: 2

Carve up a big mainframe with 24 or so processors and bind the OS to a few of them. Get another mainframe and use it as a mezzanine backplane to gang together other mainframes each with their OS images bound to specific processors. Run all networking and IO through their own processors to offload the CECs. Add some more ASICs to handle crypto. Toss all console activity off to another 'special' processor. Run a hypervisor over the whole shebang to control all of the guest images.

Voila you've reinvented parallel sysplex with VM for Linux running on 'cheap' hardware.

Except how cheap do they expect it to be?

*yawn* by Max+Entropy · 2001-08-13 04:51 · Score: 0, Offtopic

I tried to post an item on this *months* ago and, of course, it was rejected.

People oughta listen to me...

Request for help by tbo · 2001-08-13 04:37 · Score: 5, Interesting

I'm trying to design a specialized data-fitting program to be used for accelerator-based condensed matter physics (and maybe ultimately other branches of science as well). I need information on adding clustering support to this program. Here's a brief description of what the program does:

The user writes a small chunk of code that calculates the function they're trying to fit the data to. We require the user to code the function him/herself because speed is important, and some of these functions are too difficult for Mathematica or the like to fit. Once the user writes their function, it's linked (dynamically) with the rest of the code. The user then passes in a parameter file, and away it goes.

Many of these fits can take days, and, since they often have to be repeated many times with slight changes to the fitted function or initial parameters, this is a serious concern.

Can this new approach to Linux clusters be used here? We have tons of Linux boxes lying around that are being used for other things, but have lots and lots of spare cycles. We probably couldn't afford a dedicated processing farm, but we could easily live with something like distributed.net where the program transparently takes all the spare cycles.

I know the problem is parallelizable, since each node can calculate the value of the function at a few of the data points, then send back to the "master" the chi-squared contribution of those points. Each iteration of the fitting process, the master sends out the current parameter values, and then the nodes grind away... There's not too much communication required.

One of my big concerns is how to get the user-written function from the "master" computer to all the "slaves". It's unrealistic to expect the user to manually install it on all the machines each time something in the function gets tweaked and it's recompiled. Are there pre-existing standards on how to send code to nodes in a cluster, then have it executed?

Any advice or pointers to good starting places on distributed computing would be much appreciated.

BTW, as a hint to all the other comp sci geeks out there--physics is a great place to find new and challenging computing problems (I'm not claiming this is one). In particular, the particle physics people often have to deal with spectacular data rates, and do extremely complicated event reconstruction. Check it out some time.

Re:Request for help by san · 2001-08-13 04:48 · Score: 5, Informative

Hi
The normal way to operate a cluster is to have a shared (NFS) file system across all the systems, thereby solving the data distribution problem (please note though that this prevents you from doing too much file base IO because it's too slow, you might want to make a local /scratch directory on each node)
Besides the NFS share you'll need some kind of parallel programming library like MPI or pvm, and a job scheduler of some sorts. The libraries you can find on the web (maybe in precompiler RPMS, look for the mpich MPI implementation for a start), and will provide you with a programming framework for doing all the networking and setting up the topology. The scheduler can be as simple as the one provided with MPI/pvm (ie. you name a few hosts and your job gets run on those), or, if there's a number of people accessing the cluster at the same time, you might want to try a real queuer (like gridware).
The parallellization is something you'll have to do yourself and it's the hardest part of clustering.
Hope this helps :-)
Re:Request for help by Anonymous Coward · 2001-08-13 04:42 · Score: 1, Informative

Check out http://www.cs.wisc.edu/condor/mw/
Re:Request for help by mj6798 · 2001-08-13 04:50 · Score: 2, Informative

Can this new approach to Linux clusters be used here?
Use PVM or MPI. Both exist prepackaged for most major Linux distributions.
I'm trying to design a specialized data-fitting program to be used for accelerator-based condensed matter physics.
I think this may be a case where a bit more thinking and literature research about the problem would help a great deal. People solve extremely complex data fitting problems on modern PCs without the need for parallel processing, and there are very sophisticated algorithms for doing this. You should probably talk to local experts in statistics, computer science, and pattern recognition.
Re:Request for help by Anonymous Coward · 2001-08-13 05:42 · Score: 1, Informative

Divide the number of data points by nodes on your network. Associate each data point range with IP's of the available machines. Set it up so when the program runs, it will check the IP of the computer that's running it and process the approriate datapoint.
Set up nfs and then write a bash script that will execute whatever gets sent there.
When you wanted to run something, you would just copy it to the server nfs directory and all the nodes would process their respective datapoints, automatically.
Re:Request for help by Anonymous Coward · 2001-08-13 05:10 · Score: 1, Interesting

This problem is trivially easy if you use any of the well developed architechtures for distributed parallel computing. MPI is the most popular, PVM was, but is going out of style. MPI is a cluster based interprocess communication system + remote program invocation. Look at the websites of the major open source implementations of the MPI standard, MPICH (from Argonne Natl. Labs) or LAM/MPI (Notre Dame Uni.) for more information. There's a ton of tutorials and on-line training courses online. -Colin

Re:I wonder... by Anonymous Coward · 2001-08-13 16:08 · Score: 0

I'm wondering if the rest of the engineers groan when one 1337 h4x0r takes a look at a computer and says:

...

(clears throat)

...

root@desktop:/root# AllYourBeowulfClusterAreBelongToUs

Dimishing Returns..... by Wiwi+Jumbo · 2001-08-13 05:53 · Score: 1

'As the number of CPUs in a Beowulf-style cluster-a group of PCs linked via Ethernet-increases and memory is distributed instead of shared, the efficiency of each processor drops as more are added,'

Does that mean you could reach a point where adding another node actually slows everything down?

Weird....

--
Wiwi
"I trust in my abilities,
but I want more then they offer"

Re:Dimishing Returns..... by Chakat · 2001-08-13 06:34 · Score: 1

Does that mean you could reach a point where adding another node actually slows everything down?

Kind of. You just start blindly adding nodes to a cluster, efficiency drops as the processes are deadlocked more and more, waiting for that relevant network traffic. You may not ever reach negative efficiency, but you will get no gains, therefore, just burning electricity.
Speaking of burning electricity, I wonder if I should start back up implementation of my crazy idea of a PC/Mac/Sun/NeXT/SGI/Alpha/VAX/RS6000 cluster...

--
If god had intended you to be naked, you would have been born that way.
Re:Dimishing Returns..... by pogofish · 2001-08-13 06:50 · Score: 1

Does that mean you could reach a point where adding another node actually slows everything down?
Sure. Just take one example. Imagine a problem that requires the cluster to munch on a large set of data. The initial set up of the problem requires that all the processors get their piece(s) of data. Maybe this is done via an RPC style mechanism, or by reading from a common file share or any number of other techniuqes. Doesn't matter. That initial set up takes up network resources.

At some point, the network (or the shared disk, or something) will become a bottleneck. When it takes more time to get the data through that bottleneck than it takes for the processing to actually complete, then you've reached a point of negative returns: adding nodes decreases performance.

--

A man without a God is like a fish without a bicycle.
Re:Dimishing Returns..... by drnomad · 2001-08-13 07:56 · Score: 1

I'm not sure, but I heard that this is about 128 CPU's. In that case the overhead is so big, that adding another CPU would only slow the whole system down.
They discovered this when designing computer algorithmns for parallel systems. So even when there's no Operating system running the CPU's - only the program (ie "Occam" like), the overall overhead needed for the CPU's to cooperate, and safe concurrency, simply becomes too big.
Unimaginable? What's the deal:
* amount of productive CPU-cycles for overhead (Po) per second
* amount of improductive CPU-cycles for synchronizing waits (Pw) (waits for other CPU's, waits for communication bus, wait for memory access etc. -> concurrency cycles) per second
* amount of real CPU-cycles per second (Rc)
* amount of application productive CPU-cycles (Pa) per second
Now put those into a formula, and suppose that Cs is a constant... or a slow changing constant:
Total overhead cycles: Toc=(c1*N*Po)+(c2*N*Pw) (where N is number of CPU's, c1 and c2 some constants...)
Total application productive cycles per CPU, per second: Pa = Rc - Toc
The formula's aren't probably like this, but I'm trying to give an indication. From "Toc" we now know that Pa reduces (above lineairly - probably not true in real life) with every added CPU.

--
Bizar technology?

Re:Huge Market for Supercomputers Will Come... by MikeyO · 2001-08-13 14:31 · Score: 1

The reason that high speed computing has not taken off is that there are currently no consumer apps that require it.

You've obviously never tried to open a spreadsheet in StarOffice.

*yawn* Already been done by Anonymous Coward · 2001-08-13 05:44 · Score: 0

This idea isn't new at all and has been in practice for a long time in both large machines and in clusters. In clustering, for example, the C-Plant at Sandia uses this type of topology and is on the Top500. I'm pretty sure they have published papers on their topology.

Re:old ideas come back around, if they are good by Black+Parrot · 2001-08-13 05:36 · Score: 2

> In that incarnation, the two central CPUs ran only user applications, while the operating system, with all its interrupts, OS code, and device drivers, would reside nearby in the ten Peripheral CPUs (called PPUs) provided for this purpose.

When I heard a CS professor talk about putting multiple CPUs on a single chip, I suggested that one of the CPUs should be dedicated to the OS, which would mean that it wouldn't even need a FP unit. So for (say) 4 or 8 computers on a chip, one would be a "OS server" and the others would be "application servers". Ditching the FP on the "OS server" might allow an extra-high-performance design for it. And the others would only need context switches when the OS demanded it, rather than one for every stinking interrupt that came along.

--
Sheesh, evil *and* a jerk. -- Jade

My head hurts! by Dutchmaan · 2001-08-13 04:23 · Score: 0, Troll

Can you imagine a Beowulf-cluster of beowulf-cluster posts...?

Re:My head hurts! by ackthpt · 2001-08-13 05:40 · Score: 1

I resisted the urge.
However, after reading the article, I suggest you go back and update all the previous "Imagine a Beowulf Cluster of ..." posts, as they're now rendered out of date.
Thanks.

--

A feeling of having made the same mistake before: Deja Foobar

Remember? by Anonymous Coward · 2001-08-13 08:02 · Score: 0

What happened to OSCAR?

And yet. by ackthpt · 2001-08-13 05:43 · Score: 1

These were Ex-Cray, which perhaps says something in and of itself...

--

A feeling of having made the same mistake before: Deja Foobar

Plan 9 style architecture? by Anonymous Coward · 2001-08-13 05:06 · Score: 1, Interesting

How in general is this different than the approach taken by Plan 9? (http://plan9.bell-labs.com/sys/doc/9.html)

Those Fools! by Mtgman · 2001-08-13 04:30 · Score: 4, Funny

Treating each node as a peer! Don't they know that Peer to Peer networks are stealing from our musicians and corrupting our youth! I just hope they can repent before the heavy hand of justice comse down on them.

Steven

--
-- I have marked myself unwilling to moderate-- I don't have other accounts to artificially inflate the karma of

Re:Mixed architectures. by Anonymous Coward · 2001-08-13 05:48 · Score: 0

Yes, this has been done for a while, usually assignment by hand of the programs that run on which nodes. The hard part is getting an automatic reservation system to figure this out for you. There are some batch schedulers that can schedule this type of stuff for you but you still have to identify which nodes will run what program. In the embedded space this has been an issue for quite a long time. For example an embedded machine may have X number of GPUs (i.e. G4s) and Y number of some DSP (i.e. Sharcs) nodes in the same box and you want the appropriate nodes to do the right things.

Re:actually it shows why Cray always does so well. by Oestergaard · 2001-08-13 07:33 · Score: 2

LINPACK is computationally O(n^3) and O(n^2) wrt. communications, for problem size n.

That's not a completely unfair benchmark - but of course you're right it's a benchmark and therefore it does not cover every possible problem out there. However, it is based on the common linear-algebra routines that are the core of a very large part of the scientific computing problems being run out there.

Really so new? by Root+Down · 2001-08-13 07:50 · Score: 1

It seems to me that they are just using asymmetric multiprocessing on a distributed Linux cluster. (Asymmetric meaning that each CPU has a specific function.) This idea is certainly nothing new, though the novelty might be in using tailored Linux instead of the typical UNIX environment. (Different processors, typically.) Of course, this is an area that has been the focus of a great deal of research, so perhaps they are just attempting to move this idea into the private sector by using the less expensive Linux distros as a viable economic alternative for greater computational power?

Root DOWN
grep what -i sed?

Can you imagine... by Anonymous Coward · 2001-08-13 05:26 · Score: 0

Can you imagine... wait. I think they already did.

Well, then imagine Natalie Portman Naked & Petrified, then it's ALL good.

Brian Moyles by Anonymous Coward · 2001-08-13 10:49 · Score: 0

Brian Moyles is a pussy.

From what I can gather, by Anonymous Coward · 2001-08-13 04:30 · Score: 0

They're going to have some nodes in the cluster dedicated to message routing with a specially optimized version of Linux. Which would mean that adding an extra node would increase the load on the router node but not the other nodes in the cluster. Is this correct? And if so, is this revolutionary stuff? Just that it sounds a bit like applying som common sense to me.

I wonder... by Johnny5000 · 2001-08-13 04:41 · Score: 1, Funny

I'm wondering if the rest of the engineers groan when one of them takes a look at a computer and says:

...

(clears throat)

...

"Imagine a beowulf cluster of these things!"

-J5K

--
The libertarian solution to the failures of capitalism is to apply more capitalism til the failures are fixed.

Darn by Anonymous Coward · 2001-08-13 06:54 · Score: 0

If only I hadn't seen a piece of news about a Beowulf cluster of ex-Cray engineers. I wouldn't have had to make a Beowulf remark.

Cray machines are all about parallel processing by Rosco+P.+Coltrane · 2001-08-13 04:44 · Score: 2

"Looks like Cray engineers think about clustering even when they're not at Cray."

Well, duh, Cray machines are massively parallel processing machines, so they're not clusters in the sense that they don't use network cards and separate computers as basic computing units, <OVERSIMPLIFICATION>the processors talk to each other on the same bus and share the same memory</OVERSIMPLIFICATION>, but basically in either case it's about parallel processing. I *hope* Cray engineers think about clusters. I'd hate to see them think about single Athlon supercomputers ...

--
"A door is what a dog is perpetually on the wrong side of" - Ogden Nash

Re:Cray machines are all about parallel processing by bmajik · 2001-08-13 05:29 · Score: 2

Not So fast.

The CRAY 1, 2, XMP, etc etc are all VECTOR machines. Some of them happen to be parallel vector machines (multiple VECTOR processors)

The T3 series are the MPP boxes. Cray's bread and butter was VECTOR machines though. MPP came about because some problems aren't easily vectorizable (but can run on MPPs, oddly enough).

--
My opinions are my own, and do not necessarily represent those of my employer.

Re:here's another approach... by Anonymous Coward · 2001-08-13 05:30 · Score: 0

That would have been a really nice troll, if it wasn't so obvious. Try harder.

Re:Huge Market for Supercomputers Will Come... by Anonymous Coward · 2001-08-13 09:03 · Score: 0

I think it's great. The guy is notorious, universally shunned on Usenet, but on Slashdot, his posts are moderated through the roof.

He's even better then Signal 11 at showing the flaws in moderation.

Re:actually it shows why Cray always does so well. by Styx · 2001-08-13 04:55 · Score: 1

Their homepage seems to be http://www.unlimitedscale.com/.
Unfortunately, it contains absolutely no info on what hey are up to.
groups.google has a tiny bit more.
And a bit on their funding.

Anybody got any more info?

--
/Styx

No there there by Anonymous Coward · 2001-08-13 05:33 · Score: 0

Why post this article? It says "we're CRAY folks and we're doing cool stuff with LINUX to improve BEOWULF clusters." a) there's absoutely no meat here - just buzz words. b) Almost no one cares about scaling clusters to thousands of nodes for any reason, even fewer for running parallel codes. c) An EFFECTIVE cluster with thousands of nodes running PARALLEL codes isn't a Beowulf. The Beowulf name applies to SINGLE USER clusters assembled from COMMODITY Off-The-Shelf components. Cluster this size spend most of thier time running lots of smaller jobs rather than devoting the whole machine to a single problem (so, not single-user), and you can't scale many (any?) parallel codes to a thousand nodes using commodity hardware.

Gnutella parallel... by Saeger · 2001-08-13 05:33 · Score: 2, Interesting

Sounds to me like they've rediscovered the concept of a supernode where it's acknowledged that not all peers are created equal.

(I know--not the best analogy)

--
Power to the Peaceful

Re:Duh! by Anonymous Coward · 2001-08-13 05:29 · Score: 0

but if you let it sit for a few months it becomes crispy and is easily flaked away with a fingernail, putty knife, or similar scraper. YMMV, depending on the surface, namely, how porous the surface is.

The Real Problem by GrEp · 2001-08-13 06:30 · Score: 2

The real limitation of Beowolf style computing is RAM. Beowolf is great if you have programs that paralellize with little intercommunication and low RAM usage. The bigger problem is RAM. Big iron like Crays/SUNs/SGIs all have about a Terabyte of RAM in one place. When you are trying to do large physics calculations you usually have a huge data set you need to store for every time series. Supercomputers aren't cool just because they are fast, but because they can hold HUGE amounts of data in RAM for easy acess. Until PCs get a few gigs of RAM per box cluster computing is still going to be Kludgy no matter what kind of message passing scheme you use.

--

bash-2.04$
bash-2.04$yes "Don't you hate dialup connections?"| write USERNAME

A new light... by snadsnad · 2001-08-13 04:25 · Score: 0

Seems to shed some new light on the previous post about the Cray SV1 and people dispelling Beowulf.

Re:A new light... by snadsnad · 2001-08-13 04:30 · Score: 0

more exact link to the thread here.

Fixed link by Rimbo · 2001-08-13 05:55 · Score: 2, Funny

Try this:

Furbeowulf

Funny. :)

Mixed architectures. by Demon-Xanth · 2001-08-13 05:40 · Score: 1

Would it be concievably possible to use mixed architectures and assign certain tasks or routines to the architecture best suited for them. Rough example: 200MHz Pentium and 200MHz Cyrix system in the same cluster. Two calculations need to be performed, one interger, one floating point. Send the interger to the Cyrix and the floating point to the Intel. Rather than fight with some platforms being better than others at certain tasks, work WITH that fact.

--
If you think education is expensive, you should try ignorance -- Derek Bok, president of Harvard

Mainframe? by battjt · 2001-08-13 05:30 · Score: 1

Is mainframe really the right term? I thought that mainframe usually refered to a IT machine like an IBM 390. One of my client's has a "mainframe" that is only 11 400 Mz processors. The Dell 8 way MSSQL server has more processing power. (and we have verified this with benchmarks and working with IBM.)

Isn't there a differnt term for a very fast computer? Maybe something like "super computer"?

Joe

--
Joe Batt Solid Design

Re:Mainframe? by HerrGlock · 2001-08-13 06:58 · Score: 1

Note to self, don't post before coffee.

Dan

--
Cav Pilot's Reference Page
UNIX - Not just for Vestal Virgins anymore

A cheaper alternative by HRH+King+Lerxst · 2001-08-13 04:34 · Score: 4, Funny

I like the idea of a furry cluster: furbeowulf.

--
No one got beat up more often than the mimes of the old west!

From this week's Byte by wiredog · 2001-08-13 06:25 · Score: 2

SJVN commentary on distributed computing and some interviews with various people in the field.

--

Best Slashdot Co

Custom software by AdamInParadise · 2001-08-13 06:25 · Score: 2

What is important to realize is that in order to use these boxes as a cluster, you will have to wrote you own custom software. Yep, it means C and C++, and hours of hacking.

But as mentioned in a previous post, Mosix can do that for you, if and only if your program can use several instances at the same time. Compressing MP3s is a good example.

--
Nobox: Only simple products.

Re:actually it shows why Cray always does so well. by 3am · 2001-08-13 05:59 · Score: 1

and it's not the top of the line, even, if i'm not mistaken. i thought that was the t90.

--

A: None. The Universe spins the bulb, and the Zen master merely stays out of the way.

Re:actually it shows why Cray always does so well. by Styx · 2001-08-13 05:03 · Score: 1

The portfolio of Quatris Fund (one of the investors in Unlimited Scale) is small, but interestingly diverse.

--
/Styx

Re:actually it shows why Cray always does so well. by Oestergaard · 2001-08-13 05:21 · Score: 3, Informative

Of the top 500 supercomputers in the world, 47 are vector processor machines - the kind of processors that Cray became famous for.

Not one single of these are made in the U.S.

Cray today is a name. It's a brand. It's not a manufacturer of high performance computers.

Just for the record, IBM produced the two fastest computers currently, Intel the third, IBM the fourth, Hitachi the fifth, SGI, IBM, NEC, IBM, IBM, and Finally, number *11* is a Cray based on the Alpha processor (the T3E).

So, tell me again, who was playing catch-up with who ?

Re:Duh! by Raging+Idiot · 2001-08-14 00:55 · Score: 0

That shit went COMPLETELY over your puny little head. Didn't it?

--

Stupidity never felt so good.

old ideas come back around, if they are good by jkorty · 2001-08-13 05:08 · Score: 2, Informative

From the article:

The idea is to free some computers from getting bogged down in processing interrupt requests from peripherals, while letting a second set of machines run the full operating system, furnishing the cluster with networking, job scheduling, input/output, and other capabilities.

The central design theme of the CDC 6400 was exactly this, and it is a product of the mid sixties. In that incarnation, the two central CPUs ran only user applications, while the operating system, with all its interrupts, OS code, and device drivers, would reside nearby in the ten Peripheral CPUs (called PPUs) provided for this purpose. The central CPUs didn't even have an interrupt capability.

Guess who the CDC6400 designer was? Seymour Cray.

Wouldn't HURD be more suitable for such task? by basic · 2001-08-13 19:59 · Score: 1

Wouldn't HURD be more suitable for such task?

--
Basic

Funny, but about your sig.... by Anonymous Coward · 2001-08-13 06:10 · Score: 0

Would that BBQ be roasted on the 'butterfly ballots' whose results were destroyed by the same democrats who tried to run the clock out and got blocked by SCOTUS? The same democrats who cried foul then destroyed the results (recently) when they couldn't manufacture a coup. THANK GOD FOR THE SCOTUS or we would have had 1960 all over again, with some Daley kin ripping off the citizens of the US.

How does may or shall translate via FSC to CANNOT? Idiot.

actually it shows why Cray always does so well.. by TechnoVooDooDaddy · 2001-08-13 04:26 · Score: 3, Insightful

Cray's engineers seem always willing to consider every possibility, whether it be clusters, p2p, parallel, etc.. showing us that they're considering things well outside of what they're currently offering is also showing us why they're still in the game and even ahead in serious computing power after so many years.. IBM, Sun, etc.. have had their rise and falls, but Cray is always mentioned with reverance...

About as innovative as a MS product... by wadetemp · 2001-08-13 13:22 · Score: 1

As long as Beowulf clusters have been around people have been doing this. In a homebrew system made from varying types and qualities of hardware, are you seriously going to have each node doing the exactly the same task? No... you write your program (and Beowulfs are ALL in the programming) so that each node does what it's best suited for. The node with the big hard drive stores the data, the fast machine gets twice as many work units, the slow machine is devoted to taking user input or receving the end result, etc. To do otherwise would be, well, stupid. The weak link in the system would slow the whole thing down.

Creating job classes in a homegeneous cluster is just as useful. I seem to remember someone working on The Collective project at the University of Idaho was doing this with a genetic application. This cluster is pretty close to being homogeneous.

If you visit the site, the Borg penguins are my handiwork. :)

Hey jchristopher... by Anonymous Coward · 2001-08-13 07:06 · Score: 0

You Are Full Of Shit!

what kind of clustering interconnects by soldack · 2001-08-14 06:37 · Score: 2

I am curious about where clustering will go for the connection between nodes. Ethernet, Fibre Channel and various proprietary formats are around but all have issues. InfiniBand is also on the horizon. While I work with InfiniBand development, I am not involved with any kind of clustering work. I see Sockets over InfiniBand as interesting method for inter-node communication. What do those of you who do work in the field think?

--
-- soldack

Re:Duh! by Anonymous Coward · 2001-08-13 04:58 · Score: 0

Wanna go play hockey on the roof? I brought a ball, just one though.

Cray rules by Anonymous Coward · 2001-08-13 05:18 · Score: 0

As a Cray developer I can tell you this new clusters are going to kick some serious ass.
Sincerely, Mike Bouma

if I understand correctly... by dario_moreno · 2001-08-13 05:22 · Score: 2, Interesting

what these guys want to do is to build, say, a cluster of 2 CPU system where one of the CPUs only computes while the other manages I/O and communications. Indeed, the I/O part is really a problem on Beowulves, and dedicating a CPU on it and communication can be cheaper than dedicated network cards like Myrinet (at 1000 $/port) or SCI, and hi-perf I/O like HiPPi. I wonder though if they can beat the price/performance ratio of the latter the way Beowulves beat on raw Flops the ones of traditional supercomputers.

--
Google passes Turing test : see my journal

Huge Market for Supercomputers Will Come... by Louis+Savain · 2001-08-13 05:20 · Score: 3, Interesting

Mentioned with reverence, but still slowly going bust.

The reason that high speed computing has not taken off is that there are currently no consumer apps that require it. Only a few scientific, research and governmental organizations have a need for it. However, let's say there is a breakthrough in AI technology, it will require googles of CPUs and memory. And when that happens, the market will explode.

People are going to want their mechanical maids, baby sitters, gardeners, chauffeurs, lawyers, companions, stock market experts, and what not. I predict they are going to crave their mechanical servants to the point of pathological obssession.

Don't be so sure this won't happen in your lifetime. In fact, there is every reason to suppose that it might happen anytime. There is an awful lot of minds thinking about intelligence and an awful lot of money being spent on it right now. IMO, the solution to the intelligence problem is probably simple. As Dr. Rodney Brooks of MIT says, "Maybe this is wishful thinking, but maybe there really is something that we're missing." Any day now.

In conclusion, I would recommend that you don't sell your shares in the supercomputing sector just yet.

Re:Huge Market for Supercomputers Will Come... by Anonymous Coward · 2001-08-13 07:24 · Score: 0

Obviously you are full of lies and are just a karma whore.

Not much information by morbid · 2001-08-13 05:08 · Score: 0

There wasn't much at all in that article other than what had been said in the caption on /.
Does anyone know what their system does and how it differs from e.g. Beowulf and MOSIX (or GridWare for that matter)?

I don't suppose anyone will reply since I post at 0.

--
I'm out of my tree just now but please feel free to leave a banana.

Some options by truthsearch · 2001-08-13 05:09 · Score: 2

This article covers three distributed OS options, with some intro explination of the difficulties. I would think the easiest (not necessarily the best) solution could be to use Mosix (listed last in the article) and thread your application to a logical extent. Mosix won't interfere with your current linux boxes, just add it on. The tasks will automatically be load-balanced among the machines.

--
Developers: We can use your help.

Could you imagine... by Uttles · 2001-08-13 07:57 · Score: 0, Troll

A cluster fuck of these?

--

~ now you know

Use a shell script. by hotsauce · 2001-08-13 07:57 · Score: 1

Many of these fits can take days, and, since they often have to be repeated many times with slight changes to the fitted function or initial parameters, this is a serious concern.

From the Beowulf FAQ:

3. Can I take my software and run it on a Beowulf and have it go faster?
[1999-05-13]

Maybe, if you put some work into it. You need to split it into
parallel tasks that communicate using MPI or PVM or network sockets or
SysV IPC. Then you need to recompile it.

Or, as Greg Lindahl points out, if you just want to run the same
program a few thousand times with different input files, a shell script
will suffice.

--
Lies about crimes

Really, really cool clustering by JohnZed · 2001-08-13 11:17 · Score: 2

If you're interested in general-purpose clustering (i.e. you don't want to re-write all your apps to use MPI), I really suggest checking out Compaq's Single System Image Clustering (SSIC)project. For Linux, this is basically in a pre-Alpha state, but the older, UnixWare-based version was very strong.

They also have a good comparison of clustering technology features on this slide. For now, you need a shared SCSI disk that can run GFS or something similar, but it may be possible to hook in PVFS eventually for low-end stuff.

Basically, SSIC is like MOSIX, but with killer high availability features. If a node goes down (from hardware, OS, or application failure), its workload is seamlessly migrated to another, functioning node. On MOSIX, unfortunately, each process has a "home node." If the home node goes down, the process is dead. SSIC also does load balancing by process migration, and all of that good, high scalability stuff.

Anyways, just give a look, and check out their slideshow...

--JRZ

Re:actually it shows why Cray always does so well. by fgodfrey · 2001-08-13 07:47 · Score: 2

Yeah, I will definetly admit that as benchmarks go, LINPAC is a very good one. I was just trying to make the point that it is not the be all/end all of the world of scientific computing :)

--
Go Badgers! -- #include "std/disclaimer.h"

Imagine a ... by sulli · 2001-08-13 08:54 · Score: 1

Cray cluster of (Thing)? is that what we need to say now?

--

sulli
RTFJ.

SGI's linux ccNUMA ... by akb · 2001-08-13 10:11 · Score: 2

... project is already doing it. See their Linux Scalability Project.

Linux by flamedaemon666 · 2001-08-13 16:52 · Score: 1

linux/unix os' rule

--
flamedaemon666

Slashdot Mirror

A New Approach To Linux Clusters

143 comments