A New Approach To Linux Clusters
rkischuk writes: "InformationWeek has an article about a group of ex-Cray engineers working on a new architecture for clustering Linux systems. 'It's not easy trying to build scalable systems from commodity hardware designed for assembling desktop computers and small servers.' Per the article, 'As the number of CPUs in a Beowulf-style cluster-a group of PCs linked via Ethernet-increases and memory is distributed instead of shared, the efficiency of each processor drops as more are added,' but 'Unlimited's solution involves tailoring Linux running on each node in a cluster, rather than treating all the nodes as peers.'" Looks like Cray engineers think about clustering even when they're not at Cray.
I don't know if you know this, but cum streaks if you don't clean it up right away.
I totally agree with your comment. However, my real reasom for posting this is 'cause I have name envy.
/.
heh. wish I would have thought of that...
~Shane www.shanekinney.net
Long Live
There was only one PPU on the CDC 6600, equipped with a hardware multiprogramming unit to make it look like ten independent CPUs. The PPU had ten sets of machine state and I/O channels, but only one arithmetic-logic unit (ALU). This was back when the ALU was the expensive part of the CPU.
one of the CPUs should be dedicated to the OS
MacOS 8 on multiprocessors was the last major commercial incarnation of that idea. But that was an ugly hack to put multiprocessing on a uniprocessor OS.
There's a lot to be said for channelized I/O, like mainframes have. All the peripherals look roughly similar to the OS, security is better because peripherals can't write all over memory, and there's less diversity in drivers. Intel tried this, but ran into troubles because they made the channel controllers fully programmable and put a little OS under Windows to run them. Microsoft hates it when you put stuff under their OS.
Offtopic, but still near-and-dear to the hearts of many readers: How does the licensing for Plan 9 work? I saw a copy of the GPL listed as Exhibit A at the very bottom of their own licensing page.
The licensing looks liberal enough (it's labelled "PLAN 9 OPEN SOURCE LICENSE AGREEMENT" license), but how is it related to the GPL?
-bch
Just to sumarise my basic point: The Top 500 is a benchmark and is not necesarily a good indication of who makes better computers than who.
Go Badgers! -- #include "std/disclaimer.h"
i have been thinking about for quite a while now. Beauwould clusters are all nice and well, but what about joe sixpack with some it background like me who wants to get some sort of cluster going on. what methods are available (be it a simplified beouwoulf cluster or whatever...) for the guy with 3 or 4 old machines who wants to waste some electricity and try his hand at clustering some machines. is it possible to do it without being a CS major, or is it just a matter of having enough time/resources...
I lost my concept of community when my community lost all concept of me.
These folks are using standard rack mounts, fitting 5 standard ATX motherboards in 8u of rack space (no special motherboard needed). They mount them vertically which makes cooling much more efficient when piling large numbers of CPU's into a small space.
When I die, please cast my ashes upon Bill Gates -- for once, make him clean up after me!
I would imagine you would use this as follows; first, you'd get some data points with which to calculate from a server. Then, you'd also get the name of a shared library which is on a server (NFS mounted probably). This library has a function name 'calc' or some such that does that calculation. You can then call that function, and post the results somewhere.
I would avoid using MPI or PVM, since those are not designed for farming out data the way you are. You should probably use your own job control protocol. Also, you might want to allow for multiple archetectures, naming the library foo-0.0.1.i386.so and foo-0.0.1.alpha.so and so forth,
Yes, I'm still a junky. Are you still a bitch?
There were a few "Because Linux does not scale well with multiple servers" posts about why someone would use a mainframe as opposed to a Beowulf.
Well, it looks like there are people working on the task. But that's not the real point, the right tool for the right job is the point. A whole lot of processes that do not require another process to finish before the next one is where Beowulfs shine, if you want throughput or process with dependancies then a mainframe is your best bet.
But it's still nice to have an alternative for those of us who cannot afford a mainframe.
DanH
Cav Pilot's Reference Page
UNIX - Not just for Vestal Virgins anymore
Nice job reposting from phrack, you nigger.
It's a beautiful warm spring day and a man and his wife are at the zoo. She's wearing a cute, loose-fitting, pink spring dress, sleeveless with straps. As they walk through the ape exhibit and pass in front of a very large gorilla, the gorilla goes ape.
He jumps up on the bars, holding on with one hand (and 2 feet), grunting and pounding his chest with the free hand. He is obviously excited at the pretty lady in the wavy dress. The husband, noticing the excitement, suggests that his wife tease the poor fellow.
The husband suggests she pucker her lips, wiggle her bottom, and play along. She does and Mr. Gorilla gets even more excited, making noises that would wake the dead. Then the husband suggests that she let one of her straps fall, she does, and Mr. Gorilla is just about to tear the bars down.
"Now try lifting your dress up your thighs"
... this drives the gorilla absolutely crazy.
Then, quickly the husband grabs his wife by the hair, rips open the door to the cage, slings her in with the gorilla and says, "Now, tell HIM you have a headache."
SHUT THE FUCK UP. Seriously. You are the reason your head hurts. It's all that vacum inside your head trying to cave in your skull. Fucking moron.
Stupidity never felt so good.
I'd like to see the chickens have a go at it.
SHUT UP ya fucking moron!
Stupidity never felt so good.
If my neighbor just gave me 1024 free chickens
because he upgradeded to two oxen, and I didn't
like to eat chicken, perhaps getting them to
plow a field would be useful?
I Am My Own Worst Enemy
I don't know. What's the tradeoff? Lots of chicken droppings on your field? The soil can use some nutrients, I'm sure. Can the chickens do it? Have you tried or is it a hunch of yours? Who eats more? The two oxen or the chickens? What about the sleeping place? Sure, those are a lot of chickens, but they don't have a problem if you sit them side to side and they don't have a problem with sleeping in three or four rows. Manageability, yes, that's a problem... now we see a real tradeoff. Being able to replace a chicken if it dies (and chickens are cheap) or one (possibly both) of your oxen, which are not cheap, versus manageability of the two oxen. Sure, feeding the chickens is also a problem, as well as collecting the eggs wow! byproducts! suddenly I can do something with the chickens that I couldn't do with the oxen.
I like this analogy.
If you were plowing a field, which would you rather use?
Two strong oxen or 1024 chickens?
--
E2 IN2 IE?
I think this may be a case where a bit more thinking and literature research about the problem would help a great deal.
We're looking at using MINUIT, a package written by the computing divsion at CERN, as our fitting engine. MINUIT's algorithms are quite advanced, and it's commonly recognized within the physics community as the best general-purpose fitting package out there.
I think you may not realize how complicated the functions we're trying to fit are. Here's the quick and simple version: we study magnetic fields within superconductors and semiconductors on a microscopic level. We do this by using spin-polarized muons or radioactive light ions as a probe, and measuring anisotropy of the emitted decay products. That data then has to be compared against complex models of superconductivity. The computationally expensive part here is calculating the values predicted by the model for a given set of parameters. This has to be done once for each data point to calculate chi-squared, and repeated many times (once in each iteration of the fitting process), each time with different parameters. The models typically contain difficult integrals which must be evaluated numerically thousands of times with very high precision.
Since the function we're trying to fit changes fairly often depending on the sample and measurement techniques used, it's not practical for us to spend huge amounts of time optimizing each individual function to be fitted. The fitting package is already optimized, so the only thing left is to parallelize it.
Check out SARA: TERAS' is a 1024-CPU system consisting of two 512-CPU SGI Origin 3800 systems. This machine has a peak performance of 1 TFlops (1012 floating point operations) per second. The machine will be fitted with 500MHz R14000 CPUs organized in 256 4-CPU nodes and will possess 1 TByte of memory in total. 10 TByte of on-line storage and 100 TByte near-line StorageTek storage will be available. 'TERAS' will consist of 44 racks, 32 racks containing CPUs and routers, 8 I/O racks and 4 racks containing disks.
:)
The fun part: parts of this huge machine are running Linux
For more closeup pictures see: http://unfix.org/news/sara/
Ain't it sweeeeeeeeeeet?
http://unfix.org
Carve up a big mainframe with 24 or so processors and bind the OS to a few of them. Get another mainframe and use it as a mezzanine backplane to gang together other mainframes each with their OS images bound to specific processors. Run all networking and IO through their own processors to offload the CECs. Add some more ASICs to handle crypto. Toss all console activity off to another 'special' processor. Run a hypervisor over the whole shebang to control all of the guest images.
Voila you've reinvented parallel sysplex with VM for Linux running on 'cheap' hardware.
Except how cheap do they expect it to be?
People oughta listen to me...
Hemos and JonKatz made their way into the lab where the animal experiments were conducted. A cute, fuzzy rabbit had caught their eye...
The evil duo quickly subdued the little lab rabbit. They strapped the now helpless animal's head to the sex table with hot leather. JonKatz had the urge, and removed his pants, which were now buldging. After slipping off his briefs, JonKatz tightly fastened the leather straps and was ready to begin.......
JonKatz began to "grease up". Shoving endless amounts of vaseline and baby oil all around the rabbit's ass, he slid his purple head firmly into the rabbit's tight asshole. Even though the rabbit was slightly unconscious, screams of pain were constanly being emmited. Hemos reached for the chain whip and smacked the rabbit's soft nose until its face was soaked with blood. Now, with the rabbits head drooped over the edge of the table, JonKatz continued his sex hunt. His now tingling cock was pushed deeper and deeper through the thick layers of skin which covered the bowel tract. Five, six, seven, then finally all eight and 3/4 inches were plunged deep within the animal's love canal.
JonKatz's manhood tingled with every slight movement of the now half alive rabbit. He began rhythmically sliding in and out, moaning with pleasure on every thrust. JonKatz worked himself into a hot orgasm. The blood, now coming steadily out of the rabbit's ass with every thrust of JonKatz's pelvis, could be heard dripping on the floor. JonKatz's rate increased and with a final push, he spurted creamy white love gel far up into the rabbit's bleeding ass.
The blood and cum mixed together on the floor, which had now accumulated a large puddle. Unknown to JonKatz, the semen had acted as a powerful enemma for the rabbit and out ushered the contents of its intestine. The stool was loose and soft. It fell to the ground with a soft thud and broke into small pieces. The obnoxious smell caught JonKatz's attention, and no sooner had he fallen to the ground and began licking the large puddle of blood, sperm, and stool. Exited at JonKatz's enthusiasm, Hemos dropped to his knees and also began to slurp the foul mixture.
After cleaning the floor with their tongues, JonKatz and Hemos checked on the battered lab rabbit. It was barely able to hold its head up, as it had lost control of most of its motor fuctions. Feeling no pity for this sexually mistreated animal, they unstrapped it and tossed it across the room, only to make a loud and deep thud against the wall. Its blood soaked fur left spatters of red stains everywhere it touched. Hemos reached for his chain whip, while JonKatz grabbed a pair of rusty hedge clippers (one of the many torture devices carried around for "convenience"). They made their way over to the rabbit. The rabbit was struggling for every last bit of air it could, just gasping and wheezing.
"Awwwww. Poor little thing," Hemos maniacally laughed. He raised his arm and thrust the cold metal whip down, exposing the rabbit's bloody flesh. He kept whacking and whacking at the furry bag of blood. Then, when Hemos stopped to catch his breath, JonKatz stepped over with his rusty hedge clippers. He knelt over the rabbit who was knocking loudly on death's door. JonKatz took a quick glance at the clippers, grinned, and then thrust them deep into the body of the rabbit, obviously hitting many arteries. As the blood squirted into JonKatz's face he moved the clippers around in hopes to find a thick bone to crunch. "Aha! The femur!" he yelled out with excitement. JonKatz wedged the clippers against the bone. He opened them wide......then closed down on them with all his might. The bone could be heard deep inside the rabbit, being mutilated. Death had glazed the bunny's eyes.
The rabbit lay dead, a bloody mess on the floor. Its bodily fluids freely surged across the tiled floor. Then with a look of extreme satisfaction, both JonKatz and Hemos lit up some smokes, gathered their belongings and quietly left the hospital grounds, knowing with confidence that they would strike again, somewhere, soon.
I'm trying to design a specialized data-fitting program to be used for accelerator-based condensed matter physics (and maybe ultimately other branches of science as well). I need information on adding clustering support to this program. Here's a brief description of what the program does:
The user writes a small chunk of code that calculates the function they're trying to fit the data to. We require the user to code the function him/herself because speed is important, and some of these functions are too difficult for Mathematica or the like to fit. Once the user writes their function, it's linked (dynamically) with the rest of the code. The user then passes in a parameter file, and away it goes.
Many of these fits can take days, and, since they often have to be repeated many times with slight changes to the fitted function or initial parameters, this is a serious concern.
Can this new approach to Linux clusters be used here? We have tons of Linux boxes lying around that are being used for other things, but have lots and lots of spare cycles. We probably couldn't afford a dedicated processing farm, but we could easily live with something like distributed.net where the program transparently takes all the spare cycles.
I know the problem is parallelizable, since each node can calculate the value of the function at a few of the data points, then send back to the "master" the chi-squared contribution of those points. Each iteration of the fitting process, the master sends out the current parameter values, and then the nodes grind away... There's not too much communication required.
One of my big concerns is how to get the user-written function from the "master" computer to all the "slaves". It's unrealistic to expect the user to manually install it on all the machines each time something in the function gets tweaked and it's recompiled. Are there pre-existing standards on how to send code to nodes in a cluster, then have it executed?
Any advice or pointers to good starting places on distributed computing would be much appreciated.
BTW, as a hint to all the other comp sci geeks out there--physics is a great place to find new and challenging computing problems (I'm not claiming this is one). In particular, the particle physics people often have to deal with spectacular data rates, and do extremely complicated event reconstruction. Check it out some time.
I'm wondering if the rest of the engineers groan when one 1337 h4x0r takes a look at a computer and says:
...
...
(clears throat)
root@desktop:/root# AllYourBeowulfClusterAreBelongToUs
Weird....
Wiwi
"I trust in my abilities,
but I want more then they offer"
The reason that high speed computing has not taken off is that there are currently no consumer apps that require it.
You've obviously never tried to open a spreadsheet in StarOffice.
score as funny. They tend to face reality more than window developers (like windows is a world class OS, since when?).Elliuson is right for a change. windows is a toy OS on a Toy processor.Try SUN you frickin LAN Admin. Stick to LAN's And stop sucking Bill Gates's dick, your mouth is sore
This idea isn't new at all and has been in practice for a long time in both large machines and in clusters. In clustering, for example, the C-Plant at Sandia uses this type of topology and is on the Top500. I'm pretty sure they have published papers on their topology.
> In that incarnation, the two central CPUs ran only user applications, while the operating system, with all its interrupts, OS code, and device drivers, would reside nearby in the ten Peripheral CPUs (called PPUs) provided for this purpose.
When I heard a CS professor talk about putting multiple CPUs on a single chip, I suggested that one of the CPUs should be dedicated to the OS, which would mean that it wouldn't even need a FP unit. So for (say) 4 or 8 computers on a chip, one would be a "OS server" and the others would be "application servers". Ditching the FP on the "OS server" might allow an extra-high-performance design for it. And the others would only need context switches when the OS demanded it, rather than one for every stinking interrupt that came along.
Sheesh, evil *and* a jerk. -- Jade
Can you imagine a Beowulf-cluster of beowulf-cluster posts...?
ahhh...never mind.
What happened to OSCAR?
...
(clears throat)
...
FUCK YOU!
Thank you very much.
Stupidity never felt so good.
These were Ex-Cray, which perhaps says something in and of itself...
A feeling of having made the same mistake before: Deja Foobar
How in general is this different than the approach taken by Plan 9? (http://plan9.bell-labs.com/sys/doc/9.html)
Mentioned with reverence, but still slowly going bust.
don't install it! use Windows 2000 instead... face it... it's a stable operating system
Get that rats nest off your head, you numbskull -- Wesley Willis
Treating each node as a peer! Don't they know that Peer to Peer networks are stealing from our musicians and corrupting our youth! I just hope they can repent before the heavy hand of justice comse down on them.
Steven
-- I have marked myself unwilling to moderate-- I don't have other accounts to artificially inflate the karma of
Yes, this has been done for a while, usually assignment by hand of the programs that run on which nodes. The hard part is getting an automatic reservation system to figure this out for you. There are some batch schedulers that can schedule this type of stuff for you but you still have to identify which nodes will run what program. In the embedded space this has been an issue for quite a long time. For example an embedded machine may have X number of GPUs (i.e. G4s) and Y number of some DSP (i.e. Sharcs) nodes in the same box and you want the appropriate nodes to do the right things.
LINPACK is computationally O(n^3) and O(n^2) wrt. communications, for problem size n.
That's not a completely unfair benchmark - but of course you're right it's a benchmark and therefore it does not cover every possible problem out there. However, it is based on the common linear-algebra routines that are the core of a very large part of the scientific computing problems being run out there.
It seems to me that they are just using asymmetric multiprocessing on a distributed Linux cluster. (Asymmetric meaning that each CPU has a specific function.) This idea is certainly nothing new, though the novelty might be in using tailored Linux instead of the typical UNIX environment. (Different processors, typically.) Of course, this is an area that has been the focus of a great deal of research, so perhaps they are just attempting to move this idea into the private sector by using the less expensive Linux distros as a viable economic alternative for greater computational power?
Root DOWN
grep what -i sed?
I put my dick in it.
SHUT THE FUCK UP! God please, all the stupid fucking jokes here. It's like listening to the conversation in a high-school boys locker-room. Fuck! FUCK FUCK FUCK! I'm not even supposed to be here today!
Stupidity never felt so good.
Can you imagine... wait. I think they already did.
Well, then imagine Natalie Portman Naked & Petrified, then it's ALL good.
a beowulf cluster of these.
I eat meat, I tickle your feet, I ask for slashdot news it's neet, :)
but today i saw an fbi bird, it tried to eat my honey word.
Red worm ran, into the can, of win doze boxes, then sent some spam,
to see if they could pester the man, who tries to run our nationalized
land.
Read the posts, chase the ghosts, who penetrate our servers and hosts,
and you will come to learn to be, a non-elite computer hacker like me.
if you need help, send me mail, I will gladly flame your tail,
only after youve been inseminated, will my info be disseminated.
That is right, I make light, cuz i dont get none night to night,
but if a girl will come and get me laid, I'll make more funny for all to
read.
Hell-FUCKING-Yeah!
A pregnant woman walks into a bank, and lines up at the first available teller. Just at that moment the bank gets robbed and she is shot three times in the stomach. She was rushed to the hospital where she was fixed up. As she leaves she asks the doctor about her baby. The doctor says,
"Oh! You're going to have triplets. They're fine but each one has a bullet lodged in its stomach. Don't worry though the bullets will pass through their system through normal metabolism."
As time goes on the woman has three children, two girls and a boy. Twelve years later, one of the girls comes up to her mother and says "Mommy, I've done a very weird thing!"
Her mother asks her what happened and her daughter replies, "I passed a bullet into the toilet." The woman comforts her and explains all about the accident at the bank.
A few weeks later, her other daughter comes up to her with tears streaming from her eyes. "Mommy, I've done a very bad thing!" The mother says, "Let me guess. You passed a bullet into the toilet, right?"
The daughter looks up from her teary eyes and says, "Yes, how did you know?"
The mother comforts her child and explains about the incident at the bank.
A month later the boy comes up and says, "Mommy, I've done a very bad thing!"
"You passed a bullet into the toilet, right?"
"No, I was masturbating and I shot the dog"
Brian Moyles is a pussy.
They're going to have some nodes in the cluster dedicated to message routing with a specially optimized version of Linux. Which would mean that adding an extra node would increase the load on the router node but not the other nodes in the cluster. Is this correct? And if so, is this revolutionary stuff? Just that it sounds a bit like applying som common sense to me.
I'm wondering if the rest of the engineers groan when one of them takes a look at a computer and says:
...
(clears throat)
...
"Imagine a beowulf cluster of these things!"
-J5K
The libertarian solution to the failures of capitalism is to apply more capitalism til the failures are fixed.
Since it uses Linux they'll call it a clusterfuck.
Cunning linguists
If only I hadn't seen a piece of news about a Beowulf cluster of ex-Cray engineers. I wouldn't have had to make a Beowulf remark.
Well, duh, Cray machines are massively parallel processing machines, so they're not clusters in the sense that they don't use network cards and separate computers as basic computing units, <OVERSIMPLIFICATION>the processors talk to each other on the same bus and share the same memory</OVERSIMPLIFICATION>, but basically in either case it's about parallel processing. I *hope* Cray engineers think about clusters. I'd hate to see them think about single Athlon supercomputers ...
"A door is what a dog is perpetually on the wrong side of" - Ogden Nash
I think it's great. The guy is notorious, universally shunned on Usenet, but on Slashdot, his posts are moderated through the roof.
He's even better then Signal 11 at showing the flaws in moderation.
Their homepage seems to be http://www.unlimitedscale.com/.
Unfortunately, it contains absolutely no info on what hey are up to.
groups.google has a tiny bit more.
And a bit on their funding.
Anybody got any more info?
/Styx
Why post this article? It says "we're CRAY folks and we're doing cool stuff with LINUX to improve BEOWULF clusters." a) there's absoutely no meat here - just buzz words. b) Almost no one cares about scaling clusters to thousands of nodes for any reason, even fewer for running parallel codes. c) An EFFECTIVE cluster with thousands of nodes running PARALLEL codes isn't a Beowulf. The Beowulf name applies to SINGLE USER clusters assembled from COMMODITY Off-The-Shelf components. Cluster this size spend most of thier time running lots of smaller jobs rather than devoting the whole machine to a single problem (so, not single-user), and you can't scale many (any?) parallel codes to a thousand nodes using commodity hardware.
(I know--not the best analogy)
Power to the Peaceful
first cluster post!
The real limitation of Beowolf style computing is RAM. Beowolf is great if you have programs that paralellize with little intercommunication and low RAM usage. The bigger problem is RAM. Big iron like Crays/SUNs/SGIs all have about a Terabyte of RAM in one place. When you are trying to do large physics calculations you usually have a huge data set you need to store for every time series. Supercomputers aren't cool just because they are fast, but because they can hold HUGE amounts of data in RAM for easy acess. Until PCs get a few gigs of RAM per box cluster computing is still going to be Kludgy no matter what kind of message passing scheme you use.
bash-2.04$
bash-2.04$yes "Don't you hate dialup connections?"| write USERNAME
Seems to shed some new light on the previous post about the Cray SV1 and people dispelling Beowulf.
how 'bout a beowulf cluster of these clusters?
Furbeowulf
Funny. :)
... on Bizzaro World!
saru mo ki kara ochiru
Would it be concievably possible to use mixed architectures and assign certain tasks or routines to the architecture best suited for them. Rough example: 200MHz Pentium and 200MHz Cyrix system in the same cluster. Two calculations need to be performed, one interger, one floating point. Send the interger to the Cyrix and the floating point to the Intel. Rather than fight with some platforms being better than others at certain tasks, work WITH that fact.
If you think education is expensive, you should try ignorance -- Derek Bok, president of Harvard
Is mainframe really the right term? I thought that mainframe usually refered to a IT machine like an IBM 390. One of my client's has a "mainframe" that is only 11 400 Mz processors. The Dell 8 way MSSQL server has more processing power. (and we have verified this with benchmarks and working with IBM.)
Isn't there a differnt term for a very fast computer? Maybe something like "super computer"?
Joe
Joe Batt Solid Design
fp!!!1
I just heard some really sad news on talk radio - Horror/fiction writer Stephen King was found dead in his Maine house this morning. I'm sure we'll all miss him - even if you didn't read his books you've probably enjoyed one of his movies. Truly an American icon.
I like the idea of a furry cluster: furbeowulf.
No one got beat up more often than the mimes of the old west!
Toast!
can you imagine a beowulf cluster of those?!?1 wiat type of clustering are they going to call this, it might be a new troll-word for /. :)
Imagine a Beowulf cluster of - oh, wait a minute...
SJVN commentary on distributed computing and some interviews with various people in the field.
Best Slashdot Co
What is important to realize is that in order to use these boxes as a cluster, you will have to wrote you own custom software. Yep, it means C and C++, and hours of hacking.
But as mentioned in a previous post, Mosix can do that for you, if and only if your program can use several instances at the same time. Compressing MP3s is a good example.
Nobox: Only simple products.
and it's not the top of the line, even, if i'm not mistaken. i thought that was the t90.
A: None. The Universe spins the bulb, and the Zen master merely stays out of the way.
The portfolio of Quatris Fund (one of the investors in Unlimited Scale) is small, but interestingly diverse.
/Styx
Of the top 500 supercomputers in the world, 47 are vector processor machines - the kind of processors that Cray became famous for.
Not one single of these are made in the U.S.
Cray today is a name. It's a brand. It's not a manufacturer of high performance computers.
Just for the record, IBM produced the two fastest computers currently, Intel the third, IBM the fourth, Hitachi the fifth, SGI, IBM, NEC, IBM, IBM, and Finally, number *11* is a Cray based on the Alpha processor (the T3E).
So, tell me again, who was playing catch-up with who ?
Guess who the CDC6400 designer was? Seymour Cray.
Wouldn't HURD be more suitable for such task?
Basic
How does may or shall translate via FSC to CANNOT? Idiot.
Cray's engineers seem always willing to consider every possibility, whether it be clusters, p2p, parallel, etc.. showing us that they're considering things well outside of what they're currently offering is also showing us why they're still in the game and even ahead in serious computing power after so many years.. IBM, Sun, etc.. have had their rise and falls, but Cray is always mentioned with reverance...
As long as Beowulf clusters have been around people have been doing this. In a homebrew system made from varying types and qualities of hardware, are you seriously going to have each node doing the exactly the same task? No... you write your program (and Beowulfs are ALL in the programming) so that each node does what it's best suited for. The node with the big hard drive stores the data, the fast machine gets twice as many work units, the slow machine is devoted to taking user input or receving the end result, etc. To do otherwise would be, well, stupid. The weak link in the system would slow the whole thing down.
:)
Creating job classes in a homegeneous cluster is just as useful. I seem to remember someone working on The Collective project at the University of Idaho was doing this with a genetic application. This cluster is pretty close to being homogeneous.
If you visit the site, the Borg penguins are my handiwork.
can you imagine a cluster of wolves? Sure you can. First they are shaved, then they're coated in a layer of caramel, then rolled in almonds, and finally dipped in chocolate. Wolf clusters!
(Now available at www.hairyanddavid.com)
You Are Full Of Shit!
I am curious about where clustering will go for the connection between nodes. Ethernet, Fibre Channel and various proprietary formats are around but all have issues. InfiniBand is also on the horizon. While I work with InfiniBand development, I am not involved with any kind of clustering work. I see Sockets over InfiniBand as interesting method for inter-node communication. What do those of you who do work in the field think?
-- soldack
As a Cray developer I can tell you this new clusters are going to kick some serious ass.
Sincerely, Mike Bouma
what these guys want to do is to build, say, a cluster of 2 CPU system where one of the CPUs only computes while the other manages I/O and communications. Indeed, the I/O part is really a problem on Beowulves, and dedicating a CPU on it and communication can be cheaper than dedicated network cards like Myrinet (at 1000 $/port) or SCI, and hi-perf I/O like HiPPi. I wonder though if they can beat the price/performance ratio of the latter the way Beowulves beat on raw Flops the ones of traditional supercomputers.
Google passes Turing test : see my journal
Mentioned with reverence, but still slowly going bust.
The reason that high speed computing has not taken off is that there are currently no consumer apps that require it. Only a few scientific, research and governmental organizations have a need for it. However, let's say there is a breakthrough in AI technology, it will require googles of CPUs and memory. And when that happens, the market will explode.
People are going to want their mechanical maids, baby sitters, gardeners, chauffeurs, lawyers, companions, stock market experts, and what not. I predict they are going to crave their mechanical servants to the point of pathological obssession.
Don't be so sure this won't happen in your lifetime. In fact, there is every reason to suppose that it might happen anytime. There is an awful lot of minds thinking about intelligence and an awful lot of money being spent on it right now. IMO, the solution to the intelligence problem is probably simple. As Dr. Rodney Brooks of MIT says, "Maybe this is wishful thinking, but maybe there really is something that we're missing." Any day now.
In conclusion, I would recommend that you don't sell your shares in the supercomputing sector just yet.
they used RAID... i mean, what's the point of clustering all these boxes? so you could be the coolest kid on the block and say "yeah, i 0wn j00?"... what about power issues? quite honestly, i think this is just stupid... meaningless and a waste of a lot of money... but i'm not surprised... things like this happen everyday... literally
Get that rats nest off your head, you numbskull -- Wesley Willis
Nakoruru, what is that, Indian? Go make me a slurpee, Nakoruru, and give me one of those little "wassup" ligthers, too. I love that shit. Is it true you people eat your own children? I think I'll pass on that hot dog, Nakoruru. I don't want to be muching on little Nakoruruette. Hey, what are you doing? Why are you unzipping your pants? Put your pants back on! Oh
There wasn't much at all in that article other than what had been said in the caption on /.
Does anyone know what their system does and how it differs from e.g. Beowulf and MOSIX (or GridWare for that matter)?
I don't suppose anyone will reply since I post at 0.
I'm out of my tree just now but please feel free to leave a banana.
This article covers three distributed OS options, with some intro explination of the difficulties. I would think the easiest (not necessarily the best) solution could be to use Mosix (listed last in the article) and thread your application to a logical extent. Mosix won't interfere with your current linux boxes, just add it on. The tasks will automatically be load-balanced among the machines.
Developers: We can use your help.
A cluster fuck of these?
~ now you know
Many of these fits can take days, and, since they often have to be repeated many times with slight changes to the fitted function or initial parameters, this is a serious concern.
From the Beowulf FAQ:
3. Can I take my software and run it on a Beowulf and have it go faster?
[1999-05-13]
Maybe, if you put some work into it. You need to split it into
parallel tasks that communicate using MPI or PVM or network sockets or
SysV IPC. Then you need to recompile it.
Or, as Greg Lindahl points out, if you just want to run the same
program a few thousand times with different input files, a shell script
will suffice.
Lies about crimes
If you're interested in general-purpose clustering (i.e. you don't want to re-write all your apps to use MPI), I really suggest checking out Compaq's Single System Image Clustering (SSIC)project. For Linux, this is basically in a pre-Alpha state, but the older, UnixWare-based version was very strong.
They also have a good comparison of clustering technology features on this slide. For now, you need a shared SCSI disk that can run GFS or something similar, but it may be possible to hook in PVFS eventually for low-end stuff.
Basically, SSIC is like MOSIX, but with killer high availability features. If a node goes down (from hardware, OS, or application failure), its workload is seamlessly migrated to another, functioning node. On MOSIX, unfortunately, each process has a "home node." If the home node goes down, the process is dead. SSIC also does load balancing by process migration, and all of that good, high scalability stuff.
Anyways, just give a look, and check out their slideshow...
--JRZ
Yeah, I will definetly admit that as benchmarks go, LINPAC is a very good one. I was just trying to make the point that it is not the be all/end all of the world of scientific computing :)
Go Badgers! -- #include "std/disclaimer.h"
Cray cluster of (Thing)? is that what we need to say now?
sulli
RTFJ.
... project is already doing it. See their Linux Scalability Project.
linux/unix os' rule
flamedaemon666