World Wide Cluster
gwjc writes: "There is a pretty good Ian Foster article on Web-based computing clusters at the Nature site. The usual SETI@home, condor, and entropia mentions as well as a few that were news to me such as "Compute against Cancer" and "Fight Aids At Home" with links. I wonder how I go about declaring."
seti@work got me fired
GiraffeSville, a place anyone can call home
How will any prevail? I guess you could run seti@home, distributed.net, prime95, and the rest on one box. Wouldn't that void the point? All this competition waters down each project.
More and more companies are starting these distributed computation projects that feed off newbies' altruistic intentions, but why? Are the companies' motives altruistic as well? Hardly. Whatever drug molecule you calculate will be instantly patented by the corporation without so much as a by-your-leave. At least with those key-cracking contests, the winner got a prize. Here, you just get the shaft, and Entropia Incorporated gets a cut.
I'm sick of corporations and I'm sick of patents. It's getting to the point where I feel like sabbotaging all patent-seeking enterprises, even if it means we'll never find an AIDS drug. You can't do good by doing evil first, no matter what Macchiaveli tells you.
Read the rest of this comment...
After all, whilst the advantages of distributed computing are clear in that they can provide a way of harnessing a lot of computing power for a cheap cost, there are also downsides to this kind of project.
If people are so taken up with this sort of thing, imagine how easily it could be abused. People don't tend to be able to recognise and deal with email viruses, let alone a rogue distributed project that claims to do one thing whilst in actuality do something else. It sounds to me like a perfect opportunity for intelligence agencies to get their software on people's computers without anyone knowing!
How can you tell whether that client you run 24-7 on your home computer is actually helping calculate the next prime number or whether it is scanning all of your net connections and sending the information to a giant government database to be perused at their leisure? Police states like Britain already want to keep records of everything you do, this seems like a damn good way of doing it on the sly.
Personally, I'd be very wary of any piece of software that sits on your PC and has a constant connection to the internet. Unfortunately, most people are too trusting when it comes to their security online...
The thing that scares me is the possibility of said cluster being abused, and hackers using it for ill purpose. Also, what are the implications for privacy? Look at .Net and the like, this is the nest step, and all my private files will be spread all over the Web! Ultimately, the superdupercluster could become conscious and ruin us all! ;-) I would like to see these technologies more strictly controlled. Sharing of data is one thing, but sharing of processing power seems a bit on the dangerous side, don't you agree?
Thanks for reading.
--Anticipation of a New Lover's Arrival, The
Will we get money from it? No.
Will we get our names in the publication? No.
Will we feel good for donating our otherwise wasted CPU cycles to science? Yes!
None of the companies this article mentions will compensate you for your computing time. Until they do, this whole thing isn't going to take off, because there just isn't enough motivation for Joe Q. User to install this software. Why would they install something they don't understand, at a privacy risk and a stability risk, for no immediate personal benefit?
I used the Distributed client for quite a while, but I switched to Seti just because it had a cooler screen saver. I've got a bunch of computers in my office that are usually idle, and this at least looks cool when the PHBs walk by.
At least Entropia (one of the companies in the article) gets that part of the motivation, and provides you with a color screensaver. It's not nearly as good as the Seti one, but it's something.
What's your damage, Heather?
Doesn't it seem like a better idea to use idle computer time to search for a cure for Cancer and AIDS instead of searching for aliens and finding new OGR's? Guess I'm just gonna have to say goodbye to distributed.net!
While many of these projects are worthy, please turn your computer off when you are not using it. It's great to donate spare cycles while you are word processing or whatever, but when you finish, turn off your computer. Many areas of the world (including California) are critically short of electrical power. It doesn't make for a better world to waste electricity when it is short supply. Much of the emergency electricity is now being generated in obsolete highly polluting coal fired power plants. Use less electricity and you can fight pollution and cancer too! So for the sake of everyone, please conserve electricity. Turn you computer (and TV, and lights, etc) off when not in use.
Wow, it's amazing how you basically said nothing new, insighful or interesting and yet got modded up. You know putting buzzwords together does not mean that you are insightful. Did you hear that, crack-smoking moderators?
the ramifications of this on Beowulf clustering?
So if you are running three different projects, or more than one project, then the one that starts first will get all of your idle time, as far as I know.
Well, that depends on the scheduler. Linux, for example, would distribute the CPU time evenly between all project.
I don't know any off-hand, but there are schedulers which don't use a timeslice. On such a system, you could only get one project to run.
And if your distributed client is a screensaver, you can only run one of those at once.
WWTTD?
This article didn't mention Popular Power. Popular Power has the BEST screensaver, plus it WILL pay it's clients as soon as it gets a commercial project. Sign up now to help work on an influenza vaccine
You only run what you are interested in. For example, if you wish to help find (and believe in) alien life, you run SETI@home. If you want to find cool new protein structures, you run Folding@home to help the proteomics researchers. It is simple as that. As with everything in this world, use common sense. After all, we're talking about a cool way of doing things, not about how it will change every man, woman and child's life! Because it probably won't.
Veni, vidi, vici.
as I browse the net very upsetting at times. For example, I've had shit
deposited onto my harddrive (i.e. cookie w/ java booger) which reset my home-
page. The next time I go online, I'm staring at not just one,
but three windows, each trying to sell me something I'm not going to buy regardless.
The idea of allowing someone's program to run on my machine, without my direct
control turns my stomach. Yes I am paranoid, and I think I have good reason to be.
More, bigger, faster and easier are not always better for an individual
user where control is given away to those who have their own agendas.
I like to think I am in control of my PC. I'm learning to take
measures to 'make it so'. I think we all should!
Wow, it's amazing how you basically said nothing new, insighful or interesting and yet got modded up.
Why don't you try:
I, myself, thought the comment was interesting, and would have modded it up myself if I had points right now.
TomatoMan
-- http://frobnosticate.com
Distributed computing of this nature, like many clusters, is only as good as the jobs running on it. While it would be perfect for something that performs the same sequence on mulitple sets of data (ie, SETI or rendering), it is not what you would use for processing that relies on the previous calculation to continue (weather model simulation, optimizations, etc). These are just too bandwith intensive and would leave even the slowest processors mostly idle while saturating even the fastest links.
It is a far better idea to come up with some proposed jobs and determine the best hardware (be it a single large-scale system or a distrubuted cluster or small systems) before telling the world to leave their power-hungry personal computers running 24/7. 400 watt consumption isn't much until you multiply it by the number of PCs owned by the geek community.
I envisage a system were you have resource_buddies-people you have agreed to share idle cpu time with. These buddies would probably need permenent ip add's, but they are becoming increasingly common with broadband links. Anyway, when you are doing something like rendering something with gimp, ior any other cpu intensive task, the kernel module could kick in at a user defined cpu usage level. When active the module could test a few resource buddies to see if any are active and if any have cpu idle time. If it detects idle time it then shares the processing load with the remote system.
Anyway, just a thought, probably never happen
GiraffeSville, a place anyone can call home
We already have such a sandbox which is multi-platform (including Linux.) Although it's not the fastest possible implementation, I'd be much more willing to donate my spare computing cycles if the program were written as a Java applet.
The same restrictions that make Java applets safe also, to me, sound like the restrictions that would make distributed computation safe. They have no access to your local disk. They cannot make network connections, except to the source of the code.
Aside that people think of applets only for displaying graphics, and maintaining one of them up 24/7 would be difficult, are there any reasons why Java applets shouldn't be used for distributed computing?
Never play leapfrog with a unicorn. Or a juggernaut.
The idea of allowing someone's program to run on my machine, without my direct control turns my stomach. Yes I am paranoid, and I think I have good reason to be. More, bigger, faster and easier are not always better for an individual user where control is given away to those who have their own agendas.
you let programs run on your computer without direct control all of the time. in linux their called daemons and in windows they are called services. these are programs that run in the background that you never see. in linux most of them are open source, but since most people (myself included) dont religously read every scrap of the source we really dont know what we are running.
what i'm saying is that you have to assume some ignorance in order to use your computer. you depend on others (for linux it's the gnu folks, kernel hackers, etc.) to check this code. i believe you like to think you are in control of your pc.
use LaTeX? want an online reference manager that
-- john
Being a long time distributed computing advocate, I've used (and crunched many blocks for) GIMPS, distributed.net, Seti@Home, Folding@Home, and United Devices. I'm currently using all my spare cycles for United Devices. Why? Well here's a brief explanation/analysis of the projects I've used:
GIMPS - They have a good, clean client, but the critical problem is that the project has no conceivable benefit.
distributed.net - Probably the best client/site out there, and definitely the largest pre-Seti@home project. However, the cracking of encrypted messages has next to no scientific benefit (it is quite easy to calculate the chance per try of cracking any of their ciphertexts). Recently they've been doing some work with OGRs. Finding new OGRs looks like something that at least has a marginal benefit. On a side note, distributed.net has partnered up with United Devices.
Seti@home - It seems like everyone and their mom is running seti@home. However, reportedly seti@home actually has more clock cycles than they need. (they can only get so much radio info per day to analyze)
Folding@Home - Definitely a lesser known project which is being run by some researchers from Stanford, they analyze proteins. The project definitely has scientific merit, however they're experiencing some growing pains due to their recent popularity. Also their client is definitely beta-esque.
United Devices - This is the project that I'm currently contributing to. (so of course I'm bias) I chose them because they're doing something useful (working on cancer stuff with some researchers from the University of Oxford), have a fairly good client, and have a 'rewards' program for their users. (btw, GIMPS and distributed.net users also have the chance of winning a large cash prize) In addition, UD has partnered up with distributed.net, so it looks like UD just might be the commercial corporation to win the Internet-based distributed computing market.
I hear you. I have invited the daemons and services to do their work
when I installed their OSs on my drive. Sadly, what you say is true.
With Windows, I see it as 'dancing with the devil'. With Linux,
I feel more trusting considering its being open-source. My
greatest concern is for what I could be tricked into habitating my machine.
I have a lot to learn, and I'm keeping my eyes open.
Thanks for your input.
I would like to see a client that securly updates itself. - Last time Seti stopped running because it wanted an upgrade I uninstalled it.
I would like to see more of a trusted central organization that will put out a client that can handle updates and giving you a choice on what project to compute on.
This would also make it simpler to track ladder rankings.
The whole point I got into the distrubuted bit was to see if I could crank out more keys then my buddies. I imagine this worked for alot of other techie-geeks.
At this point I am not going to bother donating cycles until somone comes out with a nice client, (with optional usefull looking screensaver), that is actually working for a good cause such as cancer cure. Ladder rankings would be an added bonus - attraction.
Also clients that PROPERLY support SMP, I seem to remember having to start two instances of a key cruncher inorder to get full effect out of a machine.
On another note I remember a tale about a contracter friend installing the key cruncher software on machines at places he was contracting at. Fergot about it for months after he left that contract. Checked his ranking after he had no machines personaly only to find those office machines were still working away.
I am also curious to see what Google could pump into a project with their spare cpu cycles from many thousands of machines(other then heat up the datacenters).
-Cyril
All babbeling contained here in, may or may not be sensible to the common earthling
Wow, I read one book, and....
/. comment to this book I read a while ago, Wyrm by Mark Fabi. It's set in the days leading up to 01.01.00, and it has an interesting premise. Part of it is a vast, complex, "AI" who is sort of born out of unified processing on the internet. It's basically an accident.
;)
You see, this is the second time in this week that I have been able to relate a
Like some of the other stuff in the book, it's somewhat... far-fetched. But that's OK, because it's still a great book.
-J
Karma: T-rexcellent.
Robert Heinlein saw it all 50 years ago... Just read The Moon is a Harsh Mistress In that book, the Lunar colonies main processing computer (which, after gaining sentience, was called Mike) became sentient after many modules and auxiliary processors were added. It brings up an interesting thought... I really know nothing about neural nets or how they are simulated, but couldn't one of these distributed processing systems be used to simulate a neural net? With enough CPU power behind it could the system develop a somewhat sentient persona?
Hey - how about Mosix or Beowulf across the Inet. Yeah, I know the link is crappy for traffic, but it'd be neat to give it a shot. Anyone game?
Projects such as the Human Genome project may contribute to the formulation of better computational modules for testing, but at the end of the day, we have to remember that these are still only simulated environments.
Just how do I track the billing? By cycle or bandwidth?
How about this -- A place like distributed.net uses some secure method to track who you are and then clocks how many hours you spend running their software, then, every year they submit those hours to your national revenue service which then deducts those hours at a rate from your taxes each year (Any amount I would be happy with, 5 cents would yield $360 dollars for 300 days. 50 cents would yield $3,600 a fair good chunk of change). Maybe even a certain amount for different programs, so companies could pay more or less to have their program get run more often. This brings up the issue of companies monopolizing that computer time. This shouldn't be an issue since the user still has control over whose programs they run and maybe how much CPU time they get. This would also bring about competition, 0.5 to 2 cent hikes between competing companies every week would bring in a nice quadratic curve to the function of your tax deductible income. There is one bad issue which is that organizations that can't afford to pay for the distributed computing would be without these such services. In that case, an incentive program could be put in place where legitimate non-profit organizations get free distributed computing but the user still gets the cash. The process still has the same special interest factor, you can still run SETI or mprime even though it may only be 2.5 cents instead of the 5 cents microsoft will pay to have you send them every thing from the web-sites you visit every day to your keystrokes. The only group I see losing out would be groups between the free and ultra-high rate. Since a non-profit organization gets the service for free, and a high-rate company will get most of the time, what happens to companies that can only afford prices at 10-15% of the highest rates. It's not a perfect idea, but it's a start. It's eCharity. But then again, what are the chances something so practical and logical could actually occur in a government such as the United States :)
I had a FreeBSD box running Seti at work 24/7. I stopped working there 2 months ago but the box still seems to chew through Seti units. :)
My guess is that they don't know what it is doing but since the label on the box says "server" they leave it alone.
Oh well, they would never know how to operate a FreeBSD box anyway.
--------
None of these projects use the HTTP protocol, or hypertext in any form (let alone represented by HTML or one of it's variants.) So what on earth do they have to do with the web?
Is there a slasdot entropia team to FightAIDS@home?
Why the f*ck do some people feel compelled to submit this comment for every slashdot story????
Can't people be a bit more creative?