Slashdot Mirror


TeraGrid v. Distributed Computing

Nevyan writes "After three years of development and nearly a hundred million dollars the TeraGrid has been running at or above most peoples expectations for such a daunting project. On January 23, 2004 the system came online and provided 4.5 teraflops of computing power to scientists across the country. However, the waiting list for TeraGrid is long, including a bidding process through the National Science Foundations (NSF's) Partnerships for Advanced Computational Infrastructure (PACI) and many scientists with little funding but bright ideas are being left behind. While the list of supercomputer sites and peak power is growing how is the world of Distributed Computing faring? "

124 comments

  1. Distributed Computing by Iesus_Christus · · Score: 5, Insightful

    The problem with using distributed computing for everything is that the number of people willing to let others use processing power on their computer is not infinite. It is a very large number, but eventually everyone who wants to/knows how to help out their favorite cause will have something already installed. In addition, the more useful endeavors that use distributed computing, the less users you will get for each, and only the 'interesting' projects will get many users. Who wants to use their computing power to analyze some boring old physics experiment when you could be finding aliens or curing cancer?

    Distributed computing has its uses, but remeber: the public will only be willing to help you as long as they feel like they're contributing to something worthwhile.

    1. Re:Distributed Computing by The_Mystic_For_Real · · Score: 3, Insightful
      This could possibly be beneficial to science, whose purpose is ultimately to serve humanity. This creates something of a democracy in science. Now the public can chose what problems that it wants solved and play a direct role in helping while they sleep.

      I think that this is a Good Thing (TM). Distributed computing has the postential to not only further the cause of science, but to bridge the gap between the public and the scientists.

      --

      _____

      Thank you.

    2. Re:Distributed Computing by Trailwalker · · Score: 3, Interesting
      beneficial to science, whose purpose is ultimately to serve humanity


      Only if they are investigating cannibalism. The purpose of science is the advancement of knowledge. Service to humanity, if it happens is incidental.
    3. Re:Distributed Computing by The_Mystic_For_Real · · Score: 2, Insightful

      You need to remember, humans invented mathematics and science. Numbers are the the relations and functions used in science and mathematics are human creations to describe the universe, much like language, only with more stringent rules.

      --

      _____

      Thank you.

    4. Re:Distributed Computing by samael · · Score: 5, Insightful

      Now the public can chose what problems that it wants solved

      Jesus, there's a horrible thought. I've met the public (and seen it's choice in TV). I'd rather have monkeys choose.

    5. Re:Distributed Computing by NeoSkandranon · · Score: 2, Insightful

      You assume they have good judgement as to which causes will end up benefitting them the most.

      --
      If you can't see the value in jet powered ants you should turn in your nerd card. - Dunbal (464142)
    6. Re:Distributed Computing by hunterx11 · · Score: 5, Insightful
      Mathematics and science are neither arbitrary inventions nor entirely self-evident discoveries. They are our attempts to understand and categorize the universe in the most objective manner possible. I would even argue that language is a much more abstract type of categorization. Despite mutually unintelligible differences in languages, all languages are used to describe the same reality.

      What I'm trying to say is that the semantics of how we describe the universe may be arbitrary, but the universe is objectively describable.

      --
      English is easier said than done.
    7. Re:Distributed Computing by Anonymous Coward · · Score: 1, Insightful
      The purpose of science is the advancement of knowledge.

      Science doesn't have purpose. It's a study of phenomena. At least, that's the scientific method:

      The principles and empirical processes of discovery and demonstration considered characteristic of or necessary for scientific investigation, generally involving the observation of phenomena, the formulation of a hypothesis concerning the phenomena, experimentation to demonstrate the truth or falseness of the hypothesis, and a conclusion that validates or modifies the hypothesis.

      There is no purpose without people. Every scientists I know would never do what they do if they didn't think it would improve humanity. I think many of them are wrong, but that doesn't change the purpose of their work. There are lots of easier ways to make money.

    8. Re:Distributed Computing by Anonymous Coward · · Score: 0
      I've met the public . . . I'd rather have monkeys choose.

      Apes vs monkeys. Not much of a difference. Most of the public doesn't know which is which anyway.

    9. Re:Distributed Computing by Anonymous Coward · · Score: 0

      No. The problem is that there is no automated way to "distribute" computing. So all management has to be done through bureaucracy. The whole thing is in the end just a bunch of clusters. With some standardization in the software configuration though.

    10. Re:Distributed Computing by Alkonaut · · Score: 5, Interesting
      Why aren't websites sponsored by applets doing distributed tasks? I'm thinking mainly websites with huge numbers of visitors, where visitors tend to stay long enough to do any meaningful work. (Like GMail for example). Most people don't use more than 5% of computing power when surfing the web, and an applet is safe and easily distributed.

      Personally, I'd much rather have an applet using 10% of my cpu power instead of an annoying flash banner (which probly itself uses 10% cpu...).

      Obviously someone has to pay for internet content, and that to me would be the least intrusive way. Popup-blockers will be inefficient by the end of the year. Ads will be inside the site content. Or worse still, the popup window is the main window, while the actual content is spawned as "pop under" meaning that if you have a popup stopper, all you get is the ad window...

      Cpu cycles is the perfect internet currency. Everyone who visits a website has them.

    11. Re:Distributed Computing by koakapo · · Score: 2, Interesting

      You're right on here. And to limit the potential even further is the effect that the *cause* might have on the altruism of the user. I for one would think differently about what I do with my spare clock cycles depending on who the end user is. Consider a pure research, potentiallly common good project versus a *pure research* funded by a AN Other Megacorp project. Its still an interesting idea though - BOINC would seem to me to be the next step in allowing different actions (programmed to a certain extent) to be carried out in a distributed fashion.

      --
      ----- Every day we get up and make the choice that the thing we are doing is the most valuable use of our time. -----
    12. Re:Distributed Computing by Anonymous Coward · · Score: 0

      eloquently put!

    13. Re:Distributed Computing by jaavaaguru · · Score: 1

      I wish schools would do their service to humanity and teach people proper grammar and punctuation.

    14. Re:Distributed Computing by ChaosDiscord · · Score: 2, Informative
      The problem with using distributed computing for everything is that the number of people willing to let others use processing power on their computer is not infinite.

      Off-topic. Teragrid is a dedicated distributed computing system. Various research centers are purchasing dedicated clusters to participate. For example, instead of three universities each purchasing a large cluster which will sometimes be idle; each will purchase a slightly smaller cluster and use each other's resources when available. In the particular case of high-energy physics, multiple sites were already collaborating to distribute the monstrous amounts of processing needed. Teragrid attempts to simplify this so that instead of human beings meeting and hand-distributing work, one person can simply run the "process-todays-events.sh" script and know that computers around the world are working on it.

      The holy grail in grid computing is to be able to purchase compute time much like you purchase electricity. You don't build a power-plant next door just because you need power for your manufacturing company. Why should you purchase, maintain, and upgrade a cluster for your, say, DNA sequencing? Why not just purchase computing power from a dedicated company? Need several thousand hours of compute time in a rush (say, because you're up against a deadline)? Rent it! (That's an over simplification, there are obviously reasons you do many things in house. Don't take it as a complete argument, take it as the elevator summary.)

    15. Re:Distributed Computing by LWATCDR · · Score: 1

      "I think that this is a Good Thing (TM). Distributed computing has the postential to not only further the cause of science, but to bridge the gap between the public and the scientists."

      I do not see how this is in anyway bridging the gap between the public and scientists. How is donating your free computer cycles any different than donating a few bucks? You are just donateing a resource cash or cycles to a project.
      I mean it is not like most people running Seti at home know what an FFT is.
      Not a bad thing mind you. And it could help science but it is no great social revolution.

      --
      See my blog http://ilovecookes.blogspot.com/ for light hearted technical information.
  2. My Personal Vision by Ignignot · · Score: 3, Interesting

    Is for a different kind of distributed computing client, one that allows you to sign up for different kinds of research programs. For example, you could say "donate half my spare time to aids research, and 1/4 to math reserach, and 1/4 to seti research". Also integrate a method of possible payment for work units completed (and a checking process to remove cheaters) and I think you will have an increase in effeciency in the entire way that we treat computers. Maybe instead of everyone shelling out thousands for top of the line computers whose peak output they only need for 5% of the time, they shell out a lot less for a networked computer that buys time from other people's machines. Clearly this wouldn't work in all applications (particularly those requiring low latency) but with improving network connections I think this is a possible future.

    --
    I submitted this story last night, and it didn't get posted.
    1. Re:My Personal Vision by jpr1nd · · Score: 5, Informative

      The BOINC platform (that seti@home is switching over to) has the ability to divide work between project as you suggest. Though I'm not really sure that there are very many other projects running on it.

    2. Re:My Personal Vision by billstr78 · · Score: 4, Interesting

      IBM is already making that vision a realization.

      They are in beta stages of a massive computation cycle for hire program that will allow organizations without the funding for an entire cluster to purchase cycles provided by a large IBM Power cluster.

      It will allow for a computation cycle market to eventually arise, much like the wheat, corn or gold markets. Companies will compete to provide cheaper cycles, small-time scientists around the world will be able to have thier computation intensive problems solved at a fraction of the current cost possible today.

    3. Re:My Personal Vision by Jeff+DeMaagd · · Score: 1

      I suspect that the money given per work unit would be too small.

      I wouldn't mind it if I could make a little cash to eventually help pay back for the computer, maintainance and energy (and heat removal in the summer), I doubt it would happen. I'd love to see someone prove me wrong.

    4. Re:My Personal Vision by BiggerIsBetter · · Score: 1

      I noticed you wrote "companies". I would have thought the largest resource is personal computers rather than corporate ones... That aside, if the EFF and similar provided clients and benefitted from the proceeds it would be a low hassel way to donate to their projects.

      --
      Forget thrust, drag, lift and weight. Airplanes fly because of money.
    5. Re:My Personal Vision by TheGavster · · Score: 1

      While I think that in the beginning, you might have a large number of small-scale clients, but eventually it will evolve in the same way that agricultural commodities have evolved, that is we'll have massive networks whose sole purpose is to contribute cycles to a for-pay distributed project.

      --
      "Because Science" is one step from "Because old book". Try "Because of my experiment testing my falsifiable assertion".
    6. Re:My Personal Vision by Corporal+Dan · · Score: 5, Informative
      From ClimatePrediction.net:
      Hi, we are still rolling along with BOINC, hoping for an alpha test by the end of the month, beta in July, and hopefully a release in August when David Anderson from SETI/BOINC will be visiting us for a few weeks.

      We threw together a simple sign-up page to be contacted (just once or twice when we're ready for beta testers), so if you want to try out the Windows, Linux, or Mac versions of CPDN please signup here!

      http://climateprediction.net/misc/beta.php
    7. Re:My Personal Vision by BiggerIsBetter · · Score: 1

      Probably true. Intermediary organisations will likely spring up to broker cycles and to aggregate smaller networks into salable commodities. As you say, just like agriculture.

      --
      Forget thrust, drag, lift and weight. Airplanes fly because of money.
    8. Re:My Personal Vision by Anonymous Coward · · Score: 0

      Unfortunately, somebody has to manage and certify those projects, before they can go through the system. It's also unlikely that underdog projects will ever benefit from it.

    9. Re:My Personal Vision by Cramer · · Score: 1

      The BOINC presentation from Madrid claims folding@home is using BOINC. Unless they've recoded the clients without telling anyone, this is incorrect. Folding@home uses Cosm as it's communications framework. There is no reference to BOINC in any of the documentations I've seen from f@h.

    10. Re:My Personal Vision by silicon+not+in+the+v · · Score: 1

      If only you could win lawsuits with distributed computing power. [/wishful]
      I'll bet there tons of people that would donate processing time to helping defend open source.

      --
      We may experience some slight turbulence and then...explode. -Capt. Mal Reynolds
  3. IMHO... by Zx-man · · Score: 0, Redundant

    ...some day there will be available computing solutions powerful enough be themselves not to need distribution of the computations...

    1. Re:IMHO... by SkaterGeek · · Score: 0

      Althought that is true, Their will always be someone who wants to try and decrypt 1024mb encryption. Just for the same reason that those

  4. Looks good to me by DruidBob · · Score: 4, Interesting

    There have been big projects like SETI@home, Great Internet Mersenne Prime Search, RC5-64 and many others.

    There are some like the Casino-21 http://www.climate-dynamics.rl.ac.uk/ and Evolution-at-Home http://www.evolutionary-research.org/ too.

    It's becoming easier to create the required code for distributed projects, and it most certanly has become easier to actaully get them distributed.

  5. Cue the jokes... by ninja0 · · Score: 2, Funny
    Just in time for Doom 3!

    But will it run Longhorn?

    --
    --If the world didn't suck, we'd all fall off.
    1. Re:Cue the jokes... by NanoGator · · Score: 2, Funny

      "But will it run Longhorn?"

      Is Apple giving away an iPod to whoever uses this joke for the 100,000,000th time or something?

      --
      "Derp de derp."
    2. Re:Cue the jokes... by Anonymous Coward · · Score: 0

      But more importantly, imagine a beowolf of these...

      ohh, wait

    3. Re:Cue the jokes... by OneDeeTenTee · · Score: 0

      "But will it run Longhorn?"

      Is Apple giving away an iPod to whoever uses this joke for the 100,000,000th time or something?


      No, but they are giving a free iPod to the 100,000,000th person to ask if Apple is giving away free iPods.

      --
      Stop the world; I need to get off.
    4. Re:Cue the jokes... by Ari_Haviv · · Score: 1

      Yes it's been running Longhorn for the past 20 years...in Japan

      --
      Join Team Mozilla #38050 Folding@home
  6. Grid and Distributed comptuing by Anonymous Coward · · Score: 5, Informative

    Important to remember that the Grid is a _kind_ of distributed computing. But the main thing about The Grid (like The Internet, The Grid is basically TeraGrid in the US + European Data Grid) is that it is suitable for handing off parallel jobs with high intercommunication needs to (i.e. MPI jobs). Not necessarily because these jobs can run across different nodes of the grid (though they can with MPI/Nexus or whatever it's called), but because each "node" in the Grid network is a HUGE MOFO LINUX CLUSTER or similar. The grid gives lots of physicists access to computing resources for parallel processing jobs that would otherwise be sitting idle.

    What /.ers generally mean by distributed computing is a bit different - most apps there are "embarrassingly parallel" ones you can just farm out. They don't need to chatter to eachother, just process some data and send it back to Central.

    1. Re:Grid and Distributed comptuing by Anonymous Coward · · Score: 1, Interesting

      There isn't a single Grid. Grid is a concept not an actual physical infrastructure, a way of working. In fact Grids can be ephemeral and dynamic based on the related concept of Virtual Organisations (VO)

      There are various collections of machines which have been designed to facilitate Grid computing (instances of Grids), TetraGrid being one of them. Some systems or Grids are suitable for some types of jobs, some for others. As you rightly note, for the likes of MPI you need relatively closely coupled nodes.

      Essentially Grid is aimed at being a way to link services (compute, data, visualisation, etc) into a cohesive whole such that ultimately you can have a fire-and-forget interface, with your work going to the place that best suits it, based on additional restrictions such as security, how much you are willing to pay for the results, and how long you are prepared to wait for them.

      The back-end processing can include ad-hoc conglomerations of machines that some see as traditional distributed computing (e.g. machines running BOINC based clients and so on).

      Currently much work is being done on the top-level wrappers that allow the groups of machines to be abstracted using web services and various transactional models based on web services (see www.gridforum.org), services built into multi-component workflows, and so on.

      AaronGTurner

    2. Re:Grid and Distributed comptuing by Anonymous Coward · · Score: 0

      Tere isn't a single internet either. I'm old enough to remember internet being short for internetwork. But the Internet (capital I) was the biggest and best, just as the joint US-EU Grid is the single biggest and best. If you hear a physicist saying "Grid" (rather than a marketroid or compsci nerd), chances are he means the teragrid + eurogrid + whatever the asian thing is called system.

    3. Re:Grid and Distributed comptuing by Anonymous Coward · · Score: 1, Interesting

      As other people have said whether there is "The
      Grid" or Grids is like "The Internet" vs multiple
      IP-protocol networks, including the private ones.

      However, for practical purposes there is one "The
      Grid" which will probably evolve into The Grid
      without the quotes, and that is the worldwide
      LHC Computing Grid, currently spread across North
      America, Europe and northwest Pacicifc Rim.

      Through EGEE (in Europe) and Open Science Grid
      (in the US) LCG technology will spread out into
      the wider scientific and research community.

      Here's one of LCG's current monitoring maps,
      showing the geographical spread of sites which
      are part of the production service today:

      http://goc.grid-support.ac.uk/gppmonWorld/gppmon_m aps/lcg2.html

      AM

    4. Re:Grid and Distributed comptuing by Anonymous Coward · · Score: 0

      I think we are about 5 years from a single Grid in the way we have (to all intents and purposes) a single Internet. And there will still be, even then, internal grids serving companies concerned about security in the same way we have intranets today.

      The model of how Grids will develop will most likely mirror TCP/IP and the Internet.

      In this sense OGSA and its related technolgies are the equivalent of systems built on TCP/IP today - e.g. ftp, http, and so on. What is needed is the demonstration of an application that unlocks the potential in the way the web browser did for the internet. It will come.

  7. The Google Compute Project by BoneThugND · · Score: 5, Informative

    Google's distributed OS has been discussed a lot on Slashdot, but it is more than just a search algorithm on their own servers:

    Google Compute is a feature of the Google Toolbar that enables your computer to help solve challenging scientific problems when it would otherwise be idle. When you enable Google Compute, your computer will download a small piece of a large research project and perform calculations on it that will then be included with the calculations performed by thousands of other computers doing the same thing. This process is known as distributed computing.

    The first beneficiary of this effort is Folding@home, a non-profit academic research project at Stanford University that is trying to understand the structure of proteins so they can develop better treatments for a number of illnesses. In the future Google Compute may allow you to also donate your computing time to other carefully selected worthwhile endeavors, including projects to improve Google and its services.

    - The Google Compute Project

    1. Re:The Google Compute Project by Zx-man · · Score: 1

      Just tell me, will you allow someone to use your computing power for a really _unknown_ purpose?

    2. Re:The Google Compute Project by billstr78 · · Score: 3, Informative

      Hmmm. I think you are confusing the distributed OS additions they've made to Linux for their own clusters with the idle process harvesting of thier Google Toolbar.

      The distributed OS and Filesystem in thier own clusters is far more advanced than a SETI@Home parallel work distribution algorithm. This OS/FS and projects like it are where the grid's heritige lies. There are many problems unique to the grid, but none of it could exist without the distributed system problems first solved in local area clusters.

  8. TeraGrid doesn't use "Public" computers by StateOfTheUnion · · Score: 2, Informative

    Distributed computing has its uses, but remeber: the public will only be willing to help you as long as they feel like they're contributing to something worthwhile. Uh, I'm not sure what this has to do with the TeraGrid . . . The TeraGrid is a distributed computing system . . . but it does not use the "public's" computers. It uses university and computing center machines across the USA (e.g. NCSA, Argonne National Labs, Purdue, etc.) .

    1. Re:TeraGrid doesn't use "Public" computers by billstr78 · · Score: 1

      Ah, but many other "true Grid" projects do. There really is not any distiction between Distributed Computing and the classic definition and implementation of a Grid. Un-federated wide-area computers collaborating to work on many different tasks, scheduled to run efficiently and effectivley.

    2. Re:TeraGrid doesn't use "Public" computers by Kynde · · Score: 1

      >> Distributed computing has its uses, but remeber: the public will only be willing to help you as long as they feel like they're contributing to something worthwhile.

      > Uh, I'm not sure what this has to do with the TeraGrid . . . The TeraGrid is a distributed computing system . . . but it does not use the "public's" computers.

      Oh, the header was "TeraGrid v. Distributed Computing" and the entry ended in a phrase "While the list of supercomputer sites and peak power is growing how is the world of Distributed Computing faring?". And you thought the earlier poster was off mark... :)

      --
      1 Earth is warming, 2 It's us, 3 it's royally bad, 4 we need to take action NOW
    3. Re:TeraGrid doesn't use "Public" computers by Seanasy · · Score: 1

      They are public in the sense that they are publically funded. They are not owned by a private corporation or restricted to classified use.

    4. Re:TeraGrid doesn't use "Public" computers by NeoSkandranon · · Score: 1

      "Public" implies that usage isn't restricted to people who work there

      --
      If you can't see the value in jet powered ants you should turn in your nerd card. - Dunbal (464142)
    5. Re:TeraGrid doesn't use "Public" computers by Seanasy · · Score: 1

      Correct. Usage is not restricted to the people who work there. In fact, 99.999% of the work done on the NSF supercomputing resources is done by people who do not work at the supercomputing centers. Researchers from U.S. institutions apply for time on the supercomputers. The centers are a computing resource for the whole country.

    6. Re:TeraGrid doesn't use "Public" computers by deanj · · Score: 2, Informative

      Well, that's an interesting definition of "public" that doesn't appear to be anyone else's.

      Despite that fact, you are correct... most of the work that's run on those things is done by people that aren't part of the supercomputer centers, or the ANL. (There are a few "chief scientists" that DO run their work there, so I wouldn't say it's the 99.99-whatever% that another poster did).

      It's NOT available for the general public's use though. Even if you work at those places, that doesn't give you ANY certainty that you'll get to run anything on it....it's likely you WON'T, unless you are running tests or something like that.

    7. Re:TeraGrid doesn't use "Public" computers by NeoSkandranon · · Score: 1

      It's NOT available for the general public's use though

      Thanks, you said what I was thinking better than I was able to. I don't care who funds it, very few people are going to call something "public" if the general public cannot access it.

      --
      If you can't see the value in jet powered ants you should turn in your nerd card. - Dunbal (464142)
  9. Just imagine a Beowulf cluster of... by Anonymous Coward · · Score: 2, Funny

    ...Anonymous Cowards!

  10. Payment for Work Units by Iesus_Christus · · Score: 5, Interesting

    The idea of payment for work units is interesting. While it would certainly provide incentive for participating in distributed computing projects, I can see two problems with it already:

    1) Getting the money to pay people. One advantage of distributed computing is that you don't have to pay for time on expensive cluster. That advantage disappears when you pay distributed computing users. Of course, it may still turn out to be cheaper, and there may be users willing to participate for free.

    2) Botnets and profit. We all know of spammers using zombies to peddle goods, and of script kiddies using them to DDoS. What if some enterprising but immoral person decided to use the computing power of his zombies to profit off of the distributed computing payments? With enough zombies, he could easily make a good amount of money off of other people's computers.

    1. Re:Payment for Work Units by SkaterGeek · · Score: 0

      Well, Yea, but he is still helping out science... Which in my mind makes it a grey area :P

    2. Re:Payment for Work Units by Caseylite · · Score: 5, Interesting

      Another way to pay people would be to offer incentives such as allowing me to write off your process time (wear and tear on my system) as a charitable donation to your non-profit group. ~Casey

    3. Re:Payment for Work Units by taernim · · Score: 1

      Simple fix: Provide an authentication method which would require providing real information before you can accept the money. Then track down where all the bots' money is going. Bam, you've got your crook.

      --
      "PC Load Letter? What the $@#% does that mean?!"
    4. Re:Payment for Work Units by Anonymous Coward · · Score: 2, Insightful

      1) Somebody does pay for time on an expensive cluster. They are built and maintained with your (and my) tax money.
      2) Yes, security is a big issue in Grid computing. And it ain't there yet.

    5. Re:Payment for Work Units by Lost+Engineer · · Score: 2, Interesting

      That might be legal already. IANAL, if 60% of my computing power goes to SETI, can I write off that percentage of the depreciation of the box as a charitable donation?

    6. Re:Payment for Work Units by silicon+not+in+the+v · · Score: 1
      2) Botnets and profit. We all know of spammers using zombies to peddle goods, and of script kiddies using them to DDoS. What if some enterprising but immoral person decided to use the computing power of his zombies to profit off of the distributed computing payments? With enough zombies, he could easily make a good amount of money off of other people's computers.
      So? I'd rather the spammers (or "clustered computer users" in this case) used their zombie machines for that. Then it would get the spam to stop. Wow, you could solve two problems at once. Make distributed computing pay better than spamming, and you would stop the spam problem, too.
      --
      We may experience some slight turbulence and then...explode. -Capt. Mal Reynolds
    7. Re:Payment for Work Units by mdfst13 · · Score: 1

      ObAOL: This was my first thought as well. If other distributed projects paid as well or better than spamming, then fewer virus writers would waste time on spam. This would also shift the burden off the system (distributed computing is CPU intensive rather than bandwidth intensive) and leave it purely on the individuals with insecure PCs.

      The only bad possibility is that it might increase the number of zombies. However, this is not necessarily so. It is not evident to me that it would be simple to increase the number of zombies. If it was, why haven't they already done so?

  11. Look, this is really very simple by bersl2 · · Score: 4, Insightful

    If you can divide your problem into very many independent subproblems, clustering or distributed computing will work well. If not, your best bet is a true supercomputer.

    So: SETI@Home splits up its scans into sections, each of which do not depend on any other; therefore, a distributed solution is efficient. However, the Earth Simulator deals with chaotic systems (or so I would assume), which do not independently parallelize; this is where having hundreds of processors and terabytes of RAM and using something like NUMA is greatly more efficient.

    In short: use the right tool for the job.

    1. Re:Look, this is really very simple by billstr78 · · Score: 4, Informative

      As noted in earlier comments, the TeraGrid's individual nodes _are_ NUMA clusters. This allows large, non-parallel computations to be run without individual service level agreements, login coordination and scheduling issues gumming up the process. The TeraGrid is an effort to remove the administrative nightmare's keeping most clusters from being fully utilized and most small-time scientists work from being completed.

    2. Re:Look, this is really very simple by trifakir · · Score: 1
      However, the Earth Simulator deals with chaotic systems (or so I would assume), which do not independently parallelize; this is where having hundreds of processors and terabytes of RAM and using something like NUMA is greatly more efficient.

      AFAU Earth Simulator solves mostly nothing more than a big Finite Element Method problems. Speed-up of such problems depends much on the connection time as normally the FEM solver exchange borders every several iterations or so, while the amount of data is not so much (hence no bandwidth utilization). Now with global distributed computing projects this is a problem as if my neighbours are distributed non-unformly around the globe, I have to wait for the last one to return border data so I can continue with the computation...

  12. Access and Denial by nevyan · · Score: 4, Insightful

    The problem with large projects like TeraGrid, EarthSimulator and other supercomputer sites is that the underfunded _brilliant_ ideas are left behind by those who can afford to pay for or build these centers and sites.

    While TeraGrid is a powerfool tool it is one that thousands of scientists and laboratories are standing in line to use. Meanwhile Distributed Computing is available, cheap and relatively quick.

    While it may look good on your project to say you used a IBM BlueGENE or DeepComp 6800 is it really worth the extra cost and waiting in line for your chance to use?

    True Distributed Computing is the way to go and shows positive results. Now we just need to tinker with it some more!

    1. Re:Access and Denial by billstr78 · · Score: 1

      There are many other Grid projects that don't have the long waiting line of TeraGrid. Much designed with the Globus toolkit allows anybody to contribute CPU power and just about anyone to run jobs. It's less powerfull and is more usefull for highly-parallel computations, but it is more true to the definition of what a Grid really is.

    2. Re:Access and Denial by Seanasy · · Score: 4, Interesting
      The problem with large projects like TeraGrid, EarthSimulator and other supercomputer sites is that the underfunded _brilliant_ ideas are left behind by those who can afford to pay for or build these centers and sites.

      What are you talking about? These are publically funded resources. You apply to the NSF for time on these machines. If you're at a U.S. institution and you have a real need for supercomputing you can get time on these machines.

      While TeraGrid is a powerfool tool it is one that thousands of scientists and laboratories are standing in line to use. Meanwhile Distributed Computing is available, cheap and relatively quick.

      And Distributed Computing can't even begin to solve some of the problems that supercomputers are designed to address.

      While it may look good on your project to say you used a IBM BlueGENE or DeepComp 6800 is it really worth the extra cost and waiting in line for your chance to use?

      Yes. When you want to simulate every molecule of a proteing in a water solution (~17000 atoms worth) you need a supercomputer. DC can't do it.

      True Distributed Computing is the way to go and shows positive results. Now we just need to tinker with it some more!

      DC is neither a religion nor a panacea.

    3. Re:Access and Denial by aminorex · · Score: 2, Insightful

      > When you want to simulate every molecule of a
      > proteing in a water solution (~17000 atoms worth)
      > you need a supercomputer.

      But if you want to simulate a billion molecules,
      DC is the way to go: Then it's not a tightly
      coupled system.

      --
      -I like my women like I like my tea: green-
    4. Re:Access and Denial by Seanasy · · Score: 1

      True, if you're not simulating how those molecules interact. Don't get me wrong, DC is great for a lot of things. My point is that for certain problems DC just isn't up to the task.

    5. Re:Access and Denial by xenocide2 · · Score: 1

      Maybe I'm confused about just what issue is at hand, but I thought the terraGrid was a distributed computing machine itself, located at nine facilities and growing. In fact, the grid part pretty much implies a distributed and omnipresent system. Are you for DC, for TerraGrid, or against both?

      --
      I Browse at +4 Flamebait

      Open Source Sysadmin

    6. Re:Access and Denial by Donny+Smith · · Score: 1

      The problem you describe is not a problem of large projects - that is a problem of small scientists.
      They should go elsewhere. If they're really scientists they probably have a full time job at an institution with some kind of cluster or are affiliated with an institution that can provide access to resources commesurate with the scientist's ability.

      If there's a guy with no previous record who claims he can create AI if they let him use TeraGrid exclusively for 6 months, should he be given access to TeraGrid? I say no.

    7. Re:Access and Denial by CommieOverlord · · Score: 1

      TeraGrid is distributed across several sites, however, most jobs will run at only one site and consume 'x' CPUS. The job scheduler running at the head of the cluster will simply schedule each job to run on the best resource available.

      Most applications will face a performance plateau at

      Most scientific applications are network intensive, and 2 CPUs with a local, high-bandwidth/low-latency connection (GigE, IB, Quadrics) could very well outperform 100 CPUs distributed across the internet.

      Only for very, very easily parallelized applications (like SETI, rendering farms) is DC a viable model. There are very few applications that work like that though.

    8. Re:Access and Denial by Seanasy · · Score: 1

      The TeraGrid is distributed-computing but not like what the poster is calling DC. The poster is talking about things like SETI and encryption cracking. They are massively parrallel -- i.e. problems that can be broken up up into discrete units and worked on by unconnected machines.

      Anyway, I'm for both and against misinformation about what both are. The poster confuses the issue terribly.

    9. Re:Access and Denial by Anonymous Coward · · Score: 0

      As someone mentioned - it would be great for modeling a billion atoms... as long as they are discrete elemnets with little to no interactiosn with each other. In order to figure out how a protein acts in water we need to know how each altom interacts with all of its neighbors. Which means there has to be a lot of communication betweem each atom (or node). In these cases the bottleneck would be the network and not the CPU.

      Here are some things DC would be very good at
      Sorting through 1,000,000 possible results to a specific problem
      Analyzing hundreds of thousands of discrete observations

      Here are some things it would be bad at
      Weather Simulation
      Oceanographic Modeling
      Protein Folding

      The difference between the two classes of problem is how interconnected the components of the solution are. In the DC problems each compute node can come up with a useful answer independent of all the other nodes. For example, the prime search doesn't require a node to communicate with any other node to come up with an answer. In supercomputing applications each computer node interacts with many other nodes talking back and forth and affecting each others results. For example, in weather modeling an area is broken into many small cells - but weather doesn't stay in one cell - something that happens in Cell 1514 will have an impact on Cells 1515, 1519, and 1692-1782 so they all have to communicate their results constantly.

      I suppose its possible DC could do weather modeling but the network overhead would make it a very bad fit.

    10. Re:Access and Denial by ultranova · · Score: 1

      The problem with large projects like TeraGrid, EarthSimulator and other supercomputer sites is that the underfunded _brilliant_ ideas are left behind by those who can afford to pay for or build these centers and sites.

      What are you talking about? These are publically funded resources. You apply to the NSF for time on these machines. If you're at a U.S. institution and you have a real need for supercomputing you can get time on these machines.

      Suppose you're not at a U.S. institution and/or have such a revolutionary idea that whoever allocates the computing time on a supercomputer doesn't understand it ? Or maybe you want to simulate a new kind of uncensorable, untracable, wireless, global communication system ? That would certainly be an extremely important idea, but I'd be surprised if it got public funding, for the simple reason that it would jeopardize the power of every government on the planet... Or maybe you have invented a device that can copy physical objects, but need one final huge calculation done before being able to build it ? Again, it would threaten those in power and would be certain to get no funding...

      The really important inventions are those that empower the public, but they will not get public funding - why would the current elite help dismantle their power structure ?

      Besides, who do you think takes precedence - an old expert in his field, or some previously unknown John Smith ? Does it really matter if the currency to use is prestige and not money ? Chances are that you still can't pay it...

      So what does that leave the brilliant, but unknown John Smith ? It leaves him with distributed computing. It is going to be easier for John Smith sell his idea to Joe Average than to Bill Bureuacrat, simply because Joe won't be fired if it turns out that John Smith's research is humbug, but Bill might be.

      And Distributed Computing can't even begin to solve some of the problems that supercomputers are designed to address.

      Wrong. A supercomputer has an advantage over Distributed Computer in certain tasks because the processors share memory. However, both supercomputers and DC's were designed to do the same thing - solve a lot of parallel tasks simultaneously. There is not a fundamental difference between them. A DC is simply a supercomputer which trades communication speed between CPUs for raw CPU power. Now, if an application requires lots of communication between CPUs (threads) , then it will also require a lot of locking, making it questinable if it was suitable for parallelized execution in the first place.

      I wonder if we aren't going to start seeing 'live' DC computer systems, where the component computers are going to be in real-time communication with the server, acquiring locks, getting pieces of the global (shared) memoryspace and committing back changes. Maybe such a system would have an old-style supercomputer as its heart.

      Yes. When you want to simulate every molecule of a proteing in a water solution (~17000 atoms worth) you need a supercomputer. DC can't do it.

      Are you sure ? If I wanted to simulate this, I'd write a program that calculated the forces each particle exerts on each other given their current position and other attributes, then calculate their position and other attributes in the next iteration based on their current position, speed and those forces.

      Now, to DC that, I'd do the following:

      1. Have the server send each client the current position and state of each atom, as well as telling them what atom is "their" atom(s).
      2. Have each client calculate the forces "their" atom is causing on every other atom.
      3. Send the results to the central server.
      4. Have the server send each client all the forces affecting "their" atom.
      5. Have each client calculate the new position and other attributes of "their atom".
      6. Have the cl
      --

      Forget magic. Any technology distinguishable from divine power is insufficiently advanced.

    11. Re:Access and Denial by Seanasy · · Score: 1

      There is no conspiracy. Supercomputers solve problems that DC cannot begin to address. Your example DC simualtion wouldn't work because of the time it would take to communicate all that information. The latency issue alone would make it unsusable.

      Do you really think only prestigous scientists get to use supercomputers? That the little guy is being intentionally kept down by the supercomputing man? The idea is ludicrous. If someone has a truly 'brilliant idea' they will get on the superomputer.

      There are things that supercomputers can do that no other computing systems can do. If that wasn't true, people would be doing weather forecasting with their Forecasting@Home clients.

      No offense, but you don't understand supercomputing.

    12. Re:Access and Denial by Salis · · Score: 1

      By your logic, I'm famous.

      But I'm not, so your logic is wrong.

      I use supercomputers all the time.

      --
      Favorite /. tagline: "On the eighth day, God created FORTRAN." And it was good.
  13. Why the versus? by Anonymous Coward · · Score: 5, Insightful

    I don't understand why we are asking how a hammer is doing compared to a screwdriver? Both are varied computational models, and are at best architectural descriptions as titles; TeraGrid v. Distributed Computing. They have specific application domains and are used to solve different types of problems. One dealing with non-discrete data and experimental calculations (TeraGrid), the other focused on discrete chunks of data being filtered or rendered and are non-time nor message dependent (Distributed Computing; as defined by the Nevyan's reference). You have two tools in your tool chest. What makes one better than the other? They have completely different jobs that they tackle. They both will be successful. They need not be in competition.

    1. Re:Why the versus? by Anonymous Coward · · Score: 1
      I don't understand why we are asking how a hammer is doing compared to a screwdriver

      We're not, we're asking how TeraGrid is doing compared to Distrubuted Computing. Read the Fine Title!

      What? Oh? Nevermind...

  14. Did I read that right? by rumblin'rabbit · · Score: 3, Insightful

    4.5 Teraflops for $100 million? Surely not. That much compute power can be had for 1/20'th the price. What am I missing?

    1. Re:Did I read that right? by nevyan · · Score: 2, Informative

      NSF award in August 2001 of $53 million for intial funding of four sites: NCSA, SDSC, CACR, ANL.

      Pittsburgh Suprecomputing Center joined in when NSF announced supplementary funding with $35 million.

      $10 million was supplied by NSF in September 2003 adding ORNL, Purdue, Indiana U., and TACC.

      Total: $98,000,000.00 roughly.

      What does government spending on the TeraGrid give you? 4.5 Teraflops distributed...

      Nice.

    2. Re:Did I read that right? by Detritus · · Score: 4, Insightful

      FLOPS are easy, low latency, high bandwidth communications paths are hard.

      --
      Mea navis aericumbens anguillis abundat
    3. Re:Did I read that right? by Anonymous Coward · · Score: 2, Informative

      PSC has 6+TF alone in the TCS cluster, plus a giant NUMA system.. let alone the four DTF sites with 3TF+ *each*. Plus, they all have at least 30Gb/s uplink to each other.

      Yes, the money was spent somewhere, and not on toilet seats.

    4. Re:Did I read that right? by Seanasy · · Score: 3, Informative

      One project on the Teragrid used 17 Teraflops. The poster hasn't done his/her research on the Teragrid.

    5. Re:Did I read that right? by Seanasy · · Score: 3, Informative
      • PSC: 6 TFlops
      • NCSA + Caltech + ANL + SDSC: 15 TFlops

      I would encourage you to visit www.teragrid.org and read more than the front page to learn what the Teragrid is

    6. Re:Did I read that right? by Anonymous Coward · · Score: 0

      I think some people think that you can hook up some 100mbps LAN and that will suffice. Honestly, the node interconnects we are talking about are essentially methods of directly connecting the memory bus of one node to another. These interconnects are actually one of the biggest bottlenecks in these SC systems.

  15. Mac OS X users! by arc.light · · Score: 1, Informative

    Charles Parnot of Stanford University is looking for your spare CPU cycles for his distributed XGrid@Stanford project.

  16. Misinformed by Seanasy · · Score: 3, Informative
    ...and many scientists with little funding but bright ideas are being left behind.

    Care to cite a source?

    When you apply to the PACI program you get a grant of Service Units -- i.e. time on the computers. You don't need huge amounts of funding. The requirements state that you need to be a researcher at a U.S. institution. It also helps if you can show that you actually need and can use that kind of computing power.

    And, please, distributed computing and supercomputing are not synonymous in terms of what problems they address. Distributed computing cannot replace supercomputers in every case. DC is good for a limited set of problems.

    Lastly, an example of Teragrid research: Ketchup on the Grid with Joysticks.

  17. Wolfgrid by admiralfrijole · · Score: 5, Informative
    Wolfgrid, the NCSU Community Supercomputer, is coming along nicely.

    It is based on Apple's XGrid, and uses volunteers from the Mac community here at NCSU, as well as some of the lab macs, and soon we will hopefully have official Linux and Windows clients, maybe even Solaris, to run on more of the computers around campus.

    There is even a really nice web interface that shows the active nodes and their status, as well as the aggregate power of the two clusters.

    Its really nice, anyone who is part of the grid can just fire up the controller and submit a job, I am a part of the lower power grid since my TiBook is only a 667, but I was able to connect up and do the Mandelbrot Set thing that comes with XGrid at a level equal to around 7 or 8 GHz.

    There are some screenshots here

    --
    e to the pi i plus one equals zero
    1. Re:Wolfgrid by Anonymous Coward · · Score: 0

      Apple have kinda misnamed xgrid. It's ad-hoc clustering software. Not that that that's bad in itself, but such a cluster might be just one more node in the Grid proper. We now have apple fanboys running around saying "look we're doing Grid Computing", when in fact they're doing NeXT-like cool-cluster-computing. Yes, it's cool. No, it's not Grid, in itself. Actually, the single most important aspect of the Grid is single sign-on. What MS couldn't coerce people to do with Passport, the Grid has kinda got people to do voluntarily - everyone who uses the Grid has a personal "grid cert", a private/public key pair that identifies them and their programs on the Grid.

    2. Re:Wolfgrid by Anonymous Coward · · Score: 0

      The next challenges will be:-

      1. Better virtual organisation management to make identify management more scalable for the system administrator who wishes to make a system available to a group of users. The physics community seems to be very much involved in this as the problem has bitten them first. Being able to have this level management makes ad-hoc addition or subtraction of resources which are abstracted at the level at which they can be plugged into the wider infrastructure easier.

      2. Negotiation for services. This is the whole gamut of negotiation based on price, service level, time scales, security, bandwidth, and the like. Essentially a multi-pass broker, but covering more aspects than most brokers currently involve.

      3. Intelligent agents for negotiation. This allows intelligent onward delegation and negotation on your behalf. This is the counterpart to a system administrator being easily able to add a machine - you can now set general policies about how you wish to have your job run, and then send it out into the world to find a suitable resource.

      4. On-the-fly construction of remote environments. This increases the number of systems on which your job can run. Ideally this would be via sandboxed virtual servers.

      5. Further improvements in being able to create ad-hoc distributed systems. There is a lot of good work going on in this area. This allows organisations to make best use of their own systems (or offer them out commercially, perhaps).

      Point 2 covers a whole series of issues - there is a lot of work that needs to go into automatic creation of service level agreements that are legally binding, for example.

      AaronGTurner

      (must go and find my password again...)

  18. Meta-programs for distributed computing by giveuptheghost · · Score: 3, Informative

    This is exactly why there's BOINC, distributed.net, Grid.org, etc. that have multiple projects served to one user-installed program: Instead of projects having to compete for real-time resources, they run work units one after the other, and users can pick and choose which projects are run. This should prevent said burnout for most users.

  19. Recommend good cause to donate my free cycles to? by brienv · · Score: 2

    I think the Seti distributed thing is great and all, but I'd like to donate my free cycles to a project with more immediate (and likely) benefits to mankind. Any suggestions?

    -Brien

  20. already done! by zogger · · Score: 3, Funny

    " Just tell me, will you allow someone to use your computing power for a really _unknown_ purpose?"

    Just think, for once redmond got it right! Windows has had that feature for years!

  21. Re:Recommend good cause to donate my free cycles t by antispam_ben · · Score: 4, Informative

    There's this one, it's probably what you want:

    http://www.stanford.edu/group/pandegroup/folding/

    But I'm quite selfish (and actually interested in primes abd or at least know more about them than I do about protiens), and there are entities offering big prize money for big primes, and if one of my machines finds one, I'll get big bucks:

    http://mersenne.org

    --
    Tag lost or not installed.
  22. Re:Imagine a beowulf cluster of.... by antispam_ben · · Score: 0, Offtopic

    Actually Mr Mod, its not Offtopic, Redundant maybe but Offtopic no.

    Unfair! Let's all do some Distributed Metamoderating!

    --
    Tag lost or not installed.
  23. I call BS by Prof.+Pi · · Score: 4, Informative
    True Distributed Computing is the way to go and shows positive results. Now we just need to tinker with it some more!

    It's too bad that whoever modded this Insightful doesn't know much about parallel applications.

    DC is fine and very cost-effective for its niche of applications, which is those that are "embarassingly parallel." This is (somewhat circularly) defined as being very easy to parallelize on a DC machine. What characterizes these apps is very low communications between different tasks, which works for DC because the high network latency doesn't get in the way.

    I've love to see you try to put Conjugate Gradiant (CG) on a distributed system. It involves large matrix-vector multiplies that inherently require lots of vector fragments passing between the processors. CG is one of the 8 NAS Parallel Benchmarks, and if you look at Beowulf papers that use NAS, you'll see that they often leave out CG because performance is so bad. If it's low on a Beowulf, where the network is presumed to be local and dedicated, it will totally suck on anything with a typical high-latency/low-bandwidth network.

  24. Re:Recommend good cause to donate my free cycles t by Feyr · · Score: 0

    download porn

  25. Re:Recommend good cause to donate my free cycles t by Ari_Haviv · · Score: 1

    Yes I like the folding@home project. But you may find something interesting from the projects listed here: http://www.aspenleaf.com/distributed/distrib-proje cts.html

    yes I know my sig is coming up

    --
    Join Team Mozilla #38050 Folding@home
  26. honestly by Anonymous Coward · · Score: 0

    There really is no comparison.

    We're not really just measuring flops; you are limited by bandwidth with distributed computing.

    A distributed client can donate as much cpu as it wants but scientific problems requiring large datasets + processing + sending it back isnt going to compare to supercomputing nodes which have multigigabit network connections and distributed filesystems running at gigabytes/sec.

    Not all problems can be broken into small computational chunks.

  27. Re:Recommend good cause to donate my free cycles t by the+gnat · · Score: 3, Interesting

    Actually, as an experimental and theoretical biophysicist-in-training who knows about proteins, I'd say the folding project is only marginally more useful than the prime number search. Most biology research projects, especially computational ones, has to be sold on the basis of potential benefits to human medicine. Such advertising does not actually mean that medical benefits exist.

    While there's much to learn from studies of protein folding, there's very little medical importance to purely theoretical simulations. Since the delusion that we'll be able to replace laboratory research with really big computers is attractive to people who know nothing about biology, the impact of this type of research gets vastly overstated.

    On the other hand, Folding@Home has already yielded far more interesting results (if not exactly "useful" outside of the world of biophysics) than SETI@Home probably ever will, so go for it.

  28. Longhorn released... by Anonymous Coward · · Score: 0

    in Japan!

  29. Why being so negative? by andr0meda · · Score: 1



    Now the public can chose what problems that it wants solved

    Jesus, there's a horrible thought. I've met the public (and seen it's choice in TV). I'd rather have monkeys choose.


    Well the idea is that, you, as a member of said public, take responsibility more serious instead of just dissing it because others do.

    --
    With great power comes great electricity bills.
  30. Re:Recommend good cause to donate my free cycles t by Patrik_AKA_RedX · · Score: 2, Insightful
    On the other hand, Folding@Home has already yielded far more interesting results (if not exactly "useful" outside of the world of biophysics) than SETI@Home probably ever will, so go for it.
    IMHO seti@home's benefits lay in the spin-offs. Advancements in distributed computing and digital signal processing. I doubt they would find an alien email anytime soon, but they are very likely to come up with better ways to analyse signals. (read: better cell phones and digital cameras)
  31. Re:Recommend good cause to donate my free cycles t by scoser · · Score: 2, Informative

    Since someone has already posted the Aspenleaf list of projects, I'd like to point out my personal favorite, Find-a-Drug. It has actually returned positive anti-cancer and anti-AIDS results that have been lab tested and verified by the National Institute of Health. If that doesn't have immediate benefit to mankind written all over it, I don't know what does.

  32. This isn't Grid Computing. by hal2814 · · Score: 2, Informative

    "The "Grid" portion of the TeraGrid reflects the idea of harnessing and using distributed computers, data storage systems, networks, and other resources as if they were a single massive system." (from the TeraGrid FAQ)

    It looks like TeraGrid is latching onto a catchword in order to boost awareness of their system. What they are describing here is not Grid computing at all. Grid computing was designed to take advantage of all the dead cycles that computers typically have. The idea is that someone might have a large group of computers that do not take full advantage fo their computational cycles (like a large lab for reading e-mails and browsing the Internet). With Grid computing you would take these computers (not some Itanium cluster like TeraGrid is doing) and distribute work accross these nodes that can be performed during otherwise dead cycles. (I have no sources immedeately available but check out Grid computing through the ACM or something and you'll see plenty of info on what Grid computing really is.)

    This is what Seti@home does. It takes underutilized machines and runs computations on them. TeraGrid on the other hand, takes large clusters of otherwise unused machines and lays an abstraction over them that makes them look like one large supercomputer. This is nothing more than a distribution strategy. It looks like a nice distribution system that has the potential to scale well, but it's not Grid computing and it's nothing new.

    1. Re:This isn't Grid Computing. by Anonymous Coward · · Score: 1, Interesting

      ""The "Grid" portion of the TeraGrid reflects the idea of harnessing and using distributed computers, data storage systems, networks, and other resources as if they were a single massive system." (from the TeraGrid FAQ)

      It looks like TeraGrid is latching onto a catchword in order to boost awareness of their system. What they are describing here is not Grid computing at all."


      No, they are right and you are wrong.

      Using spare cycles is one thing you can
      do with Grid technology, but it is not the
      essential quality of Grids. "Grid" was coined by
      Ian Foster et al by analogy with electricity power
      grids. You plug into the wall and "it just works."

      Here is Ian's discussion of what is and is not
      a Grid from a couple of years ago:

      http://www.gridtoday.com/02/0722/100136.html

      AM

    2. Re:This isn't Grid Computing. by deanj · · Score: 2, Informative

      Whoa..you're so off-base on this it's not funny.

      The TeraGrid people (ANL, Ian Foster, etc) are the ones that coined the term "Grid" in the first place!

      You might not like their use of that term, but since they're the ones that came up with it in the first place, they're more right than you are.

  33. Your Gorilla agent by rvw · · Score: 1, Interesting

    Jesus, there's a horrible thought. I've met the public (and seen it's choice in TV). I'd rather have monkeys choose.

    You might be right about those monkeys. In Holland, we have the Beursgorilla (http://www.beursgorilla.nl/). This gorilla decides what stock to buy or sell based on the bananas presented to him. He proves to be better at "advising" than most of the other "real" and expensive advisors.

    For me, the DistributedComputingGorilla might decide what project will run on my computer.

  34. Teragrid runs a lot of Linux by haruchai · · Score: 4, Informative

    It's an all *nix environment presently totalling around 4200 CPUs of which 96 ( in a single cluster)
    is AIX 5.2, 3128 (WOW!!) is on Tru64 (in 2 clusters) and the rest, distributed in 5 clusters
    are some form of Linux.
    Two of the clusters have a second phase which together will add 316 CPUs on Linux.

    As of October 1 of this year, 5 clusters at 3 sites will be added with the OS / CPU breakdown as follows:
    Linux : 1800 CPUs in 3 clusters
    AIX 5.1 : 320 in 1 cluster
    Solaris 9 : 256 in 1 cluster

    That's an awful lot of Unix and a buttload of Tru64 and Linux

    --
    Pain is merely failure leaving the body
  35. Science teaching not at fault- make English easier by justanyone · · Score: 1
    Speaking as a person who loves the diversity and complexity possible in the English language: we've got one Fsck'd up Language!

    To list only a few rants:
    • vastly inconsistent spelling;
    • multiple phonemes attached to one character (I think this is the way to properly note that one letter has lots of different sounds);
    • a complete lack of a distinction between singlular and plural 'you' (a very common situation, solved in the Southern U.S. with "y'all");
    • many homonyms; no representation (alphabetic characters) for sounds common in many other world languages including arabic 'Q'/'K', swahili tounge-click, and hebrew 'chai' (as in Chaunukka).
    On the plus side, we do advantages of:
    • very few grammatical cases except possessive (word endings - for example, word Jack vs. word Jack's, although plural Jacks does confuse!);
    • No silly male/female/neutral distinctions for everyday objects that have no inherent gender (since when should a table be presumed male or femaale?!?!);
    • Large vocabulary - the number of words is very large compared to many other languages, which leads to fewer homonyms;
    • since we have few grammatical cases, rhyming can be more difficult and therefore more beautiful;
    • Verbs do not change endings when used on different subjects (he fell, she fell, we fell, they fell) (vs. french's je vais, tu va, il va, nous avons, etc.).
    Just a rant, but maybe if we all decided to do something about this mother tounge of ours, we might change it for the better (being ease of use, distinctiveness, expressiveness, reduction of confusion, etc.).

    -- Kevin J. Rice

  36. folding proteins: PRIONS by Anonymous Coward · · Score: 0
    uhm... as a non-biophysicist, i may be totally off base, here, but - here goes, anyway...

    PRIONS

    (Creutzfeldt-Jacob Disease [aka Mad Cow], et alia)

  37. Grid computing is a crock of shit by Anonymous Coward · · Score: 0

    Grid computing is a crock of shit.

    It's my job to try and make this crock of shit work.

    Teragrid is one of several grids building on Globus. Globus is, at best, "research grade" software. Frequent re-writes. Critical bugs ignored ("we'll fix it in a later version"). The developers clearly only do the most simplistic of tests and then are surprised when the system doesn't scale into the thousands of concurrant jobs real users nead.

    On top of all of this, no one is seriously attacking the real problem: cross-administrative domain issues. Want to run on a given grid? Well, you'll need to fill out different forms for different sites, obey different policies at different sites, and tweak your tasks to match each sites different configuration. The goal of simply sending your jobs off to run somewhere with minimal effort is still in the depressingly distant future.

    Any Globus based grid that is doing real work at this time is only doing it because there is a small core of techies at the bottom, desperately bailing water. A grid might compose 10 or 20 sites, but a given research will typically only use one or two that they've finally gotten working.

    The grid, the idea of being able to get (perhaps purchasing) compute time as easily as you get electricity, is a noble goal. I think that in the future it will be key to enabling compute-heavy research. But anyone telling you that today's grids are great and wonderful is just blowing smoke up your ass in the hope you'll give them more grant money. This sort of unrealistic happiness just gives us techies ulcers when people discover that everything is broken and that we're bailing water. "Hey, we were told everything was wonderful, what's taking so damn long." And we can't be honest and tell them "You were lied to. This is research grade software, strictly experimental and completely inappropriate for professional use."

    Feh.

    Anonymous for obvious reasons.

  38. Its a Cookbook by Gothmolly · · Score: 1

    The only things which serve humanity are aliens. To say that science has such a purpose places scientists, who are the best and brightest, in the position of slaves. Worse, it enslaves them to the worst and most ignorant - the degree of their evil or ignorance dictates how much moral claim they can place on the scientists. It reminds me of a quote "You have the privilege of strength, but I have the right of weakness!"

    Disgusting. Your Nickname says it all. My only hope is that after I'm long dead, you inherit the world that you're advocating.

    --
    I want to delete my account but Slashdot doesn't allow it.
  39. Current and Future BOINC projects by SETIGuy · · Score: 1
    Though I'm not really sure that there are very many other projects running on it.

    Currently available projects are...

    SETI@home
    Predictor-Protein structure prediction

    Coming soon....

    climateprediction.net
    Folding@home

    Farther in the future (i.e. pending funding)...
    Einstein@home -- a search for gravitational waves.

    In the conceptual stage, since sometime last week...
    neuralnet.net -- studies of the nature of intelligence using neural nets and genetic algorithms

  40. Re:Recommend good cause to donate my free cycles t by Quinn_Inuit · · Score: 1

    On the other hand, Folding@Home has already yielded far more interesting results (if not exactly "useful" outside of the world of biophysics) than SETI@Home probably ever will, so go for it.

    The key word there is "probably." One positive, one single positive, from SETI will be arguably the greatest discovery in human history (certainly in the top 5).

    --

    Stop learning! Only you can prevent esoterrorism.
  41. Re:Science teaching not at fault- make English eas by hesiod · · Score: 1

    > solved in the Southern U.S. with "y'all"

    Who would have guessed that ignorant hicks could improve the English language?

    I suppose it makes sense, though, that someone with less education could come up with simple & obvious solutions (although I'd prefer using a word that didn't make you sound stupid).