Slashdot Mirror


Visions Of The Future Of Grid Computing

CaptianGrid writes "Computing grids, or software engines that pool together and manage resources from isolated systems to form a new type of low-cost supercomputer, have finally come of age. BetaNews sat down with some of the world's leading grid gurus to discuss the significance of such distributed technologies and separate grid hype from grid reality."

28 of 145 comments (clear)

  1. Inspired by? by cakefool · · Score: 5, Funny

    Then more recently we have seen Univa being created, which I am involved as founder and advisor

    Univa
    Univac - a successor of Multivac, the largest computer in Asimovs world.

    Nerds - they get everywhere.

    1. Re:Inspired by? by kst · · Score: 4, Informative

      Um, no, Univac was a real 20th-century computer company (later merged into Unisys). Asimov probably named Multivac after Univac, not vice versa.

  2. X-Grid by qwertphobia · · Score: 5, Informative

    Check out Apple's X-Grid technology!

    It runs on any OSX system, 10.2.8 and up. Put your spare cycles to work.

    Xgrid: High Performance Computing for the Rest of Us

    --
    Never ask for directions from a two-headed tourist! -Big Bird
    1. Re:X-Grid by bpbond · · Score: 4, Informative

      Our research group (at UW-Madison) has been experimenting extensively with Xgrid, and right now (with the generally available version, the beta TP2) our conclusion is: not ready for prime time. Lots of promise, pairing Apple interface simplicity with powerful (and open source) underpinnings. But in its current form, there are too many bugs. You have to implement a lot of your own code and do a bunch of workarounds to really get a big, workable grid going.

      --
      "Science is a tribute to what we can know although we are fallible" -Jacob Bronowski
    2. Re:X-Grid by Red+Leader. · · Score: 3, Insightful

      What on earth are you bothering with X-grid for?

      You have the in-house developed Condor that's amazing!

    3. Re:X-Grid by bpbond · · Score: 4, Informative

      It's a good question. We bothered because

      (i) Curiosity. Xgrid was new, and looked interesting.
      (ii) Potential. Xgrid (final version) is going to be bundled with the upcoming 10.4 release of the Mac OS. That's an awful lot of machines that will have Xgrid preinstalled, with the user basically just needing to click "start."

      B

      --
      "Science is a tribute to what we can know although we are fallible" -Jacob Bronowski
  3. Commoditization of grid computing... by PornMaster · · Score: 5, Interesting

    The article mentions the commoditization of grid computing by adhering to a set of standards, but past a certain point, it makes little sense for IBM or Sun to make their tools interoperable... that makes their consulting value-add on top of grid resources they offer diminish.

    I think that for full standards compliance, you'll need to look to companies which don't offer their own computing resources -- platform-agnostic companies. But then who do you buy the compute resources from? Unless you're buying your own systems for use (which makes "utility computing" less viable), it's a bit of a catch-22.

  4. Plan 9 & Inferno by HyperChicken · · Score: 5, Informative

    If you want examples of operating systems that help with gridding, check out Plan 9 from Bell Labs and it's sister project Inferno. Nice thing about Inferno is that it runs on Linux, Windows, Mac OS X, Plan 9, and on native hardware.

    --
    Free of Flash! Free of Flash!
  5. Not the only way. by Eunuch · · Score: 5, Interesting

    There will never be a substitute for a single box with a lot of CPUs on it. For tightly coupled dataset the latency of a grid will be a limitation.

    --
    Transcend Humanity. Please.
    1. Re:Not the only way. by Rei · · Score: 3, Informative

      True; however, when you think about the big picture, the vast majority of real-world situations have at least *one* parallelism in them, and with even one parallelism, a grid usually makes sense.

      For example, let's say you're trying to determine the best FOO, and running a FOO is a highly serial process. Even though you can't split up running each FOO, you can pass the processing of each FOO test case that you want to run to a different machine.

      True - sometimes, you need the results of your previous run in order to plan your next run, and sometimes your *only* need is a single run of a very simple algorithm for which there can be absolutely no parallelization. However, more often than not, even algorithms that need tightly coupled data have at least *one* stage which can benefit from parallelization - and you really need only one to get the benefits.

      --
      "Lock and load, Brides of Christ!"
    2. Re:Not the only way. by ergo98 · · Score: 3, Interesting

      Don't forget that the overhead of a grid infrastructure - packaging the task up, transmitting it to another machine through a glacially slow network, unpackaging the task, performing the work, packaging the results, transmitting it to another machine, unpacakaging the results, coordinating in the results in some sort of orchestrator.

      Unless your task is signficantly computationally demanding, this overhead can significantly outweigh just doing the task directly, regardless of how parallel the task can be.

  6. Computing grids by Anonymous Coward · · Score: 4, Funny

    Computing grids, or software engines that pool together and manage resources?

    Pure Bolshevism, that's what!

  7. Re:Beowulf cluster by OECD · · Score: 4, Funny

    Actually, this is the great promise of grid computing. Once it becomes ubiquitous, people will stop trying to imagine Beowulf clusters!

    Of course, Soviet Russia broke up, and we still hear about that...

    --
    One man's -1 Flamebait is another man's +5 Funny.
  8. Echoes of Carly by Galuvian · · Score: 5, Funny
    From TFA:
    What we provide is primarily an implementation of Web services standards to allow people to build services, and the primary goal is also for us to provide a set of pre-defined services that allow you to use Web services protocols to interact to request the allocation of compute resources, the creation of computational services and moving the data from one place to another and so forth.

    Does this sound like Carly Fiorina attempting to explain HP's strategy to anyone else?

  9. Grid: loaded word by selectspec · · Score: 4, Informative

    The new generation of marketeers use Grid, but they rarely are refering to what computer science engineers refer to grid clustering. I think the marketeers talk about Grid when they really mean virtual Operating Systems running on abstracted hardware platforms: either a mainframe, or otherwise kick-ass multi-way system that has been virtually partioned, or something like vmware piecing together several x86 style servers.

    Frankly, I don't like the word Grid being applied in this way. However, the latter technology is facinating (virtual OS) and will come to dominate computing in the next few years.

    The basic idea is total abstraction of the application/service from hardware/location. The app gets the resources it needs, can be cloned/replicated to another location for distaster tolerance, and can scale and grow on demand based on needs by simply throwing more hardware modules at it. It's not just limited to computing but also applied to storage and network.

    --

    Someone you trust is one of us.

  10. Oh boy... by Duncan3 · · Score: 4, Interesting

    Look, the bottom line is there is nothing new here, just new sets of buzzwords. You have been able to submit massive computer jobs to IBM or Sun (with their insane $1/cpu-hour), or even most college campuses (the U of Minnesota had such systems) for the last 35 years. MPI/PVM standardized and commoditized the clustering side of things long ago.

    Globus is now "web services" and not "GRID". GRID is so last century. It's far more cool now that it's in Java too. Anyone still working on GRIDs should search/replace immediately!!!

    And did they drop the name of every single business partner they have in that article, or did only I notice that? ;)

    --
    - Adam L. Beberg - The Cosm Project - http://www.mithral.com/
  11. Blackout 2003 by Ced_Ex · · Score: 4, Insightful

    Hasn't the blackout taught us to move away from GRID type setups? If people just created their own power the blackout would have affected us less. Could this principle not be used for home computing? Rely on yourself and not on others?

    --
    Live forever, or die trying.
  12. Solution to Intel heat problems by Anonymous Coward · · Score: 3, Funny

    Just do all your computation in whatever hemisphere is in winter. They can use the heat.

  13. Grids eh! by igrigorik · · Score: 3, Interesting

    Only problem with this kind of setup is in fact it's limited ability to accomplish anything usefull to a consumer or a medium company. While, of course it is an interesting field, and one that needs to be researched, technology like proximity computing (SUN) is what will dictate the technology in the future. It's hard as it is to even get decent multiprocessor scheduling without too much overhead on a single pc, overhead incurred with grids would be enormous (I guess that's why the primary applications would be file storage etc.) Proximity computing on the other hand, is an innovative approach that doesnt try to solve a problem in place, but avoids it all together.

  14. What I want in a grid by argoff · · Score: 3, Interesting

    Is a combinataion of grid and virtualisation.

    Grid in the sense that if my datacenter needs more resources, I just plug in a blamk PC with extra CPU/MEM/Disk and not worry about it. Or if one goes bad, I just rip it out without worring about what it will destroy.

    Virtualisation in sanse that if I need an email server - I just create a virtual one on this grid and let it go, if I need a DNS server - I just create one on this grid and let it go, a web server ... same thing, ldap server same thing. If a server gets under load, it will automatically devote more memory/space/cpu/bandwidth to it as reasonable.

    That is my idea of a true grid.

  15. Great until everyone needs cycles at the same time by G4from128k · · Score: 3, Insightful

    Grids are great for non-time critical computations tasks. But what happens when everyone needs cycles now! My guess is that systems will evolve to give cycles to the highest bidder/highest priority. In such an environment, low-priority tasks will become effectively impossible on a grid - there will always be some higher-priority/higher-paying task that usurps the cycles.

    I wonder how long SETI@home will last if home PC users realize they can "sell" cycles to meet for-pay demand for computational power.

    --
    Two wrongs don't make a right, but three lefts do.
  16. HPC for very specialized problems, maybe. by Rhys · · Score: 5, Informative

    If you've got a problem that's trivially parallelizable, then sure grid computing is great! RC5, seti@home, and similar projects can benefit from grid computing (really, that's what grid computing is -- someone else's code able to run on your machine when it's idle and do work).

    However, don't even begin to think you'll be solving anything that requires any sort of processor to processor communication. Rocket simulation (our local favorite example here at UIUC) for instance is heavily communication based.

    The linpack benchmark that top500 uses also needs a low-latency interconnect to perform really well, so don't expect to see "the grid" sitting up at the #1 supercomputer slot on top500.org anytime soon (or really, ever, unless someone develops FTL networking). Latency on the internet in general (and specifically around the world thru all those switches and latest_slashdot_hot_chick_movie.torrent packets) is nothing near what a supercomputer needs.

    Now, there are research groups looking at ways of making communiation delays less of a problem, including the one I was in while I was in grad school. There's a number of ways to do it, but none of them I've seen are going to take on worldwide-network-latency and survive with their performance intact.

    Even something as "simple" as chess wants to have a fast interconnect - every node that's gotten stranded working on low-priority (bad move) work is a wasted node you may as well not have.

    --
    Slashdot Patriotism: We Support our Dupes!
    1. Re:HPC for very specialized problems, maybe. by magefile · · Score: 3, Insightful

      Maybe it's less efficient (as in your chess example), but the amount available may be able to compensate for it. If your grid is only 60% as efficient as your supercomputer, but you have three times the power (#'s pulled out of the air), then grid is still beneficial.

  17. Convergence of Grid and Virtualized LSB by NZheretic · · Score: 4, Interesting

    Take a pinch of Standard Linux
    Wrap it up in Xen
    Add a touch of SELinux
    And a little bitty bit of Globus
    Oh like a Sandboxed Platform
    Oh Lordy, Lordy, mixed with Free and Open Source Code
    You know you lump it all together
    And you got a recipe for a Multi Vendor Development scene
    It is coming though, you know, you know.

    What we have is a great big melting pot
    Big enough enough enough to take every vendor and all IT's got
    And keep it stirring for a hundred years or more
    And turn out Application Service and Content Providers by the score.

    With apologies to Blue Mink .

  18. Oh come on. by Moderation+abuser · · Score: 3, Informative

    "Grid" technology to do this stuff has been around for decades e.g. NQS, hell NASA gave away PBS in the 80s & 90s.

    The problem is that most of the CPUs out there run Windows, which is currently damned near useless for this kind of thing. It'll require a rewrite of the OS to take proper advantage of the potential of a network of windows boxes for general purpose computing. OTOH, a couple of shell scripts and SGE (http://gridengine.sunsource.net/) does the job on Linux and other Unix systems.

    --
    Government of the people, by corporate executives, for corporate profits.
  19. The network problem; somebody do the maths by zarathustra6625 · · Score: 5, Interesting

    What no one is mentioning is that these big cluster/grids that Sun/IBM are building to later sell over the network are dependant on the ratio between network speed and batch file sizes.

    EXAMPLE: IBM is currently offering CPU/Hour service in Houston to oil and gas companies. Sounds great till you realize the multi-terabyte files that consume such a massive compute service are too big to be readily sent over the network. Instead they use vans to haul tape and disk over to IBM and then run the process on it.

    What is the bandwith of a station wagon? Right now its faster than the internet on a 20 mile drive across Houston.

    But even take it a step further and the ratio remains. What if I wanted to pay Sun/hr for CPUs while I worked on a big Maya render of 200 gigs. By the time I've sent that over cable modem have I gained a ton in performance time?

    The problem I see is that we are making CPU massively parrallel but not networks. So will it EVER make sense to send a massive file to a commercial grid over a singular network connection.

    Somone should do the math.

  20. Re:Grid: loaded word by DeepRedux · · Score: 3, Insightful

    The use of the word "grid" here is in the sense of an electric power grid. The idea is that you should get computing power on demand, just like you get electic power on demand.

  21. Depends on how you utilize it. by jd · · Score: 3, Insightful
    Personally, the way I'd design a high-performance computer is to set up a grid of clusters. Keep things local to a cluster that involve high I/O throughput, and farm out to another cluster anything where the I/O is less important.


    The reason for such an arrangement is that high-speed interconnects are expensive. Building a single cluster that is uniformly very high performance would be horrible for anyone other than a very rich organization to consider.


    On the other hand, grids alone are way too slow to handle the needs of time-critical communication, which is what you have a lot of the time in parallel computing.


    A hybrid, able to place components of a problem according to that component's needs, would seem to be the logical solution. It is also the scalable solution. Clusters often have an upper limit in size. By having grids of clusters, you have a virtually infinite capacity. True, there simply aren't any clusters that have reached the upper limit. Yet. But it's getting tough at the size they are at right now.

    --
    It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)