Slashdot Mirror


Distributed Computing Economics

machaut writes "In a ClusterComputing.org article, Jim Gray, director of Microsoft's Bay Area Research Lab, provides an interesting economic analysis for building distributed systems. When do you choose a grid over a cluster or a supercomputer? When does it pay off to move a task to the data vs moving the data to the task? He takes current hardware and networking costs into account to answer those questions."

130 comments

  1. Warning: by pecosdave · · Score: 5, Funny

    Ungodly numbers of "Beo-Wolf" cluster jokes arriving now!

    --
    The preceding post was not a Slashvertisement.
    1. Re:Warning: by tuber · · Score: 5, Funny

      Well, there's the first one.

    2. Re:Warning: by Anonymous Coward · · Score: 0

      Imagine a beowulf cluster of "Beo-Wolf" cluster jo*BANG*!

      slumps over dead.

  2. Does that include electrical costs? by Anonymous Coward · · Score: 3, Insightful

    How much does it cost to keep hundreds of regular computers (with all their extra peripherals) crunching away vs. a specially designed computer/set of computers.

    1. Re:Does that include electrical costs? by TopShelf · · Score: 3, Interesting

      That shows how this analysis is done from the perspective of the party performing the investigation, as opposed to society as a whole. For instance, Seti@Home's costs in terms of user electrical, maintenance, etc. isn't considered here...

      --
      Stop by my site where I write about ERP systems & more
    2. Re:Does that include electrical costs? by yintercept · · Score: 4, Interesting

      The whole goal of distributed computing is to externalize costs. When someone else bears your cost...then, yeah, it's free. This is the idea behind P2P. P2P is significantly less efficient than specialized servers, but externalizes costs. In some cases, SETI type arrangements use real idle equipment. In other cases, it pushes real costs onto other unwitting groups.

      Your employer has to pay for the electricity if you leave a bunch of computers on at night to help calculate protein folds. It is not necessarily a bad thing...just that an unwitting party is bearing a cost. In many cases the cost is of little consequence...in some cases it is.

    3. Re:Does that include electrical costs? by cicadia · · Score: 2, Informative
      Didja read the article?

      SETI@Home harvested more than a million CPU years worth more than a billion dollars. It sent out a billion jobs of 1/2 MB each. This petabyte of network bandwidth cost about a million dollars. The SETI@Home peers donated a billion dollars of free CPU time and also donated 1012 watt-hours which is about 100M$ of electricity

      No, it doesn't include the value of user-performed maintenance, but as an economic analysis, it would be pretty negligent to not include the value of donated CPU-time and electricity.

      --
      Living better through chemicals
    4. Re:Does that include electrical costs? by Anonymous Coward · · Score: 0

      Most (if not all) people who donate time to SETI@home have a computer anyway, and don't incur any additional maintenance cost from running the client. There probably is a slight additioanl cost in electricity, and that could coneivably mean higher maintenance cost (eg. a fan might break because it's running for longer than intended), but it's negligible.

  3. Whassatnow? by Anonymous Coward · · Score: 5, Funny

    When do you choose a grid over a cluster or a supercomputer?

    When you have a really high-paying job where you are paid to make such decisions.

  4. s@h, et. al.. by i.r.id10t · · Score: 4, Interesting

    .. have already figured it out - let other willing users pay the power bill, bandwidth cost, etc. and crunch the data in their spare time. Seems to be working well for seti@home, etc.

    Of course, if you are working with sensitive data (military stuff, major trade secrets, etc.) your security/privacy needs will outweigh the costs involved with doing it all in house.

    --
    Don't blame me, I voted for Kodos
    1. Re:s@h, et. al.. by jason0000042 · · Score: 3, Insightful

      Also, as long as people are still allowed to decide what runs on their own computer you will have to convince them that they should help you with your distributed computing task.

      SETI@Home worked so well because people want to know the answer. People are interested in the results. If you tried to do a distributed apple browning application nobody would download it.

      --
      i don't like my old sig.
  5. Of course I imagine it helps a lot when it's free by DarthVeda · · Score: 0

    Just imagine how much SETI@Home would be out if they had to pay for all of that...

  6. hmm... by xao+gypsie · · Score: 3, Interesting

    interesting thought, but what is the difference between this and the age old concept of the cost/benefit relationship...? im not trolling, it seems that it is jsut that concept with a tech twist..

    --


    xao
    http://TheHillforum.hopto.org
    1. Re:hmm... by Anonymous Coward · · Score: 2

      Uhh, nothing. It IS a cost/benefit analysis.

    2. Re:hmm... by Mooncaller · · Score: 1

      Its is cost/benifit, but applied to a specific case.

    3. Re:hmm... by f97tosc · · Score: 1

      interesting thought, but what is the difference between this and the age old concept of the cost/benefit relationship...? im not trolling, it seems that it is jsut that concept with a tech twist.. br>
      Sure, it is just cost/benefit but that tech twist is what has been missing for years. Important IT decisions have been left to the preferences of IT people instead of conducting a cost/ benefit analysis for the company as a whole. I think this tech twist is very imporant, and we will absolutely see more of it in the future.

      Tor

  7. Re:Of course I imagine it helps a lot when it's fr by DarthVeda · · Score: 1, Insightful

    Oh I see somebody beat me to the Seti@home first... oh well... off to craking my measly 5 units a day.

  8. Interesting yet shallow economic analysis by Raindance · · Score: 5, Informative

    I was happy that Gray covered SETI@Home as I think the nature of SETI is akin to where certain aspects of distributed computing may go in the future. However, I argue that he left some some key parts of SETI economics at the door; most notably, data integrity and security. As I understand it, *over half* of SETI's processing power, bandwidth, and so forth is used to verify data integrity as it's using untrusted hosts to do it's calculations.

    This doesn't make SETI a poor supercomputer, but it does change the economics of it. An economic model of computing resources which accounts for work done by untrusted hosts as involving different overhead as that done by trusted hosts would be a much more useful metric to think in terms of.

    1. Re:Interesting yet shallow economic analysis by Sourdough · · Score: 1

      Also, in a grid environment where those executing code do not trust those buying cycles, there is more significant overhead because the farmed-out code must run in a sandbox. So when neither party trusts the other, you might reasonably expect to end up with a 200% reduction in speed, or worse.

      Further, there is a really fundamental bit not mentioned at all in the article -- the effect of network latency. Many parallel applications require frequent synchronization between nodes, and performance quickly becomes dominated by latency as it increases. That is why "real" supercomputers typically have interconnect latency on the order of a few microseconds. "Beowulfs" with commodity network gear as an interconnect have tens of microseconds. A widely distributed computation will necessarily have latencies of tens of thousands of microseconds (unless someone figures out how to change the speed of light).

  9. SETI@home by LordoftheFrings · · Score: 2, Insightful

    The articles states that SETI@home has a whopping 54 teraflops of computing power. This is an unfathomable number of cpu cycles, and guess what, it is alled used FOR FREE! This is a great example of how a community of users is willing to sacrifice something (unused cpu cycles and small amounts of bandwidth) to meet some great future goal (contact with extraterrestrials). Did I mention it is FREE? I wonder how much money researchers could save or use in a better fashion if they all used distributed computing instead of expensive (super)computers. Some already have done this, and I know that there are distributed efforts currently processing ways to fight cancer and aids.

    1. Re:SETI@home by flabbergast · · Score: 5, Informative

      The author points out: "The ideal mobile task is stateless (needs no database or database access), has a tiny network input and output, and has huge computational demand."

      "And of course, SETI@Home is a good example: it computes for 12 hours on half a megabyte of input."

      So, for projects that fit this model, then they should save money over supercomputers. But few projects fit this model, with the author mentioning web and data processing, data loading, CFD, ie anything that "generates a continuous and voluminous output stream" as economically unfeasible. So, car companies really do need those supercomputers to virtually crash their cars. =)

    2. Re:SETI@home by brian728s · · Score: 1

      Sky-net used 60 teraflops in t3. SETI is obviously waiting for aliens to contact them so they can increase the efficiency of their processing to squeeze an extra 6 teraflops out. Then they can release the terminators and wait for the government to give them enough money to buy the world from the machines and rule with an iron fist. CONSPIRACY!!!!!

  10. Summary... by Realistic_Dragon · · Score: 2, Insightful

    Greedy overcharging telcos make grid computing over the internet more expensive than traditional supercomputing, unless you can get people to pay for you (SETI).

    --
    Beep beep.
  11. The bad thing about distrubuted computing by fishynet · · Score: 4, Interesting

    All the work must(should) be double checked to make sure everything is correct

    --

    Cats: All your base are belong to us.
    Captain: Take off every sig !!
    1. Re:The bad thing about distrubuted computing by Anonymous Coward · · Score: 0

      How is this related to distributed computing? If you are really putting lives on the line, then all computations should be done twice: by two different teams, with two different development suites, on two different OS/s, on two different hardware platforms.

    2. Re:The bad thing about distrubuted computing by cicadia · · Score: 2, Insightful
      It's not necessarily a bad thing; it just means that distributed computing is better suited to some problems than others.

      There's a whole class of problems which can take a tremendously long time to solve, but for which the solution, once found, can be verified very quickly.

      The distributed.net key-cracking contest was like this -- you don't have to double check every piece of work because once you've found the key, it is trivial to test it to make sure it's right. The OGR project works the same way, and I suspect that SETI uses a similar model.

      If it was true that you had to double-check everything then there would be absolutely no benefit to distributed computing. You'd be better off just building a supercomputer and doing everything just once.

      --
      Living better through chemicals
    3. Re:The bad thing about distrubuted computing by ZorbaTHut · · Score: 1

      Only with untrusted computers, remember. I run a small distributed network at my workplace (the 30 CPUs in the office, about half of which are in the render farm) and don't bother to doublecheck work - if it's wrong, it's my code at fault, and doing it again won't change anything. Nobody's hacking my client for credit - there *isn't* any credit.

      --
      Breaking Into the Industry - A development log about starting a game studio.
    4. Re:The bad thing about distrubuted computing by chrisbtoo · · Score: 1

      The distributed.net key-cracking contest was like this -- you don't have to double check every piece of work because once you've found the key, it is trivial to test it to make sure it's right.

      Almost true. That covers the problem of a "false positive" (client says that this is the correct answer, check it to ensure that it is), but does nothing to counter the problem of a "false negative".

      What if I have "the" RC5-72 key in a block on my harddisk and, for some reason or other, I report that I didn't find the key in there? The rest of the network carries on cracking away for tens of years only to find that they exhausted the keyspace without finding anything - and they have to start again.

      --
      Registering accounts later than some other chrisb since 1997
  12. Connect the Microsoft dots by Rosco+P.+Coltrane · · Score: 2, Insightful

    Microsoft and IBM tout web services as a new computing model - Internet-scale distributed computing.
    They observe that the current Internet is designed for people interacting with computers. Traffic on the future Internet will be dominated by computer-to-computer interactions.


    And that explains why Microsoft has suddenly declared war on spam : they have to free bandwidth for their own .NET message passing. Remember folks, Microsoft never does anything without a reason, and certainly never does anything for the good of anybody else but themselves.

    --
    "A door is what a dog is perpetually on the wrong side of" - Ogden Nash
    1. Re:Connect the Microsoft dots by GoofyBoy · · Score: 1

      >Remember folks, Microsoft never does anything without a reason, and certainly never does anything for the good of anybody else but themselves.

      And how is this different from you or I act?

      --
      The surprise isn't how often we make bad choices; the surprise is how seldom they defeat us.
    2. Re:Connect the Microsoft dots by Khakionion · · Score: 0
      Remember folks, Microsoft never does anything without a reason, and certainly never does anything for the good of anybody else but themselves.
      Excuse me? You don't just do things without a reason when you've got products to sell. And when was the last time you saw Red Hat touting Windows as superior? ALL companies do things for reasons, and for themselves. Take your mindless MS bashing elsewhere.
      --
      OMG! Wau!
    3. Re:Connect the Microsoft dots by Xentax · · Score: 2, Insightful

      I fine theory, but I disagree (and I have a nagging suspicion this is a troll, but either way, it bears answering).

      MS is getting on the anti-spam problem because it helps them, yes. But not for some theoretical future savings on bandwidth costs. They're doing it because taking an active role looks good to customers and investors, both of which are increasingly seeing spam as a real problem and not just something us techies talk about.

      Remember folks, publicly held corporations are *legally* driven by one thing, and ONLY one thing: Maximizing shareholder return.

      That they consequentially minimize other concerns, like the ecology, or customer satisfaction (when it stands in opposition to revenue/profit/market share/etc.), is hardly a surprising consequence. These issues are only worthwhile when the fallout of ignoring them costs more than it does to respect them (and to be seen to respect them). Just ask Best Buy about the economic value of good customer service.

      So, I agree to the extent that MS's motives for fighting spam are less than altruistic. But, as a public corporation, they have NO BUSINESS in benevolence; their shareholders could sue (and win) if they sacrificed shareholder value for some other purpose without very good reasons for doing so.

      Xentax

      --
      You shouldn't verb words.
    4. Re:Connect the Microsoft dots by Rosco+P.+Coltrane · · Score: 4, Insightful

      And how is this different from you or I act?

      I don't know for you, but I make GPL software, I give it away for free and therefore I give time and money to the community, partly to pursue a certain idea of the computer industry I desire.

      In a way, it's just like people who run the Seti@Home client : they don't do it just "to get a free screensaver" like that Microsoft guy narrowly thinks, they also do it because they want to feel part of a greater, more noble effort than just getting rich quick.

      When was the last time Microsoft gave anything open-source or for free that didn't serve one of their short, medium or long term plans ? I mean, it's okay, they're there to make money and they admit it, there's nothing wrong with this goal as long as they try to achieve it morally and legally, but why should it be the same for everybody ?

      --
      "A door is what a dog is perpetually on the wrong side of" - Ogden Nash
    5. Re:Connect the Microsoft dots by Rosco+P.+Coltrane · · Score: 2, Insightful

      I have a nagging suspicion this is a troll

      FYI, I never troll.

      You may be right, they may do it to look good, or they may also do it to free bandwidth for their ever-increasing patches and to make .NET a viable product proposition, like I believe. But whatever the reason, I'm only saying they're doing it for a purpose, and people should cross-read declarations made by big corporation reps to find the motive behind their actions.

      --
      "A door is what a dog is perpetually on the wrong side of" - Ogden Nash
    6. Re:Connect the Microsoft dots by yintercept · · Score: 3, Funny
      Traffic on the future Internet will be dominated by computer-to-computer interactions.

      This is already true. Most email traffic these day seems to be marketers talking to spam filters.

    7. Re:Connect the Microsoft dots by GoofyBoy · · Score: 1

      I was more thinking about how you do give away the code for free for a reason. You have a self-interest in giving the code away, you want the computer industry to be a certain way
      .
      Another way to think about the SETI people is that they are greedy/prideful that THEY could be the one who finds the signal that indicates ET is contacting us.

      Purely self-less acts are very very rare and so could we expect MS (or any corporation) to do so any more than we would ourselves?

      --
      The surprise isn't how often we make bad choices; the surprise is how seldom they defeat us.
    8. Re:Connect the Microsoft dots by Anonymous Coward · · Score: 0

      I give it away for free and therefore I give time and money to the community, partly to pursue a certain idea of the computer industry I desire.

      So you are doing it to benefit yourself. Yes, it benefits others as well but you are getting something out of it.

    9. Re:Connect the Microsoft dots by Xentax · · Score: 1

      people should cross-read declarations made by big corporation reps to find the motive behind their actions

      I couldn't agree more. In the end, it ALWAYS has to boil down to the bottom line, even if some of the line items in question are more intangible than others (like customer/consumer goodwill).

      Xentax

      --
      You shouldn't verb words.
  13. Re:FIX FILE ROLLER! by Anonymous Coward · · Score: 0

    Give us back our "extract to here" option NOW! You can shove your hig up micheal de incasa's ass! Fix split pane in nautlilus too!

    Quit yer bitchin' in the wrong place. The correct place would be in a gnome forum, not /.
    If it means that much to you, either learn to program, or pay someone to fix it for you.

  14. They should rename the lab by Anonymous Coward · · Score: 2, Funny

    to Microsoft Bay Area Research Facility

  15. Spoiler by 4of12 · · Score: 3, Informative

    .
    .
    .
    Conclusions

    Put the computation near the data.

    My own general take on all this is the Moore's Law for CPU/data costs vs time will beat the decrease in network latency costs vs time and we'll generally expect to see communications protocols become more "intelligent" to compensate up for the this barrier that cannot be overcome. BW will be relatively cheap, but the cost of building up and tearing down a connection will remain high enough to discourage multi-exchange handshaking (ie., UDP model vs TCP model).

    --
    "Provided by the management for your protection."
    1. Re:Spoiler by Alien+Being · · Score: 1

      "Put the computation near the data."

      I don't think it's that simple, at least not for a general purpose system.

      The Seti@home app doesn't care about net latency or bandwidth. Non-realtime Video encoding cares about bandwidth but not latency. Finite element analysis cares about both. Intelligent resource management and task classification will be very important.

      I suspect that as the the field develops, we'll see many existing NUMA techniques simply extended outside the box. The network really is the computer.

  16. Obligatory conspiracy theory by Anonymous Coward · · Score: 2, Insightful

    The last time I read an article from the Microsoft Research guys was in Communications of the ACM. The article was about media center computers (in the article, named Mbox) and digital consumer product consolidation/standardization. Of course there was no mention of Apple and just a brief acknowledgement of TiVo.

    As a strange coincidence, HP and others announced their media center PCs shortly afterwards, followed by Microsoft releasing XBox Live.

    Now the same Microsoft researcher is talking about grids and clusters? Expect a Microsoft cluster package soon...

    1. Re:Obligatory conspiracy theory by stiggle · · Score: 2, Interesting

      You mean the one that uses the spare processing power of the XBox live users to number crunch for Microsoft?

    2. Re:Obligatory conspiracy theory by steve_l · · Score: 3, Informative

      no doubt ... to date the Grid is very java centric. Now maybe .NET could deliver a speedup, but the nice thing about Java is (a) the latest 1.4.2 JREs use the PIII SSE and P4 SSE2 register sets for better float and double performance, and (b) you can put some serious unix servers in the grid for bonus speed.

      One thing Jim ignored is cost of software. Because MS effectively charge per-CPU for their system, you cannot afford to build a beowulf cluster on windows, let alone a full grid. So if MS do want to play in grid space, they need a way to price their platform so it makes economic sense. Didnt see that in the paper.

      (nb, MS do clustering already, it is just focused at DBs and big IIS installations, and it costs big numbers)

  17. Cool! by Anonymous Coward · · Score: 0

    Hey, I enjoyed that. Bash it if you will, but the breakdown analysis of the costs was quite interesting. It wasn't MS-fanboy propaganda.

    1. Re:Cool! by Anonymous Coward · · Score: 0

      No, it was Microsoft propaganda period. Reread the analysis, it's all wrong ...

    2. Re:Cool! by Anonymous Coward · · Score: 0

      Perhaps you'd care to elaborate, oh intelligent one?

    3. Re:Cool! by Mooncaller · · Score: 1

      It was informative reading even if it came from the mouth of the devil ( or, I should say, a minor demon).

  18. Strange math by Rosco+P.+Coltrane · · Score: 2, Interesting

    From the article :

    1 Mbps WAN link $100/month

    From this we conclude that one dollar equates to:
    $1=

    1 GB sent over the WAN


    Oh yeah ? 1 Mpbs for a month == 2678400Gb per month == 334800GB per month. 334800GB / 100 == 3.348TB

    From this I conclude that one dollar equates to 3.348 terabytes sent over the WAN.

    Gosh that was scary. I can restart xMule now ...

    --
    "A door is what a dog is perpetually on the wrong side of" - Ogden Nash
    1. Re:Strange math by Leffe · · Score: 1

      Math can be scary at times -_-

    2. Re:Strange math by rekkanoryo · · Score: 1

      Interesting stats. Doing the math like that though can even make dialup seem impressive if you word it right...

    3. Re:Strange math by friendofafriend · · Score: 3, Insightful

      I'm sorry, I don't follow your maths here...
      There are 2678400 seconds in a month (assuming 31 days...), so that makes 2678400 Megabits transmittable in a month, or 334800 MegaBytes. Each of your $100 buys 3348 MB, which is 3.3 GB - same order of magnitude as the author suggests...
      Perhaps you meant 2678400Mb per month.

    4. Re:Strange math by Rosco+P.+Coltrane · · Score: 1

      [FOOT IN MOUTH]
      Ooops, Tera- Giga- Mega- blooper.
      Yes, I meant 3.3GB per dollar
      [/FOOT IN MOUTH]

      --
      "A door is what a dog is perpetually on the wrong side of" - Ogden Nash
    5. Re:Strange math by cyb97 · · Score: 1

      I find your maths interesting... in short terms you write: 1 Mpbs == 3.348TB
      (I also guess that was Mbps), be careful of using to many =='s

  19. FedEx wasn't mentioned by bigpat · · Score: 4, Interesting

    Just last year we were discussing data transfer over the time it would take to overnight some data in a package, worked out that it was faster and wouldn't clog up our line to burn the DVDs and send them through an international package service vs send it over the T1s. I think with all but the largest businesses this is probably still true for larger (Gigabytes) amounts of data. Network costs are too high to be putting data far from where it is to be used. Whether CEOs realize it or not, this has a great effect on the ways businesses with multiple locations structure their company and work together.

    1. Re:FedEx wasn't mentioned by Anonymous Coward · · Score: 1, Funny

      Never underestimate the bandwidth of a station wagon full of CDROMs.

    2. Re:FedEx wasn't mentioned by silas_moeckel · · Score: 1

      FedEx is more cost effective if latency and manpower are not issues. Though if your still using T1's for data your probably dont have much in the way of data transfer needs T1's are horidly expensive compared to there carring capacity. Often for 10-20 times the cost of a T1 I can get an OC3 link thats have 100 times the carring capacity only usefull if you need the move data all the time. There is also the use of Satalites they can get OC3 speeds at a cost of about 500 an hour assuming all the office are visiable under one footprint (domestic). Very rarely will it be cheaper to have remote data father out than a MAN setting.

      --
      No sir I dont like it.
    3. Re:FedEx wasn't mentioned by Anonymous Coward · · Score: 0
      Never underestimate the bandwidth of a station wagon full of CDROMs.

      But the latency really sucks.

    4. Re:FedEx wasn't mentioned by sql*kitten · · Score: 1

      burn the DVDs and send them through an international package service vs send it over the T1s. I think with all but the largest businesses this is probably still true for larger (Gigabytes) amounts of data.

      Yes, I believe that's how SETI@Home gets the raw data from the radio telescope to the data centre, they write it to tape then fly it.

    5. Re:FedEx wasn't mentioned by Anonymous Coward · · Score: 0

      plus, you'd use securicore if you were running this as a grid service. Oh, and you'd need loads of cd drives at the customer end if you used cds, use 50GB tapes instead. AND use entire tapes for error correction.

    6. Re:FedEx wasn't mentioned by bigpat · · Score: 1

      "Often for 10-20 times the cost of a T1 I can get an OC3 link thats have 100 times the carring capacity only usefull if you need the move data all the time."

      Good if you can share costs of an OC-3 with 10 to 20 other companies. Maybe when the office space market heats up a bit, then more buildings will just start providing this service to their tennants. Wonder if the idea of sharing anything would fly in a board room.

    7. Re:FedEx wasn't mentioned by silas_moeckel · · Score: 1

      Buildings have been providing services like this to the internet for years now. I worked with a company back in the mid 90's that braught a DS3 into a building then sold it off to tennants handing them off ethernet and managed firewalls. Even at a DS3 that costs about 10 times as much as a T1 and has 30 times to carring capacity it's a decent return.

      --
      No sir I dont like it.
    8. Re:FedEx wasn't mentioned by bigpat · · Score: 1

      "Buildings have been providing services like this to the internet for years now"

      It is is the "10 times as much" part of what you are saying that I think you understate the importance of.

  20. point of diminishing returns by mechaZardoz · · Score: 1

    Yes, there are plenty of systems available for distributed computing tasks.

    Yes, there are plenty of free CPU cycles.

    But bear in mind that the rate of growth of entrants to the Internet is not growing exponentially as it once was; if anything, current trends are flattening.

    Whereas, if both science and industry fully embrace this mode of problem solving in the next few years, one has to wonder how many aps will it take to render it ineffectual? (1,000? 10,000?) or will we be able to go to this well forever?

  21. One dollar. by Anonymous Coward · · Score: 5, Funny

    Wow, what a world. $1 will now buy:
    1 GB sent over the WAN
    10 Tops tera-CPU instructions
    8 hours of cpu time
    1 GB disk space
    10 M database accesses
    10 TB of disk bandwidth
    1 large beverage
    1 of everything in the $1 store
    1 unlimited phonecall from some 10-10-### phone company.
    5 packets of cool aid
    10 packets of generic cool aid
    2 cans of coke

    When I was a child, data was expensive, and food was cheap...

    1. Re:One dollar. by Leffe · · Score: 1

      More data == I can feed my robot children.

    2. Re:One dollar. by hazem · · Score: 1

      Don't forget $1 = 6 packages of Top Ramen
      and $1 = 10 packages of cheap ramen

  22. "Put the computation near the data" by heironymouscoward · · Score: 3, Interesting

    This is an old maxim of design of any multi-tiered system. The reason is this: computation is largely about selecting and filtering data, before sending the results on to further tiers. This selection and filtering process requires many times more bandwidth towards the data source than it does towards the client layers.
    This only stops being true when there is no significant data, i.e. when the computation creates the data, as in the author's examples of render farms.

    --
    Ceci n'est pas une signature
  23. ROTFLMAO hah by Anonymous Coward · · Score: 0

    You are so funny!!!!!! hahahahha!

    1. Re:ROTFLMAO hah by Anonymous Coward · · Score: 0

      Actually I found it an okay joke.

  24. Re:oh really? by Khakionion · · Score: 0

    when when will this affect you in some way? No harm, no foul.

    --
    OMG! Wau!
  25. Anybody tried it? by LinuxParanoid · · Score: 5, Interesting

    It's a nice piece of analysis. Someone could have done it 8 years ago when Java came out; the facts are not significantly different (The values are different of course but the ratios involved are pretty similar. I did some thinking along these lines back then, and then in 2000 when considering working for a "hot P2P company" that an old acquaintance of mine was running.)

    My thinking went something like this: There are only a few, "niche" applications which need more compute power and which people pay for (distributed rendering, CFD, FEA, maybe a couple others). Maybe you could build that into a 10-30 million dollar business if you overcame a zillion obstacles but it didn't look like a billion or multi-billion dollar business. The applications for which people buy beefy servers, and which have a monetary payback, are mostly database applications. For those, you need to move the entire database near to the number-crunching PC, and that's not really feasible due to the cost of transporting Gigabytes of data or the unlikelihood that the PC's hard disk can store all the giga/terabytes of information potentially relevant for the computation. Not to mention the security problem.

    And Jim Gray's analysis lays out in more precise economic terms why it doesn't make sense. I like how he even calculated the relative merits of a Beowulf-like cluster of PCs versus P2P which I never really analyzed (I lumped them together as basically similar.)

    That said, has anybody even made a stab at designing or implementing a relational database with a P2P architecture? I know that there's Oracle Cluster Server, but I'm thinking of something more low-end and more distributed.

    --LP

    1. Re:Anybody tried it? by gillbates · · Score: 3, Interesting
      That said, has anybody even made a stab at designing or implementing a relational database with a P2P architecture

      Actually, I'm working on something similar for a customer of mine. The real challenge lies in the solutions to the following problems:

      • What happens when a node 'disappears' from the network? The traditional approach is to use a redundant backup, but doing so increases bandwidth usage. By the time you get around to a triple-redundant system, you're effectively working with only a third of your network's bandwidth.
      • As the number of nodes grows, the likelihood that two or more machines will need data not cached locally increases. As the system grows, there comes a point at which the entire system effectively becomes constrained by the slowest responding machine in the entire network. (For example, if I have 2 machines, half of the database will reside on the other machine, meaning that the local cache contains half of the result set, and half travels over the network. However, if I have 10 machines, each machine has only 1/10 of the database, meaning that 90% of the result set must travel over the network. To make matters worse, if each machine is a client as well, then in a 10 machine cluster, the average client will spend 9/10 database io cycles fulfilling requests for other clients, and only 1/10 io cycles performing its own queries.)
      At this point, I'm literally betting my career on solving the above problems. Network bandwidth is the real constraint; I'm currently working on ways to reduce the amount I have to send over the network. (for example, I'm considering an adaptive-locking strategy where a record would be marked for update on the remote server, but the transactions would be 'bundled' and sent across the network in aggregate to reduce network latency.
      --
      The society for a thought-free internet welcomes you.
    2. Re:Anybody tried it? by JamieF · · Score: 2, Insightful

      If you look at the way data works in a cluster, it's pretty clear that spreading it across a big slow network is a bad idea.

      In a DBMS, if all accesses are reads, you basically can just cache the data in every node of the cluster and it's ultra fast. If it's a lot of data, partition it across a large number of machines so that they each cache a subset of the whole database, and direct client hits to the appropriate node.

      The problem comes when you change something in an ACID compliant DBMS - you have to write to a transaction log on disk, then change the block in memory and write it to disk, then to the transaction log on disk again saying the transaction was successful. In a cluster it's worse, because you also have to tell all the other caches that something changed. Maybe you just say "invalidate the object with ID xxxx", maybe you tell it to flush all objects of that type, or maybe you send it the new one. You first get a lock on the row in the DB, then you change it (multiple physical disk writes), then you unlock the row and tell all the caches that they're out of sync.
      Multiply this times several nodes and N transactions/second and all of a sudden you're talking about serious bandwidth. If you can get multicasting going then the bottlneck becomes the write performance of the DB (all nodes still bottleneck on the ability to write transactions to the transaction log on disk), but the database iteself can be partitioned, so you can go really far with this if you implement it correctly, and if your data happens to partition nicely like that.

      So, for something like the gene stuff that was mentioned in the article, it's not unreasonable to think that a user might someday have the whole DB on his/her hard disk. But running a clustered RDBMS over DSL with lots of writes and cache management traffic would be uuuugly. I think that's why it hasn't really been done as a general case.

      For another example, look at SMP vs NUMA architectures. Inter-CPU bandwidth in SMP is critical because cache coherency has to be maintained; in NUMA performance is uneven depending on where the data is but it's still fast. Some problems need access to the whole data set, and others can be easily partitioned.

    3. Re:Anybody tried it? by Skjellifetti · · Score: 1

      Take a look at Mike Stonebraker's work.

    4. Re:Anybody tried it? by Anonymous Coward · · Score: 1, Informative

      No.

      But in principle, the reason you want to go for a P2P based DBMS is not really scalability, which we can do today with 'shared nothing' clusters (lots of Motherboards + local disk connected by Ethernet, rather than 'shared disk' clusters which are lots of diskless Motherboards connected to a SAN/NAS over the network). Rather, its system availability.

      With shared disk clusters you have a central point of failure/synchronization: the disk (or the disk controller). Today's shared nothing DBMSs all adopt a system model which says that 'if one of us is down we are all down'.

      P2P constitutes an architectural model for having a go at problems like scheduled down-time. ie. Let's not take the system down while we upgrade the software/hardware.

    5. Re:Anybody tried it? by gbjbaanb · · Score: 1

      Its quite simple really. consider the 2 db scenario (as more nodes are just more-of-the-same).

      You hold the entire db on each server, all writes are performed to both servers, reads take place on either of them. That doesn't really reduce the network traffic as you still are spending as much as you would with a 1-db system, but now you have reundancy and faster processing (ie reads).

      You hold the entire database spread over the 2 nodes, in which case you have problems if 1 node fails. In this case you really would like to make reads occur on the db where the data is kept, however, you can only do this if the data is partitioned (ie records A-N on one, M-Z on the other), as otherwise a read transaction won't know if the other server holds the required data or if there is no data. (ie. get all records beginning with 'Z', maybe there are no Z records, maybe they're all on the other server).

      If you hold 'pointers' to the records on the other server, you have the problem of updating both servers - once with the data, and then another network hit with a 'the data is stored here' transaction. Those additional transactions can add up to more traffic, especailly as the number of nodes increases.

      IMHO, the only way to reduce network traffic is to hold local caches. This is a problem for write transactions, but each read transaction can hit the server for a 'is my local copy still correct', in which case you may get greatly reduced network traffic, but only if each transaction is for a significantly large amount of data (ie larger than the results of the cache-correct transaction).

      Incidentally, if you bundle transactions to send over the network to maintain data integrity on the servers, you are incraesing latency - the first transaction's data will take a lot longer to travel to its destination if it has to wait for other transactions to complete first. (waiting for a full bus), so unless the amount of data in a transaction is significantly larger than the cache-control transactions, you might as well send it directly.

      Your best bet is to compress the data on the fly, or to connect your network as a 'tree', so data sent to 2 servers can be sent as 1 hit, and is then split closer to those servers. (ie. client A sends to switch/proxy B which sends to servers C and D)

      In the end, its a big trade-off between data size, latency, and data integrity. In other words - depends on what exactly you're trying to do :-)

    6. Re:Anybody tried it? by gillbates · · Score: 1
      You hold the entire db on each server...

      Which is the strategy I'm employing right now. I would, however, like to be able to use this same db design for situations in which the entire db is too big to fit on a single server. I really don't see any point in writing another database if it doesn't overcome any significant limitations of the current offerings. To date, no one has come up with an effective distributed database, and hopefully I'll be able to change that.

      --
      The society for a thought-free internet welcomes you.
    7. Re:Anybody tried it? by gbjbaanb · · Score: 1

      good luck, but please bear in mind that the reason no-one has come up with an effective distributed DB is not through lack of customers, or trying, or research.

      You should remind yourself what problems you're trying to overcome - is it large database, faster processing, or faster networking? Each of those 3 requires a different solution.

      You could say the internet is a distributed database though, and Google is its index.

  26. That's 'cause you are a retard. Put on your helmet by Anonymous Coward · · Score: 0
  27. YUO = SNIVELLING SLASHBOT!!!1 M$ $UX0RZZ!!!11 HAHA by Anonymous Coward · · Score: 0

    Of course you would find it an okay joke, you're too busy sucking off Taco to know what good humor is. Buy a clue, slashbot.

  28. Attack on IBM? by Jacco+de+Leeuw · · Score: 2, Interesting
    Some companies, notably IBM, Salesforce.com, Oracle.com and others are touting outsourcing, or "On Demand Computing," as an innovative way to reduce costs. There are some successes, but many more failures.

    The recurrent theme of this analysis is that "On Demand" computing is only economical for very CPU-intensive (100,000 instructions per byte or a CPU-day per gigabyte of network traffic) applications.

    This must be considered an attack on IBM's fairly visible On-Demand Computing campaign.

    Beowulf clusters have completely different networking economics. [...] That is why rendering farms and BLAST search engines are routinely built using Beowulf clusters.

    This reminds me of those Microsoft-funded TCO reports. They concede that Linux has cost advantages in a very specific field (webhosting; Beowulf clusters), because anyone intuitively knows it's true. For all the rest: use Microsoft stuff. That's what they are saying.

    --
    -------
    Warning: Slashdot may contain traces of nuts.
    1. Re:Attack on IBM? by pcause · · Score: 2, Insightful

      No it isn't an attack on IBM or anyone else. Grey knows what he is talking about and his analysis is just fine. We all need to get past marketing hype and commercials and excitement about "the next big thing" and look at the reality of the numbers. The issue is: are we close to having the infrastructure for generalized "on demand computing"? Grey explains it so what anyone can understand the tradeoffs. Even your CFO, which is the key!

      It is a great article/analysis. Believe it and ignore the hype. SOme day the hype may be true, and Grey even explains how to figure that ut. But for now, only specialized applications are suitable candidates.

    2. Re:Attack on IBM? by sirwired · · Score: 1

      I believe he mis-understands what "on-demand" is all about. He is interpreting it as merely an "enterprise" version of SETI. "On-demand" involves a number of things, including instant provisioning of massive amounts of storage out of a central pool (in the same or multiple locations,depending on requirements), instant steps in CPU and I/O bandwidth, additional server nodes, etc. The SETI style of distributed computing indeed has very limited applications. Being able to quickly increase the size of your IT backend is a bit more useful, and profitable.

      SirWired

  29. DONT LET SPELING STOP YOU FROM READING THE ARTICLE by Anonymous Coward · · Score: 0
  30. moron cyphering the cost of being in the garmeNT by Anonymous Coward · · Score: 0

    disstricked.

    well, it's definitely going to cost a lot more to use the infactdead payper liesense bugwear distributed buy the felonious kingdumb of Godless stock markup FraUDs, than to do it right, but that's how you learn.

  31. I think someones feelings are hurt. by pecosdave · · Score: 1

    I'm sorry if I lessened the impact of your Beo-Wolf cluster joke by posting mine. I really am.

    Hey everybody, I know of an A/C that needs a group hug!

    --
    The preceding post was not a Slashvertisement.
    1. Re:I think someones feelings are hurt. by Anonymous Coward · · Score: 0

      Hint: Learn to spell beowulf.

  32. Another data point by Alomex · · Score: 4, Insightful

    A few years back when Grid computing was all the rage we sat down with some investment partners and worked out the figures. We came pretty much to the same conclusion. The "average" commercial supercomputing application (pharma, oil drilling, simulation) would not benefit from "free" cycles on the network.

    Essentially, any commercial computation valuable enough to require that amount of effort can justify purchasing a hundred thousand node beowulf cluster and run locally. The reduction in network costs, the advantages of total control and tight security more than pay for the difference in computing cost.

    Non-commercial computations such as SETI will benefit from grid computing, and we expect to see more efforts long those lines (RSA, Mersenne, Stanford DNA). But remember, we were thinking about starting a business, and none of those pay for the services, so we moved on.

  33. Jim, the Internet Bubble has burst by mkc · · Score: 2, Insightful

    When SETI@Home spent $10^6 to get everyone to spend $10^8 on electricity alone, how was that a good deal? Have extraterrestrials sent a message that they're about to touch down with a vaccine for AIDS, a formula for cold fusion, a permanent end to unemployment, a sure-fire way to get good representation in government? Could we have spent the money more wisely, Jim?

    If Bill paid you folks to do something more than get technically-challenged investors excited, perhaps our software would work better. (And ASN.1 isn't that bad, by the way. Do I need everything going between my machine and the server to be verbose enough to read by hand? When I encode all my messages in XML, in how many cases will I miss the 10000 to 1 ratio just because the encoding is verbose?)

  34. SETI is Extrememely Costly by yintercept · · Score: 5, Funny

    We only look at the cost of SETI from our perspective here on earth...but if you ever consider the enormous cost space aliens have to incur to make their secret communications appear as background noise, then I think more people would oppose the project.

  35. Re:moron cyphering the cost of being in the garmeN by Anonymous Coward · · Score: 0

    EWe mushed bee ae vharry strAYng pers-on

  36. ,psssst by Anonymous Coward · · Score: 0

    when trolling always leave the main word misspelled. It helps to bring in comments.

  37. So, we're in a Helicopter eh? by croftj · · Score: 2, Funny

    His reasoning sounds good, but what the hey? It sounds pretty obvious that the most cost effective approach is to keep the data close to the CPU doing the crunching.

    --
    -- Many men would appreciate a woman's mind more if they could fondle it
  38. Ethiopean Free Software Developers by Beliskner · · Score: 1
    Computing costs hundreds of billions of dollars per year.
    IBM, HP, Dell, Unisys, NEC, and Sun each sell billions of dollars of computers each year. Software companies like Microsoft, IBM, Oracle, and Computer Associates sell billions of dollars of software per year. Computing is obviously not free
    So what happens to free software? Looks like either the money stays in the corporations by them creating software, paying developers, and billing clients; or the money is in the populous' Bank account so the free software developers create software and buy food out of their own money earned by other means. This implies that it's impossible to have full-time free software developers, as especially in the anti-Socialist United States, people that don't earn money for their work starve to death and get no healthcare as a rule. Where does Linus get his food? Can free software ever have a permanent core developer group that doesn't starve to death? I'm quite concerned - the best software like QNX has had the same full-time specialised core developers for many decades, can Linux only ever be a mishmash of half-baked ideas?
    --
    A caveman dreams of being us, the incalculable power and riches. We dream of being Q, then what?
  39. Read between the lines by t482 · · Score: 2, Funny

    Total Cost of Ownership (TCO) is more than a trillion dollars per year.
    Operations costs far exceed capital costs. Hardware and software are minor parts of the total cost of ownership.
    -- microsoft software is cheap so you should keep buying it. Even if administering it is expensive.

    Megaservices like Yahoo!, Google,et all have relatively low operations staff costs.
    -- Open source if managed properly doesn't need many people. But this formula can't be applied to the propreitary software shit you buy.

    Most applications do not benefit from megaservice economies of scale
    --Most microsoft products. We will still take our chunk of flesh no matter what.

    Outsourcing has often proved to be a shell game - moving costs from one place to another.
    --having a third party vendor manage your Microsoft software for you isn't going to save you much.

    Web services reduce the costs of publishing and receiving information.
    --But you will need a huge support staff to manage it plus lots of licenses. (see above)

    Most Web and data processing applications are network or state intensive and are not economically viable as mobile applications.
    --especially once the MS licensing is thrown in.

  40. Why SETI@Home works... by gatkinso · · Score: 2, Informative

    ...because people like me are willing to donate their computers time and a part of their [and their employers, hawhaw] electric bill.

    I do so because I am interested in the project... not because I feel like I want to help cut someone's computing cost. If SAH was a for profit enterprise my interest would quickly evaporate.

    --
    I am very small, utmostly microscopic.
  41. What's all the fuss? by gooddope · · Score: 1

    If it can't be done on an abacus by an infinite number of monkeys, then it can't be done. It is the simple monkey principal, similar to the duct tape principal.

  42. true, but an explanation is in order. by Vellmont · · Score: 4, Insightful

    I think it's probbably safer to say that seti@home has a huge surplus of computational power, and uses it to verify each result (though it's not strictly necessary to do so). With only one data source (Aerecibo) you can only produce data so quickly, and once you have enough computational power to do the analysis in real time any extra is just surplus that can be used to verify. They did, however later add some extra analysis to the data to take better advantage of the huge surplus of computing power they have.

    The important point though, is that for seti@home each individual workunit, while important isn't critical to the whole project. If a small percentage of workunits aren't computed perfectly it's not catastrophic. In other words there's a certain amount of tolerance for innacuracy. For a project like the OGR (Optimum Golomb Ruler) by distributed.net each workunit must be calculated perfectly, as the goal is to prove which ruler is the optimum one. If workunit isn't verified you haven't really proven anything, since it's possible (and probbably likely) that hardware failure produced an innaccurate result somewhere in the millions of workunits calculated. (Or perhaps a modified client produced innacurate results). Other distributed computing tasks have different amounts of tolerance for innacurate results.

    Your underlying point is a good one though. For some projects the need for integrity of the results is very high, so larger computing power may be necessary to verify each result.

    --
    AccountKiller
  43. What if you got to eat the results??? DELICIOUS! by moogla · · Score: 0

    n/t

    --
    Black holes are where the Matrix raised SIGFPE
  44. He doesn't do it for himself by Anonymous Coward · · Score: 0

    He does it for J.D. "Boss" Hogg!

  45. EphEN BETTAR AND MOR3 CVELER!!!!111 by moogla · · Score: 1

    Is to put crazy shit in the title like you are a 11 year old boy!!!

    --
    Black holes are where the Matrix raised SIGFPE
  46. Judgement Day?? by RugbyHoe · · Score: 1

    ...and SkyNet became self-aware (again) at midnight July 3, 2003...

  47. All this talk about SETI... by warriorpostman · · Score: 3, Informative

    ...but, there's other programs that people might find more socially useful/productive than SETI.

    How 'bout...this from United Devices? They do a variety of biologically related projects, the most popular one, as far as I can tell, being cancer research...I've been running it for almost 2 years, and I have 100,000 points...how many points do you have?

    1. Re:All this talk about SETI... by Anonymous Coward · · Score: 0

      We will probably find ET and colonize Mars before we cure cancer. Like the March of Dime or the feed Africans for pennies a day movements, the "cure cancer with your screensaver" project is more about "awareness raising" and advertising for United Devices than anything else.

  48. Google by The+Creator · · Score: 3, Interesting
    For example, in 2002 Google had an operations staff of 25 who managed its two petabyte database and 10,000 servers spread across several sites.


    And what OS where they using? :)

    --

    FRA: STFU GTFO
  49. This is horribly written by Anonymous Coward · · Score: 0

    Clearly, this is an adaptation of a powerpoint presentation by someone who thinks in powerpoint. Why is the first sentence of every paragraph bold ? Why don't any of the paragraphs lead from one to another ? I think it's because each bold sentence used to be a bullet on a powerpoint presentation, and the paragraph was adapted from his notes on the presentation.

    It reads like a bad sales pitch for a sucky mutual fund.

    The only part that caught my eye:

    "If telecom prices drop faster than Moore's law, the analysis fails. If telecom prices drop slower than Moore's law, the analysis becomes stronger. Most of the argument in this paper pivots on the relatively high price of telecommunications. Over the last 40 years telecom prices have fallen much more slowly than any other information technology. If this situation changed, it could completely alter the arguments here. But there is no obvious sign of that occurring."

    Everyone knows that telecom == fraud. If Grande Communications can offer telephone land line for $8.50 a month, even as they are laying all new infrastructure, then how the hell does SBC and Verizon get away with charging $40 a month for an infrastructure that is already there and has been paid of decades ago ? ( http://www.grandecom.com/ProductsServices/phone_lo cal_rates.jsp )

    It is possible that Jim Gray's suggestion that there is "no obvious sign" of a telecommunications price collapse is too pessimistic. With a tight economy and slow but sure growth in consumer choice, the telecommunications giants could still collapse. It would be nice to see SBC completely implode, and those 300,000 parasites it employs be replaces with dozen companies of a few hundred employees each. The efficiency the economy would gain by not paying Ed Whitacre 90 million dollars a year, and savings of several hundred dollars a year by every household, would offset the burst of unemployed paper pushers.

  50. in other words by rifftide · · Score: 1

    Massive grid computing currently isn't economical for crunching the daily workload of insurance claims... not that it ever occurred to any of us that it would be, but it's nice to hear it from the world's leading expert on TP.

  51. Self-employment by swb · · Score: 2, Interesting

    What if it became a self-employment option?

    In other words, my home office, instead of being a revenue-sucking hole filled with computers was instead a source of distributed computing power I could sell on an ad-hoc basis? I eat the tab for the upkeep and get paid cash per work unit I'm able to get done.

    1. Re:Self-employment by bobbozzo · · Score: 1

      ISTM that once enough companies start offering grid management services that the price paid to the end-users (the node owners) will come way down and then will probably not be enough to cover electricity for the computers & A/C.

      --
      Nothing to see here; Move along.
  52. only half of the equation by pfisher42 · · Score: 2, Interesting

    This is a good analysis of the hardware side of the cost/benefit analysis of distributed computing, but that's nowhere near the full story.

    For example, per the thesis of this article (that network communication is the largest expense of distributed computing), the Salesforce.com model isn't valid. Yet they're a great success story of computational outsourcing. Huh?

    The key, I think, is that outsourcing eliminates distractions, and gets your employees back to working at your company's core competency. You're good at selling widgets, so you let Salesforce.com be your HR department.

    This sort of analysis is useful for hardware, and could also be applied to meatware resources. It's the same essence of the TCO argument from Microsoft which, I think, does have validity.

    When you bill your time at $50 an hour, you'll gladly pay $300 for an operating system rather than use a free one that takes 4 hours to install, 10 hours to learn command line computerese you never wanted to know anyway and many hours dealing with conversion and compatibility hassles.

  53. Turing award by LCamel · · Score: 1

    It would be better if the post also mention that Jim Gray is a Turing award owner.

  54. Opportunity Cost... by NotQuiteReal · · Score: 1
    One cost of distributed computing is picking the wrong donation...

    E.g. If you own a computer, you are most likely a person. People get old and die. You should do medical distributed computing projects since you might benefit from the findings. Looking for aliens is cool and all, but not much in the way of practical application (unless they are beaming encoded drawings of really cool devices).

    On the third hand, if you are a stranded "traveler" (not the Irish kind)... Maybe puting out a "beam me up" signal will be of use.

    --
    This issue is a bit more complicated than you think.
  55. Only criminals would have... by GrodinTierce · · Score: 1
    a Beowulf-class MPI job that simulates crack propagation in a mechanical object
    Arrest them at once!

    Tierce
    --


    Tierce
    Who sponsors your feelings?
  56. Why a grid? Because you're insane. by Anonymous Coward · · Score: 0
    When do you choose a grid over a cluster or a supercomputer?

    When you've been giving lots of funding to do a grid because it's a neat buzzword.

    When you're tired of getting lots of real work done and would rather spend all of your time debugging the dozen layer system, none of which do useful error reporting.

    When you've decided that dealing with multiple sites, each with different requirements to get a login and different requirements for accepted certificate authorities, is fun.

    When you've decided that you love working with alpha quality software that is unsupported because the middleware providers are in the middle of their sixth major rewrite in three years "which will fix all of the problems," but will probably just reintroduce old bugs that you debugged for them and sent them patches for.

    When you have too much time on your hands, so you want to reimplement a front end, because the middleware providers won't provide it any no one else provides a useful front end. Of course, said front end needs to provide resource allocation and reliability, because the middleware people decided to just hand that off to end users.

    When you've got a horde of bored sysadmins with nothing better to do that clean up the "droppings" left behind on disk by the middleware, which regularlly leaves behind unused files but is completely unable to tell you which files are unused.

    I'm working on multiple grid projects, some of which have been running for several years at this point. "The Grid" is a filthy lie. It only works if you control all of the machines, and if you control all of the machines, just set up a cluster, you'll be happier.

  57. Extending the effective life of old hardware by SgtChaireBourne · · Score: 1
    Schools and libraries often have old hardware as a result of low hardware and software budgets. It would be interesting to see how much more kick could be gained from the old hardware by setting up network of QNX machines or other kernel/microkernel with distributed processing capabilities. Many libraries and schools are already familiar with various F/OSS tools like KDE+Mozilla+OpenOffice & co., so swapping out the [micro-]kernel would be unnoticed by the user, except for a possible improved performance.

    In general, office parks, libraries, class rooms, and computer labs have a lot of idle time that could be exploited. Generally in cases like these the power is on anyway during certain hours and maintenance is covered anyway, so distributed computing might be used for many things.

    --
    Beta is broken and the link to classic doesn't work. Stop wasting our time or there won't be anybody left here.
  58. If the mountain won't come to M... by GridGroova · · Score: 1

    This pidgeon-holing of distributed computing (ie. Grid) technologies is flawed thinking - and it would be a worry if IBM and Microsoft have built their "the future is web-services" strategies solely on this type of analysis. What the analysis forgets is that you have to get the data INTO the database before you can compute with it. For many processing tasks, it makes more sense to send the bulk of processing to the data rather than the other way round. The barriers to achieving this aren't in the economies, they're in the architecture of current databases. If you extend this thinking to data in a more general sense you'd have to conclude that file sharing networks aren't efficient ... and neither is the world wide web.

  59. Re:Connect the Microsoft .net bs by ratfynk · · Score: 1
    well put but? How the hell will they sell their incredible online gaming software to servers that can't hop around quick enough because of spam traffic. All bets are off if the band width demands of online gamers+spam reach what are the predicted levels.

    The game console sales are way higher than the online percentage of consol owners. If the MS gamers .net heaven vision does happen without a corresponding drop in spam traffic the good old /. effect will be meaningless. No one except perhaps Taco will get on.

    --
    OH THE SHAME I fell off the wagon and use sigs again!