Slashdot Mirror


Towards an Internet-Scale Operating System

gschoder writes: "Two Berkeley computer scientists (including David P. Anderson of SETI@home) envision an Internet-scale operating system to harness the processing power, networking efficiency, and storage capacity of everyone's computers. Scientific American has their proposal."

23 of 303 comments (clear)

  1. Why buy a computer? by JohnBE · · Score: 1, Interesting

    Won't people just use the minimum specification of machine and leach processing power from the rest of the network?

    --
    e4 e5
  2. Why should I want my computer doing others' work? by Navius+Eurisko · · Score: 2, Interesting

    In Scientific American, the writer gives the example of Mary's computer being ultilized by a Biotech company while it's idle. Another example is a movie that is stored on several hundred people's computers. Why should I let my computer be ultilized for someone else's for-profit work or entertainment when they can do it for themselves?

    It's another thing when a person volunteers to participate (I run SETI@athome) but this proposal sounds like a forced standard upon a consumer.

  3. High latency? by Telastyn · · Score: 4, Interesting

    The only thing I could immagine these things being used for is very high storage, very very parrellized problems. Factoring, travelling salesman (otherwise known as airport scheduling), SETI@home and the such.

    The OS will never be fully "functional" as OSes are considered today, because people will lie and cheat and steal. IMO (read: opinion removed from ass) the only practical use of this would be the equivalent of making a kernel patch that could have a slice of disk, a slice of memory usage, and a slice of bandwidth, and then it would run SETI@home, or whatever code it was instructed to run from the "master".

    If it was not run on public machines I could immagine something akin to Beowulf from the ground up. An OS designed for premeditated clustering. That's not Internet sized though...

  4. P2P makes the inroad more acceptable by 2Flower · · Score: 3, Interesting

    Five years ago, I'd have said no way, this is unfeasible, people would not contribute their storage space and CPU cycles to someone else.

    But now, with server-obfuscated peer to peer systems like AudioGalaxy, it could be possible. Imagine selling people on the idea of a 'universal public hard drive', where all you do is search for a file, then copy it over locally without actually knowing where/who it came from. I doubt there'd be any objections, given how convenient and 'anonymous' it would be. Sacrificing a share of your own hard drive space for cacheing files you might not be interested in would be a small price to pay for that. That's one resource down; do the same thing for CPU cycles (provided we have a killer app reason for people to need more cycles, given high speed processors of today) and other computing resources and the rest will fall in place.

    I doubt it'll go as far as this proposal, at leastnot for a LONG time, but the unthinkable is already becoming the thinkable in some areas.

  5. hmmm by ekephart · · Score: 4, Interesting

    Don't get me wrong the marvels of distributed computing are endless, but why don't we make ourselves more efficient on a smaller scale first. Besides there are some questions to work out.

    "Consider Mary's movie, being uploaded in fragments from perhaps 200 hosts. Each host may be a PC connected to the Internet by an antiquated 56k modem--far too slow to show a high-quality video--but combined they could deliver 10 megabits a second, better than a cable modem."

    Ok, thats nice, how do they propose Mary receive 10Mbps? Get 12 DSL lines? What about the people on dial-up? While people gain access to the internet around the world, those of us with the uber-connections will just leech on them? Now, they talk about the "digital divide" but that is just plain vicious. I'd rather be stickin it to The Man then Uncle Sven in Stockholm. So then what, everyone gets a fast connection -> backbone upgrade -> ATT, MCI, Earthlink, Sprint, etc. spend the money that Amgen would save.

    Also: How would individuals choose who can use their computers resources given their ethical or moral convictions. While I would surely donate my CPU and disks to cancer research or finding larger prime numbers, I don't want the DoD using it to think up new ways to kill people.

    --
    sig
  6. Re:Whats in it for me? by CmdrPinkTaco · · Score: 2, Interesting

    I was thinking of an idea similar to this a while ago and thinking how to get people to get people on a system and how a company could profit from a system such as this. The idea that I came up with is as follows (and granted, this is very much a rough outline):

    Sell computers at or just above cost to consumers in a package that provides all the necessary hardware / software. The end user will be forced to sign an agreement that will provide for them the DSL / cable line at a reduced cost and the computer for the end user. They must also agree (stated within the terms of service, that their computer should always remain on (when reasonable) and when not being used is subject to being used by my company (we'll call it MyCo).

    Now, to offset the costs of the reduced price of computers and the reduced cost of cable / dsl - MyCo then can sell a client to a larger corporation who is interested in large scale computing without having to purchase one. For those of you who are familiar with the supercomputer environment, it isn't uncommon to lease out cycles on a larger scale computer to other entities to help offset the cost of some of the larger super computers. By leasing out the number crunching abilities of the distributed network of computers, this would be able to cover the costs of selling consumer hardware / packages and would allow for large-ish companies to harness the power of a distributed number crunching system.

    Like I said, this is all very preliminary and more of just a thought than anything, but I think that something like this might attract more than just the "geek novelty" users. It would allow consumers to benefit, and would allow other companies to piggy-back on the system without having to make the large investment into a "supercomputer."

    --
    Please give your mod points to others, Im at the cap. They will appreciate it more
  7. Just wait.... by st0rmshad0w · · Score: 3, Interesting

    Until your system and damn near everyone elses is siezed for evidence in some computer crime or some move in the war on terrorism.

  8. Communist, Schmommunist... by RumGunner · · Score: 2, Interesting

    This won't work for the same reason that communism doesn't work. There are too many people who are greedy, manipulative jerks, and more often than not they will take advantage of the rest of us.

    Perhaps if you set up your computer service like a secret society this would work. Then you'd have to know all the users, and would be able to track everything. It would be like the Masons, only with computers.

  9. distributed backup is the killer app by emin · · Score: 4, Interesting
    The article mentions distributed backup as a possible application, but in my mind distributed backup is the killer application.

    Consider a distributed backup program which works roughly as follows.

    • You install the program and give it a certain amount of space on your hard drive.
    • You tell the agent which files or directories you want backed up (e.g. /home).
    • The distributed backup program periodically contacts other computers and swaps encrypted versions of your data for their data.
    • If your machine crashes or you lose data or your city gets nuked, you can easily recover your data from the computers you shared with.

    This type of application would provide at least 3 important benefits for backup. First, its relatively cheap. If you want to backup more data, just buy more local disk space and trade files with more computers. This seems much easier (at least for a home user) than setting up a tape backup system, making sure the tapes get replaced, making sure the tapes get put someplace safe, etc. Second, its much safer than pretty much any backup system you could buy today commericially since your data is literally spread all over the world. Finally, the backup system isn't controlled by any large corporation.

    Obviously there are still some details left to be worked out such as how to let computers who want to trade files find each other (both centralized and distributed options exist analagous to napster and gnutella), how to prevent cheating (having your computer periodically ask its partners for hashes of the data they are backing up should work), how to control redundancy most efficiently (error correcting codes like Reed-Solomon codes or Tornado codes would probably be smarter than just repeating data).

    If you're looking for a great distributed open source project that will make the world a better place, I encourage you to develop prototypes for distributed backup. I plan to develop my own prototype one day, but currently I'm pretty busy with graduate school.

    -Emin

  10. Re:i don't know.. by Anonymous Coward · · Score: 1, Interesting

    ...Yeah, can you imagine the stink people would raise if, instead of their processor time and disk space, their hard earned money were used for all sorts of programs that they have ethical and philisophical objections to. I mean, what a horrible world it would be if other people were using my money to execute citizens convicted on shaky evidence or build some ludicriously expensive impossible missile defense shield. I'm sure glad that type of thing is only the stuff of dystopian science fiction novels. Whew!

  11. Trusted data by SirSlud · · Score: 3, Interesting

    Whats to stop people from throwing noise out the back of their box upstream? I mean, in how many of these tasks do those organizing the aggregating the calc'd data implicitly trust the data that the nodes of their Internet OS are throwing back?

    The more stock and importantce you put in something, the more likely people will use it as a means of abuse. I can envision a world where people who are against a particular scientific task (for whatever reason, ethical, on principal, or whatever), use this Internet OS, and join particular distributed apps simply to throw noise into the upstream ...

    --
    "Old man yells at systemd"
  12. Re:i don't know.. by spoonyfork · · Score: 3, Interesting
    I'm not so sure how i feel about something i own being used for something i don't.

    What if the computer you bought for US$2000 was largely subsidized by the colation of entities that wanted to use your CPU and mass storage when you weren't so that it only cost you like US$1000 or even US$500. Would you participate then? Even if you wouldn't, could you see how someone else might?

    --
    Speak truth to power.
  13. Error correcting codes by yerricde · · Score: 2, Interesting

    Should I guess the missing 40% from the available 60%?

    Yes! Error-correcting codes will make it possible to guess the whole file from fragments that add up to 50%. Mojo Nation already does this.

    --
    Will I retire or break 10K?
  14. can anyone say... by gh05t · · Score: 4, Interesting
    security as we know it no longer exists?
    How many people do you know that are too scared to purchase anything online because they're afraid that some crazy cracker will intercept vital financial information? I know quite a few. We have to keep in mind that a relatively small portion of the overall population will actually see the benefit of this technology; and even fewer will trust it.
    Things that should be considered:
    • security of personal computers
    • security of bank account
    • additional power consumption from computer being left on
    • cost to companies that use the technology
    • cost, if any, for a persons' file backups
    • value of the differences in speed/storage of individuals' computers
    First of all, can the encryption be cracked? with massive distributed computing available your computers cpu cycles may very well be used to crack your own personal encryption scheme that was used to back up your files securely. What kind of bank account access will be given to allow pennies to trickle in? Without proper supervision, how would you know that the pennies trickling out are really legitimately earned? I beleive that there was a case not too many years ago where a programmer created 'bugs' in a banks software that allowed money to trickle into his own bank account unsolicited. Also, can the companies using your pc really pay enough to compensate for the additional power consumption costs of leaving your computer on more frequently? Wouldn't people be more inclined to leave their computers on more often so as to allow more pennies to trickle in? And last of all, how would the value of individuals' computers be judged? Would it truly be fair to allow someone with a Pentium 233MHz and a 3 Gig hard drive to get payed the same rate as someone with an Athlon XP 1900+ and 80 Gig hard drive? I think that it's a cool idea, but too difficult to implement any time soon, if ever.
  15. Storage by esme · · Score: 3, Interesting
    The greatest possibility that I see for using this sort of system is storage. I don't know about the rest of you, but I would glady sell my spare processor cycles to get a robust, secure, frequently-updated backup of my files. I have a backup system (CD-R for my home machines), but I don't keep it updated very well, and certainly not as updated as the system they're talking about could keep it.

    Add to that the fact that when you start dealing with serious amounts of data (~1TB), making backups to tape or any other media starts to get really difficult. If the free disk space on people's computers (I've got around 30 or 40GB free on my home machines) could be put to use to store backups, I'm sure businesses would be willing to pay a significant amount of money for it.

    -Esme

  16. How does one control what one's PC is used for? by Mr.+Neutron · · Score: 3, Interesting
    I'm all for sharing unused CPU power and DSL bandwidth, but what if I think SETI@Home is a waste of time, or have moral objections to my box being used as a repeater to broadcast R-rated movies? Is there going to be a way to itemize every flop and byte, and opt-out of the ones I don't want?

    Probably not.

    --
    dinner: it's what's for beer
  17. Re:Scary... by tonywong · · Score: 4, Interesting

    What about the computer doing things that you are philosophically opposed to? Like nuclear simulations (for China?), or genetic database searching for profiling individuals?

    It can be a lot more scary than you think.

  18. I/O Bound by Waffle+Iron · · Score: 3, Interesting
    It seems to me that most coputational tasks are more I/O bound than processor bound anyway. This scheme would just make the problem worse by moving the computations farther away from the ultimate source and destination of the data being processed.

    Processors faster than 2GHz are dirt cheap today. High-bandwidth connections aren't cheap, and connections to home users are 3 orders of magnitude slower than an internal disk drive channel.

    This kind of thing only seems to make sense for the most geek-oriented scientific types of calculations, and of those only the jobs that are trivially parallelized, like SETI. I don't see everyone changing their OS to support it.

  19. a couple of issues by dutky · · Score: 3, Interesting
    It sounds nice, but I see two big problems:
    1. even if we have lots of unused processor time (which I'm sure we do), pumping the data in to and out of a remote procedure call can consume a lot of bandwidth and result in a huge lag time. Many problems don't distribute well, even when you have relatively high bandwidth connections to send the data over (like multi-GB memory busses), so the problem only gets worse when you use a measley network pipe or modem line. (processor memory bus bandwidth tends to be in the 5-10 Giga-bit range, even the best home internet access is only 10-100 Mega-bits)

    2. the steady state of a hard drive is full. There just isn't going to be enough spare, on-line, storage space on folks' desktops to give any appreciable amount out to share. If you have to deal with the bloat of a self healing encoding, the problem only gets worse.

      Consider the case of N users, each with one hard drive of size X. They share out half of their hard drive space, but a file takes three times as much space to store on the distributed system than it does purely locally (for the self-healing encoding). The total hard drive space available to the group is now N*X/2 + 1/3*N*X/2 = N*X*4/6, or just over half the actual total space on the network. The average space available to any single user is the total available space on the network divided by the number of users, or just over half the actual space on the individual user's local hard drive.

      That doesn't sound like too good a deal to me. Admittedly, I will be getting some extra reliability, but given how many home user's back-up their data on a regular basis, I don't think reliability is worth much (at least to home users).


    At first blush, it sounds like a nice idea, but I don't think the economics are going to support it. It will always be easier and cheaper for the folk that actually need more storage or processing power to just go out and buy it, especially while Moore's law is in effect. For anyone else, it just doesn't matter.
  20. Been there, done that by Jim+McCoy · · Score: 3, Interesting

    The article mentions distributed backup as a possible application, but in my mind distributed backup is the killer application.

    While this is not directly mentioned by David Anderson in his article I know for a fact that this is something that United Devices is interested in because late last year Mojo Nation was in discussion with UD to provide just this sort of service to its users.

    This sort of distributed backup is what the current private branch of the Mojo Nation codebase does, with a little taskbar app that sits in the background and distributed backed up files to peers within the enterprise. One major benefit that your post missed is that the majority of the data stored on hard drives within an enterprise is redundant data (e.g. multiple copies of MS Word, etc.) and with a distributed backup system you only need to keep a few copies of such files around for restores. You can back up 99% of your data while only needing 10-15% of the available space on individual PCs.

    In what is turning out to be one of life's interesting ironies, the company that was most intrested in this UD/MojoNation pairing was Enron's bandwidth trading group (mostly for storing medical imaging data and distributed corporate backups.) When Skilling left Enron just before the whole accounting scandal started to blow up the Enron guys became "unavailable" so things never moved forward, but you can be certain that this sort of a distributed data storage and backup system will appear again.

    Jim

  21. Intended use... by nologin · · Score: 2, Interesting

    I'm not so worried about the technical side of things, but more along the lines of intended use...

    Could someone queue a job to crack a encrypted password file, or a document stolen from the government? I imagine that with 150 million computers using their spare cycles, this job could be done with relative ease. This is definitely an issue that the authors have failed to address in their proposal.

    The legal rammifications alone makes this prohibitive. Is a person who's computer did 0.1% of an illegal activity just as liable as someone who did 10%, 25%, 50% or as liable as the person who submitted the job? Can you even fully control what kind of jobs your system is doing using this proposed infrastructure?

    It may be a great idea for say X machines inside a large corporation, but there is already some alternatives to fill that need. I just don't see how they can work out the logistics of issues such as the one I present above, when they have to also worry about technical and financial issues that such a system would bring with it.

  22. Re:It's been done, and no one uses it by Salamander · · Score: 3, Interesting
    Frankly, "high latencies and frequent failures" are why such an idea is impractical, regardless of whether or not the theoretical problems can be solved (and i argue that they already have been solved).

    Hm. So we have a set of "theoretical" problems, for which it's doubtful that solutions exist. Except that you say they've already been solved...and apparently they're not just theoretical either. Truly, you have a dizzying intellect.

    Local disk space is far, far cheaper and more robust than network storage!

    Cheaper, yes. More robust? For what value of "robust"? Are we talking about data that only exists in one place, or in multiple places? Which one's more resistant to the type of failure that takes out a whole site? Please provide a definition by which something that exists only on your machine (whose mere existence is only known locally) is more robust than something that exists in multiple places.

    How long will it take to transmit a few dozen gig via DSL?

    Irrelevant. In any but the most stupidly designed distributed data stores, most data would be served out of a local cache under most conditions. In many, the next step would be to serve it out of another geographically-local machine over a fast LAN connection. Just because you personally can't think of a distributed-storage architecture any better than traversing the globe for every datum doesn't mean that better architectures don't exist.

    there is no reason to not use a user-level process to manage the data exchange

    Really? Ever try to do mmap-style I/O over Napster? How about plain old open/read/write over Gnutella? Byte-range locking within a Freenet file? Hmmm. If you want to talk about solved problems, how about ideas like VFS layers and network-protocol abstractions? To provide generalized, transparent access to data, on a par semantically with the sort of access that you get with a local filesystem, your "user-level process" isn't going to cut it. Not by a long shot. That's like going back to the days when every application needed its own library just to get keyboard input or draw stuff on the screen. This kind of thing belongs, at least partially, inside the operating system so that all applications can use all equivalent protocols without special linkage; see my file-sharing manifesto for a fuller explanation.

    --
    Slashdot - News for Herds. Stuff that Splatters.
  23. Re:A question of trust by renehollan · · Score: 3, Interesting
    True enough, and as I'm very much a libertarian, I afgree with you. But...

    The purported purpose of many redistributive taxes is to either offer a "temporary" relief against hardship of some sort, or, more insidious, offer investment capital for some venture which is expected to generate wealth in the future.

    Historically, private charity (when not the victim of dollars that go toward taxes instead of the charity) does a better job of taking care of the poor and destiture than does government.

    As for "investment capital", if the venture were worthy of funding, private investors would do so, for a share of the expected gains.

    Sometimes, of course, the government wins, or at least had a miniscule investment in something that wins big (think "Al Gore's" Internet). And I've seen many a slashdotter argue where government should "invest" -- NASA being a favorite "charity" (because they do cool stuff, I suppose). So, we slashdotters, as a group, are not immune to the lure of redistributed tax dollars. The big problem here, is that no matter how small the "government's" (i.e. taxpayers) investment, they claim ownership, lock, stock, and barrel, citing that "it wouldn't be if not for Uncle Sam [substitute your government as appropriate]".

    Perhaps not as soon, but worthwhile things do get tended to by the private sector "when the time is right" (yes, to expect to profit, of course). The private sector tends to be far more responsive as well, espescially in innovative new technologies exploited by startups.

    So, no, I am not any friend of government redistributive taxation, but I do think we should have strong counter arguments for all the "justifications" for it.

    --
    You could've hired me.