Slashdot Mirror


Making Use of Terabytes of Unused Storage

kernspaltung writes "I manage a network of roughly a hundred Windows boxes, all of them with hard drives of at least 40GB — many have 80GB drives and larger. Other than what's used by the OS, a few applications, and a smattering of small documents, this space is idle. What would be a productive use for these terabytes of wasted space? Does any software exist that would enable pooling this extra space into one or more large virtual networked drives? Something that could offer the fault-tolerance and ease-of-use of ZFS across a network of PCs would be great for small-to-medium organizations."

21 of 448 comments (clear)

  1. Not without heavy utilization of other resources by Mostly+a+lurker · · Score: 4, Insightful

    If you have a very robust local network with plenty of spare capacity, and can accept a performance hit on the client computers, I am sure some kind of linked filesystem would be possible. In most practical situations, I think this idea would be a non-starter.

  2. Do you really have control of the boxes? by Marc+Rochkind · · Score: 5, Insightful

    If they're in a computer room, then such a scheme might work. But, if they're on user's desks, you don't really have control. They're subject to filling up, being shut off, being knocked about, crashing, etc. I don't think in this case you would really get the reliability that the diversity and independence would suggest.

    --Marc

    1. Re:Do you really have control of the boxes? by cbart387 · · Score: 2, Insightful

      Or is this just trying to salvage something you can't really use in order to create a solution for a problem that doesn't really exist? Bingo Bango Bongo! If you read the submitter's question, it simplies to:
          a) Is there there something productive I can be doing?
          b) How to do it?

      Everything else is fluff that tends to lead slashdot readers off on tangents, flamewars, Emacs Vs Vi (emacs), KDE vs GNOME (gnome)
      --
      Lack of planning on your part does not constitute an emergency on mine.
  3. Maybe move with the times? by line-bundle · · Score: 2, Insightful

    You could try to use something like "Localhost Azureus" for distributed data storage. The only problem will be that it will cost you in terms of processor and network hogging.

    Is it cost effective to reclaim that (small) space? Probably not. My suggestion is to realize that no-one tries to save clock cycles any more and maybe this is the way disk storage is probably heading that way.

  4. Space is not that important any longer by eebra82 · · Score: 4, Insightful

    It's a very interesting question, but from my point of view, hard drive space is so ridiculously cheap nowadays that it is utterly pointless to look for a useful application that will fill it up.

    Let's assume that the average computer has 80 GB of storage. Multiply that by 100 and you get 8 TB of space. That's what you can get into one or two computers nowadays without plunging out too much cash.

    What's more interesting is how much processing power you have as well as how fast the internet connection is.

    1. Re:Space is not that important any longer by jaxom · · Score: 5, Insightful

      I disagree with this and face this question all the time in work. Disks are cheap, storage systems aren't. If this is for a business that requires reasonable uptime, then the only solution would be to implement a SAN using Fibre Channel or iSCSI and then take out the drives. With the right array, all of a sudden those drives become superfluous (you decide if boot from SAN is right for you), management is easier and you'll be able to get a lot of reuse out of the drives.

      Now a lot of people will start to question the cost of doing all of this and it isn't cheap, however you have to analyze the data correctly. We migrated 200 servers from DAS to a SAN and had our money back within 12 months. Add on top of that the implementation of VMs, all of a sudden those 200 went to 20. That's a big difference in cost of ownership.

    2. Re:Space is not that important any longer by STrinity · · Score: 2, Insightful

      The solution is obvious -- the company should have just one or two 80 gig hard drives that employees connect to via Unix terminals.

      --
      Les Miserables Volume 1 now up with my reading of
    3. Re:Space is not that important any longer by LWATCDR · · Score: 5, Insightful

      Yep a better question is Why do all these PCs have harddrives?
      If they are really only using it for the OS, a few applications, and a few docs why not use diskless workstations?
      Less power, heat, and fewer things to break.
      In other words don't use all those drives, get ride of all of them.

      --
      See my blog http://ilovecookes.blogspot.com/ for light hearted technical information.
    4. Re:Space is not that important any longer by jpetts · · Score: 2, Insightful

      You're talking about different things: for example, I have just put together a NAS device using SATA disks that offers out volumes as iSCSI targets over GigE.

      SATA is a drive interface spec. NAS is a generic description of a type of storage device. iSCSI is a communication protocol, as is GigE.

      It's being used as storage for an Oracle database server used by around a hundred simultaneous users.

      By buying commodity parts from Fry's I managed to get 3T usable for under $2000.

      Oh, and I had fun building it.

      --
      Call me old fashioned, but I like a dump to be as memorable as it is devastating - Bender
  5. Dumbest question yet... by Aaron32 · · Score: 1, Insightful

    This is the dumbest /. question I've seen. Decentralized network storage pooled together with no means of practical management? Sign me up! Oh yeah, let's rely on the ditzy end users to help make sure it doesn't crash. I'm sure everyone will leave their computers on 100% of the time so you can make use of it. Don't tell anyone at work of your idea, they might not ever stop laughing.

    1. Re:Dumbest question yet... by ZeroPly · · Score: 2, Insightful
      You haven't put any thought into this - it takes about 20 seconds to answer your concerns given an introductory class in OS design.

      Obviously computers will crash or be turned off. We have this wonderful concept in architecture design called "redundancy" which we can use to address problems like that:

      Assume the probability of any computer being offline is d(c_n). For some computers you will have d(c) very low, such as user out of town often, other will have d(c) quite high, either the user leaves it on all the time or it has background processing to do.

      Computing and updating d() is fairly easy given any modern management tool. Then create clusters of computers with a required availability so that you stripe data across the componenet computers taking into account d() of each computer. Availability of the cluster would be a function of your modified striping algorithm. When you save data, you just choose what availability you would settle for, and the right cluster is chosen.

      Let me answer your next question in advance: if this is so obvious why is no one producing a product that's cheap and easy to implement? Because you'd have about 25 patent trolls lined up at the courthouse - too many teeth, not enough ass.

      --
      Support microSD: in a post 9/11 world, it is unwise to carry your data on media that you cannot comfortably swallow.
  6. Re:Not without heavy *use* of other resources by Anonymous Coward · · Score: 2, Insightful

    Please stop typing words like "utilization" when you mean "use". You sound like a PHB trying to sound smarter than he really is and you make it a pain for people to read what you write, especially non-Anglophones. Read George Orwell's essay on this topic.

  7. Looking at the problem another way... by pedantic+bore · · Score: 4, Insightful

    You might want to ask yourself why, after more than a decade of research and countless papers and prototypes that address this problem, your PCs storage are still underutilized...

    It's harder than it looks to get something reliable. Your PCs have extra capacity because it's cheap, but mining that capacity is not cheap. As other posters have pointed out, putting together (or just purchasing) a server with a few TB of storage is simpler and cheaper, less prone to getting wiped out by a virus, easier to manage and backup.

    --
    Am I part of the core demographic for Swedish Fish?
  8. Re:Not without heavy *use* of other resources by DarrenBaker · · Score: 3, Insightful

    Hrmm... Funny, he didn't come across that way to me at all. You, however, come across as a pompous linguistic Nazi, much like Orwell. If you compose sentences for people who don't have command of the language, then you are really quite delusional.

    As is my understanding, resources are utilised, while tools are used. He was correct in its usage.

  9. Grog likes it simple by upside · · Score: 2, Insightful

    Great, let's all dumb down to the lowest common denominator. English is a rich language and all the better for it. If you're too lazy to learn it, your choice. I'm a non-native speaker but prefer a vibrant, expressive language to some "for-dummies" international pidgin.

    --
    I'm sorry if I haven't offended anyone
  10. Re:Not without heavy *use* of other resources by Anonymous Coward · · Score: 0, Insightful

    The difference is that I wasn't nasty about it, I explained a problem and gave him a link to an essay about it. You, on the other hand, called Orwell and me names, attacked a straw-man, and said something incorrect about the words that is trivially debunked by glancing at a dictionary.

  11. Re:AFS by Secure+Endpoints · · Score: 2, Insightful

    AFS would be applicable if you were interested in turning each end user workstation into a centrally managed AFS server and dedicate storage for holding replicated readonly volumes. I wouldn't store single instance read-write volumes on a machine that at the mercy of an end user to turn on or off. I would also be resistant to deploying centrally managed storage on end user controlled machines in any case due to the access control issues. Anything that is stored on a machine that the end user has physical control over can be accessed by the end user.

    As others have pointed out, storage is so inexpensive these days. 8TB can be obtained for a few thousand dollars and managed in a much more reliable manner.

  12. Re:you need a better gun by ScrewMaster · · Score: 2, Insightful

    Detailed knowledge of any technological artifact will make you better at using it, maintaining it, knowing when to use it, whether it's an automobile or an AK-47. Yes, some people find guns interesting to a greater degree than others (I don't, personally, nor do I own one) but whatever floats one's boat. Let me ask: do you find someone that has an advanced knowledge of computers creepy? Probably not, if you're on Slashdot ... but there are many that do, until they need him.

    When the time comes that I need a brain to pick, it's those "creepy" nerd types that I seek out. They're the ones most like to be able to help. Maybe you're anti-gun, and the fact that some people are not is offensive to you, I don't know. Regardless, you should look at people who know much more than you about a given subject as a potentially valuable resource, not an object of scorn.

    --
    The higher the technology, the sharper that two-edged sword.
  13. May I recommend against this? by IanDanforth · · Score: 2, Insightful

    Having tried this in college, I can tell you a couple things.

    1. You will noticeably reduce the lifespan of the discs. (Which can anger cost conscious supervisors)

    2. Doing ongoing hardware maintenance, because of this reduced lifespan, on closed, used by others, boxes is a *serious* pain.

    Storage setups make hot swapping discs easy, trying to do this with full blown systems just gets tiresome. The solution I eventually came up with was the following.

    Implement a two tiered hardware replacement cycle where you reduce the time a user is allowed to keep any hard drive in their box before replacement. Then using the still reasonably good drives, create a centralized storage solution in which the drives can live out the rest of their useful spans. Data security, user happiness, and redundancy are all good selling points of this system. You still have to deal with monkeying around in user boxes but if it's on a schedule and it nets you more drives, it's not so bad.

    -Ian

  14. Re:Typical IT guy by AlecLyons · · Score: 2, Insightful

    You make it sound like it's a bad policy keeping all business data somewhere properly managed. It won't mitigate any damage done to your company or your career because you told them to be careful. People will store data in the most convenient location, thats not stupidity - just human nature.

  15. Re:It's been done by Microsoft: DFS NameSpaces by darrenkw · · Score: 3, Insightful

    Been there also and I disagree on the "just works" part. We're using it successfully but we've run into issues with losing files from some of the computers. Let's say that the admin changes permissions on somebodies directory so that they can write to it also. DFS will think that the file with the changed permissions is the newer one and blow the other one away. I hesitate to call that "just working".