Ask Slashdot: DIY NAS For a Variety of Legacy Drives?
An anonymous reader writes "I have at least 10 assorted hard drives ranging from 100 GB to 3 TB, including external drives, IDE desktop drives, laptop drives, etc. What's the best way to setup a home NAS to utilize all this 'excess' space? And could it be set up with redundancy built-in so a single drive failure would cause no data loss? I don't need anything fancy. Visibility to networked Windows PCs is great; ability to streak to Roku / iPad / Toshiba etc would be great but not necessary. What's the best way to accomplish this goal?"
Those older drives are probably failures just waiting to happen. With the cost of the hard drive space continually dropping, just use new drives. It's not worth screwing around with old ones for anything other than salvaging old data off them, even though the urge to do so is strong in the more frugal among us.
FreeNAS or OpenFiler.
I think FreeNAS (the BSD based one) is lighter and easier, as OpenFiler seems to be going in a more "fully featured" direction with less support for older hardware, but they're both good.
maybe put them in machine, make a glusterFS ?
Not streak to iPad. Stream. Streaking to iPad would require cleaning supplies at the point of impact.
Look at FreeNAS or Unraid. Unraid has a 3-drive limit IIRC for the free version, but supports an unlimited amount of drives for the non-free version.
What you're suggesting is a colossal waste of power. Just buy a new drive and junk your ancient old drives.
If you use Windows, the forthcoming Windows 8 "Storage Spaces" feature appears to be perfect for situations like this. http://blogs.msdn.com/b/b8/archive/2012/01/05/virtualizing-storage-for-scale-resiliency-and-efficiency.aspx
FreeBSD has fast ZFS support which is wonderful file system to fight data loss.
If you just use LVM and group all your disks together into one PV, that would make the array appear as "one big drive" to the system.
Redundancy (RAID) would not work so well because your array would be limited by the smallest disk in the array. Sure, raid the 300GB to the 1TB, but you end up with a RAID-1 array of 300G.
Join the Slashcott! Feb 10 thru Feb 17!
Ah ha! Who else amongst you has a huge surplus of huge hard drives going unused, now that netflix streaming has displaced 60% of all the crud you had spinning idle in a closet the 3 years before you signed up?
My storage requirements went from about 3 terabytes to about 30 gigabytes over the past 2 years. I believe I am the archetype and that I am doing the same thing as the average geek. I suspect there are piles of huge disks sitting offline because of this streaming displacement.
It cost me about 18 dollars a month to leave my x86 file server online, idle (killawatt meter, nh rates); netflix is cheaper than that.
Come on, who else has a comment related to this.
slashdot: where everyone yells sarcastic metaphors to themselves to understand the issue
FreeNAS can use ZFS for aggregating multiples drives, independent of size, technology etc, all with varying degrees of protection.
It's by far the best solution to your case.
Flavio
Do you care about your electricity bill at all? If you do, it'll probably be cheaper over the course of 6-12 months to buy a simple NAS box or a cheap atom board and plug in a couple of 2TB hard drives.
However even if you replace the word "streak" with "steak," the sentence still makes no sense.
Windows Home Server (V1) - mix and match to your hearts content and all the addins you can eat for adding features.
I want a list of atrocities done in your name - Recoil
If you have pairs of drives with reasonably similar size and performance specs, you could deploy them in RAID 1, or RAID 5 if you have three or more similar drives, and have some redundancy. FreeNAS, OpenFiler, or Nexenta will all work, but you're still rolling the dice in a rigged game, man. Old hard drives are for target practice.
http://www.winsupersite.com/article/windows8/windows-8-feature-focus-storage-spaces-142537
Well, lots of things.
Media can suck up a lot of drive space...even if it is all legal!! You might want to rip all your CDs to various formats (flac for good stereo in the living room, mp3s for ipod or car).
Ripping your dvds/blurays...to watch conveniently. Then with all this, you might like a few backups so you don't lose all that ripping work too easily.
I'm about to buy a new high end DSLR....storing pictures....HD video for production...archiving the raw video parts as well as the finished products. Still photos....originals...plus processed ones....redundancy copies for these can eat up a lot of space.
There's a ton of stuff you might want that will eat up space in today's digital world....
Light travels faster than sound. This is why some people appear bright until you hear them speak.........
Ditch your small drives. A 3TB is equal to 30 100GB drives, so there is absolutely no point in keeping them.
Group your drives into similar sizes and assign into a few different raid arrays. The array will only use the capacity of the smallest drive connected.
I would use RAID 1 if you only have 2 drives in the array. Raid 5 for 4 drives or more.
Hard drives are very reasonable priced these days. It will be much more cost effective to purchase new sata drives then attempt to use any IDEs at all.
Kevin
Why am I the only one saying this? Setup Greyhole, throw a bunch of disks at it, and enjoy! And to all those saying "those drives are going to die soon", you can actually tell Greyhole that you consider a drive "broken" and it will still use most of its storage (albeit redundantly) until it does die and have to be removed.
1. Throw away everything that isn't a standard-sized SATA drive.
2. Buy a Drobo (http://www.drobo.com/products/professionals/drobo-fs/index.php).
3. Put the five (or eight) largest drives in the Drobo.
4. Throw away the rest of the drives.
5. When you get a drive that is larger than the smallest drive in your Drobo, pull the smaller drive out and insert the larger drive.
6. Find peace in the universe.
When I was young and foolish, I tried to keep every drive spinning, even long after its time had passed. I had *nix boxes stuffed with drives and SCSI-attached arrays. I learned a lot about drive management and system administration but, mostly, I learned that there is a value to my time and my time isn't best utilized playing disk administrator.
Drobo doesn't pay me a dime and I am still more excited about Drobo than any technology product since TiVo.
Cheers,
Matt
Maybe the power consumption problem could somehow be worked out by starting and stopping disks based on idle timeouts? I don't know how well that kind of setup would play with a RAID configuration, but perhaps there's some other method too.
I still kinda like the concept of keeping old hardware running for ecological reasons (making new stuff takes a lot of power and resources). And it would be interesting to find some kind of solution for this case even though it's gonna be somewhat hacky. My two cents is to consider putting all the IDE disks behind IDE->SATA converters to make the whole setup SATA-only, to streamline it a bit.
Windows Home Server v1 (2003). Drive extender will lump all sizes of hard drives into a JBOD. Turn on folder duplication for redundancy. Full integration with Windows 7. Use Plex media streaming to stream to an iPad. That's my setup and works great.
2006 called, and it's pissed that MS stole all those features from ZFS....
If only "common" sense was actually that common...
I've been using unRAID for years and it's a great solution for a small home NAS box. If you ever change your mind about using it, you simply turn your parity drive into a regular Linux boot disk, and the remaining drives are just regular Reiserfs2 filesystems. Most RAID systems and/or software would require much gymnastics to de-RAID them, if it could be done at all.
In addition, hardware-based striped RAID makes you dependent on the RAID controller; if it dies and you can't find a replacement compatible with the original's striping mechanism, your data just disappeared.
Why are you combining 100GB and 3TB drives? First of all, the 3TB drive is litterally 30 times the size (giving you a space increase of 3%). Second of all, the 100GB is probably fairly old, so shouldn't even be trusted as stable. You are going to spend more on the ATA adapter for that drive than the value of the space it provides. Currently a 3TB drive costs about $100, that's $0.03/GB which means that 100GB drive is worth ... wait for it ... $3. Sata to IDE adapters run about $9 a piece.
I've been in the same situation, it was only a year ago that I was running on multiple 10GB drives and an old 120GB laptop drive because I only had IDE in my server. So I went to newegg and got a low powered an E350-onboard-cpu motherboard (doesn't even need a fan) for $130, 8GB of ram (I use ZFS) for $50 and a 2TB drive for $70 (drives have gone up since then, but not terribly high) and threw the thing into an old case with a cheap power supply. That's basically an entire system with about 15 times the storage space as my old one for $250 shipped to my front door and the system can take 5 more drives without so much as an expansion card.
Once you specify "a", the "d" is redundant.
Don't use FreeNAS. Normally I would recommend it because it's very straightforward and does a good job of what it does, but it's not ideal for your exact situation. It has a fairly old version of the ZFS file system. You'll want a new enough version to use a RAIDZ configuration. This way you can put all the drives in one pool, and it will automatically juggle redundancy and parity to allow a single drive to fail without losing data. That has the end result of presenting some fraction of your total storage as one single mount point.
This will maximize the amount of usable storage you get while keeping things redundant and failure-resistant.
FreeNAS only supports simple striping or mirroring. With striping you'll lose the whole pool if one of your old drives fails, so it's right out. Mirroring works best with matched pairs of drives with the same capacity, and you'll be wasting space to drive pairs of mismatched sizes. Also, even in the best case scenario mirroring leaves you with only half the usable space relative to the total capacity of all your drives.
Some things to keep in mind:
- RAIDZ doesn't allow shrinking pools (I think). So every time one of your old drives dies you need to immediately replace it with *something.* That could get expensive if they're old and you have a lot of them.
- You could probably replace the capacity of all these old drives with $300 of new drives. Is it worth it? Especially knowing that you'll have to replace them with modern drives as they fail anyway, and will likely soon enough end up with a crazy glut of drive space, and an empty bank account from buying all the replacement drives.
- There's a PPA for ZFS on FUSE available for Ubuntu Server, that would probably make setting something like this up pretty trivial.
Seriously, buy a Synology NAS, dump all your data on it and call it a day. The cheapest 2 bay model they make is straight up badass for its price point.
Good-bye
Full disclosure: I am the developer
Check out: http://stablebit.com/DrivePool
It's a software disk pooling solution that combines any number of disks of any size into one big virtual pool. You can designate certain folders to be duplicated on the pool. Any files placed in duplicated folders will be stored on 2 disks at the same time.
The implementation is a hard core NT kernel driver with a virtual disk. There is a full NT kernel storage stack, no user mode hacks here.
Unlike RAID and similar solutions, all your pooled files are stored as standard NTFS files on each individual disk in the pool. This means that you can simply plug in any pooled disk to any system that can read NTFS to get at your files in case disaster strikes.
It's commercial software, $20 USD per server.
Put all of the small drives in a JBOD array and use the 3TB as an internal backup because RAID is not a backup solution.
Use FreeNAS or OpenFiler.
Drobo performance sucks (with more than one concurrent user).
Low-end core i3 processor and lots of RAM because RAM is cheap these days.
Competition Good, Monopoly Bad.
look at amahi.org, it is a turn-key Home Server based on fedora and greyhole as it's replication engine.
Dump anything less than a TB except one drive and you are set.
You set the replication level by share and it keeps a full copy on each drive until the replication count is reached for that file on that share.
Example:
you have 4 1TB drives and 1 500Gb drive.
You have the share photo configured to replicate on each drive.
You have replication off on the video share.
You have a replication level of two on the mp3 share.
When you store a photo greyhole write it to your 5 drives.
When you store a video it goes on a random drive.
When you store a mp3 it goes to 2 random drives.
So if you lose a drive you should loose about 25% of your videos, 6.25% of your mp3 and 0% of your pictures.
Jehovah be praised, Oracle was not selected
Though it's not free it is cheap and works well. It supports allot of hardware and is efficient with resources. I have gotten it to run SCSI, ATA, USB, FC-AL, SATA, and SAS drives all at once. A build usually only takes about 5 minutes including a basic configuration. I have been running it for over 7 years now with no real issues to speak of. Mike
It looks like l'm going to have to read up on this stuff again.
Given the spread of drive ages I'd definitely want redundancy, and given the variety of capacities, a traditional RAID system isn't going to cut it. I'm actually thinking of cloud computing technology, with it's attendent abilities to duplicate data(and services!) across sites of uneven capability, even optimize resources across different locations to optimize resources.
Basically, you'd be looking a 'cloud' of HDs, with an underlying system that's aware of the different drives and a directive to, say, keep all data on at least 3 different drives. How it does that depends on the alogorithms, but it isn't structured so rigidly like traditional RAID.
I don't read AC A human right
Powering 10 old harddrives for some time is going to be much more expensive than just getting a new one. A modern drive uses about 5W on average. these oldies probably use much more. 10 drives using 10 watts at $0.10 per kwh will set you back $87 per year. You do the math.
0x or or snor perron?!
I'll add that having a single machine for backups is very convenient. I have a FreeBSD ZFS machine in the basement, and I run CrashPlan on it as well as netatalk so it can pretend to be a Time Capsule. Whenever I fix a friend/relative's computer I make them install CrashPlan on their own computer and point them to my server. Sure it uses some of my drive space up, but it saves me hours (days?) of time when their machine dies.
As you point out, all that digital crap sure adds up - and I have a fair amount of ill-gotten media as well. It's really nice to just tell SabNZBd to just download each episode of your favorite show and not have to worry about cleaning up the space right away.
W..w..W - Willy Waterloo washes Warren Wiggins who is washing Waldo Woo.
Pogoplugs are great, can plug in 4 drives via USB or more with a USB hub. I paid $25 for mine, can't really go wrong.
"If you have nothing to hide, you have nothing to fear." - Every fascist, ever
Use drobo if you are time poor and money rich, use btrfs if you are time rich and money poor.
Btrfs's capabilities are nothing short of amazing. Here is a vid about it:
http://www.youtube.com/watch?v=9bQc_z-Cb7E
Well you could wait 2 months for a release candidate of an os that few people will touch before the first service pack...
Or you could use ZFS, which has had those features for years already and is supported on several stable tried and tested platforms.
http://spamdecoy.net - free throwaway anonymous email - avoid spam!
My music collection alone is in the 100's of GBs. It's not at all inconceivable to need this much storage if you're trying to digitize your physical media collection. I'd probably need 20+ TB's to rip everything I own on CD/DVD/BD, I'm just waiting until you can get a good brand for about $20/TB or so.
If you don't care about the electricity costs, Illumian, NexentaCore, OpenIndiana or Solaris 11 with Napp-it on top makes a great ZFS file server. If has native CIFS (Windows) file sharing and can share AFS (Apple) and NFS with no problem. ZFS is by far the best file system around, and lets you combine different size drives with no problems. Napp-it turns the whole thing into a web managed appliance.
You can even stream with the minidlna (lightweight), ushare or mediatomb addons! Rock solid and very fast if you give it ample RAM.
My advice would be to find some inexpensive USB or eSATA drive enclosures for the smaller drives and just use them as off-line storage.
Take some data you don't need instant access to, put it on one drive, and make an identical copy on a second for backup. Put them in a corner and only power them up when needed.
Or just use the smaller drives as partial backup for a larger NAS. Can be handy if you suddenly need to grab a collection of files and go.
Like everyone else is saying, no sense keeping them spinning and eating up power. Might even think twice about the larger drives unless they are power efficient models.
The answer to your question is ZFS on FreeBSD.
Look into unraid from lime technology. Same basic premise as drobo but not a ridiculously priced rip off, predates drobo too, plus it is a linux-based distro so you can mess with it if you want, but it isn't necessary. best feature, besides not carrying about differences in drive sizes, is that the data isn't really in a raid, it just appears that way and if one drive fails, you can use parity to recover your data, and even if you cannot, you only lose the data that was on the failed disk since the underlying file system is just ReiserFS.
...after you wipe them, and buy a real NAS like a ReadyNas, Synology, etc. smallnetbuilder is a great resource for this.
Alternatively, use FreeNAS and build your own, with recent drives.
Create a Truecrypt file filling each old drive, after a full format. Use for full (not incremental backups) every 6 months, starting with the smallest sizes (to use them up). Then put them in your Mum's garage, suitably labelled.
Last tip for backups. Do "dir /on /s > backup_2012_04_23" for each drive after filling it, and keep the list on your main machine, so you can see if you've got a copy of something (and where) before fishing around.
There are several distributed file systems which are able to cope with a varied array of drive and speeds, and will try to keep as much redundancy as possible.
On need to read a little bit and pick the best candidate up.
"Sufficiently advanced satire is indistinguishable from reality." - [Tips: 1DrYakQDKCQ6y52z6QbnkxHXAocMZJE61o ]
Let's say a drive uses about 10W of power. Running 10 drives uses 100W or, about 73kWh/month, which to me would cost about 9EUR. The cheapest 1TB drive costs ~80EUR, so buying a new drive instead of using 10 old ones would pay off in 9 months. But if I do not have that 80EUR now, it's either wait 9 months and then buy a hard drive or grab an axe, go outside and ask kind strangers for money. If my drives are 200GB instead of 100GB the payoff time is longer (because a 2TB drive is more expensive).
I recently had 20 drives across three machines. I was using a combination of raid5, iscsi, mhddfs, and samba. Machine1 mounts the iscsi devices from the other two machines, and then mhddfs combines them into one virtual filesystem. Samba is then used to share files out with laptops.
What I found is that network card drives in 3.2 kernels are currently in a horrible state. They crash left and right under real load. This is after trying different brands, tweaks, version of drivers, etc. In addition it seems iscsi client in Fedora 16 is also not in a great state. Independent of network issues, I would still get failures. The machine running CentOS 6.2 used to be Fedora 16, but was converted to make things more stable.
My latest plan is to do basically the same thing I was doing before, but on a smaller scale. I am going to retire the 1tb drives in Machine3, and replace them with the 2tb drives from Machine1. I am also going to convert Machine3 to CentOS 6.2 for stablility. Then Machine2 will mount Machine3's iscsi device, and use mhddfs and samba. This reduces the number of machines involved from three to two, and takes Fedora 16 out of the mix. It will also reduce the number of drives involved in mass storage from 20 to 14.
I plan to add two 2tb drives to Machine1, for storage, but it end up being only a desktop.
Machine1
Fedora 16
6x2tb raid5
Machine2
CentOS 6.2
5x1.5tb raid5
3tx2tb raid5
Machine3
Fedora 16
6x1tb raid5
Havoc Penington, the bane of my Linux desktop.
unRAID will do this and in fact I often come home and find all my drives spun down as nothing is accessing them. Can put them all in one array too and the only "waste" will be the single parity drive. Been running it for at LEAST 5-6 years or more and have never lost data...
Build it, Drive it, Improve it! Hybridz.org
- how much power does the PC use when running
- how much does each drive use
- how likely is it that one of the older drives fail
- how much work is it to replace the drive, including the "cost" of the downtime
I would assume that over - say - three years the power usage of a regular PC (especially on that has enough room and power for 10 drives) will easily outweigh the cost for some low-power NAS and something like 3-4 large hard drives, which will most likely run without a glitch for something like 3-5 years ... at least that's the results I came out at, before picking up a 4-bay QNAP NAS that I put 4 1.5TB drives in ... mind you, I do like to tinker with hard and software, but at some points, it's just not worth it ...
People have mentioned electricity and such. They have mentioned the utility of the smaller drives etc...
:)
:)
Let's be frank... any home built PC with the ability to flood a gigabit Ethernet line or two will actually be quite power hungry. These Atom based NAS boxes aren't actually using straight ITX Atom boards, they have precisely what is needed, nothing more, nothing less. They are extremely power efficient as well.
FreeNAS, OpenFiler and several others out there are awesome tools. I have a FreeNAS server myself... though my home file server is running Windows 7 with 8x2TB with a SAS RAID controller, my iSCSI box for VMWare is running FreeNAS. I love it... but to be honest, it is not power efficient. If I spent 6 months to a year focussed on tuning FreeNAS to this specific system, I might be able to get power usage and the function set up to what I could have gotten from QNAP for example for a few hundred bucks.
For my super important file storage, I will very likely this week, after years of FreeNAS simply buy a two drive box with 3TB drives in RAID1 and put it on a different floor of the house. My network in the house is now made up of multiple Cisco Catalyst gigabit switches with four cable etherchannel between each floor. So, a little NAS with a RAID1 that supports etherchannel would be great for backing up parts of my server
No... my house doesn't look like a mess... I have very clean cabling and nice looking racks that blend with the furniture
The best thing I ever did with my old drives was to give them to my kids along with a set of screw drivers and a hammer. They a great time and cleaned up everything when they were done.
Except drives don't use 10W. Looking at one I pulled from a machine recently to put a bigger one in, it's rated maximum consumption is 8W, but in actual use (1) it'll only use that much occasionally -- while it's both spinning its platters up from stationary and seeking at the same time -- and (2) in an application like this it'll likely be in a power saving mode 75% or more of the time. I'd hazard about 1.5W is a much more reasonable figure to use as an estimated average.
Synology products are so cheap these days, and synology has a proprietary SHR (synology hybrid raid) structure that allows you to mix and match any kind of drive you want, from 100 gigs all the way to 3tb as you mentioned with redundancy built in - consider it a modified raid 5 that allows you to mix and match drives from different sizes and vendors.
This summer, Infinit (infinit.io) will be released. The program aggregates local storage across all of your devices and allows you to access the data on them at anytime directly through your file management system and eventually mobile devices. In addition, if your friends or colleagues create Infinit networks with their storage space, you can exchange storage space to ensure that your files are replicated on trusted nodes outside of your own network. All files in the Infinit network are encrypted and distributed in bits across the network so only you have access. You can find out when it's available via twitter @infinitdotio or via FB http://www.facebook.com/infinitdotio
...is why people is still actually saving everything they download? If you have the bandwidth, storing would be only limited to what you really could need fast; and talking in terabyte terms seems unreasonable. If you have the band but capped downloads, well you should only need some kind of temporary storage, sized accordingly to the cap. If you have small bandwidth, that's not your problem.