What is the Ideal Low-end NAS Solution?
Mark asks: "As demand for storage continues to grow and prices continue to drop, network attached storage (NAS) devices are popping up everywhere...from large enterprises to restaurants to small offices and homes. Several vendors are now offering low-end NAS solutions targeted at SOHO users, with varying results. Most of them are just standard PC components and standard IDE hard drives running Linux, but the price tag on these often far oustrips what one would expect to pay for the parts. Hence, people all over the world (myself included) are building their own NAS machines at home at a fraction of the cost. Beyond support for RAID, CIFS, NFS, HTTP, and FTP, what would the ideal home NAS operating system include? And more importantly, what should it leave out to avoid conflicts, security vulnerabilities, and instability? Are there any Linux/*BSD/other distributions out there optimized specifically for NAS applications? What does the ideal NAS distribution look like to you?"
A NAS is little more than a box of hard drives with a NIC attached. They get a nifty web-based interface or somesuch to make it real simple to setup and they often come in small packages, but is that worth the premium? You could buy a small-ish desktop/tower case and probably build your own very cheaply. Setting up Samba on Linux with simple "everyone can write" access is braindead simple.
Do you need a web-based interface? Do you need hot-swappable drives with auto-rebuild? Do you need a 2U rackmount or other small-ish case? (Remember, need is a very strong word.) If you can't answer yes then save yourself a few grand and do it yourself.
On the flip side, if you DO need that stuff, I've been very pleased with Fastora. Good interface, easy setup and lots of options. We got a 1.337TB unit (8x250GB hard drives in RAID5, one drive as a hot spare) with 2x100Mb NIC and 1x1Gb NIC for around $7,000.
There is a difference between "insightful" and "inciteful" other than spelling.
Might have a look at Mitel (formerly e-smith) SME Server. I've been using it for my file server at home, email, and to host a few domains for a couple of years now. Good stuff, pretty secure, can also be your router/gateway. One ther I haven't looked at, but I intend to check out soon, is BlueQuartz. Not really a distro, but the results of Sun open-sourcing the Sobalt RaQ550 network appliance. There's a binary install kit for a basic Redhat/Fedora setup, source, and many howto's out there...
I would consider two OS's for a low-end home NAS.
First OS:
Debian GNU/Linux
Why? 1) Easy to update. 2) Wide selection of packages. 3) Possible to do a minimal install and have a pretty bare-bones OS.
Second OS:
OpenBSD
Why? 1) Security. 2) Security. 3) Security.
Just for anyone else reading who gets similar ideas, he's got some big errors.
This is incorrect. The reason you only put 1 device per channel is because with IDE, only one device on a channel can be active at once. It has nothing to do with the likelihood of failure. Even if that weren't true, his assumption is silly - a single drive is much more likely to break than a single channel on a controller.
This erroneous assumption carries through his entire implementation and has crippled it's performance (as seen in the benchmarks - 36MB/s ? That's pathetic for an 6 disk RAID0 array - effectively what is is for disk reads). Using the "hardware" RAID on the card is another mistake, tying the array forever to that particular brand and model of disk controller.
Folks, if you're setting up honkin' great big RAID arrays at home and don't want to pay for decent RAID controllers like 3wares, *use software RAID*. The CPU overhead is insignificant and the bonus of being able to move the array between arbitrary machines and not having to worry about a disk controller failure permanently making your data inaccessible is more than worth it.
I had always used reiserfs for everything, but having been recently asked to set up a small bunch of inexpensive file servers, I took the time to research which filesystem is best able to survive a crash or power outage. The few recent tests I've found suggest that of XFS, JFS, reiserfs, and ext3 (ordered), ext3 had the by far best recovery rate, and reiserfs had the worst among the journaled filesystems tested. In one, where a disk intensive app was run and the system was reset several several seconds later, ext3 survived over 300 power cycles with minimal damage, while reiserfs became unbootable after 10 cycles, and the rest did better but came nowhere near ext3.
After a few days of disbelief and frantic googling, I decided to make the switch to ext3. Now if I can only get approval to purchase UPS's for the servers.
As for which distribution to use, we tested Slackware 10, Fedora Core 2, and finally chose CentOS.