Slashdot Mirror


The Amazing $5k Terabyte Array

An anonymous reader writes: "Running out of space on your local disk? How about a Terabyte array for only a few thousand dollars. This article at KCGeek.com shows how to put together 1000 Gigs of hard drive space for the cost of a few desktop computers." I could rip my entire anime collection for instant access! Rip all my CDs and still have .9 Terabytes left! Maybe Mirror Usenet! I guess the simple truth is that now that 100 gig drives are a couple hundred bucks, we now have the ability to store anything we reasonably could need (unless you define "Reasonable" as "I need to store DNA Sequences").

11 of 448 comments (clear)

  1. Actually by IAmATuringMachine! · · Score: 5, Interesting

    Actually a DNA sequence is only about 3GB for a human - you're anime DVDs might take more space, at least until you compress them. Then again, DNA should be fairly trivial to compress highly. Let Z = CA, Y = TG, .....

    --
    "Computer Science is no more about computers than astronomy is about telescopes."
    -E. W. Dijkstra
  2. 3Ware Escalade IDE-RAID by rdl · · Score: 5, Interesting

    I've been using these for a long time (6200 dual-port in hardware-mirror, up to the 8-port cards for large disk configs), and they're very fast and reliable. Cheap, too.

    $500 for an 8-port 64-bit RAID controller, looking to the host like a single scsi device per logical volume, seems like the best deal available. Along with a motherboard with sufficient slots for gig-e and these cards (easy to get 4 64-bit slots...maybe you can get more with 3-4 buses), and a 4U rackmount case with 16 drive bays, and you can have 4U of rackmount storage for $5k, too.

    I've been using setups like this for clients, as well as for private file storage (divx, mp3, backups, etc.), and know of people using them for USENET news servers (one of the most demanding unix apps for reasonably priced hardware).

    It goes without saying you want a journaled file system or softupdates when you have disks this size, and ideally keep them mounted read-only, and divided into smaller partitions, whenever possible. e2fsck on a 300GB partition with hundred of open files is painful.

  3. 2TB for $8300 by GigsVT · · Score: 5, Interesting

    Inspired by Slashdot's earlier story that was nearly identical, and with the help of Peter Ashford from ACCS, we built two servers, both with capacities well over a TB, for around $8000 each. They have the capacity to expand to 3TB if need be.

    Story here

    As far as performance:
    (from my memory)
    EXT3: About 16MB/Sec block write, 45MB/sec block read
    ReiserFS: About 20MB/sec block write, 130MB/Sec block read (that's no typo).
    XFS: About 30MB/sec block write, 85MB/sec block read.

    It seems that file system plays a large role in performance. The arrays are three RAID5 in hardware using Linux software RAID0 on top of the RAID5 arrays to tie them together.

    IDE RAID controllers are 3ware Escalade 7810. Write performance can be greatly increased by using 7850 cards that have more cache.

    We stuck with XFS, Reiserfs had a bigfile bug, files created over 2GB would lock up the computer basically. XFS in general seemed much more mature, reiserfs seems more like someone's college thesis project, that they never cleaned up to be production grade.

    We experimented with different RAID0 stripe sizes, the hardware RAID5 stripe size is fixed at 64k, there are 7 active disks in each array and one hot spare. Stripe size tweaking seemed to mostly trade off read for write speed, within a certain range of values, with a taper off in performance at either extreme, (down around 8k stripes, or over 1024k stripes)

    We eventually went with 1024k stripes. That is what the benchmarks above reflect. The variance in file system performance could very well be due to interactions with stripe size, but there seemed to be common themes (reiser always read fastest no matter what stripe, XFS was always better at writes)

    I have been in so many arguments with SCSI zealots on here over this RAID... I wish people would understand what price/performance ratio means. IDE isn't a superior technology, but every now and then, it is the right tool for the job, when price is a goal too.

    --
    I've had enough abrasive sigs. Kittens are cute and fuzzy.
  4. Uhm, redundancy in posting? by VWswing · · Score: 3, Interesting

    Is this any more special than the last time
    slashdot announced an amazing terabyte arrayHere

    Seriously though.. People's numbers are pretty far off. This can be done for about 3000.. Pricewatch
    has 160 gig drives for $259 .. 10 of these would give you over 1 terabyte in useable space in raid 1.. Or if you just cared about write performance, 6 of them for $1554 would give you a terabyte of useable storage.. another $600 to throw together a cheap pc and cheap ide raid cards.. you get it for under $2500.. big deal.

    Lately I'm realizing how awful IDE really is.. I finally got around to throwing 2 36 gig ultra 160 drives on my box with an adaptec scsi card, running ext3 on top of a raid mirror.. more space than I need (I just keep all my mp3s on an IDE raid.. since my dragon motherboard has ide raid built in).. Since I've gone to scsi life has been happy. I can do things while compiling, while vacuuming my db, etc..

    Funny how mac used scsi before the rest of us, huh?

    --
    "And how can this be? For he is the ..."
  5. Why use expensive online storage? by JoeShmoe · · Score: 3, Interesting

    Aren't these types of systems more for archiving massive amounts of data than actively working on it? I mean, how much data can a computer actively process anyway? Wouldn't a 100GB drive meet just about any processing demands (genome tracking, video editing, etc)?

    Why not use slower but MUCH cheaper offline storage? I really like the design goal of

    http://www.dvdchanger.com/

    You can easily get 1TB of storage with such a device for less than $1000. True, only one person can access it at a time but that is only because PowerFile wants to charge more for so-called "networked version".

    In theory, if someone could figure out how to build on of these things, you could throw in a two or three CD/DVD drives for accessing and a 20GB hard drive to buffer images. Boom. Now you have the perfect storage backbone for a house-wide media center. I just wish Linksys or someone would throw a linux thinserver onto of the PowerFile hardware and get me something cheap and network-ready.

    - JoeShmoe

    .

    --
    -- I wonder which will go down in history as the bigger failure: the War on Drugs or the War on Filesharing
  6. Oops! by Paul+Johnson · · Score: 4, Interesting
    Sorry, I just noticed a thinko in the discussion of IDE drive costs. The DLT costing assumed 2:1 compression. The disk cost didn't. Assuming compression we can squash 120GB onto a 60GB drive, requiring only 9 drives for a full backup, and 20 drives overall (a couple of spares is always a good idea). Thats £2200 for IDE backup, which is actually cheaper than the DLT solution.

    Does anyone out there actually use IDE drives like this? It seems a pretty obvious thing to do.

    Paul.

    --
    You are lost in a twisty maze of little standards, all different.
  7. thousand hours of video? by peter303 · · Score: 3, Interesting

    Video is the most bulky storage people would save. How much would people want to save for re-viewing? First you have the time-shifting stuff like TiVo/Replay- perhaps a few tens of hours at most. Then you would be your favorite movies and TV series. As video-phone improves you might be saving some hours of friends and relatives video conversations. With infinite storage, the constraint becomes need and time to view all that stuff. And you'll probably be wanting to spend your time looking at new stuff. So I'd guess most people's real needs would be hundreds to a thousand hours. At 1-2 BG per hour, your talking about a terabyte or two.

    I don't include the argument that you'd have trouble finding old stuff. Computer software is more clever at organizing things - far better than material storage. A good recent example of this is Apple's "iPhoto" that much more convenient for organizing thousands of photos than physical albums.

  8. There's a reason they called it "Terraserver" by jefp · · Score: 3, Interesting

    I've wanted a terabyte of storage since the mid-1970s, when I realized that there were approximately a trillion square meters on the Earth's surface. Store one byte of grayscale image for each square meter and that's a terabyte of data right there.

    Of course these days I'd want 3TB so I could store color images.

  9. Re:The Amazing $5k Terabyte Array by edmudama · · Score: 3, Interesting

    > They really really need to design a IDE-II
    > specification that gives the SCSI performance
    > traits to IDE.

    They already have it -- tag command queueing has been in the ATA spec for years, since ATA-5 I think. Most vendors either have command queueing IDE drives, or are coming out with them soon.

    http://www.t13.org for more info on the various ATA specifications

    --eric

    --
    More data, damnit!
  10. Do it for half with Pricewatch by mangoless · · Score: 3, Interesting

    Storage solution: 1TB RAID5 storage array (Prices are from Pricewatch) Quantity Price Subtotal Intel Celeron 700 MHz w/ Socket 370 MB, UDMA 100, AGP VIDEO 8~64MB shared only, Sound, 56K AMR Modem, 10/100 Network in MidTower case w/Powersupply 1x$135.00=$135.00 Power Magic PCI IDE U/ATA100 RAID Controller w/Cable 4x$22.00=$88.00 Maxtor 4G160J8 5400/133 8x$259.00=$2,072.00 60.0GB EIDE Ultra DMA 5400 1x$85.00=$85.00 Total: $2,380.00 - Mangoless

    --
    [a mango-free monkey]
  11. Better performance.. by tcc · · Score: 4, Interesting

    Get a 3ware escalade card in march they'll support 48bits-LBA in the new firmware, you'll be able to hookup those 160GB monsters in raid-0 (or raid-5) with a tenfold increase in performance, without taking up all the PCI slots.

    the TX2 is a nice little card, but you can only use 2 drives per board for getting the "full speed" (else if you use master/secondary, 4 drives will give you the raid speed of 2 in stripe) and then you'd have to stripe your raid-0 drives in software. Instead of wasting PCI slots and using an underperforming card, you pay a couple of bucks more and you get the real thing with full speed and hardware raid5.

    There are a lot of raid benchmarks at storagereview.com as well. IDE raid is so damn cheap.

    --
    --- Metamoderating abusive downgraders since my 300th post.